2025-12-04T08:53:38.2311959Z Current runner version: '2.329.0' 2025-12-04T08:53:38.2315082Z Runner name: 'linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw' 2025-12-04T08:53:38.2315493Z Runner group name: 'default' 2025-12-04T08:53:38.2315952Z Machine name: 'linux' 2025-12-04T08:53:38.2317098Z ##[group]GITHUB_TOKEN Permissions 2025-12-04T08:53:38.2318195Z Contents: read 2025-12-04T08:53:38.2318434Z Metadata: read 2025-12-04T08:53:38.2318702Z ##[endgroup] 2025-12-04T08:53:38.2319780Z Secret source: Actions 2025-12-04T08:53:38.2320066Z Prepare workflow directory 2025-12-04T08:53:38.2564329Z Prepare all required actions 2025-12-04T08:53:38.2583900Z Getting action download info 2025-12-04T08:53:38.7681906Z Download action repository 'pytorch/pytorch@main' (SHA:ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T08:53:46.7332002Z Download action repository 'pytorch/test-infra@main' (SHA:39aa74d619174326f4e2fb0e216151c2f29d9ffd) 2025-12-04T08:53:47.8883458Z Download action repository 'actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T08:53:48.9299628Z Download action repository 'aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722' (SHA:ececac1a45f3b08a01d2dd070d28d111c5fe6722) 2025-12-04T08:53:49.8066607Z Getting action download info 2025-12-04T08:53:50.0278626Z Download action repository 'actions/checkout@v4' (SHA:34e114876b0b11c390a56381ad16ebd13914f8d5) 2025-12-04T08:53:50.8600640Z Getting action download info 2025-12-04T08:53:51.0810688Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2025-12-04T08:53:51.8724823Z Getting action download info 2025-12-04T08:53:52.0998470Z Uses: pytorch/pytorch/.github/workflows/_rocm-test.yml@refs/heads/main (ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T08:53:52.1000576Z ##[group] Inputs 2025-12-04T08:53:52.1000741Z build-environment: linux-jammy-rocm-py3.10 2025-12-04T08:53:52.1003957Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T08:53:52.1007295Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:53:52.1007587Z sync-tag: 2025-12-04T08:53:52.1008014Z timeout-minutes: 300 2025-12-04T08:53:52.1008127Z tests-to-include: 2025-12-04T08:53:52.1008231Z dashboard-tag: 2025-12-04T08:53:52.1008460Z disable-monitor: true 2025-12-04T08:53:52.1008585Z monitor-log-interval: 5 2025-12-04T08:53:52.1008709Z monitor-data-collect-interval: 1 2025-12-04T08:53:52.1008848Z ##[endgroup] 2025-12-04T08:53:52.1009062Z Complete job name: linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T08:53:52.1277204Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2025-12-04T08:53:52.1277480Z with: 2025-12-04T08:53:52.1277573Z no-sudo: true 2025-12-04T08:53:52.1277665Z submodules: recursive 2025-12-04T08:53:52.1277764Z fetch-depth: 0 2025-12-04T08:53:52.1277894Z env: 2025-12-04T08:53:52.1293879Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:53:52.1294021Z ##[endgroup] 2025-12-04T08:53:52.1340219Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T08:53:52.1340606Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T08:53:52.1348217Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:53:52.1348382Z env: 2025-12-04T08:53:52.1348473Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:53:52.1348578Z ##[endgroup] 2025-12-04T08:53:52.1528282Z ##[group]Run actions/checkout@v4 2025-12-04T08:53:52.1528467Z with: 2025-12-04T08:53:52.1528596Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:53:52.1528737Z fetch-depth: 0 2025-12-04T08:53:52.1528838Z submodules: recursive 2025-12-04T08:53:52.1528944Z show-progress: false 2025-12-04T08:53:52.1529052Z repository: pytorch/pytorch 2025-12-04T08:53:52.1529265Z token: *** 2025-12-04T08:53:52.1529359Z ssh-strict: true 2025-12-04T08:53:52.1529455Z ssh-user: git 2025-12-04T08:53:52.1529550Z persist-credentials: true 2025-12-04T08:53:52.1529658Z clean: true 2025-12-04T08:53:52.1529765Z sparse-checkout-cone-mode: true 2025-12-04T08:53:52.1529883Z fetch-tags: false 2025-12-04T08:53:52.1529976Z lfs: false 2025-12-04T08:53:52.1530064Z set-safe-directory: true 2025-12-04T08:53:52.1530228Z env: 2025-12-04T08:53:52.1530315Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:53:52.1530416Z ##[endgroup] 2025-12-04T08:53:52.2148800Z Syncing repository: pytorch/pytorch 2025-12-04T08:53:52.2149501Z ##[group]Getting Git version info 2025-12-04T08:53:52.2149669Z Working directory is '/home/runner/_work/pytorch/pytorch' 2025-12-04T08:53:52.2149908Z [command]/usr/bin/git version 2025-12-04T08:53:52.2150030Z git version 2.52.0 2025-12-04T08:53:52.2171004Z ##[endgroup] 2025-12-04T08:53:52.2174935Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/4ab98931-fc69-4646-9aa1-8977239ede23/.gitconfig' 2025-12-04T08:53:52.2175417Z Temporarily overriding HOME='/home/runner/_work/_temp/4ab98931-fc69-4646-9aa1-8977239ede23' before making global git config changes 2025-12-04T08:53:52.2176056Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T08:53:52.2183238Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T08:53:52.2219529Z [command]/usr/bin/git config --local --get remote.origin.url 2025-12-04T08:53:52.2236904Z https://github.com/pytorch/pytorch 2025-12-04T08:53:52.2253574Z ##[group]Removing previously created refs, to avoid conflicts 2025-12-04T08:53:52.2257753Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-12-04T08:53:52.2276981Z refs/heads/main 2025-12-04T08:53:52.2282904Z [command]/usr/bin/git checkout --detach 2025-12-04T08:53:53.9202375Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:53:53.9245305Z [command]/usr/bin/git branch --delete --force main 2025-12-04T08:53:53.9482190Z Deleted branch main (was ffd9b0fb4355). 2025-12-04T08:53:53.9487045Z ##[endgroup] 2025-12-04T08:53:53.9489010Z [command]/usr/bin/git submodule status 2025-12-04T08:53:53.9680386Z 7e1e1fe3858c63c251c637ae41a20de425dde96f android/libs/fbjni (v0.1.0-12-g7e1e1fe) 2025-12-04T08:53:53.9716790Z 4dfe081cf6bcd15db339cf2680b9281b8451eeb3 third_party/FP16 (4dfe081) 2025-12-04T08:53:53.9754473Z b408327ac2a15ec3e43352421954f5b1967701d1 third_party/FXdiv (b408327) 2025-12-04T08:53:53.9800940Z c07e3a0400713d546e0dea2d5466dd22ea389c73 third_party/NNPACK (c07e3a0) 2025-12-04T08:53:53.9835381Z 3ebbc93ded7285963bff932c678fa367eb393ba6 third_party/NVTX (v3.1.0-313-g3ebbc93) 2025-12-04T08:53:53.9886677Z 1d8f600fd424278486eade7ed3e877c99f0846b1 third_party/VulkanMemoryAllocator (v2.1.0-982-g1d8f600) 2025-12-04T08:53:54.0163985Z 51a0103656eff6fc9bfd39a4597923c4b542c883 third_party/XNNPACK (remotes/origin/ds/ndk-1243-g51a0103656) 2025-12-04T08:53:54.0188013Z 01aae101b9e5e94d6c16a9514c9fb8df99c93150 third_party/aiter (v0.1.1-92-g01aae101) 2025-12-04T08:53:54.0200575Z 299e5928955cc62af9968370293b916f5130916f third_party/benchmark (v1.9.3) 2025-12-04T08:53:54.0250341Z 7fe50dc3da2069d6645d9deb8c017a876472a977 third_party/composable_kernel (rocm-6.4.3-459-g7fe50dc3d) 2025-12-04T08:53:54.0326817Z 89c932f313c6437c38f2982869beacc89c2f2246 third_party/cpp-httplib (v0.26.0) 2025-12-04T08:53:54.0475180Z f858c30bcb16f8effd5ff46996f0514539e17abc third_party/cpuinfo (f858c30) 2025-12-04T08:53:54.0495639Z 0b1577c8c83401237d601d0d0db5210506705396 third_party/cudnn_frontend (v0.5-61-g0b1577c) 2025-12-04T08:53:54.0554074Z f88806b1e31dfa579842638740216dd41fc6c588 third_party/cutlass (v4.3.1) 2025-12-04T08:53:54.0580405Z c0b988d39a9e47c794d699f29930ed4d7c7e13a4 third_party/fbgemm (v1.4.0-rc1-2-gc0b988d39) 2025-12-04T08:53:54.0624357Z 979702c87a8713a8e0a5e9fee122b90d2ef13be5 third_party/flash-attention (v2.7.4) 2025-12-04T08:53:54.0639409Z a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757 third_party/flatbuffers (v24.12.23) 2025-12-04T08:53:54.0886416Z 407c905e45ad75fc29bf0f9bb7c5c2fd3475976f third_party/fmt (12.1.0) 2025-12-04T08:53:54.0950316Z 3fb5c176c17c765a3492cd2f0321b0dab712f350 third_party/gemmlowp/gemmlowp (remotes/origin/revert-87-master-135-g3fb5c17) 2025-12-04T08:53:54.1018212Z 54cbae0d3a67fa890b4c3d9ee162b7860315e341 third_party/gloo (remotes/origin/gh/c-p-i-o/1/base-37-g54cbae0) 2025-12-04T08:53:54.1152789Z 52eb8108c5bdec04579160ae17225d66034bd723 third_party/googletest (release-1.8.0-3544-g52eb8108) 2025-12-04T08:53:54.1202168Z 719d8e6cd7f7a0e01b155657526d693acf97c2b3 third_party/ideep (pytorch-rls-v3.7.1) 2025-12-04T08:53:54.1233354Z dec1d23ca65ab069d225dfe40dea14f455170959 third_party/ittapi (v3.25.5) 2025-12-04T08:53:54.1369408Z 31f85df8fbd89c188f14ef10f1ec65379786b943 third_party/kineto (heads/main) 2025-12-04T08:53:54.1382469Z d7770c89632329a9914ef1a90289917597639cbe third_party/kleidiai (v1.15.0) 2025-12-04T08:53:54.1395033Z fbd8b99c2b828428947d70fdc046bb55609be93e third_party/mimalloc (v2.2.4) 2025-12-04T08:53:54.1410007Z 55f93686c01528224f448c19128836e7df245f72 third_party/nlohmann (v3.12.0) 2025-12-04T08:53:54.1626172Z e709452ef2bbc1d113faf678c24e6d3467696e83 third_party/onnx (v1.18.0) 2025-12-04T08:53:54.1642639Z a799f4aed9c94b765dcdaabaeab7d5e7e2310878 third_party/opentelemetry-cpp (v1.14.2) 2025-12-04T08:53:54.1657644Z 0fa0ef591e38c2758e3184c6c23e497b9f732ffa third_party/pocketfft (release_for_eigen-40-g0fa0ef5) 2025-12-04T08:53:54.1864031Z d1eca4e4b421cd2997495c4b4e65cea6be4e9b8a third_party/protobuf (v3.7.0-rc.2-1279-gd1eca4e4b) 2025-12-04T08:53:54.1903943Z 072586a71b55b7f8c584153d223e95687148a900 third_party/psimd (heads/master) 2025-12-04T08:53:54.1937957Z 4fe0e1e183925bf8cfa6aae24237e724a96479b8 third_party/pthreadpool (0.1-144-g4fe0e1e) 2025-12-04T08:53:54.1955000Z f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8 third_party/pybind11 (v3.0.1) 2025-12-04T08:53:54.2018899Z f45429b087dd7d5bc78bb40dc7cf06425c252d67 third_party/python-peachpy (remotes/origin/pre-generated) 2025-12-04T08:53:54.2066529Z 5a1d179df9cf652951b59010a2d2075372d67f68 third_party/sleef (3.8) 2025-12-04T08:53:54.2113439Z 2b4cd91092d335a697416b2a3cb398283246849d third_party/tensorpipe (heads/main) 2025-12-04T08:53:54.2128481Z ##[group]Cleaning the repository 2025-12-04T08:53:54.2133107Z [command]/usr/bin/git clean -ffdx 2025-12-04T08:53:54.2251242Z [command]/usr/bin/git reset --hard HEAD 2025-12-04T08:53:56.0733114Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:53:56.0828490Z ##[endgroup] 2025-12-04T08:53:56.0831495Z ##[group]Disabling automatic garbage collection 2025-12-04T08:53:56.0850038Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T08:53:56.0879151Z ##[endgroup] 2025-12-04T08:53:56.0879346Z ##[group]Setting up auth 2025-12-04T08:53:56.0879510Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T08:53:56.0906110Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T08:53:56.1132944Z Entering 'android/libs/fbjni' 2025-12-04T08:53:56.1160926Z Entering 'third_party/FP16' 2025-12-04T08:53:56.1210346Z Entering 'third_party/FXdiv' 2025-12-04T08:53:56.1256035Z Entering 'third_party/NNPACK' 2025-12-04T08:53:56.1304078Z Entering 'third_party/NVTX' 2025-12-04T08:53:56.1322023Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:53:56.1343662Z Entering 'third_party/XNNPACK' 2025-12-04T08:53:56.1366462Z Entering 'third_party/aiter' 2025-12-04T08:53:56.1393371Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:53:56.1470521Z Entering 'third_party/benchmark' 2025-12-04T08:53:56.1478276Z Entering 'third_party/composable_kernel' 2025-12-04T08:53:56.1499044Z Entering 'third_party/cpp-httplib' 2025-12-04T08:53:56.1525304Z Entering 'third_party/cpuinfo' 2025-12-04T08:53:56.1550442Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:53:56.1646995Z Entering 'third_party/cutlass' 2025-12-04T08:53:56.1647175Z Entering 'third_party/fbgemm' 2025-12-04T08:53:56.1667868Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:53:56.1690040Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:53:56.1743013Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:53:56.1771903Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:53:56.1825151Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:53:56.1873931Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:53:56.1901407Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:53:56.1927995Z Entering 'third_party/flash-attention' 2025-12-04T08:53:56.1953305Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:53:56.1979385Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:53:56.2011489Z Entering 'third_party/flatbuffers' 2025-12-04T08:53:56.2032936Z Entering 'third_party/fmt' 2025-12-04T08:53:56.2052783Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:53:56.2074391Z Entering 'third_party/gloo' 2025-12-04T08:53:56.2097825Z Entering 'third_party/googletest' 2025-12-04T08:53:56.2119063Z Entering 'third_party/ideep' 2025-12-04T08:53:56.2139007Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:53:56.2163143Z Entering 'third_party/ittapi' 2025-12-04T08:53:56.2181340Z Entering 'third_party/kineto' 2025-12-04T08:53:56.2202150Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:53:56.2222825Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:53:56.2244618Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:53:56.2270666Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:53:56.2291328Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:53:56.2313264Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:53:56.2340280Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:53:56.2363790Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:53:56.2384168Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:53:56.2422745Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:53:56.2444953Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:53:56.2474350Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.2533134Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.2560764Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:53:56.2583671Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:53:56.2613310Z Entering 'third_party/kleidiai' 2025-12-04T08:53:56.2673839Z Entering 'third_party/mimalloc' 2025-12-04T08:53:56.2703262Z Entering 'third_party/nlohmann' 2025-12-04T08:53:56.2723178Z Entering 'third_party/onnx' 2025-12-04T08:53:56.2750651Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:53:56.2835635Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:53:56.2862021Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:53:56.2887037Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:53:56.2910131Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:53:56.2935100Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:53:56.2958895Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:53:56.2980865Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:53:56.3000755Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:53:56.3024448Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.3046240Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.3071597Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:53:56.3141992Z Entering 'third_party/pocketfft' 2025-12-04T08:53:56.3163024Z Entering 'third_party/protobuf' 2025-12-04T08:53:56.3187817Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:53:56.3214237Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:53:56.3237527Z Entering 'third_party/psimd' 2025-12-04T08:53:56.3280276Z Entering 'third_party/pthreadpool' 2025-12-04T08:53:56.3304466Z Entering 'third_party/pybind11' 2025-12-04T08:53:56.3331921Z Entering 'third_party/python-peachpy' 2025-12-04T08:53:56.3384250Z Entering 'third_party/sleef' 2025-12-04T08:53:56.3445124Z Entering 'third_party/tensorpipe' 2025-12-04T08:53:56.3505282Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:53:56.3531724Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:53:56.3563587Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:53:56.3591032Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:53:56.3629470Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:53:56.3679902Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T08:53:56.3725530Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T08:53:56.4008496Z Entering 'android/libs/fbjni' 2025-12-04T08:53:56.4037447Z Entering 'third_party/FP16' 2025-12-04T08:53:56.4062352Z Entering 'third_party/FXdiv' 2025-12-04T08:53:56.4084628Z Entering 'third_party/NNPACK' 2025-12-04T08:53:56.4138927Z Entering 'third_party/NVTX' 2025-12-04T08:53:56.4162727Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:53:56.4187174Z Entering 'third_party/XNNPACK' 2025-12-04T08:53:56.4217256Z Entering 'third_party/aiter' 2025-12-04T08:53:56.4238438Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:53:56.4266975Z Entering 'third_party/benchmark' 2025-12-04T08:53:56.4294359Z Entering 'third_party/composable_kernel' 2025-12-04T08:53:56.4323347Z Entering 'third_party/cpp-httplib' 2025-12-04T08:53:56.4347758Z Entering 'third_party/cpuinfo' 2025-12-04T08:53:56.4369968Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:53:56.4390223Z Entering 'third_party/cutlass' 2025-12-04T08:53:56.4418110Z Entering 'third_party/fbgemm' 2025-12-04T08:53:56.4439831Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:53:56.4465244Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:53:56.4489735Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:53:56.4515377Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:53:56.4581750Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:53:56.4604147Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:53:56.4625431Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:53:56.4673470Z Entering 'third_party/flash-attention' 2025-12-04T08:53:56.4699127Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:53:56.4746903Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:53:56.4779837Z Entering 'third_party/flatbuffers' 2025-12-04T08:53:56.4806539Z Entering 'third_party/fmt' 2025-12-04T08:53:56.4837855Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:53:56.4860639Z Entering 'third_party/gloo' 2025-12-04T08:53:56.4881700Z Entering 'third_party/googletest' 2025-12-04T08:53:56.4903367Z Entering 'third_party/ideep' 2025-12-04T08:53:56.4927726Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:53:56.4981551Z Entering 'third_party/ittapi' 2025-12-04T08:53:56.5004280Z Entering 'third_party/kineto' 2025-12-04T08:53:56.5056253Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:53:56.5079399Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:53:56.5103188Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:53:56.5129207Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:53:56.5152018Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:53:56.5168951Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:53:56.5215639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:53:56.5232181Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:53:56.5248088Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:53:56.5264358Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:53:56.5283996Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:53:56.5330380Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.5358715Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.5444717Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:53:56.5471948Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:53:56.5516296Z Entering 'third_party/kleidiai' 2025-12-04T08:53:56.5542149Z Entering 'third_party/mimalloc' 2025-12-04T08:53:56.5564755Z Entering 'third_party/nlohmann' 2025-12-04T08:53:56.5584935Z Entering 'third_party/onnx' 2025-12-04T08:53:56.5617593Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:53:56.5642473Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:53:56.5662840Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:53:56.5681925Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:53:56.5701713Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:53:56.5723830Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:53:56.5743619Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:53:56.5790506Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:53:56.5791490Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:53:56.5840838Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.5841183Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.5848785Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:53:56.5894372Z Entering 'third_party/pocketfft' 2025-12-04T08:53:56.5919965Z Entering 'third_party/protobuf' 2025-12-04T08:53:56.5990741Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:53:56.5997046Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:53:56.6049878Z Entering 'third_party/psimd' 2025-12-04T08:53:56.6075900Z Entering 'third_party/pthreadpool' 2025-12-04T08:53:56.6100084Z Entering 'third_party/pybind11' 2025-12-04T08:53:56.6126277Z Entering 'third_party/python-peachpy' 2025-12-04T08:53:56.6149762Z Entering 'third_party/sleef' 2025-12-04T08:53:56.6174627Z Entering 'third_party/tensorpipe' 2025-12-04T08:53:56.6197398Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:53:56.6229164Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:53:56.6246965Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:53:56.6264558Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:53:56.6291704Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:53:56.6345983Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.6381184Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T08:53:56.6538865Z Entering 'android/libs/fbjni' 2025-12-04T08:53:56.6551682Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:53:56.6579504Z Entering 'third_party/FP16' 2025-12-04T08:53:56.6579806Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:53:56.6581414Z Entering 'third_party/FXdiv' 2025-12-04T08:53:56.6610410Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:53:56.6610967Z Entering 'third_party/NNPACK' 2025-12-04T08:53:56.6623245Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:53:56.6638640Z Entering 'third_party/NVTX' 2025-12-04T08:53:56.6649532Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:53:56.6703861Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:53:56.6716984Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:53:56.6727384Z Entering 'third_party/XNNPACK' 2025-12-04T08:53:56.6738840Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:53:56.6753769Z Entering 'third_party/aiter' 2025-12-04T08:53:56.6767536Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:53:56.6780622Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:53:56.6789696Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:53:56.6805165Z Entering 'third_party/benchmark' 2025-12-04T08:53:56.6816968Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:53:56.6827058Z Entering 'third_party/composable_kernel' 2025-12-04T08:53:56.6839656Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:53:56.6853161Z Entering 'third_party/cpp-httplib' 2025-12-04T08:53:56.6866027Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:53:56.6874819Z Entering 'third_party/cpuinfo' 2025-12-04T08:53:56.6886210Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:53:56.6895324Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:53:56.6906193Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:53:56.6915386Z Entering 'third_party/cutlass' 2025-12-04T08:53:56.6926443Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:53:56.6939075Z Entering 'third_party/fbgemm' 2025-12-04T08:53:56.6949994Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:53:56.6960220Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:53:56.6970190Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:53:56.6976875Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:53:56.6987633Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:53:56.6997439Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:53:56.7008158Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:53:56.7023475Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:53:56.7035023Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:53:56.7047703Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:53:56.7057765Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:53:56.7066348Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:53:56.7076342Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:53:56.7087526Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:53:56.7099331Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:53:56.7107499Z Entering 'third_party/flash-attention' 2025-12-04T08:53:56.7118836Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:53:56.7130702Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:53:56.7142937Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:53:56.7158163Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:53:56.7172395Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:53:56.7185464Z Entering 'third_party/flatbuffers' 2025-12-04T08:53:56.7196597Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:53:56.7205803Z Entering 'third_party/fmt' 2025-12-04T08:53:56.7217548Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:53:56.7228179Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:53:56.7242413Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:53:56.7283576Z Entering 'third_party/gloo' 2025-12-04T08:53:56.7299345Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:53:56.7307653Z Entering 'third_party/googletest' 2025-12-04T08:53:56.7321813Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.7334012Z Entering 'third_party/ideep' 2025-12-04T08:53:56.7347538Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:53:56.7363792Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:53:56.7369340Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:53:56.7393229Z Entering 'third_party/ittapi' 2025-12-04T08:53:56.7406152Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:53:56.7417093Z Entering 'third_party/kineto' 2025-12-04T08:53:56.7427268Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:53:56.7443652Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:53:56.7456532Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:53:56.7465269Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:53:56.7476461Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:53:56.7489231Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:53:56.7501410Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:53:56.7511739Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:53:56.7525917Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:53:56.7536503Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:53:56.7545947Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:53:56.7553784Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:53:56.7586326Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:53:56.7600154Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:53:56.7631811Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:53:56.7643026Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:53:56.7651777Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.7661880Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:53:56.7741376Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:53:56.7742459Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:53:56.7742818Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:53:56.7743236Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:53:56.7743626Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:53:56.7744028Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.7745028Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:53:56.7745496Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.7756009Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:53:56.7794405Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:53:56.7805113Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:53:56.7815724Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:53:56.7827019Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.7840057Z Entering 'third_party/kleidiai' 2025-12-04T08:53:56.7851764Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:53:56.7861319Z Entering 'third_party/mimalloc' 2025-12-04T08:53:56.7871619Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:53:56.7880738Z Entering 'third_party/nlohmann' 2025-12-04T08:53:56.7892439Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:53:56.7903032Z Entering 'third_party/onnx' 2025-12-04T08:53:56.7912197Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:53:56.7929110Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:53:56.7938773Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:53:56.7947508Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:53:56.7958028Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:53:56.7967135Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:53:56.7976535Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:53:56.7984393Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:53:56.7996131Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.8007292Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:53:56.8016612Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:53:56.8024868Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:53:56.8038333Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:53:56.8050230Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:53:56.8061895Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:53:56.8069913Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:53:56.8092598Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:53:56.8142267Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:53:56.8157770Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:53:56.8168246Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:53:56.8186394Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:53:56.8231441Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:53:56.8246161Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:53:56.8257277Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:53:56.8274033Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:53:56.8290391Z Entering 'third_party/pocketfft' 2025-12-04T08:53:56.8306643Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:53:56.8317593Z Entering 'third_party/protobuf' 2025-12-04T08:53:56.8332543Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:53:56.8345181Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:53:56.8356160Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:53:56.8391390Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:53:56.8402712Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.8414573Z Entering 'third_party/psimd' 2025-12-04T08:53:56.8428011Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:53:56.8437272Z Entering 'third_party/pthreadpool' 2025-12-04T08:53:56.8458140Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:53:56.8465316Z Entering 'third_party/pybind11' 2025-12-04T08:53:56.8477609Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:53:56.8489987Z Entering 'third_party/python-peachpy' 2025-12-04T08:53:56.8503258Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:53:56.8514454Z Entering 'third_party/sleef' 2025-12-04T08:53:56.8528160Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:53:56.8542607Z Entering 'third_party/tensorpipe' 2025-12-04T08:53:56.8555941Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:53:56.8582295Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:53:56.8592820Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:53:56.8601687Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:53:56.8611761Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:53:56.8620357Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:53:56.8659618Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:53:56.8666873Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:53:56.8676537Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:53:56.8685008Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:53:56.8695002Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:53:56.8724341Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8746768Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8763621Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8778573Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8792594Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8841160Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8860921Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8878951Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8895954Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8914460Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8932350Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8950322Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8971637Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.8991126Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9026937Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9028012Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9038726Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9054614Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9071296Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9087984Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9103244Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9118629Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9134459Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9152598Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9182594Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9210929Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9227933Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9270564Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9316413Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9334431Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9351390Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9365778Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9383604Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9403032Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9420976Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9436182Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9467037Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9479725Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9490887Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9502848Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9513128Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9524355Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9534865Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9548760Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9564456Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9581619Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9599702Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9644748Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9663309Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9697058Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9720469Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9805054Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9822435Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9848470Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9903849Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9958901Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9976131Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:56.9992382Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0006486Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0026647Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0044893Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0061342Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0078148Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0094457Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0111660Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0129696Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0147262Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0165462Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0183586Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0200512Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0217096Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0232813Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0249783Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0268614Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0288917Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0305583Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0323114Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0339683Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0358174Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0376837Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0393644Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:53:57.0413500Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:53:57.0461340Z ##[endgroup] 2025-12-04T08:53:57.0461548Z ##[group]Fetching the repository 2025-12-04T08:53:57.0461895Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T08:53:57.6958143Z From https://github.com/pytorch/pytorch 2025-12-04T08:53:57.6958409Z - [deleted] (none) -> ciflow/inductor/160174 2025-12-04T08:53:57.6958592Z - [deleted] (none) -> ciflow/trunk/160174 2025-12-04T08:54:02.2829231Z * [new branch] 2.6.0.dev20241004+ -> origin/2.6.0.dev20241004+ 2025-12-04T08:54:02.2829458Z * [new branch] 2.9.1 -> origin/2.9.1 2025-12-04T08:54:02.2829702Z * [new branch] AaronWang04_addmmfusion_perftest -> origin/AaronWang04_addmmfusion_perftest 2025-12-04T08:54:02.2829940Z * [new branch] Flamefire-patch-1 -> origin/Flamefire-patch-1 2025-12-04T08:54:02.2830336Z * [new branch] HDCharles-2.6.0-release-notes -> origin/HDCharles-2.6.0-release-notes 2025-12-04T08:54:02.2830550Z * [new branch] HOPrintFunc -> origin/HOPrintFunc 2025-12-04T08:54:02.2830734Z * [new branch] IvanKobzarev/stack/1 -> origin/IvanKobzarev/stack/1 2025-12-04T08:54:02.2830920Z * [new branch] NicoshevSVE128 -> origin/NicoshevSVE128 2025-12-04T08:54:02.2831106Z * [new branch] PR-AOTInductorNoneBug -> origin/PR-AOTInductorNoneBug 2025-12-04T08:54:02.2831318Z * [new branch] PR-AOTInductorNoneBugFix -> origin/PR-AOTInductorNoneBugFix 2025-12-04T08:54:02.2831519Z * [new branch] PR-FixConfigsIssue -> origin/PR-FixConfigsIssue 2025-12-04T08:54:02.2832010Z * [new branch] PR-NoneBugFix-viable -> origin/PR-NoneBugFix-viable 2025-12-04T08:54:02.2832193Z * [new branch] PR-ResetToZero -> origin/PR-ResetToZero 2025-12-04T08:54:02.2832386Z * [new branch] Update-Flash-Packaging -> origin/Update-Flash-Packaging 2025-12-04T08:54:02.2832581Z * [new branch] VLA_exp -> origin/VLA_exp 2025-12-04T08:54:02.2832754Z * [new branch] activation_bench -> origin/activation_bench 2025-12-04T08:54:02.2832930Z * [new branch] addmm-heuristic -> origin/addmm-heuristic 2025-12-04T08:54:02.2833110Z * [new branch] adi/onednn_aarch64 -> origin/adi/onednn_aarch64 2025-12-04T08:54:02.2833282Z * [new branch] adi/test -> origin/adi/test 2025-12-04T08:54:02.2833445Z * [new branch] adi/test_bgemm -> origin/adi/test_bgemm 2025-12-04T08:54:02.2833616Z * [new branch] adi/test_m8g -> origin/adi/test_m8g 2025-12-04T08:54:02.2833785Z * [new branch] adi/test_onednn -> origin/adi/test_onednn 2025-12-04T08:54:02.2833960Z * [new branch] adi/test_onednn_v3.9 -> origin/adi/test_onednn_v3.9 2025-12-04T08:54:02.2834149Z * [new branch] adi/test_presve_change -> origin/adi/test_presve_change 2025-12-04T08:54:02.2834443Z * [new branch] adi/test_timm -> origin/adi/test_timm 2025-12-04T08:54:02.2834623Z * [new branch] adi/testpresve_change -> origin/adi/testpresve_change 2025-12-04T08:54:02.2834817Z * [new branch] aditew01/test/vec_bf16 -> origin/aditew01/test/vec_bf16 2025-12-04T08:54:02.2835018Z * [new branch] ah-globalfeedback-hook -> origin/ah-globalfeedback-hook 2025-12-04T08:54:02.2835232Z * [new branch] albanD-patch-1 -> origin/albanD-patch-1 2025-12-04T08:54:02.2835419Z * [new branch] also-surround-shimh -> origin/also-surround-shimh 2025-12-04T08:54:02.2835607Z * [new branch] angelayi/aot_compile -> origin/angelayi/aot_compile 2025-12-04T08:54:02.2835820Z * [new branch] angelayi/aoti_additional_files -> origin/angelayi/aoti_additional_files 2025-12-04T08:54:02.2836031Z * [new branch] angelayi/benchmark -> origin/angelayi/benchmark 2025-12-04T08:54:02.2836472Z * [new branch] angelayi/change_pytree_serialization -> origin/angelayi/change_pytree_serialization 2025-12-04T08:54:02.2860410Z * [new branch] angelayi/cpp_loader -> origin/angelayi/cpp_loader 2025-12-04T08:54:02.2894580Z * [new branch] angelayi/inductor_const -> origin/angelayi/inductor_const 2025-12-04T08:54:02.2894867Z * [new branch] angelayi/lstm -> origin/angelayi/lstm 2025-12-04T08:54:02.2895050Z * [new branch] angelayi/no_so_weight -> origin/angelayi/no_so_weight 2025-12-04T08:54:02.2895248Z * [new branch] angelayi/scan_layers -> origin/angelayi/scan_layers 2025-12-04T08:54:02.2895438Z * [new branch] angelayi/side_eff -> origin/angelayi/side_eff 2025-12-04T08:54:02.2895620Z * [new branch] angelayi/state_dict -> origin/angelayi/state_dict 2025-12-04T08:54:02.2895817Z * [new branch] angelayi/symint_input -> origin/angelayi/symint_input 2025-12-04T08:54:02.2896009Z * [new branch] angelayi/symm_mem -> origin/angelayi/symm_mem 2025-12-04T08:54:02.2896189Z * [new branch] angelayi/test_cpp -> origin/angelayi/test_cpp 2025-12-04T08:54:02.2896375Z * [new branch] angelayi/torch_size -> origin/angelayi/torch_size 2025-12-04T08:54:02.2896560Z * [new branch] annotate_assert -> origin/annotate_assert 2025-12-04T08:54:02.2896750Z * [new branch] annotate_fallback_kernel -> origin/annotate_fallback_kernel 2025-12-04T08:54:02.2897084Z * [new branch] annotation_deepcopy -> origin/annotation_deepcopy 2025-12-04T08:54:02.2897272Z * [new branch] annotation_dynamo -> origin/annotation_dynamo 2025-12-04T08:54:02.2897455Z * [new branch] aot_eager_stack_trace -> origin/aot_eager_stack_trace 2025-12-04T08:54:02.2897642Z * [new branch] aoti-cuda-alloc -> origin/aoti-cuda-alloc 2025-12-04T08:54:02.2897824Z * [new branch] aoti_const_device -> origin/aoti_const_device 2025-12-04T08:54:02.2898007Z * [new branch] aoti_fqn_name_interface -> origin/aoti_fqn_name_interface 2025-12-04T08:54:02.2898220Z * [new branch] aoti_package_weights_binary -> origin/aoti_package_weights_binary 2025-12-04T08:54:02.2898420Z * [new branch] aoti_target_windows -> origin/aoti_target_windows 2025-12-04T08:54:02.2898644Z * [new branch] arsh/feat/inductor_check_profiling -> origin/arsh/feat/inductor_check_profiling 2025-12-04T08:54:02.2898861Z * [new branch] async_tp -> origin/async_tp 2025-12-04T08:54:02.2899064Z * [new branch] atalman-inductor-perf-cu124 -> origin/atalman-inductor-perf-cu124 2025-12-04T08:54:02.2899313Z * [new branch] atalman-inductor-perf-cu124.1 -> origin/atalman-inductor-perf-cu124.1 2025-12-04T08:54:02.2902679Z * [new branch] atalman-patch-2 -> origin/atalman-patch-2 2025-12-04T08:54:02.2902864Z * [new branch] atalman-patch-3 -> origin/atalman-patch-3 2025-12-04T08:54:02.2903040Z * [new branch] atalman-patch-4 -> origin/atalman-patch-4 2025-12-04T08:54:02.2903219Z * [new branch] atalman-patch-5 -> origin/atalman-patch-5 2025-12-04T08:54:02.2903396Z * [new branch] atalman-patch-6 -> origin/atalman-patch-6 2025-12-04T08:54:02.2903571Z * [new branch] atalman-patch-7 -> origin/atalman-patch-7 2025-12-04T08:54:02.2903748Z * [new branch] atalman-patch-8 -> origin/atalman-patch-8 2025-12-04T08:54:02.2903933Z * [new branch] atalman_inductor_2.3.1 -> origin/atalman_inductor_2.3.1 2025-12-04T08:54:02.2904131Z * [new branch] atalman_inductor_2.4.0 -> origin/atalman_inductor_2.4.0 2025-12-04T08:54:02.2904327Z * [new branch] atalman_inductor_2.4.x -> origin/atalman_inductor_2.4.x 2025-12-04T08:54:02.2904539Z * [new branch] attention_benchmarking_clean -> origin/attention_benchmarking_clean 2025-12-04T08:54:02.2904761Z * [new branch] bahuang/dt_fix_scalar_add -> origin/bahuang/dt_fix_scalar_add 2025-12-04T08:54:02.2904965Z * [new branch] bahuang/fix_debug_mode -> origin/bahuang/fix_debug_mode 2025-12-04T08:54:02.2905154Z * [new branch] bahuang/fix_expand -> origin/bahuang/fix_expand 2025-12-04T08:54:02.2905337Z * [new branch] bahuang/test -> origin/bahuang/test 2025-12-04T08:54:02.2905508Z * [new branch] base/1.5 -> origin/base/1.5 2025-12-04T08:54:02.2905716Z * [new branch] batching_sdpa_efficient_attention -> origin/batching_sdpa_efficient_attention 2025-12-04T08:54:02.2905940Z * [new branch] bench_scaled_mm_ops -> origin/bench_scaled_mm_ops 2025-12-04T08:54:02.2906128Z * [new branch] benchmark-updates -> origin/benchmark-updates 2025-12-04T08:54:02.2906318Z * [new branch] benchmarking-script -> origin/benchmarking-script 2025-12-04T08:54:02.2906511Z * [new branch] bertmaher/pinbump26 -> origin/bertmaher/pinbump26 2025-12-04T08:54:02.2906693Z * [new branch] bertrand/cutlass -> origin/bertrand/cutlass 2025-12-04T08:54:02.2906877Z * [new branch] bf/bug-static-input -> origin/bf/bug-static-input 2025-12-04T08:54:02.2907090Z * [new branch] bf/cg-backend -> origin/bf/cg-backend 2025-12-04T08:54:02.2907259Z * [new branch] bf/cg-nccl-test -> origin/bf/cg-nccl-test 2025-12-04T08:54:02.2907441Z * [new branch] bf/cg-remove-check -> origin/bf/cg-remove-check 2025-12-04T08:54:02.2907634Z * [new branch] bf/clean-torchbench-hf -> origin/bf/clean-torchbench-hf 2025-12-04T08:54:02.2907825Z * [new branch] bf/combo-debug-log -> origin/bf/combo-debug-log 2025-12-04T08:54:02.2908002Z * [new branch] bf/cudagraph -> origin/bf/cudagraph 2025-12-04T08:54:02.2908230Z * [new branch] bf/cudagraph-disable-input-mutation -> origin/bf/cudagraph-disable-input-mutation 2025-12-04T08:54:02.2908582Z * [new branch] bf/cudagraph-enable-input-mutation-support-benchmark -> origin/bf/cudagraph-enable-input-mutation-support-benchmark 2025-12-04T08:54:02.2908896Z * [new branch] bf/cudagraph-partition -> origin/bf/cudagraph-partition 2025-12-04T08:54:02.2909114Z * [new branch] bf/donated-buffer-bench -> origin/bf/donated-buffer-bench 2025-12-04T08:54:02.2909316Z * [new branch] bf/dynamo-partition -> origin/bf/dynamo-partition 2025-12-04T08:54:02.2909496Z * [new branch] bf/lite -> origin/bf/lite 2025-12-04T08:54:02.2909730Z * [new branch] bf/pa-non-divisible -> origin/bf/pa-non-divisible 2025-12-04T08:54:02.2909954Z * [new branch] bf/partition-cache-free-symbols -> origin/bf/partition-cache-free-symbols 2025-12-04T08:54:02.2910246Z * [new branch] bf/partition-memory-plan -> origin/bf/partition-memory-plan 2025-12-04T08:54:02.2910458Z * [new branch] bf/partition-move-cpu -> origin/bf/partition-move-cpu 2025-12-04T08:54:02.2910671Z * [new branch] bf/partition-view-fallback -> origin/bf/partition-view-fallback 2025-12-04T08:54:02.2910892Z * [new branch] bf/remove-check-55b0c39d -> origin/bf/remove-check-55b0c39d 2025-12-04T08:54:02.2911091Z * [new branch] bf/timm-nov-26-2025 -> origin/bf/timm-nov-26-2025 2025-12-04T08:54:02.2911297Z * [new branch] bf/transformer-pin-4-57-3 -> origin/bf/transformer-pin-4-57-3 2025-12-04T08:54:02.2911527Z * [new branch] bisect_perf_hf_T5_3acc6eac492 -> origin/bisect_perf_hf_T5_3acc6eac492 2025-12-04T08:54:02.2911752Z * [new branch] bisect_perf_hf_T5_3fcf66f61fb -> origin/bisect_perf_hf_T5_3fcf66f61fb 2025-12-04T08:54:02.2911971Z * [new branch] bisect_perf_hf_T5_4009d154129 -> origin/bisect_perf_hf_T5_4009d154129 2025-12-04T08:54:02.2912188Z * [new branch] bisect_perf_hf_T5_40d0740e73d -> origin/bisect_perf_hf_T5_40d0740e73d 2025-12-04T08:54:02.2912398Z * [new branch] bisect_perf_hf_T5_5268754e -> origin/bisect_perf_hf_T5_5268754e 2025-12-04T08:54:02.2912616Z * [new branch] bisect_perf_hf_T5_7d89a8d385c -> origin/bisect_perf_hf_T5_7d89a8d385c 2025-12-04T08:54:02.2912836Z * [new branch] bisect_perf_hf_T5_b7a25c1ee7c -> origin/bisect_perf_hf_T5_b7a25c1ee7c 2025-12-04T08:54:02.2913051Z * [new branch] bisect_perf_hf_T5_c25b201583f -> origin/bisect_perf_hf_T5_c25b201583f 2025-12-04T08:54:02.2913271Z * [new branch] bisect_perf_hf_T5_c93e57efac0 -> origin/bisect_perf_hf_T5_c93e57efac0 2025-12-04T08:54:02.2913488Z * [new branch] bisect_perf_hf_T5_ca9813ea149 -> origin/bisect_perf_hf_T5_ca9813ea149 2025-12-04T08:54:02.2913700Z * [new branch] bisect_perf_hf_T5_d65f194a -> origin/bisect_perf_hf_T5_d65f194a 2025-12-04T08:54:02.2913911Z * [new branch] bisect_perf_hf_T5_da94ab0b -> origin/bisect_perf_hf_T5_da94ab0b 2025-12-04T08:54:02.2914126Z * [new branch] bisect_perf_hf_T5_da94ab0b_new -> origin/bisect_perf_hf_T5_da94ab0b_new 2025-12-04T08:54:02.2914409Z * [new branch] bisect_perf_hf_T5_db4e8a1d8a8 -> origin/bisect_perf_hf_T5_db4e8a1d8a8 2025-12-04T08:54:02.2914627Z * [new branch] bisect_perf_hf_T5_e0d97e936a2 -> origin/bisect_perf_hf_T5_e0d97e936a2 2025-12-04T08:54:02.2914846Z * [new branch] bisect_perf_hf_T5_f23621ec563 -> origin/bisect_perf_hf_T5_f23621ec563 2025-12-04T08:54:02.2915055Z * [new branch] brister/fx_device_type -> origin/brister/fx_device_type 2025-12-04T08:54:02.2915273Z * [new branch] brister/test_inductor_all_fx -> origin/brister/test_inductor_all_fx 2025-12-04T08:54:02.2915527Z * [new branch] brister/tiled_reduction_no_numel_check -> origin/brister/tiled_reduction_no_numel_check 2025-12-04T08:54:02.2915752Z * [new branch] bwd-backup -> origin/bwd-backup 2025-12-04T08:54:02.2915927Z * [new branch] c57382a49 -> origin/c57382a49 2025-12-04T08:54:02.2916097Z * [new branch] ca_0431d47eaa -> origin/ca_0431d47eaa 2025-12-04T08:54:02.2916269Z * [new branch] ca_fix_0431d47eaa -> origin/ca_fix_0431d47eaa 2025-12-04T08:54:02.2916475Z * [new branch] camyllh/test_setup_hooks_push -> origin/camyllh/test_setup_hooks_push 2025-12-04T08:54:02.2916686Z * [new branch] cccclai-patch-1 -> origin/cccclai-patch-1 2025-12-04T08:54:02.2916955Z * [new branch] cherry-pick-159969-by-pytorch_bot_bot_ -> origin/cherry-pick-159969-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2917241Z * [new branch] cherry-pick-160586-by-pytorch_bot_bot_ -> origin/cherry-pick-160586-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2917521Z * [new branch] cherry-pick-162208-by-pytorch_bot_bot_ -> origin/cherry-pick-162208-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2917795Z * [new branch] cherry-pick-163169-by-pytorch_bot_bot_ -> origin/cherry-pick-163169-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2918076Z * [new branch] cherry-pick-165086-by-pytorch_bot_bot_ -> origin/cherry-pick-165086-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2918355Z * [new branch] cherry-pick-165514-by-pytorch_bot_bot_ -> origin/cherry-pick-165514-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2918629Z * [new branch] cherry-pick-165601-by-pytorch_bot_bot_ -> origin/cherry-pick-165601-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2918910Z * [new branch] cherry-pick-165667-by-pytorch_bot_bot_ -> origin/cherry-pick-165667-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2919187Z * [new branch] cherry-pick-165815-by-pytorch_bot_bot_ -> origin/cherry-pick-165815-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2919462Z * [new branch] cherry-pick-165922-by-pytorch_bot_bot_ -> origin/cherry-pick-165922-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2919739Z * [new branch] cherry-pick-166148-by-pytorch_bot_bot_ -> origin/cherry-pick-166148-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2920020Z * [new branch] cherry-pick-166181-by-pytorch_bot_bot_ -> origin/cherry-pick-166181-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2920319Z * [new branch] cherry-pick-166404-by-pytorch_bot_bot_ -> origin/cherry-pick-166404-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2920595Z * [new branch] cherry-pick-166427-by-pytorch_bot_bot_ -> origin/cherry-pick-166427-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2920878Z * [new branch] cherry-pick-166480-by-pytorch_bot_bot_ -> origin/cherry-pick-166480-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2921151Z * [new branch] cherry-pick-166570-by-pytorch_bot_bot_ -> origin/cherry-pick-166570-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2921427Z * [new branch] cherry-pick-166993-by-pytorch_bot_bot_ -> origin/cherry-pick-166993-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2921707Z * [new branch] cherry-pick-167111-by-pytorch_bot_bot_ -> origin/cherry-pick-167111-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2922016Z * [new branch] cherry-pick-167478-by-pytorch_bot_bot_ -> origin/cherry-pick-167478-by-pytorch_bot_bot_ 2025-12-04T08:54:02.2922259Z * [new branch] cherry_pick_166036_166040 -> origin/cherry_pick_166036_166040 2025-12-04T08:54:02.2922455Z * [new branch] cherry_pick_166457 -> origin/cherry_pick_166457 2025-12-04T08:54:02.2922638Z * [new branch] cherrypick_166338 -> origin/cherrypick_166338 2025-12-04T08:54:02.2922822Z * [new branch] cherrypick_166458 -> origin/cherrypick_166458 2025-12-04T08:54:02.2923003Z * [new branch] cherrypick_166586 -> origin/cherrypick_166586 2025-12-04T08:54:02.2923184Z * [new branch] cherrypick_166956 -> origin/cherrypick_166956 2025-12-04T08:54:02.2923361Z * [new branch] ci_attn -> origin/ci_attn 2025-12-04T08:54:02.2923533Z * [new branch] codex-testing -> origin/codex-testing 2025-12-04T08:54:02.2923798Z * [new branch] codex/add-check_memory_overlap-helper-functions -> origin/codex/add-check_memory_overlap-helper-functions 2025-12-04T08:54:02.2924104Z * [new branch] codex/fix-issue-121219-in-pytorch -> origin/codex/fix-issue-121219-in-pytorch 2025-12-04T08:54:02.2924453Z * [new branch] codex/investigate-segfaults-in-get_tensor_storage_id -> origin/codex/investigate-segfaults-in-get_tensor_storage_id 2025-12-04T08:54:02.2924823Z * [new branch] codex/refactor-lintrunner-config-to-use-uv-run -> origin/codex/refactor-lintrunner-config-to-use-uv-run 2025-12-04T08:54:02.2925096Z * [new branch] compatiblpy39util -> origin/compatiblpy39util 2025-12-04T08:54:02.2925282Z * [new branch] cond_hop_device -> origin/cond_hop_device 2025-12-04T08:54:02.2925454Z * [new branch] context_test -> origin/context_test 2025-12-04T08:54:02.2925696Z * [new branch] copilot/code-style-cleanup-python-pip -> origin/copilot/code-style-cleanup-python-pip 2025-12-04T08:54:02.2925946Z * [new branch] cpio/fix_new_ami_tests -> origin/cpio/fix_new_ami_tests 2025-12-04T08:54:02.2926164Z * [new branch] cpp-docs-dependency-upgrade -> origin/cpp-docs-dependency-upgrade 2025-12-04T08:54:02.2926387Z * [new branch] csl/always_produce_xml -> origin/csl/always_produce_xml 2025-12-04T08:54:02.2926591Z * [new branch] csl/build_test_more_procs -> origin/csl/build_test_more_procs 2025-12-04T08:54:02.2926799Z * [new branch] csl/build_test_more_procs2 -> origin/csl/build_test_more_procs2 2025-12-04T08:54:02.2926995Z * [new branch] csl/clean_up -> origin/csl/clean_up 2025-12-04T08:54:02.2927192Z * [new branch] csl/fix_retry_segfault_exit -> origin/csl/fix_retry_segfault_exit 2025-12-04T08:54:02.2927401Z * [new branch] csl/katex -> origin/csl/katex 2025-12-04T08:54:02.2927572Z * [new branch] csl/larger_runner -> origin/csl/larger_runner 2025-12-04T08:54:02.2927747Z * [new branch] csl/lint_testing -> origin/csl/lint_testing 2025-12-04T08:54:02.2927920Z * [new branch] csl/lint_thing -> origin/csl/lint_thing 2025-12-04T08:54:02.2928105Z * [new branch] csl/lintrunner_stuff -> origin/csl/lintrunner_stuff 2025-12-04T08:54:02.2928295Z * [new branch] csl/manually_gen_json -> origin/csl/manually_gen_json 2025-12-04T08:54:02.2928474Z * [new branch] csl/mps_sharding -> origin/csl/mps_sharding 2025-12-04T08:54:02.2928660Z * [new branch] csl/multistage_docker -> origin/csl/multistage_docker 2025-12-04T08:54:02.2928844Z * [new branch] csl/print_timing -> origin/csl/print_timing 2025-12-04T08:54:02.2929024Z * [new branch] csl/remove_experiment -> origin/csl/remove_experiment 2025-12-04T08:54:02.2929249Z * [new branch] csl/remove_maybe_unused_var -> origin/csl/remove_maybe_unused_var 2025-12-04T08:54:02.2929481Z * [new branch] csl/remove_repo_specific_autolabel -> origin/csl/remove_repo_specific_autolabel 2025-12-04T08:54:02.2929702Z * [new branch] csl/remove_run_parallel -> origin/csl/remove_run_parallel 2025-12-04T08:54:02.2929899Z * [new branch] csl/remove_unused_vars -> origin/csl/remove_unused_vars 2025-12-04T08:54:02.2930083Z * [new branch] csl/revert_open -> origin/csl/revert_open 2025-12-04T08:54:02.2930298Z * [new branch] csl/skip_build -> origin/csl/skip_build 2025-12-04T08:54:02.2930492Z * [new branch] csl/smaller_avx_amx_runenrs -> origin/csl/smaller_avx_amx_runenrs 2025-12-04T08:54:02.2930684Z * [new branch] csl/td_job_level -> origin/csl/td_job_level 2025-12-04T08:54:02.2930887Z * [new branch] csl/test_cuda_build_large_runner -> origin/csl/test_cuda_build_large_runner 2025-12-04T08:54:02.2931137Z * [new branch] csl/test_owners_autograd_dispatch_nn -> origin/csl/test_owners_autograd_dispatch_nn 2025-12-04T08:54:02.2931388Z * [new branch] csl/test_owners_higher_confidence -> origin/csl/test_owners_higher_confidence 2025-12-04T08:54:02.2931645Z * [new branch] csl/upload_json_running -> origin/csl/upload_json_running 2025-12-04T08:54:02.2931833Z * [new branch] csl/win_sccache -> origin/csl/win_sccache 2025-12-04T08:54:02.2932004Z * [new branch] csl/xml_stuff -> origin/csl/xml_stuff 2025-12-04T08:54:02.2932174Z * [new branch] cublasrelax2 -> origin/cublasrelax2 2025-12-04T08:54:02.2932343Z * [new branch] cuda_mempool -> origin/cuda_mempool 2025-12-04T08:54:02.2932519Z * [new branch] custom_lowering_dict -> origin/custom_lowering_dict 2025-12-04T08:54:02.2932718Z * [new branch] d4l3k/debug_plane_frtrace -> origin/d4l3k/debug_plane_frtrace 2025-12-04T08:54:02.2932904Z * [new branch] daxia6/2.8o3 -> origin/daxia6/2.8o3 2025-12-04T08:54:02.2933069Z * [new branch] debug-guard -> origin/debug-guard 2025-12-04T08:54:02.2933247Z * [new branch] delete-quant-docs -> origin/delete-quant-docs 2025-12-04T08:54:02.2933582Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 2025-12-04T08:54:02.2934033Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 2025-12-04T08:54:02.2934365Z * [new branch] desertfire/test_cpp_wrapper -> origin/desertfire/test_cpp_wrapper 2025-12-04T08:54:02.2934611Z * [new branch] desertfire/triton-cpu-for-aarch64 -> origin/desertfire/triton-cpu-for-aarch64 2025-12-04T08:54:02.2934843Z * [new branch] dev/dhruva/flex_attn_opt -> origin/dev/dhruva/flex_attn_opt 2025-12-04T08:54:02.2935042Z * [new branch] dev/joona/MPSNDArrayAdd -> origin/dev/joona/MPSNDArrayAdd 2025-12-04T08:54:02.2935237Z * [new branch] dev/joona/Unranked -> origin/dev/joona/Unranked 2025-12-04T08:54:02.2935411Z * [new branch] dev/joona/cat -> origin/dev/joona/cat 2025-12-04T08:54:02.2935594Z * [new branch] dev/joona/embeddingbag -> origin/dev/joona/embeddingbag 2025-12-04T08:54:02.2935798Z * [new branch] dev/joona/fix_sdpa_memtest -> origin/dev/joona/fix_sdpa_memtest 2025-12-04T08:54:02.2936013Z * [new branch] dev/joona/getTensorsString -> origin/dev/joona/getTensorsString 2025-12-04T08:54:02.2936233Z * [new branch] dev/joona/mps_linear_macos14 -> origin/dev/joona/mps_linear_macos14 2025-12-04T08:54:02.2936470Z * [new branch] dev/joona/scalar_clamp -> origin/dev/joona/scalar_clamp 2025-12-04T08:54:02.2936650Z * [new branch] dev/joona/sdpa -> origin/dev/joona/sdpa 2025-12-04T08:54:02.2936829Z * [new branch] dev/joona/sdpa_api -> origin/dev/joona/sdpa_api 2025-12-04T08:54:02.2937012Z * [new branch] dev/joona/type_inf -> origin/dev/joona/type_inf 2025-12-04T08:54:02.2937202Z * [new branch] dev/joona/ulpAssertClose -> origin/dev/joona/ulpAssertClose 2025-12-04T08:54:02.2937396Z * [new branch] dev/joona/upsize3d -> origin/dev/joona/upsize3d 2025-12-04T08:54:02.2937572Z * [new branch] disp_counter -> origin/disp_counter 2025-12-04T08:54:02.2937747Z * [new branch] divyanshk-patch-1 -> origin/divyanshk-patch-1 2025-12-04T08:54:02.2937919Z * [new branch] docs -> origin/docs 2025-12-04T08:54:02.2938084Z * [new branch] documentation -> origin/documentation 2025-12-04T08:54:02.2938264Z * [new branch] eager_model_benchmarks -> origin/eager_model_benchmarks 2025-12-04T08:54:02.2938477Z * [new branch] embg/test_inductor_ci_control -> origin/embg/test_inductor_ci_control 2025-12-04T08:54:02.2938723Z * [new branch] embg/triton_l2_prefetch_128B -> origin/embg/triton_l2_prefetch_128B 2025-12-04T08:54:02.2938937Z * [new branch] embg/triton_l2_prefetch_256B -> origin/embg/triton_l2_prefetch_256B 2025-12-04T08:54:02.2939132Z * [new branch] eqy-patch-1 -> origin/eqy-patch-1 2025-12-04T08:54:02.2939299Z * [new branch] eqy-patch-2 -> origin/eqy-patch-2 2025-12-04T08:54:02.2939462Z * [new branch] eqy-patch-3 -> origin/eqy-patch-3 2025-12-04T08:54:02.2939625Z * [new branch] eqy-patch-4 -> origin/eqy-patch-4 2025-12-04T08:54:02.2939789Z * [new branch] eqy-patch-5 -> origin/eqy-patch-5 2025-12-04T08:54:02.2939952Z * [new branch] eqy-patch-6 -> origin/eqy-patch-6 2025-12-04T08:54:02.2940193Z * [new branch] exclamaforte/amd-ma -> origin/exclamaforte/amd-ma 2025-12-04T08:54:02.2940433Z * [new branch] exclamaforte/combo-kernels-perf-run -> origin/exclamaforte/combo-kernels-perf-run 2025-12-04T08:54:02.2940693Z * [new branch] exclamaforte/do_bench_refactor -> origin/exclamaforte/do_bench_refactor 2025-12-04T08:54:02.2940943Z * [new branch] exclamaforte/enable-mem-dep-fusion -> origin/exclamaforte/enable-mem-dep-fusion 2025-12-04T08:54:02.2941222Z * [new branch] exclamaforte/fix-exhaustive-autotuning -> origin/exclamaforte/fix-exhaustive-autotuning 2025-12-04T08:54:02.2941513Z * [new branch] exclamaforte/fix-trace-parsing-fx-svg -> origin/exclamaforte/fix-trace-parsing-fx-svg 2025-12-04T08:54:02.2941816Z * [new branch] exclamaforte/force-pointwise-cat-perf-run -> origin/exclamaforte/force-pointwise-cat-perf-run 2025-12-04T08:54:02.2942077Z * [new branch] exclamaforte/fusion-data -> origin/exclamaforte/fusion-data 2025-12-04T08:54:02.2942307Z * [new branch] exclamaforte/gemm-benchmark-run -> origin/exclamaforte/gemm-benchmark-run 2025-12-04T08:54:02.2942554Z * [new branch] exclamaforte/gemm-export-model -> origin/exclamaforte/gemm-export-model 2025-12-04T08:54:02.2942773Z * [new branch] exclamaforte/gemm-model -> origin/exclamaforte/gemm-model 2025-12-04T08:54:02.2943042Z * [new branch] exclamaforte/gemm-model-all-data-collection -> origin/exclamaforte/gemm-model-all-data-collection 2025-12-04T08:54:02.2943308Z * [new branch] exclamaforte/gemm-to-amd -> origin/exclamaforte/gemm-to-amd 2025-12-04T08:54:02.2943584Z * [new branch] exclamaforte/just-gemm-model -> origin/exclamaforte/just-gemm-model 2025-12-04T08:54:02.2943850Z * [new branch] exclamaforte/just-gemm-model-no-refactor -> origin/exclamaforte/just-gemm-model-no-refactor 2025-12-04T08:54:02.2944125Z * [new branch] exclamaforte/profile-diff-algo -> origin/exclamaforte/profile-diff-algo 2025-12-04T08:54:02.2944471Z * [new branch] exclamaforte/profiler-visualization -> origin/exclamaforte/profiler-visualization 2025-12-04T08:54:02.2944736Z * [new branch] exclamaforte/test_cpp_wrapper_mode -> origin/exclamaforte/test_cpp_wrapper_mode 2025-12-04T08:54:02.2945003Z * [new branch] exclamaforte/update-autotune-configs -> origin/exclamaforte/update-autotune-configs 2025-12-04T08:54:02.2945288Z * [new branch] exclamaforte/update-autotune-configs-2 -> origin/exclamaforte/update-autotune-configs-2 2025-12-04T08:54:02.2945514Z * [new branch] exec -> origin/exec 2025-12-04T08:54:02.2945690Z * [new branch] experimental-mosaic -> origin/experimental-mosaic 2025-12-04T08:54:02.2945876Z * [new branch] export-D61047529 -> origin/export-D61047529 2025-12-04T08:54:02.2946051Z * [new branch] export-D71412006 -> origin/export-D71412006 2025-12-04T08:54:02.2946256Z * [new branch] export-D73042989 -> origin/export-D73042989 2025-12-04T08:54:02.2946484Z * [new branch] export-D78957093 -> origin/export-D78957093 2025-12-04T08:54:02.2946659Z * [new branch] export-D78996107 -> origin/export-D78996107 2025-12-04T08:54:02.2946832Z * [new branch] export-D80823877 -> origin/export-D80823877 2025-12-04T08:54:02.2947005Z * [new branch] export-D80958642 -> origin/export-D80958642 2025-12-04T08:54:02.2947176Z * [new branch] export-D81054193 -> origin/export-D81054193 2025-12-04T08:54:02.2947353Z * [new branch] export-D81204584 -> origin/export-D81204584 2025-12-04T08:54:02.2947522Z * [new branch] export-D81429090 -> origin/export-D81429090 2025-12-04T08:54:02.2947693Z * [new branch] export-D82250826 -> origin/export-D82250826 2025-12-04T08:54:02.2947865Z * [new branch] export-D82253817 -> origin/export-D82253817 2025-12-04T08:54:02.2948034Z * [new branch] export-D83541846 -> origin/export-D83541846 2025-12-04T08:54:02.2948205Z * [new branch] export-D83627170 -> origin/export-D83627170 2025-12-04T08:54:02.2948375Z * [new branch] export-D83766701 -> origin/export-D83766701 2025-12-04T08:54:02.2948547Z * [new branch] export-D83768878 -> origin/export-D83768878 2025-12-04T08:54:02.2948718Z * [new branch] export-D83769447 -> origin/export-D83769447 2025-12-04T08:54:02.2948888Z * [new branch] export-D84089824 -> origin/export-D84089824 2025-12-04T08:54:02.2949058Z * [new branch] export-D84213020 -> origin/export-D84213020 2025-12-04T08:54:02.2949230Z * [new branch] export-D84373821 -> origin/export-D84373821 2025-12-04T08:54:02.2949397Z * [new branch] export-D84612194 -> origin/export-D84612194 2025-12-04T08:54:02.2949569Z * [new branch] export-D84890985 -> origin/export-D84890985 2025-12-04T08:54:02.2949740Z * [new branch] export-D85122326 -> origin/export-D85122326 2025-12-04T08:54:02.2949911Z * [new branch] export-D86256198 -> origin/export-D86256198 2025-12-04T08:54:02.2950087Z * [new branch] export-D86460608 -> origin/export-D86460608 2025-12-04T08:54:02.2950295Z * [new branch] export-D86474796 -> origin/export-D86474796 2025-12-04T08:54:02.2950463Z * [new branch] export-D86712396 -> origin/export-D86712396 2025-12-04T08:54:02.2950673Z * [new branch] export-D87022129 -> origin/export-D87022129 2025-12-04T08:54:02.2950844Z * [new branch] export-D87838959 -> origin/export-D87838959 2025-12-04T08:54:02.2951011Z * [new branch] export-D88319437 -> origin/export-D88319437 2025-12-04T08:54:02.2951233Z * [new branch] exported-model-train-idempotent -> origin/exported-model-train-idempotent 2025-12-04T08:54:02.2951463Z * [new branch] ezyang-titan-october -> origin/ezyang-titan-october 2025-12-04T08:54:02.2951658Z * [new branch] ezyang-titan-october2 -> origin/ezyang-titan-october2 2025-12-04T08:54:02.2951844Z * [new branch] ezyang-war -> origin/ezyang-war 2025-12-04T08:54:02.2952037Z * [new branch] ezyang/wip-aot-descriptors -> origin/ezyang/wip-aot-descriptors 2025-12-04T08:54:02.2952234Z * [new branch] fa_u8_brgemm -> origin/fa_u8_brgemm 2025-12-04T08:54:02.2952421Z * [new branch] fadeputr/sequence_fbgemm -> origin/fadeputr/sequence_fbgemm 2025-12-04T08:54:02.2952611Z * [new branch] fastmath_baseline -> origin/fastmath_baseline 2025-12-04T08:54:02.2952784Z * [new branch] fbcode/warm -> origin/fbcode/warm 2025-12-04T08:54:02.2952987Z * [new branch] fca -> origin/fca 2025-12-04T08:54:02.2953144Z * [new branch] fca2_ca5984c -> origin/fca2_ca5984c 2025-12-04T08:54:02.2953306Z * [new branch] fca5 -> origin/fca5 2025-12-04T08:54:02.2953484Z * [new branch] feature/justknobs-cpp -> origin/feature/justknobs-cpp 2025-12-04T08:54:02.2953681Z * [new branch] feature/numa-forkserver -> origin/feature/numa-forkserver 2025-12-04T08:54:02.2953873Z * [new branch] ffast_math_baseline -> origin/ffast_math_baseline 2025-12-04T08:54:02.2954054Z * [new branch] ffast_math_target -> origin/ffast_math_target 2025-12-04T08:54:02.2954234Z * [new branch] findhao/base_commit -> origin/findhao/base_commit 2025-12-04T08:54:02.2954420Z * [new branch] findhao/base_commit1 -> origin/findhao/base_commit1 2025-12-04T08:54:02.2954609Z * [new branch] findhao/multistream2 -> origin/findhao/multistream2 2025-12-04T08:54:02.2954794Z * [new branch] findhao/multistream5 -> origin/findhao/multistream5 2025-12-04T08:54:02.2954980Z * [new branch] findhao/multistream6 -> origin/findhao/multistream6 2025-12-04T08:54:02.2955178Z * [new branch] findhao/operatorbench3 -> origin/findhao/operatorbench3 2025-12-04T08:54:02.2955380Z * [new branch] findhao/operatorbench5 -> origin/findhao/operatorbench5 2025-12-04T08:54:02.2955579Z * [new branch] findhao/tritonparse -> origin/findhao/tritonparse 2025-12-04T08:54:02.2955794Z * [new branch] fix-ck-gemm-template-format -> origin/fix-ck-gemm-template-format 2025-12-04T08:54:02.2956009Z * [new branch] fix-config-ignore -> origin/fix-config-ignore 2025-12-04T08:54:02.2956196Z * [new branch] fix-dict-guard -> origin/fix-dict-guard 2025-12-04T08:54:02.2956376Z * [new branch] fix_addmm_issue -> origin/fix_addmm_issue 2025-12-04T08:54:02.2956578Z * [new branch] fix_amd_missing_cluster_dims -> origin/fix_amd_missing_cluster_dims 2025-12-04T08:54:02.2956783Z * [new branch] fix_bench_bwd_pass -> origin/fix_bench_bwd_pass 2025-12-04T08:54:02.2956971Z * [new branch] fix_mem_profiler_config -> origin/fix_mem_profiler_config 2025-12-04T08:54:02.2957164Z * [new branch] fix_nvrtc_discovery -> origin/fix_nvrtc_discovery 2025-12-04T08:54:02.2957345Z * [new branch] fix_op_runner -> origin/fix_op_runner 2025-12-04T08:54:02.2957538Z * [new branch] fix_ubn_159469 -> origin/fix_ubn_159469 2025-12-04T08:54:02.2957714Z * [new branch] fixes-triage -> origin/fixes-triage 2025-12-04T08:54:02.2957892Z * [new branch] fixflashinfer -> origin/fixflashinfer 2025-12-04T08:54:02.2958073Z * [new branch] flash_decoding_cpu -> origin/flash_decoding_cpu 2025-12-04T08:54:02.2958256Z * [new branch] flex-flash -> origin/flex-flash 2025-12-04T08:54:02.2958457Z * [new branch] flex_attention_functorch_grad -> origin/flex_attention_functorch_grad 2025-12-04T08:54:02.2958657Z * [new branch] flex_flash -> origin/flex_flash 2025-12-04T08:54:02.2958862Z * [new branch] fmassa/fix_memeff_sharding_rule -> origin/fmassa/fix_memeff_sharding_rule 2025-12-04T08:54:02.2959113Z * [new branch] fmassa/tests_comm_compute_scheduler -> origin/fmassa/tests_comm_compute_scheduler 2025-12-04T08:54:02.2959331Z * [new branch] forkserver_fix -> origin/forkserver_fix 2025-12-04T08:54:02.2959514Z * [new branch] fsdp2_trace_rules -> origin/fsdp2_trace_rules 2025-12-04T08:54:02.2959690Z * [new branch] fx_cpp -> origin/fx_cpp 2025-12-04T08:54:02.2959878Z * [new branch] fy/fix-win -> origin/fy/fix-win 2025-12-04T08:54:02.2960167Z * [new branch] galv-patch-1 -> origin/galv-patch-1 2025-12-04T08:54:02.2960401Z * [new branch] galv/cudagraphs-conditional-nodes-4 -> origin/galv/cudagraphs-conditional-nodes-4 2025-12-04T08:54:02.2960662Z * [new branch] georgehong/cmakelists-patch -> origin/georgehong/cmakelists-patch 2025-12-04T08:54:02.2960877Z * [new branch] gh/AlnisM/1/base -> origin/gh/AlnisM/1/base 2025-12-04T08:54:02.2961057Z * [new branch] gh/AlnisM/1/head -> origin/gh/AlnisM/1/head 2025-12-04T08:54:02.2961247Z * [new branch] gh/EikanWang/67/base -> origin/gh/EikanWang/67/base 2025-12-04T08:54:02.2961443Z * [new branch] gh/EikanWang/67/head -> origin/gh/EikanWang/67/head 2025-12-04T08:54:02.2961633Z * [new branch] gh/Gasoonjia/1/base -> origin/gh/Gasoonjia/1/base 2025-12-04T08:54:02.2961825Z * [new branch] gh/Gasoonjia/1/head -> origin/gh/Gasoonjia/1/head 2025-12-04T08:54:02.2962012Z * [new branch] gh/H-Huang/131/base -> origin/gh/H-Huang/131/base 2025-12-04T08:54:02.2962196Z * [new branch] gh/H-Huang/131/head -> origin/gh/H-Huang/131/head 2025-12-04T08:54:02.2962378Z * [new branch] gh/H-Huang/131/orig -> origin/gh/H-Huang/131/orig 2025-12-04T08:54:02.2962560Z * [new branch] gh/H-Huang/132/base -> origin/gh/H-Huang/132/base 2025-12-04T08:54:02.2962739Z * [new branch] gh/H-Huang/132/head -> origin/gh/H-Huang/132/head 2025-12-04T08:54:02.2962921Z * [new branch] gh/H-Huang/132/orig -> origin/gh/H-Huang/132/orig 2025-12-04T08:54:02.2963103Z * [new branch] gh/H-Huang/180/base -> origin/gh/H-Huang/180/base 2025-12-04T08:54:02.2963281Z * [new branch] gh/H-Huang/180/head -> origin/gh/H-Huang/180/head 2025-12-04T08:54:02.2963463Z * [new branch] gh/H-Huang/180/orig -> origin/gh/H-Huang/180/orig 2025-12-04T08:54:02.2963644Z * [new branch] gh/H-Huang/182/base -> origin/gh/H-Huang/182/base 2025-12-04T08:54:02.2963823Z * [new branch] gh/H-Huang/182/head -> origin/gh/H-Huang/182/head 2025-12-04T08:54:02.2964004Z * [new branch] gh/H-Huang/182/orig -> origin/gh/H-Huang/182/orig 2025-12-04T08:54:02.2964183Z * [new branch] gh/H-Huang/226/base -> origin/gh/H-Huang/226/base 2025-12-04T08:54:02.2964407Z * [new branch] gh/H-Huang/226/head -> origin/gh/H-Huang/226/head 2025-12-04T08:54:02.2964587Z * [new branch] gh/H-Huang/226/orig -> origin/gh/H-Huang/226/orig 2025-12-04T08:54:02.2964761Z * [new branch] gh/H-Huang/228/base -> origin/gh/H-Huang/228/base 2025-12-04T08:54:02.2964941Z * [new branch] gh/H-Huang/228/head -> origin/gh/H-Huang/228/head 2025-12-04T08:54:02.2965119Z * [new branch] gh/H-Huang/228/orig -> origin/gh/H-Huang/228/orig 2025-12-04T08:54:02.2965312Z * [new branch] gh/IvanKobzarev/150/base -> origin/gh/IvanKobzarev/150/base 2025-12-04T08:54:02.2965518Z * [new branch] gh/IvanKobzarev/150/head -> origin/gh/IvanKobzarev/150/head 2025-12-04T08:54:02.2965719Z * [new branch] gh/IvanKobzarev/150/orig -> origin/gh/IvanKobzarev/150/orig 2025-12-04T08:54:02.2965917Z * [new branch] gh/IvanKobzarev/157/base -> origin/gh/IvanKobzarev/157/base 2025-12-04T08:54:02.2966118Z * [new branch] gh/IvanKobzarev/157/head -> origin/gh/IvanKobzarev/157/head 2025-12-04T08:54:02.2966317Z * [new branch] gh/IvanKobzarev/157/orig -> origin/gh/IvanKobzarev/157/orig 2025-12-04T08:54:02.2966514Z * [new branch] gh/IvanKobzarev/159/base -> origin/gh/IvanKobzarev/159/base 2025-12-04T08:54:02.2966755Z * [new branch] gh/IvanKobzarev/159/head -> origin/gh/IvanKobzarev/159/head 2025-12-04T08:54:02.2966955Z * [new branch] gh/IvanKobzarev/159/orig -> origin/gh/IvanKobzarev/159/orig 2025-12-04T08:54:02.2967151Z * [new branch] gh/IvanKobzarev/162/base -> origin/gh/IvanKobzarev/162/base 2025-12-04T08:54:02.2967350Z * [new branch] gh/IvanKobzarev/162/head -> origin/gh/IvanKobzarev/162/head 2025-12-04T08:54:02.2967548Z * [new branch] gh/IvanKobzarev/162/orig -> origin/gh/IvanKobzarev/162/orig 2025-12-04T08:54:02.2967743Z * [new branch] gh/IvanKobzarev/163/base -> origin/gh/IvanKobzarev/163/base 2025-12-04T08:54:02.2967942Z * [new branch] gh/IvanKobzarev/163/head -> origin/gh/IvanKobzarev/163/head 2025-12-04T08:54:02.2968140Z * [new branch] gh/IvanKobzarev/163/orig -> origin/gh/IvanKobzarev/163/orig 2025-12-04T08:54:02.2968339Z * [new branch] gh/IvanKobzarev/166/base -> origin/gh/IvanKobzarev/166/base 2025-12-04T08:54:02.2968539Z * [new branch] gh/IvanKobzarev/166/head -> origin/gh/IvanKobzarev/166/head 2025-12-04T08:54:02.2968735Z * [new branch] gh/IvanKobzarev/166/orig -> origin/gh/IvanKobzarev/166/orig 2025-12-04T08:54:02.2968930Z * [new branch] gh/IvanKobzarev/167/base -> origin/gh/IvanKobzarev/167/base 2025-12-04T08:54:02.2969128Z * [new branch] gh/IvanKobzarev/167/head -> origin/gh/IvanKobzarev/167/head 2025-12-04T08:54:02.2969326Z * [new branch] gh/IvanKobzarev/167/orig -> origin/gh/IvanKobzarev/167/orig 2025-12-04T08:54:02.2969524Z * [new branch] gh/IvanKobzarev/168/base -> origin/gh/IvanKobzarev/168/base 2025-12-04T08:54:02.2969725Z * [new branch] gh/IvanKobzarev/168/head -> origin/gh/IvanKobzarev/168/head 2025-12-04T08:54:02.2969921Z * [new branch] gh/IvanKobzarev/168/orig -> origin/gh/IvanKobzarev/168/orig 2025-12-04T08:54:02.2970165Z * [new branch] gh/IvanKobzarev/169/base -> origin/gh/IvanKobzarev/169/base 2025-12-04T08:54:02.2970366Z * [new branch] gh/IvanKobzarev/169/head -> origin/gh/IvanKobzarev/169/head 2025-12-04T08:54:02.2970564Z * [new branch] gh/IvanKobzarev/169/orig -> origin/gh/IvanKobzarev/169/orig 2025-12-04T08:54:02.2970763Z * [new branch] gh/IvanKobzarev/170/base -> origin/gh/IvanKobzarev/170/base 2025-12-04T08:54:02.2970962Z * [new branch] gh/IvanKobzarev/170/head -> origin/gh/IvanKobzarev/170/head 2025-12-04T08:54:02.2971158Z * [new branch] gh/IvanKobzarev/170/orig -> origin/gh/IvanKobzarev/170/orig 2025-12-04T08:54:02.2971394Z * [new branch] gh/IvanKobzarev/171/base -> origin/gh/IvanKobzarev/171/base 2025-12-04T08:54:02.2971593Z * [new branch] gh/IvanKobzarev/171/head -> origin/gh/IvanKobzarev/171/head 2025-12-04T08:54:02.2971789Z * [new branch] gh/IvanKobzarev/171/orig -> origin/gh/IvanKobzarev/171/orig 2025-12-04T08:54:02.2971993Z * [new branch] gh/IvanKobzarev/172/base -> origin/gh/IvanKobzarev/172/base 2025-12-04T08:54:02.2972192Z * [new branch] gh/IvanKobzarev/172/head -> origin/gh/IvanKobzarev/172/head 2025-12-04T08:54:02.2972388Z * [new branch] gh/IvanKobzarev/172/orig -> origin/gh/IvanKobzarev/172/orig 2025-12-04T08:54:02.2972587Z * [new branch] gh/IvanKobzarev/173/base -> origin/gh/IvanKobzarev/173/base 2025-12-04T08:54:02.2972786Z * [new branch] gh/IvanKobzarev/173/head -> origin/gh/IvanKobzarev/173/head 2025-12-04T08:54:02.2972985Z * [new branch] gh/IvanKobzarev/173/orig -> origin/gh/IvanKobzarev/173/orig 2025-12-04T08:54:02.2973183Z * [new branch] gh/IvanKobzarev/174/base -> origin/gh/IvanKobzarev/174/base 2025-12-04T08:54:02.2973382Z * [new branch] gh/IvanKobzarev/174/head -> origin/gh/IvanKobzarev/174/head 2025-12-04T08:54:02.2973611Z * [new branch] gh/IvanKobzarev/174/orig -> origin/gh/IvanKobzarev/174/orig 2025-12-04T08:54:02.2973810Z * [new branch] gh/IvanKobzarev/175/base -> origin/gh/IvanKobzarev/175/base 2025-12-04T08:54:02.2974010Z * [new branch] gh/IvanKobzarev/175/head -> origin/gh/IvanKobzarev/175/head 2025-12-04T08:54:02.2974211Z * [new branch] gh/IvanKobzarev/175/orig -> origin/gh/IvanKobzarev/175/orig 2025-12-04T08:54:02.2974412Z * [new branch] gh/IvanKobzarev/176/base -> origin/gh/IvanKobzarev/176/base 2025-12-04T08:54:02.2974612Z * [new branch] gh/IvanKobzarev/176/head -> origin/gh/IvanKobzarev/176/head 2025-12-04T08:54:02.2974812Z * [new branch] gh/IvanKobzarev/176/orig -> origin/gh/IvanKobzarev/176/orig 2025-12-04T08:54:02.2975013Z * [new branch] gh/IvanKobzarev/177/base -> origin/gh/IvanKobzarev/177/base 2025-12-04T08:54:02.2975213Z * [new branch] gh/IvanKobzarev/177/head -> origin/gh/IvanKobzarev/177/head 2025-12-04T08:54:02.2975411Z * [new branch] gh/IvanKobzarev/177/orig -> origin/gh/IvanKobzarev/177/orig 2025-12-04T08:54:02.2975611Z * [new branch] gh/IvanKobzarev/178/base -> origin/gh/IvanKobzarev/178/base 2025-12-04T08:54:02.2975816Z * [new branch] gh/IvanKobzarev/178/head -> origin/gh/IvanKobzarev/178/head 2025-12-04T08:54:02.2976014Z * [new branch] gh/IvanKobzarev/178/orig -> origin/gh/IvanKobzarev/178/orig 2025-12-04T08:54:02.2976211Z * [new branch] gh/IvanKobzarev/179/base -> origin/gh/IvanKobzarev/179/base 2025-12-04T08:54:02.2976414Z * [new branch] gh/IvanKobzarev/179/head -> origin/gh/IvanKobzarev/179/head 2025-12-04T08:54:02.2976614Z * [new branch] gh/IvanKobzarev/179/orig -> origin/gh/IvanKobzarev/179/orig 2025-12-04T08:54:02.2976812Z * [new branch] gh/IvanKobzarev/180/base -> origin/gh/IvanKobzarev/180/base 2025-12-04T08:54:02.2977012Z * [new branch] gh/IvanKobzarev/180/head -> origin/gh/IvanKobzarev/180/head 2025-12-04T08:54:02.2977212Z * [new branch] gh/IvanKobzarev/180/orig -> origin/gh/IvanKobzarev/180/orig 2025-12-04T08:54:02.2977410Z * [new branch] gh/IvanKobzarev/181/base -> origin/gh/IvanKobzarev/181/base 2025-12-04T08:54:02.2977606Z * [new branch] gh/IvanKobzarev/181/head -> origin/gh/IvanKobzarev/181/head 2025-12-04T08:54:02.2977805Z * [new branch] gh/IvanKobzarev/181/orig -> origin/gh/IvanKobzarev/181/orig 2025-12-04T08:54:02.2978004Z * [new branch] gh/IvanKobzarev/182/base -> origin/gh/IvanKobzarev/182/base 2025-12-04T08:54:02.2978229Z * [new branch] gh/IvanKobzarev/182/head -> origin/gh/IvanKobzarev/182/head 2025-12-04T08:54:02.2978427Z * [new branch] gh/IvanKobzarev/182/orig -> origin/gh/IvanKobzarev/182/orig 2025-12-04T08:54:02.2978629Z * [new branch] gh/IvanKobzarev/183/base -> origin/gh/IvanKobzarev/183/base 2025-12-04T08:54:02.2978828Z * [new branch] gh/IvanKobzarev/183/head -> origin/gh/IvanKobzarev/183/head 2025-12-04T08:54:02.2979026Z * [new branch] gh/IvanKobzarev/183/orig -> origin/gh/IvanKobzarev/183/orig 2025-12-04T08:54:02.2979224Z * [new branch] gh/IvanKobzarev/184/base -> origin/gh/IvanKobzarev/184/base 2025-12-04T08:54:02.2979426Z * [new branch] gh/IvanKobzarev/184/head -> origin/gh/IvanKobzarev/184/head 2025-12-04T08:54:02.2979625Z * [new branch] gh/IvanKobzarev/184/orig -> origin/gh/IvanKobzarev/184/orig 2025-12-04T08:54:02.2979827Z * [new branch] gh/NikhilAPatel/1/base -> origin/gh/NikhilAPatel/1/base 2025-12-04T08:54:02.2980024Z * [new branch] gh/NikhilAPatel/1/head -> origin/gh/NikhilAPatel/1/head 2025-12-04T08:54:02.2980263Z * [new branch] gh/NikhilAPatel/2/base -> origin/gh/NikhilAPatel/2/base 2025-12-04T08:54:02.2980457Z * [new branch] gh/NikhilAPatel/2/head -> origin/gh/NikhilAPatel/2/head 2025-12-04T08:54:02.2980690Z * [new branch] gh/NikhilAPatel/4/base -> origin/gh/NikhilAPatel/4/base 2025-12-04T08:54:02.2980884Z * [new branch] gh/NikhilAPatel/4/head -> origin/gh/NikhilAPatel/4/head 2025-12-04T08:54:02.2981077Z * [new branch] gh/NikhilAPatel/5/base -> origin/gh/NikhilAPatel/5/base 2025-12-04T08:54:02.2981268Z * [new branch] gh/NikhilAPatel/5/head -> origin/gh/NikhilAPatel/5/head 2025-12-04T08:54:02.2981461Z * [new branch] gh/NikhilAPatel/5/orig -> origin/gh/NikhilAPatel/5/orig 2025-12-04T08:54:02.2981658Z * [new branch] gh/PaliC/17/base -> origin/gh/PaliC/17/base 2025-12-04T08:54:02.2981835Z * [new branch] gh/PaliC/17/head -> origin/gh/PaliC/17/head 2025-12-04T08:54:02.2982011Z * [new branch] gh/PaliC/17/orig -> origin/gh/PaliC/17/orig 2025-12-04T08:54:02.2982183Z * [new branch] gh/PaliC/18/base -> origin/gh/PaliC/18/base 2025-12-04T08:54:02.2982362Z * [new branch] gh/PaliC/18/head -> origin/gh/PaliC/18/head 2025-12-04T08:54:02.2982539Z * [new branch] gh/PaliC/18/orig -> origin/gh/PaliC/18/orig 2025-12-04T08:54:02.2982709Z * [new branch] gh/PaliC/20/base -> origin/gh/PaliC/20/base 2025-12-04T08:54:02.2982882Z * [new branch] gh/PaliC/20/head -> origin/gh/PaliC/20/head 2025-12-04T08:54:02.2983058Z * [new branch] gh/PaliC/20/orig -> origin/gh/PaliC/20/orig 2025-12-04T08:54:02.2983232Z * [new branch] gh/PaliC/21/base -> origin/gh/PaliC/21/base 2025-12-04T08:54:02.2983404Z * [new branch] gh/PaliC/21/head -> origin/gh/PaliC/21/head 2025-12-04T08:54:02.2983577Z * [new branch] gh/PaliC/21/orig -> origin/gh/PaliC/21/orig 2025-12-04T08:54:02.2983747Z * [new branch] gh/PaliC/23/base -> origin/gh/PaliC/23/base 2025-12-04T08:54:02.2983920Z * [new branch] gh/PaliC/23/head -> origin/gh/PaliC/23/head 2025-12-04T08:54:02.2984092Z * [new branch] gh/PaliC/23/orig -> origin/gh/PaliC/23/orig 2025-12-04T08:54:02.2984264Z * [new branch] gh/PaliC/24/base -> origin/gh/PaliC/24/base 2025-12-04T08:54:02.2984442Z * [new branch] gh/PaliC/24/head -> origin/gh/PaliC/24/head 2025-12-04T08:54:02.2984614Z * [new branch] gh/PaliC/24/orig -> origin/gh/PaliC/24/orig 2025-12-04T08:54:02.2984783Z * [new branch] gh/PaliC/25/head -> origin/gh/PaliC/25/head 2025-12-04T08:54:02.2984988Z * [new branch] gh/PaliC/25/next -> origin/gh/PaliC/25/next 2025-12-04T08:54:02.2985158Z * [new branch] gh/PaliC/25/orig -> origin/gh/PaliC/25/orig 2025-12-04T08:54:02.2985330Z * [new branch] gh/PaliC/26/head -> origin/gh/PaliC/26/head 2025-12-04T08:54:02.2985503Z * [new branch] gh/PaliC/26/next -> origin/gh/PaliC/26/next 2025-12-04T08:54:02.2985673Z * [new branch] gh/PaliC/26/orig -> origin/gh/PaliC/26/orig 2025-12-04T08:54:02.2985850Z * [new branch] gh/PaliC/27/next -> origin/gh/PaliC/27/next 2025-12-04T08:54:02.2986023Z * [new branch] gh/PaliC/28/head -> origin/gh/PaliC/28/head 2025-12-04T08:54:02.2986194Z * [new branch] gh/PaliC/28/next -> origin/gh/PaliC/28/next 2025-12-04T08:54:02.2986367Z * [new branch] gh/PaliC/28/orig -> origin/gh/PaliC/28/orig 2025-12-04T08:54:02.2986540Z * [new branch] gh/PaliC/29/head -> origin/gh/PaliC/29/head 2025-12-04T08:54:02.2986712Z * [new branch] gh/PaliC/29/next -> origin/gh/PaliC/29/next 2025-12-04T08:54:02.2986885Z * [new branch] gh/PaliC/29/orig -> origin/gh/PaliC/29/orig 2025-12-04T08:54:02.2987087Z * [new branch] gh/PaliC/30/head -> origin/gh/PaliC/30/head 2025-12-04T08:54:02.2987258Z * [new branch] gh/PaliC/30/next -> origin/gh/PaliC/30/next 2025-12-04T08:54:02.2987432Z * [new branch] gh/PaliC/30/orig -> origin/gh/PaliC/30/orig 2025-12-04T08:54:02.2987605Z * [new branch] gh/PaliC/31/head -> origin/gh/PaliC/31/head 2025-12-04T08:54:02.2987776Z * [new branch] gh/PaliC/31/next -> origin/gh/PaliC/31/next 2025-12-04T08:54:02.2987949Z * [new branch] gh/PaliC/31/orig -> origin/gh/PaliC/31/orig 2025-12-04T08:54:02.2988137Z * [new branch] gh/PaulZhang12/25/base -> origin/gh/PaulZhang12/25/base 2025-12-04T08:54:02.2988328Z * [new branch] gh/PaulZhang12/25/head -> origin/gh/PaulZhang12/25/head 2025-12-04T08:54:02.2988519Z * [new branch] gh/PaulZhang12/25/orig -> origin/gh/PaulZhang12/25/orig 2025-12-04T08:54:02.2988709Z * [new branch] gh/PaulZhang12/28/base -> origin/gh/PaulZhang12/28/base 2025-12-04T08:54:02.2988902Z * [new branch] gh/PaulZhang12/28/head -> origin/gh/PaulZhang12/28/head 2025-12-04T08:54:02.2989094Z * [new branch] gh/PaulZhang12/28/orig -> origin/gh/PaulZhang12/28/orig 2025-12-04T08:54:02.2989282Z * [new branch] gh/PaulZhang12/31/base -> origin/gh/PaulZhang12/31/base 2025-12-04T08:54:02.2989475Z * [new branch] gh/PaulZhang12/31/head -> origin/gh/PaulZhang12/31/head 2025-12-04T08:54:02.2989665Z * [new branch] gh/PaulZhang12/31/orig -> origin/gh/PaulZhang12/31/orig 2025-12-04T08:54:02.2989853Z * [new branch] gh/PaulZhang12/37/base -> origin/gh/PaulZhang12/37/base 2025-12-04T08:54:02.2990046Z * [new branch] gh/PaulZhang12/37/head -> origin/gh/PaulZhang12/37/head 2025-12-04T08:54:02.2990269Z * [new branch] gh/PaulZhang12/37/orig -> origin/gh/PaulZhang12/37/orig 2025-12-04T08:54:02.2990459Z * [new branch] gh/PaulZhang12/40/base -> origin/gh/PaulZhang12/40/base 2025-12-04T08:54:02.2990650Z * [new branch] gh/PaulZhang12/40/head -> origin/gh/PaulZhang12/40/head 2025-12-04T08:54:02.2990844Z * [new branch] gh/PaulZhang12/40/orig -> origin/gh/PaulZhang12/40/orig 2025-12-04T08:54:02.2991034Z * [new branch] gh/PaulZhang12/42/base -> origin/gh/PaulZhang12/42/base 2025-12-04T08:54:02.2991224Z * [new branch] gh/PaulZhang12/42/head -> origin/gh/PaulZhang12/42/head 2025-12-04T08:54:02.2991413Z * [new branch] gh/PaulZhang12/43/base -> origin/gh/PaulZhang12/43/base 2025-12-04T08:54:02.2991640Z * [new branch] gh/PaulZhang12/43/head -> origin/gh/PaulZhang12/43/head 2025-12-04T08:54:02.2991829Z * [new branch] gh/PaulZhang12/43/orig -> origin/gh/PaulZhang12/43/orig 2025-12-04T08:54:02.2992019Z * [new branch] gh/PaulZhang12/44/base -> origin/gh/PaulZhang12/44/base 2025-12-04T08:54:02.2992209Z * [new branch] gh/PaulZhang12/44/head -> origin/gh/PaulZhang12/44/head 2025-12-04T08:54:02.2992402Z * [new branch] gh/PaulZhang12/45/base -> origin/gh/PaulZhang12/45/base 2025-12-04T08:54:02.2992592Z * [new branch] gh/PaulZhang12/45/head -> origin/gh/PaulZhang12/45/head 2025-12-04T08:54:02.2992782Z * [new branch] gh/PaulZhang12/45/orig -> origin/gh/PaulZhang12/45/orig 2025-12-04T08:54:02.2992975Z * [new branch] gh/PaulZhang12/46/base -> origin/gh/PaulZhang12/46/base 2025-12-04T08:54:02.2993168Z * [new branch] gh/PaulZhang12/46/head -> origin/gh/PaulZhang12/46/head 2025-12-04T08:54:02.2993358Z * [new branch] gh/PaulZhang12/46/orig -> origin/gh/PaulZhang12/46/orig 2025-12-04T08:54:02.2993548Z * [new branch] gh/PaulZhang12/47/base -> origin/gh/PaulZhang12/47/base 2025-12-04T08:54:02.2993745Z * [new branch] gh/PaulZhang12/47/head -> origin/gh/PaulZhang12/47/head 2025-12-04T08:54:02.2993977Z * [new branch] gh/PaulZhang12/47/orig -> origin/gh/PaulZhang12/47/orig 2025-12-04T08:54:02.2994168Z * [new branch] gh/PaulZhang12/48/base -> origin/gh/PaulZhang12/48/base 2025-12-04T08:54:02.2994356Z * [new branch] gh/PaulZhang12/48/head -> origin/gh/PaulZhang12/48/head 2025-12-04T08:54:02.2994545Z * [new branch] gh/PaulZhang12/48/orig -> origin/gh/PaulZhang12/48/orig 2025-12-04T08:54:02.2994736Z * [new branch] gh/SamGinzburg/11/base -> origin/gh/SamGinzburg/11/base 2025-12-04T08:54:02.2994927Z * [new branch] gh/SamGinzburg/11/head -> origin/gh/SamGinzburg/11/head 2025-12-04T08:54:02.2995130Z * [new branch] gh/SherlockNoMad/1/base -> origin/gh/SherlockNoMad/1/base 2025-12-04T08:54:02.2995332Z * [new branch] gh/SherlockNoMad/1/head -> origin/gh/SherlockNoMad/1/head 2025-12-04T08:54:02.2995532Z * [new branch] gh/SherlockNoMad/10/base -> origin/gh/SherlockNoMad/10/base 2025-12-04T08:54:02.2995735Z * [new branch] gh/SherlockNoMad/10/head -> origin/gh/SherlockNoMad/10/head 2025-12-04T08:54:02.2995937Z * [new branch] gh/SherlockNoMad/10/orig -> origin/gh/SherlockNoMad/10/orig 2025-12-04T08:54:02.2996137Z * [new branch] gh/SherlockNoMad/11/base -> origin/gh/SherlockNoMad/11/base 2025-12-04T08:54:02.2996339Z * [new branch] gh/SherlockNoMad/11/head -> origin/gh/SherlockNoMad/11/head 2025-12-04T08:54:02.2996539Z * [new branch] gh/SherlockNoMad/11/orig -> origin/gh/SherlockNoMad/11/orig 2025-12-04T08:54:02.2996738Z * [new branch] gh/SherlockNoMad/12/base -> origin/gh/SherlockNoMad/12/base 2025-12-04T08:54:02.2996939Z * [new branch] gh/SherlockNoMad/12/head -> origin/gh/SherlockNoMad/12/head 2025-12-04T08:54:02.2997139Z * [new branch] gh/SherlockNoMad/12/orig -> origin/gh/SherlockNoMad/12/orig 2025-12-04T08:54:02.2997338Z * [new branch] gh/SherlockNoMad/15/base -> origin/gh/SherlockNoMad/15/base 2025-12-04T08:54:02.2997537Z * [new branch] gh/SherlockNoMad/15/head -> origin/gh/SherlockNoMad/15/head 2025-12-04T08:54:02.2997737Z * [new branch] gh/SherlockNoMad/15/orig -> origin/gh/SherlockNoMad/15/orig 2025-12-04T08:54:02.2997935Z * [new branch] gh/SherlockNoMad/17/base -> origin/gh/SherlockNoMad/17/base 2025-12-04T08:54:02.2998142Z * [new branch] gh/SherlockNoMad/17/head -> origin/gh/SherlockNoMad/17/head 2025-12-04T08:54:02.2998379Z * [new branch] gh/SherlockNoMad/17/orig -> origin/gh/SherlockNoMad/17/orig 2025-12-04T08:54:02.2998577Z * [new branch] gh/SherlockNoMad/18/base -> origin/gh/SherlockNoMad/18/base 2025-12-04T08:54:02.2998777Z * [new branch] gh/SherlockNoMad/18/head -> origin/gh/SherlockNoMad/18/head 2025-12-04T08:54:02.2998977Z * [new branch] gh/SherlockNoMad/18/orig -> origin/gh/SherlockNoMad/18/orig 2025-12-04T08:54:02.2999176Z * [new branch] gh/SherlockNoMad/19/base -> origin/gh/SherlockNoMad/19/base 2025-12-04T08:54:02.2999377Z * [new branch] gh/SherlockNoMad/19/head -> origin/gh/SherlockNoMad/19/head 2025-12-04T08:54:02.2999574Z * [new branch] gh/SherlockNoMad/19/orig -> origin/gh/SherlockNoMad/19/orig 2025-12-04T08:54:02.2999780Z * [new branch] gh/SherlockNoMad/2/base -> origin/gh/SherlockNoMad/2/base 2025-12-04T08:54:02.2999977Z * [new branch] gh/SherlockNoMad/2/head -> origin/gh/SherlockNoMad/2/head 2025-12-04T08:54:02.3000206Z * [new branch] gh/SherlockNoMad/20/base -> origin/gh/SherlockNoMad/20/base 2025-12-04T08:54:02.3000406Z * [new branch] gh/SherlockNoMad/20/head -> origin/gh/SherlockNoMad/20/head 2025-12-04T08:54:02.3000608Z * [new branch] gh/SherlockNoMad/20/orig -> origin/gh/SherlockNoMad/20/orig 2025-12-04T08:54:02.3000841Z * [new branch] gh/SherlockNoMad/21/base -> origin/gh/SherlockNoMad/21/base 2025-12-04T08:54:02.3001042Z * [new branch] gh/SherlockNoMad/21/head -> origin/gh/SherlockNoMad/21/head 2025-12-04T08:54:02.3001246Z * [new branch] gh/SherlockNoMad/21/orig -> origin/gh/SherlockNoMad/21/orig 2025-12-04T08:54:02.3001444Z * [new branch] gh/SherlockNoMad/3/base -> origin/gh/SherlockNoMad/3/base 2025-12-04T08:54:02.3001642Z * [new branch] gh/SherlockNoMad/3/head -> origin/gh/SherlockNoMad/3/head 2025-12-04T08:54:02.3001841Z * [new branch] gh/SherlockNoMad/4/base -> origin/gh/SherlockNoMad/4/base 2025-12-04T08:54:02.3002037Z * [new branch] gh/SherlockNoMad/4/head -> origin/gh/SherlockNoMad/4/head 2025-12-04T08:54:02.3002233Z * [new branch] gh/SherlockNoMad/5/base -> origin/gh/SherlockNoMad/5/base 2025-12-04T08:54:02.3002432Z * [new branch] gh/SherlockNoMad/5/head -> origin/gh/SherlockNoMad/5/head 2025-12-04T08:54:02.3002641Z * [new branch] gh/Sidharth123-cpu/24/base -> origin/gh/Sidharth123-cpu/24/base 2025-12-04T08:54:02.3002920Z * [new branch] gh/Sidharth123-cpu/25/base -> origin/gh/Sidharth123-cpu/25/base 2025-12-04T08:54:02.3003182Z * [new branch] gh/Sidharth123-cpu/26/base -> origin/gh/Sidharth123-cpu/26/base 2025-12-04T08:54:02.3003436Z * [new branch] gh/Sidharth123-cpu/27/base -> origin/gh/Sidharth123-cpu/27/base 2025-12-04T08:54:02.3003871Z * [new branch] gh/StrongerXi/1/base -> origin/gh/StrongerXi/1/base 2025-12-04T08:54:02.3004152Z * [new branch] gh/StrongerXi/1/head -> origin/gh/StrongerXi/1/head 2025-12-04T08:54:02.3004374Z * [new branch] gh/StrongerXi/71/base -> origin/gh/StrongerXi/71/base 2025-12-04T08:54:02.3004627Z * [new branch] gh/StrongerXi/71/head -> origin/gh/StrongerXi/71/head 2025-12-04T08:54:02.3004844Z * [new branch] gh/StrongerXi/72/base -> origin/gh/StrongerXi/72/base 2025-12-04T08:54:02.3005068Z * [new branch] gh/StrongerXi/72/head -> origin/gh/StrongerXi/72/head 2025-12-04T08:54:02.3005313Z * [new branch] gh/StrongerXi/73/base -> origin/gh/StrongerXi/73/base 2025-12-04T08:54:02.3005519Z * [new branch] gh/StrongerXi/73/head -> origin/gh/StrongerXi/73/head 2025-12-04T08:54:02.3005741Z * [new branch] gh/StrongerXi/73/orig -> origin/gh/StrongerXi/73/orig 2025-12-04T08:54:02.3005980Z * [new branch] gh/XilunWu/160/base -> origin/gh/XilunWu/160/base 2025-12-04T08:54:02.3006228Z * [new branch] gh/XilunWu/160/head -> origin/gh/XilunWu/160/head 2025-12-04T08:54:02.3006436Z * [new branch] gh/XilunWu/160/orig -> origin/gh/XilunWu/160/orig 2025-12-04T08:54:02.3006816Z * [new branch] gh/XilunWu/163/base -> origin/gh/XilunWu/163/base 2025-12-04T08:54:02.3007021Z * [new branch] gh/XilunWu/163/head -> origin/gh/XilunWu/163/head 2025-12-04T08:54:02.3007229Z * [new branch] gh/XilunWu/163/orig -> origin/gh/XilunWu/163/orig 2025-12-04T08:54:02.3007470Z * [new branch] gh/XilunWu/168/base -> origin/gh/XilunWu/168/base 2025-12-04T08:54:02.3007667Z * [new branch] gh/XilunWu/168/head -> origin/gh/XilunWu/168/head 2025-12-04T08:54:02.3025618Z * [new branch] gh/XilunWu/168/orig -> origin/gh/XilunWu/168/orig 2025-12-04T08:54:02.3025976Z * [new branch] gh/XilunWu/169/base -> origin/gh/XilunWu/169/base 2025-12-04T08:54:02.3026201Z * [new branch] gh/XilunWu/169/head -> origin/gh/XilunWu/169/head 2025-12-04T08:54:02.3026437Z * [new branch] gh/XilunWu/169/orig -> origin/gh/XilunWu/169/orig 2025-12-04T08:54:02.3026649Z * [new branch] gh/XilunWu/170/base -> origin/gh/XilunWu/170/base 2025-12-04T08:54:02.3026924Z * [new branch] gh/XilunWu/170/head -> origin/gh/XilunWu/170/head 2025-12-04T08:54:02.3027119Z * [new branch] gh/XilunWu/170/orig -> origin/gh/XilunWu/170/orig 2025-12-04T08:54:02.3027315Z * [new branch] gh/XilunWu/171/base -> origin/gh/XilunWu/171/base 2025-12-04T08:54:02.3027501Z * [new branch] gh/XilunWu/171/head -> origin/gh/XilunWu/171/head 2025-12-04T08:54:02.3027694Z * [new branch] gh/XilunWu/171/orig -> origin/gh/XilunWu/171/orig 2025-12-04T08:54:02.3027880Z * [new branch] gh/XilunWu/173/base -> origin/gh/XilunWu/173/base 2025-12-04T08:54:02.3028073Z * [new branch] gh/XilunWu/173/head -> origin/gh/XilunWu/173/head 2025-12-04T08:54:02.3028264Z * [new branch] gh/XilunWu/173/orig -> origin/gh/XilunWu/173/orig 2025-12-04T08:54:02.3028449Z * [new branch] gh/XilunWu/175/base -> origin/gh/XilunWu/175/base 2025-12-04T08:54:02.3028641Z * [new branch] gh/XilunWu/175/head -> origin/gh/XilunWu/175/head 2025-12-04T08:54:02.3028833Z * [new branch] gh/XilunWu/175/orig -> origin/gh/XilunWu/175/orig 2025-12-04T08:54:02.3029019Z * [new branch] gh/XilunWu/176/base -> origin/gh/XilunWu/176/base 2025-12-04T08:54:02.3029209Z * [new branch] gh/XilunWu/176/head -> origin/gh/XilunWu/176/head 2025-12-04T08:54:02.3029403Z * [new branch] gh/XilunWu/176/orig -> origin/gh/XilunWu/176/orig 2025-12-04T08:54:02.3029595Z * [new branch] gh/XuehaiPan/14/base -> origin/gh/XuehaiPan/14/base 2025-12-04T08:54:02.3029797Z * [new branch] gh/XuehaiPan/14/head -> origin/gh/XuehaiPan/14/head 2025-12-04T08:54:02.3029996Z * [new branch] gh/XuehaiPan/14/orig -> origin/gh/XuehaiPan/14/orig 2025-12-04T08:54:02.3030236Z * [new branch] gh/XuehaiPan/179/base -> origin/gh/XuehaiPan/179/base 2025-12-04T08:54:02.3030438Z * [new branch] gh/XuehaiPan/179/head -> origin/gh/XuehaiPan/179/head 2025-12-04T08:54:02.3030639Z * [new branch] gh/XuehaiPan/179/orig -> origin/gh/XuehaiPan/179/orig 2025-12-04T08:54:02.3030832Z * [new branch] gh/XuehaiPan/249/base -> origin/gh/XuehaiPan/249/base 2025-12-04T08:54:02.3031030Z * [new branch] gh/XuehaiPan/249/head -> origin/gh/XuehaiPan/249/head 2025-12-04T08:54:02.3031226Z * [new branch] gh/XuehaiPan/249/orig -> origin/gh/XuehaiPan/249/orig 2025-12-04T08:54:02.3031477Z * [new branch] gh/XuehaiPan/253/base -> origin/gh/XuehaiPan/253/base 2025-12-04T08:54:02.3031675Z * [new branch] gh/XuehaiPan/253/head -> origin/gh/XuehaiPan/253/head 2025-12-04T08:54:02.3031877Z * [new branch] gh/XuehaiPan/253/orig -> origin/gh/XuehaiPan/253/orig 2025-12-04T08:54:02.3032070Z * [new branch] gh/XuehaiPan/254/base -> origin/gh/XuehaiPan/254/base 2025-12-04T08:54:02.3032270Z * [new branch] gh/XuehaiPan/254/head -> origin/gh/XuehaiPan/254/head 2025-12-04T08:54:02.3032461Z * [new branch] gh/XuehaiPan/254/orig -> origin/gh/XuehaiPan/254/orig 2025-12-04T08:54:02.3032659Z * [new branch] gh/XuehaiPan/255/base -> origin/gh/XuehaiPan/255/base 2025-12-04T08:54:02.3032857Z * [new branch] gh/XuehaiPan/255/head -> origin/gh/XuehaiPan/255/head 2025-12-04T08:54:02.3033040Z * [new branch] gh/XuehaiPan/255/orig -> origin/gh/XuehaiPan/255/orig 2025-12-04T08:54:02.3033226Z * [new branch] gh/XuehaiPan/271/base -> origin/gh/XuehaiPan/271/base 2025-12-04T08:54:02.3033410Z * [new branch] gh/XuehaiPan/271/head -> origin/gh/XuehaiPan/271/head 2025-12-04T08:54:02.3033593Z * [new branch] gh/XuehaiPan/271/orig -> origin/gh/XuehaiPan/271/orig 2025-12-04T08:54:02.3033776Z * [new branch] gh/XuehaiPan/343/base -> origin/gh/XuehaiPan/343/base 2025-12-04T08:54:02.3033988Z * [new branch] gh/XuehaiPan/343/head -> origin/gh/XuehaiPan/343/head 2025-12-04T08:54:02.3034171Z * [new branch] gh/XuehaiPan/343/orig -> origin/gh/XuehaiPan/343/orig 2025-12-04T08:54:02.3034353Z * [new branch] gh/XuehaiPan/347/base -> origin/gh/XuehaiPan/347/base 2025-12-04T08:54:02.3034538Z * [new branch] gh/XuehaiPan/347/head -> origin/gh/XuehaiPan/347/head 2025-12-04T08:54:02.3034722Z * [new branch] gh/XuehaiPan/347/orig -> origin/gh/XuehaiPan/347/orig 2025-12-04T08:54:02.3034906Z * [new branch] gh/XuehaiPan/348/base -> origin/gh/XuehaiPan/348/base 2025-12-04T08:54:02.3035091Z * [new branch] gh/XuehaiPan/348/head -> origin/gh/XuehaiPan/348/head 2025-12-04T08:54:02.3035275Z * [new branch] gh/XuehaiPan/348/orig -> origin/gh/XuehaiPan/348/orig 2025-12-04T08:54:02.3035464Z * [new branch] gh/XuehaiPan/350/base -> origin/gh/XuehaiPan/350/base 2025-12-04T08:54:02.3035647Z * [new branch] gh/XuehaiPan/350/head -> origin/gh/XuehaiPan/350/head 2025-12-04T08:54:02.3035831Z * [new branch] gh/XuehaiPan/350/orig -> origin/gh/XuehaiPan/350/orig 2025-12-04T08:54:02.3036019Z * [new branch] gh/XuehaiPan/365/base -> origin/gh/XuehaiPan/365/base 2025-12-04T08:54:02.3036207Z * [new branch] gh/XuehaiPan/365/head -> origin/gh/XuehaiPan/365/head 2025-12-04T08:54:02.3036391Z * [new branch] gh/XuehaiPan/365/orig -> origin/gh/XuehaiPan/365/orig 2025-12-04T08:54:02.3036576Z * [new branch] gh/XuehaiPan/366/base -> origin/gh/XuehaiPan/366/base 2025-12-04T08:54:02.3036760Z * [new branch] gh/XuehaiPan/366/head -> origin/gh/XuehaiPan/366/head 2025-12-04T08:54:02.3036944Z * [new branch] gh/XuehaiPan/370/base -> origin/gh/XuehaiPan/370/base 2025-12-04T08:54:02.3037128Z * [new branch] gh/XuehaiPan/370/head -> origin/gh/XuehaiPan/370/head 2025-12-04T08:54:02.3037310Z * [new branch] gh/XuehaiPan/370/orig -> origin/gh/XuehaiPan/370/orig 2025-12-04T08:54:02.3037493Z * [new branch] gh/XuehaiPan/390/base -> origin/gh/XuehaiPan/390/base 2025-12-04T08:54:02.3037677Z * [new branch] gh/XuehaiPan/390/head -> origin/gh/XuehaiPan/390/head 2025-12-04T08:54:02.3037861Z * [new branch] gh/XuehaiPan/390/orig -> origin/gh/XuehaiPan/390/orig 2025-12-04T08:54:02.3038046Z * [new branch] gh/XuehaiPan/391/base -> origin/gh/XuehaiPan/391/base 2025-12-04T08:54:02.3038257Z * [new branch] gh/XuehaiPan/391/head -> origin/gh/XuehaiPan/391/head 2025-12-04T08:54:02.3038440Z * [new branch] gh/XuehaiPan/391/orig -> origin/gh/XuehaiPan/391/orig 2025-12-04T08:54:02.3038625Z * [new branch] gh/XuehaiPan/392/base -> origin/gh/XuehaiPan/392/base 2025-12-04T08:54:02.3038811Z * [new branch] gh/XuehaiPan/392/head -> origin/gh/XuehaiPan/392/head 2025-12-04T08:54:02.3038996Z * [new branch] gh/XuehaiPan/392/orig -> origin/gh/XuehaiPan/392/orig 2025-12-04T08:54:02.3039180Z * [new branch] gh/XuehaiPan/394/base -> origin/gh/XuehaiPan/394/base 2025-12-04T08:54:02.3039362Z * [new branch] gh/XuehaiPan/394/head -> origin/gh/XuehaiPan/394/head 2025-12-04T08:54:02.3039549Z * [new branch] gh/XuehaiPan/394/orig -> origin/gh/XuehaiPan/394/orig 2025-12-04T08:54:02.3039736Z * [new branch] gh/XuehaiPan/397/base -> origin/gh/XuehaiPan/397/base 2025-12-04T08:54:02.3039924Z * [new branch] gh/XuehaiPan/397/head -> origin/gh/XuehaiPan/397/head 2025-12-04T08:54:02.3040147Z * [new branch] gh/XuehaiPan/397/orig -> origin/gh/XuehaiPan/397/orig 2025-12-04T08:54:02.3040339Z * [new branch] gh/XuehaiPan/398/base -> origin/gh/XuehaiPan/398/base 2025-12-04T08:54:02.3040562Z * [new branch] gh/XuehaiPan/398/head -> origin/gh/XuehaiPan/398/head 2025-12-04T08:54:02.3040752Z * [new branch] gh/XuehaiPan/398/orig -> origin/gh/XuehaiPan/398/orig 2025-12-04T08:54:02.3040941Z * [new branch] gh/XuehaiPan/399/base -> origin/gh/XuehaiPan/399/base 2025-12-04T08:54:02.3041125Z * [new branch] gh/XuehaiPan/399/head -> origin/gh/XuehaiPan/399/head 2025-12-04T08:54:02.3041314Z * [new branch] gh/XuehaiPan/399/orig -> origin/gh/XuehaiPan/399/orig 2025-12-04T08:54:02.3041505Z * [new branch] gh/XuehaiPan/400/base -> origin/gh/XuehaiPan/400/base 2025-12-04T08:54:02.3041691Z * [new branch] gh/XuehaiPan/400/head -> origin/gh/XuehaiPan/400/head 2025-12-04T08:54:02.3041880Z * [new branch] gh/XuehaiPan/400/orig -> origin/gh/XuehaiPan/400/orig 2025-12-04T08:54:02.3042079Z * [new branch] gh/ZhiweiYan-96/39/base -> origin/gh/ZhiweiYan-96/39/base 2025-12-04T08:54:02.3042277Z * [new branch] gh/ZhiweiYan-96/39/head -> origin/gh/ZhiweiYan-96/39/head 2025-12-04T08:54:02.3042471Z * [new branch] gh/ZhiweiYan-96/39/orig -> origin/gh/ZhiweiYan-96/39/orig 2025-12-04T08:54:02.3042664Z * [new branch] gh/ZhiweiYan-96/44/base -> origin/gh/ZhiweiYan-96/44/base 2025-12-04T08:54:02.3042852Z * [new branch] gh/ZhiweiYan-96/44/head -> origin/gh/ZhiweiYan-96/44/head 2025-12-04T08:54:02.3043045Z * [new branch] gh/ZhiweiYan-96/45/base -> origin/gh/ZhiweiYan-96/45/base 2025-12-04T08:54:02.3043238Z * [new branch] gh/ZhiweiYan-96/45/head -> origin/gh/ZhiweiYan-96/45/head 2025-12-04T08:54:02.3043430Z * [new branch] gh/ZhiweiYan-96/49/base -> origin/gh/ZhiweiYan-96/49/base 2025-12-04T08:54:02.3043620Z * [new branch] gh/ZhiweiYan-96/49/head -> origin/gh/ZhiweiYan-96/49/head 2025-12-04T08:54:02.3043812Z * [new branch] gh/ZhiweiYan-96/62/base -> origin/gh/ZhiweiYan-96/62/base 2025-12-04T08:54:02.3044005Z * [new branch] gh/ZhiweiYan-96/62/head -> origin/gh/ZhiweiYan-96/62/head 2025-12-04T08:54:02.3044195Z * [new branch] gh/ZhiweiYan-96/66/base -> origin/gh/ZhiweiYan-96/66/base 2025-12-04T08:54:02.3044386Z * [new branch] gh/ZhiweiYan-96/66/head -> origin/gh/ZhiweiYan-96/66/head 2025-12-04T08:54:02.3044576Z * [new branch] gh/ZhiweiYan-96/67/base -> origin/gh/ZhiweiYan-96/67/base 2025-12-04T08:54:02.3044766Z * [new branch] gh/ZhiweiYan-96/67/head -> origin/gh/ZhiweiYan-96/67/head 2025-12-04T08:54:02.3044985Z * [new branch] gh/ZhiweiYan-96/68/base -> origin/gh/ZhiweiYan-96/68/base 2025-12-04T08:54:02.3045175Z * [new branch] gh/ZhiweiYan-96/68/head -> origin/gh/ZhiweiYan-96/68/head 2025-12-04T08:54:02.3045366Z * [new branch] gh/ZhiweiYan-96/68/orig -> origin/gh/ZhiweiYan-96/68/orig 2025-12-04T08:54:02.3045558Z * [new branch] gh/aakhundov/1/base -> origin/gh/aakhundov/1/base 2025-12-04T08:54:02.3045744Z * [new branch] gh/aakhundov/1/head -> origin/gh/aakhundov/1/head 2025-12-04T08:54:02.3045929Z * [new branch] gh/aakhundov/2/base -> origin/gh/aakhundov/2/base 2025-12-04T08:54:02.3046109Z * [new branch] gh/aakhundov/2/head -> origin/gh/aakhundov/2/head 2025-12-04T08:54:02.3046298Z * [new branch] gh/aditew01/openblas -> origin/gh/aditew01/openblas 2025-12-04T08:54:02.3046486Z * [new branch] gh/aditew01/sbgemm -> origin/gh/aditew01/sbgemm 2025-12-04T08:54:02.3046669Z * [new branch] gh/aditew01/vecbf16 -> origin/gh/aditew01/vecbf16 2025-12-04T08:54:02.3046851Z * [new branch] gh/albanD/4/base -> origin/gh/albanD/4/base 2025-12-04T08:54:02.3047029Z * [new branch] gh/albanD/4/head -> origin/gh/albanD/4/head 2025-12-04T08:54:02.3047223Z * [new branch] gh/albanD/4/orig -> origin/gh/albanD/4/orig 2025-12-04T08:54:02.3047493Z * [new branch] gh/alexbrauckmann/paddedtensor_faketensor_init -> origin/gh/alexbrauckmann/paddedtensor_faketensor_init 2025-12-04T08:54:02.3047768Z * [new branch] gh/alexsamardzic/12/base -> origin/gh/alexsamardzic/12/base 2025-12-04T08:54:02.3047971Z * [new branch] gh/alexsamardzic/12/head -> origin/gh/alexsamardzic/12/head 2025-12-04T08:54:02.3048172Z * [new branch] gh/alexsamardzic/12/orig -> origin/gh/alexsamardzic/12/orig 2025-12-04T08:54:02.3048372Z * [new branch] gh/alexsamardzic/14/base -> origin/gh/alexsamardzic/14/base 2025-12-04T08:54:02.3048571Z * [new branch] gh/alexsamardzic/14/head -> origin/gh/alexsamardzic/14/head 2025-12-04T08:54:02.3048771Z * [new branch] gh/alexsamardzic/14/orig -> origin/gh/alexsamardzic/14/orig 2025-12-04T08:54:02.3048970Z * [new branch] gh/alexsamardzic/15/base -> origin/gh/alexsamardzic/15/base 2025-12-04T08:54:02.3049169Z * [new branch] gh/alexsamardzic/15/head -> origin/gh/alexsamardzic/15/head 2025-12-04T08:54:02.3049368Z * [new branch] gh/alexsamardzic/15/orig -> origin/gh/alexsamardzic/15/orig 2025-12-04T08:54:02.3049560Z * [new branch] gh/amjames/18/base -> origin/gh/amjames/18/base 2025-12-04T08:54:02.3049745Z * [new branch] gh/amjames/18/head -> origin/gh/amjames/18/head 2025-12-04T08:54:02.3049926Z * [new branch] gh/amjames/18/orig -> origin/gh/amjames/18/orig 2025-12-04T08:54:02.3050154Z * [new branch] gh/andrewor14/35/base -> origin/gh/andrewor14/35/base 2025-12-04T08:54:02.3050346Z * [new branch] gh/andrewor14/35/head -> origin/gh/andrewor14/35/head 2025-12-04T08:54:02.3050540Z * [new branch] gh/andrewor14/35/orig -> origin/gh/andrewor14/35/orig 2025-12-04T08:54:02.3050729Z * [new branch] gh/andrewor14/50/base -> origin/gh/andrewor14/50/base 2025-12-04T08:54:02.3050920Z * [new branch] gh/andrewor14/50/head -> origin/gh/andrewor14/50/head 2025-12-04T08:54:02.3051116Z * [new branch] gh/andrewor14/50/orig -> origin/gh/andrewor14/50/orig 2025-12-04T08:54:02.3051304Z * [new branch] gh/andyanwang/30/base -> origin/gh/andyanwang/30/base 2025-12-04T08:54:02.3051501Z * [new branch] gh/andyanwang/30/orig -> origin/gh/andyanwang/30/orig 2025-12-04T08:54:02.3051692Z * [new branch] gh/andyanwang/31/base -> origin/gh/andyanwang/31/base 2025-12-04T08:54:02.3051933Z * [new branch] gh/andyanwang/31/orig -> origin/gh/andyanwang/31/orig 2025-12-04T08:54:02.3052141Z * [new branch] gh/andyanwang/39/base -> origin/gh/andyanwang/39/base 2025-12-04T08:54:02.3052331Z * [new branch] gh/andyanwang/39/head -> origin/gh/andyanwang/39/head 2025-12-04T08:54:02.3052521Z * [new branch] gh/andyanwang/39/orig -> origin/gh/andyanwang/39/orig 2025-12-04T08:54:02.3052712Z * [new branch] gh/andyanwang/42/base -> origin/gh/andyanwang/42/base 2025-12-04T08:54:02.3052899Z * [new branch] gh/andyanwang/42/head -> origin/gh/andyanwang/42/head 2025-12-04T08:54:02.3053085Z * [new branch] gh/andyanwang/42/orig -> origin/gh/andyanwang/42/orig 2025-12-04T08:54:02.3053275Z * [new branch] gh/andyanwang/45/base -> origin/gh/andyanwang/45/base 2025-12-04T08:54:02.3053466Z * [new branch] gh/andyanwang/45/head -> origin/gh/andyanwang/45/head 2025-12-04T08:54:02.3053660Z * [new branch] gh/andyanwang/45/orig -> origin/gh/andyanwang/45/orig 2025-12-04T08:54:02.3053863Z * [new branch] gh/angelayi/107/base -> origin/gh/angelayi/107/base 2025-12-04T08:54:02.3054054Z * [new branch] gh/angelayi/107/head -> origin/gh/angelayi/107/head 2025-12-04T08:54:02.3054270Z * [new branch] gh/angelayi/114/base -> origin/gh/angelayi/114/base 2025-12-04T08:54:02.3054703Z * [new branch] gh/angelayi/114/head -> origin/gh/angelayi/114/head 2025-12-04T08:54:02.3054886Z * [new branch] gh/angelayi/114/orig -> origin/gh/angelayi/114/orig 2025-12-04T08:54:02.3055066Z * [new branch] gh/angelayi/116/base -> origin/gh/angelayi/116/base 2025-12-04T08:54:02.3055250Z * [new branch] gh/angelayi/116/head -> origin/gh/angelayi/116/head 2025-12-04T08:54:02.3055434Z * [new branch] gh/angelayi/116/orig -> origin/gh/angelayi/116/orig 2025-12-04T08:54:02.3055616Z * [new branch] gh/angelayi/122/base -> origin/gh/angelayi/122/base 2025-12-04T08:54:02.3055800Z * [new branch] gh/angelayi/122/head -> origin/gh/angelayi/122/head 2025-12-04T08:54:02.3055981Z * [new branch] gh/angelayi/122/orig -> origin/gh/angelayi/122/orig 2025-12-04T08:54:02.3056165Z * [new branch] gh/angelayi/124/base -> origin/gh/angelayi/124/base 2025-12-04T08:54:02.3056351Z * [new branch] gh/angelayi/124/head -> origin/gh/angelayi/124/head 2025-12-04T08:54:02.3056532Z * [new branch] gh/angelayi/124/orig -> origin/gh/angelayi/124/orig 2025-12-04T08:54:02.3056718Z * [new branch] gh/angelayi/128/base -> origin/gh/angelayi/128/base 2025-12-04T08:54:02.3056901Z * [new branch] gh/angelayi/128/head -> origin/gh/angelayi/128/head 2025-12-04T08:54:02.3057087Z * [new branch] gh/angelayi/128/orig -> origin/gh/angelayi/128/orig 2025-12-04T08:54:02.3057272Z * [new branch] gh/angelayi/131/base -> origin/gh/angelayi/131/base 2025-12-04T08:54:02.3057455Z * [new branch] gh/angelayi/131/head -> origin/gh/angelayi/131/head 2025-12-04T08:54:02.3057636Z * [new branch] gh/angelayi/131/orig -> origin/gh/angelayi/131/orig 2025-12-04T08:54:02.3057824Z * [new branch] gh/angelayi/132/base -> origin/gh/angelayi/132/base 2025-12-04T08:54:02.3058008Z * [new branch] gh/angelayi/132/head -> origin/gh/angelayi/132/head 2025-12-04T08:54:02.3058189Z * [new branch] gh/angelayi/132/orig -> origin/gh/angelayi/132/orig 2025-12-04T08:54:02.3058372Z * [new branch] gh/angelayi/133/base -> origin/gh/angelayi/133/base 2025-12-04T08:54:02.3058555Z * [new branch] gh/angelayi/133/head -> origin/gh/angelayi/133/head 2025-12-04T08:54:02.3058760Z * [new branch] gh/angelayi/133/orig -> origin/gh/angelayi/133/orig 2025-12-04T08:54:02.3058945Z * [new branch] gh/angelayi/134/base -> origin/gh/angelayi/134/base 2025-12-04T08:54:02.3059127Z * [new branch] gh/angelayi/134/head -> origin/gh/angelayi/134/head 2025-12-04T08:54:02.3059309Z * [new branch] gh/angelayi/134/orig -> origin/gh/angelayi/134/orig 2025-12-04T08:54:02.3059496Z * [new branch] gh/angelayi/135/base -> origin/gh/angelayi/135/base 2025-12-04T08:54:02.3059677Z * [new branch] gh/angelayi/135/head -> origin/gh/angelayi/135/head 2025-12-04T08:54:02.3059859Z * [new branch] gh/angelayi/135/orig -> origin/gh/angelayi/135/orig 2025-12-04T08:54:02.3060044Z * [new branch] gh/angelayi/136/base -> origin/gh/angelayi/136/base 2025-12-04T08:54:02.3060288Z * [new branch] gh/angelayi/136/head -> origin/gh/angelayi/136/head 2025-12-04T08:54:02.3060471Z * [new branch] gh/angelayi/136/orig -> origin/gh/angelayi/136/orig 2025-12-04T08:54:02.3060654Z * [new branch] gh/angelayi/137/base -> origin/gh/angelayi/137/base 2025-12-04T08:54:02.3060836Z * [new branch] gh/angelayi/137/head -> origin/gh/angelayi/137/head 2025-12-04T08:54:02.3061048Z * [new branch] gh/angelayi/137/orig -> origin/gh/angelayi/137/orig 2025-12-04T08:54:02.3061231Z * [new branch] gh/angelayi/138/base -> origin/gh/angelayi/138/base 2025-12-04T08:54:02.3061412Z * [new branch] gh/angelayi/138/head -> origin/gh/angelayi/138/head 2025-12-04T08:54:02.3061593Z * [new branch] gh/angelayi/138/orig -> origin/gh/angelayi/138/orig 2025-12-04T08:54:02.3061775Z * [new branch] gh/angelayi/139/base -> origin/gh/angelayi/139/base 2025-12-04T08:54:02.3061955Z * [new branch] gh/angelayi/139/head -> origin/gh/angelayi/139/head 2025-12-04T08:54:02.3062138Z * [new branch] gh/angelayi/139/orig -> origin/gh/angelayi/139/orig 2025-12-04T08:54:02.3062319Z * [new branch] gh/angelayi/140/base -> origin/gh/angelayi/140/base 2025-12-04T08:54:02.3062499Z * [new branch] gh/angelayi/140/head -> origin/gh/angelayi/140/head 2025-12-04T08:54:02.3062681Z * [new branch] gh/angelayi/140/orig -> origin/gh/angelayi/140/orig 2025-12-04T08:54:02.3062862Z * [new branch] gh/angelayi/141/base -> origin/gh/angelayi/141/base 2025-12-04T08:54:02.3063044Z * [new branch] gh/angelayi/141/head -> origin/gh/angelayi/141/head 2025-12-04T08:54:02.3063224Z * [new branch] gh/angelayi/141/orig -> origin/gh/angelayi/141/orig 2025-12-04T08:54:02.3063403Z * [new branch] gh/angelayi/142/base -> origin/gh/angelayi/142/base 2025-12-04T08:54:02.3063583Z * [new branch] gh/angelayi/142/head -> origin/gh/angelayi/142/head 2025-12-04T08:54:02.3063769Z * [new branch] gh/angelayi/142/orig -> origin/gh/angelayi/142/orig 2025-12-04T08:54:02.3063947Z * [new branch] gh/angelayi/143/base -> origin/gh/angelayi/143/base 2025-12-04T08:54:02.3064128Z * [new branch] gh/angelayi/143/head -> origin/gh/angelayi/143/head 2025-12-04T08:54:02.3064313Z * [new branch] gh/angelayi/143/orig -> origin/gh/angelayi/143/orig 2025-12-04T08:54:02.3064546Z * [new branch] gh/angelayi/144/base -> origin/gh/angelayi/144/base 2025-12-04T08:54:02.3064728Z * [new branch] gh/angelayi/144/head -> origin/gh/angelayi/144/head 2025-12-04T08:54:02.3064910Z * [new branch] gh/angelayi/144/orig -> origin/gh/angelayi/144/orig 2025-12-04T08:54:02.3065098Z * [new branch] gh/anijain2305/753/base -> origin/gh/anijain2305/753/base 2025-12-04T08:54:02.3065290Z * [new branch] gh/anijain2305/753/head -> origin/gh/anijain2305/753/head 2025-12-04T08:54:02.3065516Z * [new branch] gh/anijain2305/753/orig -> origin/gh/anijain2305/753/orig 2025-12-04T08:54:02.3065703Z * [new branch] gh/anijain2305/810/base -> origin/gh/anijain2305/810/base 2025-12-04T08:54:02.3065891Z * [new branch] gh/anijain2305/810/head -> origin/gh/anijain2305/810/head 2025-12-04T08:54:02.3066082Z * [new branch] gh/anijain2305/810/orig -> origin/gh/anijain2305/810/orig 2025-12-04T08:54:02.3066270Z * [new branch] gh/anijain2305/854/base -> origin/gh/anijain2305/854/base 2025-12-04T08:54:02.3066464Z * [new branch] gh/anijain2305/854/head -> origin/gh/anijain2305/854/head 2025-12-04T08:54:02.3066657Z * [new branch] gh/anijain2305/854/orig -> origin/gh/anijain2305/854/orig 2025-12-04T08:54:02.3066846Z * [new branch] gh/anijain2305/864/base -> origin/gh/anijain2305/864/base 2025-12-04T08:54:02.3067050Z * [new branch] gh/anijain2305/864/head -> origin/gh/anijain2305/864/head 2025-12-04T08:54:02.3067243Z * [new branch] gh/anijain2305/864/orig -> origin/gh/anijain2305/864/orig 2025-12-04T08:54:02.3067432Z * [new branch] gh/anijain2305/870/base -> origin/gh/anijain2305/870/base 2025-12-04T08:54:02.3067626Z * [new branch] gh/anijain2305/870/head -> origin/gh/anijain2305/870/head 2025-12-04T08:54:02.3067840Z * [new branch] gh/anijain2305/870/orig -> origin/gh/anijain2305/870/orig 2025-12-04T08:54:02.3068030Z * [new branch] gh/anijain2305/873/base -> origin/gh/anijain2305/873/base 2025-12-04T08:54:02.3068223Z * [new branch] gh/anijain2305/873/head -> origin/gh/anijain2305/873/head 2025-12-04T08:54:02.3068412Z * [new branch] gh/anijain2305/873/orig -> origin/gh/anijain2305/873/orig 2025-12-04T08:54:02.3068600Z * [new branch] gh/anijain2305/894/base -> origin/gh/anijain2305/894/base 2025-12-04T08:54:02.3068792Z * [new branch] gh/anijain2305/894/head -> origin/gh/anijain2305/894/head 2025-12-04T08:54:02.3068978Z * [new branch] gh/anijain2305/894/orig -> origin/gh/anijain2305/894/orig 2025-12-04T08:54:02.3069167Z * [new branch] gh/anijain2305/895/base -> origin/gh/anijain2305/895/base 2025-12-04T08:54:02.3069358Z * [new branch] gh/anijain2305/895/head -> origin/gh/anijain2305/895/head 2025-12-04T08:54:02.3069545Z * [new branch] gh/anijain2305/895/orig -> origin/gh/anijain2305/895/orig 2025-12-04T08:54:02.3069735Z * [new branch] gh/anijain2305/910/base -> origin/gh/anijain2305/910/base 2025-12-04T08:54:02.3069926Z * [new branch] gh/anijain2305/910/head -> origin/gh/anijain2305/910/head 2025-12-04T08:54:02.3070171Z * [new branch] gh/anijain2305/910/orig -> origin/gh/anijain2305/910/orig 2025-12-04T08:54:02.3070366Z * [new branch] gh/anijain2305/919/base -> origin/gh/anijain2305/919/base 2025-12-04T08:54:02.3070558Z * [new branch] gh/anijain2305/919/head -> origin/gh/anijain2305/919/head 2025-12-04T08:54:02.3070745Z * [new branch] gh/anijain2305/919/orig -> origin/gh/anijain2305/919/orig 2025-12-04T08:54:02.3070936Z * [new branch] gh/anijain2305/922/base -> origin/gh/anijain2305/922/base 2025-12-04T08:54:02.3071124Z * [new branch] gh/anijain2305/922/head -> origin/gh/anijain2305/922/head 2025-12-04T08:54:02.3071311Z * [new branch] gh/anijain2305/922/orig -> origin/gh/anijain2305/922/orig 2025-12-04T08:54:02.3071501Z * [new branch] gh/anijain2305/932/base -> origin/gh/anijain2305/932/base 2025-12-04T08:54:02.3071692Z * [new branch] gh/anijain2305/932/head -> origin/gh/anijain2305/932/head 2025-12-04T08:54:02.3071879Z * [new branch] gh/anijain2305/932/orig -> origin/gh/anijain2305/932/orig 2025-12-04T08:54:02.3072106Z * [new branch] gh/anijain2305/940/base -> origin/gh/anijain2305/940/base 2025-12-04T08:54:02.3072295Z * [new branch] gh/anijain2305/940/head -> origin/gh/anijain2305/940/head 2025-12-04T08:54:02.3072482Z * [new branch] gh/anijain2305/940/orig -> origin/gh/anijain2305/940/orig 2025-12-04T08:54:02.3072670Z * [new branch] gh/anijain2305/941/base -> origin/gh/anijain2305/941/base 2025-12-04T08:54:02.3072861Z * [new branch] gh/anijain2305/941/head -> origin/gh/anijain2305/941/head 2025-12-04T08:54:02.3073048Z * [new branch] gh/anijain2305/941/orig -> origin/gh/anijain2305/941/orig 2025-12-04T08:54:02.3073242Z * [new branch] gh/anijain2305/942/base -> origin/gh/anijain2305/942/base 2025-12-04T08:54:02.3073435Z * [new branch] gh/anijain2305/942/head -> origin/gh/anijain2305/942/head 2025-12-04T08:54:02.3073625Z * [new branch] gh/anijain2305/942/orig -> origin/gh/anijain2305/942/orig 2025-12-04T08:54:02.3073820Z * [new branch] gh/anijain2305/943/base -> origin/gh/anijain2305/943/base 2025-12-04T08:54:02.3074010Z * [new branch] gh/anijain2305/943/head -> origin/gh/anijain2305/943/head 2025-12-04T08:54:02.3074203Z * [new branch] gh/anijain2305/943/orig -> origin/gh/anijain2305/943/orig 2025-12-04T08:54:02.3074428Z * [new branch] gh/anijain2305/944/base -> origin/gh/anijain2305/944/base 2025-12-04T08:54:02.3074614Z * [new branch] gh/anijain2305/944/head -> origin/gh/anijain2305/944/head 2025-12-04T08:54:02.3074808Z * [new branch] gh/anijain2305/944/orig -> origin/gh/anijain2305/944/orig 2025-12-04T08:54:02.3074994Z * [new branch] gh/anijain2305/945/base -> origin/gh/anijain2305/945/base 2025-12-04T08:54:02.3075179Z * [new branch] gh/anijain2305/945/head -> origin/gh/anijain2305/945/head 2025-12-04T08:54:02.3075369Z * [new branch] gh/anijain2305/945/orig -> origin/gh/anijain2305/945/orig 2025-12-04T08:54:02.3075560Z * [new branch] gh/anijain2305/946/base -> origin/gh/anijain2305/946/base 2025-12-04T08:54:02.3075748Z * [new branch] gh/anijain2305/946/head -> origin/gh/anijain2305/946/head 2025-12-04T08:54:02.3075937Z * [new branch] gh/anijain2305/946/orig -> origin/gh/anijain2305/946/orig 2025-12-04T08:54:02.3076130Z * [new branch] gh/anijain2305/947/base -> origin/gh/anijain2305/947/base 2025-12-04T08:54:02.3076317Z * [new branch] gh/anijain2305/947/head -> origin/gh/anijain2305/947/head 2025-12-04T08:54:02.3076508Z * [new branch] gh/anijain2305/947/orig -> origin/gh/anijain2305/947/orig 2025-12-04T08:54:02.3076696Z * [new branch] gh/anijain2305/948/base -> origin/gh/anijain2305/948/base 2025-12-04T08:54:02.3076883Z * [new branch] gh/anijain2305/948/head -> origin/gh/anijain2305/948/head 2025-12-04T08:54:02.3077076Z * [new branch] gh/anijain2305/948/orig -> origin/gh/anijain2305/948/orig 2025-12-04T08:54:02.3077268Z * [new branch] gh/anijain2305/949/base -> origin/gh/anijain2305/949/base 2025-12-04T08:54:02.3077453Z * [new branch] gh/anijain2305/949/head -> origin/gh/anijain2305/949/head 2025-12-04T08:54:02.3077640Z * [new branch] gh/anijain2305/949/orig -> origin/gh/anijain2305/949/orig 2025-12-04T08:54:02.3077826Z * [new branch] gh/anijain2305/950/base -> origin/gh/anijain2305/950/base 2025-12-04T08:54:02.3078011Z * [new branch] gh/anijain2305/950/head -> origin/gh/anijain2305/950/head 2025-12-04T08:54:02.3078198Z * [new branch] gh/anijain2305/950/orig -> origin/gh/anijain2305/950/orig 2025-12-04T08:54:02.3078383Z * [new branch] gh/anijain2305/951/base -> origin/gh/anijain2305/951/base 2025-12-04T08:54:02.3078569Z * [new branch] gh/anijain2305/951/head -> origin/gh/anijain2305/951/head 2025-12-04T08:54:02.3078794Z * [new branch] gh/anijain2305/951/orig -> origin/gh/anijain2305/951/orig 2025-12-04T08:54:02.3078984Z * [new branch] gh/anijain2305/952/base -> origin/gh/anijain2305/952/base 2025-12-04T08:54:02.3079174Z * [new branch] gh/anijain2305/952/head -> origin/gh/anijain2305/952/head 2025-12-04T08:54:02.3079368Z * [new branch] gh/anijain2305/952/orig -> origin/gh/anijain2305/952/orig 2025-12-04T08:54:02.3079557Z * [new branch] gh/anijain2305/953/base -> origin/gh/anijain2305/953/base 2025-12-04T08:54:02.3079745Z * [new branch] gh/anijain2305/953/head -> origin/gh/anijain2305/953/head 2025-12-04T08:54:02.3079937Z * [new branch] gh/anijain2305/953/orig -> origin/gh/anijain2305/953/orig 2025-12-04T08:54:02.3080219Z * [new branch] gh/anijain2305/954/base -> origin/gh/anijain2305/954/base 2025-12-04T08:54:02.3080417Z * [new branch] gh/anijain2305/954/head -> origin/gh/anijain2305/954/head 2025-12-04T08:54:02.3080608Z * [new branch] gh/anijain2305/954/orig -> origin/gh/anijain2305/954/orig 2025-12-04T08:54:02.3080795Z * [new branch] gh/anijain2305/955/base -> origin/gh/anijain2305/955/base 2025-12-04T08:54:02.3080986Z * [new branch] gh/anijain2305/955/head -> origin/gh/anijain2305/955/head 2025-12-04T08:54:02.3081205Z * [new branch] gh/anijain2305/955/orig -> origin/gh/anijain2305/955/orig 2025-12-04T08:54:02.3081392Z * [new branch] gh/anijain2305/956/base -> origin/gh/anijain2305/956/base 2025-12-04T08:54:02.3081581Z * [new branch] gh/anijain2305/956/head -> origin/gh/anijain2305/956/head 2025-12-04T08:54:02.3081768Z * [new branch] gh/anijain2305/956/orig -> origin/gh/anijain2305/956/orig 2025-12-04T08:54:02.3081954Z * [new branch] gh/anijain2305/957/base -> origin/gh/anijain2305/957/base 2025-12-04T08:54:02.3082143Z * [new branch] gh/anijain2305/957/head -> origin/gh/anijain2305/957/head 2025-12-04T08:54:02.3082335Z * [new branch] gh/anijain2305/957/orig -> origin/gh/anijain2305/957/orig 2025-12-04T08:54:02.3082521Z * [new branch] gh/anijain2305/958/base -> origin/gh/anijain2305/958/base 2025-12-04T08:54:02.3082709Z * [new branch] gh/anijain2305/958/head -> origin/gh/anijain2305/958/head 2025-12-04T08:54:02.3082899Z * [new branch] gh/anijain2305/958/orig -> origin/gh/anijain2305/958/orig 2025-12-04T08:54:02.3083086Z * [new branch] gh/anijain2305/959/base -> origin/gh/anijain2305/959/base 2025-12-04T08:54:02.3083275Z * [new branch] gh/anijain2305/959/head -> origin/gh/anijain2305/959/head 2025-12-04T08:54:02.3083459Z * [new branch] gh/anijain2305/959/orig -> origin/gh/anijain2305/959/orig 2025-12-04T08:54:02.3083647Z * [new branch] gh/anijain2305/960/base -> origin/gh/anijain2305/960/base 2025-12-04T08:54:02.3083839Z * [new branch] gh/anijain2305/960/head -> origin/gh/anijain2305/960/head 2025-12-04T08:54:02.3084027Z * [new branch] gh/anijain2305/960/orig -> origin/gh/anijain2305/960/orig 2025-12-04T08:54:02.3084217Z * [new branch] gh/anijain2305/961/base -> origin/gh/anijain2305/961/base 2025-12-04T08:54:02.3084408Z * [new branch] gh/anijain2305/961/head -> origin/gh/anijain2305/961/head 2025-12-04T08:54:02.3084596Z * [new branch] gh/anijain2305/961/orig -> origin/gh/anijain2305/961/orig 2025-12-04T08:54:02.3084788Z * [new branch] gh/anijain2305/962/base -> origin/gh/anijain2305/962/base 2025-12-04T08:54:02.3084979Z * [new branch] gh/anijain2305/962/head -> origin/gh/anijain2305/962/head 2025-12-04T08:54:02.3085167Z * [new branch] gh/anijain2305/962/orig -> origin/gh/anijain2305/962/orig 2025-12-04T08:54:02.3085361Z * [new branch] gh/anijain2305/963/base -> origin/gh/anijain2305/963/base 2025-12-04T08:54:02.3085585Z * [new branch] gh/anijain2305/963/head -> origin/gh/anijain2305/963/head 2025-12-04T08:54:02.3085773Z * [new branch] gh/anijain2305/963/orig -> origin/gh/anijain2305/963/orig 2025-12-04T08:54:02.3085964Z * [new branch] gh/anijain2305/964/base -> origin/gh/anijain2305/964/base 2025-12-04T08:54:02.3086154Z * [new branch] gh/anijain2305/964/head -> origin/gh/anijain2305/964/head 2025-12-04T08:54:02.3086340Z * [new branch] gh/anijain2305/964/orig -> origin/gh/anijain2305/964/orig 2025-12-04T08:54:02.3086534Z * [new branch] gh/anijain2305/965/base -> origin/gh/anijain2305/965/base 2025-12-04T08:54:02.3086726Z * [new branch] gh/anijain2305/965/head -> origin/gh/anijain2305/965/head 2025-12-04T08:54:02.3086916Z * [new branch] gh/anijain2305/965/orig -> origin/gh/anijain2305/965/orig 2025-12-04T08:54:02.3087104Z * [new branch] gh/anijain2305/966/base -> origin/gh/anijain2305/966/base 2025-12-04T08:54:02.3087295Z * [new branch] gh/anijain2305/966/head -> origin/gh/anijain2305/966/head 2025-12-04T08:54:02.3087480Z * [new branch] gh/anijain2305/966/orig -> origin/gh/anijain2305/966/orig 2025-12-04T08:54:02.3087670Z * [new branch] gh/anijain2305/967/base -> origin/gh/anijain2305/967/base 2025-12-04T08:54:02.3087877Z * [new branch] gh/anijain2305/967/head -> origin/gh/anijain2305/967/head 2025-12-04T08:54:02.3088066Z * [new branch] gh/anijain2305/967/orig -> origin/gh/anijain2305/967/orig 2025-12-04T08:54:02.3088257Z * [new branch] gh/anijain2305/968/base -> origin/gh/anijain2305/968/base 2025-12-04T08:54:02.3088444Z * [new branch] gh/anijain2305/968/head -> origin/gh/anijain2305/968/head 2025-12-04T08:54:02.3088637Z * [new branch] gh/anijain2305/968/orig -> origin/gh/anijain2305/968/orig 2025-12-04T08:54:02.3088826Z * [new branch] gh/anijain2305/969/base -> origin/gh/anijain2305/969/base 2025-12-04T08:54:02.3089012Z * [new branch] gh/anijain2305/969/head -> origin/gh/anijain2305/969/head 2025-12-04T08:54:02.3089207Z * [new branch] gh/anijain2305/969/orig -> origin/gh/anijain2305/969/orig 2025-12-04T08:54:02.3089398Z * [new branch] gh/anijain2305/970/base -> origin/gh/anijain2305/970/base 2025-12-04T08:54:02.3089583Z * [new branch] gh/anijain2305/970/head -> origin/gh/anijain2305/970/head 2025-12-04T08:54:02.3089772Z * [new branch] gh/anijain2305/970/orig -> origin/gh/anijain2305/970/orig 2025-12-04T08:54:02.3089961Z * [new branch] gh/anjali411/216/base -> origin/gh/anjali411/216/base 2025-12-04T08:54:02.3090244Z * [new branch] gh/anjali411/216/head -> origin/gh/anjali411/216/head 2025-12-04T08:54:02.3090432Z * [new branch] gh/anjali411/216/orig -> origin/gh/anjali411/216/orig 2025-12-04T08:54:02.3090620Z * [new branch] gh/anshul-si/1/base -> origin/gh/anshul-si/1/base 2025-12-04T08:54:02.3090800Z * [new branch] gh/anshul-si/1/head -> origin/gh/anshul-si/1/head 2025-12-04T08:54:02.3090981Z * [new branch] gh/anshul-si/2/base -> origin/gh/anshul-si/2/base 2025-12-04T08:54:02.3091164Z * [new branch] gh/anshul-si/2/head -> origin/gh/anshul-si/2/head 2025-12-04T08:54:02.3091343Z * [new branch] gh/anshul-si/3/base -> origin/gh/anshul-si/3/base 2025-12-04T08:54:02.3091522Z * [new branch] gh/anshul-si/3/head -> origin/gh/anshul-si/3/head 2025-12-04T08:54:02.3091700Z * [new branch] gh/anshul-si/4/base -> origin/gh/anshul-si/4/base 2025-12-04T08:54:02.3091877Z * [new branch] gh/anshul-si/4/head -> origin/gh/anshul-si/4/head 2025-12-04T08:54:02.3092058Z * [new branch] gh/anshul-si/5/base -> origin/gh/anshul-si/5/base 2025-12-04T08:54:02.3092271Z * [new branch] gh/anshul-si/5/head -> origin/gh/anshul-si/5/head 2025-12-04T08:54:02.3092454Z * [new branch] gh/anshul-si/53/base -> origin/gh/anshul-si/53/base 2025-12-04T08:54:02.3092639Z * [new branch] gh/anshul-si/53/head -> origin/gh/anshul-si/53/head 2025-12-04T08:54:02.3092819Z * [new branch] gh/anshul-si/58/base -> origin/gh/anshul-si/58/base 2025-12-04T08:54:02.3093002Z * [new branch] gh/anshul-si/58/head -> origin/gh/anshul-si/58/head 2025-12-04T08:54:02.3093188Z * [new branch] gh/anshul-si/66/base -> origin/gh/anshul-si/66/base 2025-12-04T08:54:02.3093366Z * [new branch] gh/anshul-si/66/head -> origin/gh/anshul-si/66/head 2025-12-04T08:54:02.3093550Z * [new branch] gh/anshul-si/66/orig -> origin/gh/anshul-si/66/orig 2025-12-04T08:54:02.3093733Z * [new branch] gh/anshul-si/67/base -> origin/gh/anshul-si/67/base 2025-12-04T08:54:02.3093914Z * [new branch] gh/anshul-si/67/head -> origin/gh/anshul-si/67/head 2025-12-04T08:54:02.3094098Z * [new branch] gh/anshul-si/67/orig -> origin/gh/anshul-si/67/orig 2025-12-04T08:54:02.3094280Z * [new branch] gh/anshul-si/68/base -> origin/gh/anshul-si/68/base 2025-12-04T08:54:02.3094488Z * [new branch] gh/anshul-si/68/head -> origin/gh/anshul-si/68/head 2025-12-04T08:54:02.3094670Z * [new branch] gh/anshul-si/68/orig -> origin/gh/anshul-si/68/orig 2025-12-04T08:54:02.3094852Z * [new branch] gh/anshul-si/69/base -> origin/gh/anshul-si/69/base 2025-12-04T08:54:02.3095030Z * [new branch] gh/anshul-si/69/head -> origin/gh/anshul-si/69/head 2025-12-04T08:54:02.3095221Z * [new branch] gh/anshul-si/69/orig -> origin/gh/anshul-si/69/orig 2025-12-04T08:54:02.3095399Z * [new branch] gh/anshul-si/70/base -> origin/gh/anshul-si/70/base 2025-12-04T08:54:02.3095577Z * [new branch] gh/anshul-si/70/head -> origin/gh/anshul-si/70/head 2025-12-04T08:54:02.3095758Z * [new branch] gh/anshul-si/70/orig -> origin/gh/anshul-si/70/orig 2025-12-04T08:54:02.3095936Z * [new branch] gh/anshul-si/71/base -> origin/gh/anshul-si/71/base 2025-12-04T08:54:02.3096114Z * [new branch] gh/anshul-si/71/head -> origin/gh/anshul-si/71/head 2025-12-04T08:54:02.3096292Z * [new branch] gh/anshul-si/71/orig -> origin/gh/anshul-si/71/orig 2025-12-04T08:54:02.3096471Z * [new branch] gh/anshul-si/72/base -> origin/gh/anshul-si/72/base 2025-12-04T08:54:02.3096646Z * [new branch] gh/anshul-si/72/head -> origin/gh/anshul-si/72/head 2025-12-04T08:54:02.3096825Z * [new branch] gh/anshul-si/72/orig -> origin/gh/anshul-si/72/orig 2025-12-04T08:54:02.3097004Z * [new branch] gh/anshul-si/73/base -> origin/gh/anshul-si/73/base 2025-12-04T08:54:02.3097183Z * [new branch] gh/anshul-si/73/head -> origin/gh/anshul-si/73/head 2025-12-04T08:54:02.3097360Z * [new branch] gh/anshul-si/73/orig -> origin/gh/anshul-si/73/orig 2025-12-04T08:54:02.3097540Z * [new branch] gh/aorenste/132/base -> origin/gh/aorenste/132/base 2025-12-04T08:54:02.3097722Z * [new branch] gh/aorenste/132/head -> origin/gh/aorenste/132/head 2025-12-04T08:54:02.3097902Z * [new branch] gh/aorenste/134/base -> origin/gh/aorenste/134/base 2025-12-04T08:54:02.3098081Z * [new branch] gh/aorenste/134/head -> origin/gh/aorenste/134/head 2025-12-04T08:54:02.3098259Z * [new branch] gh/aorenste/134/orig -> origin/gh/aorenste/134/orig 2025-12-04T08:54:02.3098439Z * [new branch] gh/aorenste/139/base -> origin/gh/aorenste/139/base 2025-12-04T08:54:02.3098618Z * [new branch] gh/aorenste/139/head -> origin/gh/aorenste/139/head 2025-12-04T08:54:02.3098825Z * [new branch] gh/aorenste/139/orig -> origin/gh/aorenste/139/orig 2025-12-04T08:54:02.3099006Z * [new branch] gh/aorenste/141/base -> origin/gh/aorenste/141/base 2025-12-04T08:54:02.3099186Z * [new branch] gh/aorenste/141/head -> origin/gh/aorenste/141/head 2025-12-04T08:54:02.3099368Z * [new branch] gh/aorenste/145/base -> origin/gh/aorenste/145/base 2025-12-04T08:54:02.3099547Z * [new branch] gh/aorenste/145/head -> origin/gh/aorenste/145/head 2025-12-04T08:54:02.3099726Z * [new branch] gh/aorenste/145/orig -> origin/gh/aorenste/145/orig 2025-12-04T08:54:02.3099907Z * [new branch] gh/aorenste/146/base -> origin/gh/aorenste/146/base 2025-12-04T08:54:02.3100087Z * [new branch] gh/aorenste/146/head -> origin/gh/aorenste/146/head 2025-12-04T08:54:02.3100308Z * [new branch] gh/aorenste/146/orig -> origin/gh/aorenste/146/orig 2025-12-04T08:54:02.3100489Z * [new branch] gh/aorenste/147/base -> origin/gh/aorenste/147/base 2025-12-04T08:54:02.3100669Z * [new branch] gh/aorenste/147/head -> origin/gh/aorenste/147/head 2025-12-04T08:54:02.3100848Z * [new branch] gh/aorenste/147/orig -> origin/gh/aorenste/147/orig 2025-12-04T08:54:02.3101078Z * [new branch] gh/aorenste/148/base -> origin/gh/aorenste/148/base 2025-12-04T08:54:02.3101259Z * [new branch] gh/aorenste/148/head -> origin/gh/aorenste/148/head 2025-12-04T08:54:02.3101439Z * [new branch] gh/aorenste/148/orig -> origin/gh/aorenste/148/orig 2025-12-04T08:54:02.3101619Z * [new branch] gh/aorenste/149/base -> origin/gh/aorenste/149/base 2025-12-04T08:54:02.3101799Z * [new branch] gh/aorenste/149/head -> origin/gh/aorenste/149/head 2025-12-04T08:54:02.3101980Z * [new branch] gh/aorenste/149/orig -> origin/gh/aorenste/149/orig 2025-12-04T08:54:02.3102164Z * [new branch] gh/aorenste/150/base -> origin/gh/aorenste/150/base 2025-12-04T08:54:02.3102345Z * [new branch] gh/aorenste/150/head -> origin/gh/aorenste/150/head 2025-12-04T08:54:02.3102533Z * [new branch] gh/aorenste/150/orig -> origin/gh/aorenste/150/orig 2025-12-04T08:54:02.3102726Z * [new branch] gh/aorenste/151/base -> origin/gh/aorenste/151/base 2025-12-04T08:54:02.3102911Z * [new branch] gh/aorenste/151/head -> origin/gh/aorenste/151/head 2025-12-04T08:54:02.3103089Z * [new branch] gh/aorenste/151/orig -> origin/gh/aorenste/151/orig 2025-12-04T08:54:02.3103268Z * [new branch] gh/aorenste/152/base -> origin/gh/aorenste/152/base 2025-12-04T08:54:02.3103446Z * [new branch] gh/aorenste/152/head -> origin/gh/aorenste/152/head 2025-12-04T08:54:02.3103630Z * [new branch] gh/aorenste/152/orig -> origin/gh/aorenste/152/orig 2025-12-04T08:54:02.3103812Z * [new branch] gh/aorenste/153/base -> origin/gh/aorenste/153/base 2025-12-04T08:54:02.3103991Z * [new branch] gh/aorenste/153/head -> origin/gh/aorenste/153/head 2025-12-04T08:54:02.3104174Z * [new branch] gh/aorenste/153/orig -> origin/gh/aorenste/153/orig 2025-12-04T08:54:02.3104357Z * [new branch] gh/aorenste/154/base -> origin/gh/aorenste/154/base 2025-12-04T08:54:02.3104538Z * [new branch] gh/aorenste/154/head -> origin/gh/aorenste/154/head 2025-12-04T08:54:02.3104720Z * [new branch] gh/aorenste/154/orig -> origin/gh/aorenste/154/orig 2025-12-04T08:54:02.3104901Z * [new branch] gh/aorenste/155/base -> origin/gh/aorenste/155/base 2025-12-04T08:54:02.3105080Z * [new branch] gh/aorenste/155/head -> origin/gh/aorenste/155/head 2025-12-04T08:54:02.3105288Z * [new branch] gh/aorenste/155/orig -> origin/gh/aorenste/155/orig 2025-12-04T08:54:02.3105471Z * [new branch] gh/aorenste/156/base -> origin/gh/aorenste/156/base 2025-12-04T08:54:02.3105652Z * [new branch] gh/aorenste/156/head -> origin/gh/aorenste/156/head 2025-12-04T08:54:02.3105842Z * [new branch] gh/aorenste/156/orig -> origin/gh/aorenste/156/orig 2025-12-04T08:54:02.3106024Z * [new branch] gh/aorenste/157/base -> origin/gh/aorenste/157/base 2025-12-04T08:54:02.3106202Z * [new branch] gh/aorenste/157/head -> origin/gh/aorenste/157/head 2025-12-04T08:54:02.3106382Z * [new branch] gh/aorenste/157/orig -> origin/gh/aorenste/157/orig 2025-12-04T08:54:02.3106562Z * [new branch] gh/aorenste/158/base -> origin/gh/aorenste/158/base 2025-12-04T08:54:02.3106742Z * [new branch] gh/aorenste/158/head -> origin/gh/aorenste/158/head 2025-12-04T08:54:02.3106926Z * [new branch] gh/aorenste/158/orig -> origin/gh/aorenste/158/orig 2025-12-04T08:54:02.3107103Z * [new branch] gh/aorenste/159/base -> origin/gh/aorenste/159/base 2025-12-04T08:54:02.3107286Z * [new branch] gh/aorenste/159/head -> origin/gh/aorenste/159/head 2025-12-04T08:54:02.3107487Z * [new branch] gh/aorenste/159/orig -> origin/gh/aorenste/159/orig 2025-12-04T08:54:02.3107678Z * [new branch] gh/avikchaudhuri/1/base -> origin/gh/avikchaudhuri/1/base 2025-12-04T08:54:02.3107876Z * [new branch] gh/avikchaudhuri/1/head -> origin/gh/avikchaudhuri/1/head 2025-12-04T08:54:02.3108069Z * [new branch] gh/avikchaudhuri/2/base -> origin/gh/avikchaudhuri/2/base 2025-12-04T08:54:02.3108263Z * [new branch] gh/avikchaudhuri/2/head -> origin/gh/avikchaudhuri/2/head 2025-12-04T08:54:02.3108454Z * [new branch] gh/avikchaudhuri/2/orig -> origin/gh/avikchaudhuri/2/orig 2025-12-04T08:54:02.3108641Z * [new branch] gh/bdhirsh/666/base -> origin/gh/bdhirsh/666/base 2025-12-04T08:54:02.3108820Z * [new branch] gh/bdhirsh/666/head -> origin/gh/bdhirsh/666/head 2025-12-04T08:54:02.3109004Z * [new branch] gh/bdhirsh/666/orig -> origin/gh/bdhirsh/666/orig 2025-12-04T08:54:02.3109183Z * [new branch] gh/bdhirsh/668/base -> origin/gh/bdhirsh/668/base 2025-12-04T08:54:02.3109359Z * [new branch] gh/bdhirsh/668/head -> origin/gh/bdhirsh/668/head 2025-12-04T08:54:02.3109537Z * [new branch] gh/bdhirsh/668/orig -> origin/gh/bdhirsh/668/orig 2025-12-04T08:54:02.3109718Z * [new branch] gh/bdhirsh/669/base -> origin/gh/bdhirsh/669/base 2025-12-04T08:54:02.3109896Z * [new branch] gh/bdhirsh/669/head -> origin/gh/bdhirsh/669/head 2025-12-04T08:54:02.3110073Z * [new branch] gh/bdhirsh/669/orig -> origin/gh/bdhirsh/669/orig 2025-12-04T08:54:02.3110302Z * [new branch] gh/bdhirsh/670/base -> origin/gh/bdhirsh/670/base 2025-12-04T08:54:02.3110482Z * [new branch] gh/bdhirsh/670/head -> origin/gh/bdhirsh/670/head 2025-12-04T08:54:02.3110657Z * [new branch] gh/bdhirsh/670/orig -> origin/gh/bdhirsh/670/orig 2025-12-04T08:54:02.3110836Z * [new branch] gh/bdhirsh/672/base -> origin/gh/bdhirsh/672/base 2025-12-04T08:54:02.3111015Z * [new branch] gh/bdhirsh/672/head -> origin/gh/bdhirsh/672/head 2025-12-04T08:54:02.3111191Z * [new branch] gh/bdhirsh/672/orig -> origin/gh/bdhirsh/672/orig 2025-12-04T08:54:02.3111366Z * [new branch] gh/bdhirsh/675/base -> origin/gh/bdhirsh/675/base 2025-12-04T08:54:02.3111542Z * [new branch] gh/bdhirsh/675/head -> origin/gh/bdhirsh/675/head 2025-12-04T08:54:02.3111720Z * [new branch] gh/bdhirsh/675/orig -> origin/gh/bdhirsh/675/orig 2025-12-04T08:54:02.3111930Z * [new branch] gh/bdhirsh/676/base -> origin/gh/bdhirsh/676/base 2025-12-04T08:54:02.3112107Z * [new branch] gh/bdhirsh/676/head -> origin/gh/bdhirsh/676/head 2025-12-04T08:54:02.3112284Z * [new branch] gh/bdhirsh/676/orig -> origin/gh/bdhirsh/676/orig 2025-12-04T08:54:02.3112460Z * [new branch] gh/bdhirsh/677/base -> origin/gh/bdhirsh/677/base 2025-12-04T08:54:02.3112641Z * [new branch] gh/bdhirsh/677/head -> origin/gh/bdhirsh/677/head 2025-12-04T08:54:02.3112710Z * [new branch] gh/bdhirsh/677/orig -> origin/gh/bdhirsh/677/orig 2025-12-04T08:54:02.3112780Z * [new branch] gh/bdhirsh/678/base -> origin/gh/bdhirsh/678/base 2025-12-04T08:54:02.3112848Z * [new branch] gh/bdhirsh/678/head -> origin/gh/bdhirsh/678/head 2025-12-04T08:54:02.3112916Z * [new branch] gh/bdhirsh/678/orig -> origin/gh/bdhirsh/678/orig 2025-12-04T08:54:02.3112988Z * [new branch] gh/bdhirsh/679/base -> origin/gh/bdhirsh/679/base 2025-12-04T08:54:02.3113057Z * [new branch] gh/bdhirsh/679/head -> origin/gh/bdhirsh/679/head 2025-12-04T08:54:02.3113125Z * [new branch] gh/bdhirsh/679/orig -> origin/gh/bdhirsh/679/orig 2025-12-04T08:54:02.3113220Z * [new branch] gh/bdhirsh/680/base -> origin/gh/bdhirsh/680/base 2025-12-04T08:54:02.3113289Z * [new branch] gh/bdhirsh/680/head -> origin/gh/bdhirsh/680/head 2025-12-04T08:54:02.3113357Z * [new branch] gh/bdhirsh/680/orig -> origin/gh/bdhirsh/680/orig 2025-12-04T08:54:02.3113427Z * [new branch] gh/bdhirsh/681/base -> origin/gh/bdhirsh/681/base 2025-12-04T08:54:02.3113495Z * [new branch] gh/bdhirsh/681/head -> origin/gh/bdhirsh/681/head 2025-12-04T08:54:02.3113564Z * [new branch] gh/bdhirsh/681/orig -> origin/gh/bdhirsh/681/orig 2025-12-04T08:54:02.3113660Z * [new branch] gh/benjaminglass1/101/base -> origin/gh/benjaminglass1/101/base 2025-12-04T08:54:02.3113751Z * [new branch] gh/benjaminglass1/101/head -> origin/gh/benjaminglass1/101/head 2025-12-04T08:54:02.3113840Z * [new branch] gh/benjaminglass1/101/orig -> origin/gh/benjaminglass1/101/orig 2025-12-04T08:54:02.3113928Z * [new branch] gh/benjaminglass1/102/base -> origin/gh/benjaminglass1/102/base 2025-12-04T08:54:02.3114013Z * [new branch] gh/benjaminglass1/102/head -> origin/gh/benjaminglass1/102/head 2025-12-04T08:54:02.3114099Z * [new branch] gh/benjaminglass1/102/orig -> origin/gh/benjaminglass1/102/orig 2025-12-04T08:54:02.3114183Z * [new branch] gh/benjaminglass1/106/base -> origin/gh/benjaminglass1/106/base 2025-12-04T08:54:02.3114268Z * [new branch] gh/benjaminglass1/106/head -> origin/gh/benjaminglass1/106/head 2025-12-04T08:54:02.3114356Z * [new branch] gh/benjaminglass1/106/orig -> origin/gh/benjaminglass1/106/orig 2025-12-04T08:54:02.3114440Z * [new branch] gh/benjaminglass1/107/base -> origin/gh/benjaminglass1/107/base 2025-12-04T08:54:02.3114525Z * [new branch] gh/benjaminglass1/107/head -> origin/gh/benjaminglass1/107/head 2025-12-04T08:54:02.3114613Z * [new branch] gh/benjaminglass1/107/orig -> origin/gh/benjaminglass1/107/orig 2025-12-04T08:54:02.3114697Z * [new branch] gh/benjaminglass1/108/base -> origin/gh/benjaminglass1/108/base 2025-12-04T08:54:02.3114781Z * [new branch] gh/benjaminglass1/108/head -> origin/gh/benjaminglass1/108/head 2025-12-04T08:54:02.3114867Z * [new branch] gh/benjaminglass1/108/orig -> origin/gh/benjaminglass1/108/orig 2025-12-04T08:54:02.3114952Z * [new branch] gh/benjaminglass1/109/base -> origin/gh/benjaminglass1/109/base 2025-12-04T08:54:02.3115062Z * [new branch] gh/benjaminglass1/109/head -> origin/gh/benjaminglass1/109/head 2025-12-04T08:54:02.3115149Z * [new branch] gh/benjaminglass1/109/orig -> origin/gh/benjaminglass1/109/orig 2025-12-04T08:54:02.3115235Z * [new branch] gh/benjaminglass1/97/base -> origin/gh/benjaminglass1/97/base 2025-12-04T08:54:02.3115320Z * [new branch] gh/benjaminglass1/97/head -> origin/gh/benjaminglass1/97/head 2025-12-04T08:54:02.3115403Z * [new branch] gh/benjaminglass1/97/orig -> origin/gh/benjaminglass1/97/orig 2025-12-04T08:54:02.3115482Z * [new branch] gh/bobrenjc93/570/base -> origin/gh/bobrenjc93/570/base 2025-12-04T08:54:02.3115559Z * [new branch] gh/bobrenjc93/570/head -> origin/gh/bobrenjc93/570/head 2025-12-04T08:54:02.3115633Z * [new branch] gh/bobrenjc93/570/orig -> origin/gh/bobrenjc93/570/orig 2025-12-04T08:54:02.3115707Z * [new branch] gh/bobrenjc93/604/base -> origin/gh/bobrenjc93/604/base 2025-12-04T08:54:02.3115781Z * [new branch] gh/bobrenjc93/604/head -> origin/gh/bobrenjc93/604/head 2025-12-04T08:54:02.3115853Z * [new branch] gh/bobrenjc93/604/orig -> origin/gh/bobrenjc93/604/orig 2025-12-04T08:54:02.3115926Z * [new branch] gh/bobrenjc93/638/base -> origin/gh/bobrenjc93/638/base 2025-12-04T08:54:02.3116020Z * [new branch] gh/bobrenjc93/638/head -> origin/gh/bobrenjc93/638/head 2025-12-04T08:54:02.3116093Z * [new branch] gh/bobrenjc93/638/orig -> origin/gh/bobrenjc93/638/orig 2025-12-04T08:54:02.3116166Z * [new branch] gh/bobrenjc93/653/base -> origin/gh/bobrenjc93/653/base 2025-12-04T08:54:02.3116242Z * [new branch] gh/bobrenjc93/653/head -> origin/gh/bobrenjc93/653/head 2025-12-04T08:54:02.3116316Z * [new branch] gh/bobrenjc93/653/orig -> origin/gh/bobrenjc93/653/orig 2025-12-04T08:54:02.3116390Z * [new branch] gh/bobrenjc93/654/base -> origin/gh/bobrenjc93/654/base 2025-12-04T08:54:02.3116467Z * [new branch] gh/bobrenjc93/654/head -> origin/gh/bobrenjc93/654/head 2025-12-04T08:54:02.3116541Z * [new branch] gh/bobrenjc93/654/orig -> origin/gh/bobrenjc93/654/orig 2025-12-04T08:54:02.3116615Z * [new branch] gh/bobrenjc93/657/base -> origin/gh/bobrenjc93/657/base 2025-12-04T08:54:02.3116688Z * [new branch] gh/bobrenjc93/657/head -> origin/gh/bobrenjc93/657/head 2025-12-04T08:54:02.3116761Z * [new branch] gh/bobrenjc93/657/orig -> origin/gh/bobrenjc93/657/orig 2025-12-04T08:54:02.3116835Z * [new branch] gh/bobrenjc93/672/base -> origin/gh/bobrenjc93/672/base 2025-12-04T08:54:02.3116911Z * [new branch] gh/bobrenjc93/672/head -> origin/gh/bobrenjc93/672/head 2025-12-04T08:54:02.3116983Z * [new branch] gh/bobrenjc93/672/orig -> origin/gh/bobrenjc93/672/orig 2025-12-04T08:54:02.3117059Z * [new branch] gh/bobrenjc93/679/base -> origin/gh/bobrenjc93/679/base 2025-12-04T08:54:02.3117131Z * [new branch] gh/bobrenjc93/679/head -> origin/gh/bobrenjc93/679/head 2025-12-04T08:54:02.3117205Z * [new branch] gh/bobrenjc93/679/orig -> origin/gh/bobrenjc93/679/orig 2025-12-04T08:54:02.3117277Z * [new branch] gh/bobrenjc93/680/base -> origin/gh/bobrenjc93/680/base 2025-12-04T08:54:02.3117350Z * [new branch] gh/bobrenjc93/680/head -> origin/gh/bobrenjc93/680/head 2025-12-04T08:54:02.3117422Z * [new branch] gh/bobrenjc93/680/orig -> origin/gh/bobrenjc93/680/orig 2025-12-04T08:54:02.3117495Z * [new branch] gh/bobrenjc93/681/base -> origin/gh/bobrenjc93/681/base 2025-12-04T08:54:02.3117567Z * [new branch] gh/bobrenjc93/681/head -> origin/gh/bobrenjc93/681/head 2025-12-04T08:54:02.3117639Z * [new branch] gh/bobrenjc93/681/orig -> origin/gh/bobrenjc93/681/orig 2025-12-04T08:54:02.3117744Z * [new branch] gh/bobrenjc93/682/base -> origin/gh/bobrenjc93/682/base 2025-12-04T08:54:02.3117817Z * [new branch] gh/bobrenjc93/682/head -> origin/gh/bobrenjc93/682/head 2025-12-04T08:54:02.3117889Z * [new branch] gh/bobrenjc93/682/orig -> origin/gh/bobrenjc93/682/orig 2025-12-04T08:54:02.3117967Z * [new branch] gh/bobrenjc93/683/base -> origin/gh/bobrenjc93/683/base 2025-12-04T08:54:02.3118038Z * [new branch] gh/bobrenjc93/683/head -> origin/gh/bobrenjc93/683/head 2025-12-04T08:54:02.3118112Z * [new branch] gh/bobrenjc93/683/orig -> origin/gh/bobrenjc93/683/orig 2025-12-04T08:54:02.3118185Z * [new branch] gh/bobrenjc93/684/base -> origin/gh/bobrenjc93/684/base 2025-12-04T08:54:02.3118257Z * [new branch] gh/bobrenjc93/684/head -> origin/gh/bobrenjc93/684/head 2025-12-04T08:54:02.3118330Z * [new branch] gh/bobrenjc93/684/orig -> origin/gh/bobrenjc93/684/orig 2025-12-04T08:54:02.3118404Z * [new branch] gh/bobrenjc93/685/base -> origin/gh/bobrenjc93/685/base 2025-12-04T08:54:02.3118477Z * [new branch] gh/bobrenjc93/685/head -> origin/gh/bobrenjc93/685/head 2025-12-04T08:54:02.3118550Z * [new branch] gh/bobrenjc93/685/orig -> origin/gh/bobrenjc93/685/orig 2025-12-04T08:54:02.3118643Z * [new branch] gh/bobrenjc93/686/base -> origin/gh/bobrenjc93/686/base 2025-12-04T08:54:02.3118716Z * [new branch] gh/bobrenjc93/686/head -> origin/gh/bobrenjc93/686/head 2025-12-04T08:54:02.3118789Z * [new branch] gh/bobrenjc93/686/orig -> origin/gh/bobrenjc93/686/orig 2025-12-04T08:54:02.3118861Z * [new branch] gh/bobrenjc93/687/base -> origin/gh/bobrenjc93/687/base 2025-12-04T08:54:02.3118936Z * [new branch] gh/bobrenjc93/687/head -> origin/gh/bobrenjc93/687/head 2025-12-04T08:54:02.3119011Z * [new branch] gh/bobrenjc93/687/orig -> origin/gh/bobrenjc93/687/orig 2025-12-04T08:54:02.3119084Z * [new branch] gh/bobrenjc93/688/base -> origin/gh/bobrenjc93/688/base 2025-12-04T08:54:02.3119156Z * [new branch] gh/bobrenjc93/688/head -> origin/gh/bobrenjc93/688/head 2025-12-04T08:54:02.3119231Z * [new branch] gh/bobrenjc93/688/orig -> origin/gh/bobrenjc93/688/orig 2025-12-04T08:54:02.3119305Z * [new branch] gh/bobrenjc93/689/base -> origin/gh/bobrenjc93/689/base 2025-12-04T08:54:02.3119377Z * [new branch] gh/bobrenjc93/689/head -> origin/gh/bobrenjc93/689/head 2025-12-04T08:54:02.3119451Z * [new branch] gh/bobrenjc93/689/orig -> origin/gh/bobrenjc93/689/orig 2025-12-04T08:54:02.3119523Z * [new branch] gh/bobrenjc93/690/base -> origin/gh/bobrenjc93/690/base 2025-12-04T08:54:02.3119596Z * [new branch] gh/bobrenjc93/690/head -> origin/gh/bobrenjc93/690/head 2025-12-04T08:54:02.3119670Z * [new branch] gh/bobrenjc93/690/orig -> origin/gh/bobrenjc93/690/orig 2025-12-04T08:54:02.3119742Z * [new branch] gh/bobrenjc93/691/base -> origin/gh/bobrenjc93/691/base 2025-12-04T08:54:02.3119815Z * [new branch] gh/bobrenjc93/691/head -> origin/gh/bobrenjc93/691/head 2025-12-04T08:54:02.3119888Z * [new branch] gh/bobrenjc93/691/orig -> origin/gh/bobrenjc93/691/orig 2025-12-04T08:54:02.3119960Z * [new branch] gh/bobrenjc93/692/base -> origin/gh/bobrenjc93/692/base 2025-12-04T08:54:02.3120034Z * [new branch] gh/bobrenjc93/692/head -> origin/gh/bobrenjc93/692/head 2025-12-04T08:54:02.3120171Z * [new branch] gh/bobrenjc93/692/orig -> origin/gh/bobrenjc93/692/orig 2025-12-04T08:54:02.3120245Z * [new branch] gh/bobrenjc93/693/base -> origin/gh/bobrenjc93/693/base 2025-12-04T08:54:02.3120319Z * [new branch] gh/bobrenjc93/693/head -> origin/gh/bobrenjc93/693/head 2025-12-04T08:54:02.3120424Z * [new branch] gh/bobrenjc93/693/orig -> origin/gh/bobrenjc93/693/orig 2025-12-04T08:54:02.3120495Z * [new branch] gh/bobrenjc93/694/base -> origin/gh/bobrenjc93/694/base 2025-12-04T08:54:02.3120569Z * [new branch] gh/bobrenjc93/694/head -> origin/gh/bobrenjc93/694/head 2025-12-04T08:54:02.3120643Z * [new branch] gh/bobrenjc93/694/orig -> origin/gh/bobrenjc93/694/orig 2025-12-04T08:54:02.3120717Z * [new branch] gh/bobrenjc93/695/base -> origin/gh/bobrenjc93/695/base 2025-12-04T08:54:02.3120792Z * [new branch] gh/bobrenjc93/695/head -> origin/gh/bobrenjc93/695/head 2025-12-04T08:54:02.3120864Z * [new branch] gh/bobrenjc93/695/orig -> origin/gh/bobrenjc93/695/orig 2025-12-04T08:54:02.3120933Z * [new branch] gh/c00w/23/base -> origin/gh/c00w/23/base 2025-12-04T08:54:02.3121000Z * [new branch] gh/c00w/23/head -> origin/gh/c00w/23/head 2025-12-04T08:54:02.3121066Z * [new branch] gh/c00w/53/base -> origin/gh/c00w/53/base 2025-12-04T08:54:02.3121129Z * [new branch] gh/c00w/53/head -> origin/gh/c00w/53/head 2025-12-04T08:54:02.3121196Z * [new branch] gh/c00w/53/orig -> origin/gh/c00w/53/orig 2025-12-04T08:54:02.3121282Z * [new branch] gh/c00w/54/base -> origin/gh/c00w/54/base 2025-12-04T08:54:02.3121346Z * [new branch] gh/c00w/54/head -> origin/gh/c00w/54/head 2025-12-04T08:54:02.3121409Z * [new branch] gh/c00w/54/orig -> origin/gh/c00w/54/orig 2025-12-04T08:54:02.3121471Z * [new branch] gh/c00w/56/base -> origin/gh/c00w/56/base 2025-12-04T08:54:02.3121535Z * [new branch] gh/c00w/56/head -> origin/gh/c00w/56/head 2025-12-04T08:54:02.3121598Z * [new branch] gh/c00w/56/orig -> origin/gh/c00w/56/orig 2025-12-04T08:54:02.3121662Z * [new branch] gh/c00w/57/base -> origin/gh/c00w/57/base 2025-12-04T08:54:02.3121728Z * [new branch] gh/c00w/57/head -> origin/gh/c00w/57/head 2025-12-04T08:54:02.3121793Z * [new branch] gh/c00w/57/orig -> origin/gh/c00w/57/orig 2025-12-04T08:54:02.3121855Z * [new branch] gh/c00w/58/base -> origin/gh/c00w/58/base 2025-12-04T08:54:02.3121920Z * [new branch] gh/c00w/58/head -> origin/gh/c00w/58/head 2025-12-04T08:54:02.3121983Z * [new branch] gh/c00w/58/orig -> origin/gh/c00w/58/orig 2025-12-04T08:54:02.3122055Z * [new branch] gh/clee2000/1/base -> origin/gh/clee2000/1/base 2025-12-04T08:54:02.3122127Z * [new branch] gh/clee2000/1/head -> origin/gh/clee2000/1/head 2025-12-04T08:54:02.3122195Z * [new branch] gh/clee2000/1/orig -> origin/gh/clee2000/1/orig 2025-12-04T08:54:02.3122276Z * [new branch] gh/coconutruben/1/base -> origin/gh/coconutruben/1/base 2025-12-04T08:54:02.3122355Z * [new branch] gh/coconutruben/1/head -> origin/gh/coconutruben/1/head 2025-12-04T08:54:02.3122435Z * [new branch] gh/coconutruben/55/base -> origin/gh/coconutruben/55/base 2025-12-04T08:54:02.3122513Z * [new branch] gh/coconutruben/55/head -> origin/gh/coconutruben/55/head 2025-12-04T08:54:02.3122593Z * [new branch] gh/coconutruben/55/orig -> origin/gh/coconutruben/55/orig 2025-12-04T08:54:02.3122669Z * [new branch] gh/coconutruben/57/base -> origin/gh/coconutruben/57/base 2025-12-04T08:54:02.3122747Z * [new branch] gh/coconutruben/57/head -> origin/gh/coconutruben/57/head 2025-12-04T08:54:02.3122828Z * [new branch] gh/coconutruben/57/orig -> origin/gh/coconutruben/57/orig 2025-12-04T08:54:02.3122903Z * [new branch] gh/coconutruben/70/base -> origin/gh/coconutruben/70/base 2025-12-04T08:54:02.3123007Z * [new branch] gh/coconutruben/70/head -> origin/gh/coconutruben/70/head 2025-12-04T08:54:02.3123084Z * [new branch] gh/coconutruben/70/orig -> origin/gh/coconutruben/70/orig 2025-12-04T08:54:02.3123160Z * [new branch] gh/coconutruben/71/base -> origin/gh/coconutruben/71/base 2025-12-04T08:54:02.3123237Z * [new branch] gh/coconutruben/71/head -> origin/gh/coconutruben/71/head 2025-12-04T08:54:02.3123313Z * [new branch] gh/coconutruben/71/orig -> origin/gh/coconutruben/71/orig 2025-12-04T08:54:02.3123389Z * [new branch] gh/coconutruben/72/base -> origin/gh/coconutruben/72/base 2025-12-04T08:54:02.3123467Z * [new branch] gh/coconutruben/72/head -> origin/gh/coconutruben/72/head 2025-12-04T08:54:02.3123542Z * [new branch] gh/coconutruben/72/orig -> origin/gh/coconutruben/72/orig 2025-12-04T08:54:02.3123618Z * [new branch] gh/coconutruben/73/base -> origin/gh/coconutruben/73/base 2025-12-04T08:54:02.3123696Z * [new branch] gh/coconutruben/73/head -> origin/gh/coconutruben/73/head 2025-12-04T08:54:02.3123771Z * [new branch] gh/coconutruben/73/orig -> origin/gh/coconutruben/73/orig 2025-12-04T08:54:02.3123846Z * [new branch] gh/coconutruben/74/base -> origin/gh/coconutruben/74/base 2025-12-04T08:54:02.3123943Z * [new branch] gh/coconutruben/74/head -> origin/gh/coconutruben/74/head 2025-12-04T08:54:02.3124020Z * [new branch] gh/coconutruben/74/orig -> origin/gh/coconutruben/74/orig 2025-12-04T08:54:02.3124096Z * [new branch] gh/coconutruben/79/base -> origin/gh/coconutruben/79/base 2025-12-04T08:54:02.3124172Z * [new branch] gh/coconutruben/79/head -> origin/gh/coconutruben/79/head 2025-12-04T08:54:02.3124248Z * [new branch] gh/coconutruben/79/orig -> origin/gh/coconutruben/79/orig 2025-12-04T08:54:02.3124324Z * [new branch] gh/coconutruben/80/base -> origin/gh/coconutruben/80/base 2025-12-04T08:54:02.3124402Z * [new branch] gh/coconutruben/80/head -> origin/gh/coconutruben/80/head 2025-12-04T08:54:02.3124478Z * [new branch] gh/coconutruben/80/orig -> origin/gh/coconutruben/80/orig 2025-12-04T08:54:02.3124553Z * [new branch] gh/coconutruben/82/base -> origin/gh/coconutruben/82/base 2025-12-04T08:54:02.3124631Z * [new branch] gh/coconutruben/82/head -> origin/gh/coconutruben/82/head 2025-12-04T08:54:02.3124707Z * [new branch] gh/coconutruben/82/orig -> origin/gh/coconutruben/82/orig 2025-12-04T08:54:02.3124784Z * [new branch] gh/coconutruben/83/base -> origin/gh/coconutruben/83/base 2025-12-04T08:54:02.3124860Z * [new branch] gh/coconutruben/83/head -> origin/gh/coconutruben/83/head 2025-12-04T08:54:02.3124935Z * [new branch] gh/coconutruben/83/orig -> origin/gh/coconutruben/83/orig 2025-12-04T08:54:02.3125013Z * [new branch] gh/coconutruben/84/base -> origin/gh/coconutruben/84/base 2025-12-04T08:54:02.3125088Z * [new branch] gh/coconutruben/84/head -> origin/gh/coconutruben/84/head 2025-12-04T08:54:02.3125163Z * [new branch] gh/coconutruben/84/orig -> origin/gh/coconutruben/84/orig 2025-12-04T08:54:02.3125243Z * [new branch] gh/coconutruben/85/base -> origin/gh/coconutruben/85/base 2025-12-04T08:54:02.3125320Z * [new branch] gh/coconutruben/85/head -> origin/gh/coconutruben/85/head 2025-12-04T08:54:02.3125398Z * [new branch] gh/coconutruben/85/orig -> origin/gh/coconutruben/85/orig 2025-12-04T08:54:02.3125479Z * [new branch] gh/coconutruben/86/base -> origin/gh/coconutruben/86/base 2025-12-04T08:54:02.3125556Z * [new branch] gh/coconutruben/86/head -> origin/gh/coconutruben/86/head 2025-12-04T08:54:02.3125631Z * [new branch] gh/coconutruben/86/orig -> origin/gh/coconutruben/86/orig 2025-12-04T08:54:02.3125728Z * [new branch] gh/colinchan15/1/base -> origin/gh/colinchan15/1/base 2025-12-04T08:54:02.3125802Z * [new branch] gh/colinchan15/1/head -> origin/gh/colinchan15/1/head 2025-12-04T08:54:02.3125876Z * [new branch] gh/colinchan15/2/base -> origin/gh/colinchan15/2/base 2025-12-04T08:54:02.3125951Z * [new branch] gh/colinchan15/2/head -> origin/gh/colinchan15/2/head 2025-12-04T08:54:02.3126025Z * [new branch] gh/colinchan15/3/base -> origin/gh/colinchan15/3/base 2025-12-04T08:54:02.3126097Z * [new branch] gh/colinchan15/3/head -> origin/gh/colinchan15/3/head 2025-12-04T08:54:02.3126170Z * [new branch] gh/colinchan15/6/base -> origin/gh/colinchan15/6/base 2025-12-04T08:54:02.3126243Z * [new branch] gh/colinchan15/6/head -> origin/gh/colinchan15/6/head 2025-12-04T08:54:02.3126310Z * [new branch] gh/d4l3k/1/base -> origin/gh/d4l3k/1/base 2025-12-04T08:54:02.3126376Z * [new branch] gh/d4l3k/1/head -> origin/gh/d4l3k/1/head 2025-12-04T08:54:02.3126440Z * [new branch] gh/d4l3k/2/base -> origin/gh/d4l3k/2/base 2025-12-04T08:54:02.3126510Z * [new branch] gh/d4l3k/2/head -> origin/gh/d4l3k/2/head 2025-12-04T08:54:02.3126598Z * [new branch] gh/d4l3k/2/orig -> origin/gh/d4l3k/2/orig 2025-12-04T08:54:02.3126663Z * [new branch] gh/d4l3k/3/base -> origin/gh/d4l3k/3/base 2025-12-04T08:54:02.3126728Z * [new branch] gh/d4l3k/3/head -> origin/gh/d4l3k/3/head 2025-12-04T08:54:02.3126791Z * [new branch] gh/d4l3k/3/orig -> origin/gh/d4l3k/3/orig 2025-12-04T08:54:02.3126854Z * [new branch] gh/d4l3k/4/base -> origin/gh/d4l3k/4/base 2025-12-04T08:54:02.3126920Z * [new branch] gh/d4l3k/4/head -> origin/gh/d4l3k/4/head 2025-12-04T08:54:02.3126985Z * [new branch] gh/d4l3k/4/orig -> origin/gh/d4l3k/4/orig 2025-12-04T08:54:02.3127048Z * [new branch] gh/d4l3k/5/base -> origin/gh/d4l3k/5/base 2025-12-04T08:54:02.3127112Z * [new branch] gh/d4l3k/5/orig -> origin/gh/d4l3k/5/orig 2025-12-04T08:54:02.3127202Z * [new branch] gh/davidberard98/392/base -> origin/gh/davidberard98/392/base 2025-12-04T08:54:02.3127288Z * [new branch] gh/davidberard98/392/head -> origin/gh/davidberard98/392/head 2025-12-04T08:54:02.3127372Z * [new branch] gh/davidberard98/392/orig -> origin/gh/davidberard98/392/orig 2025-12-04T08:54:02.3127455Z * [new branch] gh/davidberard98/399/base -> origin/gh/davidberard98/399/base 2025-12-04T08:54:02.3127537Z * [new branch] gh/davidberard98/399/head -> origin/gh/davidberard98/399/head 2025-12-04T08:54:02.3127621Z * [new branch] gh/davidberard98/399/orig -> origin/gh/davidberard98/399/orig 2025-12-04T08:54:02.3127698Z * [new branch] gh/desertfire/605/base -> origin/gh/desertfire/605/base 2025-12-04T08:54:02.3127773Z * [new branch] gh/desertfire/605/head -> origin/gh/desertfire/605/head 2025-12-04T08:54:02.3127849Z * [new branch] gh/desertfire/605/orig -> origin/gh/desertfire/605/orig 2025-12-04T08:54:02.3127924Z * [new branch] gh/desertfire/606/base -> origin/gh/desertfire/606/base 2025-12-04T08:54:02.3127999Z * [new branch] gh/desertfire/606/head -> origin/gh/desertfire/606/head 2025-12-04T08:54:02.3128072Z * [new branch] gh/desertfire/606/orig -> origin/gh/desertfire/606/orig 2025-12-04T08:54:02.3128145Z * [new branch] gh/desertfire/607/base -> origin/gh/desertfire/607/base 2025-12-04T08:54:02.3128221Z * [new branch] gh/desertfire/607/head -> origin/gh/desertfire/607/head 2025-12-04T08:54:02.3128323Z * [new branch] gh/desertfire/607/orig -> origin/gh/desertfire/607/orig 2025-12-04T08:54:02.3128397Z * [new branch] gh/desertfire/608/base -> origin/gh/desertfire/608/base 2025-12-04T08:54:02.3128472Z * [new branch] gh/desertfire/608/head -> origin/gh/desertfire/608/head 2025-12-04T08:54:02.3128545Z * [new branch] gh/desertfire/608/orig -> origin/gh/desertfire/608/orig 2025-12-04T08:54:02.3128619Z * [new branch] gh/desertfire/609/base -> origin/gh/desertfire/609/base 2025-12-04T08:54:02.3128693Z * [new branch] gh/desertfire/609/head -> origin/gh/desertfire/609/head 2025-12-04T08:54:02.3128766Z * [new branch] gh/desertfire/609/orig -> origin/gh/desertfire/609/orig 2025-12-04T08:54:02.3128839Z * [new branch] gh/desertfire/610/base -> origin/gh/desertfire/610/base 2025-12-04T08:54:02.3128914Z * [new branch] gh/desertfire/610/head -> origin/gh/desertfire/610/head 2025-12-04T08:54:02.3128987Z * [new branch] gh/desertfire/610/orig -> origin/gh/desertfire/610/orig 2025-12-04T08:54:02.3129061Z * [new branch] gh/desertfire/611/base -> origin/gh/desertfire/611/base 2025-12-04T08:54:02.3129135Z * [new branch] gh/desertfire/611/head -> origin/gh/desertfire/611/head 2025-12-04T08:54:02.3129209Z * [new branch] gh/desertfire/611/orig -> origin/gh/desertfire/611/orig 2025-12-04T08:54:02.3129301Z * [new branch] gh/desertfire/612/base -> origin/gh/desertfire/612/base 2025-12-04T08:54:02.3129378Z * [new branch] gh/desertfire/612/head -> origin/gh/desertfire/612/head 2025-12-04T08:54:02.3129451Z * [new branch] gh/desertfire/612/orig -> origin/gh/desertfire/612/orig 2025-12-04T08:54:02.3129524Z * [new branch] gh/desertfire/613/base -> origin/gh/desertfire/613/base 2025-12-04T08:54:02.3129598Z * [new branch] gh/desertfire/613/head -> origin/gh/desertfire/613/head 2025-12-04T08:54:02.3129673Z * [new branch] gh/desertfire/613/orig -> origin/gh/desertfire/613/orig 2025-12-04T08:54:02.3129749Z * [new branch] gh/desertfire/614/base -> origin/gh/desertfire/614/base 2025-12-04T08:54:02.3129824Z * [new branch] gh/desertfire/614/head -> origin/gh/desertfire/614/head 2025-12-04T08:54:02.3129898Z * [new branch] gh/desertfire/614/orig -> origin/gh/desertfire/614/orig 2025-12-04T08:54:02.3129973Z * [new branch] gh/desertfire/615/base -> origin/gh/desertfire/615/base 2025-12-04T08:54:02.3130046Z * [new branch] gh/desertfire/615/head -> origin/gh/desertfire/615/head 2025-12-04T08:54:02.3130381Z * [new branch] gh/desertfire/615/orig -> origin/gh/desertfire/615/orig 2025-12-04T08:54:02.3130458Z * [new branch] gh/desertfire/616/base -> origin/gh/desertfire/616/base 2025-12-04T08:54:02.3130531Z * [new branch] gh/desertfire/616/head -> origin/gh/desertfire/616/head 2025-12-04T08:54:02.3130606Z * [new branch] gh/desertfire/616/orig -> origin/gh/desertfire/616/orig 2025-12-04T08:54:02.3130681Z * [new branch] gh/desertfire/617/base -> origin/gh/desertfire/617/base 2025-12-04T08:54:02.3130755Z * [new branch] gh/desertfire/617/head -> origin/gh/desertfire/617/head 2025-12-04T08:54:02.3130830Z * [new branch] gh/desertfire/617/orig -> origin/gh/desertfire/617/orig 2025-12-04T08:54:02.3130904Z * [new branch] gh/dharakk/1/base -> origin/gh/dharakk/1/base 2025-12-04T08:54:02.3130975Z * [new branch] gh/dharakk/1/head -> origin/gh/dharakk/1/head 2025-12-04T08:54:02.3131046Z * [new branch] gh/drisspg/170/base -> origin/gh/drisspg/170/base 2025-12-04T08:54:02.3131120Z * [new branch] gh/drisspg/170/head -> origin/gh/drisspg/170/head 2025-12-04T08:54:02.3131190Z * [new branch] gh/drisspg/170/orig -> origin/gh/drisspg/170/orig 2025-12-04T08:54:02.3131295Z * [new branch] gh/drisspg/182/base -> origin/gh/drisspg/182/base 2025-12-04T08:54:02.3131368Z * [new branch] gh/drisspg/182/head -> origin/gh/drisspg/182/head 2025-12-04T08:54:02.3131440Z * [new branch] gh/drisspg/183/base -> origin/gh/drisspg/183/base 2025-12-04T08:54:02.3131514Z * [new branch] gh/drisspg/183/head -> origin/gh/drisspg/183/head 2025-12-04T08:54:02.3131585Z * [new branch] gh/drisspg/184/base -> origin/gh/drisspg/184/base 2025-12-04T08:54:02.3131656Z * [new branch] gh/drisspg/184/head -> origin/gh/drisspg/184/head 2025-12-04T08:54:02.3131730Z * [new branch] gh/drisspg/185/base -> origin/gh/drisspg/185/base 2025-12-04T08:54:02.3131801Z * [new branch] gh/drisspg/185/head -> origin/gh/drisspg/185/head 2025-12-04T08:54:02.3131871Z * [new branch] gh/drisspg/194/base -> origin/gh/drisspg/194/base 2025-12-04T08:54:02.3131947Z * [new branch] gh/drisspg/194/head -> origin/gh/drisspg/194/head 2025-12-04T08:54:02.3132016Z * [new branch] gh/drisspg/194/orig -> origin/gh/drisspg/194/orig 2025-12-04T08:54:02.3132086Z * [new branch] gh/drisspg/200/base -> origin/gh/drisspg/200/base 2025-12-04T08:54:02.3132184Z * [new branch] gh/drisspg/200/head -> origin/gh/drisspg/200/head 2025-12-04T08:54:02.3132255Z * [new branch] gh/drisspg/200/orig -> origin/gh/drisspg/200/orig 2025-12-04T08:54:02.3132326Z * [new branch] gh/drisspg/218/base -> origin/gh/drisspg/218/base 2025-12-04T08:54:02.3132400Z * [new branch] gh/drisspg/218/head -> origin/gh/drisspg/218/head 2025-12-04T08:54:02.3132469Z * [new branch] gh/drisspg/218/orig -> origin/gh/drisspg/218/orig 2025-12-04T08:54:02.3132537Z * [new branch] gh/drisspg/219/base -> origin/gh/drisspg/219/base 2025-12-04T08:54:02.3132609Z * [new branch] gh/drisspg/219/head -> origin/gh/drisspg/219/head 2025-12-04T08:54:02.3132678Z * [new branch] gh/drisspg/219/orig -> origin/gh/drisspg/219/orig 2025-12-04T08:54:02.3132746Z * [new branch] gh/drisspg/220/base -> origin/gh/drisspg/220/base 2025-12-04T08:54:02.3132817Z * [new branch] gh/drisspg/220/head -> origin/gh/drisspg/220/head 2025-12-04T08:54:02.3132887Z * [new branch] gh/drisspg/220/orig -> origin/gh/drisspg/220/orig 2025-12-04T08:54:02.3132955Z * [new branch] gh/drisspg/221/base -> origin/gh/drisspg/221/base 2025-12-04T08:54:02.3133026Z * [new branch] gh/drisspg/221/head -> origin/gh/drisspg/221/head 2025-12-04T08:54:02.3133096Z * [new branch] gh/drisspg/221/orig -> origin/gh/drisspg/221/orig 2025-12-04T08:54:02.3133165Z * [new branch] gh/drisspg/222/base -> origin/gh/drisspg/222/base 2025-12-04T08:54:02.3133236Z * [new branch] gh/drisspg/222/head -> origin/gh/drisspg/222/head 2025-12-04T08:54:02.3133306Z * [new branch] gh/drisspg/222/orig -> origin/gh/drisspg/222/orig 2025-12-04T08:54:02.3133377Z * [new branch] gh/drisspg/223/base -> origin/gh/drisspg/223/base 2025-12-04T08:54:02.3133447Z * [new branch] gh/drisspg/223/head -> origin/gh/drisspg/223/head 2025-12-04T08:54:02.3133516Z * [new branch] gh/drisspg/223/orig -> origin/gh/drisspg/223/orig 2025-12-04T08:54:02.3133586Z * [new branch] gh/drisspg/224/base -> origin/gh/drisspg/224/base 2025-12-04T08:54:02.3133655Z * [new branch] gh/drisspg/224/head -> origin/gh/drisspg/224/head 2025-12-04T08:54:02.3133724Z * [new branch] gh/drisspg/224/orig -> origin/gh/drisspg/224/orig 2025-12-04T08:54:02.3133794Z * [new branch] gh/drisspg/225/base -> origin/gh/drisspg/225/base 2025-12-04T08:54:02.3133889Z * [new branch] gh/drisspg/225/head -> origin/gh/drisspg/225/head 2025-12-04T08:54:02.3133958Z * [new branch] gh/drisspg/225/orig -> origin/gh/drisspg/225/orig 2025-12-04T08:54:02.3134029Z * [new branch] gh/drisspg/226/base -> origin/gh/drisspg/226/base 2025-12-04T08:54:02.3134100Z * [new branch] gh/drisspg/226/head -> origin/gh/drisspg/226/head 2025-12-04T08:54:02.3134169Z * [new branch] gh/drisspg/226/orig -> origin/gh/drisspg/226/orig 2025-12-04T08:54:02.3134238Z * [new branch] gh/drisspg/227/base -> origin/gh/drisspg/227/base 2025-12-04T08:54:02.3134307Z * [new branch] gh/drisspg/227/head -> origin/gh/drisspg/227/head 2025-12-04T08:54:02.3134374Z * [new branch] gh/drisspg/227/orig -> origin/gh/drisspg/227/orig 2025-12-04T08:54:02.3134444Z * [new branch] gh/drisspg/228/base -> origin/gh/drisspg/228/base 2025-12-04T08:54:02.3134514Z * [new branch] gh/drisspg/228/head -> origin/gh/drisspg/228/head 2025-12-04T08:54:02.3134584Z * [new branch] gh/drisspg/228/orig -> origin/gh/drisspg/228/orig 2025-12-04T08:54:02.3134654Z * [new branch] gh/drisspg/229/base -> origin/gh/drisspg/229/base 2025-12-04T08:54:02.3134750Z * [new branch] gh/drisspg/229/head -> origin/gh/drisspg/229/head 2025-12-04T08:54:02.3134820Z * [new branch] gh/drisspg/229/orig -> origin/gh/drisspg/229/orig 2025-12-04T08:54:02.3134891Z * [new branch] gh/drisspg/230/base -> origin/gh/drisspg/230/base 2025-12-04T08:54:02.3134960Z * [new branch] gh/drisspg/230/head -> origin/gh/drisspg/230/head 2025-12-04T08:54:02.3135030Z * [new branch] gh/drisspg/230/orig -> origin/gh/drisspg/230/orig 2025-12-04T08:54:02.3135102Z * [new branch] gh/dsjohns2/1/base -> origin/gh/dsjohns2/1/base 2025-12-04T08:54:02.3135175Z * [new branch] gh/dsjohns2/1/head -> origin/gh/dsjohns2/1/head 2025-12-04T08:54:02.3135258Z * [new branch] gh/dzmitry-huba/1/base -> origin/gh/dzmitry-huba/1/base 2025-12-04T08:54:02.3135335Z * [new branch] gh/dzmitry-huba/1/head -> origin/gh/dzmitry-huba/1/head 2025-12-04T08:54:02.3135414Z * [new branch] gh/dzmitry-huba/12/base -> origin/gh/dzmitry-huba/12/base 2025-12-04T08:54:02.3135493Z * [new branch] gh/dzmitry-huba/12/head -> origin/gh/dzmitry-huba/12/head 2025-12-04T08:54:02.3135570Z * [new branch] gh/dzmitry-huba/12/orig -> origin/gh/dzmitry-huba/12/orig 2025-12-04T08:54:02.3135646Z * [new branch] gh/dzmitry-huba/13/base -> origin/gh/dzmitry-huba/13/base 2025-12-04T08:54:02.3135722Z * [new branch] gh/dzmitry-huba/13/head -> origin/gh/dzmitry-huba/13/head 2025-12-04T08:54:02.3135799Z * [new branch] gh/dzmitry-huba/13/orig -> origin/gh/dzmitry-huba/13/orig 2025-12-04T08:54:02.3135875Z * [new branch] gh/dzmitry-huba/14/base -> origin/gh/dzmitry-huba/14/base 2025-12-04T08:54:02.3135951Z * [new branch] gh/dzmitry-huba/14/head -> origin/gh/dzmitry-huba/14/head 2025-12-04T08:54:02.3136026Z * [new branch] gh/dzmitry-huba/14/orig -> origin/gh/dzmitry-huba/14/orig 2025-12-04T08:54:02.3136102Z * [new branch] gh/dzmitry-huba/15/base -> origin/gh/dzmitry-huba/15/base 2025-12-04T08:54:02.3136178Z * [new branch] gh/dzmitry-huba/15/head -> origin/gh/dzmitry-huba/15/head 2025-12-04T08:54:02.3136253Z * [new branch] gh/dzmitry-huba/15/orig -> origin/gh/dzmitry-huba/15/orig 2025-12-04T08:54:02.3136329Z * [new branch] gh/dzmitry-huba/16/base -> origin/gh/dzmitry-huba/16/base 2025-12-04T08:54:02.3136407Z * [new branch] gh/dzmitry-huba/16/head -> origin/gh/dzmitry-huba/16/head 2025-12-04T08:54:02.3136507Z * [new branch] gh/dzmitry-huba/16/orig -> origin/gh/dzmitry-huba/16/orig 2025-12-04T08:54:02.3136584Z * [new branch] gh/dzmitry-huba/17/base -> origin/gh/dzmitry-huba/17/base 2025-12-04T08:54:02.3136659Z * [new branch] gh/dzmitry-huba/17/head -> origin/gh/dzmitry-huba/17/head 2025-12-04T08:54:02.3136736Z * [new branch] gh/dzmitry-huba/17/orig -> origin/gh/dzmitry-huba/17/orig 2025-12-04T08:54:02.3136813Z * [new branch] gh/dzmitry-huba/2/base -> origin/gh/dzmitry-huba/2/base 2025-12-04T08:54:02.3136889Z * [new branch] gh/dzmitry-huba/2/head -> origin/gh/dzmitry-huba/2/head 2025-12-04T08:54:02.3136962Z * [new branch] gh/dzmitry-huba/3/base -> origin/gh/dzmitry-huba/3/base 2025-12-04T08:54:02.3137038Z * [new branch] gh/dzmitry-huba/3/head -> origin/gh/dzmitry-huba/3/head 2025-12-04T08:54:02.3137112Z * [new branch] gh/eellison/808/base -> origin/gh/eellison/808/base 2025-12-04T08:54:02.3137187Z * [new branch] gh/eellison/808/head -> origin/gh/eellison/808/head 2025-12-04T08:54:02.3137261Z * [new branch] gh/eellison/808/orig -> origin/gh/eellison/808/orig 2025-12-04T08:54:02.3137332Z * [new branch] gh/eellison/822/base -> origin/gh/eellison/822/base 2025-12-04T08:54:02.3137427Z * [new branch] gh/eellison/822/head -> origin/gh/eellison/822/head 2025-12-04T08:54:02.3137500Z * [new branch] gh/eellison/822/orig -> origin/gh/eellison/822/orig 2025-12-04T08:54:02.3137571Z * [new branch] gh/eellison/823/base -> origin/gh/eellison/823/base 2025-12-04T08:54:02.3137641Z * [new branch] gh/eellison/823/head -> origin/gh/eellison/823/head 2025-12-04T08:54:02.3137716Z * [new branch] gh/eellison/823/orig -> origin/gh/eellison/823/orig 2025-12-04T08:54:02.3137788Z * [new branch] gh/eellison/862/base -> origin/gh/eellison/862/base 2025-12-04T08:54:02.3137859Z * [new branch] gh/eellison/862/head -> origin/gh/eellison/862/head 2025-12-04T08:54:02.3137932Z * [new branch] gh/eellison/862/orig -> origin/gh/eellison/862/orig 2025-12-04T08:54:02.3138002Z * [new branch] gh/eellison/863/base -> origin/gh/eellison/863/base 2025-12-04T08:54:02.3138073Z * [new branch] gh/eellison/863/head -> origin/gh/eellison/863/head 2025-12-04T08:54:02.3138148Z * [new branch] gh/eellison/863/orig -> origin/gh/eellison/863/orig 2025-12-04T08:54:02.3138220Z * [new branch] gh/eellison/864/base -> origin/gh/eellison/864/base 2025-12-04T08:54:02.3138290Z * [new branch] gh/eellison/864/head -> origin/gh/eellison/864/head 2025-12-04T08:54:02.3138361Z * [new branch] gh/eellison/864/orig -> origin/gh/eellison/864/orig 2025-12-04T08:54:02.3138432Z * [new branch] gh/eellison/865/base -> origin/gh/eellison/865/base 2025-12-04T08:54:02.3138507Z * [new branch] gh/eellison/865/head -> origin/gh/eellison/865/head 2025-12-04T08:54:02.3138578Z * [new branch] gh/eellison/865/orig -> origin/gh/eellison/865/orig 2025-12-04T08:54:02.3138650Z * [new branch] gh/eellison/866/base -> origin/gh/eellison/866/base 2025-12-04T08:54:02.3138721Z * [new branch] gh/eellison/866/head -> origin/gh/eellison/866/head 2025-12-04T08:54:02.3138791Z * [new branch] gh/eellison/866/orig -> origin/gh/eellison/866/orig 2025-12-04T08:54:02.3138862Z * [new branch] gh/eellison/867/base -> origin/gh/eellison/867/base 2025-12-04T08:54:02.3138934Z * [new branch] gh/eellison/867/head -> origin/gh/eellison/867/head 2025-12-04T08:54:02.3139004Z * [new branch] gh/eellison/867/orig -> origin/gh/eellison/867/orig 2025-12-04T08:54:02.3139093Z * [new branch] gh/eellison/868/base -> origin/gh/eellison/868/base 2025-12-04T08:54:02.3139165Z * [new branch] gh/eellison/868/head -> origin/gh/eellison/868/head 2025-12-04T08:54:02.3139235Z * [new branch] gh/eellison/868/orig -> origin/gh/eellison/868/orig 2025-12-04T08:54:02.3139307Z * [new branch] gh/eellison/869/base -> origin/gh/eellison/869/base 2025-12-04T08:54:02.3139381Z * [new branch] gh/eellison/869/head -> origin/gh/eellison/869/head 2025-12-04T08:54:02.3139450Z * [new branch] gh/eellison/869/orig -> origin/gh/eellison/869/orig 2025-12-04T08:54:02.3139521Z * [new branch] gh/eellison/870/base -> origin/gh/eellison/870/base 2025-12-04T08:54:02.3139593Z * [new branch] gh/eellison/870/head -> origin/gh/eellison/870/head 2025-12-04T08:54:02.3139663Z * [new branch] gh/eellison/870/orig -> origin/gh/eellison/870/orig 2025-12-04T08:54:02.3139736Z * [new branch] gh/eellison/871/base -> origin/gh/eellison/871/base 2025-12-04T08:54:02.3139805Z * [new branch] gh/eellison/871/head -> origin/gh/eellison/871/head 2025-12-04T08:54:02.3139874Z * [new branch] gh/eellison/871/orig -> origin/gh/eellison/871/orig 2025-12-04T08:54:02.3139946Z * [new branch] gh/eellison/872/base -> origin/gh/eellison/872/base 2025-12-04T08:54:02.3140036Z * [new branch] gh/eellison/872/head -> origin/gh/eellison/872/head 2025-12-04T08:54:02.3140155Z * [new branch] gh/eellison/872/orig -> origin/gh/eellison/872/orig 2025-12-04T08:54:02.3140230Z * [new branch] gh/eellison/873/base -> origin/gh/eellison/873/base 2025-12-04T08:54:02.3140300Z * [new branch] gh/eellison/873/head -> origin/gh/eellison/873/head 2025-12-04T08:54:02.3140369Z * [new branch] gh/eellison/873/orig -> origin/gh/eellison/873/orig 2025-12-04T08:54:02.3140443Z * [new branch] gh/eellison/874/base -> origin/gh/eellison/874/base 2025-12-04T08:54:02.3140513Z * [new branch] gh/eellison/874/head -> origin/gh/eellison/874/head 2025-12-04T08:54:02.3140583Z * [new branch] gh/eellison/874/orig -> origin/gh/eellison/874/orig 2025-12-04T08:54:02.3140654Z * [new branch] gh/eellison/875/base -> origin/gh/eellison/875/base 2025-12-04T08:54:02.3140726Z * [new branch] gh/eellison/875/head -> origin/gh/eellison/875/head 2025-12-04T08:54:02.3140798Z * [new branch] gh/eellison/875/orig -> origin/gh/eellison/875/orig 2025-12-04T08:54:02.3140871Z * [new branch] gh/eellison/876/base -> origin/gh/eellison/876/base 2025-12-04T08:54:02.3140941Z * [new branch] gh/eellison/876/head -> origin/gh/eellison/876/head 2025-12-04T08:54:02.3141012Z * [new branch] gh/eellison/876/orig -> origin/gh/eellison/876/orig 2025-12-04T08:54:02.3141085Z * [new branch] gh/eellison/877/base -> origin/gh/eellison/877/base 2025-12-04T08:54:02.3141154Z * [new branch] gh/eellison/877/head -> origin/gh/eellison/877/head 2025-12-04T08:54:02.3141225Z * [new branch] gh/eellison/877/orig -> origin/gh/eellison/877/orig 2025-12-04T08:54:02.3141296Z * [new branch] gh/eellison/878/base -> origin/gh/eellison/878/base 2025-12-04T08:54:02.3141366Z * [new branch] gh/eellison/878/head -> origin/gh/eellison/878/head 2025-12-04T08:54:02.3141438Z * [new branch] gh/eellison/878/orig -> origin/gh/eellison/878/orig 2025-12-04T08:54:02.3141508Z * [new branch] gh/eellison/879/base -> origin/gh/eellison/879/base 2025-12-04T08:54:02.3141577Z * [new branch] gh/eellison/879/head -> origin/gh/eellison/879/head 2025-12-04T08:54:02.3141648Z * [new branch] gh/eellison/879/orig -> origin/gh/eellison/879/orig 2025-12-04T08:54:02.3142024Z * [new branch] gh/eellison/880/base -> origin/gh/eellison/880/base 2025-12-04T08:54:02.3142094Z * [new branch] gh/eellison/880/head -> origin/gh/eellison/880/head 2025-12-04T08:54:02.3142166Z * [new branch] gh/eellison/880/orig -> origin/gh/eellison/880/orig 2025-12-04T08:54:02.3142238Z * [new branch] gh/eellison/881/base -> origin/gh/eellison/881/base 2025-12-04T08:54:02.3142309Z * [new branch] gh/eellison/881/head -> origin/gh/eellison/881/head 2025-12-04T08:54:02.3142381Z * [new branch] gh/eellison/881/orig -> origin/gh/eellison/881/orig 2025-12-04T08:54:02.3142450Z * [new branch] gh/eellison/882/base -> origin/gh/eellison/882/base 2025-12-04T08:54:02.3142519Z * [new branch] gh/eellison/882/head -> origin/gh/eellison/882/head 2025-12-04T08:54:02.3142592Z * [new branch] gh/eellison/882/orig -> origin/gh/eellison/882/orig 2025-12-04T08:54:02.3142663Z * [new branch] gh/eellison/883/base -> origin/gh/eellison/883/base 2025-12-04T08:54:02.3142734Z * [new branch] gh/eellison/883/head -> origin/gh/eellison/883/head 2025-12-04T08:54:02.3142805Z * [new branch] gh/eellison/883/orig -> origin/gh/eellison/883/orig 2025-12-04T08:54:02.3142898Z * [new branch] gh/eellison/884/base -> origin/gh/eellison/884/base 2025-12-04T08:54:02.3142969Z * [new branch] gh/eellison/884/head -> origin/gh/eellison/884/head 2025-12-04T08:54:02.3143040Z * [new branch] gh/eellison/884/orig -> origin/gh/eellison/884/orig 2025-12-04T08:54:02.3143108Z * [new branch] gh/etaf/147/base -> origin/gh/etaf/147/base 2025-12-04T08:54:02.3143177Z * [new branch] gh/etaf/147/head -> origin/gh/etaf/147/head 2025-12-04T08:54:02.3143242Z * [new branch] gh/etaf/154/base -> origin/gh/etaf/154/base 2025-12-04T08:54:02.3143309Z * [new branch] gh/etaf/154/head -> origin/gh/etaf/154/head 2025-12-04T08:54:02.3143375Z * [new branch] gh/etaf/154/orig -> origin/gh/etaf/154/orig 2025-12-04T08:54:02.3143439Z * [new branch] gh/etaf/156/base -> origin/gh/etaf/156/base 2025-12-04T08:54:02.3143504Z * [new branch] gh/etaf/156/head -> origin/gh/etaf/156/head 2025-12-04T08:54:02.3143572Z * [new branch] gh/etaf/156/orig -> origin/gh/etaf/156/orig 2025-12-04T08:54:02.3143635Z * [new branch] gh/etaf/157/base -> origin/gh/etaf/157/base 2025-12-04T08:54:02.3143699Z * [new branch] gh/etaf/157/head -> origin/gh/etaf/157/head 2025-12-04T08:54:02.3143765Z * [new branch] gh/etaf/157/orig -> origin/gh/etaf/157/orig 2025-12-04T08:54:02.3143829Z * [new branch] gh/etaf/158/base -> origin/gh/etaf/158/base 2025-12-04T08:54:02.3143894Z * [new branch] gh/etaf/158/head -> origin/gh/etaf/158/head 2025-12-04T08:54:02.3143961Z * [new branch] gh/etaf/158/orig -> origin/gh/etaf/158/orig 2025-12-04T08:54:02.3144025Z * [new branch] gh/etaf/159/base -> origin/gh/etaf/159/base 2025-12-04T08:54:02.3144089Z * [new branch] gh/etaf/159/head -> origin/gh/etaf/159/head 2025-12-04T08:54:02.3144155Z * [new branch] gh/etaf/159/orig -> origin/gh/etaf/159/orig 2025-12-04T08:54:02.3144219Z * [new branch] gh/etaf/160/base -> origin/gh/etaf/160/base 2025-12-04T08:54:02.3144283Z * [new branch] gh/etaf/160/head -> origin/gh/etaf/160/head 2025-12-04T08:54:02.3144348Z * [new branch] gh/etaf/160/orig -> origin/gh/etaf/160/orig 2025-12-04T08:54:02.3144413Z * [new branch] gh/etaf/161/base -> origin/gh/etaf/161/base 2025-12-04T08:54:02.3144495Z * [new branch] gh/etaf/161/head -> origin/gh/etaf/161/head 2025-12-04T08:54:02.3144561Z * [new branch] gh/etaf/161/orig -> origin/gh/etaf/161/orig 2025-12-04T08:54:02.3144625Z * [new branch] gh/etaf/166/base -> origin/gh/etaf/166/base 2025-12-04T08:54:02.3144691Z * [new branch] gh/etaf/166/head -> origin/gh/etaf/166/head 2025-12-04T08:54:02.3144757Z * [new branch] gh/etaf/166/orig -> origin/gh/etaf/166/orig 2025-12-04T08:54:02.3144822Z * [new branch] gh/etaf/167/base -> origin/gh/etaf/167/base 2025-12-04T08:54:02.3144888Z * [new branch] gh/etaf/167/head -> origin/gh/etaf/167/head 2025-12-04T08:54:02.3144951Z * [new branch] gh/etaf/167/orig -> origin/gh/etaf/167/orig 2025-12-04T08:54:02.3145015Z * [new branch] gh/etaf/168/base -> origin/gh/etaf/168/base 2025-12-04T08:54:02.3145082Z * [new branch] gh/etaf/168/head -> origin/gh/etaf/168/head 2025-12-04T08:54:02.3145147Z * [new branch] gh/etaf/168/orig -> origin/gh/etaf/168/orig 2025-12-04T08:54:02.3145212Z * [new branch] gh/etaf/172/base -> origin/gh/etaf/172/base 2025-12-04T08:54:02.3145279Z * [new branch] gh/etaf/172/head -> origin/gh/etaf/172/head 2025-12-04T08:54:02.3145362Z * [new branch] gh/etaf/172/orig -> origin/gh/etaf/172/orig 2025-12-04T08:54:02.3145427Z * [new branch] gh/etaf/173/base -> origin/gh/etaf/173/base 2025-12-04T08:54:02.3145493Z * [new branch] gh/etaf/173/head -> origin/gh/etaf/173/head 2025-12-04T08:54:02.3145557Z * [new branch] gh/etaf/173/orig -> origin/gh/etaf/173/orig 2025-12-04T08:54:02.3145621Z * [new branch] gh/etaf/174/base -> origin/gh/etaf/174/base 2025-12-04T08:54:02.3145688Z * [new branch] gh/etaf/174/head -> origin/gh/etaf/174/head 2025-12-04T08:54:02.3145753Z * [new branch] gh/etaf/175/base -> origin/gh/etaf/175/base 2025-12-04T08:54:02.3145817Z * [new branch] gh/etaf/175/head -> origin/gh/etaf/175/head 2025-12-04T08:54:02.3145882Z * [new branch] gh/etaf/175/orig -> origin/gh/etaf/175/orig 2025-12-04T08:54:02.3145947Z * [new branch] gh/etaf/176/base -> origin/gh/etaf/176/base 2025-12-04T08:54:02.3146011Z * [new branch] gh/etaf/176/head -> origin/gh/etaf/176/head 2025-12-04T08:54:02.3146077Z * [new branch] gh/etaf/176/orig -> origin/gh/etaf/176/orig 2025-12-04T08:54:02.3146141Z * [new branch] gh/etaf/177/base -> origin/gh/etaf/177/base 2025-12-04T08:54:02.3146206Z * [new branch] gh/etaf/177/head -> origin/gh/etaf/177/head 2025-12-04T08:54:02.3146270Z * [new branch] gh/etaf/177/orig -> origin/gh/etaf/177/orig 2025-12-04T08:54:02.3146336Z * [new branch] gh/etaf/178/base -> origin/gh/etaf/178/base 2025-12-04T08:54:02.3146402Z * [new branch] gh/etaf/178/head -> origin/gh/etaf/178/head 2025-12-04T08:54:02.3146466Z * [new branch] gh/etaf/178/orig -> origin/gh/etaf/178/orig 2025-12-04T08:54:02.3146530Z * [new branch] gh/etaf/179/base -> origin/gh/etaf/179/base 2025-12-04T08:54:02.3146597Z * [new branch] gh/etaf/179/head -> origin/gh/etaf/179/head 2025-12-04T08:54:02.3146661Z * [new branch] gh/etaf/179/orig -> origin/gh/etaf/179/orig 2025-12-04T08:54:02.3146724Z * [new branch] gh/etaf/180/base -> origin/gh/etaf/180/base 2025-12-04T08:54:02.3146790Z * [new branch] gh/etaf/180/head -> origin/gh/etaf/180/head 2025-12-04T08:54:02.3146854Z * [new branch] gh/etaf/180/orig -> origin/gh/etaf/180/orig 2025-12-04T08:54:02.3146955Z * [new branch] gh/exclamaforte/1/base -> origin/gh/exclamaforte/1/base 2025-12-04T08:54:02.3147034Z * [new branch] gh/exclamaforte/1/head -> origin/gh/exclamaforte/1/head 2025-12-04T08:54:02.3147110Z * [new branch] gh/exclamaforte/2/base -> origin/gh/exclamaforte/2/base 2025-12-04T08:54:02.3147186Z * [new branch] gh/exclamaforte/2/head -> origin/gh/exclamaforte/2/head 2025-12-04T08:54:02.3147265Z * [new branch] gh/exclamaforte/3/base -> origin/gh/exclamaforte/3/base 2025-12-04T08:54:02.3147342Z * [new branch] gh/exclamaforte/3/head -> origin/gh/exclamaforte/3/head 2025-12-04T08:54:02.3147418Z * [new branch] gh/exclamaforte/4/base -> origin/gh/exclamaforte/4/base 2025-12-04T08:54:02.3147497Z * [new branch] gh/exclamaforte/4/head -> origin/gh/exclamaforte/4/head 2025-12-04T08:54:02.3147567Z * [new branch] gh/ezyang/2374/base -> origin/gh/ezyang/2374/base 2025-12-04T08:54:02.3147639Z * [new branch] gh/ezyang/2374/head -> origin/gh/ezyang/2374/head 2025-12-04T08:54:02.3147711Z * [new branch] gh/ezyang/2374/orig -> origin/gh/ezyang/2374/orig 2025-12-04T08:54:02.3147780Z * [new branch] gh/ezyang/2973/base -> origin/gh/ezyang/2973/base 2025-12-04T08:54:02.3147875Z * [new branch] gh/ezyang/2973/head -> origin/gh/ezyang/2973/head 2025-12-04T08:54:02.3147944Z * [new branch] gh/ezyang/2973/orig -> origin/gh/ezyang/2973/orig 2025-12-04T08:54:02.3148013Z * [new branch] gh/ezyang/2974/base -> origin/gh/ezyang/2974/base 2025-12-04T08:54:02.3148085Z * [new branch] gh/ezyang/2974/head -> origin/gh/ezyang/2974/head 2025-12-04T08:54:02.3148155Z * [new branch] gh/ezyang/2974/orig -> origin/gh/ezyang/2974/orig 2025-12-04T08:54:02.3148224Z * [new branch] gh/ezyang/3131/base -> origin/gh/ezyang/3131/base 2025-12-04T08:54:02.3148295Z * [new branch] gh/ezyang/3131/head -> origin/gh/ezyang/3131/head 2025-12-04T08:54:02.3148364Z * [new branch] gh/ezyang/3131/orig -> origin/gh/ezyang/3131/orig 2025-12-04T08:54:02.3148432Z * [new branch] gh/ezyang/3139/base -> origin/gh/ezyang/3139/base 2025-12-04T08:54:02.3148505Z * [new branch] gh/ezyang/3139/head -> origin/gh/ezyang/3139/head 2025-12-04T08:54:02.3148575Z * [new branch] gh/ezyang/3139/orig -> origin/gh/ezyang/3139/orig 2025-12-04T08:54:02.3148642Z * [new branch] gh/ezyang/3140/base -> origin/gh/ezyang/3140/base 2025-12-04T08:54:02.3148714Z * [new branch] gh/ezyang/3140/head -> origin/gh/ezyang/3140/head 2025-12-04T08:54:02.3148782Z * [new branch] gh/ezyang/3140/orig -> origin/gh/ezyang/3140/orig 2025-12-04T08:54:02.3148852Z * [new branch] gh/ezyang/3143/base -> origin/gh/ezyang/3143/base 2025-12-04T08:54:02.3148924Z * [new branch] gh/ezyang/3143/head -> origin/gh/ezyang/3143/head 2025-12-04T08:54:02.3148992Z * [new branch] gh/ezyang/3143/orig -> origin/gh/ezyang/3143/orig 2025-12-04T08:54:02.3149060Z * [new branch] gh/ezyang/3144/base -> origin/gh/ezyang/3144/base 2025-12-04T08:54:02.3149133Z * [new branch] gh/ezyang/3144/head -> origin/gh/ezyang/3144/head 2025-12-04T08:54:02.3149203Z * [new branch] gh/ezyang/3144/orig -> origin/gh/ezyang/3144/orig 2025-12-04T08:54:02.3149273Z * [new branch] gh/ezyang/3167/base -> origin/gh/ezyang/3167/base 2025-12-04T08:54:02.3149344Z * [new branch] gh/ezyang/3167/head -> origin/gh/ezyang/3167/head 2025-12-04T08:54:02.3149414Z * [new branch] gh/ezyang/3167/orig -> origin/gh/ezyang/3167/orig 2025-12-04T08:54:02.3149488Z * [new branch] gh/ezyang/3173/base -> origin/gh/ezyang/3173/base 2025-12-04T08:54:02.3149576Z * [new branch] gh/ezyang/3173/head -> origin/gh/ezyang/3173/head 2025-12-04T08:54:02.3149644Z * [new branch] gh/ezyang/3173/orig -> origin/gh/ezyang/3173/orig 2025-12-04T08:54:02.3149718Z * [new branch] gh/ezyang/3175/base -> origin/gh/ezyang/3175/base 2025-12-04T08:54:02.3149788Z * [new branch] gh/ezyang/3175/head -> origin/gh/ezyang/3175/head 2025-12-04T08:54:02.3149855Z * [new branch] gh/ezyang/3175/orig -> origin/gh/ezyang/3175/orig 2025-12-04T08:54:02.3149926Z * [new branch] gh/ezyang/3182/base -> origin/gh/ezyang/3182/base 2025-12-04T08:54:02.3149993Z * [new branch] gh/ezyang/3182/head -> origin/gh/ezyang/3182/head 2025-12-04T08:54:02.3150061Z * [new branch] gh/ezyang/3182/orig -> origin/gh/ezyang/3182/orig 2025-12-04T08:54:02.3150168Z * [new branch] gh/ezyang/3185/base -> origin/gh/ezyang/3185/base 2025-12-04T08:54:02.3150240Z * [new branch] gh/ezyang/3185/head -> origin/gh/ezyang/3185/head 2025-12-04T08:54:02.3150308Z * [new branch] gh/ezyang/3185/orig -> origin/gh/ezyang/3185/orig 2025-12-04T08:54:02.3150381Z * [new branch] gh/ezyang/3189/base -> origin/gh/ezyang/3189/base 2025-12-04T08:54:02.3150478Z * [new branch] gh/ezyang/3189/head -> origin/gh/ezyang/3189/head 2025-12-04T08:54:02.3150548Z * [new branch] gh/ezyang/3189/orig -> origin/gh/ezyang/3189/orig 2025-12-04T08:54:02.3150618Z * [new branch] gh/ezyang/3191/base -> origin/gh/ezyang/3191/base 2025-12-04T08:54:02.3150688Z * [new branch] gh/ezyang/3191/head -> origin/gh/ezyang/3191/head 2025-12-04T08:54:02.3150759Z * [new branch] gh/ezyang/3191/orig -> origin/gh/ezyang/3191/orig 2025-12-04T08:54:02.3150832Z * [new branch] gh/ezyang/3192/base -> origin/gh/ezyang/3192/base 2025-12-04T08:54:02.3150903Z * [new branch] gh/ezyang/3192/head -> origin/gh/ezyang/3192/head 2025-12-04T08:54:02.3150973Z * [new branch] gh/ezyang/3192/orig -> origin/gh/ezyang/3192/orig 2025-12-04T08:54:02.3151047Z * [new branch] gh/ezyang/3193/base -> origin/gh/ezyang/3193/base 2025-12-04T08:54:02.3151118Z * [new branch] gh/ezyang/3193/head -> origin/gh/ezyang/3193/head 2025-12-04T08:54:02.3151188Z * [new branch] gh/ezyang/3193/orig -> origin/gh/ezyang/3193/orig 2025-12-04T08:54:02.3151262Z * [new branch] gh/ezyang/3194/base -> origin/gh/ezyang/3194/base 2025-12-04T08:54:02.3151332Z * [new branch] gh/ezyang/3194/head -> origin/gh/ezyang/3194/head 2025-12-04T08:54:02.3151406Z * [new branch] gh/ezyang/3194/orig -> origin/gh/ezyang/3194/orig 2025-12-04T08:54:02.3151475Z * [new branch] gh/ezyang/3195/base -> origin/gh/ezyang/3195/base 2025-12-04T08:54:02.3151546Z * [new branch] gh/ezyang/3195/head -> origin/gh/ezyang/3195/head 2025-12-04T08:54:02.3151616Z * [new branch] gh/ezyang/3195/orig -> origin/gh/ezyang/3195/orig 2025-12-04T08:54:02.3151877Z * [new branch] gh/ezyang/3196/base -> origin/gh/ezyang/3196/base 2025-12-04T08:54:02.3151948Z * [new branch] gh/ezyang/3196/head -> origin/gh/ezyang/3196/head 2025-12-04T08:54:02.3152021Z * [new branch] gh/ezyang/3196/orig -> origin/gh/ezyang/3196/orig 2025-12-04T08:54:02.3152090Z * [new branch] gh/ezyang/3197/base -> origin/gh/ezyang/3197/base 2025-12-04T08:54:02.3152158Z * [new branch] gh/ezyang/3197/head -> origin/gh/ezyang/3197/head 2025-12-04T08:54:02.3152243Z * [new branch] gh/ezyang/3197/orig -> origin/gh/ezyang/3197/orig 2025-12-04T08:54:02.3152312Z * [new branch] gh/ezyang/3198/base -> origin/gh/ezyang/3198/base 2025-12-04T08:54:02.3152420Z * [new branch] gh/ezyang/3198/head -> origin/gh/ezyang/3198/head 2025-12-04T08:54:02.3152492Z * [new branch] gh/ezyang/3198/orig -> origin/gh/ezyang/3198/orig 2025-12-04T08:54:02.3152562Z * [new branch] gh/ezyang/3199/base -> origin/gh/ezyang/3199/base 2025-12-04T08:54:02.3152631Z * [new branch] gh/ezyang/3199/head -> origin/gh/ezyang/3199/head 2025-12-04T08:54:02.3152703Z * [new branch] gh/ezyang/3199/orig -> origin/gh/ezyang/3199/orig 2025-12-04T08:54:02.3152772Z * [new branch] gh/ezyang/3200/base -> origin/gh/ezyang/3200/base 2025-12-04T08:54:02.3152842Z * [new branch] gh/ezyang/3200/head -> origin/gh/ezyang/3200/head 2025-12-04T08:54:02.3152915Z * [new branch] gh/ezyang/3200/orig -> origin/gh/ezyang/3200/orig 2025-12-04T08:54:02.3152984Z * [new branch] gh/ezyang/3201/base -> origin/gh/ezyang/3201/base 2025-12-04T08:54:02.3153054Z * [new branch] gh/ezyang/3201/head -> origin/gh/ezyang/3201/head 2025-12-04T08:54:02.3153125Z * [new branch] gh/ezyang/3201/orig -> origin/gh/ezyang/3201/orig 2025-12-04T08:54:02.3153195Z * [new branch] gh/ezyang/3202/base -> origin/gh/ezyang/3202/base 2025-12-04T08:54:02.3153285Z * [new branch] gh/ezyang/3202/head -> origin/gh/ezyang/3202/head 2025-12-04T08:54:02.3153354Z * [new branch] gh/ezyang/3202/orig -> origin/gh/ezyang/3202/orig 2025-12-04T08:54:02.3153423Z * [new branch] gh/ezyang/3203/base -> origin/gh/ezyang/3203/base 2025-12-04T08:54:02.3153494Z * [new branch] gh/ezyang/3203/head -> origin/gh/ezyang/3203/head 2025-12-04T08:54:02.3153562Z * [new branch] gh/ezyang/3203/orig -> origin/gh/ezyang/3203/orig 2025-12-04T08:54:02.3153631Z * [new branch] gh/ezyang/3204/base -> origin/gh/ezyang/3204/base 2025-12-04T08:54:02.3153703Z * [new branch] gh/ezyang/3204/head -> origin/gh/ezyang/3204/head 2025-12-04T08:54:02.3153773Z * [new branch] gh/ezyang/3204/orig -> origin/gh/ezyang/3204/orig 2025-12-04T08:54:02.3153842Z * [new branch] gh/ezyang/3205/base -> origin/gh/ezyang/3205/base 2025-12-04T08:54:02.3153915Z * [new branch] gh/ezyang/3205/head -> origin/gh/ezyang/3205/head 2025-12-04T08:54:02.3153983Z * [new branch] gh/ezyang/3205/orig -> origin/gh/ezyang/3205/orig 2025-12-04T08:54:02.3154052Z * [new branch] gh/ezyang/3206/base -> origin/gh/ezyang/3206/base 2025-12-04T08:54:02.3154123Z * [new branch] gh/ezyang/3206/head -> origin/gh/ezyang/3206/head 2025-12-04T08:54:02.3154192Z * [new branch] gh/ezyang/3206/orig -> origin/gh/ezyang/3206/orig 2025-12-04T08:54:02.3154260Z * [new branch] gh/ezyang/3207/base -> origin/gh/ezyang/3207/base 2025-12-04T08:54:02.3154332Z * [new branch] gh/ezyang/3207/head -> origin/gh/ezyang/3207/head 2025-12-04T08:54:02.3154401Z * [new branch] gh/ezyang/3207/orig -> origin/gh/ezyang/3207/orig 2025-12-04T08:54:02.3154471Z * [new branch] gh/ezyang/3208/base -> origin/gh/ezyang/3208/base 2025-12-04T08:54:02.3154542Z * [new branch] gh/ezyang/3208/head -> origin/gh/ezyang/3208/head 2025-12-04T08:54:02.3154618Z * [new branch] gh/ezyang/3208/orig -> origin/gh/ezyang/3208/orig 2025-12-04T08:54:02.3163695Z * [new branch] gh/ezyang/3209/base -> origin/gh/ezyang/3209/base 2025-12-04T08:54:02.3163825Z * [new branch] gh/ezyang/3209/head -> origin/gh/ezyang/3209/head 2025-12-04T08:54:02.3163900Z * [new branch] gh/ezyang/3209/orig -> origin/gh/ezyang/3209/orig 2025-12-04T08:54:02.3163976Z * [new branch] gh/fadara01/3/base -> origin/gh/fadara01/3/base 2025-12-04T08:54:02.3164114Z * [new branch] gh/fadara01/3/head -> origin/gh/fadara01/3/head 2025-12-04T08:54:02.3164184Z * [new branch] gh/fadara01/3/orig -> origin/gh/fadara01/3/orig 2025-12-04T08:54:02.3164260Z * [new branch] gh/fadara01/5/base -> origin/gh/fadara01/5/base 2025-12-04T08:54:02.3164332Z * [new branch] gh/fadara01/5/head -> origin/gh/fadara01/5/head 2025-12-04T08:54:02.3164401Z * [new branch] gh/fadara01/5/orig -> origin/gh/fadara01/5/orig 2025-12-04T08:54:02.3164472Z * [new branch] gh/fadara01/6/base -> origin/gh/fadara01/6/base 2025-12-04T08:54:02.3164541Z * [new branch] gh/fadara01/6/head -> origin/gh/fadara01/6/head 2025-12-04T08:54:02.3164612Z * [new branch] gh/fadara01/6/orig -> origin/gh/fadara01/6/orig 2025-12-04T08:54:02.3164682Z * [new branch] gh/fadara01/7/base -> origin/gh/fadara01/7/base 2025-12-04T08:54:02.3164753Z * [new branch] gh/fadara01/7/head -> origin/gh/fadara01/7/head 2025-12-04T08:54:02.3164822Z * [new branch] gh/fadara01/7/orig -> origin/gh/fadara01/7/orig 2025-12-04T08:54:02.3164895Z * [new branch] gh/fadara01/8/base -> origin/gh/fadara01/8/base 2025-12-04T08:54:02.3164988Z * [new branch] gh/fadara01/8/head -> origin/gh/fadara01/8/head 2025-12-04T08:54:02.3165057Z * [new branch] gh/fadara01/8/orig -> origin/gh/fadara01/8/orig 2025-12-04T08:54:02.3165129Z * [new branch] gh/fadara01/9/base -> origin/gh/fadara01/9/base 2025-12-04T08:54:02.3165198Z * [new branch] gh/fadara01/9/head -> origin/gh/fadara01/9/head 2025-12-04T08:54:02.3165266Z * [new branch] gh/fadara01/9/orig -> origin/gh/fadara01/9/orig 2025-12-04T08:54:02.3165337Z * [new branch] gh/fduwjj/182/base -> origin/gh/fduwjj/182/base 2025-12-04T08:54:02.3165407Z * [new branch] gh/fduwjj/182/head -> origin/gh/fduwjj/182/head 2025-12-04T08:54:02.3165476Z * [new branch] gh/fduwjj/182/orig -> origin/gh/fduwjj/182/orig 2025-12-04T08:54:02.3165551Z * [new branch] gh/fduwjj/211/base -> origin/gh/fduwjj/211/base 2025-12-04T08:54:02.3165621Z * [new branch] gh/fduwjj/211/head -> origin/gh/fduwjj/211/head 2025-12-04T08:54:02.3165691Z * [new branch] gh/fduwjj/211/orig -> origin/gh/fduwjj/211/orig 2025-12-04T08:54:02.3165765Z * [new branch] gh/fduwjj/212/base -> origin/gh/fduwjj/212/base 2025-12-04T08:54:02.3165834Z * [new branch] gh/fduwjj/212/head -> origin/gh/fduwjj/212/head 2025-12-04T08:54:02.3165908Z * [new branch] gh/fduwjj/212/orig -> origin/gh/fduwjj/212/orig 2025-12-04T08:54:02.3165977Z * [new branch] gh/fduwjj/213/base -> origin/gh/fduwjj/213/base 2025-12-04T08:54:02.3166047Z * [new branch] gh/fduwjj/213/head -> origin/gh/fduwjj/213/head 2025-12-04T08:54:02.3166120Z * [new branch] gh/fduwjj/213/orig -> origin/gh/fduwjj/213/orig 2025-12-04T08:54:02.3166188Z * [new branch] gh/fduwjj/226/base -> origin/gh/fduwjj/226/base 2025-12-04T08:54:02.3166259Z * [new branch] gh/fduwjj/226/head -> origin/gh/fduwjj/226/head 2025-12-04T08:54:02.3166334Z * [new branch] gh/fduwjj/226/orig -> origin/gh/fduwjj/226/orig 2025-12-04T08:54:02.3166403Z * [new branch] gh/fduwjj/229/base -> origin/gh/fduwjj/229/base 2025-12-04T08:54:02.3166471Z * [new branch] gh/fduwjj/229/head -> origin/gh/fduwjj/229/head 2025-12-04T08:54:02.3166540Z * [new branch] gh/fduwjj/229/orig -> origin/gh/fduwjj/229/orig 2025-12-04T08:54:02.3166609Z * [new branch] gh/fduwjj/233/base -> origin/gh/fduwjj/233/base 2025-12-04T08:54:02.3166704Z * [new branch] gh/fduwjj/233/head -> origin/gh/fduwjj/233/head 2025-12-04T08:54:02.3166776Z * [new branch] gh/fduwjj/233/orig -> origin/gh/fduwjj/233/orig 2025-12-04T08:54:02.3166846Z * [new branch] gh/fduwjj/234/base -> origin/gh/fduwjj/234/base 2025-12-04T08:54:02.3166915Z * [new branch] gh/fduwjj/234/head -> origin/gh/fduwjj/234/head 2025-12-04T08:54:02.3166987Z * [new branch] gh/fduwjj/234/orig -> origin/gh/fduwjj/234/orig 2025-12-04T08:54:02.3167055Z * [new branch] gh/fduwjj/235/base -> origin/gh/fduwjj/235/base 2025-12-04T08:54:02.3167124Z * [new branch] gh/fduwjj/235/head -> origin/gh/fduwjj/235/head 2025-12-04T08:54:02.3167195Z * [new branch] gh/fduwjj/235/orig -> origin/gh/fduwjj/235/orig 2025-12-04T08:54:02.3167262Z * [new branch] gh/fduwjj/236/base -> origin/gh/fduwjj/236/base 2025-12-04T08:54:02.3167332Z * [new branch] gh/fduwjj/236/head -> origin/gh/fduwjj/236/head 2025-12-04T08:54:02.3167405Z * [new branch] gh/fduwjj/236/orig -> origin/gh/fduwjj/236/orig 2025-12-04T08:54:02.3167473Z * [new branch] gh/fduwjj/237/base -> origin/gh/fduwjj/237/base 2025-12-04T08:54:02.3167541Z * [new branch] gh/fduwjj/237/head -> origin/gh/fduwjj/237/head 2025-12-04T08:54:02.3167633Z * [new branch] gh/fduwjj/237/orig -> origin/gh/fduwjj/237/orig 2025-12-04T08:54:02.3167705Z * [new branch] gh/fduwjj/238/base -> origin/gh/fduwjj/238/base 2025-12-04T08:54:02.3167775Z * [new branch] gh/fduwjj/238/head -> origin/gh/fduwjj/238/head 2025-12-04T08:54:02.3167844Z * [new branch] gh/fduwjj/238/orig -> origin/gh/fduwjj/238/orig 2025-12-04T08:54:02.3167915Z * [new branch] gh/fduwjj/239/base -> origin/gh/fduwjj/239/base 2025-12-04T08:54:02.3167990Z * [new branch] gh/fduwjj/239/head -> origin/gh/fduwjj/239/head 2025-12-04T08:54:02.3168059Z * [new branch] gh/fduwjj/239/orig -> origin/gh/fduwjj/239/orig 2025-12-04T08:54:02.3168131Z * [new branch] gh/fegin/332/base -> origin/gh/fegin/332/base 2025-12-04T08:54:02.3168205Z * [new branch] gh/fegin/332/head -> origin/gh/fegin/332/head 2025-12-04T08:54:02.3168274Z * [new branch] gh/fegin/332/orig -> origin/gh/fegin/332/orig 2025-12-04T08:54:02.3168342Z * [new branch] gh/fegin/333/base -> origin/gh/fegin/333/base 2025-12-04T08:54:02.3168412Z * [new branch] gh/fegin/333/head -> origin/gh/fegin/333/head 2025-12-04T08:54:02.3168479Z * [new branch] gh/fegin/333/orig -> origin/gh/fegin/333/orig 2025-12-04T08:54:02.3168547Z * [new branch] gh/fegin/334/base -> origin/gh/fegin/334/base 2025-12-04T08:54:02.3168618Z * [new branch] gh/fegin/334/head -> origin/gh/fegin/334/head 2025-12-04T08:54:02.3168683Z * [new branch] gh/fegin/334/orig -> origin/gh/fegin/334/orig 2025-12-04T08:54:02.3168754Z * [new branch] gh/fegin/335/base -> origin/gh/fegin/335/base 2025-12-04T08:54:02.3168824Z * [new branch] gh/fegin/335/head -> origin/gh/fegin/335/head 2025-12-04T08:54:02.3168892Z * [new branch] gh/fegin/335/orig -> origin/gh/fegin/335/orig 2025-12-04T08:54:02.3168961Z * [new branch] gh/fffrog/160/base -> origin/gh/fffrog/160/base 2025-12-04T08:54:02.3169031Z * [new branch] gh/fffrog/160/head -> origin/gh/fffrog/160/head 2025-12-04T08:54:02.3169099Z * [new branch] gh/fffrog/177/base -> origin/gh/fffrog/177/base 2025-12-04T08:54:02.3169166Z * [new branch] gh/fffrog/177/head -> origin/gh/fffrog/177/head 2025-12-04T08:54:02.3169255Z * [new branch] gh/fffrog/177/orig -> origin/gh/fffrog/177/orig 2025-12-04T08:54:02.3169323Z * [new branch] gh/fffrog/178/base -> origin/gh/fffrog/178/base 2025-12-04T08:54:02.3169389Z * [new branch] gh/fffrog/178/head -> origin/gh/fffrog/178/head 2025-12-04T08:54:02.3169456Z * [new branch] gh/fffrog/178/orig -> origin/gh/fffrog/178/orig 2025-12-04T08:54:02.3169525Z * [new branch] gh/fffrog/181/base -> origin/gh/fffrog/181/base 2025-12-04T08:54:02.3169592Z * [new branch] gh/fffrog/181/head -> origin/gh/fffrog/181/head 2025-12-04T08:54:02.3169660Z * [new branch] gh/fffrog/181/orig -> origin/gh/fffrog/181/orig 2025-12-04T08:54:02.3169727Z * [new branch] gh/fffrog/183/base -> origin/gh/fffrog/183/base 2025-12-04T08:54:02.3169793Z * [new branch] gh/fffrog/183/head -> origin/gh/fffrog/183/head 2025-12-04T08:54:02.3169861Z * [new branch] gh/fffrog/183/orig -> origin/gh/fffrog/183/orig 2025-12-04T08:54:02.3169931Z * [new branch] gh/fxdawnn/10/base -> origin/gh/fxdawnn/10/base 2025-12-04T08:54:02.3169999Z * [new branch] gh/fxdawnn/10/head -> origin/gh/fxdawnn/10/head 2025-12-04T08:54:02.3170068Z * [new branch] gh/fxdawnn/10/orig -> origin/gh/fxdawnn/10/orig 2025-12-04T08:54:02.3170209Z * [new branch] gh/fxdawnn/11/base -> origin/gh/fxdawnn/11/base 2025-12-04T08:54:02.3170282Z * [new branch] gh/fxdawnn/11/head -> origin/gh/fxdawnn/11/head 2025-12-04T08:54:02.3170350Z * [new branch] gh/fxdawnn/11/orig -> origin/gh/fxdawnn/11/orig 2025-12-04T08:54:02.3170418Z * [new branch] gh/fxdawnn/12/base -> origin/gh/fxdawnn/12/base 2025-12-04T08:54:02.3170487Z * [new branch] gh/fxdawnn/12/head -> origin/gh/fxdawnn/12/head 2025-12-04T08:54:02.3170555Z * [new branch] gh/fxdawnn/12/orig -> origin/gh/fxdawnn/12/orig 2025-12-04T08:54:02.3170624Z * [new branch] gh/fxdawnn/13/base -> origin/gh/fxdawnn/13/base 2025-12-04T08:54:02.3170693Z * [new branch] gh/fxdawnn/13/head -> origin/gh/fxdawnn/13/head 2025-12-04T08:54:02.3170761Z * [new branch] gh/fxdawnn/13/orig -> origin/gh/fxdawnn/13/orig 2025-12-04T08:54:02.3170830Z * [new branch] gh/fxdawnn/14/base -> origin/gh/fxdawnn/14/base 2025-12-04T08:54:02.3170900Z * [new branch] gh/fxdawnn/14/head -> origin/gh/fxdawnn/14/head 2025-12-04T08:54:02.3170968Z * [new branch] gh/fxdawnn/14/orig -> origin/gh/fxdawnn/14/orig 2025-12-04T08:54:02.3171036Z * [new branch] gh/fxdawnn/15/base -> origin/gh/fxdawnn/15/base 2025-12-04T08:54:02.3171105Z * [new branch] gh/fxdawnn/15/head -> origin/gh/fxdawnn/15/head 2025-12-04T08:54:02.3171173Z * [new branch] gh/fxdawnn/15/orig -> origin/gh/fxdawnn/15/orig 2025-12-04T08:54:02.3171244Z * [new branch] gh/fxdawnn/6/base -> origin/gh/fxdawnn/6/base 2025-12-04T08:54:02.3171314Z * [new branch] gh/fxdawnn/6/head -> origin/gh/fxdawnn/6/head 2025-12-04T08:54:02.3171381Z * [new branch] gh/fxdawnn/6/orig -> origin/gh/fxdawnn/6/orig 2025-12-04T08:54:02.3171450Z * [new branch] gh/fxdawnn/7/base -> origin/gh/fxdawnn/7/base 2025-12-04T08:54:02.3171520Z * [new branch] gh/fxdawnn/7/head -> origin/gh/fxdawnn/7/head 2025-12-04T08:54:02.3171587Z * [new branch] gh/fxdawnn/7/orig -> origin/gh/fxdawnn/7/orig 2025-12-04T08:54:02.3171653Z * [new branch] gh/fxdawnn/9/base -> origin/gh/fxdawnn/9/base 2025-12-04T08:54:02.3171721Z * [new branch] gh/fxdawnn/9/head -> origin/gh/fxdawnn/9/head 2025-12-04T08:54:02.3171788Z * [new branch] gh/fxdawnn/9/orig -> origin/gh/fxdawnn/9/orig 2025-12-04T08:54:02.3171879Z * [new branch] gh/galv/1/base -> origin/gh/galv/1/base 2025-12-04T08:54:02.3171947Z * [new branch] gh/galv/1/head -> origin/gh/galv/1/head 2025-12-04T08:54:02.3172011Z * [new branch] gh/galv/1/orig -> origin/gh/galv/1/orig 2025-12-04T08:54:02.3172077Z * [new branch] gh/galv/2/base -> origin/gh/galv/2/base 2025-12-04T08:54:02.3172140Z * [new branch] gh/galv/2/head -> origin/gh/galv/2/head 2025-12-04T08:54:02.3172203Z * [new branch] gh/galv/2/orig -> origin/gh/galv/2/orig 2025-12-04T08:54:02.3172267Z * [new branch] gh/galv/3/base -> origin/gh/galv/3/base 2025-12-04T08:54:02.3172331Z * [new branch] gh/galv/3/head -> origin/gh/galv/3/head 2025-12-04T08:54:02.3172394Z * [new branch] gh/galv/3/orig -> origin/gh/galv/3/orig 2025-12-04T08:54:02.3172474Z * [new branch] gh/guangyey/134/base -> origin/gh/guangyey/134/base 2025-12-04T08:54:02.3172549Z * [new branch] gh/guangyey/134/head -> origin/gh/guangyey/134/head 2025-12-04T08:54:02.3172621Z * [new branch] gh/guangyey/134/orig -> origin/gh/guangyey/134/orig 2025-12-04T08:54:02.3172694Z * [new branch] gh/guangyey/163/base -> origin/gh/guangyey/163/base 2025-12-04T08:54:02.3172784Z * [new branch] gh/guangyey/163/head -> origin/gh/guangyey/163/head 2025-12-04T08:54:02.3172855Z * [new branch] gh/guangyey/163/orig -> origin/gh/guangyey/163/orig 2025-12-04T08:54:02.3172929Z * [new branch] gh/guangyey/168/base -> origin/gh/guangyey/168/base 2025-12-04T08:54:02.3173000Z * [new branch] gh/guangyey/168/head -> origin/gh/guangyey/168/head 2025-12-04T08:54:02.3173070Z * [new branch] gh/guangyey/168/orig -> origin/gh/guangyey/168/orig 2025-12-04T08:54:02.3173143Z * [new branch] gh/guangyey/169/base -> origin/gh/guangyey/169/base 2025-12-04T08:54:02.3173352Z * [new branch] gh/guangyey/169/head -> origin/gh/guangyey/169/head 2025-12-04T08:54:02.3173423Z * [new branch] gh/guangyey/169/orig -> origin/gh/guangyey/169/orig 2025-12-04T08:54:02.3173496Z * [new branch] gh/guangyey/170/base -> origin/gh/guangyey/170/base 2025-12-04T08:54:02.3173567Z * [new branch] gh/guangyey/170/head -> origin/gh/guangyey/170/head 2025-12-04T08:54:02.3173638Z * [new branch] gh/guangyey/170/orig -> origin/gh/guangyey/170/orig 2025-12-04T08:54:02.3173710Z * [new branch] gh/guangyey/171/base -> origin/gh/guangyey/171/base 2025-12-04T08:54:02.3173780Z * [new branch] gh/guangyey/171/head -> origin/gh/guangyey/171/head 2025-12-04T08:54:02.3173851Z * [new branch] gh/guangyey/171/orig -> origin/gh/guangyey/171/orig 2025-12-04T08:54:02.3173922Z * [new branch] gh/guangyey/178/base -> origin/gh/guangyey/178/base 2025-12-04T08:54:02.3173993Z * [new branch] gh/guangyey/178/head -> origin/gh/guangyey/178/head 2025-12-04T08:54:02.3174066Z * [new branch] gh/guangyey/178/orig -> origin/gh/guangyey/178/orig 2025-12-04T08:54:02.3174137Z * [new branch] gh/guangyey/182/base -> origin/gh/guangyey/182/base 2025-12-04T08:54:02.3174208Z * [new branch] gh/guangyey/182/head -> origin/gh/guangyey/182/head 2025-12-04T08:54:02.3174279Z * [new branch] gh/guangyey/182/orig -> origin/gh/guangyey/182/orig 2025-12-04T08:54:02.3174350Z * [new branch] gh/guangyey/183/base -> origin/gh/guangyey/183/base 2025-12-04T08:54:02.3174420Z * [new branch] gh/guangyey/183/head -> origin/gh/guangyey/183/head 2025-12-04T08:54:02.3174492Z * [new branch] gh/guangyey/183/orig -> origin/gh/guangyey/183/orig 2025-12-04T08:54:02.3174637Z * [new branch] gh/guangyey/185/base -> origin/gh/guangyey/185/base 2025-12-04T08:54:02.3174707Z * [new branch] gh/guangyey/185/head -> origin/gh/guangyey/185/head 2025-12-04T08:54:02.3174779Z * [new branch] gh/guangyey/185/orig -> origin/gh/guangyey/185/orig 2025-12-04T08:54:02.3174850Z * [new branch] gh/guangyey/186/base -> origin/gh/guangyey/186/base 2025-12-04T08:54:02.3174920Z * [new branch] gh/guangyey/186/head -> origin/gh/guangyey/186/head 2025-12-04T08:54:02.3174991Z * [new branch] gh/guangyey/186/orig -> origin/gh/guangyey/186/orig 2025-12-04T08:54:02.3175061Z * [new branch] gh/guangyey/187/base -> origin/gh/guangyey/187/base 2025-12-04T08:54:02.3175131Z * [new branch] gh/guangyey/187/head -> origin/gh/guangyey/187/head 2025-12-04T08:54:02.3175203Z * [new branch] gh/guangyey/187/orig -> origin/gh/guangyey/187/orig 2025-12-04T08:54:02.3175275Z * [new branch] gh/guangyey/188/base -> origin/gh/guangyey/188/base 2025-12-04T08:54:02.3175347Z * [new branch] gh/guangyey/188/head -> origin/gh/guangyey/188/head 2025-12-04T08:54:02.3175418Z * [new branch] gh/guangyey/188/orig -> origin/gh/guangyey/188/orig 2025-12-04T08:54:02.3175507Z * [new branch] gh/guangyey/190/base -> origin/gh/guangyey/190/base 2025-12-04T08:54:02.3175581Z * [new branch] gh/guangyey/190/head -> origin/gh/guangyey/190/head 2025-12-04T08:54:02.3175652Z * [new branch] gh/guangyey/190/orig -> origin/gh/guangyey/190/orig 2025-12-04T08:54:02.3175722Z * [new branch] gh/guangyey/208/base -> origin/gh/guangyey/208/base 2025-12-04T08:54:02.3175793Z * [new branch] gh/guangyey/208/head -> origin/gh/guangyey/208/head 2025-12-04T08:54:02.3175864Z * [new branch] gh/guangyey/208/orig -> origin/gh/guangyey/208/orig 2025-12-04T08:54:02.3175936Z * [new branch] gh/guangyey/228/base -> origin/gh/guangyey/228/base 2025-12-04T08:54:02.3176009Z * [new branch] gh/guangyey/228/head -> origin/gh/guangyey/228/head 2025-12-04T08:54:02.3176080Z * [new branch] gh/guangyey/228/orig -> origin/gh/guangyey/228/orig 2025-12-04T08:54:02.3176151Z * [new branch] gh/guangyey/230/base -> origin/gh/guangyey/230/base 2025-12-04T08:54:02.3176224Z * [new branch] gh/guangyey/230/head -> origin/gh/guangyey/230/head 2025-12-04T08:54:02.3176294Z * [new branch] gh/guangyey/230/orig -> origin/gh/guangyey/230/orig 2025-12-04T08:54:02.3176364Z * [new branch] gh/guangyey/231/base -> origin/gh/guangyey/231/base 2025-12-04T08:54:02.3176436Z * [new branch] gh/guangyey/231/head -> origin/gh/guangyey/231/head 2025-12-04T08:54:02.3176506Z * [new branch] gh/guangyey/231/orig -> origin/gh/guangyey/231/orig 2025-12-04T08:54:02.3176626Z * [new branch] gh/guangyey/232/base -> origin/gh/guangyey/232/base 2025-12-04T08:54:02.3176698Z * [new branch] gh/guangyey/232/head -> origin/gh/guangyey/232/head 2025-12-04T08:54:02.3176768Z * [new branch] gh/guangyey/232/orig -> origin/gh/guangyey/232/orig 2025-12-04T08:54:02.3176841Z * [new branch] gh/guangyey/233/base -> origin/gh/guangyey/233/base 2025-12-04T08:54:02.3176911Z * [new branch] gh/guangyey/233/head -> origin/gh/guangyey/233/head 2025-12-04T08:54:02.3176981Z * [new branch] gh/guangyey/233/orig -> origin/gh/guangyey/233/orig 2025-12-04T08:54:02.3177053Z * [new branch] gh/guangyey/234/base -> origin/gh/guangyey/234/base 2025-12-04T08:54:02.3177122Z * [new branch] gh/guangyey/234/head -> origin/gh/guangyey/234/head 2025-12-04T08:54:02.3177193Z * [new branch] gh/guangyey/234/orig -> origin/gh/guangyey/234/orig 2025-12-04T08:54:02.3177298Z * [new branch] gh/guangyey/235/base -> origin/gh/guangyey/235/base 2025-12-04T08:54:02.3177368Z * [new branch] gh/guangyey/235/head -> origin/gh/guangyey/235/head 2025-12-04T08:54:02.3177437Z * [new branch] gh/guangyey/235/orig -> origin/gh/guangyey/235/orig 2025-12-04T08:54:02.3177511Z * [new branch] gh/guangyey/236/base -> origin/gh/guangyey/236/base 2025-12-04T08:54:02.3177581Z * [new branch] gh/guangyey/236/head -> origin/gh/guangyey/236/head 2025-12-04T08:54:02.3177652Z * [new branch] gh/guangyey/236/orig -> origin/gh/guangyey/236/orig 2025-12-04T08:54:02.3177725Z * [new branch] gh/guangyey/237/base -> origin/gh/guangyey/237/base 2025-12-04T08:54:02.3177797Z * [new branch] gh/guangyey/237/head -> origin/gh/guangyey/237/head 2025-12-04T08:54:02.3177870Z * [new branch] gh/guangyey/237/orig -> origin/gh/guangyey/237/orig 2025-12-04T08:54:02.3177940Z * [new branch] gh/guangyey/238/base -> origin/gh/guangyey/238/base 2025-12-04T08:54:02.3178012Z * [new branch] gh/guangyey/238/head -> origin/gh/guangyey/238/head 2025-12-04T08:54:02.3178082Z * [new branch] gh/guangyey/239/base -> origin/gh/guangyey/239/base 2025-12-04T08:54:02.3178187Z * [new branch] gh/guangyey/239/head -> origin/gh/guangyey/239/head 2025-12-04T08:54:02.3178258Z * [new branch] gh/guangyey/239/orig -> origin/gh/guangyey/239/orig 2025-12-04T08:54:02.3178328Z * [new branch] gh/guangyey/240/base -> origin/gh/guangyey/240/base 2025-12-04T08:54:02.3178402Z * [new branch] gh/guangyey/240/head -> origin/gh/guangyey/240/head 2025-12-04T08:54:02.3178471Z * [new branch] gh/guangyey/240/orig -> origin/gh/guangyey/240/orig 2025-12-04T08:54:02.3178545Z * [new branch] gh/guangyey/241/base -> origin/gh/guangyey/241/base 2025-12-04T08:54:02.3178614Z * [new branch] gh/guangyey/241/head -> origin/gh/guangyey/241/head 2025-12-04T08:54:02.3178685Z * [new branch] gh/guangyey/241/orig -> origin/gh/guangyey/241/orig 2025-12-04T08:54:02.3178756Z * [new branch] gh/guangyey/242/base -> origin/gh/guangyey/242/base 2025-12-04T08:54:02.3178829Z * [new branch] gh/guangyey/242/head -> origin/gh/guangyey/242/head 2025-12-04T08:54:02.3178902Z * [new branch] gh/guangyey/242/orig -> origin/gh/guangyey/242/orig 2025-12-04T08:54:02.3178976Z * [new branch] gh/guangyey/243/base -> origin/gh/guangyey/243/base 2025-12-04T08:54:02.3179046Z * [new branch] gh/guangyey/243/head -> origin/gh/guangyey/243/head 2025-12-04T08:54:02.3179118Z * [new branch] gh/guangyey/243/orig -> origin/gh/guangyey/243/orig 2025-12-04T08:54:02.3179194Z * [new branch] gh/guangyey/244/base -> origin/gh/guangyey/244/base 2025-12-04T08:54:02.3179265Z * [new branch] gh/guangyey/244/head -> origin/gh/guangyey/244/head 2025-12-04T08:54:02.3179337Z * [new branch] gh/guangyey/244/orig -> origin/gh/guangyey/244/orig 2025-12-04T08:54:02.3179412Z * [new branch] gh/guangyey/245/base -> origin/gh/guangyey/245/base 2025-12-04T08:54:02.3179485Z * [new branch] gh/guangyey/245/head -> origin/gh/guangyey/245/head 2025-12-04T08:54:02.3179557Z * [new branch] gh/guangyey/245/orig -> origin/gh/guangyey/245/orig 2025-12-04T08:54:02.3179633Z * [new branch] gh/guangyey/246/base -> origin/gh/guangyey/246/base 2025-12-04T08:54:02.3179705Z * [new branch] gh/guangyey/246/head -> origin/gh/guangyey/246/head 2025-12-04T08:54:02.3179777Z * [new branch] gh/guangyey/246/orig -> origin/gh/guangyey/246/orig 2025-12-04T08:54:02.3179878Z * [new branch] gh/guangyey/247/base -> origin/gh/guangyey/247/base 2025-12-04T08:54:02.3179950Z * [new branch] gh/guangyey/247/head -> origin/gh/guangyey/247/head 2025-12-04T08:54:02.3180026Z * [new branch] gh/guangyey/247/orig -> origin/gh/guangyey/247/orig 2025-12-04T08:54:02.3180142Z * [new branch] gh/guangyey/248/base -> origin/gh/guangyey/248/base 2025-12-04T08:54:02.3180215Z * [new branch] gh/guangyey/248/head -> origin/gh/guangyey/248/head 2025-12-04T08:54:02.3180290Z * [new branch] gh/guangyey/248/orig -> origin/gh/guangyey/248/orig 2025-12-04T08:54:02.3180363Z * [new branch] gh/guangyey/249/base -> origin/gh/guangyey/249/base 2025-12-04T08:54:02.3180435Z * [new branch] gh/guangyey/249/head -> origin/gh/guangyey/249/head 2025-12-04T08:54:02.3180511Z * [new branch] gh/guangyey/249/orig -> origin/gh/guangyey/249/orig 2025-12-04T08:54:02.3180585Z * [new branch] gh/guangyey/250/base -> origin/gh/guangyey/250/base 2025-12-04T08:54:02.3180657Z * [new branch] gh/guangyey/250/head -> origin/gh/guangyey/250/head 2025-12-04T08:54:02.3180733Z * [new branch] gh/guangyey/250/orig -> origin/gh/guangyey/250/orig 2025-12-04T08:54:02.3180848Z * [new branch] gh/guangyey/251/base -> origin/gh/guangyey/251/base 2025-12-04T08:54:02.3180921Z * [new branch] gh/guangyey/251/head -> origin/gh/guangyey/251/head 2025-12-04T08:54:02.3180997Z * [new branch] gh/guangyey/251/orig -> origin/gh/guangyey/251/orig 2025-12-04T08:54:02.3181069Z * [new branch] gh/guangyey/252/base -> origin/gh/guangyey/252/base 2025-12-04T08:54:02.3181142Z * [new branch] gh/guangyey/252/head -> origin/gh/guangyey/252/head 2025-12-04T08:54:02.3181217Z * [new branch] gh/guangyey/252/orig -> origin/gh/guangyey/252/orig 2025-12-04T08:54:02.3181292Z * [new branch] gh/guangyey/253/base -> origin/gh/guangyey/253/base 2025-12-04T08:54:02.3181364Z * [new branch] gh/guangyey/253/head -> origin/gh/guangyey/253/head 2025-12-04T08:54:02.3181440Z * [new branch] gh/guangyey/253/orig -> origin/gh/guangyey/253/orig 2025-12-04T08:54:02.3181513Z * [new branch] gh/guangyey/254/base -> origin/gh/guangyey/254/base 2025-12-04T08:54:02.3181585Z * [new branch] gh/guangyey/254/head -> origin/gh/guangyey/254/head 2025-12-04T08:54:02.3181655Z * [new branch] gh/guangyey/254/orig -> origin/gh/guangyey/254/orig 2025-12-04T08:54:02.3181725Z * [new branch] gh/guangyey/255/base -> origin/gh/guangyey/255/base 2025-12-04T08:54:02.3181797Z * [new branch] gh/guangyey/255/head -> origin/gh/guangyey/255/head 2025-12-04T08:54:02.3181868Z * [new branch] gh/guangyey/255/orig -> origin/gh/guangyey/255/orig 2025-12-04T08:54:02.3181967Z * [new branch] gh/guilhermeleobas/107/base -> origin/gh/guilhermeleobas/107/base 2025-12-04T08:54:02.3182059Z * [new branch] gh/guilhermeleobas/107/head -> origin/gh/guilhermeleobas/107/head 2025-12-04T08:54:02.3182147Z * [new branch] gh/guilhermeleobas/107/orig -> origin/gh/guilhermeleobas/107/orig 2025-12-04T08:54:02.3182237Z * [new branch] gh/guilhermeleobas/108/base -> origin/gh/guilhermeleobas/108/base 2025-12-04T08:54:02.3182325Z * [new branch] gh/guilhermeleobas/108/head -> origin/gh/guilhermeleobas/108/head 2025-12-04T08:54:02.3182413Z * [new branch] gh/guilhermeleobas/108/orig -> origin/gh/guilhermeleobas/108/orig 2025-12-04T08:54:02.3182501Z * [new branch] gh/guilhermeleobas/150/base -> origin/gh/guilhermeleobas/150/base 2025-12-04T08:54:02.3182591Z * [new branch] gh/guilhermeleobas/150/head -> origin/gh/guilhermeleobas/150/head 2025-12-04T08:54:02.3182718Z * [new branch] gh/guilhermeleobas/150/orig -> origin/gh/guilhermeleobas/150/orig 2025-12-04T08:54:02.3182806Z * [new branch] gh/guilhermeleobas/168/base -> origin/gh/guilhermeleobas/168/base 2025-12-04T08:54:02.3182896Z * [new branch] gh/guilhermeleobas/168/head -> origin/gh/guilhermeleobas/168/head 2025-12-04T08:54:02.3182987Z * [new branch] gh/guilhermeleobas/168/orig -> origin/gh/guilhermeleobas/168/orig 2025-12-04T08:54:02.3183076Z * [new branch] gh/guilhermeleobas/169/base -> origin/gh/guilhermeleobas/169/base 2025-12-04T08:54:02.3183170Z * [new branch] gh/guilhermeleobas/169/head -> origin/gh/guilhermeleobas/169/head 2025-12-04T08:54:02.3183260Z * [new branch] gh/guilhermeleobas/169/orig -> origin/gh/guilhermeleobas/169/orig 2025-12-04T08:54:02.3183352Z * [new branch] gh/guilhermeleobas/170/base -> origin/gh/guilhermeleobas/170/base 2025-12-04T08:54:02.3183444Z * [new branch] gh/guilhermeleobas/170/head -> origin/gh/guilhermeleobas/170/head 2025-12-04T08:54:02.3183533Z * [new branch] gh/guilhermeleobas/170/orig -> origin/gh/guilhermeleobas/170/orig 2025-12-04T08:54:02.3183626Z * [new branch] gh/guilhermeleobas/171/base -> origin/gh/guilhermeleobas/171/base 2025-12-04T08:54:02.3183750Z * [new branch] gh/guilhermeleobas/171/head -> origin/gh/guilhermeleobas/171/head 2025-12-04T08:54:02.3183840Z * [new branch] gh/guilhermeleobas/171/orig -> origin/gh/guilhermeleobas/171/orig 2025-12-04T08:54:02.3183933Z * [new branch] gh/guilhermeleobas/173/base -> origin/gh/guilhermeleobas/173/base 2025-12-04T08:54:02.3184023Z * [new branch] gh/guilhermeleobas/173/head -> origin/gh/guilhermeleobas/173/head 2025-12-04T08:54:02.3184112Z * [new branch] gh/guilhermeleobas/173/orig -> origin/gh/guilhermeleobas/173/orig 2025-12-04T08:54:02.3184206Z * [new branch] gh/guilhermeleobas/193/base -> origin/gh/guilhermeleobas/193/base 2025-12-04T08:54:02.3184295Z * [new branch] gh/guilhermeleobas/193/head -> origin/gh/guilhermeleobas/193/head 2025-12-04T08:54:02.3184383Z * [new branch] gh/guilhermeleobas/193/orig -> origin/gh/guilhermeleobas/193/orig 2025-12-04T08:54:02.3184475Z * [new branch] gh/guilhermeleobas/204/base -> origin/gh/guilhermeleobas/204/base 2025-12-04T08:54:02.3184563Z * [new branch] gh/guilhermeleobas/204/head -> origin/gh/guilhermeleobas/204/head 2025-12-04T08:54:02.3184652Z * [new branch] gh/guilhermeleobas/204/orig -> origin/gh/guilhermeleobas/204/orig 2025-12-04T08:54:02.3184739Z * [new branch] gh/guilhermeleobas/211/base -> origin/gh/guilhermeleobas/211/base 2025-12-04T08:54:02.3184825Z * [new branch] gh/guilhermeleobas/211/head -> origin/gh/guilhermeleobas/211/head 2025-12-04T08:54:02.3184914Z * [new branch] gh/guilhermeleobas/211/orig -> origin/gh/guilhermeleobas/211/orig 2025-12-04T08:54:02.3185005Z * [new branch] gh/guilhermeleobas/226/base -> origin/gh/guilhermeleobas/226/base 2025-12-04T08:54:02.3185093Z * [new branch] gh/guilhermeleobas/226/head -> origin/gh/guilhermeleobas/226/head 2025-12-04T08:54:02.3185183Z * [new branch] gh/guilhermeleobas/226/orig -> origin/gh/guilhermeleobas/226/orig 2025-12-04T08:54:02.3185271Z * [new branch] gh/guilhermeleobas/236/base -> origin/gh/guilhermeleobas/236/base 2025-12-04T08:54:02.3185359Z * [new branch] gh/guilhermeleobas/236/head -> origin/gh/guilhermeleobas/236/head 2025-12-04T08:54:02.3185449Z * [new branch] gh/guilhermeleobas/236/orig -> origin/gh/guilhermeleobas/236/orig 2025-12-04T08:54:02.3185536Z * [new branch] gh/guilhermeleobas/247/base -> origin/gh/guilhermeleobas/247/base 2025-12-04T08:54:02.3185626Z * [new branch] gh/guilhermeleobas/247/head -> origin/gh/guilhermeleobas/247/head 2025-12-04T08:54:02.3185751Z * [new branch] gh/guilhermeleobas/247/orig -> origin/gh/guilhermeleobas/247/orig 2025-12-04T08:54:02.3185838Z * [new branch] gh/guilhermeleobas/248/base -> origin/gh/guilhermeleobas/248/base 2025-12-04T08:54:02.3185925Z * [new branch] gh/guilhermeleobas/248/head -> origin/gh/guilhermeleobas/248/head 2025-12-04T08:54:02.3186014Z * [new branch] gh/guilhermeleobas/248/orig -> origin/gh/guilhermeleobas/248/orig 2025-12-04T08:54:02.3186101Z * [new branch] gh/guilhermeleobas/250/base -> origin/gh/guilhermeleobas/250/base 2025-12-04T08:54:02.3186189Z * [new branch] gh/guilhermeleobas/250/head -> origin/gh/guilhermeleobas/250/head 2025-12-04T08:54:02.3186276Z * [new branch] gh/guilhermeleobas/250/orig -> origin/gh/guilhermeleobas/250/orig 2025-12-04T08:54:02.3186363Z * [new branch] gh/guilhermeleobas/253/base -> origin/gh/guilhermeleobas/253/base 2025-12-04T08:54:02.3186454Z * [new branch] gh/guilhermeleobas/253/head -> origin/gh/guilhermeleobas/253/head 2025-12-04T08:54:02.3186541Z * [new branch] gh/guilhermeleobas/253/orig -> origin/gh/guilhermeleobas/253/orig 2025-12-04T08:54:02.3186628Z * [new branch] gh/guilhermeleobas/254/base -> origin/gh/guilhermeleobas/254/base 2025-12-04T08:54:02.3186760Z * [new branch] gh/guilhermeleobas/254/head -> origin/gh/guilhermeleobas/254/head 2025-12-04T08:54:02.3186849Z * [new branch] gh/guilhermeleobas/254/orig -> origin/gh/guilhermeleobas/254/orig 2025-12-04T08:54:02.3186936Z * [new branch] gh/guilhermeleobas/255/base -> origin/gh/guilhermeleobas/255/base 2025-12-04T08:54:02.3187026Z * [new branch] gh/guilhermeleobas/255/head -> origin/gh/guilhermeleobas/255/head 2025-12-04T08:54:02.3187113Z * [new branch] gh/guilhermeleobas/255/orig -> origin/gh/guilhermeleobas/255/orig 2025-12-04T08:54:02.3187202Z * [new branch] gh/guilhermeleobas/256/base -> origin/gh/guilhermeleobas/256/base 2025-12-04T08:54:02.3187290Z * [new branch] gh/guilhermeleobas/256/head -> origin/gh/guilhermeleobas/256/head 2025-12-04T08:54:02.3187379Z * [new branch] gh/guilhermeleobas/256/orig -> origin/gh/guilhermeleobas/256/orig 2025-12-04T08:54:02.3187469Z * [new branch] gh/guilhermeleobas/257/base -> origin/gh/guilhermeleobas/257/base 2025-12-04T08:54:02.3187558Z * [new branch] gh/guilhermeleobas/257/head -> origin/gh/guilhermeleobas/257/head 2025-12-04T08:54:02.3187645Z * [new branch] gh/guilhermeleobas/257/orig -> origin/gh/guilhermeleobas/257/orig 2025-12-04T08:54:02.3187735Z * [new branch] gh/guilhermeleobas/258/base -> origin/gh/guilhermeleobas/258/base 2025-12-04T08:54:02.3187821Z * [new branch] gh/guilhermeleobas/258/head -> origin/gh/guilhermeleobas/258/head 2025-12-04T08:54:02.3187910Z * [new branch] gh/guilhermeleobas/258/orig -> origin/gh/guilhermeleobas/258/orig 2025-12-04T08:54:02.3187999Z * [new branch] gh/guilhermeleobas/259/base -> origin/gh/guilhermeleobas/259/base 2025-12-04T08:54:02.3188086Z * [new branch] gh/guilhermeleobas/259/head -> origin/gh/guilhermeleobas/259/head 2025-12-04T08:54:02.3188174Z * [new branch] gh/guilhermeleobas/259/orig -> origin/gh/guilhermeleobas/259/orig 2025-12-04T08:54:02.3188264Z * [new branch] gh/guilhermeleobas/260/base -> origin/gh/guilhermeleobas/260/base 2025-12-04T08:54:02.3188351Z * [new branch] gh/guilhermeleobas/260/head -> origin/gh/guilhermeleobas/260/head 2025-12-04T08:54:02.3188439Z * [new branch] gh/guilhermeleobas/260/orig -> origin/gh/guilhermeleobas/260/orig 2025-12-04T08:54:02.3188528Z * [new branch] gh/guilhermeleobas/261/base -> origin/gh/guilhermeleobas/261/base 2025-12-04T08:54:02.3188641Z * [new branch] gh/guilhermeleobas/261/head -> origin/gh/guilhermeleobas/261/head 2025-12-04T08:54:02.3188729Z * [new branch] gh/guilhermeleobas/261/orig -> origin/gh/guilhermeleobas/261/orig 2025-12-04T08:54:02.3188818Z * [new branch] gh/guilhermeleobas/262/base -> origin/gh/guilhermeleobas/262/base 2025-12-04T08:54:02.3188907Z * [new branch] gh/guilhermeleobas/262/head -> origin/gh/guilhermeleobas/262/head 2025-12-04T08:54:02.3188994Z * [new branch] gh/guilhermeleobas/262/orig -> origin/gh/guilhermeleobas/262/orig 2025-12-04T08:54:02.3189081Z * [new branch] gh/guilhermeleobas/263/base -> origin/gh/guilhermeleobas/263/base 2025-12-04T08:54:02.3189169Z * [new branch] gh/guilhermeleobas/263/head -> origin/gh/guilhermeleobas/263/head 2025-12-04T08:54:02.3189256Z * [new branch] gh/guilhermeleobas/263/orig -> origin/gh/guilhermeleobas/263/orig 2025-12-04T08:54:02.3189345Z * [new branch] gh/guilhermeleobas/264/base -> origin/gh/guilhermeleobas/264/base 2025-12-04T08:54:02.3189432Z * [new branch] gh/guilhermeleobas/264/head -> origin/gh/guilhermeleobas/264/head 2025-12-04T08:54:02.3189520Z * [new branch] gh/guilhermeleobas/264/orig -> origin/gh/guilhermeleobas/264/orig 2025-12-04T08:54:02.3189639Z * [new branch] gh/guilhermeleobas/265/base -> origin/gh/guilhermeleobas/265/base 2025-12-04T08:54:02.3189727Z * [new branch] gh/guilhermeleobas/265/head -> origin/gh/guilhermeleobas/265/head 2025-12-04T08:54:02.3189817Z * [new branch] gh/guilhermeleobas/265/orig -> origin/gh/guilhermeleobas/265/orig 2025-12-04T08:54:02.3189903Z * [new branch] gh/guilhermeleobas/266/base -> origin/gh/guilhermeleobas/266/base 2025-12-04T08:54:02.3189991Z * [new branch] gh/guilhermeleobas/266/head -> origin/gh/guilhermeleobas/266/head 2025-12-04T08:54:02.3190081Z * [new branch] gh/guilhermeleobas/266/orig -> origin/gh/guilhermeleobas/266/orig 2025-12-04T08:54:02.3190224Z * [new branch] gh/guilhermeleobas/267/base -> origin/gh/guilhermeleobas/267/base 2025-12-04T08:54:02.3190313Z * [new branch] gh/guilhermeleobas/267/head -> origin/gh/guilhermeleobas/267/head 2025-12-04T08:54:02.3190406Z * [new branch] gh/guilhermeleobas/267/orig -> origin/gh/guilhermeleobas/267/orig 2025-12-04T08:54:02.3190490Z * [new branch] gh/hameerabbasi/1/base -> origin/gh/hameerabbasi/1/base 2025-12-04T08:54:02.3190570Z * [new branch] gh/hameerabbasi/1/head -> origin/gh/hameerabbasi/1/head 2025-12-04T08:54:02.3190648Z * [new branch] gh/hameerabbasi/2/base -> origin/gh/hameerabbasi/2/base 2025-12-04T08:54:02.3190723Z * [new branch] gh/hameerabbasi/2/head -> origin/gh/hameerabbasi/2/head 2025-12-04T08:54:02.3190799Z * [new branch] gh/hameerabbasi/2/orig -> origin/gh/hameerabbasi/2/orig 2025-12-04T08:54:02.3190876Z * [new branch] gh/hameerabbasi/3/base -> origin/gh/hameerabbasi/3/base 2025-12-04T08:54:02.3190952Z * [new branch] gh/hameerabbasi/3/head -> origin/gh/hameerabbasi/3/head 2025-12-04T08:54:02.3191028Z * [new branch] gh/hameerabbasi/3/orig -> origin/gh/hameerabbasi/3/orig 2025-12-04T08:54:02.3191103Z * [new branch] gh/hameerabbasi/4/base -> origin/gh/hameerabbasi/4/base 2025-12-04T08:54:02.3191178Z * [new branch] gh/hameerabbasi/4/head -> origin/gh/hameerabbasi/4/head 2025-12-04T08:54:02.3191254Z * [new branch] gh/hameerabbasi/4/orig -> origin/gh/hameerabbasi/4/orig 2025-12-04T08:54:02.3191324Z * [new branch] gh/huydhn/1/next -> origin/gh/huydhn/1/next 2025-12-04T08:54:02.3191394Z * [new branch] gh/huydhn/2/next -> origin/gh/huydhn/2/next 2025-12-04T08:54:02.3191463Z * [new branch] gh/huydhn/3/next -> origin/gh/huydhn/3/next 2025-12-04T08:54:02.3191587Z * [new branch] gh/huydhn/4/next -> origin/gh/huydhn/4/next 2025-12-04T08:54:02.3191653Z * [new branch] gh/huydhn/5/next -> origin/gh/huydhn/5/next 2025-12-04T08:54:02.3191720Z * [new branch] gh/huydhn/6/next -> origin/gh/huydhn/6/next 2025-12-04T08:54:02.3191789Z * [new branch] gh/int3/97/base -> origin/gh/int3/97/base 2025-12-04T08:54:02.3191855Z * [new branch] gh/int3/97/head -> origin/gh/int3/97/head 2025-12-04T08:54:02.3191928Z * [new branch] gh/isuruf/101/base -> origin/gh/isuruf/101/base 2025-12-04T08:54:02.3191997Z * [new branch] gh/isuruf/101/head -> origin/gh/isuruf/101/head 2025-12-04T08:54:02.3192066Z * [new branch] gh/isuruf/146/base -> origin/gh/isuruf/146/base 2025-12-04T08:54:02.3192137Z * [new branch] gh/isuruf/146/head -> origin/gh/isuruf/146/head 2025-12-04T08:54:02.3192205Z * [new branch] gh/isuruf/146/orig -> origin/gh/isuruf/146/orig 2025-12-04T08:54:02.3192271Z * [new branch] gh/isuruf/158/base -> origin/gh/isuruf/158/base 2025-12-04T08:54:02.3192340Z * [new branch] gh/isuruf/158/head -> origin/gh/isuruf/158/head 2025-12-04T08:54:02.3192457Z * [new branch] gh/isuruf/159/base -> origin/gh/isuruf/159/base 2025-12-04T08:54:02.3192524Z * [new branch] gh/isuruf/159/head -> origin/gh/isuruf/159/head 2025-12-04T08:54:02.3192592Z * [new branch] gh/isuruf/160/base -> origin/gh/isuruf/160/base 2025-12-04T08:54:02.3192658Z * [new branch] gh/isuruf/160/head -> origin/gh/isuruf/160/head 2025-12-04T08:54:02.3192726Z * [new branch] gh/isuruf/160/orig -> origin/gh/isuruf/160/orig 2025-12-04T08:54:02.3192795Z * [new branch] gh/isuruf/81/base -> origin/gh/isuruf/81/base 2025-12-04T08:54:02.3192865Z * [new branch] gh/isuruf/81/head -> origin/gh/isuruf/81/head 2025-12-04T08:54:02.3192935Z * [new branch] gh/isuruf/81/orig -> origin/gh/isuruf/81/orig 2025-12-04T08:54:02.3193008Z * [new branch] gh/jamesjwu/176/base -> origin/gh/jamesjwu/176/base 2025-12-04T08:54:02.3193082Z * [new branch] gh/jamesjwu/176/head -> origin/gh/jamesjwu/176/head 2025-12-04T08:54:02.3193157Z * [new branch] gh/jamesjwu/176/orig -> origin/gh/jamesjwu/176/orig 2025-12-04T08:54:02.3193229Z * [new branch] gh/jamesjwu/187/base -> origin/gh/jamesjwu/187/base 2025-12-04T08:54:02.3193301Z * [new branch] gh/jamesjwu/187/head -> origin/gh/jamesjwu/187/head 2025-12-04T08:54:02.3193373Z * [new branch] gh/jamesjwu/187/orig -> origin/gh/jamesjwu/187/orig 2025-12-04T08:54:02.3193442Z * [new branch] gh/jamesjwu/196/base -> origin/gh/jamesjwu/196/base 2025-12-04T08:54:02.3193514Z * [new branch] gh/jamesjwu/196/head -> origin/gh/jamesjwu/196/head 2025-12-04T08:54:02.3193585Z * [new branch] gh/jamesjwu/196/orig -> origin/gh/jamesjwu/196/orig 2025-12-04T08:54:02.3193656Z * [new branch] gh/jamesjwu/198/base -> origin/gh/jamesjwu/198/base 2025-12-04T08:54:02.3193726Z * [new branch] gh/jamesjwu/198/head -> origin/gh/jamesjwu/198/head 2025-12-04T08:54:02.3193798Z * [new branch] gh/jamesjwu/198/orig -> origin/gh/jamesjwu/198/orig 2025-12-04T08:54:02.3193868Z * [new branch] gh/jamesjwu/207/base -> origin/gh/jamesjwu/207/base 2025-12-04T08:54:02.3193937Z * [new branch] gh/jamesjwu/207/head -> origin/gh/jamesjwu/207/head 2025-12-04T08:54:02.3194008Z * [new branch] gh/jamesjwu/207/orig -> origin/gh/jamesjwu/207/orig 2025-12-04T08:54:02.3194078Z * [new branch] gh/jamesjwu/208/base -> origin/gh/jamesjwu/208/base 2025-12-04T08:54:02.3194179Z * [new branch] gh/jamesjwu/208/head -> origin/gh/jamesjwu/208/head 2025-12-04T08:54:02.3194249Z * [new branch] gh/jamesjwu/208/orig -> origin/gh/jamesjwu/208/orig 2025-12-04T08:54:02.3194319Z * [new branch] gh/jamesjwu/52/base -> origin/gh/jamesjwu/52/base 2025-12-04T08:54:02.3194391Z * [new branch] gh/jamesjwu/52/head -> origin/gh/jamesjwu/52/head 2025-12-04T08:54:02.3194461Z * [new branch] gh/jamesjwu/53/base -> origin/gh/jamesjwu/53/base 2025-12-04T08:54:02.3194531Z * [new branch] gh/jamesjwu/53/head -> origin/gh/jamesjwu/53/head 2025-12-04T08:54:02.3194601Z * [new branch] gh/jamesjwu/54/base -> origin/gh/jamesjwu/54/base 2025-12-04T08:54:02.3194670Z * [new branch] gh/jamesjwu/54/head -> origin/gh/jamesjwu/54/head 2025-12-04T08:54:02.3194739Z * [new branch] gh/jamesjwu/55/base -> origin/gh/jamesjwu/55/base 2025-12-04T08:54:02.3194813Z * [new branch] gh/jamesjwu/55/head -> origin/gh/jamesjwu/55/head 2025-12-04T08:54:02.3194882Z * [new branch] gh/jamesjwu/56/base -> origin/gh/jamesjwu/56/base 2025-12-04T08:54:02.3194951Z * [new branch] gh/jamesjwu/56/head -> origin/gh/jamesjwu/56/head 2025-12-04T08:54:02.3195051Z * [new branch] gh/jamesjwu/57/base -> origin/gh/jamesjwu/57/base 2025-12-04T08:54:02.3195121Z * [new branch] gh/jamesjwu/57/head -> origin/gh/jamesjwu/57/head 2025-12-04T08:54:02.3195190Z * [new branch] gh/jamesjwu/58/base -> origin/gh/jamesjwu/58/base 2025-12-04T08:54:02.3195263Z * [new branch] gh/jamesjwu/58/head -> origin/gh/jamesjwu/58/head 2025-12-04T08:54:02.3195333Z * [new branch] gh/jamesjwu/59/base -> origin/gh/jamesjwu/59/base 2025-12-04T08:54:02.3195403Z * [new branch] gh/jamesjwu/59/head -> origin/gh/jamesjwu/59/head 2025-12-04T08:54:02.3195474Z * [new branch] gh/jamesjwu/60/base -> origin/gh/jamesjwu/60/base 2025-12-04T08:54:02.3195544Z * [new branch] gh/jamesjwu/60/head -> origin/gh/jamesjwu/60/head 2025-12-04T08:54:02.3195615Z * [new branch] gh/jamesjwu/61/base -> origin/gh/jamesjwu/61/base 2025-12-04T08:54:02.3195684Z * [new branch] gh/jamesjwu/61/head -> origin/gh/jamesjwu/61/head 2025-12-04T08:54:02.3195753Z * [new branch] gh/jamesjwu/62/base -> origin/gh/jamesjwu/62/base 2025-12-04T08:54:02.3195824Z * [new branch] gh/jamesjwu/62/head -> origin/gh/jamesjwu/62/head 2025-12-04T08:54:02.3195894Z * [new branch] gh/jamesjwu/63/base -> origin/gh/jamesjwu/63/base 2025-12-04T08:54:02.3195966Z * [new branch] gh/jamesjwu/63/head -> origin/gh/jamesjwu/63/head 2025-12-04T08:54:02.3196039Z * [new branch] gh/jamesjwu/64/base -> origin/gh/jamesjwu/64/base 2025-12-04T08:54:02.3196110Z * [new branch] gh/jamesjwu/64/head -> origin/gh/jamesjwu/64/head 2025-12-04T08:54:02.3196179Z * [new branch] gh/jamesjwu/65/base -> origin/gh/jamesjwu/65/base 2025-12-04T08:54:02.3196247Z * [new branch] gh/jamesjwu/65/head -> origin/gh/jamesjwu/65/head 2025-12-04T08:54:02.3196319Z * [new branch] gh/janeyx99/165/base -> origin/gh/janeyx99/165/base 2025-12-04T08:54:02.3196392Z * [new branch] gh/janeyx99/165/head -> origin/gh/janeyx99/165/head 2025-12-04T08:54:02.3196461Z * [new branch] gh/janeyx99/165/orig -> origin/gh/janeyx99/165/orig 2025-12-04T08:54:02.3196530Z * [new branch] gh/janeyx99/201/base -> origin/gh/janeyx99/201/base 2025-12-04T08:54:02.3196601Z * [new branch] gh/janeyx99/201/head -> origin/gh/janeyx99/201/head 2025-12-04T08:54:02.3196669Z * [new branch] gh/janeyx99/201/orig -> origin/gh/janeyx99/201/orig 2025-12-04T08:54:02.3196770Z * [new branch] gh/janeyx99/225/base -> origin/gh/janeyx99/225/base 2025-12-04T08:54:02.3196842Z * [new branch] gh/janeyx99/225/head -> origin/gh/janeyx99/225/head 2025-12-04T08:54:02.3196911Z * [new branch] gh/janeyx99/225/orig -> origin/gh/janeyx99/225/orig 2025-12-04T08:54:02.3196980Z * [new branch] gh/janeyx99/299/base -> origin/gh/janeyx99/299/base 2025-12-04T08:54:02.3197053Z * [new branch] gh/janeyx99/299/head -> origin/gh/janeyx99/299/head 2025-12-04T08:54:02.3197123Z * [new branch] gh/janeyx99/299/orig -> origin/gh/janeyx99/299/orig 2025-12-04T08:54:02.3197193Z * [new branch] gh/janeyx99/302/base -> origin/gh/janeyx99/302/base 2025-12-04T08:54:02.3197265Z * [new branch] gh/janeyx99/302/head -> origin/gh/janeyx99/302/head 2025-12-04T08:54:02.3197337Z * [new branch] gh/janeyx99/303/base -> origin/gh/janeyx99/303/base 2025-12-04T08:54:02.3197405Z * [new branch] gh/janeyx99/303/head -> origin/gh/janeyx99/303/head 2025-12-04T08:54:02.3197481Z * [new branch] gh/janeyx99/305/base -> origin/gh/janeyx99/305/base 2025-12-04T08:54:02.3197551Z * [new branch] gh/janeyx99/305/head -> origin/gh/janeyx99/305/head 2025-12-04T08:54:02.3197658Z * [new branch] gh/janeyx99/306/base -> origin/gh/janeyx99/306/base 2025-12-04T08:54:02.3197731Z * [new branch] gh/janeyx99/306/head -> origin/gh/janeyx99/306/head 2025-12-04T08:54:02.3197802Z * [new branch] gh/janeyx99/314/base -> origin/gh/janeyx99/314/base 2025-12-04T08:54:02.3197874Z * [new branch] gh/janeyx99/314/head -> origin/gh/janeyx99/314/head 2025-12-04T08:54:02.3197944Z * [new branch] gh/janeyx99/314/orig -> origin/gh/janeyx99/314/orig 2025-12-04T08:54:02.3198014Z * [new branch] gh/janeyx99/315/base -> origin/gh/janeyx99/315/base 2025-12-04T08:54:02.3198087Z * [new branch] gh/janeyx99/315/head -> origin/gh/janeyx99/315/head 2025-12-04T08:54:02.3198155Z * [new branch] gh/janeyx99/315/orig -> origin/gh/janeyx99/315/orig 2025-12-04T08:54:02.3198226Z * [new branch] gh/janeyx99/316/base -> origin/gh/janeyx99/316/base 2025-12-04T08:54:02.3198301Z * [new branch] gh/janeyx99/316/head -> origin/gh/janeyx99/316/head 2025-12-04T08:54:02.3198371Z * [new branch] gh/janeyx99/316/orig -> origin/gh/janeyx99/316/orig 2025-12-04T08:54:02.3198441Z * [new branch] gh/janeyx99/317/base -> origin/gh/janeyx99/317/base 2025-12-04T08:54:02.3198512Z * [new branch] gh/janeyx99/317/head -> origin/gh/janeyx99/317/head 2025-12-04T08:54:02.3198582Z * [new branch] gh/janeyx99/317/orig -> origin/gh/janeyx99/317/orig 2025-12-04T08:54:02.3198654Z * [new branch] gh/janeyx99/325/base -> origin/gh/janeyx99/325/base 2025-12-04T08:54:02.3198726Z * [new branch] gh/janeyx99/325/head -> origin/gh/janeyx99/325/head 2025-12-04T08:54:02.3198797Z * [new branch] gh/janeyx99/325/orig -> origin/gh/janeyx99/325/orig 2025-12-04T08:54:02.3198867Z * [new branch] gh/janeyx99/327/base -> origin/gh/janeyx99/327/base 2025-12-04T08:54:02.3198937Z * [new branch] gh/janeyx99/327/head -> origin/gh/janeyx99/327/head 2025-12-04T08:54:02.3199007Z * [new branch] gh/janeyx99/327/orig -> origin/gh/janeyx99/327/orig 2025-12-04T08:54:02.3199077Z * [new branch] gh/janeyx99/328/base -> origin/gh/janeyx99/328/base 2025-12-04T08:54:02.3199150Z * [new branch] gh/janeyx99/328/head -> origin/gh/janeyx99/328/head 2025-12-04T08:54:02.3199219Z * [new branch] gh/janeyx99/328/orig -> origin/gh/janeyx99/328/orig 2025-12-04T08:54:02.3199322Z * [new branch] gh/janeyx99/329/base -> origin/gh/janeyx99/329/base 2025-12-04T08:54:02.3199392Z * [new branch] gh/janeyx99/329/head -> origin/gh/janeyx99/329/head 2025-12-04T08:54:02.3199461Z * [new branch] gh/janeyx99/329/orig -> origin/gh/janeyx99/329/orig 2025-12-04T08:54:02.3199533Z * [new branch] gh/janeyx99/330/base -> origin/gh/janeyx99/330/base 2025-12-04T08:54:02.3199604Z * [new branch] gh/janeyx99/330/head -> origin/gh/janeyx99/330/head 2025-12-04T08:54:02.3199675Z * [new branch] gh/janeyx99/330/orig -> origin/gh/janeyx99/330/orig 2025-12-04T08:54:02.3199747Z * [new branch] gh/janeyx99/331/base -> origin/gh/janeyx99/331/base 2025-12-04T08:54:02.3199817Z * [new branch] gh/janeyx99/331/head -> origin/gh/janeyx99/331/head 2025-12-04T08:54:02.3199887Z * [new branch] gh/janeyx99/331/orig -> origin/gh/janeyx99/331/orig 2025-12-04T08:54:02.3199960Z * [new branch] gh/janeyx99/332/base -> origin/gh/janeyx99/332/base 2025-12-04T08:54:02.3200028Z * [new branch] gh/janeyx99/332/head -> origin/gh/janeyx99/332/head 2025-12-04T08:54:02.3200149Z * [new branch] gh/janeyx99/332/orig -> origin/gh/janeyx99/332/orig 2025-12-04T08:54:02.3200263Z * [new branch] gh/janeyx99/333/base -> origin/gh/janeyx99/333/base 2025-12-04T08:54:02.3200333Z * [new branch] gh/janeyx99/333/head -> origin/gh/janeyx99/333/head 2025-12-04T08:54:02.3200403Z * [new branch] gh/janeyx99/333/orig -> origin/gh/janeyx99/333/orig 2025-12-04T08:54:02.3200475Z * [new branch] gh/janeyx99/88/base -> origin/gh/janeyx99/88/base 2025-12-04T08:54:02.3200545Z * [new branch] gh/janeyx99/88/head -> origin/gh/janeyx99/88/head 2025-12-04T08:54:02.3200613Z * [new branch] gh/janeyx99/88/orig -> origin/gh/janeyx99/88/orig 2025-12-04T08:54:02.3200685Z * [new branch] gh/jansel/360/base -> origin/gh/jansel/360/base 2025-12-04T08:54:02.3200753Z * [new branch] gh/jansel/360/head -> origin/gh/jansel/360/head 2025-12-04T08:54:02.3200822Z * [new branch] gh/jansel/451/base -> origin/gh/jansel/451/base 2025-12-04T08:54:02.3200892Z * [new branch] gh/jansel/451/head -> origin/gh/jansel/451/head 2025-12-04T08:54:02.3200959Z * [new branch] gh/jansel/451/orig -> origin/gh/jansel/451/orig 2025-12-04T08:54:02.3201029Z * [new branch] gh/jansel/462/base -> origin/gh/jansel/462/base 2025-12-04T08:54:02.3201097Z * [new branch] gh/jansel/462/head -> origin/gh/jansel/462/head 2025-12-04T08:54:02.3201165Z * [new branch] gh/jansel/462/orig -> origin/gh/jansel/462/orig 2025-12-04T08:54:02.3201234Z * [new branch] gh/jansel/533/base -> origin/gh/jansel/533/base 2025-12-04T08:54:02.3201303Z * [new branch] gh/jansel/533/head -> origin/gh/jansel/533/head 2025-12-04T08:54:02.3201372Z * [new branch] gh/jansel/533/orig -> origin/gh/jansel/533/orig 2025-12-04T08:54:02.3201441Z * [new branch] gh/jansel/552/base -> origin/gh/jansel/552/base 2025-12-04T08:54:02.3201510Z * [new branch] gh/jansel/552/head -> origin/gh/jansel/552/head 2025-12-04T08:54:02.3201578Z * [new branch] gh/jansel/552/orig -> origin/gh/jansel/552/orig 2025-12-04T08:54:02.3201650Z * [new branch] gh/jansel/553/base -> origin/gh/jansel/553/base 2025-12-04T08:54:02.3201718Z * [new branch] gh/jansel/553/head -> origin/gh/jansel/553/head 2025-12-04T08:54:02.3201785Z * [new branch] gh/jansel/553/orig -> origin/gh/jansel/553/orig 2025-12-04T08:54:02.3201853Z * [new branch] gh/jansel/554/base -> origin/gh/jansel/554/base 2025-12-04T08:54:02.3201961Z * [new branch] gh/jansel/554/head -> origin/gh/jansel/554/head 2025-12-04T08:54:02.3202028Z * [new branch] gh/jansel/554/orig -> origin/gh/jansel/554/orig 2025-12-04T08:54:02.3202098Z * [new branch] gh/jansel/555/base -> origin/gh/jansel/555/base 2025-12-04T08:54:02.3202166Z * [new branch] gh/jansel/555/head -> origin/gh/jansel/555/head 2025-12-04T08:54:02.3202235Z * [new branch] gh/jansel/555/orig -> origin/gh/jansel/555/orig 2025-12-04T08:54:02.3202306Z * [new branch] gh/jansel/556/base -> origin/gh/jansel/556/base 2025-12-04T08:54:02.3202375Z * [new branch] gh/jansel/556/head -> origin/gh/jansel/556/head 2025-12-04T08:54:02.3202444Z * [new branch] gh/jansel/556/orig -> origin/gh/jansel/556/orig 2025-12-04T08:54:02.3202515Z * [new branch] gh/jansel/557/base -> origin/gh/jansel/557/base 2025-12-04T08:54:02.3202584Z * [new branch] gh/jansel/557/head -> origin/gh/jansel/557/head 2025-12-04T08:54:02.3202652Z * [new branch] gh/jansel/557/orig -> origin/gh/jansel/557/orig 2025-12-04T08:54:02.3202721Z * [new branch] gh/jansel/558/base -> origin/gh/jansel/558/base 2025-12-04T08:54:02.3202788Z * [new branch] gh/jansel/558/head -> origin/gh/jansel/558/head 2025-12-04T08:54:02.3202895Z * [new branch] gh/jansel/558/orig -> origin/gh/jansel/558/orig 2025-12-04T08:54:02.3202965Z * [new branch] gh/jansel/559/base -> origin/gh/jansel/559/base 2025-12-04T08:54:02.3203033Z * [new branch] gh/jansel/559/head -> origin/gh/jansel/559/head 2025-12-04T08:54:02.3203102Z * [new branch] gh/jansel/559/orig -> origin/gh/jansel/559/orig 2025-12-04T08:54:02.3203169Z * [new branch] gh/jansel/560/base -> origin/gh/jansel/560/base 2025-12-04T08:54:02.3203238Z * [new branch] gh/jansel/560/head -> origin/gh/jansel/560/head 2025-12-04T08:54:02.3203306Z * [new branch] gh/jansel/560/orig -> origin/gh/jansel/560/orig 2025-12-04T08:54:02.3203373Z * [new branch] gh/jansel/561/base -> origin/gh/jansel/561/base 2025-12-04T08:54:02.3203441Z * [new branch] gh/jansel/561/head -> origin/gh/jansel/561/head 2025-12-04T08:54:02.3203514Z * [new branch] gh/jansel/561/orig -> origin/gh/jansel/561/orig 2025-12-04T08:54:02.3203583Z * [new branch] gh/jansel/562/base -> origin/gh/jansel/562/base 2025-12-04T08:54:02.3203651Z * [new branch] gh/jansel/562/head -> origin/gh/jansel/562/head 2025-12-04T08:54:02.3203720Z * [new branch] gh/jansel/562/orig -> origin/gh/jansel/562/orig 2025-12-04T08:54:02.3203787Z * [new branch] gh/jansel/563/base -> origin/gh/jansel/563/base 2025-12-04T08:54:02.3203855Z * [new branch] gh/jansel/563/head -> origin/gh/jansel/563/head 2025-12-04T08:54:02.3203924Z * [new branch] gh/jansel/563/orig -> origin/gh/jansel/563/orig 2025-12-04T08:54:02.3203990Z * [new branch] gh/jansel/564/base -> origin/gh/jansel/564/base 2025-12-04T08:54:02.3204057Z * [new branch] gh/jansel/564/head -> origin/gh/jansel/564/head 2025-12-04T08:54:02.3204130Z * [new branch] gh/jansel/564/orig -> origin/gh/jansel/564/orig 2025-12-04T08:54:02.3204197Z * [new branch] gh/jansel/565/base -> origin/gh/jansel/565/base 2025-12-04T08:54:02.3204264Z * [new branch] gh/jansel/565/head -> origin/gh/jansel/565/head 2025-12-04T08:54:02.3204331Z * [new branch] gh/jansel/565/orig -> origin/gh/jansel/565/orig 2025-12-04T08:54:02.3204398Z * [new branch] gh/jansel/566/base -> origin/gh/jansel/566/base 2025-12-04T08:54:02.3204493Z * [new branch] gh/jansel/566/head -> origin/gh/jansel/566/head 2025-12-04T08:54:02.3204560Z * [new branch] gh/jansel/566/orig -> origin/gh/jansel/566/orig 2025-12-04T08:54:02.3204628Z * [new branch] gh/jansel/567/base -> origin/gh/jansel/567/base 2025-12-04T08:54:02.3204702Z * [new branch] gh/jansel/567/head -> origin/gh/jansel/567/head 2025-12-04T08:54:02.3204772Z * [new branch] gh/jansel/567/orig -> origin/gh/jansel/567/orig 2025-12-04T08:54:02.3204841Z * [new branch] gh/jansel/568/base -> origin/gh/jansel/568/base 2025-12-04T08:54:02.3204913Z * [new branch] gh/jansel/568/head -> origin/gh/jansel/568/head 2025-12-04T08:54:02.3204983Z * [new branch] gh/jansel/568/orig -> origin/gh/jansel/568/orig 2025-12-04T08:54:02.3205052Z * [new branch] gh/jansel/569/base -> origin/gh/jansel/569/base 2025-12-04T08:54:02.3205126Z * [new branch] gh/jansel/569/head -> origin/gh/jansel/569/head 2025-12-04T08:54:02.3205195Z * [new branch] gh/jansel/569/orig -> origin/gh/jansel/569/orig 2025-12-04T08:54:02.3205264Z * [new branch] gh/jansel/570/base -> origin/gh/jansel/570/base 2025-12-04T08:54:02.3205335Z * [new branch] gh/jansel/570/head -> origin/gh/jansel/570/head 2025-12-04T08:54:02.3205433Z * [new branch] gh/jansel/570/orig -> origin/gh/jansel/570/orig 2025-12-04T08:54:02.3205501Z * [new branch] gh/jansel/571/base -> origin/gh/jansel/571/base 2025-12-04T08:54:02.3205571Z * [new branch] gh/jansel/571/head -> origin/gh/jansel/571/head 2025-12-04T08:54:02.3205639Z * [new branch] gh/jansel/571/orig -> origin/gh/jansel/571/orig 2025-12-04T08:54:02.3205707Z * [new branch] gh/jansel/572/base -> origin/gh/jansel/572/base 2025-12-04T08:54:02.3205775Z * [new branch] gh/jansel/572/head -> origin/gh/jansel/572/head 2025-12-04T08:54:02.3205844Z * [new branch] gh/jansel/572/orig -> origin/gh/jansel/572/orig 2025-12-04T08:54:02.3205910Z * [new branch] gh/jansel/573/base -> origin/gh/jansel/573/base 2025-12-04T08:54:02.3205978Z * [new branch] gh/jansel/573/head -> origin/gh/jansel/573/head 2025-12-04T08:54:02.3206046Z * [new branch] gh/jansel/573/orig -> origin/gh/jansel/573/orig 2025-12-04T08:54:02.3206113Z * [new branch] gh/jansel/574/base -> origin/gh/jansel/574/base 2025-12-04T08:54:02.3206181Z * [new branch] gh/jansel/574/head -> origin/gh/jansel/574/head 2025-12-04T08:54:02.3206249Z * [new branch] gh/jansel/574/orig -> origin/gh/jansel/574/orig 2025-12-04T08:54:02.3206320Z * [new branch] gh/jansel/575/base -> origin/gh/jansel/575/base 2025-12-04T08:54:02.3206386Z * [new branch] gh/jansel/575/head -> origin/gh/jansel/575/head 2025-12-04T08:54:02.3206453Z * [new branch] gh/jansel/575/orig -> origin/gh/jansel/575/orig 2025-12-04T08:54:02.3206523Z * [new branch] gh/jansel/576/base -> origin/gh/jansel/576/base 2025-12-04T08:54:02.3206591Z * [new branch] gh/jansel/576/head -> origin/gh/jansel/576/head 2025-12-04T08:54:02.3206662Z * [new branch] gh/jansel/576/orig -> origin/gh/jansel/576/orig 2025-12-04T08:54:02.3206748Z * [new branch] gh/jbschlosser/247/base -> origin/gh/jbschlosser/247/base 2025-12-04T08:54:02.3206826Z * [new branch] gh/jbschlosser/247/head -> origin/gh/jbschlosser/247/head 2025-12-04T08:54:02.3206903Z * [new branch] gh/jbschlosser/247/orig -> origin/gh/jbschlosser/247/orig 2025-12-04T08:54:02.3206982Z * [new branch] gh/jbschlosser/250/base -> origin/gh/jbschlosser/250/base 2025-12-04T08:54:02.3207090Z * [new branch] gh/jbschlosser/250/head -> origin/gh/jbschlosser/250/head 2025-12-04T08:54:02.3207169Z * [new branch] gh/jbschlosser/250/orig -> origin/gh/jbschlosser/250/orig 2025-12-04T08:54:02.3207245Z * [new branch] gh/jerryzh168/1/base -> origin/gh/jerryzh168/1/base 2025-12-04T08:54:02.3207318Z * [new branch] gh/jerryzh168/1/head -> origin/gh/jerryzh168/1/head 2025-12-04T08:54:02.3207389Z * [new branch] gh/jerryzh168/1/orig -> origin/gh/jerryzh168/1/orig 2025-12-04T08:54:02.3207465Z * [new branch] gh/jiayisunx/59/base -> origin/gh/jiayisunx/59/base 2025-12-04T08:54:02.3207538Z * [new branch] gh/jiayisunx/59/head -> origin/gh/jiayisunx/59/head 2025-12-04T08:54:02.3207612Z * [new branch] gh/jiayisunx/59/orig -> origin/gh/jiayisunx/59/orig 2025-12-04T08:54:02.3207683Z * [new branch] gh/jiayisunx/61/base -> origin/gh/jiayisunx/61/base 2025-12-04T08:54:02.3207756Z * [new branch] gh/jiayisunx/61/head -> origin/gh/jiayisunx/61/head 2025-12-04T08:54:02.3207830Z * [new branch] gh/jiayisunx/61/orig -> origin/gh/jiayisunx/61/orig 2025-12-04T08:54:02.3207900Z * [new branch] gh/jiayisunx/68/base -> origin/gh/jiayisunx/68/base 2025-12-04T08:54:02.3207970Z * [new branch] gh/jiayisunx/68/head -> origin/gh/jiayisunx/68/head 2025-12-04T08:54:02.3208071Z * [new branch] gh/jiayisunx/68/orig -> origin/gh/jiayisunx/68/orig 2025-12-04T08:54:02.3208143Z * [new branch] gh/jiayisunx/77/base -> origin/gh/jiayisunx/77/base 2025-12-04T08:54:02.3208214Z * [new branch] gh/jiayisunx/77/head -> origin/gh/jiayisunx/77/head 2025-12-04T08:54:02.3208287Z * [new branch] gh/jiayisunx/77/orig -> origin/gh/jiayisunx/77/orig 2025-12-04T08:54:02.3208359Z * [new branch] gh/jiayisunx/78/base -> origin/gh/jiayisunx/78/base 2025-12-04T08:54:02.3208434Z * [new branch] gh/jiayisunx/78/head -> origin/gh/jiayisunx/78/head 2025-12-04T08:54:02.3208506Z * [new branch] gh/jiayisunx/78/orig -> origin/gh/jiayisunx/78/orig 2025-12-04T08:54:02.3208577Z * [new branch] gh/jiayisunx/79/base -> origin/gh/jiayisunx/79/base 2025-12-04T08:54:02.3208647Z * [new branch] gh/jiayisunx/79/head -> origin/gh/jiayisunx/79/head 2025-12-04T08:54:02.3208721Z * [new branch] gh/jiayisunx/79/orig -> origin/gh/jiayisunx/79/orig 2025-12-04T08:54:02.3208791Z * [new branch] gh/jiayisunx/82/base -> origin/gh/jiayisunx/82/base 2025-12-04T08:54:02.3208862Z * [new branch] gh/jiayisunx/82/head -> origin/gh/jiayisunx/82/head 2025-12-04T08:54:02.3208937Z * [new branch] gh/jiayisunx/82/orig -> origin/gh/jiayisunx/82/orig 2025-12-04T08:54:02.3209009Z * [new branch] gh/jiayisunx/83/base -> origin/gh/jiayisunx/83/base 2025-12-04T08:54:02.3209083Z * [new branch] gh/jiayisunx/83/head -> origin/gh/jiayisunx/83/head 2025-12-04T08:54:02.3209154Z * [new branch] gh/jiayisunx/83/orig -> origin/gh/jiayisunx/83/orig 2025-12-04T08:54:02.3209226Z * [new branch] gh/jiayisunx/84/base -> origin/gh/jiayisunx/84/base 2025-12-04T08:54:02.3209301Z * [new branch] gh/jiayisunx/84/head -> origin/gh/jiayisunx/84/head 2025-12-04T08:54:02.3209374Z * [new branch] gh/jiayisunx/84/orig -> origin/gh/jiayisunx/84/orig 2025-12-04T08:54:02.3209445Z * [new branch] gh/jiayisunx/85/base -> origin/gh/jiayisunx/85/base 2025-12-04T08:54:02.3209517Z * [new branch] gh/jiayisunx/85/head -> origin/gh/jiayisunx/85/head 2025-12-04T08:54:02.3209587Z * [new branch] gh/jiayisunx/85/orig -> origin/gh/jiayisunx/85/orig 2025-12-04T08:54:02.3209658Z * [new branch] gh/jiayisunx/86/base -> origin/gh/jiayisunx/86/base 2025-12-04T08:54:02.3209762Z * [new branch] gh/jiayisunx/86/head -> origin/gh/jiayisunx/86/head 2025-12-04T08:54:02.3209834Z * [new branch] gh/jiayisunx/86/orig -> origin/gh/jiayisunx/86/orig 2025-12-04T08:54:02.3209905Z * [new branch] gh/jiayisunx/87/base -> origin/gh/jiayisunx/87/base 2025-12-04T08:54:02.3209980Z * [new branch] gh/jiayisunx/87/head -> origin/gh/jiayisunx/87/head 2025-12-04T08:54:02.3210051Z * [new branch] gh/jiayisunx/87/orig -> origin/gh/jiayisunx/87/orig 2025-12-04T08:54:02.3210180Z * [new branch] gh/jiayisunx/88/base -> origin/gh/jiayisunx/88/base 2025-12-04T08:54:02.3210255Z * [new branch] gh/jiayisunx/88/head -> origin/gh/jiayisunx/88/head 2025-12-04T08:54:02.3210327Z * [new branch] gh/jiayisunx/88/orig -> origin/gh/jiayisunx/88/orig 2025-12-04T08:54:02.3210399Z * [new branch] gh/jiayisunx/89/base -> origin/gh/jiayisunx/89/base 2025-12-04T08:54:02.3210474Z * [new branch] gh/jiayisunx/89/head -> origin/gh/jiayisunx/89/head 2025-12-04T08:54:02.3210546Z * [new branch] gh/jiayisunx/89/orig -> origin/gh/jiayisunx/89/orig 2025-12-04T08:54:02.3210618Z * [new branch] gh/jiayisunx/90/base -> origin/gh/jiayisunx/90/base 2025-12-04T08:54:02.3210744Z * [new branch] gh/jiayisunx/90/head -> origin/gh/jiayisunx/90/head 2025-12-04T08:54:02.3210817Z * [new branch] gh/jiayisunx/90/orig -> origin/gh/jiayisunx/90/orig 2025-12-04T08:54:02.3210899Z * [new branch] gh/jjwu@meta.com/1/base -> origin/gh/jjwu@meta.com/1/base 2025-12-04T08:54:02.3210978Z * [new branch] gh/jjwu@meta.com/1/head -> origin/gh/jjwu@meta.com/1/head 2025-12-04T08:54:02.3211049Z * [new branch] gh/jturney/1/base -> origin/gh/jturney/1/base 2025-12-04T08:54:02.3211122Z * [new branch] gh/jturney/1/head -> origin/gh/jturney/1/head 2025-12-04T08:54:02.3211193Z * [new branch] gh/jturney/1/orig -> origin/gh/jturney/1/orig 2025-12-04T08:54:02.3211261Z * [new branch] gh/jturney/2/base -> origin/gh/jturney/2/base 2025-12-04T08:54:02.3211332Z * [new branch] gh/jturney/2/head -> origin/gh/jturney/2/head 2025-12-04T08:54:02.3211401Z * [new branch] gh/jturney/2/orig -> origin/gh/jturney/2/orig 2025-12-04T08:54:02.3211478Z * [new branch] gh/karthickai/10/base -> origin/gh/karthickai/10/base 2025-12-04T08:54:02.3211556Z * [new branch] gh/karthickai/10/head -> origin/gh/karthickai/10/head 2025-12-04T08:54:02.3211629Z * [new branch] gh/karthickai/10/orig -> origin/gh/karthickai/10/orig 2025-12-04T08:54:02.3211702Z * [new branch] gh/karthickai/11/base -> origin/gh/karthickai/11/base 2025-12-04T08:54:02.3211779Z * [new branch] gh/karthickai/11/head -> origin/gh/karthickai/11/head 2025-12-04T08:54:02.3211853Z * [new branch] gh/karthickai/11/orig -> origin/gh/karthickai/11/orig 2025-12-04T08:54:02.3211927Z * [new branch] gh/karthickai/12/base -> origin/gh/karthickai/12/base 2025-12-04T08:54:02.3212002Z * [new branch] gh/karthickai/12/head -> origin/gh/karthickai/12/head 2025-12-04T08:54:02.3212076Z * [new branch] gh/karthickai/12/orig -> origin/gh/karthickai/12/orig 2025-12-04T08:54:02.3212149Z * [new branch] gh/karthickai/13/base -> origin/gh/karthickai/13/base 2025-12-04T08:54:02.3212224Z * [new branch] gh/karthickai/13/head -> origin/gh/karthickai/13/head 2025-12-04T08:54:02.3212298Z * [new branch] gh/karthickai/13/orig -> origin/gh/karthickai/13/orig 2025-12-04T08:54:02.3212373Z * [new branch] gh/karthickai/14/base -> origin/gh/karthickai/14/base 2025-12-04T08:54:02.3212446Z * [new branch] gh/karthickai/14/head -> origin/gh/karthickai/14/head 2025-12-04T08:54:02.3212562Z * [new branch] gh/karthickai/14/orig -> origin/gh/karthickai/14/orig 2025-12-04T08:54:02.3212638Z * [new branch] gh/karthickai/15/base -> origin/gh/karthickai/15/base 2025-12-04T08:54:02.3212711Z * [new branch] gh/karthickai/15/head -> origin/gh/karthickai/15/head 2025-12-04T08:54:02.3212784Z * [new branch] gh/karthickai/15/orig -> origin/gh/karthickai/15/orig 2025-12-04T08:54:02.3212858Z * [new branch] gh/karthickai/16/base -> origin/gh/karthickai/16/base 2025-12-04T08:54:02.3212929Z * [new branch] gh/karthickai/16/head -> origin/gh/karthickai/16/head 2025-12-04T08:54:02.3213003Z * [new branch] gh/karthickai/16/orig -> origin/gh/karthickai/16/orig 2025-12-04T08:54:02.3213079Z * [new branch] gh/karthickai/17/base -> origin/gh/karthickai/17/base 2025-12-04T08:54:02.3213155Z * [new branch] gh/karthickai/17/head -> origin/gh/karthickai/17/head 2025-12-04T08:54:02.3213230Z * [new branch] gh/karthickai/17/orig -> origin/gh/karthickai/17/orig 2025-12-04T08:54:02.3213305Z * [new branch] gh/karthickai/18/base -> origin/gh/karthickai/18/base 2025-12-04T08:54:02.3213379Z * [new branch] gh/karthickai/18/head -> origin/gh/karthickai/18/head 2025-12-04T08:54:02.3213478Z * [new branch] gh/karthickai/18/orig -> origin/gh/karthickai/18/orig 2025-12-04T08:54:02.3213552Z * [new branch] gh/karthickai/19/base -> origin/gh/karthickai/19/base 2025-12-04T08:54:02.3213624Z * [new branch] gh/karthickai/19/head -> origin/gh/karthickai/19/head 2025-12-04T08:54:02.3213697Z * [new branch] gh/karthickai/19/orig -> origin/gh/karthickai/19/orig 2025-12-04T08:54:02.3213773Z * [new branch] gh/karthickai/20/base -> origin/gh/karthickai/20/base 2025-12-04T08:54:02.3213847Z * [new branch] gh/karthickai/20/head -> origin/gh/karthickai/20/head 2025-12-04T08:54:02.3213920Z * [new branch] gh/karthickai/20/orig -> origin/gh/karthickai/20/orig 2025-12-04T08:54:02.3213996Z * [new branch] gh/karthickai/21/base -> origin/gh/karthickai/21/base 2025-12-04T08:54:02.3214068Z * [new branch] gh/karthickai/21/head -> origin/gh/karthickai/21/head 2025-12-04T08:54:02.3214144Z * [new branch] gh/karthickai/21/orig -> origin/gh/karthickai/21/orig 2025-12-04T08:54:02.3214217Z * [new branch] gh/karthickai/22/base -> origin/gh/karthickai/22/base 2025-12-04T08:54:02.3214290Z * [new branch] gh/karthickai/22/head -> origin/gh/karthickai/22/head 2025-12-04T08:54:02.3214365Z * [new branch] gh/karthickai/22/orig -> origin/gh/karthickai/22/orig 2025-12-04T08:54:02.3214438Z * [new branch] gh/karthickai/23/base -> origin/gh/karthickai/23/base 2025-12-04T08:54:02.3214513Z * [new branch] gh/karthickai/23/head -> origin/gh/karthickai/23/head 2025-12-04T08:54:02.3214587Z * [new branch] gh/karthickai/23/orig -> origin/gh/karthickai/23/orig 2025-12-04T08:54:02.3214661Z * [new branch] gh/karthickai/24/base -> origin/gh/karthickai/24/base 2025-12-04T08:54:02.3214734Z * [new branch] gh/karthickai/24/head -> origin/gh/karthickai/24/head 2025-12-04T08:54:02.3214810Z * [new branch] gh/karthickai/24/orig -> origin/gh/karthickai/24/orig 2025-12-04T08:54:02.3214882Z * [new branch] gh/karthickai/25/base -> origin/gh/karthickai/25/base 2025-12-04T08:54:02.3214956Z * [new branch] gh/karthickai/25/head -> origin/gh/karthickai/25/head 2025-12-04T08:54:02.3215028Z * [new branch] gh/karthickai/25/orig -> origin/gh/karthickai/25/orig 2025-12-04T08:54:02.3215100Z * [new branch] gh/karthickai/26/base -> origin/gh/karthickai/26/base 2025-12-04T08:54:02.3215200Z * [new branch] gh/karthickai/26/head -> origin/gh/karthickai/26/head 2025-12-04T08:54:02.3215274Z * [new branch] gh/karthickai/26/orig -> origin/gh/karthickai/26/orig 2025-12-04T08:54:02.3215347Z * [new branch] gh/karthickai/6/base -> origin/gh/karthickai/6/base 2025-12-04T08:54:02.3215422Z * [new branch] gh/karthickai/6/head -> origin/gh/karthickai/6/head 2025-12-04T08:54:02.3215498Z * [new branch] gh/karthickai/6/orig -> origin/gh/karthickai/6/orig 2025-12-04T08:54:02.3215566Z * [new branch] gh/krocki/1/base -> origin/gh/krocki/1/base 2025-12-04T08:54:02.3215637Z * [new branch] gh/krocki/1/head -> origin/gh/krocki/1/head 2025-12-04T08:54:02.3215705Z * [new branch] gh/krocki/1/orig -> origin/gh/krocki/1/orig 2025-12-04T08:54:02.3215772Z * [new branch] gh/krocki/2/base -> origin/gh/krocki/2/base 2025-12-04T08:54:02.3215845Z * [new branch] gh/krocki/2/head -> origin/gh/krocki/2/head 2025-12-04T08:54:02.3215911Z * [new branch] gh/krocki/2/orig -> origin/gh/krocki/2/orig 2025-12-04T08:54:02.3215991Z * [new branch] gh/kurtamohler/60/base -> origin/gh/kurtamohler/60/base 2025-12-04T08:54:02.3216108Z * [new branch] gh/kurtamohler/60/head -> origin/gh/kurtamohler/60/head 2025-12-04T08:54:02.3216185Z * [new branch] gh/kurtamohler/60/orig -> origin/gh/kurtamohler/60/orig 2025-12-04T08:54:02.3216260Z * [new branch] gh/kurtamohler/61/base -> origin/gh/kurtamohler/61/base 2025-12-04T08:54:02.3216340Z * [new branch] gh/kurtamohler/61/head -> origin/gh/kurtamohler/61/head 2025-12-04T08:54:02.3216415Z * [new branch] gh/kurtamohler/61/orig -> origin/gh/kurtamohler/61/orig 2025-12-04T08:54:02.3216489Z * [new branch] gh/kurtamohler/62/base -> origin/gh/kurtamohler/62/base 2025-12-04T08:54:02.3216568Z * [new branch] gh/kurtamohler/62/head -> origin/gh/kurtamohler/62/head 2025-12-04T08:54:02.3216644Z * [new branch] gh/kurtamohler/62/orig -> origin/gh/kurtamohler/62/orig 2025-12-04T08:54:02.3216719Z * [new branch] gh/kurtamohler/63/base -> origin/gh/kurtamohler/63/base 2025-12-04T08:54:02.3216796Z * [new branch] gh/kurtamohler/63/head -> origin/gh/kurtamohler/63/head 2025-12-04T08:54:02.3216870Z * [new branch] gh/kurtamohler/63/orig -> origin/gh/kurtamohler/63/orig 2025-12-04T08:54:02.3216945Z * [new branch] gh/kurtamohler/64/base -> origin/gh/kurtamohler/64/base 2025-12-04T08:54:02.3217020Z * [new branch] gh/kurtamohler/64/head -> origin/gh/kurtamohler/64/head 2025-12-04T08:54:02.3217095Z * [new branch] gh/kurtamohler/64/orig -> origin/gh/kurtamohler/64/orig 2025-12-04T08:54:02.3217169Z * [new branch] gh/kurtamohler/65/base -> origin/gh/kurtamohler/65/base 2025-12-04T08:54:02.3217246Z * [new branch] gh/kurtamohler/65/head -> origin/gh/kurtamohler/65/head 2025-12-04T08:54:02.3217321Z * [new branch] gh/kurtamohler/65/orig -> origin/gh/kurtamohler/65/orig 2025-12-04T08:54:02.3217397Z * [new branch] gh/kurtamohler/66/base -> origin/gh/kurtamohler/66/base 2025-12-04T08:54:02.3217473Z * [new branch] gh/kurtamohler/66/head -> origin/gh/kurtamohler/66/head 2025-12-04T08:54:02.3217548Z * [new branch] gh/kurtamohler/66/orig -> origin/gh/kurtamohler/66/orig 2025-12-04T08:54:02.3217625Z * [new branch] gh/kurtamohler/67/base -> origin/gh/kurtamohler/67/base 2025-12-04T08:54:02.3217699Z * [new branch] gh/kurtamohler/67/head -> origin/gh/kurtamohler/67/head 2025-12-04T08:54:02.3217774Z * [new branch] gh/kurtamohler/67/orig -> origin/gh/kurtamohler/67/orig 2025-12-04T08:54:02.3217894Z * [new branch] gh/kwen2501/130/base -> origin/gh/kwen2501/130/base 2025-12-04T08:54:02.3217964Z * [new branch] gh/kwen2501/130/head -> origin/gh/kwen2501/130/head 2025-12-04T08:54:02.3218038Z * [new branch] gh/kwen2501/130/orig -> origin/gh/kwen2501/130/orig 2025-12-04T08:54:02.3218106Z * [new branch] gh/kwen2501/170/base -> origin/gh/kwen2501/170/base 2025-12-04T08:54:02.3218176Z * [new branch] gh/kwen2501/170/head -> origin/gh/kwen2501/170/head 2025-12-04T08:54:02.3218246Z * [new branch] gh/kwen2501/187/base -> origin/gh/kwen2501/187/base 2025-12-04T08:54:02.3218314Z * [new branch] gh/kwen2501/187/head -> origin/gh/kwen2501/187/head 2025-12-04T08:54:02.3218382Z * [new branch] gh/kwen2501/187/orig -> origin/gh/kwen2501/187/orig 2025-12-04T08:54:02.3218452Z * [new branch] gh/kwen2501/188/base -> origin/gh/kwen2501/188/base 2025-12-04T08:54:02.3218522Z * [new branch] gh/kwen2501/188/head -> origin/gh/kwen2501/188/head 2025-12-04T08:54:02.3218590Z * [new branch] gh/kwen2501/188/orig -> origin/gh/kwen2501/188/orig 2025-12-04T08:54:02.3218659Z * [new branch] gh/kwen2501/211/base -> origin/gh/kwen2501/211/base 2025-12-04T08:54:02.3218727Z * [new branch] gh/kwen2501/211/head -> origin/gh/kwen2501/211/head 2025-12-04T08:54:02.3218823Z * [new branch] gh/kwen2501/224/base -> origin/gh/kwen2501/224/base 2025-12-04T08:54:02.3218894Z * [new branch] gh/kwen2501/224/head -> origin/gh/kwen2501/224/head 2025-12-04T08:54:02.3218962Z * [new branch] gh/kwen2501/224/orig -> origin/gh/kwen2501/224/orig 2025-12-04T08:54:02.3219030Z * [new branch] gh/kwen2501/228/base -> origin/gh/kwen2501/228/base 2025-12-04T08:54:02.3219100Z * [new branch] gh/kwen2501/228/head -> origin/gh/kwen2501/228/head 2025-12-04T08:54:02.3219170Z * [new branch] gh/kwen2501/228/orig -> origin/gh/kwen2501/228/orig 2025-12-04T08:54:02.3219237Z * [new branch] gh/kwen2501/234/base -> origin/gh/kwen2501/234/base 2025-12-04T08:54:02.3219307Z * [new branch] gh/kwen2501/234/head -> origin/gh/kwen2501/234/head 2025-12-04T08:54:02.3219376Z * [new branch] gh/kwen2501/234/orig -> origin/gh/kwen2501/234/orig 2025-12-04T08:54:02.3219445Z * [new branch] gh/kwen2501/235/base -> origin/gh/kwen2501/235/base 2025-12-04T08:54:02.3219513Z * [new branch] gh/kwen2501/235/head -> origin/gh/kwen2501/235/head 2025-12-04T08:54:02.3219581Z * [new branch] gh/kwen2501/235/orig -> origin/gh/kwen2501/235/orig 2025-12-04T08:54:02.3219652Z * [new branch] gh/kwen2501/236/base -> origin/gh/kwen2501/236/base 2025-12-04T08:54:02.3219720Z * [new branch] gh/kwen2501/236/head -> origin/gh/kwen2501/236/head 2025-12-04T08:54:02.3219789Z * [new branch] gh/kwen2501/236/orig -> origin/gh/kwen2501/236/orig 2025-12-04T08:54:02.3219858Z * [new branch] gh/kwen2501/237/base -> origin/gh/kwen2501/237/base 2025-12-04T08:54:02.3219926Z * [new branch] gh/kwen2501/237/head -> origin/gh/kwen2501/237/head 2025-12-04T08:54:02.3219995Z * [new branch] gh/kwen2501/237/orig -> origin/gh/kwen2501/237/orig 2025-12-04T08:54:02.3220064Z * [new branch] gh/kwen2501/238/base -> origin/gh/kwen2501/238/base 2025-12-04T08:54:02.3220179Z * [new branch] gh/kwen2501/238/head -> origin/gh/kwen2501/238/head 2025-12-04T08:54:02.3220248Z * [new branch] gh/kwen2501/238/orig -> origin/gh/kwen2501/238/orig 2025-12-04T08:54:02.3220317Z * [new branch] gh/kwen2501/240/base -> origin/gh/kwen2501/240/base 2025-12-04T08:54:02.3220384Z * [new branch] gh/kwen2501/240/head -> origin/gh/kwen2501/240/head 2025-12-04T08:54:02.3220500Z * [new branch] gh/kwen2501/240/orig -> origin/gh/kwen2501/240/orig 2025-12-04T08:54:02.3220569Z * [new branch] gh/kwen2501/241/base -> origin/gh/kwen2501/241/base 2025-12-04T08:54:02.3220638Z * [new branch] gh/kwen2501/241/head -> origin/gh/kwen2501/241/head 2025-12-04T08:54:02.3220708Z * [new branch] gh/kwen2501/241/orig -> origin/gh/kwen2501/241/orig 2025-12-04T08:54:02.3220778Z * [new branch] gh/kwen2501/247/base -> origin/gh/kwen2501/247/base 2025-12-04T08:54:02.3220845Z * [new branch] gh/kwen2501/247/head -> origin/gh/kwen2501/247/head 2025-12-04T08:54:02.3220914Z * [new branch] gh/kwen2501/247/orig -> origin/gh/kwen2501/247/orig 2025-12-04T08:54:02.3220982Z * [new branch] gh/kwen2501/252/base -> origin/gh/kwen2501/252/base 2025-12-04T08:54:02.3221051Z * [new branch] gh/kwen2501/252/head -> origin/gh/kwen2501/252/head 2025-12-04T08:54:02.3221123Z * [new branch] gh/kwen2501/252/orig -> origin/gh/kwen2501/252/orig 2025-12-04T08:54:02.3221193Z * [new branch] gh/kwen2501/259/base -> origin/gh/kwen2501/259/base 2025-12-04T08:54:02.3221261Z * [new branch] gh/kwen2501/259/head -> origin/gh/kwen2501/259/head 2025-12-04T08:54:02.3221390Z * [new branch] gh/kwen2501/259/orig -> origin/gh/kwen2501/259/orig 2025-12-04T08:54:02.3221460Z * [new branch] gh/kwen2501/260/base -> origin/gh/kwen2501/260/base 2025-12-04T08:54:02.3221528Z * [new branch] gh/kwen2501/260/head -> origin/gh/kwen2501/260/head 2025-12-04T08:54:02.3221597Z * [new branch] gh/kwen2501/260/orig -> origin/gh/kwen2501/260/orig 2025-12-04T08:54:02.3221665Z * [new branch] gh/kwen2501/268/base -> origin/gh/kwen2501/268/base 2025-12-04T08:54:02.3221734Z * [new branch] gh/kwen2501/268/head -> origin/gh/kwen2501/268/head 2025-12-04T08:54:02.3221805Z * [new branch] gh/kwen2501/268/orig -> origin/gh/kwen2501/268/orig 2025-12-04T08:54:02.3221874Z * [new branch] gh/kwen2501/269/base -> origin/gh/kwen2501/269/base 2025-12-04T08:54:02.3221945Z * [new branch] gh/kwen2501/269/head -> origin/gh/kwen2501/269/head 2025-12-04T08:54:02.3222016Z * [new branch] gh/kwen2501/269/orig -> origin/gh/kwen2501/269/orig 2025-12-04T08:54:02.3222085Z * [new branch] gh/kwen2501/270/base -> origin/gh/kwen2501/270/base 2025-12-04T08:54:02.3222153Z * [new branch] gh/kwen2501/270/head -> origin/gh/kwen2501/270/head 2025-12-04T08:54:02.3222222Z * [new branch] gh/kwen2501/270/orig -> origin/gh/kwen2501/270/orig 2025-12-04T08:54:02.3222290Z * [new branch] gh/kwen2501/271/base -> origin/gh/kwen2501/271/base 2025-12-04T08:54:02.3222359Z * [new branch] gh/kwen2501/271/head -> origin/gh/kwen2501/271/head 2025-12-04T08:54:02.3222428Z * [new branch] gh/kwen2501/271/orig -> origin/gh/kwen2501/271/orig 2025-12-04T08:54:02.3222496Z * [new branch] gh/kwen2501/274/base -> origin/gh/kwen2501/274/base 2025-12-04T08:54:02.3222565Z * [new branch] gh/kwen2501/274/head -> origin/gh/kwen2501/274/head 2025-12-04T08:54:02.3222634Z * [new branch] gh/kwen2501/274/orig -> origin/gh/kwen2501/274/orig 2025-12-04T08:54:02.3222701Z * [new branch] gh/kwen2501/275/base -> origin/gh/kwen2501/275/base 2025-12-04T08:54:02.3222770Z * [new branch] gh/kwen2501/275/head -> origin/gh/kwen2501/275/head 2025-12-04T08:54:02.3222839Z * [new branch] gh/kwen2501/275/orig -> origin/gh/kwen2501/275/orig 2025-12-04T08:54:02.3222906Z * [new branch] gh/kwen2501/276/base -> origin/gh/kwen2501/276/base 2025-12-04T08:54:02.3222975Z * [new branch] gh/kwen2501/276/head -> origin/gh/kwen2501/276/head 2025-12-04T08:54:02.3223074Z * [new branch] gh/kwen2501/276/orig -> origin/gh/kwen2501/276/orig 2025-12-04T08:54:02.3223142Z * [new branch] gh/kwen2501/277/base -> origin/gh/kwen2501/277/base 2025-12-04T08:54:02.3223212Z * [new branch] gh/kwen2501/277/head -> origin/gh/kwen2501/277/head 2025-12-04T08:54:02.3223281Z * [new branch] gh/kwen2501/277/orig -> origin/gh/kwen2501/277/orig 2025-12-04T08:54:02.3223349Z * [new branch] gh/kwen2501/278/base -> origin/gh/kwen2501/278/base 2025-12-04T08:54:02.3223418Z * [new branch] gh/kwen2501/278/head -> origin/gh/kwen2501/278/head 2025-12-04T08:54:02.3223486Z * [new branch] gh/kwen2501/278/orig -> origin/gh/kwen2501/278/orig 2025-12-04T08:54:02.3223554Z * [new branch] gh/kwen2501/279/base -> origin/gh/kwen2501/279/base 2025-12-04T08:54:02.3223624Z * [new branch] gh/kwen2501/279/head -> origin/gh/kwen2501/279/head 2025-12-04T08:54:02.3223692Z * [new branch] gh/kwen2501/279/orig -> origin/gh/kwen2501/279/orig 2025-12-04T08:54:02.3223760Z * [new branch] gh/kwen2501/280/base -> origin/gh/kwen2501/280/base 2025-12-04T08:54:02.3223830Z * [new branch] gh/kwen2501/280/head -> origin/gh/kwen2501/280/head 2025-12-04T08:54:02.3223922Z * [new branch] gh/kwen2501/280/orig -> origin/gh/kwen2501/280/orig 2025-12-04T08:54:02.3223992Z * [new branch] gh/kwen2501/281/base -> origin/gh/kwen2501/281/base 2025-12-04T08:54:02.3224061Z * [new branch] gh/kwen2501/281/head -> origin/gh/kwen2501/281/head 2025-12-04T08:54:02.3224129Z * [new branch] gh/kwen2501/281/orig -> origin/gh/kwen2501/281/orig 2025-12-04T08:54:02.3224198Z * [new branch] gh/kwen2501/282/base -> origin/gh/kwen2501/282/base 2025-12-04T08:54:02.3224268Z * [new branch] gh/kwen2501/282/head -> origin/gh/kwen2501/282/head 2025-12-04T08:54:02.3224336Z * [new branch] gh/kwen2501/282/orig -> origin/gh/kwen2501/282/orig 2025-12-04T08:54:02.3224406Z * [new branch] gh/kwen2501/283/base -> origin/gh/kwen2501/283/base 2025-12-04T08:54:02.3224474Z * [new branch] gh/kwen2501/283/head -> origin/gh/kwen2501/283/head 2025-12-04T08:54:02.3224543Z * [new branch] gh/kwen2501/283/orig -> origin/gh/kwen2501/283/orig 2025-12-04T08:54:02.3224612Z * [new branch] gh/kwen2501/284/base -> origin/gh/kwen2501/284/base 2025-12-04T08:54:02.3224679Z * [new branch] gh/kwen2501/284/head -> origin/gh/kwen2501/284/head 2025-12-04T08:54:02.3224747Z * [new branch] gh/kwen2501/284/orig -> origin/gh/kwen2501/284/orig 2025-12-04T08:54:02.3224817Z * [new branch] gh/kwen2501/285/base -> origin/gh/kwen2501/285/base 2025-12-04T08:54:02.3224886Z * [new branch] gh/kwen2501/285/head -> origin/gh/kwen2501/285/head 2025-12-04T08:54:02.3224955Z * [new branch] gh/kwen2501/285/orig -> origin/gh/kwen2501/285/orig 2025-12-04T08:54:02.3225024Z * [new branch] gh/kwen2501/286/base -> origin/gh/kwen2501/286/base 2025-12-04T08:54:02.3225092Z * [new branch] gh/kwen2501/286/head -> origin/gh/kwen2501/286/head 2025-12-04T08:54:02.3225162Z * [new branch] gh/kwen2501/286/orig -> origin/gh/kwen2501/286/orig 2025-12-04T08:54:02.3225231Z * [new branch] gh/kwen2501/287/base -> origin/gh/kwen2501/287/base 2025-12-04T08:54:02.3225298Z * [new branch] gh/kwen2501/287/head -> origin/gh/kwen2501/287/head 2025-12-04T08:54:02.3225367Z * [new branch] gh/kwen2501/287/orig -> origin/gh/kwen2501/287/orig 2025-12-04T08:54:02.3225436Z * [new branch] gh/kwen2501/288/base -> origin/gh/kwen2501/288/base 2025-12-04T08:54:02.3225531Z * [new branch] gh/kwen2501/288/head -> origin/gh/kwen2501/288/head 2025-12-04T08:54:02.3225601Z * [new branch] gh/kwen2501/288/orig -> origin/gh/kwen2501/288/orig 2025-12-04T08:54:02.3225676Z * [new branch] gh/laithsakka/251/base -> origin/gh/laithsakka/251/base 2025-12-04T08:54:02.3225753Z * [new branch] gh/laithsakka/251/head -> origin/gh/laithsakka/251/head 2025-12-04T08:54:02.3225830Z * [new branch] gh/laithsakka/251/orig -> origin/gh/laithsakka/251/orig 2025-12-04T08:54:02.3225903Z * [new branch] gh/laithsakka/276/base -> origin/gh/laithsakka/276/base 2025-12-04T08:54:02.3225977Z * [new branch] gh/laithsakka/276/head -> origin/gh/laithsakka/276/head 2025-12-04T08:54:02.3226051Z * [new branch] gh/laithsakka/276/orig -> origin/gh/laithsakka/276/orig 2025-12-04T08:54:02.3226125Z * [new branch] gh/laithsakka/28/base -> origin/gh/laithsakka/28/base 2025-12-04T08:54:02.3226199Z * [new branch] gh/laithsakka/29/base -> origin/gh/laithsakka/29/base 2025-12-04T08:54:02.3226272Z * [new branch] gh/laithsakka/30/base -> origin/gh/laithsakka/30/base 2025-12-04T08:54:02.3226344Z * [new branch] gh/laithsakka/30/head -> origin/gh/laithsakka/30/head 2025-12-04T08:54:02.3226455Z * [new branch] gh/laithsakka/31/base -> origin/gh/laithsakka/31/base 2025-12-04T08:54:02.3226529Z * [new branch] gh/laithsakka/31/head -> origin/gh/laithsakka/31/head 2025-12-04T08:54:02.3226603Z * [new branch] gh/laithsakka/313/base -> origin/gh/laithsakka/313/base 2025-12-04T08:54:02.3226677Z * [new branch] gh/laithsakka/313/head -> origin/gh/laithsakka/313/head 2025-12-04T08:54:02.3226751Z * [new branch] gh/laithsakka/313/orig -> origin/gh/laithsakka/313/orig 2025-12-04T08:54:02.3226824Z * [new branch] gh/laithsakka/316/base -> origin/gh/laithsakka/316/base 2025-12-04T08:54:02.3226901Z * [new branch] gh/laithsakka/316/head -> origin/gh/laithsakka/316/head 2025-12-04T08:54:02.3226973Z * [new branch] gh/laithsakka/316/orig -> origin/gh/laithsakka/316/orig 2025-12-04T08:54:02.3227046Z * [new branch] gh/laithsakka/317/base -> origin/gh/laithsakka/317/base 2025-12-04T08:54:02.3227120Z * [new branch] gh/laithsakka/317/head -> origin/gh/laithsakka/317/head 2025-12-04T08:54:02.3227193Z * [new branch] gh/laithsakka/317/orig -> origin/gh/laithsakka/317/orig 2025-12-04T08:54:02.3227266Z * [new branch] gh/laithsakka/319/base -> origin/gh/laithsakka/319/base 2025-12-04T08:54:02.3227340Z * [new branch] gh/laithsakka/319/head -> origin/gh/laithsakka/319/head 2025-12-04T08:54:02.3227411Z * [new branch] gh/laithsakka/319/orig -> origin/gh/laithsakka/319/orig 2025-12-04T08:54:02.3227485Z * [new branch] gh/laithsakka/32/base -> origin/gh/laithsakka/32/base 2025-12-04T08:54:02.3227562Z * [new branch] gh/laithsakka/32/head -> origin/gh/laithsakka/32/head 2025-12-04T08:54:02.3227634Z * [new branch] gh/laithsakka/320/base -> origin/gh/laithsakka/320/base 2025-12-04T08:54:02.3227707Z * [new branch] gh/laithsakka/320/head -> origin/gh/laithsakka/320/head 2025-12-04T08:54:02.3227781Z * [new branch] gh/laithsakka/320/orig -> origin/gh/laithsakka/320/orig 2025-12-04T08:54:02.3227853Z * [new branch] gh/laithsakka/321/base -> origin/gh/laithsakka/321/base 2025-12-04T08:54:02.3227926Z * [new branch] gh/laithsakka/321/head -> origin/gh/laithsakka/321/head 2025-12-04T08:54:02.3227999Z * [new branch] gh/laithsakka/321/orig -> origin/gh/laithsakka/321/orig 2025-12-04T08:54:02.3228071Z * [new branch] gh/laithsakka/322/base -> origin/gh/laithsakka/322/base 2025-12-04T08:54:02.3228181Z * [new branch] gh/laithsakka/322/head -> origin/gh/laithsakka/322/head 2025-12-04T08:54:02.3228255Z * [new branch] gh/laithsakka/322/orig -> origin/gh/laithsakka/322/orig 2025-12-04T08:54:02.3228328Z * [new branch] gh/laithsakka/323/base -> origin/gh/laithsakka/323/base 2025-12-04T08:54:02.3228401Z * [new branch] gh/laithsakka/323/head -> origin/gh/laithsakka/323/head 2025-12-04T08:54:02.3228476Z * [new branch] gh/laithsakka/323/orig -> origin/gh/laithsakka/323/orig 2025-12-04T08:54:02.3228550Z * [new branch] gh/laithsakka/324/base -> origin/gh/laithsakka/324/base 2025-12-04T08:54:02.3228624Z * [new branch] gh/laithsakka/324/head -> origin/gh/laithsakka/324/head 2025-12-04T08:54:02.3228697Z * [new branch] gh/laithsakka/324/orig -> origin/gh/laithsakka/324/orig 2025-12-04T08:54:02.3228769Z * [new branch] gh/laithsakka/325/base -> origin/gh/laithsakka/325/base 2025-12-04T08:54:02.3228847Z * [new branch] gh/laithsakka/325/head -> origin/gh/laithsakka/325/head 2025-12-04T08:54:02.3228919Z * [new branch] gh/laithsakka/325/orig -> origin/gh/laithsakka/325/orig 2025-12-04T08:54:02.3228992Z * [new branch] gh/laithsakka/326/base -> origin/gh/laithsakka/326/base 2025-12-04T08:54:02.3229094Z * [new branch] gh/laithsakka/326/head -> origin/gh/laithsakka/326/head 2025-12-04T08:54:02.3229168Z * [new branch] gh/laithsakka/326/orig -> origin/gh/laithsakka/326/orig 2025-12-04T08:54:02.3229243Z * [new branch] gh/laithsakka/327/base -> origin/gh/laithsakka/327/base 2025-12-04T08:54:02.3229317Z * [new branch] gh/laithsakka/327/head -> origin/gh/laithsakka/327/head 2025-12-04T08:54:02.3229389Z * [new branch] gh/laithsakka/327/orig -> origin/gh/laithsakka/327/orig 2025-12-04T08:54:02.3229461Z * [new branch] gh/laithsakka/328/base -> origin/gh/laithsakka/328/base 2025-12-04T08:54:02.3229539Z * [new branch] gh/laithsakka/328/head -> origin/gh/laithsakka/328/head 2025-12-04T08:54:02.3229611Z * [new branch] gh/laithsakka/328/orig -> origin/gh/laithsakka/328/orig 2025-12-04T08:54:02.3229681Z * [new branch] gh/liangel/4/base -> origin/gh/liangel/4/base 2025-12-04T08:54:02.3229752Z * [new branch] gh/liangel/4/head -> origin/gh/liangel/4/head 2025-12-04T08:54:02.3229820Z * [new branch] gh/liangel/4/orig -> origin/gh/liangel/4/orig 2025-12-04T08:54:02.3229895Z * [new branch] gh/lucaskabela/1/base -> origin/gh/lucaskabela/1/base 2025-12-04T08:54:02.3229972Z * [new branch] gh/lucaskabela/1/head -> origin/gh/lucaskabela/1/head 2025-12-04T08:54:02.3230036Z * [new branch] gh/lw/4/base -> origin/gh/lw/4/base 2025-12-04T08:54:02.3230128Z * [new branch] gh/lw/4/head -> origin/gh/lw/4/head 2025-12-04T08:54:02.3230193Z * [new branch] gh/lw/4/orig -> origin/gh/lw/4/orig 2025-12-04T08:54:02.3230254Z * [new branch] gh/lw/5/base -> origin/gh/lw/5/base 2025-12-04T08:54:02.3230316Z * [new branch] gh/lw/5/head -> origin/gh/lw/5/head 2025-12-04T08:54:02.3230378Z * [new branch] gh/lw/5/orig -> origin/gh/lw/5/orig 2025-12-04T08:54:02.3230439Z * [new branch] gh/lw/6/base -> origin/gh/lw/6/base 2025-12-04T08:54:02.3230501Z * [new branch] gh/lw/6/head -> origin/gh/lw/6/head 2025-12-04T08:54:02.3230562Z * [new branch] gh/lw/6/orig -> origin/gh/lw/6/orig 2025-12-04T08:54:02.3230630Z * [new branch] gh/malfet/14/base -> origin/gh/malfet/14/base 2025-12-04T08:54:02.3230702Z * [new branch] gh/malfet/417/base -> origin/gh/malfet/417/base 2025-12-04T08:54:02.3230822Z * [new branch] gh/malfet/417/head -> origin/gh/malfet/417/head 2025-12-04T08:54:02.3230890Z * [new branch] gh/malfet/417/orig -> origin/gh/malfet/417/orig 2025-12-04T08:54:02.3230959Z * [new branch] gh/malfet/506/base -> origin/gh/malfet/506/base 2025-12-04T08:54:02.3231025Z * [new branch] gh/malfet/506/head -> origin/gh/malfet/506/head 2025-12-04T08:54:02.3231092Z * [new branch] gh/malfet/506/orig -> origin/gh/malfet/506/orig 2025-12-04T08:54:02.3231160Z * [new branch] gh/malfet/517/base -> origin/gh/malfet/517/base 2025-12-04T08:54:02.3231227Z * [new branch] gh/malfet/517/head -> origin/gh/malfet/517/head 2025-12-04T08:54:02.3231294Z * [new branch] gh/malfet/528/base -> origin/gh/malfet/528/base 2025-12-04T08:54:02.3231362Z * [new branch] gh/malfet/528/head -> origin/gh/malfet/528/head 2025-12-04T08:54:02.3231430Z * [new branch] gh/malfet/528/orig -> origin/gh/malfet/528/orig 2025-12-04T08:54:02.3231497Z * [new branch] gh/malfet/537/base -> origin/gh/malfet/537/base 2025-12-04T08:54:02.3231565Z * [new branch] gh/malfet/537/head -> origin/gh/malfet/537/head 2025-12-04T08:54:02.3231632Z * [new branch] gh/malfet/537/orig -> origin/gh/malfet/537/orig 2025-12-04T08:54:02.3231738Z * [new branch] gh/malfet/546/base -> origin/gh/malfet/546/base 2025-12-04T08:54:02.3231807Z * [new branch] gh/malfet/546/head -> origin/gh/malfet/546/head 2025-12-04T08:54:02.3231874Z * [new branch] gh/malfet/546/orig -> origin/gh/malfet/546/orig 2025-12-04T08:54:02.3231943Z * [new branch] gh/malfet/565/base -> origin/gh/malfet/565/base 2025-12-04T08:54:02.3232010Z * [new branch] gh/malfet/565/head -> origin/gh/malfet/565/head 2025-12-04T08:54:02.3232078Z * [new branch] gh/malfet/565/orig -> origin/gh/malfet/565/orig 2025-12-04T08:54:02.3232146Z * [new branch] gh/malfet/575/base -> origin/gh/malfet/575/base 2025-12-04T08:54:02.3232213Z * [new branch] gh/malfet/575/head -> origin/gh/malfet/575/head 2025-12-04T08:54:02.3232279Z * [new branch] gh/malfet/575/orig -> origin/gh/malfet/575/orig 2025-12-04T08:54:02.3232348Z * [new branch] gh/malfet/580/base -> origin/gh/malfet/580/base 2025-12-04T08:54:02.3232416Z * [new branch] gh/malfet/580/head -> origin/gh/malfet/580/head 2025-12-04T08:54:02.3232482Z * [new branch] gh/malfet/580/orig -> origin/gh/malfet/580/orig 2025-12-04T08:54:02.3232550Z * [new branch] gh/malfet/581/base -> origin/gh/malfet/581/base 2025-12-04T08:54:02.3232616Z * [new branch] gh/malfet/581/head -> origin/gh/malfet/581/head 2025-12-04T08:54:02.3232684Z * [new branch] gh/malfet/581/orig -> origin/gh/malfet/581/orig 2025-12-04T08:54:02.3232752Z * [new branch] gh/malfet/583/base -> origin/gh/malfet/583/base 2025-12-04T08:54:02.3232821Z * [new branch] gh/malfet/583/head -> origin/gh/malfet/583/head 2025-12-04T08:54:02.3232888Z * [new branch] gh/malfet/583/orig -> origin/gh/malfet/583/orig 2025-12-04T08:54:02.3232956Z * [new branch] gh/malfet/586/base -> origin/gh/malfet/586/base 2025-12-04T08:54:02.3233023Z * [new branch] gh/malfet/586/head -> origin/gh/malfet/586/head 2025-12-04T08:54:02.3233090Z * [new branch] gh/malfet/586/orig -> origin/gh/malfet/586/orig 2025-12-04T08:54:02.3233159Z * [new branch] gh/malfet/587/base -> origin/gh/malfet/587/base 2025-12-04T08:54:02.3233226Z * [new branch] gh/malfet/587/head -> origin/gh/malfet/587/head 2025-12-04T08:54:02.3233292Z * [new branch] gh/malfet/587/orig -> origin/gh/malfet/587/orig 2025-12-04T08:54:02.3233386Z * [new branch] gh/malfet/588/base -> origin/gh/malfet/588/base 2025-12-04T08:54:02.3233452Z * [new branch] gh/malfet/588/head -> origin/gh/malfet/588/head 2025-12-04T08:54:02.3233520Z * [new branch] gh/malfet/588/orig -> origin/gh/malfet/588/orig 2025-12-04T08:54:02.3233588Z * [new branch] gh/malfet/589/base -> origin/gh/malfet/589/base 2025-12-04T08:54:02.3233655Z * [new branch] gh/malfet/589/head -> origin/gh/malfet/589/head 2025-12-04T08:54:02.3233723Z * [new branch] gh/malfet/589/orig -> origin/gh/malfet/589/orig 2025-12-04T08:54:02.3233789Z * [new branch] gh/malfet/590/base -> origin/gh/malfet/590/base 2025-12-04T08:54:02.3233855Z * [new branch] gh/malfet/590/head -> origin/gh/malfet/590/head 2025-12-04T08:54:02.3233923Z * [new branch] gh/malfet/590/orig -> origin/gh/malfet/590/orig 2025-12-04T08:54:02.3233990Z * [new branch] gh/malfet/591/base -> origin/gh/malfet/591/base 2025-12-04T08:54:02.3234057Z * [new branch] gh/malfet/591/head -> origin/gh/malfet/591/head 2025-12-04T08:54:02.3234125Z * [new branch] gh/malfet/591/orig -> origin/gh/malfet/591/orig 2025-12-04T08:54:02.3234219Z * [new branch] gh/malfet/592/base -> origin/gh/malfet/592/base 2025-12-04T08:54:02.3234286Z * [new branch] gh/malfet/592/head -> origin/gh/malfet/592/head 2025-12-04T08:54:02.3234354Z * [new branch] gh/malfet/592/orig -> origin/gh/malfet/592/orig 2025-12-04T08:54:02.3234421Z * [new branch] gh/malfet/593/base -> origin/gh/malfet/593/base 2025-12-04T08:54:02.3234488Z * [new branch] gh/malfet/593/head -> origin/gh/malfet/593/head 2025-12-04T08:54:02.3234556Z * [new branch] gh/malfet/593/orig -> origin/gh/malfet/593/orig 2025-12-04T08:54:02.3234625Z * [new branch] gh/malfet/594/base -> origin/gh/malfet/594/base 2025-12-04T08:54:02.3234691Z * [new branch] gh/malfet/594/head -> origin/gh/malfet/594/head 2025-12-04T08:54:02.3234759Z * [new branch] gh/malfet/594/orig -> origin/gh/malfet/594/orig 2025-12-04T08:54:02.3234826Z * [new branch] gh/malfet/595/base -> origin/gh/malfet/595/base 2025-12-04T08:54:02.3234893Z * [new branch] gh/malfet/595/head -> origin/gh/malfet/595/head 2025-12-04T08:54:02.3234960Z * [new branch] gh/malfet/595/orig -> origin/gh/malfet/595/orig 2025-12-04T08:54:02.3235027Z * [new branch] gh/malfet/596/base -> origin/gh/malfet/596/base 2025-12-04T08:54:02.3235094Z * [new branch] gh/malfet/596/head -> origin/gh/malfet/596/head 2025-12-04T08:54:02.3235162Z * [new branch] gh/malfet/596/orig -> origin/gh/malfet/596/orig 2025-12-04T08:54:02.3235230Z * [new branch] gh/malfet/597/base -> origin/gh/malfet/597/base 2025-12-04T08:54:02.3235298Z * [new branch] gh/malfet/597/head -> origin/gh/malfet/597/head 2025-12-04T08:54:02.3235365Z * [new branch] gh/malfet/597/orig -> origin/gh/malfet/597/orig 2025-12-04T08:54:02.3235432Z * [new branch] gh/malfet/598/base -> origin/gh/malfet/598/base 2025-12-04T08:54:02.3235500Z * [new branch] gh/malfet/598/head -> origin/gh/malfet/598/head 2025-12-04T08:54:02.3235567Z * [new branch] gh/malfet/598/orig -> origin/gh/malfet/598/orig 2025-12-04T08:54:02.3235633Z * [new branch] gh/malfet/599/base -> origin/gh/malfet/599/base 2025-12-04T08:54:02.3235700Z * [new branch] gh/malfet/599/head -> origin/gh/malfet/599/head 2025-12-04T08:54:02.3235766Z * [new branch] gh/malfet/599/orig -> origin/gh/malfet/599/orig 2025-12-04T08:54:02.3235859Z * [new branch] gh/malfet/600/base -> origin/gh/malfet/600/base 2025-12-04T08:54:02.3235928Z * [new branch] gh/malfet/600/head -> origin/gh/malfet/600/head 2025-12-04T08:54:02.3235994Z * [new branch] gh/malfet/600/orig -> origin/gh/malfet/600/orig 2025-12-04T08:54:02.3236062Z * [new branch] gh/malfet/601/base -> origin/gh/malfet/601/base 2025-12-04T08:54:02.3236129Z * [new branch] gh/malfet/601/head -> origin/gh/malfet/601/head 2025-12-04T08:54:02.3236196Z * [new branch] gh/malfet/601/orig -> origin/gh/malfet/601/orig 2025-12-04T08:54:02.3236263Z * [new branch] gh/malfet/602/base -> origin/gh/malfet/602/base 2025-12-04T08:54:02.3236332Z * [new branch] gh/malfet/602/head -> origin/gh/malfet/602/head 2025-12-04T08:54:02.3236398Z * [new branch] gh/malfet/602/orig -> origin/gh/malfet/602/orig 2025-12-04T08:54:02.3236466Z * [new branch] gh/malfet/603/base -> origin/gh/malfet/603/base 2025-12-04T08:54:02.3236533Z * [new branch] gh/malfet/603/head -> origin/gh/malfet/603/head 2025-12-04T08:54:02.3236600Z * [new branch] gh/malfet/603/orig -> origin/gh/malfet/603/orig 2025-12-04T08:54:02.3236703Z * [new branch] gh/malfet/604/base -> origin/gh/malfet/604/base 2025-12-04T08:54:02.3236773Z * [new branch] gh/malfet/604/head -> origin/gh/malfet/604/head 2025-12-04T08:54:02.3236839Z * [new branch] gh/malfet/604/orig -> origin/gh/malfet/604/orig 2025-12-04T08:54:02.3236907Z * [new branch] gh/malfet/605/base -> origin/gh/malfet/605/base 2025-12-04T08:54:02.3236973Z * [new branch] gh/malfet/605/head -> origin/gh/malfet/605/head 2025-12-04T08:54:02.3237040Z * [new branch] gh/malfet/605/orig -> origin/gh/malfet/605/orig 2025-12-04T08:54:02.3237110Z * [new branch] gh/malfet/606/base -> origin/gh/malfet/606/base 2025-12-04T08:54:02.3237177Z * [new branch] gh/malfet/606/head -> origin/gh/malfet/606/head 2025-12-04T08:54:02.3237243Z * [new branch] gh/malfet/606/orig -> origin/gh/malfet/606/orig 2025-12-04T08:54:02.3237312Z * [new branch] gh/malfet/607/base -> origin/gh/malfet/607/base 2025-12-04T08:54:02.3237379Z * [new branch] gh/malfet/607/head -> origin/gh/malfet/607/head 2025-12-04T08:54:02.3237445Z * [new branch] gh/malfet/607/orig -> origin/gh/malfet/607/orig 2025-12-04T08:54:02.3237513Z * [new branch] gh/malfet/608/base -> origin/gh/malfet/608/base 2025-12-04T08:54:02.3237580Z * [new branch] gh/malfet/608/head -> origin/gh/malfet/608/head 2025-12-04T08:54:02.3237646Z * [new branch] gh/malfet/608/orig -> origin/gh/malfet/608/orig 2025-12-04T08:54:02.3237716Z * [new branch] gh/malfet/609/base -> origin/gh/malfet/609/base 2025-12-04T08:54:02.3237783Z * [new branch] gh/malfet/609/head -> origin/gh/malfet/609/head 2025-12-04T08:54:02.3237851Z * [new branch] gh/malfet/609/orig -> origin/gh/malfet/609/orig 2025-12-04T08:54:02.3237922Z * [new branch] gh/malfet/610/base -> origin/gh/malfet/610/base 2025-12-04T08:54:02.3237990Z * [new branch] gh/malfet/610/head -> origin/gh/malfet/610/head 2025-12-04T08:54:02.3238058Z * [new branch] gh/malfet/610/orig -> origin/gh/malfet/610/orig 2025-12-04T08:54:02.3238126Z * [new branch] gh/malfet/611/base -> origin/gh/malfet/611/base 2025-12-04T08:54:02.3238194Z * [new branch] gh/malfet/611/head -> origin/gh/malfet/611/head 2025-12-04T08:54:02.3238263Z * [new branch] gh/malfet/611/orig -> origin/gh/malfet/611/orig 2025-12-04T08:54:02.3238359Z * [new branch] gh/malfet/612/base -> origin/gh/malfet/612/base 2025-12-04T08:54:02.3238427Z * [new branch] gh/malfet/612/head -> origin/gh/malfet/612/head 2025-12-04T08:54:02.3238495Z * [new branch] gh/malfet/612/orig -> origin/gh/malfet/612/orig 2025-12-04T08:54:02.3238565Z * [new branch] gh/malfet/64/base -> origin/gh/malfet/64/base 2025-12-04T08:54:02.3238634Z * [new branch] gh/malfet/64/head -> origin/gh/malfet/64/head 2025-12-04T08:54:02.3238726Z * [new branch] gh/manuelcandales/11/base -> origin/gh/manuelcandales/11/base 2025-12-04T08:54:02.3238813Z * [new branch] gh/manuelcandales/11/head -> origin/gh/manuelcandales/11/head 2025-12-04T08:54:02.3238896Z * [new branch] gh/manuelcandales/11/orig -> origin/gh/manuelcandales/11/orig 2025-12-04T08:54:02.3238965Z * [new branch] gh/markkm/1/base -> origin/gh/markkm/1/base 2025-12-04T08:54:02.3239038Z * [new branch] gh/masnesral/1/base -> origin/gh/masnesral/1/base 2025-12-04T08:54:02.3239111Z * [new branch] gh/masnesral/1/head -> origin/gh/masnesral/1/head 2025-12-04T08:54:02.3239184Z * [new branch] gh/masnesral/1/orig -> origin/gh/masnesral/1/orig 2025-12-04T08:54:02.3239282Z * [new branch] gh/mhorowitz/0/base -> origin/gh/mhorowitz/0/base 2025-12-04T08:54:02.3239353Z * [new branch] gh/mhorowitz/0/head -> origin/gh/mhorowitz/0/head 2025-12-04T08:54:02.3239426Z * [new branch] gh/mhorowitz/1/base -> origin/gh/mhorowitz/1/base 2025-12-04T08:54:02.3239497Z * [new branch] gh/mhorowitz/1/head -> origin/gh/mhorowitz/1/head 2025-12-04T08:54:02.3239570Z * [new branch] gh/mhorowitz/2/base -> origin/gh/mhorowitz/2/base 2025-12-04T08:54:02.3239642Z * [new branch] gh/mhorowitz/2/head -> origin/gh/mhorowitz/2/head 2025-12-04T08:54:02.3239712Z * [new branch] gh/mhorowitz/3/base -> origin/gh/mhorowitz/3/base 2025-12-04T08:54:02.3239782Z * [new branch] gh/mhorowitz/3/head -> origin/gh/mhorowitz/3/head 2025-12-04T08:54:02.3239855Z * [new branch] gh/mhorowitz/4/base -> origin/gh/mhorowitz/4/base 2025-12-04T08:54:02.3239926Z * [new branch] gh/mhorowitz/4/head -> origin/gh/mhorowitz/4/head 2025-12-04T08:54:02.3239997Z * [new branch] gh/mhorowitz/5/base -> origin/gh/mhorowitz/5/base 2025-12-04T08:54:02.3240070Z * [new branch] gh/mhorowitz/5/head -> origin/gh/mhorowitz/5/head 2025-12-04T08:54:02.3240220Z * [new branch] gh/mhorowitz/6/base -> origin/gh/mhorowitz/6/base 2025-12-04T08:54:02.3240295Z * [new branch] gh/mhorowitz/6/head -> origin/gh/mhorowitz/6/head 2025-12-04T08:54:02.3240398Z * [new branch] gh/mikaylagawarecki/234/base -> origin/gh/mikaylagawarecki/234/base 2025-12-04T08:54:02.3240496Z * [new branch] gh/mikaylagawarecki/234/head -> origin/gh/mikaylagawarecki/234/head 2025-12-04T08:54:02.3240593Z * [new branch] gh/mikaylagawarecki/235/base -> origin/gh/mikaylagawarecki/235/base 2025-12-04T08:54:02.3240685Z * [new branch] gh/mikaylagawarecki/235/head -> origin/gh/mikaylagawarecki/235/head 2025-12-04T08:54:02.3240779Z * [new branch] gh/mikaylagawarecki/236/base -> origin/gh/mikaylagawarecki/236/base 2025-12-04T08:54:02.3240871Z * [new branch] gh/mikaylagawarecki/236/head -> origin/gh/mikaylagawarecki/236/head 2025-12-04T08:54:02.3240964Z * [new branch] gh/mikaylagawarecki/237/base -> origin/gh/mikaylagawarecki/237/base 2025-12-04T08:54:02.3241056Z * [new branch] gh/mikaylagawarecki/237/head -> origin/gh/mikaylagawarecki/237/head 2025-12-04T08:54:02.3241148Z * [new branch] gh/mikaylagawarecki/238/base -> origin/gh/mikaylagawarecki/238/base 2025-12-04T08:54:02.3241289Z * [new branch] gh/mikaylagawarecki/238/head -> origin/gh/mikaylagawarecki/238/head 2025-12-04T08:54:02.3241383Z * [new branch] gh/mikaylagawarecki/336/base -> origin/gh/mikaylagawarecki/336/base 2025-12-04T08:54:02.3241478Z * [new branch] gh/mikaylagawarecki/336/head -> origin/gh/mikaylagawarecki/336/head 2025-12-04T08:54:02.3241571Z * [new branch] gh/mikaylagawarecki/336/orig -> origin/gh/mikaylagawarecki/336/orig 2025-12-04T08:54:02.3241662Z * [new branch] gh/mikaylagawarecki/341/base -> origin/gh/mikaylagawarecki/341/base 2025-12-04T08:54:02.3241755Z * [new branch] gh/mikaylagawarecki/341/head -> origin/gh/mikaylagawarecki/341/head 2025-12-04T08:54:02.3241847Z * [new branch] gh/mikaylagawarecki/341/orig -> origin/gh/mikaylagawarecki/341/orig 2025-12-04T08:54:02.3241940Z * [new branch] gh/mikaylagawarecki/342/base -> origin/gh/mikaylagawarecki/342/base 2025-12-04T08:54:02.3242032Z * [new branch] gh/mikaylagawarecki/342/head -> origin/gh/mikaylagawarecki/342/head 2025-12-04T08:54:02.3242124Z * [new branch] gh/mikaylagawarecki/342/orig -> origin/gh/mikaylagawarecki/342/orig 2025-12-04T08:54:02.3242221Z * [new branch] gh/mikaylagawarecki/345/base -> origin/gh/mikaylagawarecki/345/base 2025-12-04T08:54:02.3242351Z * [new branch] gh/mikaylagawarecki/345/head -> origin/gh/mikaylagawarecki/345/head 2025-12-04T08:54:02.3242443Z * [new branch] gh/mikaylagawarecki/345/orig -> origin/gh/mikaylagawarecki/345/orig 2025-12-04T08:54:02.3242535Z * [new branch] gh/mikaylagawarecki/346/base -> origin/gh/mikaylagawarecki/346/base 2025-12-04T08:54:02.3242628Z * [new branch] gh/mikaylagawarecki/346/head -> origin/gh/mikaylagawarecki/346/head 2025-12-04T08:54:02.3242721Z * [new branch] gh/mikaylagawarecki/346/orig -> origin/gh/mikaylagawarecki/346/orig 2025-12-04T08:54:02.3242816Z * [new branch] gh/mikaylagawarecki/347/base -> origin/gh/mikaylagawarecki/347/base 2025-12-04T08:54:02.3242908Z * [new branch] gh/mikaylagawarecki/347/head -> origin/gh/mikaylagawarecki/347/head 2025-12-04T08:54:02.3242999Z * [new branch] gh/mikaylagawarecki/347/orig -> origin/gh/mikaylagawarecki/347/orig 2025-12-04T08:54:02.3243095Z * [new branch] gh/mikaylagawarecki/350/base -> origin/gh/mikaylagawarecki/350/base 2025-12-04T08:54:02.3243187Z * [new branch] gh/mikaylagawarecki/350/head -> origin/gh/mikaylagawarecki/350/head 2025-12-04T08:54:02.3243282Z * [new branch] gh/mikaylagawarecki/350/orig -> origin/gh/mikaylagawarecki/350/orig 2025-12-04T08:54:02.3243374Z * [new branch] gh/mikaylagawarecki/351/base -> origin/gh/mikaylagawarecki/351/base 2025-12-04T08:54:02.3243466Z * [new branch] gh/mikaylagawarecki/351/head -> origin/gh/mikaylagawarecki/351/head 2025-12-04T08:54:02.3243563Z * [new branch] gh/mikaylagawarecki/351/orig -> origin/gh/mikaylagawarecki/351/orig 2025-12-04T08:54:02.3243653Z * [new branch] gh/mikaylagawarecki/352/base -> origin/gh/mikaylagawarecki/352/base 2025-12-04T08:54:02.3243745Z * [new branch] gh/mikaylagawarecki/352/head -> origin/gh/mikaylagawarecki/352/head 2025-12-04T08:54:02.3243840Z * [new branch] gh/mikaylagawarecki/352/orig -> origin/gh/mikaylagawarecki/352/orig 2025-12-04T08:54:02.3243932Z * [new branch] gh/mikaylagawarecki/353/base -> origin/gh/mikaylagawarecki/353/base 2025-12-04T08:54:02.3244023Z * [new branch] gh/mikaylagawarecki/353/head -> origin/gh/mikaylagawarecki/353/head 2025-12-04T08:54:02.3244117Z * [new branch] gh/mikaylagawarecki/353/orig -> origin/gh/mikaylagawarecki/353/orig 2025-12-04T08:54:02.3244210Z * [new branch] gh/mikaylagawarecki/354/base -> origin/gh/mikaylagawarecki/354/base 2025-12-04T08:54:02.3244329Z * [new branch] gh/mikaylagawarecki/354/head -> origin/gh/mikaylagawarecki/354/head 2025-12-04T08:54:02.3244424Z * [new branch] gh/mikaylagawarecki/354/orig -> origin/gh/mikaylagawarecki/354/orig 2025-12-04T08:54:02.3244516Z * [new branch] gh/mikaylagawarecki/356/base -> origin/gh/mikaylagawarecki/356/base 2025-12-04T08:54:02.3244611Z * [new branch] gh/mikaylagawarecki/356/head -> origin/gh/mikaylagawarecki/356/head 2025-12-04T08:54:02.3244701Z * [new branch] gh/mikaylagawarecki/356/orig -> origin/gh/mikaylagawarecki/356/orig 2025-12-04T08:54:02.3244792Z * [new branch] gh/mikaylagawarecki/357/base -> origin/gh/mikaylagawarecki/357/base 2025-12-04T08:54:02.3244886Z * [new branch] gh/mikaylagawarecki/357/head -> origin/gh/mikaylagawarecki/357/head 2025-12-04T08:54:02.3244977Z * [new branch] gh/mikaylagawarecki/357/orig -> origin/gh/mikaylagawarecki/357/orig 2025-12-04T08:54:02.3245069Z * [new branch] gh/mikaylagawarecki/359/base -> origin/gh/mikaylagawarecki/359/base 2025-12-04T08:54:02.3245163Z * [new branch] gh/mikaylagawarecki/359/head -> origin/gh/mikaylagawarecki/359/head 2025-12-04T08:54:02.3245255Z * [new branch] gh/mikaylagawarecki/359/orig -> origin/gh/mikaylagawarecki/359/orig 2025-12-04T08:54:02.3245372Z * [new branch] gh/mikaylagawarecki/360/base -> origin/gh/mikaylagawarecki/360/base 2025-12-04T08:54:02.3245465Z * [new branch] gh/mikaylagawarecki/360/head -> origin/gh/mikaylagawarecki/360/head 2025-12-04T08:54:02.3245556Z * [new branch] gh/mikaylagawarecki/360/orig -> origin/gh/mikaylagawarecki/360/orig 2025-12-04T08:54:02.3245648Z * [new branch] gh/mikaylagawarecki/361/base -> origin/gh/mikaylagawarecki/361/base 2025-12-04T08:54:02.3245743Z * [new branch] gh/mikaylagawarecki/361/head -> origin/gh/mikaylagawarecki/361/head 2025-12-04T08:54:02.3245836Z * [new branch] gh/mikaylagawarecki/361/orig -> origin/gh/mikaylagawarecki/361/orig 2025-12-04T08:54:02.3245927Z * [new branch] gh/mikaylagawarecki/362/base -> origin/gh/mikaylagawarecki/362/base 2025-12-04T08:54:02.3246019Z * [new branch] gh/mikaylagawarecki/362/head -> origin/gh/mikaylagawarecki/362/head 2025-12-04T08:54:02.3246111Z * [new branch] gh/mikaylagawarecki/362/orig -> origin/gh/mikaylagawarecki/362/orig 2025-12-04T08:54:02.3246204Z * [new branch] gh/mikaylagawarecki/363/base -> origin/gh/mikaylagawarecki/363/base 2025-12-04T08:54:02.3246296Z * [new branch] gh/mikaylagawarecki/363/head -> origin/gh/mikaylagawarecki/363/head 2025-12-04T08:54:02.3246386Z * [new branch] gh/mikaylagawarecki/363/orig -> origin/gh/mikaylagawarecki/363/orig 2025-12-04T08:54:02.3246480Z * [new branch] gh/mikaylagawarecki/364/base -> origin/gh/mikaylagawarecki/364/base 2025-12-04T08:54:02.3246573Z * [new branch] gh/mikaylagawarecki/364/head -> origin/gh/mikaylagawarecki/364/head 2025-12-04T08:54:02.3246663Z * [new branch] gh/mikaylagawarecki/364/orig -> origin/gh/mikaylagawarecki/364/orig 2025-12-04T08:54:02.3246760Z * [new branch] gh/mikaylagawarecki/365/base -> origin/gh/mikaylagawarecki/365/base 2025-12-04T08:54:02.3246851Z * [new branch] gh/mikaylagawarecki/365/head -> origin/gh/mikaylagawarecki/365/head 2025-12-04T08:54:02.3246942Z * [new branch] gh/mikaylagawarecki/365/orig -> origin/gh/mikaylagawarecki/365/orig 2025-12-04T08:54:02.3247035Z * [new branch] gh/mikaylagawarecki/366/base -> origin/gh/mikaylagawarecki/366/base 2025-12-04T08:54:02.3247126Z * [new branch] gh/mikaylagawarecki/366/head -> origin/gh/mikaylagawarecki/366/head 2025-12-04T08:54:02.3247217Z * [new branch] gh/mikaylagawarecki/366/orig -> origin/gh/mikaylagawarecki/366/orig 2025-12-04T08:54:02.3247343Z * [new branch] gh/mikaylagawarecki/367/base -> origin/gh/mikaylagawarecki/367/base 2025-12-04T08:54:02.3247434Z * [new branch] gh/mikaylagawarecki/367/head -> origin/gh/mikaylagawarecki/367/head 2025-12-04T08:54:02.3247528Z * [new branch] gh/mikaylagawarecki/367/orig -> origin/gh/mikaylagawarecki/367/orig 2025-12-04T08:54:02.3247620Z * [new branch] gh/mikaylagawarecki/368/base -> origin/gh/mikaylagawarecki/368/base 2025-12-04T08:54:02.3247711Z * [new branch] gh/mikaylagawarecki/368/head -> origin/gh/mikaylagawarecki/368/head 2025-12-04T08:54:02.3247804Z * [new branch] gh/mikaylagawarecki/368/orig -> origin/gh/mikaylagawarecki/368/orig 2025-12-04T08:54:02.3247895Z * [new branch] gh/mikaylagawarecki/369/base -> origin/gh/mikaylagawarecki/369/base 2025-12-04T08:54:02.3247986Z * [new branch] gh/mikaylagawarecki/369/head -> origin/gh/mikaylagawarecki/369/head 2025-12-04T08:54:02.3248079Z * [new branch] gh/mikaylagawarecki/369/orig -> origin/gh/mikaylagawarecki/369/orig 2025-12-04T08:54:02.3248170Z * [new branch] gh/mikaylagawarecki/370/base -> origin/gh/mikaylagawarecki/370/base 2025-12-04T08:54:02.3248261Z * [new branch] gh/mikaylagawarecki/370/head -> origin/gh/mikaylagawarecki/370/head 2025-12-04T08:54:02.3248386Z * [new branch] gh/mikaylagawarecki/370/orig -> origin/gh/mikaylagawarecki/370/orig 2025-12-04T08:54:02.3248477Z * [new branch] gh/mikaylagawarecki/371/base -> origin/gh/mikaylagawarecki/371/base 2025-12-04T08:54:02.3248569Z * [new branch] gh/mikaylagawarecki/371/head -> origin/gh/mikaylagawarecki/371/head 2025-12-04T08:54:02.3248661Z * [new branch] gh/mikaylagawarecki/371/orig -> origin/gh/mikaylagawarecki/371/orig 2025-12-04T08:54:02.3248752Z * [new branch] gh/mikaylagawarecki/372/base -> origin/gh/mikaylagawarecki/372/base 2025-12-04T08:54:02.3248845Z * [new branch] gh/mikaylagawarecki/372/head -> origin/gh/mikaylagawarecki/372/head 2025-12-04T08:54:02.3248936Z * [new branch] gh/mikaylagawarecki/372/orig -> origin/gh/mikaylagawarecki/372/orig 2025-12-04T08:54:02.3249028Z * [new branch] gh/mikaylagawarecki/373/base -> origin/gh/mikaylagawarecki/373/base 2025-12-04T08:54:02.3249123Z * [new branch] gh/mikaylagawarecki/373/head -> origin/gh/mikaylagawarecki/373/head 2025-12-04T08:54:02.3249214Z * [new branch] gh/mikaylagawarecki/373/orig -> origin/gh/mikaylagawarecki/373/orig 2025-12-04T08:54:02.3249306Z * [new branch] gh/mikaylagawarecki/374/base -> origin/gh/mikaylagawarecki/374/base 2025-12-04T08:54:02.3249398Z * [new branch] gh/mikaylagawarecki/374/head -> origin/gh/mikaylagawarecki/374/head 2025-12-04T08:54:02.3249488Z * [new branch] gh/mikaylagawarecki/374/orig -> origin/gh/mikaylagawarecki/374/orig 2025-12-04T08:54:02.3249580Z * [new branch] gh/mikaylagawarecki/375/base -> origin/gh/mikaylagawarecki/375/base 2025-12-04T08:54:02.3249673Z * [new branch] gh/mikaylagawarecki/375/head -> origin/gh/mikaylagawarecki/375/head 2025-12-04T08:54:02.3249764Z * [new branch] gh/mikaylagawarecki/375/orig -> origin/gh/mikaylagawarecki/375/orig 2025-12-04T08:54:02.3249857Z * [new branch] gh/mikaylagawarecki/376/base -> origin/gh/mikaylagawarecki/376/base 2025-12-04T08:54:02.3249950Z * [new branch] gh/mikaylagawarecki/376/head -> origin/gh/mikaylagawarecki/376/head 2025-12-04T08:54:02.3250041Z * [new branch] gh/mikaylagawarecki/376/orig -> origin/gh/mikaylagawarecki/376/orig 2025-12-04T08:54:02.3250170Z * [new branch] gh/mikaylagawarecki/377/base -> origin/gh/mikaylagawarecki/377/base 2025-12-04T08:54:02.3250265Z * [new branch] gh/mikaylagawarecki/377/head -> origin/gh/mikaylagawarecki/377/head 2025-12-04T08:54:02.3250402Z * [new branch] gh/mikaylagawarecki/377/orig -> origin/gh/mikaylagawarecki/377/orig 2025-12-04T08:54:02.3250496Z * [new branch] gh/mikaylagawarecki/378/base -> origin/gh/mikaylagawarecki/378/base 2025-12-04T08:54:02.3250588Z * [new branch] gh/mikaylagawarecki/378/head -> origin/gh/mikaylagawarecki/378/head 2025-12-04T08:54:02.3250680Z * [new branch] gh/mikaylagawarecki/378/orig -> origin/gh/mikaylagawarecki/378/orig 2025-12-04T08:54:02.3250773Z * [new branch] gh/mikaylagawarecki/379/base -> origin/gh/mikaylagawarecki/379/base 2025-12-04T08:54:02.3250864Z * [new branch] gh/mikaylagawarecki/379/head -> origin/gh/mikaylagawarecki/379/head 2025-12-04T08:54:02.3250955Z * [new branch] gh/mikaylagawarecki/379/orig -> origin/gh/mikaylagawarecki/379/orig 2025-12-04T08:54:02.3251049Z * [new branch] gh/mikaylagawarecki/380/base -> origin/gh/mikaylagawarecki/380/base 2025-12-04T08:54:02.3251141Z * [new branch] gh/mikaylagawarecki/380/head -> origin/gh/mikaylagawarecki/380/head 2025-12-04T08:54:02.3251233Z * [new branch] gh/mikaylagawarecki/380/orig -> origin/gh/mikaylagawarecki/380/orig 2025-12-04T08:54:02.3251325Z * [new branch] gh/mikaylagawarecki/381/base -> origin/gh/mikaylagawarecki/381/base 2025-12-04T08:54:02.3251468Z * [new branch] gh/mikaylagawarecki/381/head -> origin/gh/mikaylagawarecki/381/head 2025-12-04T08:54:02.3251559Z * [new branch] gh/mikaylagawarecki/381/orig -> origin/gh/mikaylagawarecki/381/orig 2025-12-04T08:54:02.3251651Z * [new branch] gh/mikaylagawarecki/382/base -> origin/gh/mikaylagawarecki/382/base 2025-12-04T08:54:02.3251742Z * [new branch] gh/mikaylagawarecki/382/head -> origin/gh/mikaylagawarecki/382/head 2025-12-04T08:54:02.3251833Z * [new branch] gh/mikaylagawarecki/382/orig -> origin/gh/mikaylagawarecki/382/orig 2025-12-04T08:54:02.3251927Z * [new branch] gh/mikaylagawarecki/383/base -> origin/gh/mikaylagawarecki/383/base 2025-12-04T08:54:02.3252020Z * [new branch] gh/mikaylagawarecki/383/head -> origin/gh/mikaylagawarecki/383/head 2025-12-04T08:54:02.3252115Z * [new branch] gh/mikaylagawarecki/383/orig -> origin/gh/mikaylagawarecki/383/orig 2025-12-04T08:54:02.3252207Z * [new branch] gh/mikaylagawarecki/384/base -> origin/gh/mikaylagawarecki/384/base 2025-12-04T08:54:02.3252298Z * [new branch] gh/mikaylagawarecki/384/head -> origin/gh/mikaylagawarecki/384/head 2025-12-04T08:54:02.3252390Z * [new branch] gh/mikaylagawarecki/384/orig -> origin/gh/mikaylagawarecki/384/orig 2025-12-04T08:54:02.3252481Z * [new branch] gh/mikaylagawarecki/385/base -> origin/gh/mikaylagawarecki/385/base 2025-12-04T08:54:02.3252572Z * [new branch] gh/mikaylagawarecki/385/head -> origin/gh/mikaylagawarecki/385/head 2025-12-04T08:54:02.3252666Z * [new branch] gh/mikaylagawarecki/385/orig -> origin/gh/mikaylagawarecki/385/orig 2025-12-04T08:54:02.3252756Z * [new branch] gh/mikaylagawarecki/386/base -> origin/gh/mikaylagawarecki/386/base 2025-12-04T08:54:02.3252847Z * [new branch] gh/mikaylagawarecki/386/head -> origin/gh/mikaylagawarecki/386/head 2025-12-04T08:54:02.3252941Z * [new branch] gh/mikaylagawarecki/386/orig -> origin/gh/mikaylagawarecki/386/orig 2025-12-04T08:54:02.3253032Z * [new branch] gh/mikaylagawarecki/387/base -> origin/gh/mikaylagawarecki/387/base 2025-12-04T08:54:02.3253122Z * [new branch] gh/mikaylagawarecki/387/head -> origin/gh/mikaylagawarecki/387/head 2025-12-04T08:54:02.3253214Z * [new branch] gh/mikaylagawarecki/387/orig -> origin/gh/mikaylagawarecki/387/orig 2025-12-04T08:54:02.3253305Z * [new branch] gh/mikaylagawarecki/388/base -> origin/gh/mikaylagawarecki/388/base 2025-12-04T08:54:02.3253422Z * [new branch] gh/mikaylagawarecki/388/head -> origin/gh/mikaylagawarecki/388/head 2025-12-04T08:54:02.3253514Z * [new branch] gh/mikaylagawarecki/388/orig -> origin/gh/mikaylagawarecki/388/orig 2025-12-04T08:54:02.3253604Z * [new branch] gh/mikaylagawarecki/389/base -> origin/gh/mikaylagawarecki/389/base 2025-12-04T08:54:02.3253697Z * [new branch] gh/mikaylagawarecki/389/head -> origin/gh/mikaylagawarecki/389/head 2025-12-04T08:54:02.3253789Z * [new branch] gh/mikaylagawarecki/389/orig -> origin/gh/mikaylagawarecki/389/orig 2025-12-04T08:54:02.3253882Z * [new branch] gh/mikaylagawarecki/390/base -> origin/gh/mikaylagawarecki/390/base 2025-12-04T08:54:02.3253975Z * [new branch] gh/mikaylagawarecki/390/head -> origin/gh/mikaylagawarecki/390/head 2025-12-04T08:54:02.3254068Z * [new branch] gh/mikaylagawarecki/390/orig -> origin/gh/mikaylagawarecki/390/orig 2025-12-04T08:54:02.3254162Z * [new branch] gh/mikaylagawarecki/391/base -> origin/gh/mikaylagawarecki/391/base 2025-12-04T08:54:02.3254259Z * [new branch] gh/mikaylagawarecki/391/head -> origin/gh/mikaylagawarecki/391/head 2025-12-04T08:54:02.3254352Z * [new branch] gh/mikaylagawarecki/391/orig -> origin/gh/mikaylagawarecki/391/orig 2025-12-04T08:54:02.3254469Z * [new branch] gh/mikaylagawarecki/392/base -> origin/gh/mikaylagawarecki/392/base 2025-12-04T08:54:02.3254566Z * [new branch] gh/mikaylagawarecki/392/head -> origin/gh/mikaylagawarecki/392/head 2025-12-04T08:54:02.3254659Z * [new branch] gh/mikaylagawarecki/392/orig -> origin/gh/mikaylagawarecki/392/orig 2025-12-04T08:54:02.3254734Z * [new branch] gh/mlazos/41/base -> origin/gh/mlazos/41/base 2025-12-04T08:54:02.3254804Z * [new branch] gh/mlazos/41/head -> origin/gh/mlazos/41/head 2025-12-04T08:54:02.3254875Z * [new branch] gh/mlazos/41/orig -> origin/gh/mlazos/41/orig 2025-12-04T08:54:02.3254946Z * [new branch] gh/mlazos/42/base -> origin/gh/mlazos/42/base 2025-12-04T08:54:02.3255014Z * [new branch] gh/mlazos/42/head -> origin/gh/mlazos/42/head 2025-12-04T08:54:02.3255081Z * [new branch] gh/mlazos/42/orig -> origin/gh/mlazos/42/orig 2025-12-04T08:54:02.3255148Z * [new branch] gh/mlazos/43/base -> origin/gh/mlazos/43/base 2025-12-04T08:54:02.3255214Z * [new branch] gh/mlazos/43/head -> origin/gh/mlazos/43/head 2025-12-04T08:54:02.3255280Z * [new branch] gh/mlazos/43/orig -> origin/gh/mlazos/43/orig 2025-12-04T08:54:02.3255346Z * [new branch] gh/mlazos/44/base -> origin/gh/mlazos/44/base 2025-12-04T08:54:02.3255412Z * [new branch] gh/mlazos/44/head -> origin/gh/mlazos/44/head 2025-12-04T08:54:02.3255479Z * [new branch] gh/mlazos/44/orig -> origin/gh/mlazos/44/orig 2025-12-04T08:54:02.3255546Z * [new branch] gh/mlazos/47/base -> origin/gh/mlazos/47/base 2025-12-04T08:54:02.3255613Z * [new branch] gh/mlazos/47/head -> origin/gh/mlazos/47/head 2025-12-04T08:54:02.3255678Z * [new branch] gh/mlazos/47/orig -> origin/gh/mlazos/47/orig 2025-12-04T08:54:02.3255746Z * [new branch] gh/mlazos/48/base -> origin/gh/mlazos/48/base 2025-12-04T08:54:02.3255812Z * [new branch] gh/mlazos/48/head -> origin/gh/mlazos/48/head 2025-12-04T08:54:02.3255878Z * [new branch] gh/mlazos/48/orig -> origin/gh/mlazos/48/orig 2025-12-04T08:54:02.3255944Z * [new branch] gh/mlazos/49/base -> origin/gh/mlazos/49/base 2025-12-04T08:54:02.3256011Z * [new branch] gh/mlazos/49/head -> origin/gh/mlazos/49/head 2025-12-04T08:54:02.3256104Z * [new branch] gh/mlazos/49/orig -> origin/gh/mlazos/49/orig 2025-12-04T08:54:02.3256171Z * [new branch] gh/mlazos/50/base -> origin/gh/mlazos/50/base 2025-12-04T08:54:02.3256237Z * [new branch] gh/mlazos/50/head -> origin/gh/mlazos/50/head 2025-12-04T08:54:02.3256303Z * [new branch] gh/mlazos/50/orig -> origin/gh/mlazos/50/orig 2025-12-04T08:54:02.3256372Z * [new branch] gh/mlazos/51/base -> origin/gh/mlazos/51/base 2025-12-04T08:54:02.3256438Z * [new branch] gh/mlazos/51/head -> origin/gh/mlazos/51/head 2025-12-04T08:54:02.3256505Z * [new branch] gh/mlazos/51/orig -> origin/gh/mlazos/51/orig 2025-12-04T08:54:02.3256572Z * [new branch] gh/mlazos/52/base -> origin/gh/mlazos/52/base 2025-12-04T08:54:02.3256638Z * [new branch] gh/mlazos/52/head -> origin/gh/mlazos/52/head 2025-12-04T08:54:02.3256707Z * [new branch] gh/mlazos/52/orig -> origin/gh/mlazos/52/orig 2025-12-04T08:54:02.3256776Z * [new branch] gh/mlazos/53/base -> origin/gh/mlazos/53/base 2025-12-04T08:54:02.3256841Z * [new branch] gh/mlazos/53/head -> origin/gh/mlazos/53/head 2025-12-04T08:54:02.3256908Z * [new branch] gh/mlazos/53/orig -> origin/gh/mlazos/53/orig 2025-12-04T08:54:02.3257003Z * [new branch] gh/mlazos/54/base -> origin/gh/mlazos/54/base 2025-12-04T08:54:02.3257070Z * [new branch] gh/mlazos/54/head -> origin/gh/mlazos/54/head 2025-12-04T08:54:02.3257140Z * [new branch] gh/mlazos/54/orig -> origin/gh/mlazos/54/orig 2025-12-04T08:54:02.3257206Z * [new branch] gh/mlazos/55/base -> origin/gh/mlazos/55/base 2025-12-04T08:54:02.3257272Z * [new branch] gh/mlazos/55/head -> origin/gh/mlazos/55/head 2025-12-04T08:54:02.3257339Z * [new branch] gh/mlazos/55/orig -> origin/gh/mlazos/55/orig 2025-12-04T08:54:02.3257407Z * [new branch] gh/mlazos/56/base -> origin/gh/mlazos/56/base 2025-12-04T08:54:02.3257472Z * [new branch] gh/mlazos/56/head -> origin/gh/mlazos/56/head 2025-12-04T08:54:02.3257538Z * [new branch] gh/mlazos/56/orig -> origin/gh/mlazos/56/orig 2025-12-04T08:54:02.3257606Z * [new branch] gh/mlazos/57/base -> origin/gh/mlazos/57/base 2025-12-04T08:54:02.3257672Z * [new branch] gh/mlazos/57/head -> origin/gh/mlazos/57/head 2025-12-04T08:54:02.3257738Z * [new branch] gh/mlazos/57/orig -> origin/gh/mlazos/57/orig 2025-12-04T08:54:02.3257804Z * [new branch] gh/mlazos/58/base -> origin/gh/mlazos/58/base 2025-12-04T08:54:02.3257870Z * [new branch] gh/mlazos/58/head -> origin/gh/mlazos/58/head 2025-12-04T08:54:02.3257937Z * [new branch] gh/mlazos/58/orig -> origin/gh/mlazos/58/orig 2025-12-04T08:54:02.3258004Z * [new branch] gh/mlazos/59/base -> origin/gh/mlazos/59/base 2025-12-04T08:54:02.3258072Z * [new branch] gh/mlazos/59/head -> origin/gh/mlazos/59/head 2025-12-04T08:54:02.3258137Z * [new branch] gh/mlazos/59/orig -> origin/gh/mlazos/59/orig 2025-12-04T08:54:02.3258207Z * [new branch] gh/mlazos/60/base -> origin/gh/mlazos/60/base 2025-12-04T08:54:02.3258277Z * [new branch] gh/mlazos/60/head -> origin/gh/mlazos/60/head 2025-12-04T08:54:02.3258344Z * [new branch] gh/mlazos/60/orig -> origin/gh/mlazos/60/orig 2025-12-04T08:54:02.3258410Z * [new branch] gh/mlazos/61/base -> origin/gh/mlazos/61/base 2025-12-04T08:54:02.3258477Z * [new branch] gh/mlazos/61/head -> origin/gh/mlazos/61/head 2025-12-04T08:54:02.3258544Z * [new branch] gh/mlazos/61/orig -> origin/gh/mlazos/61/orig 2025-12-04T08:54:02.3258644Z * [new branch] gh/mlazos/62/base -> origin/gh/mlazos/62/base 2025-12-04T08:54:02.3258712Z * [new branch] gh/mlazos/62/head -> origin/gh/mlazos/62/head 2025-12-04T08:54:02.3258778Z * [new branch] gh/mlazos/62/orig -> origin/gh/mlazos/62/orig 2025-12-04T08:54:02.3258844Z * [new branch] gh/mlazos/63/base -> origin/gh/mlazos/63/base 2025-12-04T08:54:02.3258912Z * [new branch] gh/mlazos/63/head -> origin/gh/mlazos/63/head 2025-12-04T08:54:02.3258977Z * [new branch] gh/mlazos/63/orig -> origin/gh/mlazos/63/orig 2025-12-04T08:54:02.3259044Z * [new branch] gh/mlazos/64/base -> origin/gh/mlazos/64/base 2025-12-04T08:54:02.3259112Z * [new branch] gh/mlazos/64/head -> origin/gh/mlazos/64/head 2025-12-04T08:54:02.3259178Z * [new branch] gh/mlazos/64/orig -> origin/gh/mlazos/64/orig 2025-12-04T08:54:02.3259245Z * [new branch] gh/mlazos/65/base -> origin/gh/mlazos/65/base 2025-12-04T08:54:02.3259312Z * [new branch] gh/mlazos/65/head -> origin/gh/mlazos/65/head 2025-12-04T08:54:02.3259378Z * [new branch] gh/mlazos/65/orig -> origin/gh/mlazos/65/orig 2025-12-04T08:54:02.3259463Z * [new branch] gh/mlazos/66/base -> origin/gh/mlazos/66/base 2025-12-04T08:54:02.3259531Z * [new branch] gh/mlazos/66/head -> origin/gh/mlazos/66/head 2025-12-04T08:54:02.3259597Z * [new branch] gh/mlazos/66/orig -> origin/gh/mlazos/66/orig 2025-12-04T08:54:02.3259665Z * [new branch] gh/mlazos/67/base -> origin/gh/mlazos/67/base 2025-12-04T08:54:02.3259731Z * [new branch] gh/mlazos/67/head -> origin/gh/mlazos/67/head 2025-12-04T08:54:02.3259796Z * [new branch] gh/mlazos/67/orig -> origin/gh/mlazos/67/orig 2025-12-04T08:54:02.3259865Z * [new branch] gh/mlazos/68/base -> origin/gh/mlazos/68/base 2025-12-04T08:54:02.3259931Z * [new branch] gh/mlazos/68/head -> origin/gh/mlazos/68/head 2025-12-04T08:54:02.3259997Z * [new branch] gh/mlazos/68/orig -> origin/gh/mlazos/68/orig 2025-12-04T08:54:02.3260065Z * [new branch] gh/mlazos/69/base -> origin/gh/mlazos/69/base 2025-12-04T08:54:02.3260178Z * [new branch] gh/mlazos/69/head -> origin/gh/mlazos/69/head 2025-12-04T08:54:02.3260245Z * [new branch] gh/mlazos/69/orig -> origin/gh/mlazos/69/orig 2025-12-04T08:54:02.3260312Z * [new branch] gh/mlazos/70/base -> origin/gh/mlazos/70/base 2025-12-04T08:54:02.3260378Z * [new branch] gh/mlazos/70/head -> origin/gh/mlazos/70/head 2025-12-04T08:54:02.3260443Z * [new branch] gh/mlazos/70/orig -> origin/gh/mlazos/70/orig 2025-12-04T08:54:02.3260512Z * [new branch] gh/mlazos/71/base -> origin/gh/mlazos/71/base 2025-12-04T08:54:02.3260580Z * [new branch] gh/mlazos/71/head -> origin/gh/mlazos/71/head 2025-12-04T08:54:02.3260648Z * [new branch] gh/mlazos/71/orig -> origin/gh/mlazos/71/orig 2025-12-04T08:54:02.3260715Z * [new branch] gh/mlazos/72/base -> origin/gh/mlazos/72/base 2025-12-04T08:54:02.3260782Z * [new branch] gh/mlazos/72/head -> origin/gh/mlazos/72/head 2025-12-04T08:54:02.3260848Z * [new branch] gh/mlazos/72/orig -> origin/gh/mlazos/72/orig 2025-12-04T08:54:02.3260915Z * [new branch] gh/mlazos/73/base -> origin/gh/mlazos/73/base 2025-12-04T08:54:02.3260981Z * [new branch] gh/mlazos/73/head -> origin/gh/mlazos/73/head 2025-12-04T08:54:02.3261048Z * [new branch] gh/mlazos/73/orig -> origin/gh/mlazos/73/orig 2025-12-04T08:54:02.3261167Z * [new branch] gh/mrmiywj/1/base -> origin/gh/mrmiywj/1/base 2025-12-04T08:54:02.3261234Z * [new branch] gh/mrmiywj/1/head -> origin/gh/mrmiywj/1/head 2025-12-04T08:54:02.3261307Z * [new branch] gh/muchulee8/73/base -> origin/gh/muchulee8/73/base 2025-12-04T08:54:02.3261381Z * [new branch] gh/muchulee8/73/head -> origin/gh/muchulee8/73/head 2025-12-04T08:54:02.3261454Z * [new branch] gh/muchulee8/73/orig -> origin/gh/muchulee8/73/orig 2025-12-04T08:54:02.3261540Z * [new branch] gh/naveenthangudu/1/base -> origin/gh/naveenthangudu/1/base 2025-12-04T08:54:02.3261623Z * [new branch] gh/naveenthangudu/1/head -> origin/gh/naveenthangudu/1/head 2025-12-04T08:54:02.3261704Z * [new branch] gh/naveenthangudu/1/orig -> origin/gh/naveenthangudu/1/orig 2025-12-04T08:54:02.3261786Z * [new branch] gh/naveenthangudu/2/base -> origin/gh/naveenthangudu/2/base 2025-12-04T08:54:02.3261867Z * [new branch] gh/naveenthangudu/2/head -> origin/gh/naveenthangudu/2/head 2025-12-04T08:54:02.3261946Z * [new branch] gh/naveenthangudu/2/orig -> origin/gh/naveenthangudu/2/orig 2025-12-04T08:54:02.3262026Z * [new branch] gh/naveenthangudu/3/base -> origin/gh/naveenthangudu/3/base 2025-12-04T08:54:02.3262150Z * [new branch] gh/naveenthangudu/3/head -> origin/gh/naveenthangudu/3/head 2025-12-04T08:54:02.3262230Z * [new branch] gh/naveenthangudu/3/orig -> origin/gh/naveenthangudu/3/orig 2025-12-04T08:54:02.3262310Z * [new branch] gh/naveenthangudu/4/base -> origin/gh/naveenthangudu/4/base 2025-12-04T08:54:02.3262389Z * [new branch] gh/naveenthangudu/4/head -> origin/gh/naveenthangudu/4/head 2025-12-04T08:54:02.3262468Z * [new branch] gh/naveenthangudu/4/orig -> origin/gh/naveenthangudu/4/orig 2025-12-04T08:54:02.3262549Z * [new branch] gh/naveenthangudu/5/base -> origin/gh/naveenthangudu/5/base 2025-12-04T08:54:02.3262631Z * [new branch] gh/naveenthangudu/5/head -> origin/gh/naveenthangudu/5/head 2025-12-04T08:54:02.3262709Z * [new branch] gh/naveenthangudu/5/orig -> origin/gh/naveenthangudu/5/orig 2025-12-04T08:54:02.3262789Z * [new branch] gh/naveenthangudu/6/base -> origin/gh/naveenthangudu/6/base 2025-12-04T08:54:02.3262869Z * [new branch] gh/naveenthangudu/6/head -> origin/gh/naveenthangudu/6/head 2025-12-04T08:54:02.3262952Z * [new branch] gh/naveenthangudu/6/orig -> origin/gh/naveenthangudu/6/orig 2025-12-04T08:54:02.3263031Z * [new branch] gh/naveenthangudu/7/base -> origin/gh/naveenthangudu/7/base 2025-12-04T08:54:02.3263110Z * [new branch] gh/naveenthangudu/7/head -> origin/gh/naveenthangudu/7/head 2025-12-04T08:54:02.3263190Z * [new branch] gh/naveenthangudu/7/orig -> origin/gh/naveenthangudu/7/orig 2025-12-04T08:54:02.3263270Z * [new branch] gh/naveenthangudu/8/base -> origin/gh/naveenthangudu/8/base 2025-12-04T08:54:02.3263349Z * [new branch] gh/naveenthangudu/8/head -> origin/gh/naveenthangudu/8/head 2025-12-04T08:54:02.3263429Z * [new branch] gh/naveenthangudu/8/orig -> origin/gh/naveenthangudu/8/orig 2025-12-04T08:54:02.3263510Z * [new branch] gh/naveenthangudu/9/base -> origin/gh/naveenthangudu/9/base 2025-12-04T08:54:02.3263589Z * [new branch] gh/naveenthangudu/9/head -> origin/gh/naveenthangudu/9/head 2025-12-04T08:54:02.3263669Z * [new branch] gh/naveenthangudu/9/orig -> origin/gh/naveenthangudu/9/orig 2025-12-04T08:54:02.3263744Z * [new branch] gh/nikitaved/1/base -> origin/gh/nikitaved/1/base 2025-12-04T08:54:02.3263817Z * [new branch] gh/nikitaved/1/head -> origin/gh/nikitaved/1/head 2025-12-04T08:54:02.3263889Z * [new branch] gh/nikitaved/1/orig -> origin/gh/nikitaved/1/orig 2025-12-04T08:54:02.3263990Z * [new branch] gh/nikitaved/10/base -> origin/gh/nikitaved/10/base 2025-12-04T08:54:02.3264063Z * [new branch] gh/nikitaved/10/head -> origin/gh/nikitaved/10/head 2025-12-04T08:54:02.3264136Z * [new branch] gh/nikitaved/10/orig -> origin/gh/nikitaved/10/orig 2025-12-04T08:54:02.3264208Z * [new branch] gh/nikitaved/11/base -> origin/gh/nikitaved/11/base 2025-12-04T08:54:02.3264279Z * [new branch] gh/nikitaved/11/head -> origin/gh/nikitaved/11/head 2025-12-04T08:54:02.3264351Z * [new branch] gh/nikitaved/11/orig -> origin/gh/nikitaved/11/orig 2025-12-04T08:54:02.3264422Z * [new branch] gh/nikitaved/12/base -> origin/gh/nikitaved/12/base 2025-12-04T08:54:02.3264494Z * [new branch] gh/nikitaved/12/head -> origin/gh/nikitaved/12/head 2025-12-04T08:54:02.3264565Z * [new branch] gh/nikitaved/12/orig -> origin/gh/nikitaved/12/orig 2025-12-04T08:54:02.3264637Z * [new branch] gh/nikitaved/13/base -> origin/gh/nikitaved/13/base 2025-12-04T08:54:02.3264708Z * [new branch] gh/nikitaved/13/head -> origin/gh/nikitaved/13/head 2025-12-04T08:54:02.3264778Z * [new branch] gh/nikitaved/13/orig -> origin/gh/nikitaved/13/orig 2025-12-04T08:54:02.3264878Z * [new branch] gh/nikitaved/14/base -> origin/gh/nikitaved/14/base 2025-12-04T08:54:02.3264951Z * [new branch] gh/nikitaved/14/head -> origin/gh/nikitaved/14/head 2025-12-04T08:54:02.3265022Z * [new branch] gh/nikitaved/14/orig -> origin/gh/nikitaved/14/orig 2025-12-04T08:54:02.3265093Z * [new branch] gh/nikitaved/15/base -> origin/gh/nikitaved/15/base 2025-12-04T08:54:02.3265167Z * [new branch] gh/nikitaved/15/head -> origin/gh/nikitaved/15/head 2025-12-04T08:54:02.3265238Z * [new branch] gh/nikitaved/15/orig -> origin/gh/nikitaved/15/orig 2025-12-04T08:54:02.3265311Z * [new branch] gh/nikitaved/16/base -> origin/gh/nikitaved/16/base 2025-12-04T08:54:02.3265383Z * [new branch] gh/nikitaved/16/head -> origin/gh/nikitaved/16/head 2025-12-04T08:54:02.3265454Z * [new branch] gh/nikitaved/16/orig -> origin/gh/nikitaved/16/orig 2025-12-04T08:54:02.3265526Z * [new branch] gh/nikitaved/2/base -> origin/gh/nikitaved/2/base 2025-12-04T08:54:02.3265598Z * [new branch] gh/nikitaved/2/head -> origin/gh/nikitaved/2/head 2025-12-04T08:54:02.3265669Z * [new branch] gh/nikitaved/2/orig -> origin/gh/nikitaved/2/orig 2025-12-04T08:54:02.3265739Z * [new branch] gh/nikitaved/4/base -> origin/gh/nikitaved/4/base 2025-12-04T08:54:02.3265810Z * [new branch] gh/nikitaved/4/head -> origin/gh/nikitaved/4/head 2025-12-04T08:54:02.3265880Z * [new branch] gh/nikitaved/4/orig -> origin/gh/nikitaved/4/orig 2025-12-04T08:54:02.3265951Z * [new branch] gh/nikitaved/5/base -> origin/gh/nikitaved/5/base 2025-12-04T08:54:02.3266024Z * [new branch] gh/nikitaved/5/head -> origin/gh/nikitaved/5/head 2025-12-04T08:54:02.3266094Z * [new branch] gh/nikitaved/5/orig -> origin/gh/nikitaved/5/orig 2025-12-04T08:54:02.3266165Z * [new branch] gh/nikitaved/6/base -> origin/gh/nikitaved/6/base 2025-12-04T08:54:02.3266236Z * [new branch] gh/nikitaved/6/head -> origin/gh/nikitaved/6/head 2025-12-04T08:54:02.3266307Z * [new branch] gh/nikitaved/6/orig -> origin/gh/nikitaved/6/orig 2025-12-04T08:54:02.3266378Z * [new branch] gh/nikitaved/8/base -> origin/gh/nikitaved/8/base 2025-12-04T08:54:02.3266448Z * [new branch] gh/nikitaved/8/head -> origin/gh/nikitaved/8/head 2025-12-04T08:54:02.3266517Z * [new branch] gh/nikitaved/8/orig -> origin/gh/nikitaved/8/orig 2025-12-04T08:54:02.3266617Z * [new branch] gh/nikitaved/9/base -> origin/gh/nikitaved/9/base 2025-12-04T08:54:02.3266687Z * [new branch] gh/nikitaved/9/head -> origin/gh/nikitaved/9/head 2025-12-04T08:54:02.3266757Z * [new branch] gh/nikitaved/9/orig -> origin/gh/nikitaved/9/orig 2025-12-04T08:54:02.3266827Z * [new branch] gh/oulgen/10/base -> origin/gh/oulgen/10/base 2025-12-04T08:54:02.3266895Z * [new branch] gh/oulgen/10/head -> origin/gh/oulgen/10/head 2025-12-04T08:54:02.3266962Z * [new branch] gh/oulgen/10/orig -> origin/gh/oulgen/10/orig 2025-12-04T08:54:02.3267030Z * [new branch] gh/oulgen/11/base -> origin/gh/oulgen/11/base 2025-12-04T08:54:02.3267096Z * [new branch] gh/oulgen/11/head -> origin/gh/oulgen/11/head 2025-12-04T08:54:02.3267162Z * [new branch] gh/oulgen/11/orig -> origin/gh/oulgen/11/orig 2025-12-04T08:54:02.3267232Z * [new branch] gh/oulgen/12/base -> origin/gh/oulgen/12/base 2025-12-04T08:54:02.3267298Z * [new branch] gh/oulgen/12/head -> origin/gh/oulgen/12/head 2025-12-04T08:54:02.3267364Z * [new branch] gh/oulgen/12/orig -> origin/gh/oulgen/12/orig 2025-12-04T08:54:02.3267466Z * [new branch] gh/oulgen/13/base -> origin/gh/oulgen/13/base 2025-12-04T08:54:02.3267532Z * [new branch] gh/oulgen/13/head -> origin/gh/oulgen/13/head 2025-12-04T08:54:02.3267598Z * [new branch] gh/oulgen/13/orig -> origin/gh/oulgen/13/orig 2025-12-04T08:54:02.3267667Z * [new branch] gh/oulgen/14/base -> origin/gh/oulgen/14/base 2025-12-04T08:54:02.3267732Z * [new branch] gh/oulgen/14/head -> origin/gh/oulgen/14/head 2025-12-04T08:54:02.3267800Z * [new branch] gh/oulgen/14/orig -> origin/gh/oulgen/14/orig 2025-12-04T08:54:02.3267867Z * [new branch] gh/oulgen/15/base -> origin/gh/oulgen/15/base 2025-12-04T08:54:02.3267933Z * [new branch] gh/oulgen/15/head -> origin/gh/oulgen/15/head 2025-12-04T08:54:02.3268001Z * [new branch] gh/oulgen/15/orig -> origin/gh/oulgen/15/orig 2025-12-04T08:54:02.3268068Z * [new branch] gh/oulgen/16/base -> origin/gh/oulgen/16/base 2025-12-04T08:54:02.3268135Z * [new branch] gh/oulgen/16/head -> origin/gh/oulgen/16/head 2025-12-04T08:54:02.3268202Z * [new branch] gh/oulgen/16/orig -> origin/gh/oulgen/16/orig 2025-12-04T08:54:02.3268268Z * [new branch] gh/oulgen/17/base -> origin/gh/oulgen/17/base 2025-12-04T08:54:02.3268334Z * [new branch] gh/oulgen/17/head -> origin/gh/oulgen/17/head 2025-12-04T08:54:02.3268401Z * [new branch] gh/oulgen/17/orig -> origin/gh/oulgen/17/orig 2025-12-04T08:54:02.3268470Z * [new branch] gh/oulgen/18/base -> origin/gh/oulgen/18/base 2025-12-04T08:54:02.3268535Z * [new branch] gh/oulgen/18/head -> origin/gh/oulgen/18/head 2025-12-04T08:54:02.3268602Z * [new branch] gh/oulgen/18/orig -> origin/gh/oulgen/18/orig 2025-12-04T08:54:02.3268668Z * [new branch] gh/oulgen/19/base -> origin/gh/oulgen/19/base 2025-12-04T08:54:02.3268735Z * [new branch] gh/oulgen/19/head -> origin/gh/oulgen/19/head 2025-12-04T08:54:02.3268803Z * [new branch] gh/oulgen/19/orig -> origin/gh/oulgen/19/orig 2025-12-04T08:54:02.3268869Z * [new branch] gh/oulgen/20/base -> origin/gh/oulgen/20/base 2025-12-04T08:54:02.3268935Z * [new branch] gh/oulgen/20/head -> origin/gh/oulgen/20/head 2025-12-04T08:54:02.3269002Z * [new branch] gh/oulgen/20/orig -> origin/gh/oulgen/20/orig 2025-12-04T08:54:02.3269096Z * [new branch] gh/oulgen/21/base -> origin/gh/oulgen/21/base 2025-12-04T08:54:02.3269163Z * [new branch] gh/oulgen/21/head -> origin/gh/oulgen/21/head 2025-12-04T08:54:02.3269231Z * [new branch] gh/oulgen/21/orig -> origin/gh/oulgen/21/orig 2025-12-04T08:54:02.3269297Z * [new branch] gh/oulgen/22/base -> origin/gh/oulgen/22/base 2025-12-04T08:54:02.3269366Z * [new branch] gh/oulgen/22/head -> origin/gh/oulgen/22/head 2025-12-04T08:54:02.3269432Z * [new branch] gh/oulgen/22/orig -> origin/gh/oulgen/22/orig 2025-12-04T08:54:02.3269498Z * [new branch] gh/oulgen/23/base -> origin/gh/oulgen/23/base 2025-12-04T08:54:02.3269566Z * [new branch] gh/oulgen/23/head -> origin/gh/oulgen/23/head 2025-12-04T08:54:02.3269632Z * [new branch] gh/oulgen/23/orig -> origin/gh/oulgen/23/orig 2025-12-04T08:54:02.3269699Z * [new branch] gh/oulgen/24/base -> origin/gh/oulgen/24/base 2025-12-04T08:54:02.3269766Z * [new branch] gh/oulgen/24/head -> origin/gh/oulgen/24/head 2025-12-04T08:54:02.3269831Z * [new branch] gh/oulgen/24/orig -> origin/gh/oulgen/24/orig 2025-12-04T08:54:02.3269896Z * [new branch] gh/oulgen/25/base -> origin/gh/oulgen/25/base 2025-12-04T08:54:02.3269998Z * [new branch] gh/oulgen/25/head -> origin/gh/oulgen/25/head 2025-12-04T08:54:02.3270065Z * [new branch] gh/oulgen/25/orig -> origin/gh/oulgen/25/orig 2025-12-04T08:54:02.3270152Z * [new branch] gh/oulgen/26/base -> origin/gh/oulgen/26/base 2025-12-04T08:54:02.3270222Z * [new branch] gh/oulgen/26/head -> origin/gh/oulgen/26/head 2025-12-04T08:54:02.3270288Z * [new branch] gh/oulgen/26/orig -> origin/gh/oulgen/26/orig 2025-12-04T08:54:02.3270355Z * [new branch] gh/oulgen/4/base -> origin/gh/oulgen/4/base 2025-12-04T08:54:02.3270425Z * [new branch] gh/oulgen/4/head -> origin/gh/oulgen/4/head 2025-12-04T08:54:02.3270492Z * [new branch] gh/oulgen/4/orig -> origin/gh/oulgen/4/orig 2025-12-04T08:54:02.3270558Z * [new branch] gh/oulgen/7/base -> origin/gh/oulgen/7/base 2025-12-04T08:54:02.3270627Z * [new branch] gh/oulgen/7/head -> origin/gh/oulgen/7/head 2025-12-04T08:54:02.3270693Z * [new branch] gh/oulgen/7/orig -> origin/gh/oulgen/7/orig 2025-12-04T08:54:02.3270758Z * [new branch] gh/oulgen/8/base -> origin/gh/oulgen/8/base 2025-12-04T08:54:02.3270825Z * [new branch] gh/oulgen/8/head -> origin/gh/oulgen/8/head 2025-12-04T08:54:02.3270890Z * [new branch] gh/oulgen/8/orig -> origin/gh/oulgen/8/orig 2025-12-04T08:54:02.3270955Z * [new branch] gh/oulgen/9/base -> origin/gh/oulgen/9/base 2025-12-04T08:54:02.3271022Z * [new branch] gh/oulgen/9/head -> origin/gh/oulgen/9/head 2025-12-04T08:54:02.3271088Z * [new branch] gh/oulgen/9/orig -> origin/gh/oulgen/9/orig 2025-12-04T08:54:02.3271193Z * [new branch] gh/patvig/mtia-serialization -> origin/gh/patvig/mtia-serialization 2025-12-04T08:54:02.3271263Z * [new branch] gh/pearu/108/base -> origin/gh/pearu/108/base 2025-12-04T08:54:02.3271331Z * [new branch] gh/pearu/108/head -> origin/gh/pearu/108/head 2025-12-04T08:54:02.3271399Z * [new branch] gh/pearu/108/orig -> origin/gh/pearu/108/orig 2025-12-04T08:54:02.3271465Z * [new branch] gh/pearu/109/base -> origin/gh/pearu/109/base 2025-12-04T08:54:02.3271530Z * [new branch] gh/pearu/109/head -> origin/gh/pearu/109/head 2025-12-04T08:54:02.3271598Z * [new branch] gh/pearu/109/orig -> origin/gh/pearu/109/orig 2025-12-04T08:54:02.3271701Z * [new branch] gh/pearu/110/base -> origin/gh/pearu/110/base 2025-12-04T08:54:02.3271767Z * [new branch] gh/pearu/110/head -> origin/gh/pearu/110/head 2025-12-04T08:54:02.3271834Z * [new branch] gh/pearu/110/orig -> origin/gh/pearu/110/orig 2025-12-04T08:54:02.3271902Z * [new branch] gh/pearu/111/base -> origin/gh/pearu/111/base 2025-12-04T08:54:02.3271967Z * [new branch] gh/pearu/111/head -> origin/gh/pearu/111/head 2025-12-04T08:54:02.3272035Z * [new branch] gh/pearu/111/orig -> origin/gh/pearu/111/orig 2025-12-04T08:54:02.3272101Z * [new branch] gh/pearu/112/base -> origin/gh/pearu/112/base 2025-12-04T08:54:02.3272166Z * [new branch] gh/pearu/112/head -> origin/gh/pearu/112/head 2025-12-04T08:54:02.3272235Z * [new branch] gh/pearu/112/orig -> origin/gh/pearu/112/orig 2025-12-04T08:54:02.3272302Z * [new branch] gh/pearu/115/base -> origin/gh/pearu/115/base 2025-12-04T08:54:02.3272368Z * [new branch] gh/pearu/115/head -> origin/gh/pearu/115/head 2025-12-04T08:54:02.3272436Z * [new branch] gh/pearu/115/orig -> origin/gh/pearu/115/orig 2025-12-04T08:54:02.3272527Z * [new branch] gh/pearu/116/base -> origin/gh/pearu/116/base 2025-12-04T08:54:02.3272593Z * [new branch] gh/pearu/116/head -> origin/gh/pearu/116/head 2025-12-04T08:54:02.3272661Z * [new branch] gh/pearu/116/orig -> origin/gh/pearu/116/orig 2025-12-04T08:54:02.3272728Z * [new branch] gh/pearu/117/base -> origin/gh/pearu/117/base 2025-12-04T08:54:02.3272796Z * [new branch] gh/pearu/117/head -> origin/gh/pearu/117/head 2025-12-04T08:54:02.3272862Z * [new branch] gh/pearu/117/orig -> origin/gh/pearu/117/orig 2025-12-04T08:54:02.3272929Z * [new branch] gh/pearu/118/base -> origin/gh/pearu/118/base 2025-12-04T08:54:02.3272996Z * [new branch] gh/pearu/118/head -> origin/gh/pearu/118/head 2025-12-04T08:54:02.3273061Z * [new branch] gh/pearu/118/orig -> origin/gh/pearu/118/orig 2025-12-04T08:54:02.3273127Z * [new branch] gh/pearu/119/base -> origin/gh/pearu/119/base 2025-12-04T08:54:02.3273195Z * [new branch] gh/pearu/119/head -> origin/gh/pearu/119/head 2025-12-04T08:54:02.3273260Z * [new branch] gh/pearu/119/orig -> origin/gh/pearu/119/orig 2025-12-04T08:54:02.3273326Z * [new branch] gh/pearu/139/base -> origin/gh/pearu/139/base 2025-12-04T08:54:02.3273393Z * [new branch] gh/pearu/139/head -> origin/gh/pearu/139/head 2025-12-04T08:54:02.3273459Z * [new branch] gh/pearu/139/orig -> origin/gh/pearu/139/orig 2025-12-04T08:54:02.3273526Z * [new branch] gh/pearu/140/base -> origin/gh/pearu/140/base 2025-12-04T08:54:02.3273594Z * [new branch] gh/pearu/140/head -> origin/gh/pearu/140/head 2025-12-04T08:54:02.3273660Z * [new branch] gh/pearu/140/orig -> origin/gh/pearu/140/orig 2025-12-04T08:54:02.3273725Z * [new branch] gh/pearu/142/base -> origin/gh/pearu/142/base 2025-12-04T08:54:02.3273794Z * [new branch] gh/pearu/142/head -> origin/gh/pearu/142/head 2025-12-04T08:54:02.3273860Z * [new branch] gh/pearu/142/orig -> origin/gh/pearu/142/orig 2025-12-04T08:54:02.3273926Z * [new branch] gh/pearu/143/base -> origin/gh/pearu/143/base 2025-12-04T08:54:02.3273994Z * [new branch] gh/pearu/143/head -> origin/gh/pearu/143/head 2025-12-04T08:54:02.3274061Z * [new branch] gh/pearu/143/orig -> origin/gh/pearu/143/orig 2025-12-04T08:54:02.3274158Z * [new branch] gh/pearu/147/base -> origin/gh/pearu/147/base 2025-12-04T08:54:02.3274226Z * [new branch] gh/pearu/147/head -> origin/gh/pearu/147/head 2025-12-04T08:54:02.3274292Z * [new branch] gh/pearu/147/orig -> origin/gh/pearu/147/orig 2025-12-04T08:54:02.3274360Z * [new branch] gh/pearu/149/base -> origin/gh/pearu/149/base 2025-12-04T08:54:02.3274428Z * [new branch] gh/pearu/149/head -> origin/gh/pearu/149/head 2025-12-04T08:54:02.3274494Z * [new branch] gh/pearu/149/orig -> origin/gh/pearu/149/orig 2025-12-04T08:54:02.3274562Z * [new branch] gh/pearu/150/base -> origin/gh/pearu/150/base 2025-12-04T08:54:02.3274629Z * [new branch] gh/pearu/150/head -> origin/gh/pearu/150/head 2025-12-04T08:54:02.3274695Z * [new branch] gh/pearu/150/orig -> origin/gh/pearu/150/orig 2025-12-04T08:54:02.3274763Z * [new branch] gh/pearu/151/base -> origin/gh/pearu/151/base 2025-12-04T08:54:02.3274829Z * [new branch] gh/pearu/151/head -> origin/gh/pearu/151/head 2025-12-04T08:54:02.3274895Z * [new branch] gh/pearu/151/orig -> origin/gh/pearu/151/orig 2025-12-04T08:54:02.3274962Z * [new branch] gh/pearu/152/base -> origin/gh/pearu/152/base 2025-12-04T08:54:02.3275057Z * [new branch] gh/pearu/152/head -> origin/gh/pearu/152/head 2025-12-04T08:54:02.3275124Z * [new branch] gh/pearu/152/orig -> origin/gh/pearu/152/orig 2025-12-04T08:54:02.3275191Z * [new branch] gh/pearu/153/base -> origin/gh/pearu/153/base 2025-12-04T08:54:02.3275258Z * [new branch] gh/pearu/153/head -> origin/gh/pearu/153/head 2025-12-04T08:54:02.3275324Z * [new branch] gh/pearu/153/orig -> origin/gh/pearu/153/orig 2025-12-04T08:54:02.3275391Z * [new branch] gh/pearu/154/base -> origin/gh/pearu/154/base 2025-12-04T08:54:02.3275460Z * [new branch] gh/pearu/154/head -> origin/gh/pearu/154/head 2025-12-04T08:54:02.3275526Z * [new branch] gh/pearu/154/orig -> origin/gh/pearu/154/orig 2025-12-04T08:54:02.3275596Z * [new branch] gh/pearu/155/base -> origin/gh/pearu/155/base 2025-12-04T08:54:02.3275664Z * [new branch] gh/pearu/155/head -> origin/gh/pearu/155/head 2025-12-04T08:54:02.3275731Z * [new branch] gh/pearu/155/orig -> origin/gh/pearu/155/orig 2025-12-04T08:54:02.3275799Z * [new branch] gh/pearu/156/base -> origin/gh/pearu/156/base 2025-12-04T08:54:02.3275867Z * [new branch] gh/pearu/156/head -> origin/gh/pearu/156/head 2025-12-04T08:54:02.3275935Z * [new branch] gh/pearu/156/orig -> origin/gh/pearu/156/orig 2025-12-04T08:54:02.3276002Z * [new branch] gh/pearu/56/base -> origin/gh/pearu/56/base 2025-12-04T08:54:02.3276070Z * [new branch] gh/pearu/56/head -> origin/gh/pearu/56/head 2025-12-04T08:54:02.3276139Z * [new branch] gh/pearu/56/orig -> origin/gh/pearu/56/orig 2025-12-04T08:54:02.3276205Z * [new branch] gh/pearu/97/base -> origin/gh/pearu/97/base 2025-12-04T08:54:02.3276272Z * [new branch] gh/pearu/97/head -> origin/gh/pearu/97/head 2025-12-04T08:54:02.3276339Z * [new branch] gh/pearu/97/orig -> origin/gh/pearu/97/orig 2025-12-04T08:54:02.3276413Z * [new branch] gh/pianpwk/21/base -> origin/gh/pianpwk/21/base 2025-12-04T08:54:02.3276486Z * [new branch] gh/pianpwk/21/head -> origin/gh/pianpwk/21/head 2025-12-04T08:54:02.3276558Z * [new branch] gh/pianpwk/28/base -> origin/gh/pianpwk/28/base 2025-12-04T08:54:02.3276629Z * [new branch] gh/pianpwk/28/head -> origin/gh/pianpwk/28/head 2025-12-04T08:54:02.3276724Z * [new branch] gh/pianpwk/28/orig -> origin/gh/pianpwk/28/orig 2025-12-04T08:54:02.3276795Z * [new branch] gh/pianpwk/29/base -> origin/gh/pianpwk/29/base 2025-12-04T08:54:02.3276865Z * [new branch] gh/pianpwk/29/head -> origin/gh/pianpwk/29/head 2025-12-04T08:54:02.3276936Z * [new branch] gh/pianpwk/29/orig -> origin/gh/pianpwk/29/orig 2025-12-04T08:54:02.3277006Z * [new branch] gh/pianpwk/30/base -> origin/gh/pianpwk/30/base 2025-12-04T08:54:02.3277075Z * [new branch] gh/pianpwk/30/head -> origin/gh/pianpwk/30/head 2025-12-04T08:54:02.3277145Z * [new branch] gh/pianpwk/30/orig -> origin/gh/pianpwk/30/orig 2025-12-04T08:54:02.3277214Z * [new branch] gh/pianpwk/31/base -> origin/gh/pianpwk/31/base 2025-12-04T08:54:02.3277283Z * [new branch] gh/pianpwk/31/head -> origin/gh/pianpwk/31/head 2025-12-04T08:54:02.3277353Z * [new branch] gh/pianpwk/31/orig -> origin/gh/pianpwk/31/orig 2025-12-04T08:54:02.3277425Z * [new branch] gh/pianpwk/32/base -> origin/gh/pianpwk/32/base 2025-12-04T08:54:02.3277494Z * [new branch] gh/pianpwk/32/head -> origin/gh/pianpwk/32/head 2025-12-04T08:54:02.3277594Z * [new branch] gh/pianpwk/32/orig -> origin/gh/pianpwk/32/orig 2025-12-04T08:54:02.3277665Z * [new branch] gh/pianpwk/33/base -> origin/gh/pianpwk/33/base 2025-12-04T08:54:02.3277735Z * [new branch] gh/pianpwk/33/head -> origin/gh/pianpwk/33/head 2025-12-04T08:54:02.3277806Z * [new branch] gh/pianpwk/33/orig -> origin/gh/pianpwk/33/orig 2025-12-04T08:54:02.3277875Z * [new branch] gh/pianpwk/34/base -> origin/gh/pianpwk/34/base 2025-12-04T08:54:02.3277944Z * [new branch] gh/pianpwk/34/head -> origin/gh/pianpwk/34/head 2025-12-04T08:54:02.3278017Z * [new branch] gh/pianpwk/34/orig -> origin/gh/pianpwk/34/orig 2025-12-04T08:54:02.3278086Z * [new branch] gh/pianpwk/35/base -> origin/gh/pianpwk/35/base 2025-12-04T08:54:02.3278155Z * [new branch] gh/pianpwk/35/head -> origin/gh/pianpwk/35/head 2025-12-04T08:54:02.3278226Z * [new branch] gh/pianpwk/35/orig -> origin/gh/pianpwk/35/orig 2025-12-04T08:54:02.3278293Z * [new branch] gh/rec/141/base -> origin/gh/rec/141/base 2025-12-04T08:54:02.3278359Z * [new branch] gh/rec/141/head -> origin/gh/rec/141/head 2025-12-04T08:54:02.3278428Z * [new branch] gh/rec/153/base -> origin/gh/rec/153/base 2025-12-04T08:54:02.3278493Z * [new branch] gh/rec/153/head -> origin/gh/rec/153/head 2025-12-04T08:54:02.3278556Z * [new branch] gh/rec/153/orig -> origin/gh/rec/153/orig 2025-12-04T08:54:02.3282364Z * [new branch] gh/rec/154/base -> origin/gh/rec/154/base 2025-12-04T08:54:02.3282453Z * [new branch] gh/rec/154/head -> origin/gh/rec/154/head 2025-12-04T08:54:02.3282521Z * [new branch] gh/rec/154/orig -> origin/gh/rec/154/orig 2025-12-04T08:54:02.3282585Z * [new branch] gh/rec/164/base -> origin/gh/rec/164/base 2025-12-04T08:54:02.3282655Z * [new branch] gh/rec/164/head -> origin/gh/rec/164/head 2025-12-04T08:54:02.3282718Z * [new branch] gh/rec/164/orig -> origin/gh/rec/164/orig 2025-12-04T08:54:02.3282782Z * [new branch] gh/rec/166/base -> origin/gh/rec/166/base 2025-12-04T08:54:02.3282849Z * [new branch] gh/rec/166/head -> origin/gh/rec/166/head 2025-12-04T08:54:02.3282914Z * [new branch] gh/rec/166/orig -> origin/gh/rec/166/orig 2025-12-04T08:54:02.3282981Z * [new branch] gh/rec/167/base -> origin/gh/rec/167/base 2025-12-04T08:54:02.3283106Z * [new branch] gh/rec/167/head -> origin/gh/rec/167/head 2025-12-04T08:54:02.3283171Z * [new branch] gh/rec/167/orig -> origin/gh/rec/167/orig 2025-12-04T08:54:02.3283234Z * [new branch] gh/rec/168/base -> origin/gh/rec/168/base 2025-12-04T08:54:02.3283301Z * [new branch] gh/rec/168/head -> origin/gh/rec/168/head 2025-12-04T08:54:02.3283365Z * [new branch] gh/rec/168/orig -> origin/gh/rec/168/orig 2025-12-04T08:54:02.3283435Z * [new branch] gh/rec/169/base -> origin/gh/rec/169/base 2025-12-04T08:54:02.3283500Z * [new branch] gh/rec/169/head -> origin/gh/rec/169/head 2025-12-04T08:54:02.3283564Z * [new branch] gh/rec/169/orig -> origin/gh/rec/169/orig 2025-12-04T08:54:02.3283630Z * [new branch] gh/rec/170/base -> origin/gh/rec/170/base 2025-12-04T08:54:02.3283696Z * [new branch] gh/rec/170/head -> origin/gh/rec/170/head 2025-12-04T08:54:02.3283760Z * [new branch] gh/rec/170/orig -> origin/gh/rec/170/orig 2025-12-04T08:54:02.3283826Z * [new branch] gh/rec/171/base -> origin/gh/rec/171/base 2025-12-04T08:54:02.3283928Z * [new branch] gh/rec/171/head -> origin/gh/rec/171/head 2025-12-04T08:54:02.3283992Z * [new branch] gh/rec/171/orig -> origin/gh/rec/171/orig 2025-12-04T08:54:02.3284059Z * [new branch] gh/rec/172/base -> origin/gh/rec/172/base 2025-12-04T08:54:02.3284124Z * [new branch] gh/rec/172/head -> origin/gh/rec/172/head 2025-12-04T08:54:02.3284188Z * [new branch] gh/rec/172/orig -> origin/gh/rec/172/orig 2025-12-04T08:54:02.3284257Z * [new branch] gh/rec/173/base -> origin/gh/rec/173/base 2025-12-04T08:54:02.3284323Z * [new branch] gh/rec/173/head -> origin/gh/rec/173/head 2025-12-04T08:54:02.3284387Z * [new branch] gh/rec/173/orig -> origin/gh/rec/173/orig 2025-12-04T08:54:02.3284455Z * [new branch] gh/rec/174/base -> origin/gh/rec/174/base 2025-12-04T08:54:02.3284519Z * [new branch] gh/rec/174/head -> origin/gh/rec/174/head 2025-12-04T08:54:02.3284585Z * [new branch] gh/rec/174/orig -> origin/gh/rec/174/orig 2025-12-04T08:54:02.3284653Z * [new branch] gh/rec/175/base -> origin/gh/rec/175/base 2025-12-04T08:54:02.3284716Z * [new branch] gh/rec/175/head -> origin/gh/rec/175/head 2025-12-04T08:54:02.3284778Z * [new branch] gh/rec/175/orig -> origin/gh/rec/175/orig 2025-12-04T08:54:02.3284844Z * [new branch] gh/rec/176/base -> origin/gh/rec/176/base 2025-12-04T08:54:02.3284909Z * [new branch] gh/rec/176/head -> origin/gh/rec/176/head 2025-12-04T08:54:02.3284974Z * [new branch] gh/rec/176/orig -> origin/gh/rec/176/orig 2025-12-04T08:54:02.3285038Z * [new branch] gh/rec/177/base -> origin/gh/rec/177/base 2025-12-04T08:54:02.3285101Z * [new branch] gh/rec/177/head -> origin/gh/rec/177/head 2025-12-04T08:54:02.3285168Z * [new branch] gh/rec/177/orig -> origin/gh/rec/177/orig 2025-12-04T08:54:02.3285261Z * [new branch] gh/robert-hardwick/3/base -> origin/gh/robert-hardwick/3/base 2025-12-04T08:54:02.3285346Z * [new branch] gh/robert-hardwick/3/head -> origin/gh/robert-hardwick/3/head 2025-12-04T08:54:02.3285431Z * [new branch] gh/robert-hardwick/3/orig -> origin/gh/robert-hardwick/3/orig 2025-12-04T08:54:02.3285513Z * [new branch] gh/robert-hardwick/4/base -> origin/gh/robert-hardwick/4/base 2025-12-04T08:54:02.3285620Z * [new branch] gh/robert-hardwick/4/head -> origin/gh/robert-hardwick/4/head 2025-12-04T08:54:02.3285704Z * [new branch] gh/robert-hardwick/4/orig -> origin/gh/robert-hardwick/4/orig 2025-12-04T08:54:02.3285786Z * [new branch] gh/robert-hardwick/5/base -> origin/gh/robert-hardwick/5/base 2025-12-04T08:54:02.3285869Z * [new branch] gh/robert-hardwick/5/head -> origin/gh/robert-hardwick/5/head 2025-12-04T08:54:02.3285957Z * [new branch] gh/robert-hardwick/5/orig -> origin/gh/robert-hardwick/5/orig 2025-12-04T08:54:02.3286038Z * [new branch] gh/robert-hardwick/6/base -> origin/gh/robert-hardwick/6/base 2025-12-04T08:54:02.3286121Z * [new branch] gh/robert-hardwick/6/head -> origin/gh/robert-hardwick/6/head 2025-12-04T08:54:02.3286208Z * [new branch] gh/robert-hardwick/6/orig -> origin/gh/robert-hardwick/6/orig 2025-12-04T08:54:02.3286291Z * [new branch] gh/robert-hardwick/7/base -> origin/gh/robert-hardwick/7/base 2025-12-04T08:54:02.3286374Z * [new branch] gh/robert-hardwick/7/head -> origin/gh/robert-hardwick/7/head 2025-12-04T08:54:02.3286458Z * [new branch] gh/robert-hardwick/7/orig -> origin/gh/robert-hardwick/7/orig 2025-12-04T08:54:02.3286540Z * [new branch] gh/robert-hardwick/8/base -> origin/gh/robert-hardwick/8/base 2025-12-04T08:54:02.3286652Z * [new branch] gh/robert-hardwick/8/head -> origin/gh/robert-hardwick/8/head 2025-12-04T08:54:02.3286735Z * [new branch] gh/robert-hardwick/8/orig -> origin/gh/robert-hardwick/8/orig 2025-12-04T08:54:02.3286818Z * [new branch] gh/robert-hardwick/9/base -> origin/gh/robert-hardwick/9/base 2025-12-04T08:54:02.3286903Z * [new branch] gh/robert-hardwick/9/head -> origin/gh/robert-hardwick/9/head 2025-12-04T08:54:02.3286986Z * [new branch] gh/robert-hardwick/9/orig -> origin/gh/robert-hardwick/9/orig 2025-12-04T08:54:02.3287060Z * [new branch] gh/rtimpe/1/base -> origin/gh/rtimpe/1/base 2025-12-04T08:54:02.3287131Z * [new branch] gh/rtimpe/1/head -> origin/gh/rtimpe/1/head 2025-12-04T08:54:02.3287199Z * [new branch] gh/rtimpe/2/base -> origin/gh/rtimpe/2/base 2025-12-04T08:54:02.3287266Z * [new branch] gh/rtimpe/2/head -> origin/gh/rtimpe/2/head 2025-12-04T08:54:02.3287338Z * [new branch] gh/rtimpe/22/base -> origin/gh/rtimpe/22/base 2025-12-04T08:54:02.3287407Z * [new branch] gh/rtimpe/22/head -> origin/gh/rtimpe/22/head 2025-12-04T08:54:02.3287475Z * [new branch] gh/rtimpe/22/orig -> origin/gh/rtimpe/22/orig 2025-12-04T08:54:02.3287545Z * [new branch] gh/rtimpe/23/base -> origin/gh/rtimpe/23/base 2025-12-04T08:54:02.3287611Z * [new branch] gh/rtimpe/23/head -> origin/gh/rtimpe/23/head 2025-12-04T08:54:02.3287679Z * [new branch] gh/rtimpe/23/orig -> origin/gh/rtimpe/23/orig 2025-12-04T08:54:02.3287746Z * [new branch] gh/rtimpe/24/base -> origin/gh/rtimpe/24/base 2025-12-04T08:54:02.3287813Z * [new branch] gh/rtimpe/24/head -> origin/gh/rtimpe/24/head 2025-12-04T08:54:02.3287880Z * [new branch] gh/rtimpe/24/orig -> origin/gh/rtimpe/24/orig 2025-12-04T08:54:02.3287948Z * [new branch] gh/rtimpe/25/base -> origin/gh/rtimpe/25/base 2025-12-04T08:54:02.3288014Z * [new branch] gh/rtimpe/25/head -> origin/gh/rtimpe/25/head 2025-12-04T08:54:02.3288080Z * [new branch] gh/rtimpe/25/orig -> origin/gh/rtimpe/25/orig 2025-12-04T08:54:02.3288147Z * [new branch] gh/rtimpe/26/base -> origin/gh/rtimpe/26/base 2025-12-04T08:54:02.3288214Z * [new branch] gh/rtimpe/26/head -> origin/gh/rtimpe/26/head 2025-12-04T08:54:02.3288304Z * [new branch] gh/rtimpe/26/orig -> origin/gh/rtimpe/26/orig 2025-12-04T08:54:02.3288372Z * [new branch] gh/rtimpe/27/base -> origin/gh/rtimpe/27/base 2025-12-04T08:54:02.3288438Z * [new branch] gh/rtimpe/27/head -> origin/gh/rtimpe/27/head 2025-12-04T08:54:02.3288505Z * [new branch] gh/rtimpe/27/orig -> origin/gh/rtimpe/27/orig 2025-12-04T08:54:02.3288575Z * [new branch] gh/rtimpe/28/base -> origin/gh/rtimpe/28/base 2025-12-04T08:54:02.3288642Z * [new branch] gh/rtimpe/28/head -> origin/gh/rtimpe/28/head 2025-12-04T08:54:02.3288709Z * [new branch] gh/rtimpe/28/orig -> origin/gh/rtimpe/28/orig 2025-12-04T08:54:02.3288777Z * [new branch] gh/rtimpe/29/base -> origin/gh/rtimpe/29/base 2025-12-04T08:54:02.3288844Z * [new branch] gh/rtimpe/29/head -> origin/gh/rtimpe/29/head 2025-12-04T08:54:02.3288910Z * [new branch] gh/rtimpe/29/orig -> origin/gh/rtimpe/29/orig 2025-12-04T08:54:02.3288980Z * [new branch] gh/rtimpe/3/base -> origin/gh/rtimpe/3/base 2025-12-04T08:54:02.3289047Z * [new branch] gh/rtimpe/3/head -> origin/gh/rtimpe/3/head 2025-12-04T08:54:02.3289114Z * [new branch] gh/rtimpe/30/base -> origin/gh/rtimpe/30/base 2025-12-04T08:54:02.3289209Z * [new branch] gh/rtimpe/30/head -> origin/gh/rtimpe/30/head 2025-12-04T08:54:02.3289277Z * [new branch] gh/rtimpe/30/orig -> origin/gh/rtimpe/30/orig 2025-12-04T08:54:02.3289344Z * [new branch] gh/rtimpe/31/base -> origin/gh/rtimpe/31/base 2025-12-04T08:54:02.3289411Z * [new branch] gh/rtimpe/31/head -> origin/gh/rtimpe/31/head 2025-12-04T08:54:02.3289477Z * [new branch] gh/rtimpe/31/orig -> origin/gh/rtimpe/31/orig 2025-12-04T08:54:02.3289544Z * [new branch] gh/rtimpe/32/base -> origin/gh/rtimpe/32/base 2025-12-04T08:54:02.3289612Z * [new branch] gh/rtimpe/32/head -> origin/gh/rtimpe/32/head 2025-12-04T08:54:02.3289679Z * [new branch] gh/rtimpe/32/orig -> origin/gh/rtimpe/32/orig 2025-12-04T08:54:02.3289747Z * [new branch] gh/rtimpe/33/base -> origin/gh/rtimpe/33/base 2025-12-04T08:54:02.3289815Z * [new branch] gh/rtimpe/33/head -> origin/gh/rtimpe/33/head 2025-12-04T08:54:02.3289882Z * [new branch] gh/rtimpe/33/orig -> origin/gh/rtimpe/33/orig 2025-12-04T08:54:02.3289951Z * [new branch] gh/rtimpe/34/base -> origin/gh/rtimpe/34/base 2025-12-04T08:54:02.3290017Z * [new branch] gh/rtimpe/34/head -> origin/gh/rtimpe/34/head 2025-12-04T08:54:02.3290084Z * [new branch] gh/rtimpe/34/orig -> origin/gh/rtimpe/34/orig 2025-12-04T08:54:02.3290200Z * [new branch] gh/rtimpe/35/base -> origin/gh/rtimpe/35/base 2025-12-04T08:54:02.3290268Z * [new branch] gh/rtimpe/35/head -> origin/gh/rtimpe/35/head 2025-12-04T08:54:02.3290334Z * [new branch] gh/rtimpe/35/orig -> origin/gh/rtimpe/35/orig 2025-12-04T08:54:02.3290403Z * [new branch] gh/rtimpe/4/base -> origin/gh/rtimpe/4/base 2025-12-04T08:54:02.3290472Z * [new branch] gh/rtimpe/4/head -> origin/gh/rtimpe/4/head 2025-12-04T08:54:02.3290553Z * [new branch] gh/ruisizhang123/1/base -> origin/gh/ruisizhang123/1/base 2025-12-04T08:54:02.3290633Z * [new branch] gh/ruisizhang123/1/head -> origin/gh/ruisizhang123/1/head 2025-12-04T08:54:02.3290710Z * [new branch] gh/ruisizhang123/1/orig -> origin/gh/ruisizhang123/1/orig 2025-12-04T08:54:02.3290787Z * [new branch] gh/ruisizhang123/4/base -> origin/gh/ruisizhang123/4/base 2025-12-04T08:54:02.3290863Z * [new branch] gh/ruisizhang123/4/head -> origin/gh/ruisizhang123/4/head 2025-12-04T08:54:02.3290988Z * [new branch] gh/ruisizhang123/4/orig -> origin/gh/ruisizhang123/4/orig 2025-12-04T08:54:02.3291065Z * [new branch] gh/ruisizhang123/5/base -> origin/gh/ruisizhang123/5/base 2025-12-04T08:54:02.3291141Z * [new branch] gh/ruisizhang123/5/head -> origin/gh/ruisizhang123/5/head 2025-12-04T08:54:02.3291218Z * [new branch] gh/ruisizhang123/5/orig -> origin/gh/ruisizhang123/5/orig 2025-12-04T08:54:02.3291295Z * [new branch] gh/ruisizhang123/6/base -> origin/gh/ruisizhang123/6/base 2025-12-04T08:54:02.3291372Z * [new branch] gh/ruisizhang123/6/head -> origin/gh/ruisizhang123/6/head 2025-12-04T08:54:02.3291447Z * [new branch] gh/ruisizhang123/6/orig -> origin/gh/ruisizhang123/6/orig 2025-12-04T08:54:02.3291525Z * [new branch] gh/ruisizhang123/7/base -> origin/gh/ruisizhang123/7/base 2025-12-04T08:54:02.3291602Z * [new branch] gh/ruisizhang123/7/head -> origin/gh/ruisizhang123/7/head 2025-12-04T08:54:02.3291678Z * [new branch] gh/ruisizhang123/7/orig -> origin/gh/ruisizhang123/7/orig 2025-12-04T08:54:02.3291756Z * [new branch] gh/ruisizhang123/8/base -> origin/gh/ruisizhang123/8/base 2025-12-04T08:54:02.3291831Z * [new branch] gh/ruisizhang123/8/head -> origin/gh/ruisizhang123/8/head 2025-12-04T08:54:02.3291955Z * [new branch] gh/ruisizhang123/8/orig -> origin/gh/ruisizhang123/8/orig 2025-12-04T08:54:02.3292032Z * [new branch] gh/ruisizhang123/9/base -> origin/gh/ruisizhang123/9/base 2025-12-04T08:54:02.3292109Z * [new branch] gh/ruisizhang123/9/head -> origin/gh/ruisizhang123/9/head 2025-12-04T08:54:02.3292184Z * [new branch] gh/ruisizhang123/9/orig -> origin/gh/ruisizhang123/9/orig 2025-12-04T08:54:02.3292263Z * [new branch] gh/seemethere/52/base -> origin/gh/seemethere/52/base 2025-12-04T08:54:02.3292340Z * [new branch] gh/seemethere/52/head -> origin/gh/seemethere/52/head 2025-12-04T08:54:02.3292413Z * [new branch] gh/seemethere/52/orig -> origin/gh/seemethere/52/orig 2025-12-04T08:54:02.3292488Z * [new branch] gh/seemethere/53/base -> origin/gh/seemethere/53/base 2025-12-04T08:54:02.3292562Z * [new branch] gh/seemethere/53/head -> origin/gh/seemethere/53/head 2025-12-04T08:54:02.3292636Z * [new branch] gh/seemethere/53/orig -> origin/gh/seemethere/53/orig 2025-12-04T08:54:02.3292708Z * [new branch] gh/seemethere/54/base -> origin/gh/seemethere/54/base 2025-12-04T08:54:02.3292781Z * [new branch] gh/seemethere/54/head -> origin/gh/seemethere/54/head 2025-12-04T08:54:02.3292855Z * [new branch] gh/seemethere/54/orig -> origin/gh/seemethere/54/orig 2025-12-04T08:54:02.3292928Z * [new branch] gh/seemethere/55/base -> origin/gh/seemethere/55/base 2025-12-04T08:54:02.3293003Z * [new branch] gh/seemethere/55/head -> origin/gh/seemethere/55/head 2025-12-04T08:54:02.3293076Z * [new branch] gh/seemethere/55/orig -> origin/gh/seemethere/55/orig 2025-12-04T08:54:02.3293149Z * [new branch] gh/seemethere/59/base -> origin/gh/seemethere/59/base 2025-12-04T08:54:02.3293223Z * [new branch] gh/seemethere/59/head -> origin/gh/seemethere/59/head 2025-12-04T08:54:02.3293297Z * [new branch] gh/seemethere/59/orig -> origin/gh/seemethere/59/orig 2025-12-04T08:54:02.3293370Z * [new branch] gh/seemethere/62/base -> origin/gh/seemethere/62/base 2025-12-04T08:54:02.3293442Z * [new branch] gh/seemethere/62/head -> origin/gh/seemethere/62/head 2025-12-04T08:54:02.3293516Z * [new branch] gh/seemethere/62/orig -> origin/gh/seemethere/62/orig 2025-12-04T08:54:02.3293589Z * [new branch] gh/seemethere/63/base -> origin/gh/seemethere/63/base 2025-12-04T08:54:02.3293688Z * [new branch] gh/seemethere/63/head -> origin/gh/seemethere/63/head 2025-12-04T08:54:02.3293762Z * [new branch] gh/seemethere/63/orig -> origin/gh/seemethere/63/orig 2025-12-04T08:54:02.3293834Z * [new branch] gh/seemethere/71/base -> origin/gh/seemethere/71/base 2025-12-04T08:54:02.3293909Z * [new branch] gh/seemethere/71/head -> origin/gh/seemethere/71/head 2025-12-04T08:54:02.3293983Z * [new branch] gh/seemethere/71/orig -> origin/gh/seemethere/71/orig 2025-12-04T08:54:02.3294055Z * [new branch] gh/seemethere/72/base -> origin/gh/seemethere/72/base 2025-12-04T08:54:02.3294129Z * [new branch] gh/seemethere/72/head -> origin/gh/seemethere/72/head 2025-12-04T08:54:02.3294202Z * [new branch] gh/seemethere/72/orig -> origin/gh/seemethere/72/orig 2025-12-04T08:54:02.3294274Z * [new branch] gh/seemethere/73/base -> origin/gh/seemethere/73/base 2025-12-04T08:54:02.3294351Z * [new branch] gh/seemethere/73/head -> origin/gh/seemethere/73/head 2025-12-04T08:54:02.3294423Z * [new branch] gh/seemethere/73/orig -> origin/gh/seemethere/73/orig 2025-12-04T08:54:02.3294495Z * [new branch] gh/seemethere/74/base -> origin/gh/seemethere/74/base 2025-12-04T08:54:02.3294596Z * [new branch] gh/seemethere/74/head -> origin/gh/seemethere/74/head 2025-12-04T08:54:02.3294669Z * [new branch] gh/seemethere/74/orig -> origin/gh/seemethere/74/orig 2025-12-04T08:54:02.3294742Z * [new branch] gh/seemethere/75/base -> origin/gh/seemethere/75/base 2025-12-04T08:54:02.3294816Z * [new branch] gh/seemethere/75/head -> origin/gh/seemethere/75/head 2025-12-04T08:54:02.3294889Z * [new branch] gh/seemethere/75/orig -> origin/gh/seemethere/75/orig 2025-12-04T08:54:02.3294964Z * [new branch] gh/seemethere/76/base -> origin/gh/seemethere/76/base 2025-12-04T08:54:02.3295038Z * [new branch] gh/seemethere/76/head -> origin/gh/seemethere/76/head 2025-12-04T08:54:02.3295111Z * [new branch] gh/seemethere/76/orig -> origin/gh/seemethere/76/orig 2025-12-04T08:54:02.3295186Z * [new branch] gh/shunting314/145/base -> origin/gh/shunting314/145/base 2025-12-04T08:54:02.3295265Z * [new branch] gh/shunting314/145/head -> origin/gh/shunting314/145/head 2025-12-04T08:54:02.3295340Z * [new branch] gh/shunting314/145/orig -> origin/gh/shunting314/145/orig 2025-12-04T08:54:02.3295414Z * [new branch] gh/shunting314/176/base -> origin/gh/shunting314/176/base 2025-12-04T08:54:02.3295489Z * [new branch] gh/shunting314/176/head -> origin/gh/shunting314/176/head 2025-12-04T08:54:02.3295562Z * [new branch] gh/shunting314/176/orig -> origin/gh/shunting314/176/orig 2025-12-04T08:54:02.3295638Z * [new branch] gh/shunting314/249/base -> origin/gh/shunting314/249/base 2025-12-04T08:54:02.3295713Z * [new branch] gh/shunting314/249/head -> origin/gh/shunting314/249/head 2025-12-04T08:54:02.3295787Z * [new branch] gh/shunting314/249/orig -> origin/gh/shunting314/249/orig 2025-12-04T08:54:02.3295864Z * [new branch] gh/shunting314/253/base -> origin/gh/shunting314/253/base 2025-12-04T08:54:02.3295938Z * [new branch] gh/shunting314/253/head -> origin/gh/shunting314/253/head 2025-12-04T08:54:02.3296013Z * [new branch] gh/shunting314/253/orig -> origin/gh/shunting314/253/orig 2025-12-04T08:54:02.3296089Z * [new branch] gh/shunting314/256/base -> origin/gh/shunting314/256/base 2025-12-04T08:54:02.3296163Z * [new branch] gh/shunting314/256/head -> origin/gh/shunting314/256/head 2025-12-04T08:54:02.3296237Z * [new branch] gh/shunting314/256/orig -> origin/gh/shunting314/256/orig 2025-12-04T08:54:02.3296343Z * [new branch] gh/shunting314/257/base -> origin/gh/shunting314/257/base 2025-12-04T08:54:02.3296417Z * [new branch] gh/shunting314/257/head -> origin/gh/shunting314/257/head 2025-12-04T08:54:02.3296491Z * [new branch] gh/shunting314/257/orig -> origin/gh/shunting314/257/orig 2025-12-04T08:54:02.3296569Z * [new branch] gh/shunting314/258/base -> origin/gh/shunting314/258/base 2025-12-04T08:54:02.3296643Z * [new branch] gh/shunting314/258/head -> origin/gh/shunting314/258/head 2025-12-04T08:54:02.3296717Z * [new branch] gh/shunting314/258/orig -> origin/gh/shunting314/258/orig 2025-12-04T08:54:02.3296792Z * [new branch] gh/shunting314/259/base -> origin/gh/shunting314/259/base 2025-12-04T08:54:02.3296867Z * [new branch] gh/shunting314/259/head -> origin/gh/shunting314/259/head 2025-12-04T08:54:02.3296941Z * [new branch] gh/shunting314/259/orig -> origin/gh/shunting314/259/orig 2025-12-04T08:54:02.3297018Z * [new branch] gh/shunting314/260/base -> origin/gh/shunting314/260/base 2025-12-04T08:54:02.3297092Z * [new branch] gh/shunting314/260/head -> origin/gh/shunting314/260/head 2025-12-04T08:54:02.3297167Z * [new branch] gh/shunting314/260/orig -> origin/gh/shunting314/260/orig 2025-12-04T08:54:02.3297269Z * [new branch] gh/shunting314/261/base -> origin/gh/shunting314/261/base 2025-12-04T08:54:02.3297344Z * [new branch] gh/shunting314/261/head -> origin/gh/shunting314/261/head 2025-12-04T08:54:02.3297421Z * [new branch] gh/shunting314/261/orig -> origin/gh/shunting314/261/orig 2025-12-04T08:54:02.3297496Z * [new branch] gh/shunting314/262/base -> origin/gh/shunting314/262/base 2025-12-04T08:54:02.3297570Z * [new branch] gh/shunting314/262/head -> origin/gh/shunting314/262/head 2025-12-04T08:54:02.3297647Z * [new branch] gh/shunting314/262/orig -> origin/gh/shunting314/262/orig 2025-12-04T08:54:02.3297721Z * [new branch] gh/shunting314/263/base -> origin/gh/shunting314/263/base 2025-12-04T08:54:02.3297795Z * [new branch] gh/shunting314/263/head -> origin/gh/shunting314/263/head 2025-12-04T08:54:02.3297870Z * [new branch] gh/shunting314/263/orig -> origin/gh/shunting314/263/orig 2025-12-04T08:54:02.3297945Z * [new branch] gh/shunting314/264/base -> origin/gh/shunting314/264/base 2025-12-04T08:54:02.3298019Z * [new branch] gh/shunting314/264/head -> origin/gh/shunting314/264/head 2025-12-04T08:54:02.3298095Z * [new branch] gh/shunting314/264/orig -> origin/gh/shunting314/264/orig 2025-12-04T08:54:02.3298168Z * [new branch] gh/shunting314/265/base -> origin/gh/shunting314/265/base 2025-12-04T08:54:02.3298243Z * [new branch] gh/shunting314/265/head -> origin/gh/shunting314/265/head 2025-12-04T08:54:02.3298322Z * [new branch] gh/shunting314/265/orig -> origin/gh/shunting314/265/orig 2025-12-04T08:54:02.3298397Z * [new branch] gh/shunting314/266/base -> origin/gh/shunting314/266/base 2025-12-04T08:54:02.3298471Z * [new branch] gh/shunting314/266/head -> origin/gh/shunting314/266/head 2025-12-04T08:54:02.3298548Z * [new branch] gh/shunting314/266/orig -> origin/gh/shunting314/266/orig 2025-12-04T08:54:02.3298623Z * [new branch] gh/shunting314/267/base -> origin/gh/shunting314/267/base 2025-12-04T08:54:02.3298698Z * [new branch] gh/shunting314/267/head -> origin/gh/shunting314/267/head 2025-12-04T08:54:02.3298774Z * [new branch] gh/shunting314/267/orig -> origin/gh/shunting314/267/orig 2025-12-04T08:54:02.3298848Z * [new branch] gh/shunting314/268/base -> origin/gh/shunting314/268/base 2025-12-04T08:54:02.3298923Z * [new branch] gh/shunting314/268/head -> origin/gh/shunting314/268/head 2025-12-04T08:54:02.3299029Z * [new branch] gh/shunting314/268/orig -> origin/gh/shunting314/268/orig 2025-12-04T08:54:02.3299104Z * [new branch] gh/shunting314/269/base -> origin/gh/shunting314/269/base 2025-12-04T08:54:02.3299180Z * [new branch] gh/shunting314/269/head -> origin/gh/shunting314/269/head 2025-12-04T08:54:02.3299255Z * [new branch] gh/shunting314/269/orig -> origin/gh/shunting314/269/orig 2025-12-04T08:54:02.3299330Z * [new branch] gh/silverguo/1/base -> origin/gh/silverguo/1/base 2025-12-04T08:54:02.3299404Z * [new branch] gh/silverguo/1/head -> origin/gh/silverguo/1/head 2025-12-04T08:54:02.3299475Z * [new branch] gh/silverguo/2/base -> origin/gh/silverguo/2/base 2025-12-04T08:54:02.3299546Z * [new branch] gh/silverguo/2/head -> origin/gh/silverguo/2/head 2025-12-04T08:54:02.3299620Z * [new branch] gh/silverguo/3/base -> origin/gh/silverguo/3/base 2025-12-04T08:54:02.3299690Z * [new branch] gh/silverguo/3/head -> origin/gh/silverguo/3/head 2025-12-04T08:54:02.3299760Z * [new branch] gh/silverguo/4/base -> origin/gh/silverguo/4/base 2025-12-04T08:54:02.3299831Z * [new branch] gh/silverguo/4/head -> origin/gh/silverguo/4/head 2025-12-04T08:54:02.3299935Z * [new branch] gh/slayton58/39/base -> origin/gh/slayton58/39/base 2025-12-04T08:54:02.3300009Z * [new branch] gh/slayton58/39/head -> origin/gh/slayton58/39/head 2025-12-04T08:54:02.3300081Z * [new branch] gh/slayton58/39/orig -> origin/gh/slayton58/39/orig 2025-12-04T08:54:02.3300201Z * [new branch] gh/slayton58/42/base -> origin/gh/slayton58/42/base 2025-12-04T08:54:02.3300272Z * [new branch] gh/slayton58/42/head -> origin/gh/slayton58/42/head 2025-12-04T08:54:02.3300344Z * [new branch] gh/slayton58/42/orig -> origin/gh/slayton58/42/orig 2025-12-04T08:54:02.3300415Z * [new branch] gh/slayton58/43/base -> origin/gh/slayton58/43/base 2025-12-04T08:54:02.3300486Z * [new branch] gh/slayton58/43/head -> origin/gh/slayton58/43/head 2025-12-04T08:54:02.3300557Z * [new branch] gh/slayton58/43/orig -> origin/gh/slayton58/43/orig 2025-12-04T08:54:02.3300628Z * [new branch] gh/slayton58/44/base -> origin/gh/slayton58/44/base 2025-12-04T08:54:02.3300700Z * [new branch] gh/slayton58/44/head -> origin/gh/slayton58/44/head 2025-12-04T08:54:02.3300771Z * [new branch] gh/slayton58/44/orig -> origin/gh/slayton58/44/orig 2025-12-04T08:54:02.3300841Z * [new branch] gh/slayton58/45/base -> origin/gh/slayton58/45/base 2025-12-04T08:54:02.3300912Z * [new branch] gh/slayton58/45/head -> origin/gh/slayton58/45/head 2025-12-04T08:54:02.3300984Z * [new branch] gh/slayton58/45/orig -> origin/gh/slayton58/45/orig 2025-12-04T08:54:02.3301054Z * [new branch] gh/slayton58/46/base -> origin/gh/slayton58/46/base 2025-12-04T08:54:02.3301124Z * [new branch] gh/slayton58/46/head -> origin/gh/slayton58/46/head 2025-12-04T08:54:02.3301195Z * [new branch] gh/slayton58/46/orig -> origin/gh/slayton58/46/orig 2025-12-04T08:54:02.3301265Z * [new branch] gh/slayton58/6/base -> origin/gh/slayton58/6/base 2025-12-04T08:54:02.3301337Z * [new branch] gh/slayton58/6/head -> origin/gh/slayton58/6/head 2025-12-04T08:54:02.3301407Z * [new branch] gh/slayton58/7/base -> origin/gh/slayton58/7/base 2025-12-04T08:54:02.3301476Z * [new branch] gh/slayton58/7/head -> origin/gh/slayton58/7/head 2025-12-04T08:54:02.3301551Z * [new branch] gh/soulitzer/269/base -> origin/gh/soulitzer/269/base 2025-12-04T08:54:02.3301682Z * [new branch] gh/soulitzer/269/head -> origin/gh/soulitzer/269/head 2025-12-04T08:54:02.3301756Z * [new branch] gh/soulitzer/269/orig -> origin/gh/soulitzer/269/orig 2025-12-04T08:54:02.3301830Z * [new branch] gh/soulitzer/276/base -> origin/gh/soulitzer/276/base 2025-12-04T08:54:02.3301904Z * [new branch] gh/soulitzer/276/head -> origin/gh/soulitzer/276/head 2025-12-04T08:54:02.3301976Z * [new branch] gh/soulitzer/276/orig -> origin/gh/soulitzer/276/orig 2025-12-04T08:54:02.3302051Z * [new branch] gh/soulitzer/287/base -> origin/gh/soulitzer/287/base 2025-12-04T08:54:02.3302122Z * [new branch] gh/soulitzer/287/head -> origin/gh/soulitzer/287/head 2025-12-04T08:54:02.3302195Z * [new branch] gh/soulitzer/287/orig -> origin/gh/soulitzer/287/orig 2025-12-04T08:54:02.3302268Z * [new branch] gh/soulitzer/296/base -> origin/gh/soulitzer/296/base 2025-12-04T08:54:02.3302341Z * [new branch] gh/soulitzer/296/head -> origin/gh/soulitzer/296/head 2025-12-04T08:54:02.3302414Z * [new branch] gh/soulitzer/296/orig -> origin/gh/soulitzer/296/orig 2025-12-04T08:54:02.3302485Z * [new branch] gh/soulitzer/299/base -> origin/gh/soulitzer/299/base 2025-12-04T08:54:02.3302603Z * [new branch] gh/soulitzer/299/head -> origin/gh/soulitzer/299/head 2025-12-04T08:54:02.3302677Z * [new branch] gh/soulitzer/299/orig -> origin/gh/soulitzer/299/orig 2025-12-04T08:54:02.3302749Z * [new branch] gh/soulitzer/300/base -> origin/gh/soulitzer/300/base 2025-12-04T08:54:02.3302821Z * [new branch] gh/soulitzer/300/head -> origin/gh/soulitzer/300/head 2025-12-04T08:54:02.3302894Z * [new branch] gh/soulitzer/300/orig -> origin/gh/soulitzer/300/orig 2025-12-04T08:54:02.3302966Z * [new branch] gh/soulitzer/301/base -> origin/gh/soulitzer/301/base 2025-12-04T08:54:02.3303039Z * [new branch] gh/soulitzer/301/head -> origin/gh/soulitzer/301/head 2025-12-04T08:54:02.3303112Z * [new branch] gh/soulitzer/301/orig -> origin/gh/soulitzer/301/orig 2025-12-04T08:54:02.3303183Z * [new branch] gh/soulitzer/313/base -> origin/gh/soulitzer/313/base 2025-12-04T08:54:02.3303256Z * [new branch] gh/soulitzer/313/head -> origin/gh/soulitzer/313/head 2025-12-04T08:54:02.3303328Z * [new branch] gh/soulitzer/313/orig -> origin/gh/soulitzer/313/orig 2025-12-04T08:54:02.3303401Z * [new branch] gh/soulitzer/319/base -> origin/gh/soulitzer/319/base 2025-12-04T08:54:02.3303472Z * [new branch] gh/soulitzer/319/head -> origin/gh/soulitzer/319/head 2025-12-04T08:54:02.3303546Z * [new branch] gh/soulitzer/319/orig -> origin/gh/soulitzer/319/orig 2025-12-04T08:54:02.3303618Z * [new branch] gh/soulitzer/320/base -> origin/gh/soulitzer/320/base 2025-12-04T08:54:02.3303692Z * [new branch] gh/soulitzer/320/head -> origin/gh/soulitzer/320/head 2025-12-04T08:54:02.3303764Z * [new branch] gh/soulitzer/320/orig -> origin/gh/soulitzer/320/orig 2025-12-04T08:54:02.3303836Z * [new branch] gh/soulitzer/336/base -> origin/gh/soulitzer/336/base 2025-12-04T08:54:02.3303909Z * [new branch] gh/soulitzer/336/head -> origin/gh/soulitzer/336/head 2025-12-04T08:54:02.3303981Z * [new branch] gh/soulitzer/336/orig -> origin/gh/soulitzer/336/orig 2025-12-04T08:54:02.3304053Z * [new branch] gh/soulitzer/347/base -> origin/gh/soulitzer/347/base 2025-12-04T08:54:02.3304125Z * [new branch] gh/soulitzer/347/head -> origin/gh/soulitzer/347/head 2025-12-04T08:54:02.3304197Z * [new branch] gh/soulitzer/347/orig -> origin/gh/soulitzer/347/orig 2025-12-04T08:54:02.3304269Z * [new branch] gh/soulitzer/349/base -> origin/gh/soulitzer/349/base 2025-12-04T08:54:02.3304371Z * [new branch] gh/soulitzer/349/head -> origin/gh/soulitzer/349/head 2025-12-04T08:54:02.3304443Z * [new branch] gh/soulitzer/349/orig -> origin/gh/soulitzer/349/orig 2025-12-04T08:54:02.3304515Z * [new branch] gh/soulitzer/350/base -> origin/gh/soulitzer/350/base 2025-12-04T08:54:02.3304590Z * [new branch] gh/soulitzer/350/head -> origin/gh/soulitzer/350/head 2025-12-04T08:54:02.3304661Z * [new branch] gh/soulitzer/350/orig -> origin/gh/soulitzer/350/orig 2025-12-04T08:54:02.3304734Z * [new branch] gh/soulitzer/351/base -> origin/gh/soulitzer/351/base 2025-12-04T08:54:02.3304807Z * [new branch] gh/soulitzer/351/head -> origin/gh/soulitzer/351/head 2025-12-04T08:54:02.3304880Z * [new branch] gh/soulitzer/351/orig -> origin/gh/soulitzer/351/orig 2025-12-04T08:54:02.3304953Z * [new branch] gh/soulitzer/353/base -> origin/gh/soulitzer/353/base 2025-12-04T08:54:02.3305028Z * [new branch] gh/soulitzer/353/head -> origin/gh/soulitzer/353/head 2025-12-04T08:54:02.3305099Z * [new branch] gh/soulitzer/353/orig -> origin/gh/soulitzer/353/orig 2025-12-04T08:54:02.3305171Z * [new branch] gh/soulitzer/358/base -> origin/gh/soulitzer/358/base 2025-12-04T08:54:02.3305270Z * [new branch] gh/soulitzer/358/head -> origin/gh/soulitzer/358/head 2025-12-04T08:54:02.3305342Z * [new branch] gh/soulitzer/358/orig -> origin/gh/soulitzer/358/orig 2025-12-04T08:54:02.3305416Z * [new branch] gh/soulitzer/359/base -> origin/gh/soulitzer/359/base 2025-12-04T08:54:02.3305488Z * [new branch] gh/soulitzer/359/head -> origin/gh/soulitzer/359/head 2025-12-04T08:54:02.3305559Z * [new branch] gh/soulitzer/359/orig -> origin/gh/soulitzer/359/orig 2025-12-04T08:54:02.3305633Z * [new branch] gh/soulitzer/374/base -> origin/gh/soulitzer/374/base 2025-12-04T08:54:02.3305705Z * [new branch] gh/soulitzer/374/head -> origin/gh/soulitzer/374/head 2025-12-04T08:54:02.3305777Z * [new branch] gh/soulitzer/374/orig -> origin/gh/soulitzer/374/orig 2025-12-04T08:54:02.3305851Z * [new branch] gh/soulitzer/375/base -> origin/gh/soulitzer/375/base 2025-12-04T08:54:02.3305923Z * [new branch] gh/soulitzer/375/head -> origin/gh/soulitzer/375/head 2025-12-04T08:54:02.3305995Z * [new branch] gh/soulitzer/375/orig -> origin/gh/soulitzer/375/orig 2025-12-04T08:54:02.3306067Z * [new branch] gh/soulitzer/380/base -> origin/gh/soulitzer/380/base 2025-12-04T08:54:02.3306138Z * [new branch] gh/soulitzer/380/head -> origin/gh/soulitzer/380/head 2025-12-04T08:54:02.3306210Z * [new branch] gh/soulitzer/380/orig -> origin/gh/soulitzer/380/orig 2025-12-04T08:54:02.3306284Z * [new branch] gh/soulitzer/385/base -> origin/gh/soulitzer/385/base 2025-12-04T08:54:02.3306355Z * [new branch] gh/soulitzer/385/head -> origin/gh/soulitzer/385/head 2025-12-04T08:54:02.3306427Z * [new branch] gh/soulitzer/385/orig -> origin/gh/soulitzer/385/orig 2025-12-04T08:54:02.3306502Z * [new branch] gh/soulitzer/386/base -> origin/gh/soulitzer/386/base 2025-12-04T08:54:02.3306574Z * [new branch] gh/soulitzer/386/head -> origin/gh/soulitzer/386/head 2025-12-04T08:54:02.3306645Z * [new branch] gh/soulitzer/386/orig -> origin/gh/soulitzer/386/orig 2025-12-04T08:54:02.3306718Z * [new branch] gh/soulitzer/387/base -> origin/gh/soulitzer/387/base 2025-12-04T08:54:02.3306789Z * [new branch] gh/soulitzer/387/head -> origin/gh/soulitzer/387/head 2025-12-04T08:54:02.3306862Z * [new branch] gh/soulitzer/387/orig -> origin/gh/soulitzer/387/orig 2025-12-04T08:54:02.3306969Z * [new branch] gh/soulitzer/388/base -> origin/gh/soulitzer/388/base 2025-12-04T08:54:02.3307041Z * [new branch] gh/soulitzer/388/head -> origin/gh/soulitzer/388/head 2025-12-04T08:54:02.3307114Z * [new branch] gh/soulitzer/388/orig -> origin/gh/soulitzer/388/orig 2025-12-04T08:54:02.3307187Z * [new branch] gh/soulitzer/389/base -> origin/gh/soulitzer/389/base 2025-12-04T08:54:02.3307259Z * [new branch] gh/soulitzer/389/head -> origin/gh/soulitzer/389/head 2025-12-04T08:54:02.3307331Z * [new branch] gh/soulitzer/389/orig -> origin/gh/soulitzer/389/orig 2025-12-04T08:54:02.3307403Z * [new branch] gh/soulitzer/390/base -> origin/gh/soulitzer/390/base 2025-12-04T08:54:02.3307476Z * [new branch] gh/soulitzer/390/head -> origin/gh/soulitzer/390/head 2025-12-04T08:54:02.3307549Z * [new branch] gh/soulitzer/390/orig -> origin/gh/soulitzer/390/orig 2025-12-04T08:54:02.3307622Z * [new branch] gh/soulitzer/391/base -> origin/gh/soulitzer/391/base 2025-12-04T08:54:02.3307694Z * [new branch] gh/soulitzer/391/head -> origin/gh/soulitzer/391/head 2025-12-04T08:54:02.3307768Z * [new branch] gh/soulitzer/391/orig -> origin/gh/soulitzer/391/orig 2025-12-04T08:54:02.3307867Z * [new branch] gh/soulitzer/392/base -> origin/gh/soulitzer/392/base 2025-12-04T08:54:02.3307939Z * [new branch] gh/soulitzer/392/head -> origin/gh/soulitzer/392/head 2025-12-04T08:54:02.3308013Z * [new branch] gh/soulitzer/392/orig -> origin/gh/soulitzer/392/orig 2025-12-04T08:54:02.3308085Z * [new branch] gh/swolchok/728/next -> origin/gh/swolchok/728/next 2025-12-04T08:54:02.3308155Z * [new branch] gh/swolchok/819/base -> origin/gh/swolchok/819/base 2025-12-04T08:54:02.3308228Z * [new branch] gh/swolchok/819/head -> origin/gh/swolchok/819/head 2025-12-04T08:54:02.3308298Z * [new branch] gh/swolchok/819/orig -> origin/gh/swolchok/819/orig 2025-12-04T08:54:02.3308370Z * [new branch] gh/swolchok/824/base -> origin/gh/swolchok/824/base 2025-12-04T08:54:02.3308440Z * [new branch] gh/swolchok/824/head -> origin/gh/swolchok/824/head 2025-12-04T08:54:02.3308510Z * [new branch] gh/swolchok/824/orig -> origin/gh/swolchok/824/orig 2025-12-04T08:54:02.3308581Z * [new branch] gh/swolchok/829/base -> origin/gh/swolchok/829/base 2025-12-04T08:54:02.3308651Z * [new branch] gh/swolchok/829/head -> origin/gh/swolchok/829/head 2025-12-04T08:54:02.3308721Z * [new branch] gh/swolchok/829/orig -> origin/gh/swolchok/829/orig 2025-12-04T08:54:02.3308792Z * [new branch] gh/swolchok/839/base -> origin/gh/swolchok/839/base 2025-12-04T08:54:02.3308863Z * [new branch] gh/swolchok/839/head -> origin/gh/swolchok/839/head 2025-12-04T08:54:02.3308932Z * [new branch] gh/swolchok/839/orig -> origin/gh/swolchok/839/orig 2025-12-04T08:54:02.3309004Z * [new branch] gh/swolchok/841/base -> origin/gh/swolchok/841/base 2025-12-04T08:54:02.3309074Z * [new branch] gh/swolchok/841/head -> origin/gh/swolchok/841/head 2025-12-04T08:54:02.3309144Z * [new branch] gh/swolchok/841/orig -> origin/gh/swolchok/841/orig 2025-12-04T08:54:02.3309215Z * [new branch] gh/swolchok/842/base -> origin/gh/swolchok/842/base 2025-12-04T08:54:02.3309285Z * [new branch] gh/swolchok/842/head -> origin/gh/swolchok/842/head 2025-12-04T08:54:02.3309355Z * [new branch] gh/swolchok/842/orig -> origin/gh/swolchok/842/orig 2025-12-04T08:54:02.3309426Z * [new branch] gh/swolchok/845/base -> origin/gh/swolchok/845/base 2025-12-04T08:54:02.3309521Z * [new branch] gh/swolchok/845/head -> origin/gh/swolchok/845/head 2025-12-04T08:54:02.3309592Z * [new branch] gh/swolchok/845/orig -> origin/gh/swolchok/845/orig 2025-12-04T08:54:02.3309662Z * [new branch] gh/swolchok/848/base -> origin/gh/swolchok/848/base 2025-12-04T08:54:02.3309732Z * [new branch] gh/swolchok/848/head -> origin/gh/swolchok/848/head 2025-12-04T08:54:02.3309803Z * [new branch] gh/swolchok/848/orig -> origin/gh/swolchok/848/orig 2025-12-04T08:54:02.3309874Z * [new branch] gh/swolchok/856/base -> origin/gh/swolchok/856/base 2025-12-04T08:54:02.3309943Z * [new branch] gh/swolchok/856/head -> origin/gh/swolchok/856/head 2025-12-04T08:54:02.3310013Z * [new branch] gh/swolchok/856/orig -> origin/gh/swolchok/856/orig 2025-12-04T08:54:02.3310084Z * [new branch] gh/swolchok/860/base -> origin/gh/swolchok/860/base 2025-12-04T08:54:02.3310189Z * [new branch] gh/swolchok/860/head -> origin/gh/swolchok/860/head 2025-12-04T08:54:02.3310260Z * [new branch] gh/swolchok/860/orig -> origin/gh/swolchok/860/orig 2025-12-04T08:54:02.3310330Z * [new branch] gh/swolchok/861/base -> origin/gh/swolchok/861/base 2025-12-04T08:54:02.3310399Z * [new branch] gh/swolchok/861/head -> origin/gh/swolchok/861/head 2025-12-04T08:54:02.3310533Z * [new branch] gh/swolchok/861/orig -> origin/gh/swolchok/861/orig 2025-12-04T08:54:02.3310604Z * [new branch] gh/swolchok/862/base -> origin/gh/swolchok/862/base 2025-12-04T08:54:02.3310674Z * [new branch] gh/swolchok/862/head -> origin/gh/swolchok/862/head 2025-12-04T08:54:02.3310745Z * [new branch] gh/swolchok/862/orig -> origin/gh/swolchok/862/orig 2025-12-04T08:54:02.3310815Z * [new branch] gh/swolchok/863/base -> origin/gh/swolchok/863/base 2025-12-04T08:54:02.3310886Z * [new branch] gh/swolchok/863/head -> origin/gh/swolchok/863/head 2025-12-04T08:54:02.3310957Z * [new branch] gh/swolchok/863/orig -> origin/gh/swolchok/863/orig 2025-12-04T08:54:02.3311028Z * [new branch] gh/swolchok/864/base -> origin/gh/swolchok/864/base 2025-12-04T08:54:02.3311100Z * [new branch] gh/swolchok/864/head -> origin/gh/swolchok/864/head 2025-12-04T08:54:02.3311172Z * [new branch] gh/swolchok/864/orig -> origin/gh/swolchok/864/orig 2025-12-04T08:54:02.3311242Z * [new branch] gh/swolchok/865/base -> origin/gh/swolchok/865/base 2025-12-04T08:54:02.3311312Z * [new branch] gh/swolchok/865/head -> origin/gh/swolchok/865/head 2025-12-04T08:54:02.3311383Z * [new branch] gh/swolchok/865/orig -> origin/gh/swolchok/865/orig 2025-12-04T08:54:02.3311453Z * [new branch] gh/swolchok/866/base -> origin/gh/swolchok/866/base 2025-12-04T08:54:02.3311525Z * [new branch] gh/swolchok/866/head -> origin/gh/swolchok/866/head 2025-12-04T08:54:02.3311595Z * [new branch] gh/swolchok/866/orig -> origin/gh/swolchok/866/orig 2025-12-04T08:54:02.3311665Z * [new branch] gh/swolchok/867/base -> origin/gh/swolchok/867/base 2025-12-04T08:54:02.3311737Z * [new branch] gh/swolchok/867/head -> origin/gh/swolchok/867/head 2025-12-04T08:54:02.3311807Z * [new branch] gh/swolchok/867/orig -> origin/gh/swolchok/867/orig 2025-12-04T08:54:02.3311877Z * [new branch] gh/swolchok/868/base -> origin/gh/swolchok/868/base 2025-12-04T08:54:02.3311948Z * [new branch] gh/swolchok/868/head -> origin/gh/swolchok/868/head 2025-12-04T08:54:02.3312018Z * [new branch] gh/swolchok/868/orig -> origin/gh/swolchok/868/orig 2025-12-04T08:54:02.3312088Z * [new branch] gh/swolchok/869/base -> origin/gh/swolchok/869/base 2025-12-04T08:54:02.3312211Z * [new branch] gh/swolchok/869/head -> origin/gh/swolchok/869/head 2025-12-04T08:54:02.3312281Z * [new branch] gh/swolchok/869/orig -> origin/gh/swolchok/869/orig 2025-12-04T08:54:02.3312351Z * [new branch] gh/swolchok/870/base -> origin/gh/swolchok/870/base 2025-12-04T08:54:02.3312425Z * [new branch] gh/swolchok/870/head -> origin/gh/swolchok/870/head 2025-12-04T08:54:02.3312495Z * [new branch] gh/swolchok/870/orig -> origin/gh/swolchok/870/orig 2025-12-04T08:54:02.3312568Z * [new branch] gh/swolchok/871/base -> origin/gh/swolchok/871/base 2025-12-04T08:54:02.3312639Z * [new branch] gh/swolchok/871/head -> origin/gh/swolchok/871/head 2025-12-04T08:54:02.3312709Z * [new branch] gh/swolchok/871/orig -> origin/gh/swolchok/871/orig 2025-12-04T08:54:02.3312785Z * [new branch] gh/teja-rao/4/base -> origin/gh/teja-rao/4/base 2025-12-04T08:54:02.3312856Z * [new branch] gh/teja-rao/4/head -> origin/gh/teja-rao/4/head 2025-12-04T08:54:02.3312925Z * [new branch] gh/teja-rao/4/orig -> origin/gh/teja-rao/4/orig 2025-12-04T08:54:02.3312998Z * [new branch] gh/tianyu-l/2/base -> origin/gh/tianyu-l/2/base 2025-12-04T08:54:02.3313092Z * [new branch] gh/tianyu-l/2/head -> origin/gh/tianyu-l/2/head 2025-12-04T08:54:02.3313162Z * [new branch] gh/tianyu-l/2/orig -> origin/gh/tianyu-l/2/orig 2025-12-04T08:54:02.3313233Z * [new branch] gh/tianyu-l/3/base -> origin/gh/tianyu-l/3/base 2025-12-04T08:54:02.3313302Z * [new branch] gh/tianyu-l/3/orig -> origin/gh/tianyu-l/3/orig 2025-12-04T08:54:02.3313374Z * [new branch] gh/tianyu-l/4/base -> origin/gh/tianyu-l/4/base 2025-12-04T08:54:02.3313441Z * [new branch] gh/tianyu-l/4/head -> origin/gh/tianyu-l/4/head 2025-12-04T08:54:02.3313511Z * [new branch] gh/tianyu-l/4/orig -> origin/gh/tianyu-l/4/orig 2025-12-04T08:54:02.3313601Z * [new branch] gh/tugsbayasgalan/10/base -> origin/gh/tugsbayasgalan/10/base 2025-12-04T08:54:02.3313687Z * [new branch] gh/tugsbayasgalan/10/head -> origin/gh/tugsbayasgalan/10/head 2025-12-04T08:54:02.3313773Z * [new branch] gh/tugsbayasgalan/10/orig -> origin/gh/tugsbayasgalan/10/orig 2025-12-04T08:54:02.3313859Z * [new branch] gh/tugsbayasgalan/13/base -> origin/gh/tugsbayasgalan/13/base 2025-12-04T08:54:02.3313941Z * [new branch] gh/tugsbayasgalan/13/head -> origin/gh/tugsbayasgalan/13/head 2025-12-04T08:54:02.3314022Z * [new branch] gh/tugsbayasgalan/13/orig -> origin/gh/tugsbayasgalan/13/orig 2025-12-04T08:54:02.3314107Z * [new branch] gh/tugsbayasgalan/17/base -> origin/gh/tugsbayasgalan/17/base 2025-12-04T08:54:02.3314193Z * [new branch] gh/tugsbayasgalan/17/head -> origin/gh/tugsbayasgalan/17/head 2025-12-04T08:54:02.3314274Z * [new branch] gh/tugsbayasgalan/17/orig -> origin/gh/tugsbayasgalan/17/orig 2025-12-04T08:54:02.3314358Z * [new branch] gh/tugsbayasgalan/2/base -> origin/gh/tugsbayasgalan/2/base 2025-12-04T08:54:02.3314440Z * [new branch] gh/tugsbayasgalan/2/head -> origin/gh/tugsbayasgalan/2/head 2025-12-04T08:54:02.3314523Z * [new branch] gh/tugsbayasgalan/2/orig -> origin/gh/tugsbayasgalan/2/orig 2025-12-04T08:54:02.3314607Z * [new branch] gh/tugsbayasgalan/28/base -> origin/gh/tugsbayasgalan/28/base 2025-12-04T08:54:02.3314692Z * [new branch] gh/tugsbayasgalan/28/head -> origin/gh/tugsbayasgalan/28/head 2025-12-04T08:54:02.3314775Z * [new branch] gh/tugsbayasgalan/28/orig -> origin/gh/tugsbayasgalan/28/orig 2025-12-04T08:54:02.3314858Z * [new branch] gh/tugsbayasgalan/32/base -> origin/gh/tugsbayasgalan/32/base 2025-12-04T08:54:02.3314967Z * [new branch] gh/tugsbayasgalan/32/head -> origin/gh/tugsbayasgalan/32/head 2025-12-04T08:54:02.3315050Z * [new branch] gh/tugsbayasgalan/32/orig -> origin/gh/tugsbayasgalan/32/orig 2025-12-04T08:54:02.3315134Z * [new branch] gh/tugsbayasgalan/35/base -> origin/gh/tugsbayasgalan/35/base 2025-12-04T08:54:02.3315219Z * [new branch] gh/tugsbayasgalan/35/head -> origin/gh/tugsbayasgalan/35/head 2025-12-04T08:54:02.3315304Z * [new branch] gh/tugsbayasgalan/35/orig -> origin/gh/tugsbayasgalan/35/orig 2025-12-04T08:54:02.3315386Z * [new branch] gh/tugsbayasgalan/36/base -> origin/gh/tugsbayasgalan/36/base 2025-12-04T08:54:02.3315469Z * [new branch] gh/tugsbayasgalan/36/head -> origin/gh/tugsbayasgalan/36/head 2025-12-04T08:54:02.3315554Z * [new branch] gh/tugsbayasgalan/36/orig -> origin/gh/tugsbayasgalan/36/orig 2025-12-04T08:54:02.3315637Z * [new branch] gh/tugsbayasgalan/37/base -> origin/gh/tugsbayasgalan/37/base 2025-12-04T08:54:02.3315721Z * [new branch] gh/tugsbayasgalan/37/head -> origin/gh/tugsbayasgalan/37/head 2025-12-04T08:54:02.3315804Z * [new branch] gh/tugsbayasgalan/37/orig -> origin/gh/tugsbayasgalan/37/orig 2025-12-04T08:54:02.3315913Z * [new branch] gh/tugsbayasgalan/43/base -> origin/gh/tugsbayasgalan/43/base 2025-12-04T08:54:02.3315997Z * [new branch] gh/tugsbayasgalan/43/head -> origin/gh/tugsbayasgalan/43/head 2025-12-04T08:54:02.3316082Z * [new branch] gh/tugsbayasgalan/43/orig -> origin/gh/tugsbayasgalan/43/orig 2025-12-04T08:54:02.3316165Z * [new branch] gh/tugsbayasgalan/48/base -> origin/gh/tugsbayasgalan/48/base 2025-12-04T08:54:02.3316248Z * [new branch] gh/tugsbayasgalan/48/head -> origin/gh/tugsbayasgalan/48/head 2025-12-04T08:54:02.3316331Z * [new branch] gh/tugsbayasgalan/48/orig -> origin/gh/tugsbayasgalan/48/orig 2025-12-04T08:54:02.3316415Z * [new branch] gh/tugsbayasgalan/51/base -> origin/gh/tugsbayasgalan/51/base 2025-12-04T08:54:02.3316496Z * [new branch] gh/tugsbayasgalan/51/head -> origin/gh/tugsbayasgalan/51/head 2025-12-04T08:54:02.3316582Z * [new branch] gh/tugsbayasgalan/51/orig -> origin/gh/tugsbayasgalan/51/orig 2025-12-04T08:54:02.3316665Z * [new branch] gh/tugsbayasgalan/52/base -> origin/gh/tugsbayasgalan/52/base 2025-12-04T08:54:02.3316750Z * [new branch] gh/tugsbayasgalan/52/head -> origin/gh/tugsbayasgalan/52/head 2025-12-04T08:54:02.3316832Z * [new branch] gh/tugsbayasgalan/52/orig -> origin/gh/tugsbayasgalan/52/orig 2025-12-04T08:54:02.3316914Z * [new branch] gh/tugsbayasgalan/53/base -> origin/gh/tugsbayasgalan/53/base 2025-12-04T08:54:02.3317001Z * [new branch] gh/tugsbayasgalan/53/head -> origin/gh/tugsbayasgalan/53/head 2025-12-04T08:54:02.3317084Z * [new branch] gh/tugsbayasgalan/53/orig -> origin/gh/tugsbayasgalan/53/orig 2025-12-04T08:54:02.3317165Z * [new branch] gh/tugsbayasgalan/55/base -> origin/gh/tugsbayasgalan/55/base 2025-12-04T08:54:02.3317249Z * [new branch] gh/tugsbayasgalan/55/head -> origin/gh/tugsbayasgalan/55/head 2025-12-04T08:54:02.3317333Z * [new branch] gh/tugsbayasgalan/55/orig -> origin/gh/tugsbayasgalan/55/orig 2025-12-04T08:54:02.3317415Z * [new branch] gh/tugsbayasgalan/59/base -> origin/gh/tugsbayasgalan/59/base 2025-12-04T08:54:02.3317499Z * [new branch] gh/tugsbayasgalan/59/head -> origin/gh/tugsbayasgalan/59/head 2025-12-04T08:54:02.3317581Z * [new branch] gh/tugsbayasgalan/59/orig -> origin/gh/tugsbayasgalan/59/orig 2025-12-04T08:54:02.3317661Z * [new branch] gh/tugsbayasgalan/6/base -> origin/gh/tugsbayasgalan/6/base 2025-12-04T08:54:02.3317773Z * [new branch] gh/tugsbayasgalan/6/head -> origin/gh/tugsbayasgalan/6/head 2025-12-04T08:54:02.3317853Z * [new branch] gh/tugsbayasgalan/6/orig -> origin/gh/tugsbayasgalan/6/orig 2025-12-04T08:54:02.3317935Z * [new branch] gh/tugsbayasgalan/60/base -> origin/gh/tugsbayasgalan/60/base 2025-12-04T08:54:02.3318019Z * [new branch] gh/tugsbayasgalan/60/head -> origin/gh/tugsbayasgalan/60/head 2025-12-04T08:54:02.3318103Z * [new branch] gh/tugsbayasgalan/60/orig -> origin/gh/tugsbayasgalan/60/orig 2025-12-04T08:54:02.3318187Z * [new branch] gh/tugsbayasgalan/61/base -> origin/gh/tugsbayasgalan/61/base 2025-12-04T08:54:02.3318269Z * [new branch] gh/tugsbayasgalan/61/head -> origin/gh/tugsbayasgalan/61/head 2025-12-04T08:54:02.3318352Z * [new branch] gh/tugsbayasgalan/61/orig -> origin/gh/tugsbayasgalan/61/orig 2025-12-04T08:54:02.3318436Z * [new branch] gh/tugsbayasgalan/63/base -> origin/gh/tugsbayasgalan/63/base 2025-12-04T08:54:02.3318521Z * [new branch] gh/tugsbayasgalan/63/head -> origin/gh/tugsbayasgalan/63/head 2025-12-04T08:54:02.3318602Z * [new branch] gh/tugsbayasgalan/63/orig -> origin/gh/tugsbayasgalan/63/orig 2025-12-04T08:54:02.3318685Z * [new branch] gh/tugsbayasgalan/67/base -> origin/gh/tugsbayasgalan/67/base 2025-12-04T08:54:02.3318792Z * [new branch] gh/tugsbayasgalan/67/head -> origin/gh/tugsbayasgalan/67/head 2025-12-04T08:54:02.3318875Z * [new branch] gh/tugsbayasgalan/67/orig -> origin/gh/tugsbayasgalan/67/orig 2025-12-04T08:54:02.3318958Z * [new branch] gh/tugsbayasgalan/68/base -> origin/gh/tugsbayasgalan/68/base 2025-12-04T08:54:02.3319040Z * [new branch] gh/tugsbayasgalan/68/head -> origin/gh/tugsbayasgalan/68/head 2025-12-04T08:54:02.3319125Z * [new branch] gh/tugsbayasgalan/68/orig -> origin/gh/tugsbayasgalan/68/orig 2025-12-04T08:54:02.3319209Z * [new branch] gh/tugsbayasgalan/7/base -> origin/gh/tugsbayasgalan/7/base 2025-12-04T08:54:02.3319291Z * [new branch] gh/tugsbayasgalan/7/head -> origin/gh/tugsbayasgalan/7/head 2025-12-04T08:54:02.3319371Z * [new branch] gh/tugsbayasgalan/7/orig -> origin/gh/tugsbayasgalan/7/orig 2025-12-04T08:54:02.3319457Z * [new branch] gh/tugsbayasgalan/70/base -> origin/gh/tugsbayasgalan/70/base 2025-12-04T08:54:02.3319539Z * [new branch] gh/tugsbayasgalan/70/head -> origin/gh/tugsbayasgalan/70/head 2025-12-04T08:54:02.3319621Z * [new branch] gh/tugsbayasgalan/70/orig -> origin/gh/tugsbayasgalan/70/orig 2025-12-04T08:54:02.3319707Z * [new branch] gh/tugsbayasgalan/71/base -> origin/gh/tugsbayasgalan/71/base 2025-12-04T08:54:02.3319788Z * [new branch] gh/tugsbayasgalan/71/head -> origin/gh/tugsbayasgalan/71/head 2025-12-04T08:54:02.3319871Z * [new branch] gh/tugsbayasgalan/71/orig -> origin/gh/tugsbayasgalan/71/orig 2025-12-04T08:54:02.3319956Z * [new branch] gh/tugsbayasgalan/72/base -> origin/gh/tugsbayasgalan/72/base 2025-12-04T08:54:02.3320037Z * [new branch] gh/tugsbayasgalan/72/head -> origin/gh/tugsbayasgalan/72/head 2025-12-04T08:54:02.3320168Z * [new branch] gh/tugsbayasgalan/72/orig -> origin/gh/tugsbayasgalan/72/orig 2025-12-04T08:54:02.3320252Z * [new branch] gh/tugsbayasgalan/73/base -> origin/gh/tugsbayasgalan/73/base 2025-12-04T08:54:02.3320334Z * [new branch] gh/tugsbayasgalan/73/head -> origin/gh/tugsbayasgalan/73/head 2025-12-04T08:54:02.3320417Z * [new branch] gh/tugsbayasgalan/73/orig -> origin/gh/tugsbayasgalan/73/orig 2025-12-04T08:54:02.3320499Z * [new branch] gh/tugsbayasgalan/74/base -> origin/gh/tugsbayasgalan/74/base 2025-12-04T08:54:02.3320583Z * [new branch] gh/tugsbayasgalan/74/head -> origin/gh/tugsbayasgalan/74/head 2025-12-04T08:54:02.3320715Z * [new branch] gh/tugsbayasgalan/74/orig -> origin/gh/tugsbayasgalan/74/orig 2025-12-04T08:54:02.3320798Z * [new branch] gh/tugsbayasgalan/75/base -> origin/gh/tugsbayasgalan/75/base 2025-12-04T08:54:02.3320882Z * [new branch] gh/tugsbayasgalan/75/head -> origin/gh/tugsbayasgalan/75/head 2025-12-04T08:54:02.3320968Z * [new branch] gh/tugsbayasgalan/75/orig -> origin/gh/tugsbayasgalan/75/orig 2025-12-04T08:54:02.3321050Z * [new branch] gh/tugsbayasgalan/76/base -> origin/gh/tugsbayasgalan/76/base 2025-12-04T08:54:02.3321132Z * [new branch] gh/tugsbayasgalan/76/head -> origin/gh/tugsbayasgalan/76/head 2025-12-04T08:54:02.3321216Z * [new branch] gh/tugsbayasgalan/76/orig -> origin/gh/tugsbayasgalan/76/orig 2025-12-04T08:54:02.3321299Z * [new branch] gh/tugsbayasgalan/77/base -> origin/gh/tugsbayasgalan/77/base 2025-12-04T08:54:02.3321386Z * [new branch] gh/tugsbayasgalan/77/head -> origin/gh/tugsbayasgalan/77/head 2025-12-04T08:54:02.3321468Z * [new branch] gh/tugsbayasgalan/77/orig -> origin/gh/tugsbayasgalan/77/orig 2025-12-04T08:54:02.3321549Z * [new branch] gh/tugsbayasgalan/78/base -> origin/gh/tugsbayasgalan/78/base 2025-12-04T08:54:02.3321633Z * [new branch] gh/tugsbayasgalan/78/head -> origin/gh/tugsbayasgalan/78/head 2025-12-04T08:54:02.3321762Z * [new branch] gh/tugsbayasgalan/78/orig -> origin/gh/tugsbayasgalan/78/orig 2025-12-04T08:54:02.3321845Z * [new branch] gh/tugsbayasgalan/79/base -> origin/gh/tugsbayasgalan/79/base 2025-12-04T08:54:02.3321927Z * [new branch] gh/tugsbayasgalan/79/head -> origin/gh/tugsbayasgalan/79/head 2025-12-04T08:54:02.3322009Z * [new branch] gh/tugsbayasgalan/79/orig -> origin/gh/tugsbayasgalan/79/orig 2025-12-04T08:54:02.3322091Z * [new branch] gh/tugsbayasgalan/8/base -> origin/gh/tugsbayasgalan/8/base 2025-12-04T08:54:02.3322175Z * [new branch] gh/tugsbayasgalan/8/head -> origin/gh/tugsbayasgalan/8/head 2025-12-04T08:54:02.3322254Z * [new branch] gh/tugsbayasgalan/8/orig -> origin/gh/tugsbayasgalan/8/orig 2025-12-04T08:54:02.3322336Z * [new branch] gh/tugsbayasgalan/80/base -> origin/gh/tugsbayasgalan/80/base 2025-12-04T08:54:02.3322421Z * [new branch] gh/tugsbayasgalan/80/head -> origin/gh/tugsbayasgalan/80/head 2025-12-04T08:54:02.3322504Z * [new branch] gh/tugsbayasgalan/80/orig -> origin/gh/tugsbayasgalan/80/orig 2025-12-04T08:54:02.3322585Z * [new branch] gh/tugsbayasgalan/81/base -> origin/gh/tugsbayasgalan/81/base 2025-12-04T08:54:02.3322667Z * [new branch] gh/tugsbayasgalan/81/head -> origin/gh/tugsbayasgalan/81/head 2025-12-04T08:54:02.3322749Z * [new branch] gh/tugsbayasgalan/81/orig -> origin/gh/tugsbayasgalan/81/orig 2025-12-04T08:54:02.3322832Z * [new branch] gh/tugsbayasgalan/82/base -> origin/gh/tugsbayasgalan/82/base 2025-12-04T08:54:02.3322915Z * [new branch] gh/tugsbayasgalan/82/head -> origin/gh/tugsbayasgalan/82/head 2025-12-04T08:54:02.3322997Z * [new branch] gh/tugsbayasgalan/82/orig -> origin/gh/tugsbayasgalan/82/orig 2025-12-04T08:54:02.3323080Z * [new branch] gh/tugsbayasgalan/83/base -> origin/gh/tugsbayasgalan/83/base 2025-12-04T08:54:02.3323162Z * [new branch] gh/tugsbayasgalan/83/head -> origin/gh/tugsbayasgalan/83/head 2025-12-04T08:54:02.3323244Z * [new branch] gh/tugsbayasgalan/83/orig -> origin/gh/tugsbayasgalan/83/orig 2025-12-04T08:54:02.3323326Z * [new branch] gh/tugsbayasgalan/84/base -> origin/gh/tugsbayasgalan/84/base 2025-12-04T08:54:02.3323408Z * [new branch] gh/tugsbayasgalan/84/head -> origin/gh/tugsbayasgalan/84/head 2025-12-04T08:54:02.3323489Z * [new branch] gh/tugsbayasgalan/84/orig -> origin/gh/tugsbayasgalan/84/orig 2025-12-04T08:54:02.3324129Z * [new branch] gh/tugsbayasgalan/85/base -> origin/gh/tugsbayasgalan/85/base 2025-12-04T08:54:02.3324211Z * [new branch] gh/tugsbayasgalan/85/head -> origin/gh/tugsbayasgalan/85/head 2025-12-04T08:54:02.3324293Z * [new branch] gh/tugsbayasgalan/85/orig -> origin/gh/tugsbayasgalan/85/orig 2025-12-04T08:54:02.3324381Z * [new branch] gh/tugsbayasgalan/86/base -> origin/gh/tugsbayasgalan/86/base 2025-12-04T08:54:02.3324463Z * [new branch] gh/tugsbayasgalan/86/head -> origin/gh/tugsbayasgalan/86/head 2025-12-04T08:54:02.3324545Z * [new branch] gh/tugsbayasgalan/86/orig -> origin/gh/tugsbayasgalan/86/orig 2025-12-04T08:54:02.3324627Z * [new branch] gh/tugsbayasgalan/87/base -> origin/gh/tugsbayasgalan/87/base 2025-12-04T08:54:02.3324709Z * [new branch] gh/tugsbayasgalan/87/head -> origin/gh/tugsbayasgalan/87/head 2025-12-04T08:54:02.3324792Z * [new branch] gh/tugsbayasgalan/87/orig -> origin/gh/tugsbayasgalan/87/orig 2025-12-04T08:54:02.3324875Z * [new branch] gh/tugsbayasgalan/88/base -> origin/gh/tugsbayasgalan/88/base 2025-12-04T08:54:02.3324957Z * [new branch] gh/tugsbayasgalan/88/head -> origin/gh/tugsbayasgalan/88/head 2025-12-04T08:54:02.3325075Z * [new branch] gh/tugsbayasgalan/88/orig -> origin/gh/tugsbayasgalan/88/orig 2025-12-04T08:54:02.3325160Z * [new branch] gh/tugsbayasgalan/89/base -> origin/gh/tugsbayasgalan/89/base 2025-12-04T08:54:02.3325241Z * [new branch] gh/tugsbayasgalan/89/head -> origin/gh/tugsbayasgalan/89/head 2025-12-04T08:54:02.3325325Z * [new branch] gh/tugsbayasgalan/89/orig -> origin/gh/tugsbayasgalan/89/orig 2025-12-04T08:54:02.3325405Z * [new branch] gh/tugsbayasgalan/9/base -> origin/gh/tugsbayasgalan/9/base 2025-12-04T08:54:02.3325485Z * [new branch] gh/tugsbayasgalan/9/head -> origin/gh/tugsbayasgalan/9/head 2025-12-04T08:54:02.3325568Z * [new branch] gh/tugsbayasgalan/9/orig -> origin/gh/tugsbayasgalan/9/orig 2025-12-04T08:54:02.3325651Z * [new branch] gh/tugsbayasgalan/90/base -> origin/gh/tugsbayasgalan/90/base 2025-12-04T08:54:02.3325733Z * [new branch] gh/tugsbayasgalan/90/head -> origin/gh/tugsbayasgalan/90/head 2025-12-04T08:54:02.3325816Z * [new branch] gh/tugsbayasgalan/90/orig -> origin/gh/tugsbayasgalan/90/orig 2025-12-04T08:54:02.3325898Z * [new branch] gh/tugsbayasgalan/91/base -> origin/gh/tugsbayasgalan/91/base 2025-12-04T08:54:02.3325982Z * [new branch] gh/tugsbayasgalan/91/head -> origin/gh/tugsbayasgalan/91/head 2025-12-04T08:54:02.3326065Z * [new branch] gh/tugsbayasgalan/91/orig -> origin/gh/tugsbayasgalan/91/orig 2025-12-04T08:54:02.3326146Z * [new branch] gh/tugsbayasgalan/92/base -> origin/gh/tugsbayasgalan/92/base 2025-12-04T08:54:02.3326229Z * [new branch] gh/tugsbayasgalan/92/head -> origin/gh/tugsbayasgalan/92/head 2025-12-04T08:54:02.3326313Z * [new branch] gh/tugsbayasgalan/92/orig -> origin/gh/tugsbayasgalan/92/orig 2025-12-04T08:54:02.3326394Z * [new branch] gh/tugsbayasgalan/93/base -> origin/gh/tugsbayasgalan/93/base 2025-12-04T08:54:02.3326478Z * [new branch] gh/tugsbayasgalan/93/head -> origin/gh/tugsbayasgalan/93/head 2025-12-04T08:54:02.3326563Z * [new branch] gh/tugsbayasgalan/93/orig -> origin/gh/tugsbayasgalan/93/orig 2025-12-04T08:54:02.3326631Z * [new branch] gh/v0i0/14/base -> origin/gh/v0i0/14/base 2025-12-04T08:54:02.3326698Z * [new branch] gh/v0i0/14/head -> origin/gh/v0i0/14/head 2025-12-04T08:54:02.3326765Z * [new branch] gh/v0i0/14/orig -> origin/gh/v0i0/14/orig 2025-12-04T08:54:02.3326830Z * [new branch] gh/v0i0/15/base -> origin/gh/v0i0/15/base 2025-12-04T08:54:02.3326928Z * [new branch] gh/v0i0/15/head -> origin/gh/v0i0/15/head 2025-12-04T08:54:02.3326990Z * [new branch] gh/v0i0/15/orig -> origin/gh/v0i0/15/orig 2025-12-04T08:54:02.3327051Z * [new branch] gh/v0i0/16/base -> origin/gh/v0i0/16/base 2025-12-04T08:54:02.3327116Z * [new branch] gh/v0i0/16/head -> origin/gh/v0i0/16/head 2025-12-04T08:54:02.3327179Z * [new branch] gh/v0i0/16/orig -> origin/gh/v0i0/16/orig 2025-12-04T08:54:02.3327241Z * [new branch] gh/v0i0/17/base -> origin/gh/v0i0/17/base 2025-12-04T08:54:02.3327305Z * [new branch] gh/v0i0/17/head -> origin/gh/v0i0/17/head 2025-12-04T08:54:02.3327366Z * [new branch] gh/v0i0/17/orig -> origin/gh/v0i0/17/orig 2025-12-04T08:54:02.3327429Z * [new branch] gh/v0i0/18/base -> origin/gh/v0i0/18/base 2025-12-04T08:54:02.3327494Z * [new branch] gh/v0i0/18/head -> origin/gh/v0i0/18/head 2025-12-04T08:54:02.3327557Z * [new branch] gh/v0i0/18/orig -> origin/gh/v0i0/18/orig 2025-12-04T08:54:02.3327618Z * [new branch] gh/v0i0/19/base -> origin/gh/v0i0/19/base 2025-12-04T08:54:02.3327706Z * [new branch] gh/v0i0/19/head -> origin/gh/v0i0/19/head 2025-12-04T08:54:02.3327768Z * [new branch] gh/v0i0/19/orig -> origin/gh/v0i0/19/orig 2025-12-04T08:54:02.3327848Z * [new branch] gh/vishal9-team/1/base -> origin/gh/vishal9-team/1/base 2025-12-04T08:54:02.3327926Z * [new branch] gh/vishal9-team/1/head -> origin/gh/vishal9-team/1/head 2025-12-04T08:54:02.3328003Z * [new branch] gh/vishal9-team/2/base -> origin/gh/vishal9-team/2/base 2025-12-04T08:54:02.3328078Z * [new branch] gh/vishal9-team/2/head -> origin/gh/vishal9-team/2/head 2025-12-04T08:54:02.3328154Z * [new branch] gh/vishal9-team/2/orig -> origin/gh/vishal9-team/2/orig 2025-12-04T08:54:02.3328227Z * [new branch] gh/vishal9-team/3/base -> origin/gh/vishal9-team/3/base 2025-12-04T08:54:02.3328300Z * [new branch] gh/vishal9-team/3/head -> origin/gh/vishal9-team/3/head 2025-12-04T08:54:02.3328376Z * [new branch] gh/vishal9-team/3/orig -> origin/gh/vishal9-team/3/orig 2025-12-04T08:54:02.3328449Z * [new branch] gh/vishal9-team/4/base -> origin/gh/vishal9-team/4/base 2025-12-04T08:54:02.3328525Z * [new branch] gh/vishal9-team/4/head -> origin/gh/vishal9-team/4/head 2025-12-04T08:54:02.3328599Z * [new branch] gh/vishal9-team/4/orig -> origin/gh/vishal9-team/4/orig 2025-12-04T08:54:02.3328666Z * [new branch] gh/vkuzo/1/next -> origin/gh/vkuzo/1/next 2025-12-04T08:54:02.3328733Z * [new branch] gh/vkuzo/2/next -> origin/gh/vkuzo/2/next 2025-12-04T08:54:02.3328800Z * [new branch] gh/vkuzo/3/next -> origin/gh/vkuzo/3/next 2025-12-04T08:54:02.3328874Z * [new branch] gh/wconstab/424/base -> origin/gh/wconstab/424/base 2025-12-04T08:54:02.3328949Z * [new branch] gh/wconstab/424/head -> origin/gh/wconstab/424/head 2025-12-04T08:54:02.3329023Z * [new branch] gh/wconstab/424/orig -> origin/gh/wconstab/424/orig 2025-12-04T08:54:02.3329096Z * [new branch] gh/wconstab/435/base -> origin/gh/wconstab/435/base 2025-12-04T08:54:02.3329170Z * [new branch] gh/wconstab/435/head -> origin/gh/wconstab/435/head 2025-12-04T08:54:02.3329241Z * [new branch] gh/wconstab/435/orig -> origin/gh/wconstab/435/orig 2025-12-04T08:54:02.3329311Z * [new branch] gh/wconstab/444/base -> origin/gh/wconstab/444/base 2025-12-04T08:54:02.3329381Z * [new branch] gh/wconstab/444/head -> origin/gh/wconstab/444/head 2025-12-04T08:54:02.3329475Z * [new branch] gh/wconstab/444/orig -> origin/gh/wconstab/444/orig 2025-12-04T08:54:02.3329546Z * [new branch] gh/wconstab/447/base -> origin/gh/wconstab/447/base 2025-12-04T08:54:02.3329620Z * [new branch] gh/wconstab/447/head -> origin/gh/wconstab/447/head 2025-12-04T08:54:02.3329693Z * [new branch] gh/wconstab/447/orig -> origin/gh/wconstab/447/orig 2025-12-04T08:54:02.3329764Z * [new branch] gh/wconstab/448/base -> origin/gh/wconstab/448/base 2025-12-04T08:54:02.3329836Z * [new branch] gh/wconstab/448/head -> origin/gh/wconstab/448/head 2025-12-04T08:54:02.3329906Z * [new branch] gh/wconstab/448/orig -> origin/gh/wconstab/448/orig 2025-12-04T08:54:02.3329981Z * [new branch] gh/wconstab/449/base -> origin/gh/wconstab/449/base 2025-12-04T08:54:02.3330051Z * [new branch] gh/wconstab/449/head -> origin/gh/wconstab/449/head 2025-12-04T08:54:02.3330181Z * [new branch] gh/wconstab/449/orig -> origin/gh/wconstab/449/orig 2025-12-04T08:54:02.3330253Z * [new branch] gh/wconstab/450/base -> origin/gh/wconstab/450/base 2025-12-04T08:54:02.3330322Z * [new branch] gh/wconstab/450/head -> origin/gh/wconstab/450/head 2025-12-04T08:54:02.3330438Z * [new branch] gh/wconstab/450/orig -> origin/gh/wconstab/450/orig 2025-12-04T08:54:02.3330514Z * [new branch] gh/wconstab/451/base -> origin/gh/wconstab/451/base 2025-12-04T08:54:02.3330586Z * [new branch] gh/wconstab/451/head -> origin/gh/wconstab/451/head 2025-12-04T08:54:02.3330658Z * [new branch] gh/wconstab/451/orig -> origin/gh/wconstab/451/orig 2025-12-04T08:54:02.3330732Z * [new branch] gh/wconstab/452/base -> origin/gh/wconstab/452/base 2025-12-04T08:54:02.3330801Z * [new branch] gh/wconstab/452/head -> origin/gh/wconstab/452/head 2025-12-04T08:54:02.3330874Z * [new branch] gh/wconstab/452/orig -> origin/gh/wconstab/452/orig 2025-12-04T08:54:02.3330944Z * [new branch] gh/wconstab/453/base -> origin/gh/wconstab/453/base 2025-12-04T08:54:02.3331014Z * [new branch] gh/wconstab/453/head -> origin/gh/wconstab/453/head 2025-12-04T08:54:02.3331085Z * [new branch] gh/wconstab/453/orig -> origin/gh/wconstab/453/orig 2025-12-04T08:54:02.3331157Z * [new branch] gh/wconstab/454/base -> origin/gh/wconstab/454/base 2025-12-04T08:54:02.3331226Z * [new branch] gh/wconstab/454/head -> origin/gh/wconstab/454/head 2025-12-04T08:54:02.3331296Z * [new branch] gh/wconstab/454/orig -> origin/gh/wconstab/454/orig 2025-12-04T08:54:02.3331367Z * [new branch] gh/wconstab/455/base -> origin/gh/wconstab/455/base 2025-12-04T08:54:02.3331436Z * [new branch] gh/wconstab/455/head -> origin/gh/wconstab/455/head 2025-12-04T08:54:02.3331510Z * [new branch] gh/wconstab/455/orig -> origin/gh/wconstab/455/orig 2025-12-04T08:54:02.3331580Z * [new branch] gh/wconstab/456/base -> origin/gh/wconstab/456/base 2025-12-04T08:54:02.3331651Z * [new branch] gh/wconstab/456/head -> origin/gh/wconstab/456/head 2025-12-04T08:54:02.3331723Z * [new branch] gh/wconstab/456/orig -> origin/gh/wconstab/456/orig 2025-12-04T08:54:02.3331793Z * [new branch] gh/wconstab/457/base -> origin/gh/wconstab/457/base 2025-12-04T08:54:02.3331865Z * [new branch] gh/wconstab/457/head -> origin/gh/wconstab/457/head 2025-12-04T08:54:02.3331936Z * [new branch] gh/wconstab/457/orig -> origin/gh/wconstab/457/orig 2025-12-04T08:54:02.3332006Z * [new branch] gh/wconstab/458/base -> origin/gh/wconstab/458/base 2025-12-04T08:54:02.3332122Z * [new branch] gh/wconstab/458/head -> origin/gh/wconstab/458/head 2025-12-04T08:54:02.3332196Z * [new branch] gh/wconstab/458/orig -> origin/gh/wconstab/458/orig 2025-12-04T08:54:02.3332266Z * [new branch] gh/wconstab/459/base -> origin/gh/wconstab/459/base 2025-12-04T08:54:02.3332336Z * [new branch] gh/wconstab/459/head -> origin/gh/wconstab/459/head 2025-12-04T08:54:02.3332408Z * [new branch] gh/wconstab/459/orig -> origin/gh/wconstab/459/orig 2025-12-04T08:54:02.3332478Z * [new branch] gh/wconstab/460/base -> origin/gh/wconstab/460/base 2025-12-04T08:54:02.3332547Z * [new branch] gh/wconstab/460/head -> origin/gh/wconstab/460/head 2025-12-04T08:54:02.3332619Z * [new branch] gh/wconstab/460/orig -> origin/gh/wconstab/460/orig 2025-12-04T08:54:02.3332690Z * [new branch] gh/wconstab/461/base -> origin/gh/wconstab/461/base 2025-12-04T08:54:02.3332761Z * [new branch] gh/wconstab/461/head -> origin/gh/wconstab/461/head 2025-12-04T08:54:02.3332834Z * [new branch] gh/wconstab/461/orig -> origin/gh/wconstab/461/orig 2025-12-04T08:54:02.3332904Z * [new branch] gh/wconstab/462/base -> origin/gh/wconstab/462/base 2025-12-04T08:54:02.3332973Z * [new branch] gh/wconstab/462/head -> origin/gh/wconstab/462/head 2025-12-04T08:54:02.3333071Z * [new branch] gh/wconstab/462/orig -> origin/gh/wconstab/462/orig 2025-12-04T08:54:02.3333141Z * [new branch] gh/wconstab/463/base -> origin/gh/wconstab/463/base 2025-12-04T08:54:02.3333212Z * [new branch] gh/wconstab/463/head -> origin/gh/wconstab/463/head 2025-12-04T08:54:02.3333283Z * [new branch] gh/wconstab/463/orig -> origin/gh/wconstab/463/orig 2025-12-04T08:54:02.3333352Z * [new branch] gh/wconstab/464/base -> origin/gh/wconstab/464/base 2025-12-04T08:54:02.3333426Z * [new branch] gh/wconstab/464/head -> origin/gh/wconstab/464/head 2025-12-04T08:54:02.3333495Z * [new branch] gh/wconstab/464/orig -> origin/gh/wconstab/464/orig 2025-12-04T08:54:02.3333565Z * [new branch] gh/wconstab/465/base -> origin/gh/wconstab/465/base 2025-12-04T08:54:02.3333637Z * [new branch] gh/wconstab/465/head -> origin/gh/wconstab/465/head 2025-12-04T08:54:02.3333709Z * [new branch] gh/wconstab/465/orig -> origin/gh/wconstab/465/orig 2025-12-04T08:54:02.3333779Z * [new branch] gh/wconstab/466/base -> origin/gh/wconstab/466/base 2025-12-04T08:54:02.3333850Z * [new branch] gh/wconstab/466/head -> origin/gh/wconstab/466/head 2025-12-04T08:54:02.3333919Z * [new branch] gh/wconstab/466/orig -> origin/gh/wconstab/466/orig 2025-12-04T08:54:02.3333990Z * [new branch] gh/wconstab/467/base -> origin/gh/wconstab/467/base 2025-12-04T08:54:02.3334064Z * [new branch] gh/wconstab/467/head -> origin/gh/wconstab/467/head 2025-12-04T08:54:02.3334134Z * [new branch] gh/wconstab/467/orig -> origin/gh/wconstab/467/orig 2025-12-04T08:54:02.3334203Z * [new branch] gh/wconstab/468/base -> origin/gh/wconstab/468/base 2025-12-04T08:54:02.3334275Z * [new branch] gh/wconstab/468/head -> origin/gh/wconstab/468/head 2025-12-04T08:54:02.3334346Z * [new branch] gh/wconstab/468/orig -> origin/gh/wconstab/468/orig 2025-12-04T08:54:02.3334417Z * [new branch] gh/weifengpy/39/base -> origin/gh/weifengpy/39/base 2025-12-04T08:54:02.3334489Z * [new branch] gh/weifengpy/39/head -> origin/gh/weifengpy/39/head 2025-12-04T08:54:02.3334560Z * [new branch] gh/weifengpy/39/orig -> origin/gh/weifengpy/39/orig 2025-12-04T08:54:02.3334633Z * [new branch] gh/weifengpy/40/base -> origin/gh/weifengpy/40/base 2025-12-04T08:54:02.3334734Z * [new branch] gh/weifengpy/40/head -> origin/gh/weifengpy/40/head 2025-12-04T08:54:02.3334805Z * [new branch] gh/weifengpy/40/orig -> origin/gh/weifengpy/40/orig 2025-12-04T08:54:02.3334878Z * [new branch] gh/weifengpy/41/base -> origin/gh/weifengpy/41/base 2025-12-04T08:54:02.3334951Z * [new branch] gh/weifengpy/41/head -> origin/gh/weifengpy/41/head 2025-12-04T08:54:02.3335022Z * [new branch] gh/weifengpy/41/orig -> origin/gh/weifengpy/41/orig 2025-12-04T08:54:02.3335106Z * [new branch] gh/williamwen42/250/base -> origin/gh/williamwen42/250/base 2025-12-04T08:54:02.3335188Z * [new branch] gh/williamwen42/250/head -> origin/gh/williamwen42/250/head 2025-12-04T08:54:02.3335268Z * [new branch] gh/williamwen42/250/orig -> origin/gh/williamwen42/250/orig 2025-12-04T08:54:02.3335350Z * [new branch] gh/williamwen42/279/base -> origin/gh/williamwen42/279/base 2025-12-04T08:54:02.3335429Z * [new branch] gh/williamwen42/279/head -> origin/gh/williamwen42/279/head 2025-12-04T08:54:02.3335506Z * [new branch] gh/williamwen42/279/orig -> origin/gh/williamwen42/279/orig 2025-12-04T08:54:02.3335584Z * [new branch] gh/williamwen42/282/base -> origin/gh/williamwen42/282/base 2025-12-04T08:54:02.3335688Z * [new branch] gh/williamwen42/282/head -> origin/gh/williamwen42/282/head 2025-12-04T08:54:02.3335766Z * [new branch] gh/williamwen42/282/orig -> origin/gh/williamwen42/282/orig 2025-12-04T08:54:02.3335848Z * [new branch] gh/williamwen42/287/base -> origin/gh/williamwen42/287/base 2025-12-04T08:54:02.3335924Z * [new branch] gh/williamwen42/287/head -> origin/gh/williamwen42/287/head 2025-12-04T08:54:02.3336002Z * [new branch] gh/williamwen42/287/orig -> origin/gh/williamwen42/287/orig 2025-12-04T08:54:02.3336082Z * [new branch] gh/williamwen42/288/base -> origin/gh/williamwen42/288/base 2025-12-04T08:54:02.3336161Z * [new branch] gh/williamwen42/288/head -> origin/gh/williamwen42/288/head 2025-12-04T08:54:02.3336242Z * [new branch] gh/williamwen42/288/orig -> origin/gh/williamwen42/288/orig 2025-12-04T08:54:02.3336320Z * [new branch] gh/williamwen42/296/base -> origin/gh/williamwen42/296/base 2025-12-04T08:54:02.3336398Z * [new branch] gh/williamwen42/296/head -> origin/gh/williamwen42/296/head 2025-12-04T08:54:02.3336476Z * [new branch] gh/williamwen42/296/orig -> origin/gh/williamwen42/296/orig 2025-12-04T08:54:02.3336554Z * [new branch] gh/williamwen42/297/base -> origin/gh/williamwen42/297/base 2025-12-04T08:54:02.3336631Z * [new branch] gh/williamwen42/297/head -> origin/gh/williamwen42/297/head 2025-12-04T08:54:02.3336709Z * [new branch] gh/williamwen42/297/orig -> origin/gh/williamwen42/297/orig 2025-12-04T08:54:02.3336789Z * [new branch] gh/williamwen42/306/base -> origin/gh/williamwen42/306/base 2025-12-04T08:54:02.3336866Z * [new branch] gh/williamwen42/306/head -> origin/gh/williamwen42/306/head 2025-12-04T08:54:02.3336944Z * [new branch] gh/williamwen42/306/orig -> origin/gh/williamwen42/306/orig 2025-12-04T08:54:02.3337022Z * [new branch] gh/williamwen42/309/base -> origin/gh/williamwen42/309/base 2025-12-04T08:54:02.3337099Z * [new branch] gh/williamwen42/309/head -> origin/gh/williamwen42/309/head 2025-12-04T08:54:02.3337179Z * [new branch] gh/williamwen42/309/orig -> origin/gh/williamwen42/309/orig 2025-12-04T08:54:02.3337256Z * [new branch] gh/williamwen42/310/base -> origin/gh/williamwen42/310/base 2025-12-04T08:54:02.3337334Z * [new branch] gh/williamwen42/310/head -> origin/gh/williamwen42/310/head 2025-12-04T08:54:02.3337437Z * [new branch] gh/williamwen42/310/orig -> origin/gh/williamwen42/310/orig 2025-12-04T08:54:02.3337514Z * [new branch] gh/williamwen42/311/base -> origin/gh/williamwen42/311/base 2025-12-04T08:54:02.3337594Z * [new branch] gh/williamwen42/311/head -> origin/gh/williamwen42/311/head 2025-12-04T08:54:02.3337671Z * [new branch] gh/williamwen42/311/orig -> origin/gh/williamwen42/311/orig 2025-12-04T08:54:02.3337748Z * [new branch] gh/williamwen42/319/base -> origin/gh/williamwen42/319/base 2025-12-04T08:54:02.3337827Z * [new branch] gh/williamwen42/319/head -> origin/gh/williamwen42/319/head 2025-12-04T08:54:02.3337903Z * [new branch] gh/williamwen42/319/orig -> origin/gh/williamwen42/319/orig 2025-12-04T08:54:02.3337981Z * [new branch] gh/williamwen42/325/base -> origin/gh/williamwen42/325/base 2025-12-04T08:54:02.3338059Z * [new branch] gh/williamwen42/325/head -> origin/gh/williamwen42/325/head 2025-12-04T08:54:02.3338137Z * [new branch] gh/williamwen42/325/orig -> origin/gh/williamwen42/325/orig 2025-12-04T08:54:02.3338218Z * [new branch] gh/williamwen42/326/base -> origin/gh/williamwen42/326/base 2025-12-04T08:54:02.3338296Z * [new branch] gh/williamwen42/326/head -> origin/gh/williamwen42/326/head 2025-12-04T08:54:02.3338401Z * [new branch] gh/williamwen42/326/orig -> origin/gh/williamwen42/326/orig 2025-12-04T08:54:02.3338481Z * [new branch] gh/williamwen42/327/base -> origin/gh/williamwen42/327/base 2025-12-04T08:54:02.3338558Z * [new branch] gh/williamwen42/327/head -> origin/gh/williamwen42/327/head 2025-12-04T08:54:02.3338635Z * [new branch] gh/williamwen42/327/orig -> origin/gh/williamwen42/327/orig 2025-12-04T08:54:02.3338713Z * [new branch] gh/williamwen42/328/base -> origin/gh/williamwen42/328/base 2025-12-04T08:54:02.3338790Z * [new branch] gh/williamwen42/328/head -> origin/gh/williamwen42/328/head 2025-12-04T08:54:02.3338869Z * [new branch] gh/williamwen42/328/orig -> origin/gh/williamwen42/328/orig 2025-12-04T08:54:02.3338947Z * [new branch] gh/williamwen42/329/base -> origin/gh/williamwen42/329/base 2025-12-04T08:54:02.3339024Z * [new branch] gh/williamwen42/329/head -> origin/gh/williamwen42/329/head 2025-12-04T08:54:02.3339103Z * [new branch] gh/williamwen42/329/orig -> origin/gh/williamwen42/329/orig 2025-12-04T08:54:02.3339181Z * [new branch] gh/williamwen42/330/base -> origin/gh/williamwen42/330/base 2025-12-04T08:54:02.3339257Z * [new branch] gh/williamwen42/330/head -> origin/gh/williamwen42/330/head 2025-12-04T08:54:02.3339334Z * [new branch] gh/williamwen42/330/orig -> origin/gh/williamwen42/330/orig 2025-12-04T08:54:02.3339412Z * [new branch] gh/williamwen42/331/base -> origin/gh/williamwen42/331/base 2025-12-04T08:54:02.3339490Z * [new branch] gh/williamwen42/331/head -> origin/gh/williamwen42/331/head 2025-12-04T08:54:02.3339567Z * [new branch] gh/williamwen42/331/orig -> origin/gh/williamwen42/331/orig 2025-12-04T08:54:02.3339645Z * [new branch] gh/williamwen42/332/base -> origin/gh/williamwen42/332/base 2025-12-04T08:54:02.3339724Z * [new branch] gh/williamwen42/332/head -> origin/gh/williamwen42/332/head 2025-12-04T08:54:02.3339802Z * [new branch] gh/williamwen42/332/orig -> origin/gh/williamwen42/332/orig 2025-12-04T08:54:02.3339879Z * [new branch] gh/williamwen42/333/base -> origin/gh/williamwen42/333/base 2025-12-04T08:54:02.3339956Z * [new branch] gh/williamwen42/333/head -> origin/gh/williamwen42/333/head 2025-12-04T08:54:02.3340034Z * [new branch] gh/williamwen42/333/orig -> origin/gh/williamwen42/333/orig 2025-12-04T08:54:02.3340139Z * [new branch] gh/williamwen42/334/base -> origin/gh/williamwen42/334/base 2025-12-04T08:54:02.3340255Z * [new branch] gh/williamwen42/334/head -> origin/gh/williamwen42/334/head 2025-12-04T08:54:02.3340334Z * [new branch] gh/williamwen42/334/orig -> origin/gh/williamwen42/334/orig 2025-12-04T08:54:02.3340411Z * [new branch] gh/williamwen42/335/base -> origin/gh/williamwen42/335/base 2025-12-04T08:54:02.3340489Z * [new branch] gh/williamwen42/335/head -> origin/gh/williamwen42/335/head 2025-12-04T08:54:02.3340568Z * [new branch] gh/williamwen42/335/orig -> origin/gh/williamwen42/335/orig 2025-12-04T08:54:02.3340645Z * [new branch] gh/williamwen42/336/base -> origin/gh/williamwen42/336/base 2025-12-04T08:54:02.3340722Z * [new branch] gh/williamwen42/336/head -> origin/gh/williamwen42/336/head 2025-12-04T08:54:02.3340801Z * [new branch] gh/williamwen42/336/orig -> origin/gh/williamwen42/336/orig 2025-12-04T08:54:02.3340879Z * [new branch] gh/williamwen42/337/base -> origin/gh/williamwen42/337/base 2025-12-04T08:54:02.3340957Z * [new branch] gh/williamwen42/337/head -> origin/gh/williamwen42/337/head 2025-12-04T08:54:02.3341035Z * [new branch] gh/williamwen42/337/orig -> origin/gh/williamwen42/337/orig 2025-12-04T08:54:02.3341155Z * [new branch] gh/williamwen42/338/base -> origin/gh/williamwen42/338/base 2025-12-04T08:54:02.3341234Z * [new branch] gh/williamwen42/338/head -> origin/gh/williamwen42/338/head 2025-12-04T08:54:02.3341312Z * [new branch] gh/williamwen42/338/orig -> origin/gh/williamwen42/338/orig 2025-12-04T08:54:02.3341389Z * [new branch] gh/williamwen42/339/base -> origin/gh/williamwen42/339/base 2025-12-04T08:54:02.3341467Z * [new branch] gh/williamwen42/339/head -> origin/gh/williamwen42/339/head 2025-12-04T08:54:02.3341544Z * [new branch] gh/williamwen42/339/orig -> origin/gh/williamwen42/339/orig 2025-12-04T08:54:02.3341623Z * [new branch] gh/williamwen42/340/base -> origin/gh/williamwen42/340/base 2025-12-04T08:54:02.3341702Z * [new branch] gh/williamwen42/340/head -> origin/gh/williamwen42/340/head 2025-12-04T08:54:02.3341779Z * [new branch] gh/williamwen42/340/orig -> origin/gh/williamwen42/340/orig 2025-12-04T08:54:02.3341858Z * [new branch] gh/williamwen42/341/base -> origin/gh/williamwen42/341/base 2025-12-04T08:54:02.3341936Z * [new branch] gh/williamwen42/341/head -> origin/gh/williamwen42/341/head 2025-12-04T08:54:02.3342013Z * [new branch] gh/williamwen42/341/orig -> origin/gh/williamwen42/341/orig 2025-12-04T08:54:02.3342090Z * [new branch] gh/williamwen42/342/base -> origin/gh/williamwen42/342/base 2025-12-04T08:54:02.3342168Z * [new branch] gh/williamwen42/342/head -> origin/gh/williamwen42/342/head 2025-12-04T08:54:02.3342246Z * [new branch] gh/williamwen42/342/orig -> origin/gh/williamwen42/342/orig 2025-12-04T08:54:02.3342323Z * [new branch] gh/williamwen42/343/base -> origin/gh/williamwen42/343/base 2025-12-04T08:54:02.3342401Z * [new branch] gh/williamwen42/343/head -> origin/gh/williamwen42/343/head 2025-12-04T08:54:02.3342479Z * [new branch] gh/williamwen42/343/orig -> origin/gh/williamwen42/343/orig 2025-12-04T08:54:02.3342556Z * [new branch] gh/williamwen42/344/base -> origin/gh/williamwen42/344/base 2025-12-04T08:54:02.3342634Z * [new branch] gh/williamwen42/344/head -> origin/gh/williamwen42/344/head 2025-12-04T08:54:02.3342711Z * [new branch] gh/williamwen42/344/orig -> origin/gh/williamwen42/344/orig 2025-12-04T08:54:02.3342790Z * [new branch] gh/williamwen42/345/base -> origin/gh/williamwen42/345/base 2025-12-04T08:54:02.3342869Z * [new branch] gh/williamwen42/345/head -> origin/gh/williamwen42/345/head 2025-12-04T08:54:02.3342981Z * [new branch] gh/williamwen42/345/orig -> origin/gh/williamwen42/345/orig 2025-12-04T08:54:02.3343059Z * [new branch] gh/williamwen42/346/base -> origin/gh/williamwen42/346/base 2025-12-04T08:54:02.3343136Z * [new branch] gh/williamwen42/346/head -> origin/gh/williamwen42/346/head 2025-12-04T08:54:02.3343215Z * [new branch] gh/williamwen42/346/orig -> origin/gh/williamwen42/346/orig 2025-12-04T08:54:02.3343292Z * [new branch] gh/williamwen42/347/base -> origin/gh/williamwen42/347/base 2025-12-04T08:54:02.3343369Z * [new branch] gh/williamwen42/347/head -> origin/gh/williamwen42/347/head 2025-12-04T08:54:02.3343447Z * [new branch] gh/williamwen42/347/orig -> origin/gh/williamwen42/347/orig 2025-12-04T08:54:02.3343525Z * [new branch] gh/williamwen42/348/base -> origin/gh/williamwen42/348/base 2025-12-04T08:54:02.3343604Z * [new branch] gh/williamwen42/348/head -> origin/gh/williamwen42/348/head 2025-12-04T08:54:02.3343681Z * [new branch] gh/williamwen42/348/orig -> origin/gh/williamwen42/348/orig 2025-12-04T08:54:02.3343760Z * [new branch] gh/williamwen42/349/base -> origin/gh/williamwen42/349/base 2025-12-04T08:54:02.3343869Z * [new branch] gh/williamwen42/349/head -> origin/gh/williamwen42/349/head 2025-12-04T08:54:02.3343947Z * [new branch] gh/williamwen42/349/orig -> origin/gh/williamwen42/349/orig 2025-12-04T08:54:02.3344025Z * [new branch] gh/williamwen42/350/base -> origin/gh/williamwen42/350/base 2025-12-04T08:54:02.3344102Z * [new branch] gh/williamwen42/350/head -> origin/gh/williamwen42/350/head 2025-12-04T08:54:02.3344180Z * [new branch] gh/williamwen42/350/orig -> origin/gh/williamwen42/350/orig 2025-12-04T08:54:02.3344257Z * [new branch] gh/williamwen42/351/base -> origin/gh/williamwen42/351/base 2025-12-04T08:54:02.3344336Z * [new branch] gh/williamwen42/351/head -> origin/gh/williamwen42/351/head 2025-12-04T08:54:02.3344415Z * [new branch] gh/williamwen42/351/orig -> origin/gh/williamwen42/351/orig 2025-12-04T08:54:02.3344492Z * [new branch] gh/williamwen42/352/base -> origin/gh/williamwen42/352/base 2025-12-04T08:54:02.3344570Z * [new branch] gh/williamwen42/352/head -> origin/gh/williamwen42/352/head 2025-12-04T08:54:02.3344648Z * [new branch] gh/williamwen42/352/orig -> origin/gh/williamwen42/352/orig 2025-12-04T08:54:02.3344724Z * [new branch] gh/williamwen42/353/base -> origin/gh/williamwen42/353/base 2025-12-04T08:54:02.3344801Z * [new branch] gh/williamwen42/353/head -> origin/gh/williamwen42/353/head 2025-12-04T08:54:02.3344879Z * [new branch] gh/williamwen42/353/orig -> origin/gh/williamwen42/353/orig 2025-12-04T08:54:02.3344957Z * [new branch] gh/williamwen42/354/base -> origin/gh/williamwen42/354/base 2025-12-04T08:54:02.3345034Z * [new branch] gh/williamwen42/354/head -> origin/gh/williamwen42/354/head 2025-12-04T08:54:02.3345112Z * [new branch] gh/williamwen42/354/orig -> origin/gh/williamwen42/354/orig 2025-12-04T08:54:02.3345190Z * [new branch] gh/williamwen42/355/base -> origin/gh/williamwen42/355/base 2025-12-04T08:54:02.3345267Z * [new branch] gh/williamwen42/355/head -> origin/gh/williamwen42/355/head 2025-12-04T08:54:02.3345345Z * [new branch] gh/williamwen42/355/orig -> origin/gh/williamwen42/355/orig 2025-12-04T08:54:02.3345421Z * [new branch] gh/williamwen42/356/base -> origin/gh/williamwen42/356/base 2025-12-04T08:54:02.3345498Z * [new branch] gh/williamwen42/356/head -> origin/gh/williamwen42/356/head 2025-12-04T08:54:02.3345576Z * [new branch] gh/williamwen42/356/orig -> origin/gh/williamwen42/356/orig 2025-12-04T08:54:02.3345680Z * [new branch] gh/williamwen42/357/base -> origin/gh/williamwen42/357/base 2025-12-04T08:54:02.3345758Z * [new branch] gh/williamwen42/357/head -> origin/gh/williamwen42/357/head 2025-12-04T08:54:02.3345835Z * [new branch] gh/williamwen42/357/orig -> origin/gh/williamwen42/357/orig 2025-12-04T08:54:02.3345914Z * [new branch] gh/williamwen42/358/base -> origin/gh/williamwen42/358/base 2025-12-04T08:54:02.3345993Z * [new branch] gh/williamwen42/358/head -> origin/gh/williamwen42/358/head 2025-12-04T08:54:02.3346071Z * [new branch] gh/williamwen42/358/orig -> origin/gh/williamwen42/358/orig 2025-12-04T08:54:02.3346140Z * [new branch] gh/xmfan/169/base -> origin/gh/xmfan/169/base 2025-12-04T08:54:02.3346209Z * [new branch] gh/xmfan/169/head -> origin/gh/xmfan/169/head 2025-12-04T08:54:02.3346277Z * [new branch] gh/xmfan/170/base -> origin/gh/xmfan/170/base 2025-12-04T08:54:02.3346344Z * [new branch] gh/xmfan/170/head -> origin/gh/xmfan/170/head 2025-12-04T08:54:02.3346412Z * [new branch] gh/xmfan/274/base -> origin/gh/xmfan/274/base 2025-12-04T08:54:02.3346478Z * [new branch] gh/xmfan/274/head -> origin/gh/xmfan/274/head 2025-12-04T08:54:02.3346570Z * [new branch] gh/xmfan/274/orig -> origin/gh/xmfan/274/orig 2025-12-04T08:54:02.3346639Z * [new branch] gh/xmfan/277/base -> origin/gh/xmfan/277/base 2025-12-04T08:54:02.3346705Z * [new branch] gh/xmfan/277/head -> origin/gh/xmfan/277/head 2025-12-04T08:54:02.3346771Z * [new branch] gh/xmfan/277/orig -> origin/gh/xmfan/277/orig 2025-12-04T08:54:02.3346837Z * [new branch] gh/xmfan/301/base -> origin/gh/xmfan/301/base 2025-12-04T08:54:02.3346903Z * [new branch] gh/xmfan/301/head -> origin/gh/xmfan/301/head 2025-12-04T08:54:02.3346971Z * [new branch] gh/xmfan/301/orig -> origin/gh/xmfan/301/orig 2025-12-04T08:54:02.3347038Z * [new branch] gh/xmfan/304/base -> origin/gh/xmfan/304/base 2025-12-04T08:54:02.3347104Z * [new branch] gh/xmfan/304/head -> origin/gh/xmfan/304/head 2025-12-04T08:54:02.3347172Z * [new branch] gh/xmfan/304/orig -> origin/gh/xmfan/304/orig 2025-12-04T08:54:02.3347238Z * [new branch] gh/xmfan/309/base -> origin/gh/xmfan/309/base 2025-12-04T08:54:02.3347304Z * [new branch] gh/xmfan/309/head -> origin/gh/xmfan/309/head 2025-12-04T08:54:02.3347371Z * [new branch] gh/xmfan/309/orig -> origin/gh/xmfan/309/orig 2025-12-04T08:54:02.3347437Z * [new branch] gh/xmfan/310/base -> origin/gh/xmfan/310/base 2025-12-04T08:54:02.3347503Z * [new branch] gh/xmfan/310/head -> origin/gh/xmfan/310/head 2025-12-04T08:54:02.3347636Z * [new branch] gh/xmfan/310/orig -> origin/gh/xmfan/310/orig 2025-12-04T08:54:02.3347701Z * [new branch] gh/xmfan/311/base -> origin/gh/xmfan/311/base 2025-12-04T08:54:02.3347767Z * [new branch] gh/xmfan/311/head -> origin/gh/xmfan/311/head 2025-12-04T08:54:02.3347836Z * [new branch] gh/xmfan/311/orig -> origin/gh/xmfan/311/orig 2025-12-04T08:54:02.3347902Z * [new branch] gh/xmfan/312/base -> origin/gh/xmfan/312/base 2025-12-04T08:54:02.3347967Z * [new branch] gh/xmfan/312/head -> origin/gh/xmfan/312/head 2025-12-04T08:54:02.3348034Z * [new branch] gh/xmfan/312/orig -> origin/gh/xmfan/312/orig 2025-12-04T08:54:02.3348099Z * [new branch] gh/xmfan/313/base -> origin/gh/xmfan/313/base 2025-12-04T08:54:02.3348165Z * [new branch] gh/xmfan/313/head -> origin/gh/xmfan/313/head 2025-12-04T08:54:02.3348259Z * [new branch] gh/xmfan/313/orig -> origin/gh/xmfan/313/orig 2025-12-04T08:54:02.3348338Z * [new branch] gh/xuanzhang816/27/base -> origin/gh/xuanzhang816/27/base 2025-12-04T08:54:02.3348416Z * [new branch] gh/xuanzhang816/27/head -> origin/gh/xuanzhang816/27/head 2025-12-04T08:54:02.3348495Z * [new branch] gh/xuanzhang816/27/orig -> origin/gh/xuanzhang816/27/orig 2025-12-04T08:54:02.3348571Z * [new branch] gh/xuanzhang816/32/base -> origin/gh/xuanzhang816/32/base 2025-12-04T08:54:02.3348647Z * [new branch] gh/xuanzhang816/32/head -> origin/gh/xuanzhang816/32/head 2025-12-04T08:54:02.3348722Z * [new branch] gh/xuanzhang816/32/orig -> origin/gh/xuanzhang816/32/orig 2025-12-04T08:54:02.3348796Z * [new branch] gh/xuanzhang816/33/base -> origin/gh/xuanzhang816/33/base 2025-12-04T08:54:02.3348873Z * [new branch] gh/xuanzhang816/33/head -> origin/gh/xuanzhang816/33/head 2025-12-04T08:54:02.3348949Z * [new branch] gh/xuanzhang816/33/orig -> origin/gh/xuanzhang816/33/orig 2025-12-04T08:54:02.3349024Z * [new branch] gh/xuanzhang816/34/base -> origin/gh/xuanzhang816/34/base 2025-12-04T08:54:02.3349099Z * [new branch] gh/xuanzhang816/34/head -> origin/gh/xuanzhang816/34/head 2025-12-04T08:54:02.3349200Z * [new branch] gh/xuanzhang816/34/orig -> origin/gh/xuanzhang816/34/orig 2025-12-04T08:54:02.3349276Z * [new branch] gh/xuanzhang816/35/base -> origin/gh/xuanzhang816/35/base 2025-12-04T08:54:02.3349352Z * [new branch] gh/xuanzhang816/35/head -> origin/gh/xuanzhang816/35/head 2025-12-04T08:54:02.3349427Z * [new branch] gh/xuanzhang816/35/orig -> origin/gh/xuanzhang816/35/orig 2025-12-04T08:54:02.3349500Z * [new branch] gh/yanbing-j/11/base -> origin/gh/yanbing-j/11/base 2025-12-04T08:54:02.3349575Z * [new branch] gh/yanbing-j/11/head -> origin/gh/yanbing-j/11/head 2025-12-04T08:54:02.3349645Z * [new branch] gh/yanbing-j/11/orig -> origin/gh/yanbing-j/11/orig 2025-12-04T08:54:02.3349716Z * [new branch] gh/yanbing-j/12/base -> origin/gh/yanbing-j/12/base 2025-12-04T08:54:02.3349786Z * [new branch] gh/yanbing-j/12/head -> origin/gh/yanbing-j/12/head 2025-12-04T08:54:02.3349857Z * [new branch] gh/yanbing-j/12/orig -> origin/gh/yanbing-j/12/orig 2025-12-04T08:54:02.3349926Z * [new branch] gh/yanbing-j/13/base -> origin/gh/yanbing-j/13/base 2025-12-04T08:54:02.3349997Z * [new branch] gh/yanbing-j/13/head -> origin/gh/yanbing-j/13/head 2025-12-04T08:54:02.3350067Z * [new branch] gh/yanbing-j/13/orig -> origin/gh/yanbing-j/13/orig 2025-12-04T08:54:02.3350187Z * [new branch] gh/yanbing-j/14/base -> origin/gh/yanbing-j/14/base 2025-12-04T08:54:02.3350262Z * [new branch] gh/yanbing-j/14/head -> origin/gh/yanbing-j/14/head 2025-12-04T08:54:02.3350332Z * [new branch] gh/yanbing-j/14/orig -> origin/gh/yanbing-j/14/orig 2025-12-04T08:54:02.3350401Z * [new branch] gh/yanbing-j/15/base -> origin/gh/yanbing-j/15/base 2025-12-04T08:54:02.3350470Z * [new branch] gh/yanbing-j/15/head -> origin/gh/yanbing-j/15/head 2025-12-04T08:54:02.3350540Z * [new branch] gh/yanbing-j/15/orig -> origin/gh/yanbing-j/15/orig 2025-12-04T08:54:02.3350611Z * [new branch] gh/yanbing-j/18/base -> origin/gh/yanbing-j/18/base 2025-12-04T08:54:02.3350680Z * [new branch] gh/yanbing-j/18/head -> origin/gh/yanbing-j/18/head 2025-12-04T08:54:02.3350750Z * [new branch] gh/yanbing-j/18/orig -> origin/gh/yanbing-j/18/orig 2025-12-04T08:54:02.3350821Z * [new branch] gh/yanbing-j/19/base -> origin/gh/yanbing-j/19/base 2025-12-04T08:54:02.3350964Z * [new branch] gh/yanbing-j/19/head -> origin/gh/yanbing-j/19/head 2025-12-04T08:54:02.3351033Z * [new branch] gh/yanbing-j/19/orig -> origin/gh/yanbing-j/19/orig 2025-12-04T08:54:02.3351104Z * [new branch] gh/yanbing-j/20/base -> origin/gh/yanbing-j/20/base 2025-12-04T08:54:02.3351173Z * [new branch] gh/yanbing-j/20/head -> origin/gh/yanbing-j/20/head 2025-12-04T08:54:02.3351244Z * [new branch] gh/yanbing-j/20/orig -> origin/gh/yanbing-j/20/orig 2025-12-04T08:54:02.3351314Z * [new branch] gh/yanbing-j/21/base -> origin/gh/yanbing-j/21/base 2025-12-04T08:54:02.3351383Z * [new branch] gh/yanbing-j/21/head -> origin/gh/yanbing-j/21/head 2025-12-04T08:54:02.3351452Z * [new branch] gh/yanbing-j/22/base -> origin/gh/yanbing-j/22/base 2025-12-04T08:54:02.3351523Z * [new branch] gh/yanbing-j/22/head -> origin/gh/yanbing-j/22/head 2025-12-04T08:54:02.3351593Z * [new branch] gh/yanbing-j/22/orig -> origin/gh/yanbing-j/22/orig 2025-12-04T08:54:02.3351662Z * [new branch] gh/yanbing-j/23/base -> origin/gh/yanbing-j/23/base 2025-12-04T08:54:02.3351733Z * [new branch] gh/yanbing-j/23/head -> origin/gh/yanbing-j/23/head 2025-12-04T08:54:02.3351852Z * [new branch] gh/yanbing-j/23/orig -> origin/gh/yanbing-j/23/orig 2025-12-04T08:54:02.3351923Z * [new branch] gh/yanbing-j/24/base -> origin/gh/yanbing-j/24/base 2025-12-04T08:54:02.3351993Z * [new branch] gh/yanbing-j/24/head -> origin/gh/yanbing-j/24/head 2025-12-04T08:54:02.3352062Z * [new branch] gh/yanbing-j/24/orig -> origin/gh/yanbing-j/24/orig 2025-12-04T08:54:02.3352133Z * [new branch] gh/yanbing-j/25/base -> origin/gh/yanbing-j/25/base 2025-12-04T08:54:02.3352202Z * [new branch] gh/yanbing-j/25/head -> origin/gh/yanbing-j/25/head 2025-12-04T08:54:02.3352273Z * [new branch] gh/yanbing-j/25/orig -> origin/gh/yanbing-j/25/orig 2025-12-04T08:54:02.3352343Z * [new branch] gh/yanbing-j/26/base -> origin/gh/yanbing-j/26/base 2025-12-04T08:54:02.3352412Z * [new branch] gh/yanbing-j/26/head -> origin/gh/yanbing-j/26/head 2025-12-04T08:54:02.3352483Z * [new branch] gh/yanbing-j/26/orig -> origin/gh/yanbing-j/26/orig 2025-12-04T08:54:02.3352562Z * [new branch] gh/yang-yu-hang/1/base -> origin/gh/yang-yu-hang/1/base 2025-12-04T08:54:02.3352636Z * [new branch] gh/yang-yu-hang/1/head -> origin/gh/yang-yu-hang/1/head 2025-12-04T08:54:02.3352710Z * [new branch] gh/yang-yu-hang/1/orig -> origin/gh/yang-yu-hang/1/orig 2025-12-04T08:54:02.3352785Z * [new branch] gh/yang-yu-hang/2/base -> origin/gh/yang-yu-hang/2/base 2025-12-04T08:54:02.3352859Z * [new branch] gh/yang-yu-hang/2/head -> origin/gh/yang-yu-hang/2/head 2025-12-04T08:54:02.3352932Z * [new branch] gh/yang-yu-hang/2/orig -> origin/gh/yang-yu-hang/2/orig 2025-12-04T08:54:02.3353005Z * [new branch] gh/yang-yu-hang/3/base -> origin/gh/yang-yu-hang/3/base 2025-12-04T08:54:02.3353076Z * [new branch] gh/yang-yu-hang/3/head -> origin/gh/yang-yu-hang/3/head 2025-12-04T08:54:02.3353149Z * [new branch] gh/yang-yu-hang/3/orig -> origin/gh/yang-yu-hang/3/orig 2025-12-04T08:54:02.3353223Z * [new branch] gh/yangw-dev/12/base -> origin/gh/yangw-dev/12/base 2025-12-04T08:54:02.3353294Z * [new branch] gh/yangw-dev/12/head -> origin/gh/yangw-dev/12/head 2025-12-04T08:54:02.3353366Z * [new branch] gh/yangw-dev/12/orig -> origin/gh/yangw-dev/12/orig 2025-12-04T08:54:02.3353436Z * [new branch] gh/yangw-dev/13/base -> origin/gh/yangw-dev/13/base 2025-12-04T08:54:02.3353506Z * [new branch] gh/yangw-dev/13/head -> origin/gh/yangw-dev/13/head 2025-12-04T08:54:02.3353606Z * [new branch] gh/yangw-dev/13/orig -> origin/gh/yangw-dev/13/orig 2025-12-04T08:54:02.3353675Z * [new branch] gh/yangw-dev/14/base -> origin/gh/yangw-dev/14/base 2025-12-04T08:54:02.3353745Z * [new branch] gh/yangw-dev/14/head -> origin/gh/yangw-dev/14/head 2025-12-04T08:54:02.3353818Z * [new branch] gh/yangw-dev/14/orig -> origin/gh/yangw-dev/14/orig 2025-12-04T08:54:02.3353888Z * [new branch] gh/yangw-dev/15/base -> origin/gh/yangw-dev/15/base 2025-12-04T08:54:02.3353958Z * [new branch] gh/yangw-dev/15/head -> origin/gh/yangw-dev/15/head 2025-12-04T08:54:02.3354028Z * [new branch] gh/yangw-dev/15/orig -> origin/gh/yangw-dev/15/orig 2025-12-04T08:54:02.3354098Z * [new branch] gh/yangw-dev/19/base -> origin/gh/yangw-dev/19/base 2025-12-04T08:54:02.3354168Z * [new branch] gh/yangw-dev/19/head -> origin/gh/yangw-dev/19/head 2025-12-04T08:54:02.3354240Z * [new branch] gh/yangw-dev/19/orig -> origin/gh/yangw-dev/19/orig 2025-12-04T08:54:02.3354310Z * [new branch] gh/yangw-dev/26/base -> origin/gh/yangw-dev/26/base 2025-12-04T08:54:02.3354379Z * [new branch] gh/yangw-dev/26/head -> origin/gh/yangw-dev/26/head 2025-12-04T08:54:02.3354491Z * [new branch] gh/yangw-dev/26/orig -> origin/gh/yangw-dev/26/orig 2025-12-04T08:54:02.3354561Z * [new branch] gh/yangw-dev/27/base -> origin/gh/yangw-dev/27/base 2025-12-04T08:54:02.3354631Z * [new branch] gh/yangw-dev/27/head -> origin/gh/yangw-dev/27/head 2025-12-04T08:54:02.3354702Z * [new branch] gh/yangw-dev/27/orig -> origin/gh/yangw-dev/27/orig 2025-12-04T08:54:02.3354770Z * [new branch] gh/ydwu4/292/base -> origin/gh/ydwu4/292/base 2025-12-04T08:54:02.3354840Z * [new branch] gh/ydwu4/292/head -> origin/gh/ydwu4/292/head 2025-12-04T08:54:02.3354908Z * [new branch] gh/ydwu4/292/orig -> origin/gh/ydwu4/292/orig 2025-12-04T08:54:02.3354974Z * [new branch] gh/ydwu4/294/base -> origin/gh/ydwu4/294/base 2025-12-04T08:54:02.3355041Z * [new branch] gh/ydwu4/294/head -> origin/gh/ydwu4/294/head 2025-12-04T08:54:02.3355108Z * [new branch] gh/ydwu4/294/orig -> origin/gh/ydwu4/294/orig 2025-12-04T08:54:02.3355173Z * [new branch] gh/ydwu4/295/base -> origin/gh/ydwu4/295/base 2025-12-04T08:54:02.3355240Z * [new branch] gh/ydwu4/295/head -> origin/gh/ydwu4/295/head 2025-12-04T08:54:02.3355305Z * [new branch] gh/ydwu4/295/orig -> origin/gh/ydwu4/295/orig 2025-12-04T08:54:02.3355370Z * [new branch] gh/ydwu4/296/base -> origin/gh/ydwu4/296/base 2025-12-04T08:54:02.3355437Z * [new branch] gh/ydwu4/296/head -> origin/gh/ydwu4/296/head 2025-12-04T08:54:02.3355504Z * [new branch] gh/ydwu4/296/orig -> origin/gh/ydwu4/296/orig 2025-12-04T08:54:02.3355569Z * [new branch] gh/ydwu4/306/base -> origin/gh/ydwu4/306/base 2025-12-04T08:54:02.3355636Z * [new branch] gh/ydwu4/306/head -> origin/gh/ydwu4/306/head 2025-12-04T08:54:02.3355702Z * [new branch] gh/ydwu4/306/orig -> origin/gh/ydwu4/306/orig 2025-12-04T08:54:02.3355767Z * [new branch] gh/ydwu4/312/base -> origin/gh/ydwu4/312/base 2025-12-04T08:54:02.3355833Z * [new branch] gh/ydwu4/312/head -> origin/gh/ydwu4/312/head 2025-12-04T08:54:02.3355898Z * [new branch] gh/ydwu4/312/orig -> origin/gh/ydwu4/312/orig 2025-12-04T08:54:02.3355963Z * [new branch] gh/ydwu4/322/base -> origin/gh/ydwu4/322/base 2025-12-04T08:54:02.3356030Z * [new branch] gh/ydwu4/322/head -> origin/gh/ydwu4/322/head 2025-12-04T08:54:02.3356126Z * [new branch] gh/ydwu4/322/orig -> origin/gh/ydwu4/322/orig 2025-12-04T08:54:02.3356192Z * [new branch] gh/ydwu4/327/base -> origin/gh/ydwu4/327/base 2025-12-04T08:54:02.3356258Z * [new branch] gh/ydwu4/327/head -> origin/gh/ydwu4/327/head 2025-12-04T08:54:02.3356324Z * [new branch] gh/ydwu4/327/orig -> origin/gh/ydwu4/327/orig 2025-12-04T08:54:02.3356390Z * [new branch] gh/ydwu4/328/base -> origin/gh/ydwu4/328/base 2025-12-04T08:54:02.3356456Z * [new branch] gh/ydwu4/328/head -> origin/gh/ydwu4/328/head 2025-12-04T08:54:02.3356521Z * [new branch] gh/ydwu4/328/orig -> origin/gh/ydwu4/328/orig 2025-12-04T08:54:02.3356588Z * [new branch] gh/ydwu4/329/base -> origin/gh/ydwu4/329/base 2025-12-04T08:54:02.3356653Z * [new branch] gh/ydwu4/329/head -> origin/gh/ydwu4/329/head 2025-12-04T08:54:02.3356720Z * [new branch] gh/ydwu4/329/orig -> origin/gh/ydwu4/329/orig 2025-12-04T08:54:02.3356786Z * [new branch] gh/ydwu4/330/base -> origin/gh/ydwu4/330/base 2025-12-04T08:54:02.3356852Z * [new branch] gh/ydwu4/330/head -> origin/gh/ydwu4/330/head 2025-12-04T08:54:02.3356941Z * [new branch] gh/ydwu4/330/orig -> origin/gh/ydwu4/330/orig 2025-12-04T08:54:02.3357008Z * [new branch] gh/ydwu4/331/base -> origin/gh/ydwu4/331/base 2025-12-04T08:54:02.3357074Z * [new branch] gh/ydwu4/331/head -> origin/gh/ydwu4/331/head 2025-12-04T08:54:02.3357139Z * [new branch] gh/ydwu4/331/orig -> origin/gh/ydwu4/331/orig 2025-12-04T08:54:02.3357205Z * [new branch] gh/ydwu4/332/base -> origin/gh/ydwu4/332/base 2025-12-04T08:54:02.3357271Z * [new branch] gh/ydwu4/332/head -> origin/gh/ydwu4/332/head 2025-12-04T08:54:02.3357338Z * [new branch] gh/ydwu4/332/orig -> origin/gh/ydwu4/332/orig 2025-12-04T08:54:02.3357405Z * [new branch] gh/ydwu4/333/base -> origin/gh/ydwu4/333/base 2025-12-04T08:54:02.3357471Z * [new branch] gh/ydwu4/333/head -> origin/gh/ydwu4/333/head 2025-12-04T08:54:02.3357536Z * [new branch] gh/ydwu4/333/orig -> origin/gh/ydwu4/333/orig 2025-12-04T08:54:02.3357605Z * [new branch] gh/ydwu4/334/base -> origin/gh/ydwu4/334/base 2025-12-04T08:54:02.3357671Z * [new branch] gh/ydwu4/334/head -> origin/gh/ydwu4/334/head 2025-12-04T08:54:02.3357736Z * [new branch] gh/ydwu4/334/orig -> origin/gh/ydwu4/334/orig 2025-12-04T08:54:02.3357803Z * [new branch] gh/ydwu4/335/base -> origin/gh/ydwu4/335/base 2025-12-04T08:54:02.3357868Z * [new branch] gh/ydwu4/335/head -> origin/gh/ydwu4/335/head 2025-12-04T08:54:02.3357935Z * [new branch] gh/ydwu4/335/orig -> origin/gh/ydwu4/335/orig 2025-12-04T08:54:02.3358001Z * [new branch] gh/ydwu4/337/base -> origin/gh/ydwu4/337/base 2025-12-04T08:54:02.3358066Z * [new branch] gh/ydwu4/337/head -> origin/gh/ydwu4/337/head 2025-12-04T08:54:02.3358133Z * [new branch] gh/ydwu4/337/orig -> origin/gh/ydwu4/337/orig 2025-12-04T08:54:02.3358200Z * [new branch] gh/ydwu4/339/base -> origin/gh/ydwu4/339/base 2025-12-04T08:54:02.3358266Z * [new branch] gh/ydwu4/339/head -> origin/gh/ydwu4/339/head 2025-12-04T08:54:02.3358333Z * [new branch] gh/ydwu4/339/orig -> origin/gh/ydwu4/339/orig 2025-12-04T08:54:02.3358398Z * [new branch] gh/yf225/133/base -> origin/gh/yf225/133/base 2025-12-04T08:54:02.3358462Z * [new branch] gh/yf225/133/head -> origin/gh/yf225/133/head 2025-12-04T08:54:02.3358559Z * [new branch] gh/yf225/93/base -> origin/gh/yf225/93/base 2025-12-04T08:54:02.3358624Z * [new branch] gh/yf225/93/head -> origin/gh/yf225/93/head 2025-12-04T08:54:02.3358698Z * [new branch] gh/yifuwang/152/base -> origin/gh/yifuwang/152/base 2025-12-04T08:54:02.3358772Z * [new branch] gh/yifuwang/152/head -> origin/gh/yifuwang/152/head 2025-12-04T08:54:02.3358845Z * [new branch] gh/yifuwang/152/orig -> origin/gh/yifuwang/152/orig 2025-12-04T08:54:02.3358915Z * [new branch] gh/yifuwang/195/base -> origin/gh/yifuwang/195/base 2025-12-04T08:54:02.3358986Z * [new branch] gh/yifuwang/195/head -> origin/gh/yifuwang/195/head 2025-12-04T08:54:02.3359056Z * [new branch] gh/yifuwang/195/orig -> origin/gh/yifuwang/195/orig 2025-12-04T08:54:02.3359128Z * [new branch] gh/yiming0416/1/base -> origin/gh/yiming0416/1/base 2025-12-04T08:54:02.3359203Z * [new branch] gh/yiming0416/1/head -> origin/gh/yiming0416/1/head 2025-12-04T08:54:02.3359273Z * [new branch] gh/yiming0416/2/base -> origin/gh/yiming0416/2/base 2025-12-04T08:54:02.3359342Z * [new branch] gh/yiming0416/2/head -> origin/gh/yiming0416/2/head 2025-12-04T08:54:02.3359416Z * [new branch] gh/yushangdi/1/base -> origin/gh/yushangdi/1/base 2025-12-04T08:54:02.3359509Z * [new branch] gh/yushangdi/1/head -> origin/gh/yushangdi/1/head 2025-12-04T08:54:02.3359581Z * [new branch] gh/yushangdi/10/base -> origin/gh/yushangdi/10/base 2025-12-04T08:54:02.3359654Z * [new branch] gh/yushangdi/10/head -> origin/gh/yushangdi/10/head 2025-12-04T08:54:02.3359725Z * [new branch] gh/yushangdi/10/orig -> origin/gh/yushangdi/10/orig 2025-12-04T08:54:02.3359796Z * [new branch] gh/yushangdi/11/base -> origin/gh/yushangdi/11/base 2025-12-04T08:54:02.3359868Z * [new branch] gh/yushangdi/11/head -> origin/gh/yushangdi/11/head 2025-12-04T08:54:02.3359938Z * [new branch] gh/yushangdi/11/orig -> origin/gh/yushangdi/11/orig 2025-12-04T08:54:02.3360010Z * [new branch] gh/yushangdi/2/base -> origin/gh/yushangdi/2/base 2025-12-04T08:54:02.3360082Z * [new branch] gh/yushangdi/2/head -> origin/gh/yushangdi/2/head 2025-12-04T08:54:02.3360256Z * [new branch] gh/yushangdi/7/base -> origin/gh/yushangdi/7/base 2025-12-04T08:54:02.3360330Z * [new branch] gh/yushangdi/7/head -> origin/gh/yushangdi/7/head 2025-12-04T08:54:02.3360400Z * [new branch] gh/yushangdi/7/orig -> origin/gh/yushangdi/7/orig 2025-12-04T08:54:02.3360469Z * [new branch] gh/yushangdi/8/base -> origin/gh/yushangdi/8/base 2025-12-04T08:54:02.3360540Z * [new branch] gh/yushangdi/8/head -> origin/gh/yushangdi/8/head 2025-12-04T08:54:02.3360611Z * [new branch] gh/yushangdi/8/orig -> origin/gh/yushangdi/8/orig 2025-12-04T08:54:02.3360681Z * [new branch] gh/yushangdi/9/base -> origin/gh/yushangdi/9/base 2025-12-04T08:54:02.3360751Z * [new branch] gh/yushangdi/9/head -> origin/gh/yushangdi/9/head 2025-12-04T08:54:02.3360820Z * [new branch] gh/yushangdi/9/orig -> origin/gh/yushangdi/9/orig 2025-12-04T08:54:02.3360889Z * [new branch] gh/zklaus/19/base -> origin/gh/zklaus/19/base 2025-12-04T08:54:02.3360958Z * [new branch] gh/zklaus/19/head -> origin/gh/zklaus/19/head 2025-12-04T08:54:02.3361026Z * [new branch] gh/zklaus/19/orig -> origin/gh/zklaus/19/orig 2025-12-04T08:54:02.3361092Z * [new branch] gh/zklaus/20/base -> origin/gh/zklaus/20/base 2025-12-04T08:54:02.3361160Z * [new branch] gh/zklaus/20/head -> origin/gh/zklaus/20/head 2025-12-04T08:54:02.3361273Z * [new branch] gh/zklaus/20/orig -> origin/gh/zklaus/20/orig 2025-12-04T08:54:02.3361339Z * [new branch] gh/zklaus/21/base -> origin/gh/zklaus/21/base 2025-12-04T08:54:02.3361407Z * [new branch] gh/zklaus/21/head -> origin/gh/zklaus/21/head 2025-12-04T08:54:02.3361472Z * [new branch] gh/zklaus/21/orig -> origin/gh/zklaus/21/orig 2025-12-04T08:54:02.3361540Z * [new branch] gh/zklaus/22/base -> origin/gh/zklaus/22/base 2025-12-04T08:54:02.3361606Z * [new branch] gh/zklaus/22/head -> origin/gh/zklaus/22/head 2025-12-04T08:54:02.3361673Z * [new branch] gh/zklaus/22/orig -> origin/gh/zklaus/22/orig 2025-12-04T08:54:02.3361740Z * [new branch] gh/zklaus/23/base -> origin/gh/zklaus/23/base 2025-12-04T08:54:02.3361806Z * [new branch] gh/zklaus/23/head -> origin/gh/zklaus/23/head 2025-12-04T08:54:02.3361873Z * [new branch] gh/zklaus/23/orig -> origin/gh/zklaus/23/orig 2025-12-04T08:54:02.3361940Z * [new branch] gh/zklaus/24/base -> origin/gh/zklaus/24/base 2025-12-04T08:54:02.3362006Z * [new branch] gh/zklaus/24/head -> origin/gh/zklaus/24/head 2025-12-04T08:54:02.3362071Z * [new branch] gh/zklaus/24/orig -> origin/gh/zklaus/24/orig 2025-12-04T08:54:02.3362194Z * [new branch] gh/zou3519/1197/base -> origin/gh/zou3519/1197/base 2025-12-04T08:54:02.3362264Z * [new branch] gh/zou3519/1197/head -> origin/gh/zou3519/1197/head 2025-12-04T08:54:02.3362334Z * [new branch] gh/zou3519/1197/orig -> origin/gh/zou3519/1197/orig 2025-12-04T08:54:02.3362403Z * [new branch] gh/zou3519/1199/base -> origin/gh/zou3519/1199/base 2025-12-04T08:54:02.3362472Z * [new branch] gh/zou3519/1199/head -> origin/gh/zou3519/1199/head 2025-12-04T08:54:02.3362542Z * [new branch] gh/zou3519/1199/orig -> origin/gh/zou3519/1199/orig 2025-12-04T08:54:02.3362611Z * [new branch] gh/zou3519/1200/base -> origin/gh/zou3519/1200/base 2025-12-04T08:54:02.3362678Z * [new branch] gh/zou3519/1200/head -> origin/gh/zou3519/1200/head 2025-12-04T08:54:02.3362745Z * [new branch] gh/zou3519/1200/orig -> origin/gh/zou3519/1200/orig 2025-12-04T08:54:02.3362814Z * [new branch] gh/zou3519/1201/base -> origin/gh/zou3519/1201/base 2025-12-04T08:54:02.3362882Z * [new branch] gh/zou3519/1201/head -> origin/gh/zou3519/1201/head 2025-12-04T08:54:02.3362949Z * [new branch] gh/zou3519/1201/orig -> origin/gh/zou3519/1201/orig 2025-12-04T08:54:02.3363018Z * [new branch] gh/zou3519/1202/base -> origin/gh/zou3519/1202/base 2025-12-04T08:54:02.3363085Z * [new branch] gh/zou3519/1202/head -> origin/gh/zou3519/1202/head 2025-12-04T08:54:02.3363155Z * [new branch] gh/zou3519/1202/orig -> origin/gh/zou3519/1202/orig 2025-12-04T08:54:02.3363223Z * [new branch] gh/zpcore/1/base -> origin/gh/zpcore/1/base 2025-12-04T08:54:02.3363291Z * [new branch] gh/zpcore/1/head -> origin/gh/zpcore/1/head 2025-12-04T08:54:02.3363359Z * [new branch] gh/zpcore/11/base -> origin/gh/zpcore/11/base 2025-12-04T08:54:02.3363428Z * [new branch] gh/zpcore/11/head -> origin/gh/zpcore/11/head 2025-12-04T08:54:02.3363495Z * [new branch] gh/zpcore/11/orig -> origin/gh/zpcore/11/orig 2025-12-04T08:54:02.3363562Z * [new branch] gh/zpcore/12/base -> origin/gh/zpcore/12/base 2025-12-04T08:54:02.3363629Z * [new branch] gh/zpcore/12/head -> origin/gh/zpcore/12/head 2025-12-04T08:54:02.3363695Z * [new branch] gh/zpcore/12/orig -> origin/gh/zpcore/12/orig 2025-12-04T08:54:02.3363792Z * [new branch] gh/zpcore/13/base -> origin/gh/zpcore/13/base 2025-12-04T08:54:02.3363859Z * [new branch] gh/zpcore/13/head -> origin/gh/zpcore/13/head 2025-12-04T08:54:02.3363925Z * [new branch] gh/zpcore/13/orig -> origin/gh/zpcore/13/orig 2025-12-04T08:54:02.3363993Z * [new branch] gh/zpcore/14/base -> origin/gh/zpcore/14/base 2025-12-04T08:54:02.3364060Z * [new branch] gh/zpcore/14/head -> origin/gh/zpcore/14/head 2025-12-04T08:54:02.3364127Z * [new branch] gh/zpcore/14/orig -> origin/gh/zpcore/14/orig 2025-12-04T08:54:02.3364194Z * [new branch] gh/zpcore/15/base -> origin/gh/zpcore/15/base 2025-12-04T08:54:02.3364261Z * [new branch] gh/zpcore/15/head -> origin/gh/zpcore/15/head 2025-12-04T08:54:02.3364328Z * [new branch] gh/zpcore/15/orig -> origin/gh/zpcore/15/orig 2025-12-04T08:54:02.3364396Z * [new branch] gh/zpcore/2/base -> origin/gh/zpcore/2/base 2025-12-04T08:54:02.3364463Z * [new branch] gh/zpcore/2/head -> origin/gh/zpcore/2/head 2025-12-04T08:54:02.3364530Z * [new branch] gh/zpcore/21/base -> origin/gh/zpcore/21/base 2025-12-04T08:54:02.3364598Z * [new branch] gh/zpcore/21/head -> origin/gh/zpcore/21/head 2025-12-04T08:54:02.3364688Z * [new branch] gh/zpcore/21/orig -> origin/gh/zpcore/21/orig 2025-12-04T08:54:02.3364757Z * [new branch] gh/zpcore/22/base -> origin/gh/zpcore/22/base 2025-12-04T08:54:02.3364824Z * [new branch] gh/zpcore/22/head -> origin/gh/zpcore/22/head 2025-12-04T08:54:02.3364889Z * [new branch] gh/zpcore/22/orig -> origin/gh/zpcore/22/orig 2025-12-04T08:54:02.3364957Z * [new branch] gh/zpcore/23/base -> origin/gh/zpcore/23/base 2025-12-04T08:54:02.3365023Z * [new branch] gh/zpcore/23/head -> origin/gh/zpcore/23/head 2025-12-04T08:54:02.3365091Z * [new branch] gh/zpcore/23/orig -> origin/gh/zpcore/23/orig 2025-12-04T08:54:02.3365158Z * [new branch] gh/zpcore/24/base -> origin/gh/zpcore/24/base 2025-12-04T08:54:02.3365225Z * [new branch] gh/zpcore/24/head -> origin/gh/zpcore/24/head 2025-12-04T08:54:02.3365292Z * [new branch] gh/zpcore/24/orig -> origin/gh/zpcore/24/orig 2025-12-04T08:54:02.3365359Z * [new branch] gh/zpcore/25/base -> origin/gh/zpcore/25/base 2025-12-04T08:54:02.3365426Z * [new branch] gh/zpcore/25/head -> origin/gh/zpcore/25/head 2025-12-04T08:54:02.3365493Z * [new branch] gh/zpcore/25/orig -> origin/gh/zpcore/25/orig 2025-12-04T08:54:02.3365561Z * [new branch] gh/zpcore/26/base -> origin/gh/zpcore/26/base 2025-12-04T08:54:02.3365627Z * [new branch] gh/zpcore/26/head -> origin/gh/zpcore/26/head 2025-12-04T08:54:02.3365695Z * [new branch] gh/zpcore/26/orig -> origin/gh/zpcore/26/orig 2025-12-04T08:54:02.3365762Z * [new branch] gh/zpcore/27/base -> origin/gh/zpcore/27/base 2025-12-04T08:54:02.3365828Z * [new branch] gh/zpcore/27/head -> origin/gh/zpcore/27/head 2025-12-04T08:54:02.3365896Z * [new branch] gh/zpcore/27/orig -> origin/gh/zpcore/27/orig 2025-12-04T08:54:02.3365963Z * [new branch] gh/zpcore/28/base -> origin/gh/zpcore/28/base 2025-12-04T08:54:02.3366029Z * [new branch] gh/zpcore/28/head -> origin/gh/zpcore/28/head 2025-12-04T08:54:02.3366095Z * [new branch] gh/zpcore/28/orig -> origin/gh/zpcore/28/orig 2025-12-04T08:54:02.3366163Z * [new branch] gh/zpcore/3/base -> origin/gh/zpcore/3/base 2025-12-04T08:54:02.3366230Z * [new branch] gh/zpcore/3/head -> origin/gh/zpcore/3/head 2025-12-04T08:54:02.3366328Z * [new branch] gh/zpcore/4/base -> origin/gh/zpcore/4/base 2025-12-04T08:54:02.3366394Z * [new branch] gh/zpcore/4/head -> origin/gh/zpcore/4/head 2025-12-04T08:54:02.3366461Z * [new branch] gh/zpcore/5/base -> origin/gh/zpcore/5/base 2025-12-04T08:54:02.3366529Z * [new branch] gh/zpcore/5/head -> origin/gh/zpcore/5/head 2025-12-04T08:54:02.3366596Z * [new branch] gh/zpcore/6/base -> origin/gh/zpcore/6/base 2025-12-04T08:54:02.3366661Z * [new branch] gh/zpcore/6/head -> origin/gh/zpcore/6/head 2025-12-04T08:54:02.3366728Z * [new branch] gh/zpcore/7/base -> origin/gh/zpcore/7/base 2025-12-04T08:54:02.3366793Z * [new branch] gh/zpcore/7/head -> origin/gh/zpcore/7/head 2025-12-04T08:54:02.3366858Z * [new branch] gh/zpcore/8/base -> origin/gh/zpcore/8/base 2025-12-04T08:54:02.3366927Z * [new branch] gh/zpcore/8/head -> origin/gh/zpcore/8/head 2025-12-04T08:54:02.3366995Z * [new branch] google-main -> origin/google-main 2025-12-04T08:54:02.3367080Z * [new branch] guangyey/external_stream -> origin/guangyey/external_stream 2025-12-04T08:54:02.3367153Z * [new branch] guangyey/test_2025 -> origin/guangyey/test_2025 2025-12-04T08:54:02.3367325Z * [new branch] guilhermeleobas/cherry-pick-55d87d9dfd9 -> origin/guilhermeleobas/cherry-pick-55d87d9dfd9 2025-12-04T08:54:02.3367443Z * [new branch] hameerabbasi/complex_tensor_subclass -> origin/hameerabbasi/complex_tensor_subclass 2025-12-04T08:54:02.3367585Z * [new branch] hameerabbasi/fix-ctensor-gradcheck-tests -> origin/hameerabbasi/fix-ctensor-gradcheck-tests 2025-12-04T08:54:02.3367693Z * [new branch] hameerabbasi/gradcheck-allclose -> origin/hameerabbasi/gradcheck-allclose 2025-12-04T08:54:02.3367760Z * [new branch] hc_baseline -> origin/hc_baseline 2025-12-04T08:54:02.3367824Z * [new branch] hhh_rand -> origin/hhh_rand 2025-12-04T08:54:02.3367886Z * [new branch] huba/f1 -> origin/huba/f1 2025-12-04T08:54:02.3368076Z * [new branch] increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test -> origin/increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test 2025-12-04T08:54:02.3368138Z * [new branch] inlining -> origin/inlining 2025-12-04T08:54:02.3368209Z * [new branch] inlining-ezyang -> origin/inlining-ezyang 2025-12-04T08:54:02.3368294Z * [new branch] install-torchao-0.13.0 -> origin/install-torchao-0.13.0 2025-12-04T08:54:02.3368475Z * [new branch] instrument-trunk-pull-linux-with-job-test-filters -> origin/instrument-trunk-pull-linux-with-job-test-filters 2025-12-04T08:54:02.3368545Z * [new branch] invoke-subgraph -> origin/invoke-subgraph 2025-12-04T08:54:02.3368612Z * [new branch] issue#58739 -> origin/issue#58739 2025-12-04T08:54:02.3368690Z * [new branch] jainapurva-patch-1 -> origin/jainapurva-patch-1 2025-12-04T08:54:02.3368751Z * [new branch] jathu/o3 -> origin/jathu/o3 2025-12-04T08:54:02.3368814Z * [new branch] jathu/sve -> origin/jathu/sve 2025-12-04T08:54:02.3368940Z * [new branch] jcaip/test-cusparselt-version-0.6.2 -> origin/jcaip/test-cusparselt-version-0.6.2 2025-12-04T08:54:02.3369045Z * [new branch] jcaip/update-cusparselt-0.6.2 -> origin/jcaip/update-cusparselt-0.6.2 2025-12-04T08:54:02.3369159Z * [new branch] jiannanWang/memorysnapshot_filter -> origin/jiannanWang/memorysnapshot_filter 2025-12-04T08:54:02.3369269Z * [new branch] jiannanWang/profilerstepwarning -> origin/jiannanWang/profilerstepwarning 2025-12-04T08:54:02.3369385Z * [new branch] jithunnair-amd-patch-1 -> origin/jithunnair-amd-patch-1 2025-12-04T08:54:02.3369472Z * [new branch] jithunnair-amd-patch-10 -> origin/jithunnair-amd-patch-10 2025-12-04T08:54:02.3369554Z * [new branch] jithunnair-amd-patch-2 -> origin/jithunnair-amd-patch-2 2025-12-04T08:54:02.3369637Z * [new branch] jithunnair-amd-patch-3 -> origin/jithunnair-amd-patch-3 2025-12-04T08:54:02.3369716Z * [new branch] jithunnair-amd-patch-4 -> origin/jithunnair-amd-patch-4 2025-12-04T08:54:02.3369795Z * [new branch] jithunnair-amd-patch-5 -> origin/jithunnair-amd-patch-5 2025-12-04T08:54:02.3369878Z * [new branch] jithunnair-amd-patch-6 -> origin/jithunnair-amd-patch-6 2025-12-04T08:54:02.3369956Z * [new branch] jithunnair-amd-patch-7 -> origin/jithunnair-amd-patch-7 2025-12-04T08:54:02.3370034Z * [new branch] jithunnair-amd-patch-8 -> origin/jithunnair-amd-patch-8 2025-12-04T08:54:02.3370160Z * [new branch] jithunnair-amd-patch-9 -> origin/jithunnair-amd-patch-9 2025-12-04T08:54:02.3370237Z * [new branch] justinchu/native-qdq -> origin/justinchu/native-qdq 2025-12-04T08:54:02.3370309Z * [new branch] kainan666/xlf_debug -> origin/kainan666/xlf_debug 2025-12-04T08:54:02.3370414Z * [new branch] kainan_test -> origin/kainan_test 2025-12-04T08:54:02.3370492Z * [new branch] larryliu0820-patch-1 -> origin/larryliu0820-patch-1 2025-12-04T08:54:02.3370597Z * [new branch] leslie/test_group_gemm_epilogues -> origin/leslie/test_group_gemm_epilogues 2025-12-04T08:54:02.3370703Z * [new branch] lessw2020/fix_cutlass_cache_error -> origin/lessw2020/fix_cutlass_cache_error 2025-12-04T08:54:02.3370781Z * [new branch] liaoxuan/shm_all_reduce -> origin/liaoxuan/shm_all_reduce 2025-12-04T08:54:02.3370883Z * [new branch] liaoxuan/test_fa_disable_softmax -> origin/liaoxuan/test_fa_disable_softmax 2025-12-04T08:54:02.3370965Z * [new branch] liaoxuan/test_int8_sdpa -> origin/liaoxuan/test_int8_sdpa 2025-12-04T08:54:02.3371033Z * [new branch] llama4-stable -> origin/llama4-stable 2025-12-04T08:54:02.3371100Z * [new branch] lts/release/1.8 -> origin/lts/release/1.8 2025-12-04T08:54:02.3371175Z * [new branch] lucaskabela/#94773 -> origin/lucaskabela/#94773 2025-12-04T08:54:02.3371251Z * [new branch] lucaskabela/fix_164876 -> origin/lucaskabela/fix_164876 2025-12-04T08:54:02.3371335Z * [new branch] lucaskabela/flop_counter -> origin/lucaskabela/flop_counter 2025-12-04T08:54:02.3371432Z * [new branch] lucaskabela/func_under_decomp -> origin/lucaskabela/func_under_decomp 2025-12-04T08:54:02.3371537Z * [new branch] lucaskabela/functional_in_dynamo -> origin/lucaskabela/functional_in_dynamo 2025-12-04T08:54:02.3371666Z * [new branch] lucaskabela/install_params_as_graph_attr -> origin/lucaskabela/install_params_as_graph_attr 2025-12-04T08:54:02.3371781Z * [new branch] lucaskabela/parameters_as_graph_attr -> origin/lucaskabela/parameters_as_graph_attr 2025-12-04T08:54:02.3371915Z * [new branch] lucaskabela/remove_aot_dispatcher_metadata -> origin/lucaskabela/remove_aot_dispatcher_metadata 2025-12-04T08:54:02.3371996Z * [new branch] lucaskabela/rnn_decomp -> origin/lucaskabela/rnn_decomp 2025-12-04T08:54:02.3372088Z * [new branch] lucaskabela/typing_backends -> origin/lucaskabela/typing_backends 2025-12-04T08:54:02.3372185Z * [new branch] lucaskabela/typing_ctx_manager -> origin/lucaskabela/typing_ctx_manager 2025-12-04T08:54:02.3372281Z * [new branch] lucaskabela/typing_nn_module -> origin/lucaskabela/typing_nn_module 2025-12-04T08:54:02.3372435Z * [new branch] lucaskabela/typing_user_defined -> origin/lucaskabela/typing_user_defined 2025-12-04T08:54:02.3372530Z * [new branch] lucaskabela/typing_variables -> origin/lucaskabela/typing_variables 2025-12-04T08:54:02.3372640Z * [new branch] lucaskabela/typing_variables_dicts -> origin/lucaskabela/typing_variables_dicts 2025-12-04T08:54:02.3372762Z * [new branch] lucaskabela/typing_variables_functions -> origin/lucaskabela/typing_variables_functions 2025-12-04T08:54:02.3372871Z * [new branch] lucaskabela/typing_variables_lists -> origin/lucaskabela/typing_variables_lists 2025-12-04T08:54:02.3372944Z * [new branch] lw/torch_box_by_ref -> origin/lw/torch_box_by_ref 2025-12-04T08:54:02.3373005Z * [new branch] main -> origin/main 2025-12-04T08:54:02.3373077Z * [new branch] malfet-patch-1 -> origin/malfet-patch-1 2025-12-04T08:54:02.3373148Z * [new branch] malfet-patch-2 -> origin/malfet-patch-2 2025-12-04T08:54:02.3373214Z * [new branch] malfet-patch-3 -> origin/malfet-patch-3 2025-12-04T08:54:02.3373281Z * [new branch] malfet-patch-4 -> origin/malfet-patch-4 2025-12-04T08:54:02.3373346Z * [new branch] malfet-patch-5 -> origin/malfet-patch-5 2025-12-04T08:54:02.3373439Z * [new branch] malfet-patch-6 -> origin/malfet-patch-6 2025-12-04T08:54:02.3373506Z * [new branch] malfet-patch-7 -> origin/malfet-patch-7 2025-12-04T08:54:02.3373571Z * [new branch] malfet-patch-8 -> origin/malfet-patch-8 2025-12-04T08:54:02.3373645Z * [new branch] malfet/add-3.14-ci -> origin/malfet/add-3.14-ci 2025-12-04T08:54:02.3373809Z * [new branch] malfet/be-do-not-make-typos-in-build-artifacts -> origin/malfet/be-do-not-make-typos-in-build-artifacts 2025-12-04T08:54:02.3373979Z * [new branch] malfet/be-move-more-settings-to-checkout-pytorch -> origin/malfet/be-move-more-settings-to-checkout-pytorch 2025-12-04T08:54:02.3374106Z * [new branch] malfet/be-remove-misisng-neon-headers -> origin/malfet/be-remove-misisng-neon-headers 2025-12-04T08:54:02.3374205Z * [new branch] malfet/mps-implement-col2im -> origin/malfet/mps-implement-col2im 2025-12-04T08:54:02.3374321Z * [new branch] manuel/aoti_metal_shimify-thread_safe -> origin/manuel/aoti_metal_shimify-thread_safe 2025-12-04T08:54:02.3374414Z * [new branch] manuel/inductor_link_openmp -> origin/manuel/inductor_link_openmp 2025-12-04T08:54:02.3374488Z * [new branch] masnesral/metaconda -> origin/masnesral/metaconda 2025-12-04T08:54:02.3374563Z * [new branch] mem_profiler_flaky_fix -> origin/mem_profiler_flaky_fix 2025-12-04T08:54:02.3374644Z * [new branch] mem_profiler_stack_trace -> origin/mem_profiler_stack_trace 2025-12-04T08:54:02.3374721Z * [new branch] memory_profiler_stack -> origin/memory_profiler_stack 2025-12-04T08:54:02.3374796Z * [new branch] metascroy-patch-1 -> origin/metascroy-patch-1 2025-12-04T08:54:02.3374861Z * [new branch] mingw_posix -> origin/mingw_posix 2025-12-04T08:54:02.3374936Z * [new branch] mlazos/S429861-debug -> origin/mlazos/S429861-debug 2025-12-04T08:54:02.3374999Z * [new branch] mlazos/aa -> origin/mlazos/aa 2025-12-04T08:54:02.3375063Z * [new branch] mlazos/acts -> origin/mlazos/acts 2025-12-04T08:54:02.3375135Z * [new branch] mlazos/arg-renames -> origin/mlazos/arg-renames 2025-12-04T08:54:02.3375213Z * [new branch] mlazos/bad-cudagraphs -> origin/mlazos/bad-cudagraphs 2025-12-04T08:54:02.3375315Z * [new branch] mlazos/baseline-graph-breaks -> origin/mlazos/baseline-graph-breaks 2025-12-04T08:54:02.3375414Z * [new branch] mlazos/beta-tensor -> origin/mlazos/beta-tensor 2025-12-04T08:54:02.3375479Z * [new branch] mlazos/buffers -> origin/mlazos/buffers 2025-12-04T08:54:02.3375546Z * [new branch] mlazos/buffers2 -> origin/mlazos/buffers2 2025-12-04T08:54:02.3375612Z * [new branch] mlazos/buffers3 -> origin/mlazos/buffers3 2025-12-04T08:54:02.3375676Z * [new branch] mlazos/bwd -> origin/mlazos/bwd 2025-12-04T08:54:02.3375748Z * [new branch] mlazos/combo-test -> origin/mlazos/combo-test 2025-12-04T08:54:02.3375819Z * [new branch] mlazos/ctx-cleanup -> origin/mlazos/ctx-cleanup 2025-12-04T08:54:02.3375892Z * [new branch] mlazos/cuda-cmd-log -> origin/mlazos/cuda-cmd-log 2025-12-04T08:54:02.3375974Z * [new branch] mlazos/cudagraph-tests -> origin/mlazos/cudagraph-tests 2025-12-04T08:54:02.3376077Z * [new branch] mlazos/cudagraphs-measurement -> origin/mlazos/cudagraphs-measurement 2025-12-04T08:54:02.3376152Z * [new branch] mlazos/cutlass-test -> origin/mlazos/cutlass-test 2025-12-04T08:54:02.3376233Z * [new branch] mlazos/cutlass-topo-bug -> origin/mlazos/cutlass-topo-bug 2025-12-04T08:54:02.3376343Z * [new branch] mlazos/dataclass-proxy -> origin/mlazos/dataclass-proxy 2025-12-04T08:54:02.3376412Z * [new branch] mlazos/dc-attrs -> origin/mlazos/dc-attrs 2025-12-04T08:54:02.3376481Z * [new branch] mlazos/dc-helion -> origin/mlazos/dc-helion 2025-12-04T08:54:02.3376548Z * [new branch] mlazos/dict-fix -> origin/mlazos/dict-fix 2025-12-04T08:54:02.3376619Z * [new branch] mlazos/disable-tf -> origin/mlazos/disable-tf 2025-12-04T08:54:02.3376685Z * [new branch] mlazos/dupe-fix -> origin/mlazos/dupe-fix 2025-12-04T08:54:02.3376755Z * [new branch] mlazos/dyn-batch -> origin/mlazos/dyn-batch 2025-12-04T08:54:02.3376817Z * [new branch] mlazos/evt -> origin/mlazos/evt 2025-12-04T08:54:02.3376898Z * [new branch] mlazos/extract-examples -> origin/mlazos/extract-examples 2025-12-04T08:54:02.3376967Z * [new branch] mlazos/foreach-op -> origin/mlazos/foreach-op 2025-12-04T08:54:02.3377032Z * [new branch] mlazos/fp8 -> origin/mlazos/fp8 2025-12-04T08:54:02.3377098Z * [new branch] mlazos/fp8-bias -> origin/mlazos/fp8-bias 2025-12-04T08:54:02.3377176Z * [new branch] mlazos/fp8-bias-fusion -> origin/mlazos/fp8-bias-fusion 2025-12-04T08:54:02.3377244Z * [new branch] mlazos/fp8-fixes -> origin/mlazos/fp8-fixes 2025-12-04T08:54:02.3377311Z * [new branch] mlazos/freezing -> origin/mlazos/freezing 2025-12-04T08:54:02.3377378Z * [new branch] mlazos/h-comp -> origin/mlazos/h-comp 2025-12-04T08:54:02.3377446Z * [new branch] mlazos/h-comp2 -> origin/mlazos/h-comp2 2025-12-04T08:54:02.3377512Z * [new branch] mlazos/hash-hop -> origin/mlazos/hash-hop 2025-12-04T08:54:02.3377573Z * [new branch] mlazos/hc -> origin/mlazos/hc 2025-12-04T08:54:02.3377643Z * [new branch] mlazos/hc-cycles -> origin/mlazos/hc-cycles 2025-12-04T08:54:02.3377710Z * [new branch] mlazos/hc-fixes -> origin/mlazos/hc-fixes 2025-12-04T08:54:02.3377777Z * [new branch] mlazos/hc-fixes3 -> origin/mlazos/hc-fixes3 2025-12-04T08:54:02.3377846Z * [new branch] mlazos/hc-fixes4 -> origin/mlazos/hc-fixes4 2025-12-04T08:54:02.3377910Z * [new branch] mlazos/hc-hf -> origin/mlazos/hc-hf 2025-12-04T08:54:02.3377975Z * [new branch] mlazos/hc-mut -> origin/mlazos/hc-mut 2025-12-04T08:54:02.3378063Z * [new branch] mlazos/hc10 -> origin/mlazos/hc10 2025-12-04T08:54:02.3378125Z * [new branch] mlazos/hc11 -> origin/mlazos/hc11 2025-12-04T08:54:02.3378187Z * [new branch] mlazos/hc12 -> origin/mlazos/hc12 2025-12-04T08:54:02.3378249Z * [new branch] mlazos/hc13 -> origin/mlazos/hc13 2025-12-04T08:54:02.3378308Z * [new branch] mlazos/hc14 -> origin/mlazos/hc14 2025-12-04T08:54:02.3378369Z * [new branch] mlazos/hc15 -> origin/mlazos/hc15 2025-12-04T08:54:02.3378430Z * [new branch] mlazos/hc2 -> origin/mlazos/hc2 2025-12-04T08:54:02.3378492Z * [new branch] mlazos/hc4 -> origin/mlazos/hc4 2025-12-04T08:54:02.3378554Z * [new branch] mlazos/hc5 -> origin/mlazos/hc5 2025-12-04T08:54:02.3378614Z * [new branch] mlazos/hc6 -> origin/mlazos/hc6 2025-12-04T08:54:02.3378673Z * [new branch] mlazos/hc7 -> origin/mlazos/hc7 2025-12-04T08:54:02.3378733Z * [new branch] mlazos/hc8 -> origin/mlazos/hc8 2025-12-04T08:54:02.3378791Z * [new branch] mlazos/hc9 -> origin/mlazos/hc9 2025-12-04T08:54:02.3378896Z * [new branch] mlazos/hc_baseline2 -> origin/mlazos/hc_baseline2 2025-12-04T08:54:02.3378980Z * [new branch] mlazos/inductor-streams -> origin/mlazos/inductor-streams 2025-12-04T08:54:02.3379042Z * [new branch] mlazos/main -> origin/mlazos/main 2025-12-04T08:54:02.3379103Z * [new branch] mlazos/mcg2 -> origin/mlazos/mcg2 2025-12-04T08:54:02.3379176Z * [new branch] mlazos/meta-guards -> origin/mlazos/meta-guards 2025-12-04T08:54:02.3379280Z * [new branch] mlazos/mlazos/foreach-map-adam -> origin/mlazos/mlazos/foreach-map-adam 2025-12-04T08:54:02.3379380Z * [new branch] mlazos/mlazos/tf-mode-backup -> origin/mlazos/mlazos/tf-mode-backup 2025-12-04T08:54:02.3379449Z * [new branch] mlazos/mod-fix -> origin/mlazos/mod-fix 2025-12-04T08:54:02.3379515Z * [new branch] mlazos/mode-fix -> origin/mlazos/mode-fix 2025-12-04T08:54:02.3379581Z * [new branch] mlazos/offsets -> origin/mlazos/offsets 2025-12-04T08:54:02.3379655Z * [new branch] mlazos/overguarding -> origin/mlazos/overguarding 2025-12-04T08:54:02.3379729Z * [new branch] mlazos/proxy-ctors -> origin/mlazos/proxy-ctors 2025-12-04T08:54:02.3379797Z * [new branch] mlazos/quant-fix -> origin/mlazos/quant-fix 2025-12-04T08:54:02.3379868Z * [new branch] mlazos/resnet-fix -> origin/mlazos/resnet-fix 2025-12-04T08:54:02.3379941Z * [new branch] mlazos/rm-buf-names -> origin/mlazos/rm-buf-names 2025-12-04T08:54:02.3380009Z * [new branch] mlazos/rm-code -> origin/mlazos/rm-code 2025-12-04T08:54:02.3380075Z * [new branch] mlazos/rm-spam -> origin/mlazos/rm-spam 2025-12-04T08:54:02.3380191Z * [new branch] mlazos/rtp -> origin/mlazos/rtp 2025-12-04T08:54:02.3380271Z * [new branch] mlazos/static-idx-dbg -> origin/mlazos/static-idx-dbg 2025-12-04T08:54:02.3380357Z * [new branch] mlazos/static-inputs-log -> origin/mlazos/static-inputs-log 2025-12-04T08:54:02.3380421Z * [new branch] mlazos/stests -> origin/mlazos/stests 2025-12-04T08:54:02.3380492Z * [new branch] mlazos/stream-ops -> origin/mlazos/stream-ops 2025-12-04T08:54:02.3380556Z * [new branch] mlazos/td-fix2 -> origin/mlazos/td-fix2 2025-12-04T08:54:02.3380634Z * [new branch] mlazos/tensor-hasattr2 -> origin/mlazos/tensor-hasattr2 2025-12-04T08:54:02.3380737Z * [new branch] mlazos/test -> origin/mlazos/test 2025-12-04T08:54:02.3380802Z * [new branch] mlazos/tf-mode -> origin/mlazos/tf-mode 2025-12-04T08:54:02.3380879Z * [new branch] mlazos/tf-mode-backup2 -> origin/mlazos/tf-mode-backup2 2025-12-04T08:54:02.3380958Z * [new branch] mlazos/tf-mode-reland -> origin/mlazos/tf-mode-reland 2025-12-04T08:54:02.3381035Z * [new branch] mlazos/tf-mode-reland2 -> origin/mlazos/tf-mode-reland2 2025-12-04T08:54:02.3381111Z * [new branch] mlazos/tf-mode-reland3 -> origin/mlazos/tf-mode-reland3 2025-12-04T08:54:02.3381187Z * [new branch] mlazos/triton-no-epi -> origin/mlazos/triton-no-epi 2025-12-04T08:54:02.3381259Z * [new branch] mlazos/tune-proto -> origin/mlazos/tune-proto 2025-12-04T08:54:02.3381331Z * [new branch] mlazos/tuple-fixes -> origin/mlazos/tuple-fixes 2025-12-04T08:54:02.3381406Z * [new branch] mlazos/tuple-fixes2 -> origin/mlazos/tuple-fixes2 2025-12-04T08:54:02.3381481Z * [new branch] mlazos/tuple-handling -> origin/mlazos/tuple-handling 2025-12-04T08:54:02.3381561Z * [new branch] mlazos/user-stream-base -> origin/mlazos/user-stream-base 2025-12-04T08:54:02.3381672Z * [new branch] mlazos/user-streams -> origin/mlazos/user-streams 2025-12-04T08:54:02.3381764Z * [new branch] mlazos/user-streams-backup -> origin/mlazos/user-streams-backup 2025-12-04T08:54:02.3381858Z * [new branch] mlazos/user-streams-backup2 -> origin/mlazos/user-streams-backup2 2025-12-04T08:54:02.3381927Z * [new branch] mlazos/vary-beta -> origin/mlazos/vary-beta 2025-12-04T08:54:02.3381997Z * [new branch] mlazos/vary-beta2 -> origin/mlazos/vary-beta2 2025-12-04T08:54:02.3382072Z * [new branch] mlazos/weird-perf1 -> origin/mlazos/weird-perf1 2025-12-04T08:54:02.3382145Z * [new branch] mm_out_dtype_compile -> origin/mm_out_dtype_compile 2025-12-04T08:54:02.3382209Z * [new branch] module-shim -> origin/module-shim 2025-12-04T08:54:02.3382272Z * [new branch] move_config -> origin/move_config 2025-12-04T08:54:02.3382341Z * [new branch] msaroufim/reduce -> origin/msaroufim/reduce 2025-12-04T08:54:02.3382410Z * [new branch] mtia/basic-cmake -> origin/mtia/basic-cmake 2025-12-04T08:54:02.3382512Z * [new branch] mwizak/fix-triton-block-shape -> origin/mwizak/fix-triton-block-shape 2025-12-04T08:54:02.3382578Z * [new branch] my_varlen_backup -> origin/my_varlen_backup 2025-12-04T08:54:02.3382651Z * [new branch] nativert_num_outputs -> origin/nativert_num_outputs 2025-12-04T08:54:02.3382715Z * [new branch] new-codegen -> origin/new-codegen 2025-12-04T08:54:02.3382782Z * [new branch] newtest-base -> origin/newtest-base 2025-12-04T08:54:02.3382853Z * [new branch] ngimel/addmm_dtype -> origin/ngimel/addmm_dtype 2025-12-04T08:54:02.3382919Z * [new branch] ngimel/div_inv -> origin/ngimel/div_inv 2025-12-04T08:54:02.3382997Z * [new branch] ngimel/error_index_list -> origin/ngimel/error_index_list 2025-12-04T08:54:02.3383067Z * [new branch] ngimel/gather_grid -> origin/ngimel/gather_grid 2025-12-04T08:54:02.3383155Z * [new branch] ngimel/gather_grid_release -> origin/ngimel/gather_grid_release 2025-12-04T08:54:02.3383219Z * [new branch] ngimel/gg_new -> origin/ngimel/gg_new 2025-12-04T08:54:02.3383286Z * [new branch] ngimel/hostalloc -> origin/ngimel/hostalloc 2025-12-04T08:54:02.3383356Z * [new branch] ngimel/storage_id -> origin/ngimel/storage_id 2025-12-04T08:54:02.3383448Z * [new branch] nightly -> origin/nightly 2025-12-04T08:54:02.3383567Z * [new branch] nikitaved/addmm_1_rowcol_lt_path_check -> origin/nikitaved/addmm_1_rowcol_lt_path_check 2025-12-04T08:54:02.3383690Z * [new branch] nikitaved/addmm_epilogue_fusions_2d_bias -> origin/nikitaved/addmm_epilogue_fusions_2d_bias 2025-12-04T08:54:02.3383819Z * [new branch] nikitaved/addmm_epilogue_fusions_inductor -> origin/nikitaved/addmm_epilogue_fusions_inductor 2025-12-04T08:54:02.3383944Z * [new branch] nikitaved/addmm_epilogue_fusions_scratch -> origin/nikitaved/addmm_epilogue_fusions_scratch 2025-12-04T08:54:02.3384060Z * [new branch] nikitaved/grad_addmm_epilogue_fusions -> origin/nikitaved/grad_addmm_epilogue_fusions 2025-12-04T08:54:02.3384171Z * [new branch] nikitaved/simpler_can_use_32bit_index -> origin/nikitaved/simpler_can_use_32bit_index 2025-12-04T08:54:02.3384243Z * [new branch] nikitaved/test -> origin/nikitaved/test 2025-12-04T08:54:02.3384369Z * [new branch] nmacchioni-perf-test-async-autotune -> origin/nmacchioni-perf-test-async-autotune 2025-12-04T08:54:02.3384446Z * [new branch] no_distributed_log_spew -> origin/no_distributed_log_spew 2025-12-04T08:54:02.3384541Z * [new branch] nofun-hack -> origin/nofun-hack 2025-12-04T08:54:02.3384604Z * [new branch] norm_bench -> origin/norm_bench 2025-12-04T08:54:02.3384679Z * [new branch] nullplay/fuse_matmul -> origin/nullplay/fuse_matmul 2025-12-04T08:54:02.3384755Z * [new branch] nullplay_fuse_matmul -> origin/nullplay_fuse_matmul 2025-12-04T08:54:02.3384821Z * [new branch] optimizer_test -> origin/optimizer_test 2025-12-04T08:54:02.3384890Z * [new branch] orig/release/1.10 -> origin/orig/release/1.10 2025-12-04T08:54:02.3384960Z * [new branch] orig/release/1.11 -> origin/orig/release/1.11 2025-12-04T08:54:02.3385027Z * [new branch] orig/release/1.12 -> origin/orig/release/1.12 2025-12-04T08:54:02.3385094Z * [new branch] orig/release/1.13 -> origin/orig/release/1.13 2025-12-04T08:54:02.3385161Z * [new branch] orig/release/1.6 -> origin/orig/release/1.6 2025-12-04T08:54:02.3385227Z * [new branch] orig/release/1.7 -> origin/orig/release/1.7 2025-12-04T08:54:02.3385293Z * [new branch] orig/release/1.8 -> origin/orig/release/1.8 2025-12-04T08:54:02.3385358Z * [new branch] orig/release/1.9 -> origin/orig/release/1.9 2025-12-04T08:54:02.3385422Z * [new branch] orig/release/2.0 -> origin/orig/release/2.0 2025-12-04T08:54:02.3385487Z * [new branch] orig/release/2.1 -> origin/orig/release/2.1 2025-12-04T08:54:02.3385553Z * [new branch] orig/release/2.2 -> origin/orig/release/2.2 2025-12-04T08:54:02.3385617Z * [new branch] orig/release/2.3 -> origin/orig/release/2.3 2025-12-04T08:54:02.3385683Z * [new branch] orig/release/2.4 -> origin/orig/release/2.4 2025-12-04T08:54:02.3385747Z * [new branch] orig/release/2.5 -> origin/orig/release/2.5 2025-12-04T08:54:02.3385812Z * [new branch] orig/release/2.6 -> origin/orig/release/2.6 2025-12-04T08:54:02.3385878Z * [new branch] orig/release/2.7 -> origin/orig/release/2.7 2025-12-04T08:54:02.3385943Z * [new branch] orig/release/2.8 -> origin/orig/release/2.8 2025-12-04T08:54:02.3386007Z * [new branch] orig/release/2.9 -> origin/orig/release/2.9 2025-12-04T08:54:02.3386093Z * [new branch] origin/gh/fxdawnn/1/base -> origin/origin/gh/fxdawnn/1/base 2025-12-04T08:54:02.3386202Z * [new branch] origin/gh/fxdawnn/1/orig -> origin/origin/gh/fxdawnn/1/orig 2025-12-04T08:54:02.3386283Z * [new branch] origin/gh/zpcore/14/orig -> origin/origin/gh/zpcore/14/orig 2025-12-04T08:54:02.3386353Z * [new branch] oulgen-patch-1 -> origin/oulgen-patch-1 2025-12-04T08:54:02.3386420Z * [new branch] oulgen-patch-2 -> origin/oulgen-patch-2 2025-12-04T08:54:02.3386488Z * [new branch] oulgen-patch-3 -> origin/oulgen-patch-3 2025-12-04T08:54:02.3386554Z * [new branch] oulgen-patch-4 -> origin/oulgen-patch-4 2025-12-04T08:54:02.3386620Z * [new branch] padded-tensor -> origin/padded-tensor 2025-12-04T08:54:02.3386683Z * [new branch] pca2 -> origin/pca2 2025-12-04T08:54:02.3386756Z * [new branch] per_channel_backup -> origin/per_channel_backup 2025-12-04T08:54:02.3386819Z * [new branch] perf_ops -> origin/perf_ops 2025-12-04T08:54:02.3386886Z * [new branch] perf_ops_2_9 -> origin/perf_ops_2_9 2025-12-04T08:54:02.3386957Z * [new branch] pianpwk-patch-1 -> origin/pianpwk-patch-1 2025-12-04T08:54:02.3387044Z * [new branch] pianpwk/__draft_debug_mode -> origin/pianpwk/__draft_debug_mode 2025-12-04T08:54:02.3387178Z * [new branch] pianpwk/_debug_mode_for_triton_draft -> origin/pianpwk/_debug_mode_for_triton_draft 2025-12-04T08:54:02.3387280Z * [new branch] pianpwk/_debug_nn_module_compile -> origin/pianpwk/_debug_nn_module_compile 2025-12-04T08:54:02.3387365Z * [new branch] pianpwk/_draft_triton_11_3 -> origin/pianpwk/_draft_triton_11_3 2025-12-04T08:54:02.3387459Z * [new branch] pianpwk/_manual_bucket_draft -> origin/pianpwk/_manual_bucket_draft 2025-12-04T08:54:02.3387560Z * [new branch] pianpwk/_profile_w_dispatch_keys -> origin/pianpwk/_profile_w_dispatch_keys 2025-12-04T08:54:02.3387659Z * [new branch] pianpwk/_super_draft_debug_mode -> origin/pianpwk/_super_draft_debug_mode 2025-12-04T08:54:02.3387766Z * [new branch] pianpwk/_unbacked_local_shard_size -> origin/pianpwk/_unbacked_local_shard_size 2025-12-04T08:54:02.3387841Z * [new branch] pianpwk/anomaly_tb -> origin/pianpwk/anomaly_tb 2025-12-04T08:54:02.3387924Z * [new branch] pianpwk/auto_fx_annotate -> origin/pianpwk/auto_fx_annotate 2025-12-04T08:54:02.3388038Z * [new branch] pianpwk/backed_size_oblivious_export -> origin/pianpwk/backed_size_oblivious_export 2025-12-04T08:54:02.3388124Z * [new branch] pianpwk/bert_dynamic_perf -> origin/pianpwk/bert_dynamic_perf 2025-12-04T08:54:02.3388223Z * [new branch] pianpwk/debug_fwd_stack_traces -> origin/pianpwk/debug_fwd_stack_traces 2025-12-04T08:54:02.3388307Z * [new branch] pianpwk/debug_hash_tensor -> origin/pianpwk/debug_hash_tensor 2025-12-04T08:54:02.3388399Z * [new branch] pianpwk/debug_mode_annotate -> origin/pianpwk/debug_mode_annotate 2025-12-04T08:54:02.3388490Z * [new branch] pianpwk/debug_mode_defaults -> origin/pianpwk/debug_mode_defaults 2025-12-04T08:54:02.3388572Z * [new branch] pianpwk/debug_mode_hacks -> origin/pianpwk/debug_mode_hacks 2025-12-04T08:54:02.3388679Z * [new branch] pianpwk/debug_mode_opcall_refactor -> origin/pianpwk/debug_mode_opcall_refactor 2025-12-04T08:54:02.3388768Z * [new branch] pianpwk/debug_mode_show_ids -> origin/pianpwk/debug_mode_show_ids 2025-12-04T08:54:02.3388851Z * [new branch] pianpwk/debug_mode_triton -> origin/pianpwk/debug_mode_triton 2025-12-04T08:54:02.3388947Z * [new branch] pianpwk/debug_show_stack_trace -> origin/pianpwk/debug_show_stack_trace 2025-12-04T08:54:02.3389049Z * [new branch] pianpwk/debug_wait_on_collective -> origin/pianpwk/debug_wait_on_collective 2025-12-04T08:54:02.3389174Z * [new branch] pianpwk/debugmode_compile_tf -> origin/pianpwk/debugmode_compile_tf 2025-12-04T08:54:02.3389298Z * [new branch] pianpwk/dispatch_key_debugging_for_debug -> origin/pianpwk/dispatch_key_debugging_for_debug 2025-12-04T08:54:02.3389405Z * [new branch] pianpwk/draft_debug_mode_tfcompile -> origin/pianpwk/draft_debug_mode_tfcompile 2025-12-04T08:54:02.3389503Z * [new branch] pianpwk/draft_multikernel_nn -> origin/pianpwk/draft_multikernel_nn 2025-12-04T08:54:02.3389621Z * [new branch] pianpwk/draft_multikernel_status_10_5 -> origin/pianpwk/draft_multikernel_status_10_5 2025-12-04T08:54:02.3389713Z * [new branch] pianpwk/dtensor_custom_chunk -> origin/pianpwk/dtensor_custom_chunk 2025-12-04T08:54:02.3389817Z * [new branch] pianpwk/dtensor_unbacked_keypath -> origin/pianpwk/dtensor_unbacked_keypath 2025-12-04T08:54:02.3389901Z * [new branch] pianpwk/event_list_tree -> origin/pianpwk/event_list_tree 2025-12-04T08:54:02.3389983Z * [new branch] pianpwk/false_numel_refs -> origin/pianpwk/false_numel_refs 2025-12-04T08:54:02.3390061Z * [new branch] pianpwk/maybe_guard_rel -> origin/pianpwk/maybe_guard_rel 2025-12-04T08:54:02.3390253Z * [new branch] pianpwk/multikernel_hints_draft -> origin/pianpwk/multikernel_hints_draft 2025-12-04T08:54:02.3390364Z * [new branch] pianpwk/no_size_oblivious_slice_scat -> origin/pianpwk/no_size_oblivious_slice_scat 2025-12-04T08:54:02.3390479Z * [new branch] pianpwk/oblivious_reshape_view_better -> origin/pianpwk/oblivious_reshape_view_better 2025-12-04T08:54:02.3390566Z * [new branch] pianpwk/pre_forward_hook -> origin/pianpwk/pre_forward_hook 2025-12-04T08:54:02.3390672Z * [new branch] pianpwk/skip_python_keys_alternate -> origin/pianpwk/skip_python_keys_alternate 2025-12-04T08:54:02.3390778Z * [new branch] pianpwk/skip_python_keys_in_guards -> origin/pianpwk/skip_python_keys_in_guards 2025-12-04T08:54:02.3390862Z * [new branch] pianpwk/sym_tokens_draft -> origin/pianpwk/sym_tokens_draft 2025-12-04T08:54:02.3390942Z * [new branch] pianpwk/symint_one_hot -> origin/pianpwk/symint_one_hot 2025-12-04T08:54:02.3391059Z * [new branch] pianpwk/test_pointwise_guard_or_false -> origin/pianpwk/test_pointwise_guard_or_false 2025-12-04T08:54:02.3391157Z * [new branch] pianpwk/totally_draft_sym_wrap -> origin/pianpwk/totally_draft_sym_wrap 2025-12-04T08:54:02.3391238Z * [new branch] pianpwk/try_dumb_stuff -> origin/pianpwk/try_dumb_stuff 2025-12-04T08:54:02.3391320Z * [new branch] pianpwk/try_dumb_stuff_2 -> origin/pianpwk/try_dumb_stuff_2 2025-12-04T08:54:02.3391411Z * [new branch] pianpwk/unbacked_dtensor_mm -> origin/pianpwk/unbacked_dtensor_mm 2025-12-04T08:54:02.3391508Z * [new branch] pianpwk/unbacked_tracing_12_2 -> origin/pianpwk/unbacked_tracing_12_2 2025-12-04T08:54:02.3391584Z * [new branch] pianpwk/user_symints -> origin/pianpwk/user_symints 2025-12-04T08:54:02.3391662Z * [new branch] pianpwk/wan21_reshape -> origin/pianpwk/wan21_reshape 2025-12-04T08:54:02.3391757Z * [new branch] piz/fix_partial_backward_1112 -> origin/piz/fix_partial_backward_1112 2025-12-04T08:54:02.3391833Z * [new branch] piz/prop_cache_clean -> origin/piz/prop_cache_clean 2025-12-04T08:54:02.3391901Z * [new branch] pool-separate -> origin/pool-separate 2025-12-04T08:54:02.3391965Z * [new branch] pr-156087 -> origin/pr-156087 2025-12-04T08:54:02.3392027Z * [new branch] pr/131860 -> origin/pr/131860 2025-12-04T08:54:02.3392096Z * [new branch] predispatch_to -> origin/predispatch_to 2025-12-04T08:54:02.3392200Z * [new branch] protect-c17 -> origin/protect-c17 2025-12-04T08:54:02.3392268Z * [new branch] pt-opt-cuda3 -> origin/pt-opt-cuda3 2025-12-04T08:54:02.3392349Z * [new branch] python_compiled_autograd -> origin/python_compiled_autograd 2025-12-04T08:54:02.3392480Z * [new branch] q1l1/fix_device_moved_constant_type_unknown -> origin/q1l1/fix_device_moved_constant_type_unknown 2025-12-04T08:54:02.3392620Z * [new branch] q1l1/fix_wrong_default_type_for_kernel_call_args -> origin/q1l1/fix_wrong_default_type_for_kernel_call_args 2025-12-04T08:54:02.3392700Z * [new branch] qchip/export-D54134695 -> origin/qchip/export-D54134695 2025-12-04T08:54:02.3392776Z * [new branch] quote-pytest_cache -> origin/quote-pytest_cache 2025-12-04T08:54:02.3392874Z * [new branch] reland-accgrad-stream-warn -> origin/reland-accgrad-stream-warn 2025-12-04T08:54:02.3392940Z * [new branch] release/1.10 -> origin/release/1.10 2025-12-04T08:54:02.3393005Z * [new branch] release/1.11 -> origin/release/1.11 2025-12-04T08:54:02.3393067Z * [new branch] release/1.12 -> origin/release/1.12 2025-12-04T08:54:02.3393128Z * [new branch] release/1.13 -> origin/release/1.13 2025-12-04T08:54:02.3393225Z * [new branch] release/1.4 -> origin/release/1.4 2025-12-04T08:54:02.3393290Z * [new branch] release/1.4.1 -> origin/release/1.4.1 2025-12-04T08:54:02.3393351Z * [new branch] release/1.5 -> origin/release/1.5 2025-12-04T08:54:02.3393414Z * [new branch] release/1.6 -> origin/release/1.6 2025-12-04T08:54:02.3393474Z * [new branch] release/1.7 -> origin/release/1.7 2025-12-04T08:54:02.3393536Z * [new branch] release/1.8 -> origin/release/1.8 2025-12-04T08:54:02.3393600Z * [new branch] release/1.9 -> origin/release/1.9 2025-12-04T08:54:02.3393660Z * [new branch] release/2.0 -> origin/release/2.0 2025-12-04T08:54:02.3393721Z * [new branch] release/2.1 -> origin/release/2.1 2025-12-04T08:54:02.3393781Z * [new branch] release/2.2 -> origin/release/2.2 2025-12-04T08:54:02.3393843Z * [new branch] release/2.3 -> origin/release/2.3 2025-12-04T08:54:02.3393904Z * [new branch] release/2.4 -> origin/release/2.4 2025-12-04T08:54:02.3393966Z * [new branch] release/2.5 -> origin/release/2.5 2025-12-04T08:54:02.3394025Z * [new branch] release/2.6 -> origin/release/2.6 2025-12-04T08:54:02.3394086Z * [new branch] release/2.7 -> origin/release/2.7 2025-12-04T08:54:02.3394148Z * [new branch] release/2.8 -> origin/release/2.8 2025-12-04T08:54:02.3394209Z * [new branch] release/2.9 -> origin/release/2.9 2025-12-04T08:54:02.3394273Z * [new branch] release_notes -> origin/release_notes 2025-12-04T08:54:02.3394351Z * [new branch] remove_pyinterpreter -> origin/remove_pyinterpreter 2025-12-04T08:54:02.3394478Z * [new branch] replace-pytorch-labs-20250812-195836 -> origin/replace-pytorch-labs-20250812-195836 2025-12-04T08:54:02.3394600Z * [new branch] replace-pytorch-labs-20250812-200248 -> origin/replace-pytorch-labs-20250812-200248 2025-12-04T08:54:02.3394718Z * [new branch] replace-pytorch-labs-20250812-200324 -> origin/replace-pytorch-labs-20250812-200324 2025-12-04T08:54:02.3394837Z * [new branch] replace-pytorch-labs-20250812-204020 -> origin/replace-pytorch-labs-20250812-204020 2025-12-04T08:54:02.3394967Z * [new branch] revert-131069-gh/krzysztofjordan/1/head -> origin/revert-131069-gh/krzysztofjordan/1/head 2025-12-04T08:54:02.3395106Z * [new branch] revert-131469-gh/andrewor14/51/head -> origin/revert-131469-gh/andrewor14/51/head 2025-12-04T08:54:02.3395208Z * [new branch] revert-152361-gh/fadara01/1/head -> origin/revert-152361-gh/fadara01/1/head 2025-12-04T08:54:02.3395313Z * [new branch] revert-156870-gh/skarjala/3/head -> origin/revert-156870-gh/skarjala/3/head 2025-12-04T08:54:02.3395485Z * [new branch] revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ -> origin/revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ 2025-12-04T08:54:02.3395581Z * [new branch] revert-hoo-invoke-subgraph -> origin/revert-hoo-invoke-subgraph 2025-12-04T08:54:02.3395682Z * [new branch] revert_always_build_distributed -> origin/revert_always_build_distributed 2025-12-04T08:54:02.3395751Z * [new branch] rms_norm_patch -> origin/rms_norm_patch 2025-12-04T08:54:02.3395848Z * [new branch] ruisi/fix_all_to_all_estimation -> origin/ruisi/fix_all_to_all_estimation 2025-12-04T08:54:02.3395934Z * [new branch] ruisi/fix_comm_estimation -> origin/ruisi/fix_comm_estimation 2025-12-04T08:54:02.3396039Z * [new branch] ruisi/fix_dynamic_shape_estimation -> origin/ruisi/fix_dynamic_shape_estimation 2025-12-04T08:54:02.3396162Z * [new branch] ruisi/fix_llama3_autobucketing -> origin/ruisi/fix_llama3_autobucketing 2025-12-04T08:54:02.3396267Z * [new branch] ruisi/fix_manual_bucketing_ep_pass -> origin/ruisi/fix_manual_bucketing_ep_pass 2025-12-04T08:54:02.3396352Z * [new branch] ruisi/manual_bucket_pass -> origin/ruisi/manual_bucket_pass 2025-12-04T08:54:02.3396500Z * [new branch] ryanguo99/cleanup-dynamo-expected-failures -> origin/ryanguo99/cleanup-dynamo-expected-failures 2025-12-04T08:54:02.3396586Z * [new branch] ryanguo99/fix-closure-var -> origin/ryanguo99/fix-closure-var 2025-12-04T08:54:02.3396664Z * [new branch] rzou/faketensor_bench -> origin/rzou/faketensor_bench 2025-12-04T08:54:02.3396729Z * [new branch] rzou/njt -> origin/rzou/njt 2025-12-04T08:54:02.3396791Z * [new branch] rzou/pca -> origin/rzou/pca 2025-12-04T08:54:02.3396859Z * [new branch] rzou/realprop -> origin/rzou/realprop 2025-12-04T08:54:02.3396925Z * [new branch] samplevllm -> origin/samplevllm 2025-12-04T08:54:02.3397092Z * [new branch] sanchitintel/weird_thing_with_test_cpu_select_algorithm -> origin/sanchitintel/weird_thing_with_test_cpu_select_algorithm 2025-12-04T08:54:02.3397185Z * [new branch] sapling-pr-archive-SS-JIA -> origin/sapling-pr-archive-SS-JIA 2025-12-04T08:54:02.3397300Z * [new branch] sapling-pr-archive-tushar00jain -> origin/sapling-pr-archive-tushar00jain 2025-12-04T08:54:02.3397361Z * [new branch] save -> origin/save 2025-12-04T08:54:02.3397424Z * [new branch] scaled_mm -> origin/scaled_mm 2025-12-04T08:54:02.3397487Z * [new branch] scan_attempt -> origin/scan_attempt 2025-12-04T08:54:02.3397548Z * [new branch] sdym/2.5.1 -> origin/sdym/2.5.1 2025-12-04T08:54:02.3397659Z * [new branch] sekyondaMeta-dynamoconfig-fix -> origin/sekyondaMeta-dynamoconfig-fix 2025-12-04T08:54:02.3397736Z * [new branch] shengf/fx-xform-perf -> origin/shengf/fx-xform-perf 2025-12-04T08:54:02.3397814Z * [new branch] shoumikhin-patch-1 -> origin/shoumikhin-patch-1 2025-12-04T08:54:02.3397890Z * [new branch] solve-accuracy-fix -> origin/solve-accuracy-fix 2025-12-04T08:54:02.3397969Z * [new branch] some_rocm_inductor_skips -> origin/some_rocm_inductor_skips 2025-12-04T08:54:02.3398079Z * [new branch] soulitzer/stash-tls-ac -> origin/soulitzer/stash-tls-ac 2025-12-04T08:54:02.3398163Z * [new branch] sparse-mm-bf16-support -> origin/sparse-mm-bf16-support 2025-12-04T08:54:02.3398236Z * [new branch] starterTaskUpdate -> origin/starterTaskUpdate 2025-12-04T08:54:02.3398295Z * [new branch] suo -> origin/suo 2025-12-04T08:54:02.3398359Z * [new branch] sve-poc -> origin/sve-poc 2025-12-04T08:54:02.3398421Z * [new branch] switch-bn -> origin/switch-bn 2025-12-04T08:54:02.3398513Z * [new branch] sy_annotation_in_autograd_hop -> origin/sy_annotation_in_autograd_hop 2025-12-04T08:54:02.3398584Z * [new branch] sy_aot_eager_record -> origin/sy_aot_eager_record 2025-12-04T08:54:02.3398653Z * [new branch] sy_custom_bucketing -> origin/sy_custom_bucketing 2025-12-04T08:54:02.3398721Z * [new branch] sy_debug_mode_test -> origin/sy_debug_mode_test 2025-12-04T08:54:02.3398788Z * [new branch] sy_deserialize -> origin/sy_deserialize 2025-12-04T08:54:02.3398853Z * [new branch] sy_dump_gm_code -> origin/sy_dump_gm_code 2025-12-04T08:54:02.3398914Z * [new branch] sy_exp -> origin/sy_exp 2025-12-04T08:54:02.3399010Z * [new branch] sy_export_annotation -> origin/sy_export_annotation 2025-12-04T08:54:02.3399078Z * [new branch] sy_invoke_subgraph -> origin/sy_invoke_subgraph 2025-12-04T08:54:02.3399146Z * [new branch] sy_kernel_bw_name -> origin/sy_kernel_bw_name 2025-12-04T08:54:02.3399209Z * [new branch] sy_multi_arch -> origin/sy_multi_arch 2025-12-04T08:54:02.3399276Z * [new branch] sy_nn_module_stack -> origin/sy_nn_module_stack 2025-12-04T08:54:02.3399347Z * [new branch] sy_original_dtensor -> origin/sy_original_dtensor 2025-12-04T08:54:02.3399414Z * [new branch] sy_profiler_cia -> origin/sy_profiler_cia 2025-12-04T08:54:02.3399478Z * [new branch] symm_mem_sync -> origin/symm_mem_sync 2025-12-04T08:54:02.3399563Z * [new branch] sympy-bottleneck-repro -> origin/sympy-bottleneck-repro 2025-12-04T08:54:02.3399642Z * [new branch] tensordict_integration -> origin/tensordict_integration 2025-12-04T08:54:02.3399722Z * [new branch] test-move-conda-builds -> origin/test-move-conda-builds 2025-12-04T08:54:02.3399785Z * [new branch] test-old -> origin/test-old 2025-12-04T08:54:02.3399847Z * [new branch] test/bmm_heur -> origin/test/bmm_heur 2025-12-04T08:54:02.3399945Z * [new branch] tianren/customOp_autotune_fix -> origin/tianren/customOp_autotune_fix 2025-12-04T08:54:02.3400058Z * [new branch] tianren/customOp_enable_max_autotune -> origin/tianren/customOp_enable_max_autotune 2025-12-04T08:54:02.3400174Z * [new branch] tianren/customOp_fusion -> origin/tianren/customOp_fusion 2025-12-04T08:54:02.3403265Z * [new branch] tianren/customop_collectiveop_benchmark -> origin/tianren/customop_collectiveop_benchmark 2025-12-04T08:54:02.3403415Z * [new branch] tianren/customop_collectiveop_benchmark_fix -> origin/tianren/customop_collectiveop_benchmark_fix 2025-12-04T08:54:02.3403521Z * [new branch] tianren/customop_dynamic_config -> origin/tianren/customop_dynamic_config 2025-12-04T08:54:02.3403616Z * [new branch] tianren/dynamic_range_input -> origin/tianren/dynamic_range_input 2025-12-04T08:54:02.3403721Z * [new branch] tianren/dynamic_range_input_fix -> origin/tianren/dynamic_range_input_fix 2025-12-04T08:54:02.3403825Z * [new branch] tianren/dynamic_range_input_merge -> origin/tianren/dynamic_range_input_merge 2025-12-04T08:54:02.3403989Z * [new branch] tianren/flex_paged_attn_fix_temp -> origin/tianren/flex_paged_attn_fix_temp 2025-12-04T08:54:02.3404073Z * [new branch] tianren/fx_codegen_dump -> origin/tianren/fx_codegen_dump 2025-12-04T08:54:02.3404159Z * [new branch] tianren/symmetric_memory -> origin/tianren/symmetric_memory 2025-12-04T08:54:02.3404228Z * [new branch] tianren/test -> origin/tianren/test 2025-12-04T08:54:02.3404304Z * [new branch] tidy_performance_cyy -> origin/tidy_performance_cyy 2025-12-04T08:54:02.3404365Z * [new branch] tmp -> origin/tmp 2025-12-04T08:54:02.3404431Z * [new branch] torchtitan_ep -> origin/torchtitan_ep 2025-12-04T08:54:02.3404511Z * [new branch] torchtitan_integration -> origin/torchtitan_integration 2025-12-04T08:54:02.3404594Z * [new branch] trace_fsdp_torchtune_lora -> origin/trace_fsdp_torchtune_lora 2025-12-04T08:54:02.3404678Z * [new branch] traceable_fsdp_unit_tests -> origin/traceable_fsdp_unit_tests 2025-12-04T08:54:02.3404755Z * [new branch] tree_loop_vec_base -> origin/tree_loop_vec_base 2025-12-04T08:54:02.3404820Z * [new branch] triton_kernel -> origin/triton_kernel 2025-12-04T08:54:02.3404923Z * [new branch] tt_pkg_1908 -> origin/tt_pkg_1908 2025-12-04T08:54:02.3404987Z * [new branch] type_dec -> origin/type_dec 2025-12-04T08:54:02.3405081Z * [new branch] udate-sphinx-dependancies -> origin/udate-sphinx-dependancies 2025-12-04T08:54:02.3405221Z * [new branch] update-audio-commit-hash/17630256502-1803-1 -> origin/update-audio-commit-hash/17630256502-1803-1 2025-12-04T08:54:02.3405356Z * [new branch] update-audio-commit-hash/19087141161-1916-1 -> origin/update-audio-commit-hash/19087141161-1916-1 2025-12-04T08:54:02.3405489Z * [new branch] update-audio-commit-hash/19250643381-1929-1 -> origin/update-audio-commit-hash/19250643381-1929-1 2025-12-04T08:54:02.3405622Z * [new branch] update-audio-commit-hash/19397724337-1935-1 -> origin/update-audio-commit-hash/19397724337-1935-1 2025-12-04T08:54:02.3405754Z * [new branch] update-audio-commit-hash/19555670148-1941-1 -> origin/update-audio-commit-hash/19555670148-1941-1 2025-12-04T08:54:02.3405884Z * [new branch] update-audio-commit-hash/19750627930-1946-1 -> origin/update-audio-commit-hash/19750627930-1946-1 2025-12-04T08:54:02.3406022Z * [new branch] update-triton-commit-hash/13663274526-1487-2 -> origin/update-triton-commit-hash/13663274526-1487-2 2025-12-04T08:54:02.3406157Z * [new branch] update-vision-commit-hash/19087141161-1916-1 -> origin/update-vision-commit-hash/19087141161-1916-1 2025-12-04T08:54:02.3406288Z * [new branch] update-vision-commit-hash/19184897099-1925-1 -> origin/update-vision-commit-hash/19184897099-1925-1 2025-12-04T08:54:02.3406425Z * [new branch] update-vision-commit-hash/19250643381-1929-1 -> origin/update-vision-commit-hash/19250643381-1929-1 2025-12-04T08:54:02.3406558Z * [new branch] update-vision-commit-hash/19381328640-1934-1 -> origin/update-vision-commit-hash/19381328640-1934-1 2025-12-04T08:54:02.3406694Z * [new branch] update-vision-commit-hash/19485237164-1938-1 -> origin/update-vision-commit-hash/19485237164-1938-1 2025-12-04T08:54:02.3406825Z * [new branch] update-vllm-commit-hash/18451675449-1879-1 -> origin/update-vllm-commit-hash/18451675449-1879-1 2025-12-04T08:54:02.3406909Z * [new branch] update-vllm-dockerfile -> origin/update-vllm-dockerfile 2025-12-04T08:54:02.3407034Z * [new branch] update-xla-commit-hash/19224287370-211-1 -> origin/update-xla-commit-hash/19224287370-211-1 2025-12-04T08:54:02.3407183Z * [new branch] update-xla-commit-hash/19422028566-212-1 -> origin/update-xla-commit-hash/19422028566-212-1 2025-12-04T08:54:02.3407304Z * [new branch] update-xla-commit-hash/19626841311-213-1 -> origin/update-xla-commit-hash/19626841311-213-1 2025-12-04T08:54:02.3407432Z * [new branch] update_docs_torch_multinomial_issue#125388 -> origin/update_docs_torch_multinomial_issue#125388 2025-12-04T08:54:02.3407511Z * [new branch] update_operator_readme -> origin/update_operator_readme 2025-12-04T08:54:02.3407600Z * [new branch] update_slow_tests_1722488736 -> origin/update_slow_tests_1722488736 2025-12-04T08:54:02.3407688Z * [new branch] update_slow_tests_1722879173 -> origin/update_slow_tests_1722879173 2025-12-04T08:54:02.3407774Z * [new branch] update_slow_tests_1762155677 -> origin/update_slow_tests_1762155677 2025-12-04T08:54:02.3407859Z * [new branch] update_slow_tests_1763365283 -> origin/update_slow_tests_1763365283 2025-12-04T08:54:02.3407954Z * [new branch] update_submodule_FBGEMM -> origin/update_submodule_FBGEMM 2025-12-04T08:54:02.3408032Z * [new branch] update_submodule_kineto -> origin/update_submodule_kineto 2025-12-04T08:54:02.3408126Z * [new branch] update_submodule_tensorpipe -> origin/update_submodule_tensorpipe 2025-12-04T08:54:02.3408257Z * [new branch] upload-tests-for-autorevert -> origin/upload-tests-for-autorevert 2025-12-04T08:54:02.3408319Z * [new branch] v0.1.2 -> origin/v0.1.2 2025-12-04T08:54:02.3408383Z * [new branch] v1.0.1 -> origin/v1.0.1 2025-12-04T08:54:02.3408442Z * [new branch] v1.0.3 -> origin/v1.0.3 2025-12-04T08:54:02.3408498Z * [new branch] v1.1.0 -> origin/v1.1.0 2025-12-04T08:54:02.3408555Z * [new branch] v1.2.0 -> origin/v1.2.0 2025-12-04T08:54:02.3408615Z * [new branch] v1.3.0 -> origin/v1.3.0 2025-12-04T08:54:02.3408672Z * [new branch] v1.3.1 -> origin/v1.3.1 2025-12-04T08:54:02.3408739Z * [new branch] validate_fn -> origin/validate_fn 2025-12-04T08:54:02.3408807Z * [new branch] validations_2.6 -> origin/validations_2.6 2025-12-04T08:54:02.3408877Z * [new branch] validations_2.8 -> origin/validations_2.8 2025-12-04T08:54:02.3408945Z * [new branch] varlen-api -> origin/varlen-api 2025-12-04T08:54:02.3409021Z * [new branch] varlen-api-backup -> origin/varlen-api-backup 2025-12-04T08:54:02.3409100Z * [new branch] varlen_batch_invariance -> origin/varlen_batch_invariance 2025-12-04T08:54:02.3409165Z * [new branch] viable/strict -> origin/viable/strict 2025-12-04T08:54:02.3409283Z * [new branch] vishal9-team/dtensor_parallelism_toy -> origin/vishal9-team/dtensor_parallelism_toy 2025-12-04T08:54:02.3409349Z * [new branch] vllmbuildci -> origin/vllmbuildci 2025-12-04T08:54:02.3409413Z * [new branch] vllmpin -> origin/vllmpin 2025-12-04T08:54:02.3409502Z * [new branch] vscode-recommend-pyrefly -> origin/vscode-recommend-pyrefly 2025-12-04T08:54:02.3409571Z * [new branch] wdvr-patch-1 -> origin/wdvr-patch-1 2025-12-04T08:54:02.3409637Z * [new branch] wdvr/iss_145259 -> origin/wdvr/iss_145259 2025-12-04T08:54:02.3409698Z * [new branch] whc/pei -> origin/whc/pei 2025-12-04T08:54:02.3409765Z * [new branch] whc/pp_fix -> origin/whc/pp_fix 2025-12-04T08:54:02.3409831Z * [new branch] whc/sharding -> origin/whc/sharding 2025-12-04T08:54:02.3409895Z * [new branch] whc/sharding2 -> origin/whc/sharding2 2025-12-04T08:54:02.3409985Z * [new branch] whc/uneven -> origin/whc/uneven 2025-12-04T08:54:02.3410057Z * [new branch] whc/uneven-merge -> origin/whc/uneven-merge 2025-12-04T08:54:02.3410171Z * [new branch] win_warnings -> origin/win_warnings 2025-12-04T08:54:02.3410250Z * [new branch] windows_libtorch_free -> origin/windows_libtorch_free 2025-12-04T08:54:02.3410314Z * [new branch] xmfan-war -> origin/xmfan-war 2025-12-04T08:54:02.3410377Z * [new branch] xmfan/ca_0516 -> origin/xmfan/ca_0516 2025-12-04T08:54:02.3410446Z * [new branch] xmfan/ca_1051b93192 -> origin/xmfan/ca_1051b93192 2025-12-04T08:54:02.3410599Z * [new branch] xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 -> origin/xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 2025-12-04T08:54:02.3410669Z * [new branch] xmfan/ca_5a2be192d1 -> origin/xmfan/ca_5a2be192d1 2025-12-04T08:54:02.3410738Z * [new branch] xmfan/ca_9d59b516e9 -> origin/xmfan/ca_9d59b516e9 2025-12-04T08:54:02.3410804Z * [new branch] xmfan/ca_apr8 -> origin/xmfan/ca_apr8 2025-12-04T08:54:02.3410867Z * [new branch] xmfan/ca_base -> origin/xmfan/ca_base 2025-12-04T08:54:02.3410976Z * [new branch] xmfan/ca_dynamic -> origin/xmfan/ca_dynamic 2025-12-04T08:54:02.3411043Z * [new branch] xmfan/ca_fix_dyn -> origin/xmfan/ca_fix_dyn 2025-12-04T08:54:02.3411116Z * [new branch] xmfan/ca_fix_lowering -> origin/xmfan/ca_fix_lowering 2025-12-04T08:54:02.3411192Z * [new branch] xmfan/ca_fix_polyfills -> origin/xmfan/ca_fix_polyfills 2025-12-04T08:54:02.3411255Z * [new branch] xmfan/ca_jan3 -> origin/xmfan/ca_jan3 2025-12-04T08:54:02.3411319Z * [new branch] xmfan/ca_jun18 -> origin/xmfan/ca_jun18 2025-12-04T08:54:02.3411385Z * [new branch] xmfan/ca_jun24 -> origin/xmfan/ca_jun24 2025-12-04T08:54:02.3411451Z * [new branch] xmfan/ca_nested -> origin/xmfan/ca_nested 2025-12-04T08:54:02.3411518Z * [new branch] xmfan/ca_overhead -> origin/xmfan/ca_overhead 2025-12-04T08:54:02.3411612Z * [new branch] xmfan/ca_overhead_0eba7e5451 -> origin/xmfan/ca_overhead_0eba7e5451 2025-12-04T08:54:02.3411680Z * [new branch] xmfan/cacu_jun18 -> origin/xmfan/cacu_jun18 2025-12-04T08:54:02.3411747Z * [new branch] xmfan/cacu_jun19 -> origin/xmfan/cacu_jun19 2025-12-04T08:54:02.3411814Z * [new branch] xmfan/cacu_jun4 -> origin/xmfan/cacu_jun4 2025-12-04T08:54:02.3411895Z * [new branch] xmfan/disable_duck_shape -> origin/xmfan/disable_duck_shape 2025-12-04T08:54:02.3411993Z * [new branch] xmfan/fca_cpp_node_passthrough -> origin/xmfan/fca_cpp_node_passthrough 2025-12-04T08:54:02.3412150Z * [new branch] xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T08:54:02.3412295Z * [new branch] xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T08:54:02.3412367Z * [new branch] xmfan/single_step -> origin/xmfan/single_step 2025-12-04T08:54:02.3412431Z * [new branch] xmfan/sth_0829 -> origin/xmfan/sth_0829 2025-12-04T08:54:02.3412492Z * [new branch] xmfan/test -> origin/xmfan/test 2025-12-04T08:54:02.3412581Z * [new branch] yguo/debug-0226-constexpr -> origin/yguo/debug-0226-constexpr 2025-12-04T08:54:02.3412658Z * [new branch] yguo/new_latest_changes -> origin/yguo/new_latest_changes 2025-12-04T08:54:02.3412751Z * [new branch] yguo/patch_constexpr_changes -> origin/yguo/patch_constexpr_changes 2025-12-04T08:54:02.3412857Z * [new branch] yiming/bootcamp -> origin/yiming/bootcamp 2025-12-04T08:54:02.3412958Z * [new branch] yiming/run_with_start_end_rng_hop -> origin/yiming/run_with_start_end_rng_hop 2025-12-04T08:54:02.3413023Z * [new branch] yolo-llama3 -> origin/yolo-llama3 2025-12-04T08:54:02.3413096Z * [new branch] zainr/canary-test -> origin/zainr/canary-test 2025-12-04T08:54:02.3413183Z * [new branch] zainr/cleanup-gh-runners -> origin/zainr/cleanup-gh-runners 2025-12-04T08:54:02.3413264Z * [new branch] zainr/pull-migration-c -> origin/zainr/pull-migration-c 2025-12-04T08:54:02.3413327Z * [new branch] zainr/test2 -> origin/zainr/test2 2025-12-04T08:54:02.3413400Z * [new branch] zasdfgbnm-patch-3 -> origin/zasdfgbnm-patch-3 2025-12-04T08:54:02.3413460Z * [new branch] zb2p -> origin/zb2p 2025-12-04T08:54:02.3413546Z * [new branch] zeros-and-scatter-part2 -> origin/zeros-and-scatter-part2 2025-12-04T08:54:02.3413633Z * [new branch] zhxchen17/ci/vllm_lora_oom -> origin/zhxchen17/ci/vllm_lora_oom 2025-12-04T08:54:02.3413735Z * [new branch] zhxchen17/ci/vllm_multimodal_oom -> origin/zhxchen17/ci/vllm_multimodal_oom 2025-12-04T08:54:02.3413838Z * [new branch] zhxchen17/ci/vllm_pin -> origin/zhxchen17/ci/vllm_pin 2025-12-04T08:54:02.3413961Z * [new branch] zhxchen17/dynamo/unsafe_drop_all_guards -> origin/zhxchen17/dynamo/unsafe_drop_all_guards 2025-12-04T08:54:02.3414060Z * [new branch] zhxchen17/export/call_override -> origin/zhxchen17/export/call_override 2025-12-04T08:54:02.3414145Z * [new branch] zhxchen17/export/codemod1 -> origin/zhxchen17/export/codemod1 2025-12-04T08:54:02.3414234Z * [new branch] zhxchen17/export/ctx_return -> origin/zhxchen17/export/ctx_return 2025-12-04T08:54:02.3414366Z * [new branch] zhxchen17/export/disable_side_effect_warn -> origin/zhxchen17/export/disable_side_effect_warn 2025-12-04T08:54:02.3414463Z * [new branch] zhxchen17/export/pytree_check -> origin/zhxchen17/export/pytree_check 2025-12-04T08:54:02.3414552Z * [new branch] zhxchen17/precompile/aoti -> origin/zhxchen17/precompile/aoti 2025-12-04T08:54:02.3414652Z * [new branch] zhxchen17/precompile/globals -> origin/zhxchen17/precompile/globals 2025-12-04T08:54:02.3414768Z * [new branch] zhxchen17/precompile/inductor_guards -> origin/zhxchen17/precompile/inductor_guards 2025-12-04T08:54:02.3414841Z * [new branch] zhxchen17/scratch/0 -> origin/zhxchen17/scratch/0 2025-12-04T08:54:02.3414948Z * [new branch] zhxchen17/torch_export_api_update -> origin/zhxchen17/torch_export_api_update 2025-12-04T08:54:02.3415026Z * [new branch] zhxhcen17/moodycamel -> origin/zhxhcen17/moodycamel 2025-12-04T08:54:02.3415099Z * [new branch] zxiiro/build-times -> origin/zxiiro/build-times 2025-12-04T08:54:02.3415173Z * [new branch] zxiiro/c7i.2xlarge -> origin/zxiiro/c7i.2xlarge 2025-12-04T08:54:02.3415252Z * [new branch] zxiiro/c7i.2xlarge.h100 -> origin/zxiiro/c7i.2xlarge.h100 2025-12-04T08:54:02.3415316Z * [new branch] zxiiro/main -> origin/zxiiro/main 2025-12-04T08:54:02.3415383Z * [new branch] zxiiro/risc64 -> origin/zxiiro/risc64 2025-12-04T08:54:02.3415473Z * [new branch] zxiiro/test-multicloud-arc -> origin/zxiiro/test-multicloud-arc 2025-12-04T08:54:02.3415534Z * [new tag] ciflow/dynamo/169525 -> ciflow/dynamo/169525 2025-12-04T08:54:02.3415604Z t [tag update] ciflow/inductor/167647 -> ciflow/inductor/167647 2025-12-04T08:54:02.3415702Z t [tag update] ciflow/inductor/168266 -> ciflow/inductor/168266 2025-12-04T08:54:02.3415771Z t [tag update] ciflow/inductor/169535 -> ciflow/inductor/169535 2025-12-04T08:54:02.3415831Z * [new tag] ciflow/trunk/165728 -> ciflow/trunk/165728 2025-12-04T08:54:02.3415891Z * [new tag] ciflow/trunk/169048 -> ciflow/trunk/169048 2025-12-04T08:54:02.3415953Z * [new tag] ciflow/trunk/169125 -> ciflow/trunk/169125 2025-12-04T08:54:02.3416014Z * [new tag] ciflow/trunk/169555 -> ciflow/trunk/169555 2025-12-04T08:54:02.3416073Z * [new tag] ciflow/xpu/169555 -> ciflow/xpu/169555 2025-12-04T08:54:02.5720713Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T08:54:02.5862018Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:02.5866260Z ##[endgroup] 2025-12-04T08:54:02.5866504Z ##[group]Determining the checkout info 2025-12-04T08:54:02.5866763Z ##[endgroup] 2025-12-04T08:54:02.5869754Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T08:54:02.5960176Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T08:54:02.5979433Z ##[group]Checking out the ref 2025-12-04T08:54:02.5990480Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:02.6409012Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:02.6412473Z ##[endgroup] 2025-12-04T08:54:02.6412656Z ##[group]Setting up auth for fetching submodules 2025-12-04T08:54:02.6418893Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:02.6464603Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T08:54:02.6490218Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T08:54:02.6517634Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T08:54:02.6593511Z ##[endgroup] 2025-12-04T08:54:02.6593701Z ##[group]Fetching submodules 2025-12-04T08:54:02.6610647Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T08:54:02.6795510Z Synchronizing submodule url for 'android/libs/fbjni' 2025-12-04T08:54:02.6809704Z Synchronizing submodule url for 'third_party/FP16' 2025-12-04T08:54:02.6825380Z Synchronizing submodule url for 'third_party/FXdiv' 2025-12-04T08:54:02.6836772Z Synchronizing submodule url for 'third_party/NNPACK' 2025-12-04T08:54:02.6846397Z Synchronizing submodule url for 'third_party/NVTX' 2025-12-04T08:54:02.6857124Z Synchronizing submodule url for 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:02.6870645Z Synchronizing submodule url for 'third_party/XNNPACK' 2025-12-04T08:54:02.6891111Z Synchronizing submodule url for 'third_party/aiter' 2025-12-04T08:54:02.6930541Z Synchronizing submodule url for 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:02.6932097Z Synchronizing submodule url for 'third_party/benchmark' 2025-12-04T08:54:02.6976738Z Synchronizing submodule url for 'third_party/composable_kernel' 2025-12-04T08:54:02.7008470Z Synchronizing submodule url for 'third_party/cpp-httplib' 2025-12-04T08:54:02.7008655Z Synchronizing submodule url for 'third_party/cpuinfo' 2025-12-04T08:54:02.7023958Z Synchronizing submodule url for 'third_party/cudnn_frontend' 2025-12-04T08:54:02.7035842Z Synchronizing submodule url for 'third_party/cutlass' 2025-12-04T08:54:02.7053362Z Synchronizing submodule url for 'third_party/fbgemm' 2025-12-04T08:54:02.7071859Z Synchronizing submodule url for 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:02.7083570Z Synchronizing submodule url for 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:02.7101851Z Synchronizing submodule url for 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:02.7112589Z Synchronizing submodule url for 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:02.7130700Z Synchronizing submodule url for 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:02.7144520Z Synchronizing submodule url for 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:02.7175199Z Synchronizing submodule url for 'third_party/fbgemm/external/json' 2025-12-04T08:54:02.7191987Z Synchronizing submodule url for 'third_party/flash-attention' 2025-12-04T08:54:02.7208039Z Synchronizing submodule url for 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:02.7227437Z Synchronizing submodule url for 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:02.7245615Z Synchronizing submodule url for 'third_party/flatbuffers' 2025-12-04T08:54:02.7296834Z Synchronizing submodule url for 'third_party/fmt' 2025-12-04T08:54:02.7297022Z Synchronizing submodule url for 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:02.7312134Z Synchronizing submodule url for 'third_party/gloo' 2025-12-04T08:54:02.7325568Z Synchronizing submodule url for 'third_party/googletest' 2025-12-04T08:54:02.7359687Z Synchronizing submodule url for 'third_party/ideep' 2025-12-04T08:54:02.7359870Z Synchronizing submodule url for 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:02.7364999Z Synchronizing submodule url for 'third_party/ittapi' 2025-12-04T08:54:02.7376388Z Synchronizing submodule url for 'third_party/kineto' 2025-12-04T08:54:02.7390982Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:02.7404189Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:02.7417460Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:02.7429540Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:02.7441723Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:02.7452727Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:02.7468401Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:02.7480414Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:02.7492828Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:02.7504504Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:02.7518739Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:02.7529884Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:02.7546582Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:02.7561435Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:02.7583888Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:02.7597255Z Synchronizing submodule url for 'third_party/kleidiai' 2025-12-04T08:54:02.7609480Z Synchronizing submodule url for 'third_party/mimalloc' 2025-12-04T08:54:02.7622948Z Synchronizing submodule url for 'third_party/nlohmann' 2025-12-04T08:54:02.7633822Z Synchronizing submodule url for 'third_party/onnx' 2025-12-04T08:54:02.7654067Z Synchronizing submodule url for 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:02.7669841Z Synchronizing submodule url for 'third_party/opentelemetry-cpp' 2025-12-04T08:54:02.7681305Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:02.7696910Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:02.7708844Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:02.7719622Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:02.7731880Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:02.7743110Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:02.7753645Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:02.7766300Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:02.7777788Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:02.7788164Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:02.7812785Z Synchronizing submodule url for 'third_party/pocketfft' 2025-12-04T08:54:02.7840492Z Synchronizing submodule url for 'third_party/protobuf' 2025-12-04T08:54:02.7840694Z Synchronizing submodule url for 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:02.7848656Z Synchronizing submodule url for 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:02.7864754Z Synchronizing submodule url for 'third_party/psimd' 2025-12-04T08:54:02.7875925Z Synchronizing submodule url for 'third_party/pthreadpool' 2025-12-04T08:54:02.7885978Z Synchronizing submodule url for 'third_party/pybind11' 2025-12-04T08:54:02.7899500Z Synchronizing submodule url for 'third_party/python-peachpy' 2025-12-04T08:54:02.7912225Z Synchronizing submodule url for 'third_party/sleef' 2025-12-04T08:54:02.7922535Z Synchronizing submodule url for 'third_party/tensorpipe' 2025-12-04T08:54:02.7933423Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:02.7944659Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:02.7957465Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:02.7968012Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:02.7980762Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:02.8008857Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T08:54:02.8320761Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T08:54:02.8393250Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T08:54:02.8457331Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T08:54:02.8578482Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T08:54:02.8657456Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T08:54:02.8715057Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T08:54:03.3720874Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T08:54:03.3887084Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T08:54:03.4116232Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T08:54:03.4232543Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T08:54:03.4465674Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:03.4531819Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T08:54:03.5201700Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T08:54:03.5300483Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T08:54:03.5591340Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T08:54:03.5869795Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T08:54:03.6173116Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T08:54:03.7960942Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:03.8223067Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T08:54:04.2756218Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T08:54:04.2979734Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:04.3054220Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T08:54:04.3624908Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T08:54:04.3782221Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T08:54:04.3975700Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T08:54:04.4139949Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T08:54:04.4307140Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T08:54:04.4448058Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T08:54:04.4663099Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T08:54:04.4796068Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T08:54:04.4978282Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:04.5058016Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T08:54:04.9267158Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T08:54:04.9363526Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T08:54:04.9436045Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T08:54:04.9516444Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T08:54:04.9589779Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T08:54:04.9720520Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T08:54:04.9760437Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T08:54:04.9782515Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T08:54:04.9825925Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T08:54:04.9886398Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T08:54:04.9943214Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:05.0023029Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T08:54:05.0072999Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T08:54:05.0139832Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T08:54:05.0232040Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T08:54:05.0288982Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:05.0342901Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T08:54:05.0398225Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:05.0561395Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T08:54:05.0591919Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T08:54:05.0717987Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T08:54:05.2616142Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T08:54:05.2810929Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T08:54:05.2980759Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T08:54:05.3007117Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T08:54:05.3065525Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T08:54:05.3180966Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T08:54:05.3310332Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T08:54:05.3370003Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T08:54:05.3440513Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T08:54:05.3481641Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T08:54:05.3600770Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T08:54:05.3719498Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:05.3876521Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T08:54:05.3940366Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T08:54:05.5281603Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T08:54:05.5408154Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T08:54:05.5645833Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T08:54:05.5706206Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T08:54:05.5776821Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T08:54:05.5969926Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T08:54:05.6230998Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T08:54:05.6489074Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T08:54:05.6612435Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T08:54:05.6791987Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T08:54:05.6871830Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T08:54:05.7180859Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T08:54:05.7307844Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T08:54:05.7378955Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T08:54:05.7407906Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T08:54:05.7637834Z Entering 'android/libs/fbjni' 2025-12-04T08:54:05.7672196Z Entering 'third_party/FP16' 2025-12-04T08:54:05.7712584Z Entering 'third_party/FXdiv' 2025-12-04T08:54:05.7766972Z Entering 'third_party/NNPACK' 2025-12-04T08:54:05.7830793Z Entering 'third_party/NVTX' 2025-12-04T08:54:05.7872504Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:05.7918138Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:05.7948287Z Entering 'third_party/aiter' 2025-12-04T08:54:05.7973409Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:05.7997686Z Entering 'third_party/benchmark' 2025-12-04T08:54:05.8019183Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:05.8051478Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:05.8088217Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:05.8112014Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:05.8145704Z Entering 'third_party/cutlass' 2025-12-04T08:54:05.8170409Z Entering 'third_party/fbgemm' 2025-12-04T08:54:05.8195519Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:05.8221237Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:05.8283537Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:05.8355260Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:05.8391987Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:05.8415826Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:05.8439347Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:05.8463151Z Entering 'third_party/flash-attention' 2025-12-04T08:54:05.8500503Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:05.8539524Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:05.8586591Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:05.8613636Z Entering 'third_party/fmt' 2025-12-04T08:54:05.8653061Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:05.8677216Z Entering 'third_party/gloo' 2025-12-04T08:54:05.8701313Z Entering 'third_party/googletest' 2025-12-04T08:54:05.8725736Z Entering 'third_party/ideep' 2025-12-04T08:54:05.8749597Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:05.8774115Z Entering 'third_party/ittapi' 2025-12-04T08:54:05.8796723Z Entering 'third_party/kineto' 2025-12-04T08:54:05.8820045Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:05.8845971Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:05.8874442Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:05.8897599Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:05.8918346Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:05.8939244Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:05.8961366Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:05.8980271Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:05.9002116Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:05.9026371Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:05.9047928Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:05.9070339Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:05.9092670Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:05.9117684Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:05.9141638Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:05.9168852Z Entering 'third_party/kleidiai' 2025-12-04T08:54:05.9192607Z Entering 'third_party/mimalloc' 2025-12-04T08:54:05.9218584Z Entering 'third_party/nlohmann' 2025-12-04T08:54:05.9240888Z Entering 'third_party/onnx' 2025-12-04T08:54:05.9267019Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:05.9291594Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:05.9315326Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:05.9335600Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:05.9356915Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:05.9377382Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:05.9398512Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:05.9419532Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:05.9439535Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:05.9459941Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:05.9479990Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:05.9501848Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:05.9530960Z Entering 'third_party/pocketfft' 2025-12-04T08:54:05.9554570Z Entering 'third_party/protobuf' 2025-12-04T08:54:05.9578464Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:05.9598538Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:05.9621272Z Entering 'third_party/psimd' 2025-12-04T08:54:05.9644362Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:05.9666578Z Entering 'third_party/pybind11' 2025-12-04T08:54:05.9689115Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:05.9710064Z Entering 'third_party/sleef' 2025-12-04T08:54:05.9731052Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:05.9751829Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:05.9771522Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:05.9791625Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:05.9815561Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:05.9833485Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:05.9868993Z ##[endgroup] 2025-12-04T08:54:05.9869191Z ##[group]Persisting credentials for submodules 2025-12-04T08:54:05.9876780Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T08:54:06.0043191Z Entering 'android/libs/fbjni' 2025-12-04T08:54:06.0066707Z Entering 'third_party/FP16' 2025-12-04T08:54:06.0091950Z Entering 'third_party/FXdiv' 2025-12-04T08:54:06.0115307Z Entering 'third_party/NNPACK' 2025-12-04T08:54:06.0138586Z Entering 'third_party/NVTX' 2025-12-04T08:54:06.0162891Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:06.0185949Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:06.0214458Z Entering 'third_party/aiter' 2025-12-04T08:54:06.0238196Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:06.0269017Z Entering 'third_party/benchmark' 2025-12-04T08:54:06.0293012Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:06.0319221Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:06.0342189Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:06.0366072Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:06.0389039Z Entering 'third_party/cutlass' 2025-12-04T08:54:06.0416814Z Entering 'third_party/fbgemm' 2025-12-04T08:54:06.0442865Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:06.0464616Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:06.0487794Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:06.0511457Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:06.0538263Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:06.0558475Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:06.0579781Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:06.0605127Z Entering 'third_party/flash-attention' 2025-12-04T08:54:06.0628059Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:06.0658126Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:06.0686052Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:06.0709581Z Entering 'third_party/fmt' 2025-12-04T08:54:06.0732548Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:06.0756542Z Entering 'third_party/gloo' 2025-12-04T08:54:06.0780789Z Entering 'third_party/googletest' 2025-12-04T08:54:06.0803106Z Entering 'third_party/ideep' 2025-12-04T08:54:06.0827264Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:06.0855361Z Entering 'third_party/ittapi' 2025-12-04T08:54:06.0876807Z Entering 'third_party/kineto' 2025-12-04T08:54:06.0897338Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:06.0925079Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:06.0951926Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:06.0980849Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:06.1006696Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:06.1030908Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:06.1058432Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:06.1082639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:06.1106046Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:06.1132954Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:06.1158663Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:06.1181502Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.1207588Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.1234730Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:06.1258972Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:06.1283886Z Entering 'third_party/kleidiai' 2025-12-04T08:54:06.1305065Z Entering 'third_party/mimalloc' 2025-12-04T08:54:06.1325361Z Entering 'third_party/nlohmann' 2025-12-04T08:54:06.1345703Z Entering 'third_party/onnx' 2025-12-04T08:54:06.1374544Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:06.1402999Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:06.1429319Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:06.1452431Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:06.1476018Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:06.1495437Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:06.1515259Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:06.1534658Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:06.1558872Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:06.1582630Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.1605117Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.1629487Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:06.1662691Z Entering 'third_party/pocketfft' 2025-12-04T08:54:06.1686052Z Entering 'third_party/protobuf' 2025-12-04T08:54:06.1709891Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:06.1733282Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:06.1756919Z Entering 'third_party/psimd' 2025-12-04T08:54:06.1779686Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:06.1803037Z Entering 'third_party/pybind11' 2025-12-04T08:54:06.1825294Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:06.1847183Z Entering 'third_party/sleef' 2025-12-04T08:54:06.1870339Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:06.1900337Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:06.1924243Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:06.1950605Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:06.1975454Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:06.2000842Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:06.2043815Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T08:54:06.2215436Z Entering 'android/libs/fbjni' 2025-12-04T08:54:06.2237735Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:06.2249903Z Entering 'third_party/FP16' 2025-12-04T08:54:06.2272879Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:06.2284347Z Entering 'third_party/FXdiv' 2025-12-04T08:54:06.2308073Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:06.2318699Z Entering 'third_party/NNPACK' 2025-12-04T08:54:06.2339867Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:06.2349172Z Entering 'third_party/NVTX' 2025-12-04T08:54:06.2370753Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:06.2381533Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:06.2401072Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:06.2411077Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:06.2432197Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:06.2449026Z Entering 'third_party/aiter' 2025-12-04T08:54:06.2470368Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:06.2478964Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:06.2502597Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:06.2520339Z Entering 'third_party/benchmark' 2025-12-04T08:54:06.2539937Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:06.2550087Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:06.2570933Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:06.2584782Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:06.2606049Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:06.2618671Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:06.2637411Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:06.2647444Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:06.2671108Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:06.2680422Z Entering 'third_party/cutlass' 2025-12-04T08:54:06.2707641Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:06.2722395Z Entering 'third_party/fbgemm' 2025-12-04T08:54:06.2743699Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:06.2754825Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:06.2775909Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:06.2785470Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:06.2809297Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:06.2822347Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:06.2843069Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:06.2852320Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:06.2871718Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:06.2883774Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:06.2904483Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:06.2916117Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:06.2939456Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:06.2949000Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:06.2972884Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:06.2986774Z Entering 'third_party/flash-attention' 2025-12-04T08:54:06.3007942Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:06.3018361Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:06.3041589Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:06.3056498Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:06.3077369Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:06.3093425Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:06.3120709Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:06.3133545Z Entering 'third_party/fmt' 2025-12-04T08:54:06.3158399Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:06.3169140Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:06.3191061Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:06.3201517Z Entering 'third_party/gloo' 2025-12-04T08:54:06.3223152Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:06.3232835Z Entering 'third_party/googletest' 2025-12-04T08:54:06.3252557Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.3263575Z Entering 'third_party/ideep' 2025-12-04T08:54:06.3290036Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:06.3301552Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:06.3324446Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:06.3338980Z Entering 'third_party/ittapi' 2025-12-04T08:54:06.3366412Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:06.3378048Z Entering 'third_party/kineto' 2025-12-04T08:54:06.3399617Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:06.3409279Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:06.3434370Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:06.3443247Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:06.3467960Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:06.3479849Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:06.3503718Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:06.3513963Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:06.3535723Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:06.3545659Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:06.3565128Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:06.3576900Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:06.3603880Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:06.3617438Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:06.3646435Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:06.3661680Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:06.3690060Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.3703814Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:06.3725438Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:06.3734946Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:06.3765176Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:06.3778953Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:06.3801488Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:06.3809115Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.3835468Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:06.3848326Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.3873808Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:06.3888646Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:06.3910688Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:06.3920655Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:06.3950375Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.3963633Z Entering 'third_party/kleidiai' 2025-12-04T08:54:06.3985261Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:06.3995919Z Entering 'third_party/mimalloc' 2025-12-04T08:54:06.4016504Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:06.4025857Z Entering 'third_party/nlohmann' 2025-12-04T08:54:06.4046580Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:06.4060240Z Entering 'third_party/onnx' 2025-12-04T08:54:06.4081929Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:06.4099816Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:06.4124866Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:06.4146251Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:06.4173448Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:06.4183820Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:06.4209941Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:06.4220889Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:06.4242291Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.4255038Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:06.4280016Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:06.4296758Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:06.4318804Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:06.4329600Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:06.4353358Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:06.4365314Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:06.4394553Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:06.4404603Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:06.4430583Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:06.4441170Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.4463250Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:06.4477346Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.4503283Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:06.4515462Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:06.4535703Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:06.4556531Z Entering 'third_party/pocketfft' 2025-12-04T08:54:06.4580203Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:06.4591851Z Entering 'third_party/protobuf' 2025-12-04T08:54:06.4619247Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:06.4631120Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:06.4656830Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:06.4668447Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:06.4687623Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.4698195Z Entering 'third_party/psimd' 2025-12-04T08:54:06.4718387Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:06.4726689Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:06.4749725Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:06.4759092Z Entering 'third_party/pybind11' 2025-12-04T08:54:06.4782243Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:06.4792830Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:06.4814451Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:06.4829408Z Entering 'third_party/sleef' 2025-12-04T08:54:06.4848405Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:06.4863508Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:06.4885684Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:06.4895760Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:06.4921956Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:06.4932070Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:06.4952151Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:06.4961063Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:06.4979141Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:06.4989115Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:06.5036689Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:06.5044984Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:06.5068171Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:06.5263788Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T08:54:06.5435666Z Entering 'android/libs/fbjni' 2025-12-04T08:54:06.5459643Z Entering 'third_party/FP16' 2025-12-04T08:54:06.5481969Z Entering 'third_party/FXdiv' 2025-12-04T08:54:06.5507836Z Entering 'third_party/NNPACK' 2025-12-04T08:54:06.5527886Z Entering 'third_party/NVTX' 2025-12-04T08:54:06.5549295Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:06.5568251Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:06.5593339Z Entering 'third_party/aiter' 2025-12-04T08:54:06.5615226Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:06.5639581Z Entering 'third_party/benchmark' 2025-12-04T08:54:06.5663497Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:06.5689124Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:06.5715788Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:06.5737265Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:06.5760653Z Entering 'third_party/cutlass' 2025-12-04T08:54:06.5787078Z Entering 'third_party/fbgemm' 2025-12-04T08:54:06.5810001Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:06.5829896Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:06.5854314Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:06.5874983Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:06.5898120Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:06.5918161Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:06.5937593Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:06.5960358Z Entering 'third_party/flash-attention' 2025-12-04T08:54:06.5981555Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:06.6002056Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:06.6024397Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:06.6046184Z Entering 'third_party/fmt' 2025-12-04T08:54:06.6067014Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:06.6085920Z Entering 'third_party/gloo' 2025-12-04T08:54:06.6109326Z Entering 'third_party/googletest' 2025-12-04T08:54:06.6132228Z Entering 'third_party/ideep' 2025-12-04T08:54:06.6152196Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:06.6175162Z Entering 'third_party/ittapi' 2025-12-04T08:54:06.6196505Z Entering 'third_party/kineto' 2025-12-04T08:54:06.6218136Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:06.6236524Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:06.6258588Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:06.6279228Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:06.6298327Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:06.6316889Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:06.6340457Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:06.6360333Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:06.6380284Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:06.6400466Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:06.6420179Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:06.6439100Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.6460843Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.6484280Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:06.6504146Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:06.6527268Z Entering 'third_party/kleidiai' 2025-12-04T08:54:06.6546878Z Entering 'third_party/mimalloc' 2025-12-04T08:54:06.6569526Z Entering 'third_party/nlohmann' 2025-12-04T08:54:06.6591129Z Entering 'third_party/onnx' 2025-12-04T08:54:06.6621102Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:06.6648071Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:06.6669682Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:06.6690439Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:06.6711226Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:06.6733822Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:06.6752600Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:06.6780629Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:06.6807185Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:06.6833297Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.6856139Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.6877416Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:06.6912386Z Entering 'third_party/pocketfft' 2025-12-04T08:54:06.6934463Z Entering 'third_party/protobuf' 2025-12-04T08:54:06.6958181Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:06.6980038Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:06.7001890Z Entering 'third_party/psimd' 2025-12-04T08:54:06.7024029Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:06.7047802Z Entering 'third_party/pybind11' 2025-12-04T08:54:06.7070589Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:06.7092544Z Entering 'third_party/sleef' 2025-12-04T08:54:06.7113005Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:06.7134127Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:06.7155975Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:06.7176595Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:06.7194767Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:06.7214223Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:06.7253228Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T08:54:06.7405510Z Entering 'android/libs/fbjni' 2025-12-04T08:54:06.7426799Z Entering 'third_party/FP16' 2025-12-04T08:54:06.7447113Z Entering 'third_party/FXdiv' 2025-12-04T08:54:06.7467716Z Entering 'third_party/NNPACK' 2025-12-04T08:54:06.7488096Z Entering 'third_party/NVTX' 2025-12-04T08:54:06.7509238Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:06.7529410Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:06.7556804Z Entering 'third_party/aiter' 2025-12-04T08:54:06.7578010Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:06.7602448Z Entering 'third_party/benchmark' 2025-12-04T08:54:06.7622511Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:06.7646627Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:06.7666447Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:06.7685753Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:06.7706131Z Entering 'third_party/cutlass' 2025-12-04T08:54:06.7730308Z Entering 'third_party/fbgemm' 2025-12-04T08:54:06.7751801Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:06.7770322Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:06.7792898Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:06.7813311Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:06.7837308Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:06.7857100Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:06.7876789Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:06.7898424Z Entering 'third_party/flash-attention' 2025-12-04T08:54:06.7919160Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:06.7940392Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:06.7966494Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:06.7987473Z Entering 'third_party/fmt' 2025-12-04T08:54:06.8008354Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:06.8028006Z Entering 'third_party/gloo' 2025-12-04T08:54:06.8048934Z Entering 'third_party/googletest' 2025-12-04T08:54:06.8069709Z Entering 'third_party/ideep' 2025-12-04T08:54:06.8090582Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:06.8114396Z Entering 'third_party/ittapi' 2025-12-04T08:54:06.8134444Z Entering 'third_party/kineto' 2025-12-04T08:54:06.8155480Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:06.8184167Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:06.8206675Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:06.8227735Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:06.8250790Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:06.8273284Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:06.8307878Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:06.8337712Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:06.8359249Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:06.8380062Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:06.8402515Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:06.8421595Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.8444757Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.8473757Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:06.8495511Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:06.8518038Z Entering 'third_party/kleidiai' 2025-12-04T08:54:06.8543191Z Entering 'third_party/mimalloc' 2025-12-04T08:54:06.8565198Z Entering 'third_party/nlohmann' 2025-12-04T08:54:06.8588826Z Entering 'third_party/onnx' 2025-12-04T08:54:06.8620850Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:06.8649788Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:06.8673478Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:06.8695302Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:06.8722850Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:06.8744958Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:06.8768372Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:06.8790618Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:06.8813361Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:06.8835377Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:06.8858757Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:06.8882657Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:06.8914962Z Entering 'third_party/pocketfft' 2025-12-04T08:54:06.8937995Z Entering 'third_party/protobuf' 2025-12-04T08:54:06.8965696Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:06.8987158Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:06.9012835Z Entering 'third_party/psimd' 2025-12-04T08:54:06.9035715Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:06.9059256Z Entering 'third_party/pybind11' 2025-12-04T08:54:06.9086630Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:06.9111519Z Entering 'third_party/sleef' 2025-12-04T08:54:06.9132703Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:06.9151387Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:06.9170669Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:06.9188431Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:06.9206404Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:06.9224268Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:06.9260504Z ##[endgroup] 2025-12-04T08:54:06.9433401Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T08:54:06.9524090Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:06.9630247Z ##[group]Run actions/checkout@v4 2025-12-04T08:54:06.9630377Z with: 2025-12-04T08:54:06.9630499Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:06.9630637Z fetch-depth: 0 2025-12-04T08:54:06.9630732Z submodules: recursive 2025-12-04T08:54:06.9630838Z show-progress: false 2025-12-04T08:54:06.9630951Z repository: pytorch/pytorch 2025-12-04T08:54:06.9631101Z token: *** 2025-12-04T08:54:06.9631195Z ssh-strict: true 2025-12-04T08:54:06.9631285Z ssh-user: git 2025-12-04T08:54:06.9631386Z persist-credentials: true 2025-12-04T08:54:06.9631489Z clean: true 2025-12-04T08:54:06.9631585Z sparse-checkout-cone-mode: true 2025-12-04T08:54:06.9631712Z fetch-tags: false 2025-12-04T08:54:06.9631798Z lfs: false 2025-12-04T08:54:06.9631890Z set-safe-directory: true 2025-12-04T08:54:06.9631988Z env: 2025-12-04T08:54:06.9632071Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:06.9632167Z ##[endgroup] 2025-12-04T08:54:07.0095087Z Syncing repository: pytorch/pytorch 2025-12-04T08:54:07.0095376Z ##[group]Getting Git version info 2025-12-04T08:54:07.0095550Z Working directory is '/home/runner/_work/pytorch/pytorch' 2025-12-04T08:54:07.0109388Z [command]/usr/bin/git version 2025-12-04T08:54:07.0129229Z git version 2.52.0 2025-12-04T08:54:07.0140052Z ##[endgroup] 2025-12-04T08:54:07.0144131Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/38e62a01-ad31-4564-be30-0eda7ccc9760/.gitconfig' 2025-12-04T08:54:07.0151960Z Temporarily overriding HOME='/home/runner/_work/_temp/38e62a01-ad31-4564-be30-0eda7ccc9760' before making global git config changes 2025-12-04T08:54:07.0152272Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T08:54:07.0154291Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T08:54:07.0176512Z [command]/usr/bin/git config --local --get remote.origin.url 2025-12-04T08:54:07.0194813Z https://github.com/pytorch/pytorch 2025-12-04T08:54:07.0211046Z ##[group]Removing previously created refs, to avoid conflicts 2025-12-04T08:54:07.0214380Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-12-04T08:54:07.0229059Z HEAD 2025-12-04T08:54:07.0257154Z ##[endgroup] 2025-12-04T08:54:07.0258816Z [command]/usr/bin/git submodule status 2025-12-04T08:54:07.0445209Z 7e1e1fe3858c63c251c637ae41a20de425dde96f android/libs/fbjni (v0.1.0-12-g7e1e1fe) 2025-12-04T08:54:07.0489811Z 4dfe081cf6bcd15db339cf2680b9281b8451eeb3 third_party/FP16 (4dfe081) 2025-12-04T08:54:07.0541670Z b408327ac2a15ec3e43352421954f5b1967701d1 third_party/FXdiv (b408327) 2025-12-04T08:54:07.0594246Z c07e3a0400713d546e0dea2d5466dd22ea389c73 third_party/NNPACK (c07e3a0) 2025-12-04T08:54:07.0628121Z 3ebbc93ded7285963bff932c678fa367eb393ba6 third_party/NVTX (v3.1.0-313-g3ebbc93) 2025-12-04T08:54:07.0680006Z 1d8f600fd424278486eade7ed3e877c99f0846b1 third_party/VulkanMemoryAllocator (v2.1.0-982-g1d8f600) 2025-12-04T08:54:07.0967588Z 51a0103656eff6fc9bfd39a4597923c4b542c883 third_party/XNNPACK (remotes/origin/ds/ndk-1243-g51a0103656) 2025-12-04T08:54:07.1006751Z 01aae101b9e5e94d6c16a9514c9fb8df99c93150 third_party/aiter (v0.1.1-92-g01aae101) 2025-12-04T08:54:07.1019233Z 299e5928955cc62af9968370293b916f5130916f third_party/benchmark (v1.9.3) 2025-12-04T08:54:07.1069043Z 7fe50dc3da2069d6645d9deb8c017a876472a977 third_party/composable_kernel (rocm-6.4.3-459-g7fe50dc3d) 2025-12-04T08:54:07.1147370Z 89c932f313c6437c38f2982869beacc89c2f2246 third_party/cpp-httplib (v0.26.0) 2025-12-04T08:54:07.1225450Z f858c30bcb16f8effd5ff46996f0514539e17abc third_party/cpuinfo (f858c30) 2025-12-04T08:54:07.1245884Z 0b1577c8c83401237d601d0d0db5210506705396 third_party/cudnn_frontend (v0.5-61-g0b1577c) 2025-12-04T08:54:07.1311840Z f88806b1e31dfa579842638740216dd41fc6c588 third_party/cutlass (v4.3.1) 2025-12-04T08:54:07.1334560Z c0b988d39a9e47c794d699f29930ed4d7c7e13a4 third_party/fbgemm (v1.4.0-rc1-2-gc0b988d39) 2025-12-04T08:54:07.1396572Z 979702c87a8713a8e0a5e9fee122b90d2ef13be5 third_party/flash-attention (v2.7.4) 2025-12-04T08:54:07.1408810Z a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757 third_party/flatbuffers (v24.12.23) 2025-12-04T08:54:07.1642988Z 407c905e45ad75fc29bf0f9bb7c5c2fd3475976f third_party/fmt (12.1.0) 2025-12-04T08:54:07.1712432Z 3fb5c176c17c765a3492cd2f0321b0dab712f350 third_party/gemmlowp/gemmlowp (remotes/origin/revert-87-master-135-g3fb5c17) 2025-12-04T08:54:07.1786652Z 54cbae0d3a67fa890b4c3d9ee162b7860315e341 third_party/gloo (remotes/origin/gh/c-p-i-o/1/base-37-g54cbae0) 2025-12-04T08:54:07.1927868Z 52eb8108c5bdec04579160ae17225d66034bd723 third_party/googletest (release-1.8.0-3544-g52eb8108) 2025-12-04T08:54:07.1981308Z 719d8e6cd7f7a0e01b155657526d693acf97c2b3 third_party/ideep (pytorch-rls-v3.7.1) 2025-12-04T08:54:07.2018060Z dec1d23ca65ab069d225dfe40dea14f455170959 third_party/ittapi (v3.25.5) 2025-12-04T08:54:07.2143846Z 31f85df8fbd89c188f14ef10f1ec65379786b943 third_party/kineto (heads/main) 2025-12-04T08:54:07.2173395Z d7770c89632329a9914ef1a90289917597639cbe third_party/kleidiai (v1.15.0) 2025-12-04T08:54:07.2184791Z fbd8b99c2b828428947d70fdc046bb55609be93e third_party/mimalloc (v2.2.4) 2025-12-04T08:54:07.2204797Z 55f93686c01528224f448c19128836e7df245f72 third_party/nlohmann (v3.12.0) 2025-12-04T08:54:07.2401709Z e709452ef2bbc1d113faf678c24e6d3467696e83 third_party/onnx (v1.18.0) 2025-12-04T08:54:07.2415295Z a799f4aed9c94b765dcdaabaeab7d5e7e2310878 third_party/opentelemetry-cpp (v1.14.2) 2025-12-04T08:54:07.2431564Z 0fa0ef591e38c2758e3184c6c23e497b9f732ffa third_party/pocketfft (release_for_eigen-40-g0fa0ef5) 2025-12-04T08:54:07.2632881Z d1eca4e4b421cd2997495c4b4e65cea6be4e9b8a third_party/protobuf (v3.7.0-rc.2-1279-gd1eca4e4b) 2025-12-04T08:54:07.2673709Z 072586a71b55b7f8c584153d223e95687148a900 third_party/psimd (heads/master) 2025-12-04T08:54:07.2719502Z 4fe0e1e183925bf8cfa6aae24237e724a96479b8 third_party/pthreadpool (0.1-144-g4fe0e1e) 2025-12-04T08:54:07.2740994Z f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8 third_party/pybind11 (v3.0.1) 2025-12-04T08:54:07.2781996Z f45429b087dd7d5bc78bb40dc7cf06425c252d67 third_party/python-peachpy (remotes/origin/pre-generated) 2025-12-04T08:54:07.2829137Z 5a1d179df9cf652951b59010a2d2075372d67f68 third_party/sleef (3.8) 2025-12-04T08:54:07.2871022Z 2b4cd91092d335a697416b2a3cb398283246849d third_party/tensorpipe (heads/main) 2025-12-04T08:54:07.2879212Z ##[group]Cleaning the repository 2025-12-04T08:54:07.2881504Z [command]/usr/bin/git clean -ffdx 2025-12-04T08:54:07.2998911Z [command]/usr/bin/git reset --hard HEAD 2025-12-04T08:54:07.3767669Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:07.3823696Z ##[endgroup] 2025-12-04T08:54:07.3824745Z ##[group]Disabling automatic garbage collection 2025-12-04T08:54:07.3828462Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T08:54:07.3855303Z ##[endgroup] 2025-12-04T08:54:07.3855457Z ##[group]Setting up auth 2025-12-04T08:54:07.3858910Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T08:54:07.3881466Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T08:54:07.4076057Z Entering 'android/libs/fbjni' 2025-12-04T08:54:07.4101849Z Entering 'third_party/FP16' 2025-12-04T08:54:07.4128610Z Entering 'third_party/FXdiv' 2025-12-04T08:54:07.4152989Z Entering 'third_party/NNPACK' 2025-12-04T08:54:07.4176863Z Entering 'third_party/NVTX' 2025-12-04T08:54:07.4204926Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:07.4234342Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:07.4263966Z Entering 'third_party/aiter' 2025-12-04T08:54:07.4290923Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:07.4318738Z Entering 'third_party/benchmark' 2025-12-04T08:54:07.4341528Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:07.4366060Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:07.4395335Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:07.4418697Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:07.4440608Z Entering 'third_party/cutlass' 2025-12-04T08:54:07.4466907Z Entering 'third_party/fbgemm' 2025-12-04T08:54:07.4496498Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:07.4519588Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:07.4545575Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:07.4575307Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:07.4604451Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:07.4631744Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:07.4654077Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:07.4678061Z Entering 'third_party/flash-attention' 2025-12-04T08:54:07.4707399Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:07.4731354Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:07.4757736Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:07.4784615Z Entering 'third_party/fmt' 2025-12-04T08:54:07.4811126Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:07.4838540Z Entering 'third_party/gloo' 2025-12-04T08:54:07.4862106Z Entering 'third_party/googletest' 2025-12-04T08:54:07.4884294Z Entering 'third_party/ideep' 2025-12-04T08:54:07.4907244Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:07.4934260Z Entering 'third_party/ittapi' 2025-12-04T08:54:07.4957296Z Entering 'third_party/kineto' 2025-12-04T08:54:07.4980178Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:07.5006335Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:07.5028183Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:07.5051039Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:07.5073407Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:07.5095010Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:07.5124282Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:07.5155940Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:07.5181937Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:07.5202303Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:07.5221608Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:07.5241272Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:07.5270417Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:07.5301078Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:07.5326707Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:07.5350754Z Entering 'third_party/kleidiai' 2025-12-04T08:54:07.5380659Z Entering 'third_party/mimalloc' 2025-12-04T08:54:07.5408543Z Entering 'third_party/nlohmann' 2025-12-04T08:54:07.5433794Z Entering 'third_party/onnx' 2025-12-04T08:54:07.5463779Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:07.5493888Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:07.5515226Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:07.5537126Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:07.5561160Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:07.5585292Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:07.5606644Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:07.5627625Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:07.5651502Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:07.5674503Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:07.5695959Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:07.5717521Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:07.5762761Z Entering 'third_party/pocketfft' 2025-12-04T08:54:07.5790458Z Entering 'third_party/protobuf' 2025-12-04T08:54:07.5819455Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:07.5842901Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:07.5867270Z Entering 'third_party/psimd' 2025-12-04T08:54:07.5888944Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:07.5909189Z Entering 'third_party/pybind11' 2025-12-04T08:54:07.5931693Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:07.5953402Z Entering 'third_party/sleef' 2025-12-04T08:54:07.5973539Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:07.5996182Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:07.6025087Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:07.6047512Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:07.6069843Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:07.6092577Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:07.6132286Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T08:54:07.6152885Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6163301Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T08:54:07.6181090Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T08:54:07.6351082Z Entering 'android/libs/fbjni' 2025-12-04T08:54:07.6364545Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6383648Z Entering 'third_party/FP16' 2025-12-04T08:54:07.6397047Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6414153Z Entering 'third_party/FXdiv' 2025-12-04T08:54:07.6428500Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6444921Z Entering 'third_party/NNPACK' 2025-12-04T08:54:07.6457905Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6482447Z Entering 'third_party/NVTX' 2025-12-04T08:54:07.6504645Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6527805Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:07.6543402Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6559521Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:07.6576000Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6598906Z Entering 'third_party/aiter' 2025-12-04T08:54:07.6615292Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6629505Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:07.6642663Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6662778Z Entering 'third_party/benchmark' 2025-12-04T08:54:07.6676235Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6700919Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:07.6714043Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6746560Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:07.6761326Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6777538Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:07.6790481Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6807674Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:07.6819987Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6835179Z Entering 'third_party/cutlass' 2025-12-04T08:54:07.6846878Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6867231Z Entering 'third_party/fbgemm' 2025-12-04T08:54:07.6880280Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6897366Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:07.6917068Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6935164Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:07.6952753Z http.https://github.com/.extraheader 2025-12-04T08:54:07.6974583Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:07.6997277Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7021758Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:07.7034967Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7055865Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:07.7067592Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7089237Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:07.7100783Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7118248Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:07.7131386Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7151131Z Entering 'third_party/flash-attention' 2025-12-04T08:54:07.7163280Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7179122Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:07.7193025Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7216164Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:07.7229238Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7257883Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:07.7269427Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7290051Z Entering 'third_party/fmt' 2025-12-04T08:54:07.7304250Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7320289Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:07.7331628Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7347613Z Entering 'third_party/gloo' 2025-12-04T08:54:07.7360197Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7376457Z Entering 'third_party/googletest' 2025-12-04T08:54:07.7389081Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7406327Z Entering 'third_party/ideep' 2025-12-04T08:54:07.7418712Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7433724Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:07.7446251Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7466374Z Entering 'third_party/ittapi' 2025-12-04T08:54:07.7479049Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7493644Z Entering 'third_party/kineto' 2025-12-04T08:54:07.7504947Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7521042Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:07.7543537Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7563295Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:07.7576502Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7595895Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:07.7608557Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7625770Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:07.7638147Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7653735Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:07.7665727Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7686694Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:07.7709789Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7734607Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:07.7747348Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7765258Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:07.7778687Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7798739Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:07.7811120Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7832491Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:07.7844783Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7861347Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:07.7872604Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7888137Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:07.7905234Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7924454Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:07.7937449Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7959236Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:07.7971499Z http.https://github.com/.extraheader 2025-12-04T08:54:07.7990787Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:07.8003499Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8024151Z Entering 'third_party/kleidiai' 2025-12-04T08:54:07.8040381Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8063015Z Entering 'third_party/mimalloc' 2025-12-04T08:54:07.8077060Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8102525Z Entering 'third_party/nlohmann' 2025-12-04T08:54:07.8116274Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8138797Z Entering 'third_party/onnx' 2025-12-04T08:54:07.8149990Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8181258Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:07.8196444Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8225734Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:07.8240417Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8261300Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:07.8273401Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8293745Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:07.8307034Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8329807Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:07.8347741Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8368100Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:07.8381574Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8402516Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:07.8415182Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8434061Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:07.8446039Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8465391Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:07.8476721Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8492794Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:07.8505384Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8526183Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:07.8538861Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8559059Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:07.8581576Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8617563Z Entering 'third_party/pocketfft' 2025-12-04T08:54:07.8630348Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8647321Z Entering 'third_party/protobuf' 2025-12-04T08:54:07.8659346Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8681209Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:07.8703103Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8722457Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:07.8736860Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8762853Z Entering 'third_party/psimd' 2025-12-04T08:54:07.8777102Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8801960Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:07.8814728Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8831576Z Entering 'third_party/pybind11' 2025-12-04T08:54:07.8847361Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8870184Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:07.8885530Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8913497Z Entering 'third_party/sleef' 2025-12-04T08:54:07.8927515Z http.https://github.com/.extraheader 2025-12-04T08:54:07.8948672Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:07.8980549Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9000275Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:07.9019510Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9039657Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:07.9062094Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9079688Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:07.9096586Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9116712Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:07.9131784Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9150591Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:07.9172307Z http.https://github.com/.extraheader 2025-12-04T08:54:07.9216464Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:07.9241012Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T08:54:07.9423715Z Entering 'android/libs/fbjni' 2025-12-04T08:54:07.9442435Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:07.9452641Z Entering 'third_party/FP16' 2025-12-04T08:54:07.9464329Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:07.9473765Z Entering 'third_party/FXdiv' 2025-12-04T08:54:07.9484348Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:07.9497841Z Entering 'third_party/NNPACK' 2025-12-04T08:54:07.9508814Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:07.9519832Z Entering 'third_party/NVTX' 2025-12-04T08:54:07.9532052Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:07.9542527Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:07.9554808Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:07.9568398Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:07.9581335Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:07.9596732Z Entering 'third_party/aiter' 2025-12-04T08:54:07.9606839Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:07.9616834Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:07.9628118Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:07.9647562Z Entering 'third_party/benchmark' 2025-12-04T08:54:07.9658172Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:07.9667827Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:07.9678455Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:07.9695165Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:07.9708566Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:07.9718235Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:07.9728514Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:07.9741456Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:07.9752844Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:07.9763182Z Entering 'third_party/cutlass' 2025-12-04T08:54:07.9773505Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:07.9786700Z Entering 'third_party/fbgemm' 2025-12-04T08:54:07.9798124Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:07.9808524Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:07.9818328Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:07.9827422Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:07.9837073Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:07.9849614Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:07.9861092Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:07.9870006Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:07.9881086Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:07.9894271Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:07.9904175Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:07.9914303Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:07.9928057Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:07.9940843Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:07.9951320Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:07.9962856Z Entering 'third_party/flash-attention' 2025-12-04T08:54:07.9977094Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:07.9986580Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:08.0000806Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:08.0012096Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:08.0023746Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:08.0039265Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:08.0049511Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:08.0064152Z Entering 'third_party/fmt' 2025-12-04T08:54:08.0074998Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:08.0084810Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:08.0096249Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:08.0106692Z Entering 'third_party/gloo' 2025-12-04T08:54:08.0117529Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:08.0126587Z Entering 'third_party/googletest' 2025-12-04T08:54:08.0140024Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.0149227Z Entering 'third_party/ideep' 2025-12-04T08:54:08.0160011Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:08.0168705Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:08.0180181Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:08.0198071Z Entering 'third_party/ittapi' 2025-12-04T08:54:08.0208221Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:08.0216624Z Entering 'third_party/kineto' 2025-12-04T08:54:08.0229935Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:08.0238112Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:08.0251617Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:08.0262199Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:08.0271673Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:08.0286862Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:08.0299841Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:08.0310035Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:08.0324088Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:08.0334218Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:08.0344222Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:08.0352958Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:08.0363164Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:08.0373255Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:08.0382589Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:08.0390956Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:08.0399988Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.0408163Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:08.0417121Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:08.0425552Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:08.0437464Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:08.0444798Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:08.0462600Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:08.0469870Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:08.0479221Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:08.0489099Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:08.0500047Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:08.0512037Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:08.0521410Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:08.0529721Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:08.0539517Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.0549936Z Entering 'third_party/kleidiai' 2025-12-04T08:54:08.0561547Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:08.0571098Z Entering 'third_party/mimalloc' 2025-12-04T08:54:08.0580514Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:08.0591191Z Entering 'third_party/nlohmann' 2025-12-04T08:54:08.0600806Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:08.0609919Z Entering 'third_party/onnx' 2025-12-04T08:54:08.0619394Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:08.0635285Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:08.0645793Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:08.0658235Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:08.0669166Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:08.0679175Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:08.0688953Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:08.0698113Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:08.0707958Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.0716215Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:08.0728107Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:08.0736144Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:08.0745545Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:08.0756838Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:08.0770781Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:08.0779369Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:08.0793291Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:08.0801800Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:08.0811877Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:08.0820801Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:08.0831011Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:08.0840752Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:08.0852710Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:08.0863046Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:08.0877285Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:08.0895741Z Entering 'third_party/pocketfft' 2025-12-04T08:54:08.0905551Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:08.0914653Z Entering 'third_party/protobuf' 2025-12-04T08:54:08.0926601Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:08.0936945Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:08.0946895Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:08.0955144Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:08.0964001Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.0974356Z Entering 'third_party/psimd' 2025-12-04T08:54:08.0985170Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:08.0994004Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:08.1004725Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:08.1015891Z Entering 'third_party/pybind11' 2025-12-04T08:54:08.1026085Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:08.1038486Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:08.1048510Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:08.1058128Z Entering 'third_party/sleef' 2025-12-04T08:54:08.1068489Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:08.1080236Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:08.1090504Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:08.1104546Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:08.1118285Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:08.1127087Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:08.1137085Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:08.1145453Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:08.1156042Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:08.1164362Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:08.1177603Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:08.1185451Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:08.1198545Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:08.1225327Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1244348Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1258949Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1275325Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1290764Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1310979Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1325472Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1339677Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1353502Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1367701Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1387900Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1410273Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1429718Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1444426Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1459098Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1472763Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1491592Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1505691Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1520266Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1534450Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1548068Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1561883Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1575506Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1589382Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1605654Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1619787Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1633530Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1647831Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1661148Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1673862Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1688126Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1701948Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1714772Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1730295Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1750290Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1766612Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1783777Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1798130Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1813084Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1827828Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1841159Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1855512Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1869418Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1883260Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1896108Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1908857Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1923255Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1937169Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1951691Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1965791Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1979703Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.1992421Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2006952Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2021262Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2036060Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2051142Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2069327Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2084075Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2105489Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2120264Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2133877Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2146750Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2161719Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2176242Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2190728Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2205123Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2218896Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2233601Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2249173Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2263898Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2279489Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2294746Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2309620Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2322658Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2337104Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2352817Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2368089Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2381974Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2396065Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2411159Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2425895Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:08.2442845Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:08.2462681Z ##[endgroup] 2025-12-04T08:54:08.2462945Z ##[group]Fetching the repository 2025-12-04T08:54:08.2466319Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T08:54:09.4888857Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T08:54:09.4992179Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:09.4995756Z ##[endgroup] 2025-12-04T08:54:09.4996137Z ##[group]Determining the checkout info 2025-12-04T08:54:09.4997599Z ##[endgroup] 2025-12-04T08:54:09.5002580Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T08:54:09.5088085Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T08:54:09.5112267Z ##[group]Checking out the ref 2025-12-04T08:54:09.5114048Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:09.5361092Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:09.5366272Z ##[endgroup] 2025-12-04T08:54:09.5366562Z ##[group]Setting up auth for fetching submodules 2025-12-04T08:54:09.5369893Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:09.5393276Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T08:54:09.5410573Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T08:54:09.5432191Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T08:54:09.5447094Z ##[endgroup] 2025-12-04T08:54:09.5447291Z ##[group]Fetching submodules 2025-12-04T08:54:09.5449014Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T08:54:09.5648537Z Synchronizing submodule url for 'android/libs/fbjni' 2025-12-04T08:54:09.5660205Z Synchronizing submodule url for 'third_party/FP16' 2025-12-04T08:54:09.5676607Z Synchronizing submodule url for 'third_party/FXdiv' 2025-12-04T08:54:09.5688555Z Synchronizing submodule url for 'third_party/NNPACK' 2025-12-04T08:54:09.5702588Z Synchronizing submodule url for 'third_party/NVTX' 2025-12-04T08:54:09.5715403Z Synchronizing submodule url for 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:09.5726690Z Synchronizing submodule url for 'third_party/XNNPACK' 2025-12-04T08:54:09.5743753Z Synchronizing submodule url for 'third_party/aiter' 2025-12-04T08:54:09.5757513Z Synchronizing submodule url for 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:09.5773734Z Synchronizing submodule url for 'third_party/benchmark' 2025-12-04T08:54:09.5787020Z Synchronizing submodule url for 'third_party/composable_kernel' 2025-12-04T08:54:09.5801755Z Synchronizing submodule url for 'third_party/cpp-httplib' 2025-12-04T08:54:09.5811230Z Synchronizing submodule url for 'third_party/cpuinfo' 2025-12-04T08:54:09.5822355Z Synchronizing submodule url for 'third_party/cudnn_frontend' 2025-12-04T08:54:09.5832960Z Synchronizing submodule url for 'third_party/cutlass' 2025-12-04T08:54:09.5845902Z Synchronizing submodule url for 'third_party/fbgemm' 2025-12-04T08:54:09.5858800Z Synchronizing submodule url for 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:09.5868356Z Synchronizing submodule url for 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:09.5881905Z Synchronizing submodule url for 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:09.5891012Z Synchronizing submodule url for 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:09.5907553Z Synchronizing submodule url for 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:09.5917961Z Synchronizing submodule url for 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:09.5928175Z Synchronizing submodule url for 'third_party/fbgemm/external/json' 2025-12-04T08:54:09.5942924Z Synchronizing submodule url for 'third_party/flash-attention' 2025-12-04T08:54:09.5955054Z Synchronizing submodule url for 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:09.5968482Z Synchronizing submodule url for 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:09.5985212Z Synchronizing submodule url for 'third_party/flatbuffers' 2025-12-04T08:54:09.5996231Z Synchronizing submodule url for 'third_party/fmt' 2025-12-04T08:54:09.6007519Z Synchronizing submodule url for 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:09.6018456Z Synchronizing submodule url for 'third_party/gloo' 2025-12-04T08:54:09.6029265Z Synchronizing submodule url for 'third_party/googletest' 2025-12-04T08:54:09.6041640Z Synchronizing submodule url for 'third_party/ideep' 2025-12-04T08:54:09.6051978Z Synchronizing submodule url for 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:09.6066350Z Synchronizing submodule url for 'third_party/ittapi' 2025-12-04T08:54:09.6076922Z Synchronizing submodule url for 'third_party/kineto' 2025-12-04T08:54:09.6088870Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:09.6100838Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:09.6120641Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:09.6143573Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:09.6159313Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:09.6178764Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:09.6191146Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:09.6201745Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:09.6211151Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:09.6223085Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:09.6233765Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:09.6245348Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:09.6256348Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:09.6271603Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:09.6287235Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:09.6303305Z Synchronizing submodule url for 'third_party/kleidiai' 2025-12-04T08:54:09.6314968Z Synchronizing submodule url for 'third_party/mimalloc' 2025-12-04T08:54:09.6326040Z Synchronizing submodule url for 'third_party/nlohmann' 2025-12-04T08:54:09.6340233Z Synchronizing submodule url for 'third_party/onnx' 2025-12-04T08:54:09.6357853Z Synchronizing submodule url for 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:09.6371843Z Synchronizing submodule url for 'third_party/opentelemetry-cpp' 2025-12-04T08:54:09.6383115Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:09.6393637Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:09.6404345Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:09.6414931Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:09.6425824Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:09.6436367Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:09.6451001Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:09.6466497Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:09.6488629Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:09.6506809Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:09.6527155Z Synchronizing submodule url for 'third_party/pocketfft' 2025-12-04T08:54:09.6537867Z Synchronizing submodule url for 'third_party/protobuf' 2025-12-04T08:54:09.6567299Z Synchronizing submodule url for 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:09.6583689Z Synchronizing submodule url for 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:09.6607356Z Synchronizing submodule url for 'third_party/psimd' 2025-12-04T08:54:09.6623294Z Synchronizing submodule url for 'third_party/pthreadpool' 2025-12-04T08:54:09.6636589Z Synchronizing submodule url for 'third_party/pybind11' 2025-12-04T08:54:09.6648296Z Synchronizing submodule url for 'third_party/python-peachpy' 2025-12-04T08:54:09.6659054Z Synchronizing submodule url for 'third_party/sleef' 2025-12-04T08:54:09.6675038Z Synchronizing submodule url for 'third_party/tensorpipe' 2025-12-04T08:54:09.6688196Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:09.6699606Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:09.6710299Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:09.6726559Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:09.6737299Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:09.6759728Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T08:54:09.6976387Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T08:54:09.7027801Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T08:54:09.7074181Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T08:54:09.7120426Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T08:54:09.7183357Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T08:54:09.7243504Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T08:54:09.7385160Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T08:54:09.7533675Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T08:54:09.7709233Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T08:54:09.7764158Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T08:54:09.7948147Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:09.8030898Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T08:54:09.8089795Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T08:54:09.8174726Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T08:54:09.8293840Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T08:54:09.8423730Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T08:54:09.8501578Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T08:54:09.8689931Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:09.8758594Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T08:54:09.8874813Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T08:54:09.8947901Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:09.9019330Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T08:54:09.9105861Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T08:54:09.9194082Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T08:54:09.9369102Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T08:54:09.9473323Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T08:54:09.9569251Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T08:54:09.9629746Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T08:54:09.9682774Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T08:54:09.9742706Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T08:54:09.9802180Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:09.9862783Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T08:54:10.0037259Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T08:54:10.0099783Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T08:54:10.0171478Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T08:54:10.0247285Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T08:54:10.0339260Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T08:54:10.0394553Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T08:54:10.0467471Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T08:54:10.0528493Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T08:54:10.0593248Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T08:54:10.0664648Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T08:54:10.0726487Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:10.0807815Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T08:54:10.0858170Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T08:54:10.0923678Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T08:54:10.1009144Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T08:54:10.1073198Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:10.1161214Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T08:54:10.1221475Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:10.1300648Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T08:54:10.1380922Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T08:54:10.1475742Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T08:54:10.1621594Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T08:54:10.1711473Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T08:54:10.1810803Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T08:54:10.1882665Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T08:54:10.1949911Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T08:54:10.2006959Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T08:54:10.2092862Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T08:54:10.2155431Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T08:54:10.2211187Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T08:54:10.2267684Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T08:54:10.2346581Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T08:54:10.2407169Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:10.2555591Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T08:54:10.2619463Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T08:54:10.2768567Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T08:54:10.2833792Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T08:54:10.2894007Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T08:54:10.2950450Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T08:54:10.2997951Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T08:54:10.3078931Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T08:54:10.3136014Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T08:54:10.3195572Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T08:54:10.3261971Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T08:54:10.3326423Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T08:54:10.3388052Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T08:54:10.3540085Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T08:54:10.3611861Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T08:54:10.3675929Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T08:54:10.3702530Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T08:54:10.3893153Z Entering 'android/libs/fbjni' 2025-12-04T08:54:10.3913284Z Entering 'third_party/FP16' 2025-12-04T08:54:10.3933182Z Entering 'third_party/FXdiv' 2025-12-04T08:54:10.3952646Z Entering 'third_party/NNPACK' 2025-12-04T08:54:10.3975831Z Entering 'third_party/NVTX' 2025-12-04T08:54:10.3995983Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:10.4017436Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:10.4042774Z Entering 'third_party/aiter' 2025-12-04T08:54:10.4067611Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:10.4096789Z Entering 'third_party/benchmark' 2025-12-04T08:54:10.4124878Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:10.4148938Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:10.4169440Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:10.4189868Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:10.4215334Z Entering 'third_party/cutlass' 2025-12-04T08:54:10.4245155Z Entering 'third_party/fbgemm' 2025-12-04T08:54:10.4266909Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:10.4287954Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:10.4313724Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:10.4341459Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:10.4371179Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:10.4397828Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:10.4426648Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:10.4449972Z Entering 'third_party/flash-attention' 2025-12-04T08:54:10.4474660Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:10.4501131Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:10.4527137Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:10.4548499Z Entering 'third_party/fmt' 2025-12-04T08:54:10.4568361Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:10.4591613Z Entering 'third_party/gloo' 2025-12-04T08:54:10.4610125Z Entering 'third_party/googletest' 2025-12-04T08:54:10.4632197Z Entering 'third_party/ideep' 2025-12-04T08:54:10.4657632Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:10.4680770Z Entering 'third_party/ittapi' 2025-12-04T08:54:10.4709578Z Entering 'third_party/kineto' 2025-12-04T08:54:10.4739624Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:10.4759099Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:10.4779692Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:10.4803343Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:10.4821993Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:10.4842216Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:10.4862390Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:10.4881881Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:10.4905094Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:10.4925881Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:10.4954003Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:10.4975472Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:10.5002452Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:10.5032219Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:10.5058483Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:10.5080047Z Entering 'third_party/kleidiai' 2025-12-04T08:54:10.5113938Z Entering 'third_party/mimalloc' 2025-12-04T08:54:10.5136380Z Entering 'third_party/nlohmann' 2025-12-04T08:54:10.5161994Z Entering 'third_party/onnx' 2025-12-04T08:54:10.5191401Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:10.5215183Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:10.5243578Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:10.5265517Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:10.5290798Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:10.5309818Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:10.5329103Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:10.5353945Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:10.5374700Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:10.5394708Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:10.5415228Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:10.5436347Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:10.5462228Z Entering 'third_party/pocketfft' 2025-12-04T08:54:10.5483255Z Entering 'third_party/protobuf' 2025-12-04T08:54:10.5503845Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:10.5522962Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:10.5542670Z Entering 'third_party/psimd' 2025-12-04T08:54:10.5565271Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:10.5586991Z Entering 'third_party/pybind11' 2025-12-04T08:54:10.5612879Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:10.5633336Z Entering 'third_party/sleef' 2025-12-04T08:54:10.5653630Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:10.5672331Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:10.5690148Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:10.5708064Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:10.5736323Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:10.5758186Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:10.5789720Z ##[endgroup] 2025-12-04T08:54:10.5790054Z ##[group]Persisting credentials for submodules 2025-12-04T08:54:10.5797497Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T08:54:10.5963379Z Entering 'android/libs/fbjni' 2025-12-04T08:54:10.5976994Z url.https://github.com/.insteadof 2025-12-04T08:54:10.5977204Z url.https://github.com/.insteadof 2025-12-04T08:54:10.5998748Z Entering 'third_party/FP16' 2025-12-04T08:54:10.6015312Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6015441Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6038709Z Entering 'third_party/FXdiv' 2025-12-04T08:54:10.6054089Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6054220Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6070885Z Entering 'third_party/NNPACK' 2025-12-04T08:54:10.6089222Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6089344Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6104796Z Entering 'third_party/NVTX' 2025-12-04T08:54:10.6117027Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6117150Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6139774Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:10.6152474Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6152590Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6169107Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:10.6181412Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6181535Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6212473Z Entering 'third_party/aiter' 2025-12-04T08:54:10.6226880Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6226999Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6250829Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:10.6263577Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6263698Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6288948Z Entering 'third_party/benchmark' 2025-12-04T08:54:10.6303404Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6303527Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6326871Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:10.6339995Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6340153Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6363857Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:10.6377382Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6377502Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6396966Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:10.6409591Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6409715Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6428073Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:10.6440372Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6440498Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6457403Z Entering 'third_party/cutlass' 2025-12-04T08:54:10.6471294Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6471413Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6493756Z Entering 'third_party/fbgemm' 2025-12-04T08:54:10.6507206Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6507329Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6527998Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:10.6543721Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6543844Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6558646Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:10.6572138Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6572764Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6593881Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:10.6614437Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6614655Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6636780Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:10.6649594Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6649796Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6673907Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:10.6689733Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6689926Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6707182Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:10.6725575Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6725764Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6744817Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:10.6767945Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6768097Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6793242Z Entering 'third_party/flash-attention' 2025-12-04T08:54:10.6807704Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6807831Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6828976Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:10.6843682Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6844020Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6863996Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:10.6887141Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6887283Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6912744Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:10.6929620Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6929981Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6951126Z Entering 'third_party/fmt' 2025-12-04T08:54:10.6966045Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6966308Z url.https://github.com/.insteadof 2025-12-04T08:54:10.6994828Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:10.7016843Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7017083Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7040211Z Entering 'third_party/gloo' 2025-12-04T08:54:10.7055039Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7055274Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7077232Z Entering 'third_party/googletest' 2025-12-04T08:54:10.7091077Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7091319Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7108521Z Entering 'third_party/ideep' 2025-12-04T08:54:10.7121541Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7121753Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7138242Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:10.7154936Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7155419Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7177458Z Entering 'third_party/ittapi' 2025-12-04T08:54:10.7190609Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7190802Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7206196Z Entering 'third_party/kineto' 2025-12-04T08:54:10.7218067Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7218245Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7241140Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:10.7264421Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7264574Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7285160Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:10.7297847Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7298012Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7315769Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:10.7334139Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7334314Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7359263Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:10.7372250Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7372414Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7392302Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:10.7407224Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7407385Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7423861Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:10.7437938Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7438087Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7458548Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:10.7470312Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7470565Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7486587Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:10.7499386Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7499527Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7514837Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:10.7531943Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7532088Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7549976Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:10.7563324Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7563646Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7586131Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:10.7598654Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7598863Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7614333Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:10.7634312Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7653772Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7654014Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:10.7668116Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7668274Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7690411Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:10.7708944Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7709099Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7727464Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:10.7740021Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7740203Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7758140Z Entering 'third_party/kleidiai' 2025-12-04T08:54:10.7775444Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7775596Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7799550Z Entering 'third_party/mimalloc' 2025-12-04T08:54:10.7812690Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7812835Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7828989Z Entering 'third_party/nlohmann' 2025-12-04T08:54:10.7841207Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7841342Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7858480Z Entering 'third_party/onnx' 2025-12-04T08:54:10.7870137Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7870266Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7896625Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:10.7913484Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7913634Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7934464Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:10.7948221Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7948373Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7971157Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:10.7984644Z url.https://github.com/.insteadof 2025-12-04T08:54:10.7984796Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8003186Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:10.8015715Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8015871Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8042172Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:10.8051087Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8051243Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8067000Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:10.8079083Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8079218Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8097115Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:10.8110818Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8110959Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8139854Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:10.8161381Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8161533Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8190144Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:10.8209674Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8209816Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8230670Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:10.8249019Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8249167Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8276782Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:10.8297418Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8297567Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8324431Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:10.8341289Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8341427Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8370715Z Entering 'third_party/pocketfft' 2025-12-04T08:54:10.8385931Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8386057Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8404201Z Entering 'third_party/protobuf' 2025-12-04T08:54:10.8417581Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8417704Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8434992Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:10.8447164Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8447288Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8468198Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:10.8480449Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8480574Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8521576Z Entering 'third_party/psimd' 2025-12-04T08:54:10.8537600Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8538822Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8561165Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:10.8584218Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8584418Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8605916Z Entering 'third_party/pybind11' 2025-12-04T08:54:10.8632749Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8632950Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8648050Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:10.8669740Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8670459Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8690163Z Entering 'third_party/sleef' 2025-12-04T08:54:10.8707641Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8707813Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8725975Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:10.8746314Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8746527Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8773339Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:10.8788233Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8788417Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8806466Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:10.8818599Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8818775Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8835954Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:10.8847666Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8847836Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8865085Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:10.8877477Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8877639Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8894706Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:10.8906664Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8906827Z url.https://github.com/.insteadof 2025-12-04T08:54:10.8937391Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T08:54:10.9097571Z Entering 'android/libs/fbjni' 2025-12-04T08:54:10.9120604Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:10.9131232Z Entering 'third_party/FP16' 2025-12-04T08:54:10.9153283Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:10.9162693Z Entering 'third_party/FXdiv' 2025-12-04T08:54:10.9183283Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:10.9193104Z Entering 'third_party/NNPACK' 2025-12-04T08:54:10.9214989Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:10.9225280Z Entering 'third_party/NVTX' 2025-12-04T08:54:10.9247715Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:10.9258251Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:10.9276706Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:10.9285661Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:10.9304274Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:10.9327165Z Entering 'third_party/aiter' 2025-12-04T08:54:10.9348155Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:10.9358919Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:10.9380566Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:10.9394991Z Entering 'third_party/benchmark' 2025-12-04T08:54:10.9415118Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:10.9424765Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:10.9448070Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:10.9461223Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:10.9481441Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:10.9491936Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:10.9512188Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:10.9523528Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:10.9543357Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:10.9554516Z Entering 'third_party/cutlass' 2025-12-04T08:54:10.9579088Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:10.9593052Z Entering 'third_party/fbgemm' 2025-12-04T08:54:10.9613517Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:10.9625437Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:10.9646064Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:10.9655410Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:10.9676762Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:10.9691215Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:10.9711522Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:10.9723214Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:10.9746692Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:10.9768589Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:10.9792202Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:10.9801537Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:10.9825185Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:10.9834666Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:10.9853352Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:10.9867018Z Entering 'third_party/flash-attention' 2025-12-04T08:54:10.9885439Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:10.9895917Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:10.9921319Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:10.9934708Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:10.9954639Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:10.9974879Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:10.9996346Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:11.0007948Z Entering 'third_party/fmt' 2025-12-04T08:54:11.0027849Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:11.0037799Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:11.0059579Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:11.0068931Z Entering 'third_party/gloo' 2025-12-04T08:54:11.0093100Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:11.0103567Z Entering 'third_party/googletest' 2025-12-04T08:54:11.0122517Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.0132153Z Entering 'third_party/ideep' 2025-12-04T08:54:11.0149150Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:11.0156767Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:11.0175792Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:11.0191889Z Entering 'third_party/ittapi' 2025-12-04T08:54:11.0212277Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:11.0223483Z Entering 'third_party/kineto' 2025-12-04T08:54:11.0243460Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:11.0253493Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:11.0273934Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:11.0287756Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:11.0312508Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:11.0322570Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:11.0341392Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:11.0350639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:11.0372040Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:11.0382495Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:11.0405398Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:11.0415098Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:11.0439745Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:11.0451478Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:11.0471837Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:11.0481020Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:11.0503376Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.0513294Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:11.0533467Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:11.0545933Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:11.0570640Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:11.0581555Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:11.0600374Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:11.0610893Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.0632742Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:11.0647545Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.0669581Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:11.0684074Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:11.0703237Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:11.0712940Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:11.0733227Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.0744653Z Entering 'third_party/kleidiai' 2025-12-04T08:54:11.0765106Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:11.0775769Z Entering 'third_party/mimalloc' 2025-12-04T08:54:11.0795127Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:11.0806330Z Entering 'third_party/nlohmann' 2025-12-04T08:54:11.0827539Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:11.0839222Z Entering 'third_party/onnx' 2025-12-04T08:54:11.0861781Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:11.0877953Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:11.0900905Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:11.0920375Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:11.0939748Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:11.0950590Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:11.0979175Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:11.0994443Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:11.1016872Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.1027438Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:11.1046589Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:11.1055431Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:11.1073773Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:11.1083310Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:11.1104767Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:11.1114053Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:11.1136947Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:11.1146138Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:11.1164393Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:11.1179006Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.1199946Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:11.1210657Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.1230918Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:11.1246823Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:11.1266754Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:11.1292463Z Entering 'third_party/pocketfft' 2025-12-04T08:54:11.1314249Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:11.1324181Z Entering 'third_party/protobuf' 2025-12-04T08:54:11.1343925Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:11.1357780Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:11.1376016Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:11.1391218Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:11.1414804Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.1427078Z Entering 'third_party/psimd' 2025-12-04T08:54:11.1446283Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:11.1456685Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:11.1479281Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:11.1489493Z Entering 'third_party/pybind11' 2025-12-04T08:54:11.1512341Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:11.1523077Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:11.1543536Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:11.1553534Z Entering 'third_party/sleef' 2025-12-04T08:54:11.1577643Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:11.1587556Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:11.1611179Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:11.1620892Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:11.1640704Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:11.1652755Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:11.1676456Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:11.1686247Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:11.1709064Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:11.1718213Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:11.1735805Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:11.1744695Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:11.1763833Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:11.1977319Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T08:54:11.2157049Z Entering 'android/libs/fbjni' 2025-12-04T08:54:11.2179003Z Entering 'third_party/FP16' 2025-12-04T08:54:11.2200865Z Entering 'third_party/FXdiv' 2025-12-04T08:54:11.2220288Z Entering 'third_party/NNPACK' 2025-12-04T08:54:11.2242012Z Entering 'third_party/NVTX' 2025-12-04T08:54:11.2263148Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:11.2282063Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:11.2308450Z Entering 'third_party/aiter' 2025-12-04T08:54:11.2333130Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:11.2359558Z Entering 'third_party/benchmark' 2025-12-04T08:54:11.2383475Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:11.2416159Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:11.2437806Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:11.2462797Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:11.2488310Z Entering 'third_party/cutlass' 2025-12-04T08:54:11.2510790Z Entering 'third_party/fbgemm' 2025-12-04T08:54:11.2531088Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:11.2550264Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:11.2572416Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:11.2591662Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:11.2614818Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:11.2634202Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:11.2653287Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:11.2674205Z Entering 'third_party/flash-attention' 2025-12-04T08:54:11.2700629Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:11.2724348Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:11.2749651Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:11.2775684Z Entering 'third_party/fmt' 2025-12-04T08:54:11.2801552Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:11.2824234Z Entering 'third_party/gloo' 2025-12-04T08:54:11.2855036Z Entering 'third_party/googletest' 2025-12-04T08:54:11.2880297Z Entering 'third_party/ideep' 2025-12-04T08:54:11.2902356Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:11.2924883Z Entering 'third_party/ittapi' 2025-12-04T08:54:11.2946130Z Entering 'third_party/kineto' 2025-12-04T08:54:11.2969811Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:11.2991304Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:11.3018171Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:11.3040043Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:11.3059492Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:11.3088612Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:11.3113220Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:11.3136008Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:11.3155304Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:11.3174581Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:11.3192304Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:11.3210025Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.3230819Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.3259058Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:11.3279284Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:11.3303880Z Entering 'third_party/kleidiai' 2025-12-04T08:54:11.3327727Z Entering 'third_party/mimalloc' 2025-12-04T08:54:11.3347317Z Entering 'third_party/nlohmann' 2025-12-04T08:54:11.3368645Z Entering 'third_party/onnx' 2025-12-04T08:54:11.3399185Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:11.3427336Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:11.3450391Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:11.3468244Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:11.3487997Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:11.3505812Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:11.3523830Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:11.3545553Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:11.3571411Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:11.3595199Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.3617441Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.3641049Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:11.3668099Z Entering 'third_party/pocketfft' 2025-12-04T08:54:11.3687881Z Entering 'third_party/protobuf' 2025-12-04T08:54:11.3707261Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:11.3727192Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:11.3747845Z Entering 'third_party/psimd' 2025-12-04T08:54:11.3767612Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:11.3787070Z Entering 'third_party/pybind11' 2025-12-04T08:54:11.3807192Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:11.3831394Z Entering 'third_party/sleef' 2025-12-04T08:54:11.3851377Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:11.3874235Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:11.3894716Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:11.3917115Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:11.3940577Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:11.3960969Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:11.3995450Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T08:54:11.4153016Z Entering 'android/libs/fbjni' 2025-12-04T08:54:11.4175517Z Entering 'third_party/FP16' 2025-12-04T08:54:11.4195559Z Entering 'third_party/FXdiv' 2025-12-04T08:54:11.4215188Z Entering 'third_party/NNPACK' 2025-12-04T08:54:11.4237070Z Entering 'third_party/NVTX' 2025-12-04T08:54:11.4259385Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:11.4280025Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:11.4323147Z Entering 'third_party/aiter' 2025-12-04T08:54:11.4349203Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:11.4376154Z Entering 'third_party/benchmark' 2025-12-04T08:54:11.4416513Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:11.4444913Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:11.4469517Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:11.4490977Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:11.4512849Z Entering 'third_party/cutlass' 2025-12-04T08:54:11.4536779Z Entering 'third_party/fbgemm' 2025-12-04T08:54:11.4563990Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:11.4583570Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:11.4611988Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:11.4631275Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:11.4654204Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:11.4678536Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:11.4698979Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:11.4728952Z Entering 'third_party/flash-attention' 2025-12-04T08:54:11.4754704Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:11.4777718Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:11.4802222Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:11.4826828Z Entering 'third_party/fmt' 2025-12-04T08:54:11.4858564Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:11.4881323Z Entering 'third_party/gloo' 2025-12-04T08:54:11.4903854Z Entering 'third_party/googletest' 2025-12-04T08:54:11.4927646Z Entering 'third_party/ideep' 2025-12-04T08:54:11.4948158Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:11.4970254Z Entering 'third_party/ittapi' 2025-12-04T08:54:11.4992691Z Entering 'third_party/kineto' 2025-12-04T08:54:11.5026677Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:11.5050617Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:11.5074968Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:11.5097930Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:11.5122455Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:11.5143294Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:11.5165558Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:11.5191692Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:11.5218721Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:11.5242562Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:11.5264628Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:11.5285730Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.5309518Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.5333153Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:11.5353998Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:11.5377839Z Entering 'third_party/kleidiai' 2025-12-04T08:54:11.5410174Z Entering 'third_party/mimalloc' 2025-12-04T08:54:11.5431325Z Entering 'third_party/nlohmann' 2025-12-04T08:54:11.5458126Z Entering 'third_party/onnx' 2025-12-04T08:54:11.5484967Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:11.5516874Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:11.5549494Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:11.5570176Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:11.5594503Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:11.5621149Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:11.5643256Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:11.5665233Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:11.5690932Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:11.5711772Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:11.5732472Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:11.5754692Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:11.5786720Z Entering 'third_party/pocketfft' 2025-12-04T08:54:11.5807480Z Entering 'third_party/protobuf' 2025-12-04T08:54:11.5828903Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:11.5848770Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:11.5869965Z Entering 'third_party/psimd' 2025-12-04T08:54:11.5889884Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:11.5910741Z Entering 'third_party/pybind11' 2025-12-04T08:54:11.5930246Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:11.5948320Z Entering 'third_party/sleef' 2025-12-04T08:54:11.5967205Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:11.5985628Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:11.6006977Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:11.6026712Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:11.6047696Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:11.6068721Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:11.6101925Z ##[endgroup] 2025-12-04T08:54:11.6242063Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T08:54:11.6316342Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:11.6441016Z Prepare all required actions 2025-12-04T08:54:11.6441298Z Getting action download info 2025-12-04T08:54:11.9074751Z Download action repository 'aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2025-12-04T08:54:12.8336376Z ##[group]Run ./.github/actions/setup-rocm 2025-12-04T08:54:12.8336509Z env: 2025-12-04T08:54:12.8336594Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8336695Z ##[endgroup] 2025-12-04T08:54:12.8350771Z ##[group]Run dpkg -l | grep -E " rocm" 2025-12-04T08:54:12.8350916Z dpkg -l | grep -E " rocm" 2025-12-04T08:54:12.8355163Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.8355303Z env: 2025-12-04T08:54:12.8355385Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8355488Z ##[endgroup] 2025-12-04T08:54:12.8418642Z ii rocm-cmake 0.14.0.60401-83~22.04 amd64 rocm-cmake built using CMake 2025-12-04T08:54:12.8419663Z ii rocm-core 6.4.1.60401-83~22.04 amd64 ROCm Runtime software stack 2025-12-04T08:54:12.8420317Z ii rocm-dbgapi 0.77.2.60401-83~22.04 amd64 Library to provide AMD GPU debugger API 2025-12-04T08:54:12.8420906Z ii rocm-debug-agent 2.0.4.60401-83~22.04 amd64 Radeon Open Compute Debug Agent (ROCdebug-agent) 2025-12-04T08:54:12.8421369Z ii rocm-dev 6.4.1.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T08:54:12.8421830Z ii rocm-device-libs 1.0.0.60401-83~22.04 amd64 Radeon Open Compute - device libraries 2025-12-04T08:54:12.8422220Z ii rocm-gdb 15.2.60401-83~22.04 amd64 ROCgdb 2025-12-04T08:54:12.8422600Z ii rocm-llvm 19.0.0.25184.60401-83~22.04 amd64 ROCm core compiler 2025-12-04T08:54:12.8422991Z ii rocm-opencl 2.0.0.60401-83~22.04 amd64 clr built using CMake 2025-12-04T08:54:12.8423367Z ii rocm-opencl-dev 2.0.0.60401-83~22.04 amd64 clr built using CMake 2025-12-04T08:54:12.8423767Z ii rocm-smi-lib 7.5.0.60401-83~22.04 amd64 AMD System Management libraries 2025-12-04T08:54:12.8424182Z ii rocm-utils 6.4.1.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T08:54:12.8424615Z ii rocminfo 1.0.0.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime rocminfo tool 2025-12-04T08:54:12.8442614Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T08:54:12.8442902Z # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T08:54:12.8443082Z # shellcheck disable=SC2046 2025-12-04T08:54:12.8443223Z docker stop $(docker ps -q) || true 2025-12-04T08:54:12.8443367Z # Prune all stopped containers. 2025-12-04T08:54:12.8443506Z docker container prune -f 2025-12-04T08:54:12.8448309Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.8448467Z env: 2025-12-04T08:54:12.8448568Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8448863Z ##[endgroup] 2025-12-04T08:54:12.8642372Z docker: 'docker stop' requires at least 1 argument 2025-12-04T08:54:12.8642532Z 2025-12-04T08:54:12.8642610Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2025-12-04T08:54:12.8642713Z 2025-12-04T08:54:12.8642783Z See 'docker stop --help' for more information 2025-12-04T08:54:12.8722220Z Total reclaimed space: 0B 2025-12-04T08:54:12.8747357Z ##[group]Run cat /etc/os-release || true 2025-12-04T08:54:12.8747581Z cat /etc/os-release || true 2025-12-04T08:54:12.8747774Z cat /etc/apt/sources.list.d/rocm.list || true 2025-12-04T08:54:12.8748148Z cat /opt/rocm/.info/version || true 2025-12-04T08:54:12.8748304Z whoami 2025-12-04T08:54:12.8753117Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.8753277Z env: 2025-12-04T08:54:12.8753371Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8753485Z ##[endgroup] 2025-12-04T08:54:12.8772025Z PRETTY_NAME="Ubuntu 22.04.5 LTS" 2025-12-04T08:54:12.8772140Z NAME="Ubuntu" 2025-12-04T08:54:12.8772239Z VERSION_ID="22.04" 2025-12-04T08:54:12.8772341Z VERSION="22.04.5 LTS (Jammy Jellyfish)" 2025-12-04T08:54:12.8772467Z VERSION_CODENAME=jammy 2025-12-04T08:54:12.8772570Z ID=ubuntu 2025-12-04T08:54:12.8772656Z ID_LIKE=debian 2025-12-04T08:54:12.8772785Z HOME_URL="https://www.ubuntu.com/" 2025-12-04T08:54:12.8772920Z SUPPORT_URL="https://help.ubuntu.com/" 2025-12-04T08:54:12.8773078Z BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" 2025-12-04T08:54:12.8773294Z PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" 2025-12-04T08:54:12.8773634Z UBUNTU_CODENAME=jammy 2025-12-04T08:54:12.8778449Z deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/6.4.1 jammy main 2025-12-04T08:54:12.8785756Z 6.4.1-83 2025-12-04T08:54:12.8791795Z runner 2025-12-04T08:54:12.8811702Z ##[group]Run dpkg -l | grep -E " amdgpu" 2025-12-04T08:54:12.8811908Z dpkg -l | grep -E " amdgpu" 2025-12-04T08:54:12.8816843Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.8817039Z env: 2025-12-04T08:54:12.8817153Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8817293Z ##[endgroup] 2025-12-04T08:54:12.8867123Z ii amdgpu-core 1:6.4.60401-2164967.22.04 all Core meta package for unified amdgpu driver. 2025-12-04T08:54:12.8867380Z ii amdgpu-install 6.4.60401-2164967.22.04 all AMDGPU driver repository and installer 2025-12-04T08:54:12.8887650Z ##[group]Run rocm-smi 2025-12-04T08:54:12.8887854Z rocm-smi 2025-12-04T08:54:12.8892742Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.8892942Z env: 2025-12-04T08:54:12.8893054Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.8893200Z ##[endgroup] 2025-12-04T08:54:12.9442391Z 2025-12-04T08:54:12.9442516Z 2025-12-04T08:54:12.9442685Z ============================================ ROCm System Management Interface ============================================ 2025-12-04T08:54:12.9442939Z ====================================================== Concise Info ====================================================== 2025-12-04T08:54:12.9443195Z Device Node IDs Temp Power Partitions SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2025-12-04T08:54:12.9443787Z  (DID, GUID) (Junction) (Socket) (Mem, Compute, ID)  2025-12-04T08:54:12.9444008Z ========================================================================================================================== 2025-12-04T08:54:12.9444541Z 0 3 0x74a5, 51110 26.0°C 119.0W NPS1, SPX, 0 N/A 900Mhz 0% manual 1000.0W 0% 0% 2025-12-04T08:54:12.9444779Z ========================================================================================================================== 2025-12-04T08:54:12.9444968Z ================================================== End of ROCm SMI Log =================================================== 2025-12-04T08:54:12.9505410Z ##[group]Run rocminfo 2025-12-04T08:54:12.9505585Z rocminfo 2025-12-04T08:54:12.9510067Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:12.9510277Z env: 2025-12-04T08:54:12.9510374Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:12.9510488Z ##[endgroup] 2025-12-04T08:54:13.0047831Z ROCk module version 6.12.12 is loaded 2025-12-04T08:54:13.0048076Z ===================== 2025-12-04T08:54:13.0048247Z HSA System Attributes 2025-12-04T08:54:13.0048391Z ===================== 2025-12-04T08:54:13.0048732Z Runtime Version: 1.15 2025-12-04T08:54:13.0048895Z Runtime Ext Version: 1.7 2025-12-04T08:54:13.0049057Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T08:54:13.0049294Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T08:54:13.0049547Z Machine Model: LARGE 2025-12-04T08:54:13.0049790Z System Endianness: LITTLE 2025-12-04T08:54:13.0049972Z Mwaitx: DISABLED 2025-12-04T08:54:13.0050183Z XNACK enabled: NO 2025-12-04T08:54:13.0050326Z DMAbuf Support: YES 2025-12-04T08:54:13.0050479Z VMM Support: YES 2025-12-04T08:54:13.0050567Z 2025-12-04T08:54:13.0050618Z ========== 2025-12-04T08:54:13.0050771Z HSA Agents 2025-12-04T08:54:13.0050932Z ========== 2025-12-04T08:54:13.0051176Z ******* 2025-12-04T08:54:13.0051299Z Agent 1 2025-12-04T08:54:13.0051462Z ******* 2025-12-04T08:54:13.0051632Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:13.0051846Z Uuid: CPU-XX 2025-12-04T08:54:13.0052044Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:13.0052244Z Vendor Name: CPU 2025-12-04T08:54:13.0052440Z Feature: None specified 2025-12-04T08:54:13.0052624Z Profile: FULL_PROFILE 2025-12-04T08:54:13.0052820Z Float Round Mode: NEAR 2025-12-04T08:54:13.0053023Z Max Queue Number: 0(0x0) 2025-12-04T08:54:13.0053223Z Queue Min Size: 0(0x0) 2025-12-04T08:54:13.0053414Z Queue Max Size: 0(0x0) 2025-12-04T08:54:13.0053618Z Queue Type: MULTI 2025-12-04T08:54:13.0053788Z Node: 0 2025-12-04T08:54:13.0053979Z Device Type: CPU 2025-12-04T08:54:13.0054148Z Cache Info: 2025-12-04T08:54:13.0054295Z L1: 49152(0xc000) KB 2025-12-04T08:54:13.0054484Z Chip ID: 0(0x0) 2025-12-04T08:54:13.0054664Z ASIC Revision: 0(0x0) 2025-12-04T08:54:13.0054856Z Cacheline Size: 64(0x40) 2025-12-04T08:54:13.0055051Z Max Clock Freq. (MHz): 3300 2025-12-04T08:54:13.0055225Z BDFID: 0 2025-12-04T08:54:13.0055417Z Internal Node ID: 0 2025-12-04T08:54:13.0055611Z Compute Unit: 64 2025-12-04T08:54:13.0055805Z SIMDs per CU: 0 2025-12-04T08:54:13.0055996Z Shader Engines: 0 2025-12-04T08:54:13.0056200Z Shader Arrs. per Eng.: 0 2025-12-04T08:54:13.0056400Z WatchPts on Addr. Ranges:1 2025-12-04T08:54:13.0056584Z Memory Properties: 2025-12-04T08:54:13.0056721Z Features: None 2025-12-04T08:54:13.0056865Z Pool Info: 2025-12-04T08:54:13.0057007Z Pool 1 2025-12-04T08:54:13.0057169Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:13.0057367Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T08:54:13.0057560Z Allocatable: TRUE 2025-12-04T08:54:13.0057749Z Alloc Granule: 4KB 2025-12-04T08:54:13.0058018Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0058234Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0058503Z Accessible by all: TRUE 2025-12-04T08:54:13.0058682Z Pool 2 2025-12-04T08:54:13.0058850Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:13.0059044Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T08:54:13.0059218Z Allocatable: TRUE 2025-12-04T08:54:13.0059414Z Alloc Granule: 4KB 2025-12-04T08:54:13.0059617Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0059823Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0060022Z Accessible by all: TRUE 2025-12-04T08:54:13.0060312Z Pool 3 2025-12-04T08:54:13.0060484Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T08:54:13.0060662Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T08:54:13.0060848Z Allocatable: TRUE 2025-12-04T08:54:13.0061043Z Alloc Granule: 4KB 2025-12-04T08:54:13.0061242Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0061447Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0061650Z Accessible by all: TRUE 2025-12-04T08:54:13.0061853Z Pool 4 2025-12-04T08:54:13.0062010Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:13.0062162Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T08:54:13.0062317Z Allocatable: TRUE 2025-12-04T08:54:13.0062479Z Alloc Granule: 4KB 2025-12-04T08:54:13.0062646Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0062808Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0062975Z Accessible by all: TRUE 2025-12-04T08:54:13.0063115Z ISA Info: 2025-12-04T08:54:13.0063225Z ******* 2025-12-04T08:54:13.0063330Z Agent 2 2025-12-04T08:54:13.0063429Z ******* 2025-12-04T08:54:13.0063554Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:13.0063708Z Uuid: CPU-XX 2025-12-04T08:54:13.0063864Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:13.0064025Z Vendor Name: CPU 2025-12-04T08:54:13.0064187Z Feature: None specified 2025-12-04T08:54:13.0064342Z Profile: FULL_PROFILE 2025-12-04T08:54:13.0064499Z Float Round Mode: NEAR 2025-12-04T08:54:13.0064654Z Max Queue Number: 0(0x0) 2025-12-04T08:54:13.0064815Z Queue Min Size: 0(0x0) 2025-12-04T08:54:13.0064968Z Queue Max Size: 0(0x0) 2025-12-04T08:54:13.0065117Z Queue Type: MULTI 2025-12-04T08:54:13.0065264Z Node: 1 2025-12-04T08:54:13.0065407Z Device Type: CPU 2025-12-04T08:54:13.0065543Z Cache Info: 2025-12-04T08:54:13.0065663Z L1: 49152(0xc000) KB 2025-12-04T08:54:13.0065873Z Chip ID: 0(0x0) 2025-12-04T08:54:13.0066024Z ASIC Revision: 0(0x0) 2025-12-04T08:54:13.0066186Z Cacheline Size: 64(0x40) 2025-12-04T08:54:13.0066340Z Max Clock Freq. (MHz): 3300 2025-12-04T08:54:13.0066495Z BDFID: 0 2025-12-04T08:54:13.0066645Z Internal Node ID: 1 2025-12-04T08:54:13.0066803Z Compute Unit: 64 2025-12-04T08:54:13.0066957Z SIMDs per CU: 0 2025-12-04T08:54:13.0067106Z Shader Engines: 0 2025-12-04T08:54:13.0067269Z Shader Arrs. per Eng.: 0 2025-12-04T08:54:13.0067433Z WatchPts on Addr. Ranges:1 2025-12-04T08:54:13.0067633Z Memory Properties: 2025-12-04T08:54:13.0067743Z Features: None 2025-12-04T08:54:13.0067855Z Pool Info: 2025-12-04T08:54:13.0067956Z Pool 1 2025-12-04T08:54:13.0068085Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:13.0068239Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T08:54:13.0068385Z Allocatable: TRUE 2025-12-04T08:54:13.0068545Z Alloc Granule: 4KB 2025-12-04T08:54:13.0068707Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0068870Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0069029Z Accessible by all: TRUE 2025-12-04T08:54:13.0069163Z Pool 2 2025-12-04T08:54:13.0069294Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:13.0069450Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T08:54:13.0069590Z Allocatable: TRUE 2025-12-04T08:54:13.0069743Z Alloc Granule: 4KB 2025-12-04T08:54:13.0069905Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0070062Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0070264Z Accessible by all: TRUE 2025-12-04T08:54:13.0070398Z Pool 3 2025-12-04T08:54:13.0070527Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T08:54:13.0070676Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T08:54:13.0070816Z Allocatable: TRUE 2025-12-04T08:54:13.0070969Z Alloc Granule: 4KB 2025-12-04T08:54:13.0071137Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0071295Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0071449Z Accessible by all: TRUE 2025-12-04T08:54:13.0071584Z Pool 4 2025-12-04T08:54:13.0071707Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:13.0071855Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T08:54:13.0071996Z Allocatable: TRUE 2025-12-04T08:54:13.0072149Z Alloc Granule: 4KB 2025-12-04T08:54:13.0072305Z Alloc Recommended Granule:4KB 2025-12-04T08:54:13.0072457Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0072610Z Accessible by all: TRUE 2025-12-04T08:54:13.0072743Z ISA Info: 2025-12-04T08:54:13.0072881Z ******* 2025-12-04T08:54:13.0072981Z Agent 3 2025-12-04T08:54:13.0073071Z ******* 2025-12-04T08:54:13.0073180Z Name: gfx942 2025-12-04T08:54:13.0073325Z Uuid: GPU-c59e59538c4aacf0 2025-12-04T08:54:13.0073470Z Marketing Name: AMD Instinct MI325X 2025-12-04T08:54:13.0073620Z Vendor Name: AMD 2025-12-04T08:54:13.0073765Z Feature: KERNEL_DISPATCH 2025-12-04T08:54:13.0073906Z Profile: BASE_PROFILE 2025-12-04T08:54:13.0074053Z Float Round Mode: NEAR 2025-12-04T08:54:13.0074199Z Max Queue Number: 128(0x80) 2025-12-04T08:54:13.0074344Z Queue Min Size: 64(0x40) 2025-12-04T08:54:13.0074532Z Queue Max Size: 131072(0x20000) 2025-12-04T08:54:13.0074673Z Queue Type: MULTI 2025-12-04T08:54:13.0074809Z Node: 2 2025-12-04T08:54:13.0074944Z Device Type: GPU 2025-12-04T08:54:13.0075072Z Cache Info: 2025-12-04T08:54:13.0075180Z L1: 32(0x20) KB 2025-12-04T08:54:13.0075304Z L2: 4096(0x1000) KB 2025-12-04T08:54:13.0075430Z L3: 262144(0x40000) KB 2025-12-04T08:54:13.0075558Z Chip ID: 29861(0x74a5) 2025-12-04T08:54:13.0075697Z ASIC Revision: 1(0x1) 2025-12-04T08:54:13.0075844Z Cacheline Size: 128(0x80) 2025-12-04T08:54:13.0076004Z Max Clock Freq. (MHz): 2100 2025-12-04T08:54:13.0076140Z BDFID: 1280 2025-12-04T08:54:13.0076281Z Internal Node ID: 2 2025-12-04T08:54:13.0076426Z Compute Unit: 304 2025-12-04T08:54:13.0076565Z SIMDs per CU: 4 2025-12-04T08:54:13.0076708Z Shader Engines: 32 2025-12-04T08:54:13.0076856Z Shader Arrs. per Eng.: 1 2025-12-04T08:54:13.0077009Z WatchPts on Addr. Ranges:4 2025-12-04T08:54:13.0077164Z Coherent Host Access: FALSE 2025-12-04T08:54:13.0077296Z Memory Properties: 2025-12-04T08:54:13.0077410Z Features: KERNEL_DISPATCH 2025-12-04T08:54:13.0077547Z Fast F16 Operation: TRUE 2025-12-04T08:54:13.0077698Z Wavefront Size: 64(0x40) 2025-12-04T08:54:13.0077851Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:13.0077988Z Workgroup Max Size per Dimension: 2025-12-04T08:54:13.0078112Z x 1024(0x400) 2025-12-04T08:54:13.0078238Z y 1024(0x400) 2025-12-04T08:54:13.0078363Z z 1024(0x400) 2025-12-04T08:54:13.0078499Z Max Waves Per CU: 32(0x20) 2025-12-04T08:54:13.0078648Z Max Work-item Per CU: 2048(0x800) 2025-12-04T08:54:13.0078797Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:13.0078936Z Grid Max Size per Dimension: 2025-12-04T08:54:13.0079047Z x 4294967295(0xffffffff) 2025-12-04T08:54:13.0079174Z y 4294967295(0xffffffff) 2025-12-04T08:54:13.0080474Z z 4294967295(0xffffffff) 2025-12-04T08:54:13.0080620Z Max fbarriers/Workgrp: 32 2025-12-04T08:54:13.0085221Z Packet Processor uCode:: 185 2025-12-04T08:54:13.0085387Z SDMA engine uCode:: 24 2025-12-04T08:54:13.0085539Z IOMMU Support:: None 2025-12-04T08:54:13.0085672Z Pool Info: 2025-12-04T08:54:13.0085775Z Pool 1 2025-12-04T08:54:13.0085900Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:13.0086049Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:13.0086193Z Allocatable: TRUE 2025-12-04T08:54:13.0086343Z Alloc Granule: 4KB 2025-12-04T08:54:13.0086585Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:13.0086740Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0086897Z Accessible by all: FALSE 2025-12-04T08:54:13.0087039Z Pool 2 2025-12-04T08:54:13.0087166Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:13.0087317Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:13.0087460Z Allocatable: TRUE 2025-12-04T08:54:13.0087617Z Alloc Granule: 4KB 2025-12-04T08:54:13.0087783Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:13.0087940Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0088101Z Accessible by all: FALSE 2025-12-04T08:54:13.0088243Z Pool 3 2025-12-04T08:54:13.0088369Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:13.0088517Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:13.0088659Z Allocatable: TRUE 2025-12-04T08:54:13.0088814Z Alloc Granule: 4KB 2025-12-04T08:54:13.0088971Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:13.0089126Z Alloc Alignment: 4KB 2025-12-04T08:54:13.0089278Z Accessible by all: FALSE 2025-12-04T08:54:13.0089413Z Pool 4 2025-12-04T08:54:13.0089529Z Segment: GROUP 2025-12-04T08:54:13.0089665Z Size: 64(0x40) KB 2025-12-04T08:54:13.0089804Z Allocatable: FALSE 2025-12-04T08:54:13.0089958Z Alloc Granule: 0KB 2025-12-04T08:54:13.0090160Z Alloc Recommended Granule:0KB 2025-12-04T08:54:13.0090314Z Alloc Alignment: 0KB 2025-12-04T08:54:13.0090468Z Accessible by all: FALSE 2025-12-04T08:54:13.0090604Z ISA Info: 2025-12-04T08:54:13.0090700Z ISA 1 2025-12-04T08:54:13.0090830Z Name: amdgcn-amd-amdhsa--gfx942:sramecc+:xnack- 2025-12-04T08:54:13.0090993Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T08:54:13.0091145Z Profiles: HSA_PROFILE_BASE 2025-12-04T08:54:13.0091307Z Default Rounding Mode: NEAR 2025-12-04T08:54:13.0091465Z Default Rounding Mode: NEAR 2025-12-04T08:54:13.0091665Z Fast f16: TRUE 2025-12-04T08:54:13.0091814Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:13.0091951Z Workgroup Max Size per Dimension: 2025-12-04T08:54:13.0092078Z x 1024(0x400) 2025-12-04T08:54:13.0092207Z y 1024(0x400) 2025-12-04T08:54:13.0092331Z z 1024(0x400) 2025-12-04T08:54:13.0092474Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:13.0092611Z Grid Max Size per Dimension: 2025-12-04T08:54:13.0092730Z x 4294967295(0xffffffff) 2025-12-04T08:54:13.0092856Z y 4294967295(0xffffffff) 2025-12-04T08:54:13.0092982Z z 4294967295(0xffffffff) 2025-12-04T08:54:13.0093174Z FBarrier Max Size: 32 2025-12-04T08:54:13.0093307Z ISA 2 2025-12-04T08:54:13.0093441Z Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- 2025-12-04T08:54:13.0093611Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T08:54:13.0093771Z Profiles: HSA_PROFILE_BASE 2025-12-04T08:54:13.0093924Z Default Rounding Mode: NEAR 2025-12-04T08:54:13.0094082Z Default Rounding Mode: NEAR 2025-12-04T08:54:13.0094229Z Fast f16: TRUE 2025-12-04T08:54:13.0094379Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:13.0094628Z Workgroup Max Size per Dimension: 2025-12-04T08:54:13.0094903Z x 1024(0x400) 2025-12-04T08:54:13.0095088Z y 1024(0x400) 2025-12-04T08:54:13.0095215Z z 1024(0x400) 2025-12-04T08:54:13.0095351Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:13.0095485Z Grid Max Size per Dimension: 2025-12-04T08:54:13.0095604Z x 4294967295(0xffffffff) 2025-12-04T08:54:13.0095730Z y 4294967295(0xffffffff) 2025-12-04T08:54:13.0095856Z z 4294967295(0xffffffff) 2025-12-04T08:54:13.0095996Z FBarrier Max Size: 32 2025-12-04T08:54:13.0096124Z *** Done *** 2025-12-04T08:54:13.0106415Z ##[group]Run ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T08:54:13.0106752Z ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T08:54:13.0107023Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T08:54:13.0107288Z if [[ $ngpu -eq 0 ]]; then 2025-12-04T08:54:13.0107427Z  echo "Error: Failed to detect any GPUs on the runner" 2025-12-04T08:54:13.0117866Z  echo "$msg" 2025-12-04T08:54:13.0117989Z  exit 1 2025-12-04T08:54:13.0118094Z fi 2025-12-04T08:54:13.0121818Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:13.0121972Z env: 2025-12-04T08:54:13.0122064Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.0122174Z ##[endgroup] 2025-12-04T08:54:13.0798147Z ##[group]Run pytorch/pytorch/.github/actions/diskspace-cleanup@main 2025-12-04T08:54:13.0798330Z with: 2025-12-04T08:54:13.0798433Z diskspace-cutoff: 70 2025-12-04T08:54:13.0798540Z env: 2025-12-04T08:54:13.0798638Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.0798755Z ##[endgroup] 2025-12-04T08:54:13.0819979Z ##[group]Run set -ex 2025-12-04T08:54:13.0820191Z set -ex 2025-12-04T08:54:13.0820450Z diskspace_cutoff=70 2025-12-04T08:54:13.0820602Z docker_root_dir=$(docker info -f '{{.DockerRootDir}}') 2025-12-04T08:54:13.0820764Z if [ ! -d "$docker_root_dir" ]; then 2025-12-04T08:54:13.0820965Z  echo "Docker root directory ($docker_root_dir) does not exist. Skipping disk space check." 2025-12-04T08:54:13.0821153Z  exit 0 2025-12-04T08:54:13.0821246Z fi 2025-12-04T08:54:13.0821409Z diskspace=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T08:54:13.0821736Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T08:54:13.0822011Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2025-12-04T08:54:13.0822163Z  docker system prune -af 2025-12-04T08:54:13.0822358Z  diskspace_new=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T08:54:13.0822694Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2025-12-04T08:54:13.0822859Z  diskspace_cutoff_int=$((diskspace_cutoff + 0)) 2025-12-04T08:54:13.0823015Z  difference=$((100 - diskspace_cutoff_int)) 2025-12-04T08:54:13.0823222Z  echo "Error: Available diskspace is less than $difference percent. Not enough diskspace." 2025-12-04T08:54:13.0823418Z  echo "$msg" 2025-12-04T08:54:13.0823520Z  exit 1 2025-12-04T08:54:13.0823618Z  else 2025-12-04T08:54:13.0823733Z  difference=$((diskspace - diskspace_new)) 2025-12-04T08:54:13.0823887Z  echo "Diskspace saved: $difference percent" 2025-12-04T08:54:13.0824016Z  fi 2025-12-04T08:54:13.0824107Z fi 2025-12-04T08:54:13.0828476Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:13.0828626Z env: 2025-12-04T08:54:13.0828723Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.0828838Z ##[endgroup] 2025-12-04T08:54:13.0846671Z + diskspace_cutoff=70 2025-12-04T08:54:13.0851013Z ++ docker info -f '{{.DockerRootDir}}' 2025-12-04T08:54:13.1183254Z + docker_root_dir=/home/runner/docker-data 2025-12-04T08:54:13.1183570Z + '[' '!' -d /home/runner/docker-data ']' 2025-12-04T08:54:13.1189096Z ++ df -H --output=pcent /home/runner/docker-data 2025-12-04T08:54:13.1190083Z ++ sed -n 2p 2025-12-04T08:54:13.1190738Z ++ sed s/%// 2025-12-04T08:54:13.1191725Z ++ sed 's/ //' 2025-12-04T08:54:13.1205133Z + diskspace=' 5' 2025-12-04T08:54:13.1205592Z + msg='Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified' 2025-12-04T08:54:13.1206028Z + [[ 5 -ge 70 ]] 2025-12-04T08:54:13.1226987Z ##[group]Run RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T08:54:13.1227219Z RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T08:54:13.1227423Z rm -rf "${RUNNER_ARTIFACT_DIR}" 2025-12-04T08:54:13.1227582Z mkdir -p "${RUNNER_ARTIFACT_DIR}" 2025-12-04T08:54:13.1227786Z echo "RUNNER_ARTIFACT_DIR=${RUNNER_ARTIFACT_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:13.1227977Z  2025-12-04T08:54:13.1228116Z RUNNER_TEST_RESULTS_DIR="${RUNNER_TEMP}/test-results" 2025-12-04T08:54:13.1228293Z rm -rf "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T08:54:13.1228447Z mkdir -p "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T08:54:13.1228656Z echo "RUNNER_TEST_RESULTS_DIR=${RUNNER_TEST_RESULTS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:13.1228852Z  2025-12-04T08:54:13.1228960Z RUNNER_DOCS_DIR="${RUNNER_TEMP}/docs" 2025-12-04T08:54:13.1229115Z rm -rf "${RUNNER_DOCS_DIR}" 2025-12-04T08:54:13.1229257Z mkdir -p "${RUNNER_DOCS_DIR}" 2025-12-04T08:54:13.1229428Z echo "RUNNER_DOCS_DIR=${RUNNER_DOCS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:13.1233130Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:13.1233266Z env: 2025-12-04T08:54:13.1233350Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.1233447Z ##[endgroup] 2025-12-04T08:54:13.1302032Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:13.1302273Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:13.1302484Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:13.1305819Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:13.1305982Z env: 2025-12-04T08:54:13.1306090Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.1306241Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:13.1306441Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:13.1306617Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:13.1306892Z ##[endgroup] 2025-12-04T08:54:13.1347476Z ##[group]Run # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T08:54:13.1347761Z # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T08:54:13.1347955Z # Add render group for container creation. 2025-12-04T08:54:13.1348122Z render_gid=`cat /etc/group | grep render | cut -d: -f3` 2025-12-04T08:54:13.1348320Z # Ensure GPU isolation if pod is part of kubernetes setup with DEVICE_FLAG. 2025-12-04T08:54:13.1348515Z if [ -f "/etc/podinfo/gha-render-devices" ]; then 2025-12-04T08:54:13.1348677Z  DEVICE_FLAG=$(cat /etc/podinfo/gha-render-devices) 2025-12-04T08:54:13.1348815Z else 2025-12-04T08:54:13.1348912Z  DEVICE_FLAG="--device /dev/dri" 2025-12-04T08:54:13.1349026Z fi 2025-12-04T08:54:13.1349202Z # The --group-add daemon and --group-add bin are needed in the Ubuntu 24.04 and Almalinux OSs respectively. 2025-12-04T08:54:13.1349483Z # This is due to the device files (/dev/kfd & /dev/dri) being owned by video group on bare metal. 2025-12-04T08:54:13.1349730Z # This video group ID maps to subgid 1 inside the docker image due to the /etc/subgid entries. 2025-12-04T08:54:13.1349989Z # The group name corresponding to group ID 1 can change depending on the OS, so both are necessary. 2025-12-04T08:54:13.1350500Z echo "GPU_FLAG=--device=/dev/mem --device=/dev/kfd $DEVICE_FLAG --group-add video --group-add $render_gid --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host" >> "${GITHUB_ENV}" 2025-12-04T08:54:13.1353654Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:13.1353790Z env: 2025-12-04T08:54:13.1353875Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.1354001Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:13.1354174Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:13.1354331Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:13.1354449Z ##[endgroup] 2025-12-04T08:54:13.1423852Z ##[group]Run aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722 2025-12-04T08:54:13.1424059Z with: 2025-12-04T08:54:13.1424209Z role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_s3_and_ecr_read_only 2025-12-04T08:54:13.1424380Z aws-region: us-east-1 2025-12-04T08:54:13.1424497Z role-duration-seconds: 18000 2025-12-04T08:54:13.1424623Z audience: sts.amazonaws.com 2025-12-04T08:54:13.1424734Z env: 2025-12-04T08:54:13.1424828Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.1424959Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:13.1425138Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:13.1425306Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:13.1425811Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:13.1426187Z ##[endgroup] 2025-12-04T08:54:13.5033969Z Assuming role with OIDC 2025-12-04T08:54:13.8514948Z Authenticated as assumedRoleId AROAUPVRELQNLLCOPFEJR:GitHubActions 2025-12-04T08:54:13.9501326Z ##[group]Run aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076 2025-12-04T08:54:13.9501533Z with: 2025-12-04T08:54:13.9501640Z mask-password: true 2025-12-04T08:54:13.9501757Z registry-type: private 2025-12-04T08:54:13.9501870Z skip-logout: false 2025-12-04T08:54:13.9501976Z env: 2025-12-04T08:54:13.9502073Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:13.9502214Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:13.9502391Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:13.9502560Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:13.9503114Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:13.9503496Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:13.9503618Z AWS_REGION: us-east-1 2025-12-04T08:54:13.9504193Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:13.9504357Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:13.9506466Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:13.9506578Z ##[endgroup] 2025-12-04T08:54:14.3605018Z Logging into registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:14.9882544Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:14.9882877Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:14.9883150Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:14.9883452Z env | grep '^RUNNER' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:14.9888219Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:14.9888436Z env: 2025-12-04T08:54:14.9888568Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:14.9888768Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:14.9889023Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:14.9889266Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:14.9889812Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:14.9890360Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:14.9890489Z AWS_REGION: us-east-1 2025-12-04T08:54:14.9890704Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:14.9890879Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:14.9893279Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:14.9893401Z ##[endgroup] 2025-12-04T08:54:15.0041757Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2025-12-04T08:54:15.0041944Z with: 2025-12-04T08:54:15.0042222Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0042531Z use-custom-docker-registry: true 2025-12-04T08:54:15.0042660Z docker-build-dir: .ci/docker 2025-12-04T08:54:15.0042785Z docker-build-script: ./build.sh 2025-12-04T08:54:15.0042906Z working-directory: . 2025-12-04T08:54:15.0043051Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0043206Z force-push: false 2025-12-04T08:54:15.0043302Z env: 2025-12-04T08:54:15.0043393Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:15.0043531Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:15.0043720Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:15.0043895Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:15.0044279Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:15.0044657Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:15.0044772Z AWS_REGION: us-east-1 2025-12-04T08:54:15.0044966Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:15.0045121Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:15.0047194Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:15.0047299Z ##[endgroup] 2025-12-04T08:54:15.0055588Z ##[group]Run set -ex 2025-12-04T08:54:15.0055726Z set -ex 2025-12-04T08:54:15.0055822Z  2025-12-04T08:54:15.0055979Z # If the docker build directory or the build script doesn't exist, the action will 2025-12-04T08:54:15.0056350Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2025-12-04T08:54:15.0056565Z # job could then download the pre-built image as usual 2025-12-04T08:54:15.0056825Z if [[ -d "${DOCKER_BUILD_DIR}" ]] && [[ -f "${DOCKER_BUILD_DIR}/${DOCKER_BUILD_SCRIPT}" ]] && [[ "${USE_CUSTOM_DOCKER_REGISTRY}" == "true" ]]; then 2025-12-04T08:54:15.0057063Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0057195Z else 2025-12-04T08:54:15.0057308Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0057485Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0057639Z  2025-12-04T08:54:15.0057845Z  echo "Not using custom ECR registry. Either it was not requested or there is no Docker build script in the ${REPO_NAME} repo..." 2025-12-04T08:54:15.0058077Z  exit 0 2025-12-04T08:54:15.0058171Z fi 2025-12-04T08:54:15.0058270Z  2025-12-04T08:54:15.0058406Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2025-12-04T08:54:15.0058634Z  # The docker image name already includes the ECR prefix and tag, so we can just 2025-12-04T08:54:15.0058834Z  # use it as it is, but first let's extract the tag 2025-12-04T08:54:15.0059021Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2025-12-04T08:54:15.0059219Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0059412Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0059568Z else 2025-12-04T08:54:15.0059683Z  if [[ "${DOCKER_IMAGE_NAME}" == *:* ]]; then 2025-12-04T08:54:15.0059836Z  CUSTOM_TAG_PREFIX=${DOCKER_IMAGE_NAME#*:} 2025-12-04T08:54:15.0059991Z  DOCKER_IMAGE_NAME=${DOCKER_IMAGE_NAME%%:*} 2025-12-04T08:54:15.0060166Z  fi 2025-12-04T08:54:15.0060422Z  DOCKER_TAG=${CUSTOM_TAG_PREFIX:+${CUSTOM_TAG_PREFIX}-}$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2025-12-04T08:54:15.0060654Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0060896Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0061154Z  echo "custom-tag-prefix=${CUSTOM_TAG_PREFIX}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0061317Z fi 2025-12-04T08:54:15.0065359Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:15.0065506Z env: 2025-12-04T08:54:15.0065603Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:15.0065741Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:15.0065922Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:15.0066091Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:15.0066477Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:15.0066855Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:15.0066974Z AWS_REGION: us-east-1 2025-12-04T08:54:15.0067114Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:15.0067267Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:15.0069334Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:15.0069445Z REPO_NAME: pytorch 2025-12-04T08:54:15.0069726Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0070027Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T08:54:15.0070207Z DOCKER_BUILD_SCRIPT: ./build.sh 2025-12-04T08:54:15.0070364Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0070525Z USE_CUSTOM_DOCKER_REGISTRY: true 2025-12-04T08:54:15.0070699Z CUSTOM_TAG_PREFIX: 2025-12-04T08:54:15.0070805Z ##[endgroup] 2025-12-04T08:54:15.0085858Z + [[ -d .ci/docker ]] 2025-12-04T08:54:15.0085992Z + [[ -f .ci/docker/./build.sh ]] 2025-12-04T08:54:15.0086123Z + [[ true == \t\r\u\e ]] 2025-12-04T08:54:15.0086229Z + echo skip=false 2025-12-04T08:54:15.0086603Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2025-12-04T08:54:15.0091018Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0091980Z ++ awk -F '[:,]' '{print $2}' 2025-12-04T08:54:15.0102988Z + DOCKER_TAG=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0103274Z + echo docker-tag=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0103917Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0133758Z ##[group]Run set +e 2025-12-04T08:54:15.0133934Z set +e 2025-12-04T08:54:15.0134052Z set -x 2025-12-04T08:54:15.0134163Z  2025-12-04T08:54:15.0134277Z login() { 2025-12-04T08:54:15.0134511Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T08:54:15.0134750Z } 2025-12-04T08:54:15.0134856Z  2025-12-04T08:54:15.0134965Z retry () { 2025-12-04T08:54:15.0135102Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T08:54:15.0135254Z } 2025-12-04T08:54:15.0135358Z  2025-12-04T08:54:15.0135481Z retry login "${DOCKER_REGISTRY}" 2025-12-04T08:54:15.0135627Z  2025-12-04T08:54:15.0135741Z START_TIME=$(date +%s) 2025-12-04T08:54:15.0135906Z # Wait up to 120 minutes 2025-12-04T08:54:15.0136235Z while [[ $(( $(date +%s) - 7200 )) -lt $START_TIME ]]; do 2025-12-04T08:54:15.0136471Z  # Check if image already exists, if it does then skip building it 2025-12-04T08:54:15.0136707Z  if docker manifest inspect "${DOCKER_IMAGE}"; then 2025-12-04T08:54:15.0136884Z  exit 0 2025-12-04T08:54:15.0137011Z  fi 2025-12-04T08:54:15.0137129Z  2025-12-04T08:54:15.0137318Z  # NB: This flag is used by Docker build workflow to push the image to ECR, so we can 2025-12-04T08:54:15.0137621Z  # use this to differentiate between the Docker build and regular build jobs. For the 2025-12-04T08:54:15.0137925Z  # latter, it will wait for the Docker images to become available before continuing 2025-12-04T08:54:15.0138173Z  if [ "${DOCKER_PUSH:-false}" == "true" ]; then 2025-12-04T08:54:15.0138372Z  # It's a Docker build job, let's build the image 2025-12-04T08:54:15.0138544Z  break 2025-12-04T08:54:15.0138675Z  else 2025-12-04T08:54:15.0138844Z  # It's a regular build job, wait for the image to become available 2025-12-04T08:54:15.0139043Z  sleep 300 2025-12-04T08:54:15.0139172Z  fi 2025-12-04T08:54:15.0139286Z done 2025-12-04T08:54:15.0139400Z  2025-12-04T08:54:15.0139576Z # NB: This part requires a full checkout. Otherwise, the merge base will 2025-12-04T08:54:15.0139837Z # be empty. The default action would be to continue rebuild the image 2025-12-04T08:54:15.0140082Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2025-12-04T08:54:15.0140477Z  # if we're on the base branch then use the parent commit 2025-12-04T08:54:15.0140675Z  MERGE_BASE=$(git rev-parse HEAD~) 2025-12-04T08:54:15.0140832Z else 2025-12-04T08:54:15.0140998Z  # otherwise we're on a PR, so use the most recent base commit 2025-12-04T08:54:15.0141328Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2025-12-04T08:54:15.0141478Z fi 2025-12-04T08:54:15.0141576Z  2025-12-04T08:54:15.0141684Z if [[ -z "${MERGE_BASE}" ]]; then 2025-12-04T08:54:15.0141837Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0141974Z  2025-12-04T08:54:15.0142160Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2025-12-04T08:54:15.0142373Z  exit 0 2025-12-04T08:54:15.0142476Z fi 2025-12-04T08:54:15.0142568Z  2025-12-04T08:54:15.0142700Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2025-12-04T08:54:15.0142961Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2025-12-04T08:54:15.0143187Z  exit 1 2025-12-04T08:54:15.0143287Z fi 2025-12-04T08:54:15.0143383Z  2025-12-04T08:54:15.0143539Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2025-12-04T08:54:15.0143790Z # If no image exists but the hash is the same as the previous hash then we should error out here 2025-12-04T08:54:15.0144016Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2025-12-04T08:54:15.0144275Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2025-12-04T08:54:15.0144566Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2025-12-04T08:54:15.0144746Z fi 2025-12-04T08:54:15.0144857Z  2025-12-04T08:54:15.0144964Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:15.0148884Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:15.0149030Z env: 2025-12-04T08:54:15.0149131Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:15.0149271Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:15.0149494Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:15.0149662Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:15.0150047Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:15.0150496Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:15.0150614Z AWS_REGION: us-east-1 2025-12-04T08:54:15.0150807Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:15.0150962Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:15.0153052Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:15.0153164Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T08:54:15.0153306Z BASE_REVISION: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:15.0153767Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0154135Z DOCKER_TAG: pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:15.0154371Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0154523Z DOCKER_PUSH: 2025-12-04T08:54:15.0154621Z ##[endgroup] 2025-12-04T08:54:15.0174179Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0174361Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0177514Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:15.0177718Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:15.0179048Z /home/runner/_work/_temp/916b31e4-8942-4430-b424-7f590c060857.sh: line 5: aws: command not found 2025-12-04T08:54:15.0243727Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:15.0251503Z + sleep 1 2025-12-04T08:54:16.0262839Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:16.0265859Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:16.0266329Z /home/runner/_work/_temp/916b31e4-8942-4430-b424-7f590c060857.sh: line 5: aws: command not found 2025-12-04T08:54:16.0268088Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:16.0349921Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:16.0358378Z + sleep 2 2025-12-04T08:54:18.0368228Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:18.0371983Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:18.0372699Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:18.0373192Z /home/runner/_work/_temp/916b31e4-8942-4430-b424-7f590c060857.sh: line 5: aws: command not found 2025-12-04T08:54:18.0464426Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:18.0477133Z ++ date +%s 2025-12-04T08:54:18.0486276Z + START_TIME=1764838458 2025-12-04T08:54:18.0490327Z ++ date +%s 2025-12-04T08:54:18.0498321Z + [[ 1764831258 -lt 1764838458 ]] 2025-12-04T08:54:18.0498942Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:19.4571047Z { 2025-12-04T08:54:19.4571274Z "schemaVersion": 2, 2025-12-04T08:54:19.4571553Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2025-12-04T08:54:19.4571786Z "config": { 2025-12-04T08:54:19.4571951Z "mediaType": "application/vnd.docker.container.image.v1+json", 2025-12-04T08:54:19.4572143Z "size": 30520, 2025-12-04T08:54:19.4572335Z "digest": "sha256:45252333063339f104d56e41f20304e9511ab21c7768e8d156b95ddf24a9dbe5" 2025-12-04T08:54:19.4572544Z }, 2025-12-04T08:54:19.4572645Z "layers": [ 2025-12-04T08:54:19.4572748Z { 2025-12-04T08:54:19.4572941Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4573134Z "size": 30447951, 2025-12-04T08:54:19.4573806Z "digest": "sha256:63e5bc7682b85ae57a1221210f64d62e7a90b0a30f19af4ca734b8242ae49d63" 2025-12-04T08:54:19.4574025Z }, 2025-12-04T08:54:19.4574118Z { 2025-12-04T08:54:19.4574272Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4574461Z "size": 1554, 2025-12-04T08:54:19.4574653Z "digest": "sha256:835841cca3b7e1464290cdb78e48773e03583413fbed852c3cc5165a392ea44d" 2025-12-04T08:54:19.4574926Z }, 2025-12-04T08:54:19.4575021Z { 2025-12-04T08:54:19.4575173Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4575360Z "size": 313275691, 2025-12-04T08:54:19.4575551Z "digest": "sha256:aac69780afc8611a5f94a235792d39ae055249c8319ef43b78675998a9b2f825" 2025-12-04T08:54:19.4575756Z }, 2025-12-04T08:54:19.4575849Z { 2025-12-04T08:54:19.4576001Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4576194Z "size": 704, 2025-12-04T08:54:19.4576388Z "digest": "sha256:029495b23122c840ca0e52d487afa8d2c4dbf1991cd7f204ec3e434dcf947bf4" 2025-12-04T08:54:19.4576595Z }, 2025-12-04T08:54:19.4576684Z { 2025-12-04T08:54:19.4576834Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4577021Z "size": 1218, 2025-12-04T08:54:19.4577210Z "digest": "sha256:d0fb85b008332051a3f7c052721ef68bde404b46c23fa43ad040373bd367826c" 2025-12-04T08:54:19.4577413Z }, 2025-12-04T08:54:19.4577499Z { 2025-12-04T08:54:19.4577650Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4577837Z "size": 484, 2025-12-04T08:54:19.4578018Z "digest": "sha256:59b63930883363c7d2aaab27cc61555d9f3e119dc18247a8624c98ebdaa354a5" 2025-12-04T08:54:19.4578223Z }, 2025-12-04T08:54:19.4578313Z { 2025-12-04T08:54:19.4578464Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4578776Z "size": 110363202, 2025-12-04T08:54:19.4578977Z "digest": "sha256:dc112c89d57aa1e85082e40a56e5bc743d64f834ae2f98afe91f60c248354d38" 2025-12-04T08:54:19.4579185Z }, 2025-12-04T08:54:19.4579279Z { 2025-12-04T08:54:19.4579426Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4579606Z "size": 4436, 2025-12-04T08:54:19.4579794Z "digest": "sha256:522eab2402e5001810155ef7eb56940b7c01a4fef62ac588886981c3b8ee8e1e" 2025-12-04T08:54:19.4579996Z }, 2025-12-04T08:54:19.4580166Z { 2025-12-04T08:54:19.4580322Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4580485Z "size": 1755, 2025-12-04T08:54:19.4580646Z "digest": "sha256:2b5a11b41761d8ea3b829e4772e4064cb6c4e4989126af324d0057661e4493a1" 2025-12-04T08:54:19.4580822Z }, 2025-12-04T08:54:19.4580902Z { 2025-12-04T08:54:19.4581030Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4581187Z "size": 724, 2025-12-04T08:54:19.4581349Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:19.4581531Z }, 2025-12-04T08:54:19.4581613Z { 2025-12-04T08:54:19.4581742Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4581901Z "size": 3185588166, 2025-12-04T08:54:19.4582073Z "digest": "sha256:73e33534e9eb94cf29418d65944168962b65fe21f55e9b8bad18c76e9b3a37b8" 2025-12-04T08:54:19.4582248Z }, 2025-12-04T08:54:19.4582329Z { 2025-12-04T08:54:19.4582456Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4582612Z "size": 396, 2025-12-04T08:54:19.4582781Z "digest": "sha256:5bfdaeb5578d6ffcd7db29c48303cbceb13c591210feaa216a8daa7a6d445b4b" 2025-12-04T08:54:19.4582967Z }, 2025-12-04T08:54:19.4583047Z { 2025-12-04T08:54:19.4583172Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4583330Z "size": 236863, 2025-12-04T08:54:19.4583499Z "digest": "sha256:c07d27e4d3a5ba4ad5325bb785b2e4f058fe5e10ec1aeeb413a1e152b073f203" 2025-12-04T08:54:19.4583686Z }, 2025-12-04T08:54:19.4583870Z { 2025-12-04T08:54:19.4583998Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4584155Z "size": 787, 2025-12-04T08:54:19.4584318Z "digest": "sha256:b21856d1bf420da6fa8ec7331b82ab355d4f4178644e7d3a3d3d0fbc3610109a" 2025-12-04T08:54:19.4584533Z }, 2025-12-04T08:54:19.4584619Z { 2025-12-04T08:54:19.4584756Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4584920Z "size": 106, 2025-12-04T08:54:19.4585095Z "digest": "sha256:cb19d84867e4063f55db9459c28c50a2abc37c06d3c1ca82ba95fa8427cc438a" 2025-12-04T08:54:19.4585281Z }, 2025-12-04T08:54:19.4585369Z { 2025-12-04T08:54:19.4585498Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4585664Z "size": 1496, 2025-12-04T08:54:19.4585832Z "digest": "sha256:8165374f8dccf88a7791a5d31afbe29e4d4542b4f1cf1904945e07f9af6bf8ba" 2025-12-04T08:54:19.4586021Z }, 2025-12-04T08:54:19.4586106Z { 2025-12-04T08:54:19.4586245Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4586410Z "size": 458789560, 2025-12-04T08:54:19.4586587Z "digest": "sha256:1aecc77354ceba59ec6f0d37a558f2dbb6d5c0854553ee8505ac8707b422da6d" 2025-12-04T08:54:19.4586774Z }, 2025-12-04T08:54:19.4586862Z { 2025-12-04T08:54:19.4587000Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4587441Z "size": 164, 2025-12-04T08:54:19.4587671Z "digest": "sha256:465d3fd643aa2ea0ad07335cda66f12f1d7e5e800c4e9385ec466bc8a1ceabda" 2025-12-04T08:54:19.4587862Z }, 2025-12-04T08:54:19.4587950Z { 2025-12-04T08:54:19.4588085Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4588249Z "size": 104, 2025-12-04T08:54:19.4588411Z "digest": "sha256:6c503e779d6f41ca7f51309875df2b725c171926aece7009c4b8a64d1ba3f58e" 2025-12-04T08:54:19.4588655Z }, 2025-12-04T08:54:19.4588743Z { 2025-12-04T08:54:19.4588882Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4589046Z "size": 724, 2025-12-04T08:54:19.4589209Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:19.4589387Z }, 2025-12-04T08:54:19.4589472Z { 2025-12-04T08:54:19.4589610Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4589773Z "size": 196, 2025-12-04T08:54:19.4589939Z "digest": "sha256:f7e9a021f0ee3d11a50dcb96378af8103a21f6c3c142f54529207648f3ed00b2" 2025-12-04T08:54:19.4590165Z }, 2025-12-04T08:54:19.4590248Z { 2025-12-04T08:54:19.4590382Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4590545Z "size": 2583, 2025-12-04T08:54:19.4590710Z "digest": "sha256:8e023b349080fb11ee55491bc9b842b30e9e3a90246d05b303a73dc62038caf2" 2025-12-04T08:54:19.4590893Z }, 2025-12-04T08:54:19.4590986Z { 2025-12-04T08:54:19.4591117Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4591280Z "size": 7577171420, 2025-12-04T08:54:19.4591449Z "digest": "sha256:8188df80e595a3dbcf84623c6a58a655269898cbb60029435f136d7f9d34ccaa" 2025-12-04T08:54:19.4591625Z }, 2025-12-04T08:54:19.4591709Z { 2025-12-04T08:54:19.4591840Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4591999Z "size": 135, 2025-12-04T08:54:19.4592165Z "digest": "sha256:3c2c2f8c74bfa16c4bf9a832c97bbb1d55205b2b4a2cead02cf74301ca1001fb" 2025-12-04T08:54:19.4592348Z }, 2025-12-04T08:54:19.4592432Z { 2025-12-04T08:54:19.4592563Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4592721Z "size": 104, 2025-12-04T08:54:19.4592888Z "digest": "sha256:2aa7784fbe3300f8bbfb6bb51cff3b01fd091e829c2bc7ab9e25261a0dd9b3bd" 2025-12-04T08:54:19.4593073Z }, 2025-12-04T08:54:19.4593155Z { 2025-12-04T08:54:19.4593287Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4593445Z "size": 612, 2025-12-04T08:54:19.4593661Z "digest": "sha256:2b3b5215d3ebe8789f0444457bfd5a6e218289b64aa07653ac3d03ddda5e6708" 2025-12-04T08:54:19.4593840Z }, 2025-12-04T08:54:19.4593923Z { 2025-12-04T08:54:19.4594055Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4594215Z "size": 838191945, 2025-12-04T08:54:19.4594388Z "digest": "sha256:99b1f1ea3e857834cebd01763d90fbd700aeb9c2d2ef23eda2cfff5652c9708b" 2025-12-04T08:54:19.4594569Z }, 2025-12-04T08:54:19.4594651Z { 2025-12-04T08:54:19.4594780Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4594939Z "size": 111, 2025-12-04T08:54:19.4595097Z "digest": "sha256:18d6daba0a5768a37ad106b57974f6b7efd35c43a87c246bcd3f43fea88f2d2b" 2025-12-04T08:54:19.4595272Z }, 2025-12-04T08:54:19.4595354Z { 2025-12-04T08:54:19.4595479Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4595635Z "size": 1555, 2025-12-04T08:54:19.4595800Z "digest": "sha256:5277f2a503ebd17ba9d9b86cc9bac86265504adeb449c0647616ddaacd3cbc41" 2025-12-04T08:54:19.4595980Z }, 2025-12-04T08:54:19.4596061Z { 2025-12-04T08:54:19.4596188Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4596345Z "size": 107, 2025-12-04T08:54:19.4596504Z "digest": "sha256:3198a9717aace920fd5de085319adf75091af05fc4318ce4b16a8a5b0e8d449e" 2025-12-04T08:54:19.4596683Z }, 2025-12-04T08:54:19.4596764Z { 2025-12-04T08:54:19.4596892Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4597050Z "size": 166, 2025-12-04T08:54:19.4597206Z "digest": "sha256:99a4918e5808277879449e97ccd7190db6b9aa2d742b57a3b831ce0198522bdd" 2025-12-04T08:54:19.4597377Z }, 2025-12-04T08:54:19.4597454Z { 2025-12-04T08:54:19.4597580Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4597770Z "size": 3526081, 2025-12-04T08:54:19.4597935Z "digest": "sha256:15bb11dfc6acc3537d527d6771c8e711e5605e99f82ec41e805d4600b8a97516" 2025-12-04T08:54:19.4598107Z }, 2025-12-04T08:54:19.4598181Z { 2025-12-04T08:54:19.4598306Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4598460Z "size": 107, 2025-12-04T08:54:19.4598617Z "digest": "sha256:bd87c8766e90e33db17514558ac591cc3f4149afd7abeaef4dd5770bbfa14210" 2025-12-04T08:54:19.4598794Z }, 2025-12-04T08:54:19.4598874Z { 2025-12-04T08:54:19.4599002Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4599161Z "size": 829, 2025-12-04T08:54:19.4599320Z "digest": "sha256:1969e15d0c13874ea5883ed829235a19ef6dc21c8aa6172032b78a8ffa6ff262" 2025-12-04T08:54:19.4599498Z }, 2025-12-04T08:54:19.4599582Z { 2025-12-04T08:54:19.4599712Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4599872Z "size": 26973054, 2025-12-04T08:54:19.4600043Z "digest": "sha256:24a03847d382b73c11969f8f73916a6bedf5ccea12f6f4290b3880f29ceda32a" 2025-12-04T08:54:19.4600258Z }, 2025-12-04T08:54:19.4600342Z { 2025-12-04T08:54:19.4600473Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4600633Z "size": 104, 2025-12-04T08:54:19.4600795Z "digest": "sha256:816e2e34e01839a35d624dbf4bd9ac9bea4c975104af47a0e6b6b6dee6c6f98d" 2025-12-04T08:54:19.4600974Z }, 2025-12-04T08:54:19.4601057Z { 2025-12-04T08:54:19.4601190Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4601349Z "size": 424, 2025-12-04T08:54:19.4601511Z "digest": "sha256:b168858b85373f8ddca549d79267a06de4fa945d04bf791c55c9ddc93957fa3c" 2025-12-04T08:54:19.4601689Z }, 2025-12-04T08:54:19.4601775Z { 2025-12-04T08:54:19.4601906Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4602065Z "size": 19309386, 2025-12-04T08:54:19.4602235Z "digest": "sha256:6b8d5ff02e267e38322afbb8a58ed63ce9d75b10e9e73255e6affcbc6b6539bf" 2025-12-04T08:54:19.4602425Z }, 2025-12-04T08:54:19.4602549Z { 2025-12-04T08:54:19.4602679Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4602837Z "size": 826, 2025-12-04T08:54:19.4602997Z "digest": "sha256:4e3b10a5dd6aed29f238d604925e2a4f873141c1087c8dd4fdde5c61e7560893" 2025-12-04T08:54:19.4603177Z }, 2025-12-04T08:54:19.4603260Z { 2025-12-04T08:54:19.4603389Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4603550Z "size": 724, 2025-12-04T08:54:19.4603707Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:19.4603883Z }, 2025-12-04T08:54:19.4603966Z { 2025-12-04T08:54:19.4604095Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4604253Z "size": 149, 2025-12-04T08:54:19.4604418Z "digest": "sha256:3092fab73b59190b9facfc49bf18f58612172bc2fd68dfa339a1118632616939" 2025-12-04T08:54:19.4604601Z }, 2025-12-04T08:54:19.4604686Z { 2025-12-04T08:54:19.4604823Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4604980Z "size": 136, 2025-12-04T08:54:19.4605146Z "digest": "sha256:20020dd28a15ba092fcbfe906ee39cdddfcc9d0b7eb42fdd6f4c08a984fa9c00" 2025-12-04T08:54:19.4605330Z }, 2025-12-04T08:54:19.4605413Z { 2025-12-04T08:54:19.4605543Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4605700Z "size": 140, 2025-12-04T08:54:19.4605863Z "digest": "sha256:ae5280ce969dcff08c091e9a5f7641f13561b2b0ee44d78b7c3f81d8fe8e6d32" 2025-12-04T08:54:19.4606044Z }, 2025-12-04T08:54:19.4606127Z { 2025-12-04T08:54:19.4606258Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4606419Z "size": 32, 2025-12-04T08:54:19.4606585Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T08:54:19.4606808Z }, 2025-12-04T08:54:19.4606891Z { 2025-12-04T08:54:19.4607026Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4607187Z "size": 222, 2025-12-04T08:54:19.4607352Z "digest": "sha256:fe17d9eb0fd26d3af4c724bf570d833978b131cedb7dc17a800aa388a246b3cd" 2025-12-04T08:54:19.4607533Z }, 2025-12-04T08:54:19.4607620Z { 2025-12-04T08:54:19.4607746Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4607904Z "size": 346, 2025-12-04T08:54:19.4608064Z "digest": "sha256:a51e0dab2d596e6563483f27c12660007160847d177ba4c31812a8f44ada5754" 2025-12-04T08:54:19.4608241Z }, 2025-12-04T08:54:19.4608324Z { 2025-12-04T08:54:19.4608453Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4608611Z "size": 88300, 2025-12-04T08:54:19.4608779Z "digest": "sha256:6eb176cefd72d37ecbcdf074289a8f1de732d8816cc695ece7e4709d098094d6" 2025-12-04T08:54:19.4608960Z }, 2025-12-04T08:54:19.4609047Z { 2025-12-04T08:54:19.4609175Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4609334Z "size": 106, 2025-12-04T08:54:19.4609495Z "digest": "sha256:e7b8cf2e8d5a4c56db9726ce62c1176032408b3b1c25a000592361cb4245e2b5" 2025-12-04T08:54:19.4609674Z }, 2025-12-04T08:54:19.4609755Z { 2025-12-04T08:54:19.4609881Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4610035Z "size": 1671, 2025-12-04T08:54:19.4610231Z "digest": "sha256:ef3a5060abce88884bc8bd815aa41c46427f34eeb132fe0ddd85a3f86e6dc83d" 2025-12-04T08:54:19.4610410Z }, 2025-12-04T08:54:19.4610486Z { 2025-12-04T08:54:19.4610617Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4610770Z "size": 724, 2025-12-04T08:54:19.4610923Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:19.4611095Z }, 2025-12-04T08:54:19.4611172Z { 2025-12-04T08:54:19.4621236Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4621422Z "size": 138, 2025-12-04T08:54:19.4621656Z "digest": "sha256:a6f4ec14b42b8f0a83d20aa6a985ddb6a1bf64e0ed3d44afd3484b87d4ed5ad3" 2025-12-04T08:54:19.4621838Z }, 2025-12-04T08:54:19.4621916Z { 2025-12-04T08:54:19.4622046Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4622202Z "size": 119, 2025-12-04T08:54:19.4622362Z "digest": "sha256:7e5a0c956cfbd6f8074fbfd3b1d416e6635d632835ec00c8dd4c015a21da19b4" 2025-12-04T08:54:19.4622540Z }, 2025-12-04T08:54:19.4622617Z { 2025-12-04T08:54:19.4622746Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4622902Z "size": 6238423049, 2025-12-04T08:54:19.4623072Z "digest": "sha256:b4f78730cfe76ce091b78b2e2e3d52be03f1097b3e4c3de5bd79f8d13a853132" 2025-12-04T08:54:19.4623248Z }, 2025-12-04T08:54:19.4623326Z { 2025-12-04T08:54:19.4623453Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4623609Z "size": 174, 2025-12-04T08:54:19.4623769Z "digest": "sha256:081028f24389b112683689fd362e8c0d6f358082710e72feab91cea6383feb4d" 2025-12-04T08:54:19.4623938Z }, 2025-12-04T08:54:19.4624016Z { 2025-12-04T08:54:19.4624142Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4624295Z "size": 1896, 2025-12-04T08:54:19.4624459Z "digest": "sha256:a534dcf4b9a9e5fabed742c8a8fc43c9cfe7346ea88ab3c177c3b14fd3afe00a" 2025-12-04T08:54:19.4624637Z }, 2025-12-04T08:54:19.4624711Z { 2025-12-04T08:54:19.4624836Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4624992Z "size": 197577597, 2025-12-04T08:54:19.4625154Z "digest": "sha256:2e77500302cc13224427e1d74e471bd79d5109ba6a5099a83df1d10b786f71ba" 2025-12-04T08:54:19.4625323Z }, 2025-12-04T08:54:19.4625400Z { 2025-12-04T08:54:19.4625524Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4625717Z "size": 304, 2025-12-04T08:54:19.4625881Z "digest": "sha256:bc08246bb4ba18c3ec5bc69e16b6b4e929c5bd0f3fae10eeb0b1a622a63d6fa2" 2025-12-04T08:54:19.4626059Z }, 2025-12-04T08:54:19.4626136Z { 2025-12-04T08:54:19.4626262Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4626416Z "size": 32, 2025-12-04T08:54:19.4626578Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T08:54:19.4626753Z }, 2025-12-04T08:54:19.4626829Z { 2025-12-04T08:54:19.4626957Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4627112Z "size": 106, 2025-12-04T08:54:19.4627273Z "digest": "sha256:ff0c473ca120ebdcaa2ba10b3274e82032edd5196019e76d4e7584553704ae81" 2025-12-04T08:54:19.4627447Z }, 2025-12-04T08:54:19.4627523Z { 2025-12-04T08:54:19.4627649Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:19.4627803Z "size": 54145662, 2025-12-04T08:54:19.4627978Z "digest": "sha256:6bbc14b250efb3cdaad12c91573c6bb9129ad3e3432f0ed1a7eaebc9958d162f" 2025-12-04T08:54:19.4628155Z } 2025-12-04T08:54:19.4628234Z ] 2025-12-04T08:54:19.4628315Z } 2025-12-04T08:54:19.4628401Z + exit 0 2025-12-04T08:54:19.4644029Z ##[group]Run set -eux 2025-12-04T08:54:19.4644149Z set -eux 2025-12-04T08:54:19.4644311Z # It's ok if this steps fails, it would then be an anonymous user like what we used to have 2025-12-04T08:54:19.4644727Z aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token | jq --raw-output '.SecretString' | jq -r .docker_hub_readonly_token | docker login --username pytorchbot --password-stdin || true 2025-12-04T08:54:19.4649092Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:19.4649237Z env: 2025-12-04T08:54:19.4649329Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:19.4649464Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:19.4649638Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:19.4649808Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:19.4650304Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:19.4650674Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:19.4650789Z AWS_REGION: us-east-1 2025-12-04T08:54:19.4650981Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:19.4651132Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:19.4653598Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:19.4653707Z ##[endgroup] 2025-12-04T08:54:19.4676984Z + aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token 2025-12-04T08:54:19.4677357Z /home/runner/_work/_temp/c8168e12-34bf-453d-9f4c-04bfd4cd87f4.sh: line 3: aws: command not found 2025-12-04T08:54:19.4678178Z + jq --raw-output .SecretString 2025-12-04T08:54:19.4678361Z + jq -r .docker_hub_readonly_token 2025-12-04T08:54:19.4680287Z + docker login --username pytorchbot --password-stdin 2025-12-04T08:54:19.4768678Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:19.4776403Z + true 2025-12-04T08:54:19.4836854Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2025-12-04T08:54:19.4837044Z with: 2025-12-04T08:54:19.4837325Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:19.4837657Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:19.4837819Z env: 2025-12-04T08:54:19.4837919Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:19.4838067Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:19.4838255Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:19.4838432Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:19.4838952Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:19.4839358Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:19.4839484Z AWS_REGION: us-east-1 2025-12-04T08:54:19.4839684Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:19.4839849Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:19.4842015Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:19.4842131Z ##[endgroup] 2025-12-04T08:54:19.4849060Z ##[group]Run set -x 2025-12-04T08:54:19.4849185Z set -x 2025-12-04T08:54:19.4849289Z set +e 2025-12-04T08:54:19.4849389Z  2025-12-04T08:54:19.4849488Z login() { 2025-12-04T08:54:19.4849686Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T08:54:19.4849882Z } 2025-12-04T08:54:19.4849969Z  2025-12-04T08:54:19.4850066Z retry () { 2025-12-04T08:54:19.4850228Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T08:54:19.4850354Z } 2025-12-04T08:54:19.4850440Z  2025-12-04T08:54:19.4850540Z retry login "${DOCKER_REGISTRY}" 2025-12-04T08:54:19.4850659Z  2025-12-04T08:54:19.4850845Z IMAGE_SIZE=$(docker manifest inspect "${DOCKER_IMAGE}" | jq '[.layers[].size, .config.size] | add / 1024 / 1024') 2025-12-04T08:54:19.4851089Z echo "Compressed size of image in MB: ${IMAGE_SIZE}" 2025-12-04T08:54:19.4851230Z  2025-12-04T08:54:19.4851314Z set -e 2025-12-04T08:54:19.4851450Z # ignore output since only exit code is used for conditional 2025-12-04T08:54:19.4851635Z # only pull docker image if it's not available locally 2025-12-04T08:54:19.4851841Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2025-12-04T08:54:19.4852028Z  retry docker pull "${DOCKER_IMAGE}" 2025-12-04T08:54:19.4852156Z fi 2025-12-04T08:54:19.4856131Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:19.4856276Z env: 2025-12-04T08:54:19.4856368Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:19.4856510Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:19.4856686Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:19.4856851Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:19.4857227Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:19.4857597Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:19.4857715Z AWS_REGION: us-east-1 2025-12-04T08:54:19.4857849Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:19.4857999Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:19.4860082Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:19.4860411Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:19.4860810Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:19.4860962Z ##[endgroup] 2025-12-04T08:54:19.4875230Z + set +e 2025-12-04T08:54:19.4876015Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:19.4876240Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:19.4878693Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:19.4879230Z /home/runner/_work/_temp/96f1cde1-4f16-45be-a263-47ee086ea417.sh: line 5: aws: command not found 2025-12-04T08:54:19.4879801Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:19.4942080Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:19.4948017Z + sleep 1 2025-12-04T08:54:20.4957962Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:20.4961860Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:20.4962475Z /home/runner/_work/_temp/96f1cde1-4f16-45be-a263-47ee086ea417.sh: line 5: aws: command not found 2025-12-04T08:54:20.4963102Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:20.5049846Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:20.5058973Z + sleep 2 2025-12-04T08:54:22.5072614Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:22.5076126Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:22.5076692Z /home/runner/_work/_temp/96f1cde1-4f16-45be-a263-47ee086ea417.sh: line 5: aws: command not found 2025-12-04T08:54:22.5078589Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:22.5142825Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:22.5155427Z ++ docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:22.5156145Z ++ jq '[.layers[].size, .config.size] | add / 1024 / 1024' 2025-12-04T08:54:23.8905853Z + IMAGE_SIZE=18171.470620155334 2025-12-04T08:54:23.8906245Z + echo 'Compressed size of image in MB: 18171.470620155334' 2025-12-04T08:54:23.8906544Z + set -e 2025-12-04T08:54:23.8907108Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:23.8907717Z Compressed size of image in MB: 18171.470620155334 2025-12-04T08:54:23.9014584Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:23.9015348Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:24.9915568Z pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a: Pulling from pytorch/ci-image 2025-12-04T08:54:24.9916176Z 63e5bc7682b8: Pulling fs layer 2025-12-04T08:54:24.9916476Z 835841cca3b7: Pulling fs layer 2025-12-04T08:54:24.9916720Z aac69780afc8: Pulling fs layer 2025-12-04T08:54:24.9916963Z 029495b23122: Pulling fs layer 2025-12-04T08:54:24.9917203Z d0fb85b00833: Pulling fs layer 2025-12-04T08:54:24.9917439Z 59b639308833: Pulling fs layer 2025-12-04T08:54:24.9917677Z dc112c89d57a: Pulling fs layer 2025-12-04T08:54:24.9917910Z 522eab2402e5: Pulling fs layer 2025-12-04T08:54:24.9918145Z 2b5a11b41761: Pulling fs layer 2025-12-04T08:54:24.9927960Z 9681563a88ff: Pulling fs layer 2025-12-04T08:54:24.9928302Z 73e33534e9eb: Pulling fs layer 2025-12-04T08:54:24.9928581Z 5bfdaeb5578d: Pulling fs layer 2025-12-04T08:54:24.9928837Z c07d27e4d3a5: Pulling fs layer 2025-12-04T08:54:24.9929084Z b21856d1bf42: Pulling fs layer 2025-12-04T08:54:24.9929357Z cb19d84867e4: Pulling fs layer 2025-12-04T08:54:24.9930052Z 8165374f8dcc: Pulling fs layer 2025-12-04T08:54:24.9931558Z 1aecc77354ce: Pulling fs layer 2025-12-04T08:54:24.9932137Z 465d3fd643aa: Pulling fs layer 2025-12-04T08:54:24.9933049Z 6c503e779d6f: Pulling fs layer 2025-12-04T08:54:24.9933357Z f7e9a021f0ee: Pulling fs layer 2025-12-04T08:54:24.9933656Z 8e023b349080: Pulling fs layer 2025-12-04T08:54:24.9933941Z 8188df80e595: Pulling fs layer 2025-12-04T08:54:24.9934231Z 3c2c2f8c74bf: Pulling fs layer 2025-12-04T08:54:24.9934524Z 2aa7784fbe33: Pulling fs layer 2025-12-04T08:54:24.9934818Z 2b3b5215d3eb: Pulling fs layer 2025-12-04T08:54:24.9935107Z 99b1f1ea3e85: Pulling fs layer 2025-12-04T08:54:24.9935402Z 18d6daba0a57: Pulling fs layer 2025-12-04T08:54:24.9935693Z 5277f2a503eb: Pulling fs layer 2025-12-04T08:54:24.9935977Z 3198a9717aac: Pulling fs layer 2025-12-04T08:54:24.9936267Z 99a4918e5808: Pulling fs layer 2025-12-04T08:54:24.9936553Z 15bb11dfc6ac: Pulling fs layer 2025-12-04T08:54:24.9937019Z bd87c8766e90: Pulling fs layer 2025-12-04T08:54:24.9937306Z 1969e15d0c13: Pulling fs layer 2025-12-04T08:54:24.9937590Z 24a03847d382: Pulling fs layer 2025-12-04T08:54:24.9937870Z 816e2e34e018: Pulling fs layer 2025-12-04T08:54:24.9938164Z b168858b8537: Pulling fs layer 2025-12-04T08:54:24.9938454Z 6b8d5ff02e26: Pulling fs layer 2025-12-04T08:54:24.9938743Z 4e3b10a5dd6a: Pulling fs layer 2025-12-04T08:54:24.9939025Z 73e33534e9eb: Waiting 2025-12-04T08:54:24.9939293Z 5bfdaeb5578d: Waiting 2025-12-04T08:54:24.9939549Z d0fb85b00833: Waiting 2025-12-04T08:54:24.9939789Z 029495b23122: Waiting 2025-12-04T08:54:24.9940028Z dc112c89d57a: Waiting 2025-12-04T08:54:24.9940352Z 522eab2402e5: Waiting 2025-12-04T08:54:24.9940588Z 2b5a11b41761: Waiting 2025-12-04T08:54:24.9940832Z 9681563a88ff: Waiting 2025-12-04T08:54:24.9941070Z 6c503e779d6f: Waiting 2025-12-04T08:54:24.9941307Z 8165374f8dcc: Waiting 2025-12-04T08:54:24.9941550Z 8188df80e595: Waiting 2025-12-04T08:54:24.9941789Z 2aa7784fbe33: Waiting 2025-12-04T08:54:24.9941967Z 465d3fd643aa: Waiting 2025-12-04T08:54:24.9942141Z 18d6daba0a57: Waiting 2025-12-04T08:54:24.9942309Z bd87c8766e90: Waiting 2025-12-04T08:54:24.9942479Z 99a4918e5808: Waiting 2025-12-04T08:54:24.9942656Z 1969e15d0c13: Waiting 2025-12-04T08:54:24.9942822Z 59b639308833: Waiting 2025-12-04T08:54:24.9943000Z c07d27e4d3a5: Waiting 2025-12-04T08:54:24.9943174Z b21856d1bf42: Waiting 2025-12-04T08:54:24.9943349Z 4e3b10a5dd6a: Waiting 2025-12-04T08:54:24.9943526Z 2b3b5215d3eb: Waiting 2025-12-04T08:54:24.9943697Z 8e023b349080: Waiting 2025-12-04T08:54:24.9943869Z 1aecc77354ce: Waiting 2025-12-04T08:54:24.9944073Z 24a03847d382: Waiting 2025-12-04T08:54:24.9944249Z 15bb11dfc6ac: Waiting 2025-12-04T08:54:24.9944425Z 3198a9717aac: Waiting 2025-12-04T08:54:24.9944603Z 6b8d5ff02e26: Waiting 2025-12-04T08:54:24.9944772Z cb19d84867e4: Waiting 2025-12-04T08:54:24.9944950Z 816e2e34e018: Waiting 2025-12-04T08:54:24.9945123Z 3c2c2f8c74bf: Waiting 2025-12-04T08:54:24.9945294Z 99b1f1ea3e85: Waiting 2025-12-04T08:54:24.9945472Z b168858b8537: Waiting 2025-12-04T08:54:24.9945639Z 5277f2a503eb: Waiting 2025-12-04T08:54:24.9945828Z 3092fab73b59: Pulling fs layer 2025-12-04T08:54:24.9946028Z f7e9a021f0ee: Waiting 2025-12-04T08:54:24.9946213Z 20020dd28a15: Pulling fs layer 2025-12-04T08:54:24.9946418Z ae5280ce969d: Pulling fs layer 2025-12-04T08:54:24.9946614Z 20020dd28a15: Waiting 2025-12-04T08:54:24.9946799Z 4f4fb700ef54: Pulling fs layer 2025-12-04T08:54:24.9946994Z ae5280ce969d: Waiting 2025-12-04T08:54:24.9947176Z fe17d9eb0fd2: Pulling fs layer 2025-12-04T08:54:24.9947377Z 4f4fb700ef54: Waiting 2025-12-04T08:54:24.9947556Z a51e0dab2d59: Pulling fs layer 2025-12-04T08:54:24.9947756Z fe17d9eb0fd2: Waiting 2025-12-04T08:54:24.9947942Z 6eb176cefd72: Pulling fs layer 2025-12-04T08:54:24.9948148Z e7b8cf2e8d5a: Pulling fs layer 2025-12-04T08:54:24.9948371Z ef3a5060abce: Pulling fs layer 2025-12-04T08:54:24.9948567Z a51e0dab2d59: Waiting 2025-12-04T08:54:24.9948745Z 6eb176cefd72: Waiting 2025-12-04T08:54:24.9948919Z ef3a5060abce: Waiting 2025-12-04T08:54:24.9949101Z e7b8cf2e8d5a: Waiting 2025-12-04T08:54:24.9949284Z a6f4ec14b42b: Pulling fs layer 2025-12-04T08:54:24.9949490Z 7e5a0c956cfb: Pulling fs layer 2025-12-04T08:54:24.9949694Z b4f78730cfe7: Pulling fs layer 2025-12-04T08:54:24.9949970Z 081028f24389: Pulling fs layer 2025-12-04T08:54:24.9950219Z 7e5a0c956cfb: Waiting 2025-12-04T08:54:24.9950397Z a6f4ec14b42b: Waiting 2025-12-04T08:54:24.9950582Z a534dcf4b9a9: Pulling fs layer 2025-12-04T08:54:24.9950780Z 081028f24389: Waiting 2025-12-04T08:54:24.9950899Z b4f78730cfe7: Waiting 2025-12-04T08:54:24.9951064Z 2e77500302cc: Pulling fs layer 2025-12-04T08:54:24.9951216Z bc08246bb4ba: Pulling fs layer 2025-12-04T08:54:24.9951354Z 2e77500302cc: Waiting 2025-12-04T08:54:24.9951482Z a534dcf4b9a9: Waiting 2025-12-04T08:54:24.9951608Z bc08246bb4ba: Waiting 2025-12-04T08:54:24.9951746Z ff0c473ca120: Pulling fs layer 2025-12-04T08:54:24.9951896Z 6bbc14b250ef: Pulling fs layer 2025-12-04T08:54:24.9952041Z ff0c473ca120: Waiting 2025-12-04T08:54:24.9952219Z 6bbc14b250ef: Waiting 2025-12-04T08:54:26.6586070Z 835841cca3b7: Verifying Checksum 2025-12-04T08:54:26.6586514Z 835841cca3b7: Download complete 2025-12-04T08:54:27.2284013Z 029495b23122: Verifying Checksum 2025-12-04T08:54:27.2285281Z 029495b23122: Download complete 2025-12-04T08:54:27.7309460Z 63e5bc7682b8: Verifying Checksum 2025-12-04T08:54:27.7309860Z 63e5bc7682b8: Download complete 2025-12-04T08:54:27.8099860Z d0fb85b00833: Verifying Checksum 2025-12-04T08:54:27.8100237Z d0fb85b00833: Download complete 2025-12-04T08:54:28.2551320Z 63e5bc7682b8: Pull complete 2025-12-04T08:54:28.2639237Z 835841cca3b7: Pull complete 2025-12-04T08:54:28.3238328Z 59b639308833: Verifying Checksum 2025-12-04T08:54:28.3239738Z 59b639308833: Download complete 2025-12-04T08:54:28.8996159Z 522eab2402e5: Verifying Checksum 2025-12-04T08:54:28.8996564Z 522eab2402e5: Download complete 2025-12-04T08:54:29.4675578Z 2b5a11b41761: Download complete 2025-12-04T08:54:30.0382684Z 9681563a88ff: Download complete 2025-12-04T08:54:31.2972289Z dc112c89d57a: Verifying Checksum 2025-12-04T08:54:31.2972703Z dc112c89d57a: Download complete 2025-12-04T08:54:31.8884700Z 5bfdaeb5578d: Verifying Checksum 2025-12-04T08:54:31.8885127Z 5bfdaeb5578d: Download complete 2025-12-04T08:54:32.7576573Z c07d27e4d3a5: Verifying Checksum 2025-12-04T08:54:32.7576901Z c07d27e4d3a5: Download complete 2025-12-04T08:54:33.3914755Z b21856d1bf42: Verifying Checksum 2025-12-04T08:54:33.3915156Z b21856d1bf42: Download complete 2025-12-04T08:54:33.9236623Z aac69780afc8: Verifying Checksum 2025-12-04T08:54:33.9236783Z aac69780afc8: Download complete 2025-12-04T08:54:34.0539899Z cb19d84867e4: Verifying Checksum 2025-12-04T08:54:34.0540059Z cb19d84867e4: Download complete 2025-12-04T08:54:34.5277804Z 8165374f8dcc: Verifying Checksum 2025-12-04T08:54:34.5278038Z 8165374f8dcc: Download complete 2025-12-04T08:54:35.1287361Z 465d3fd643aa: Download complete 2025-12-04T08:54:35.7425888Z 6c503e779d6f: Verifying Checksum 2025-12-04T08:54:35.7426232Z 6c503e779d6f: Download complete 2025-12-04T08:54:36.3528925Z f7e9a021f0ee: Verifying Checksum 2025-12-04T08:54:36.3529255Z f7e9a021f0ee: Download complete 2025-12-04T08:54:36.9941423Z 8e023b349080: Verifying Checksum 2025-12-04T08:54:36.9941666Z 8e023b349080: Download complete 2025-12-04T08:54:37.9553255Z aac69780afc8: Pull complete 2025-12-04T08:54:37.9604684Z 029495b23122: Pull complete 2025-12-04T08:54:37.9655216Z d0fb85b00833: Pull complete 2025-12-04T08:54:37.9696065Z 59b639308833: Pull complete 2025-12-04T08:54:39.0513315Z dc112c89d57a: Pull complete 2025-12-04T08:54:39.0557528Z 522eab2402e5: Pull complete 2025-12-04T08:54:39.0606529Z 2b5a11b41761: Pull complete 2025-12-04T08:54:39.0666482Z 9681563a88ff: Pull complete 2025-12-04T08:54:45.7751958Z 1aecc77354ce: Verifying Checksum 2025-12-04T08:54:45.7752429Z 1aecc77354ce: Download complete 2025-12-04T08:54:46.4057104Z 3c2c2f8c74bf: Verifying Checksum 2025-12-04T08:54:46.4057452Z 3c2c2f8c74bf: Download complete 2025-12-04T08:54:47.0712739Z 2aa7784fbe33: Verifying Checksum 2025-12-04T08:54:47.0713234Z 2aa7784fbe33: Download complete 2025-12-04T08:54:47.7286530Z 2b3b5215d3eb: Verifying Checksum 2025-12-04T08:54:47.7286930Z 2b3b5215d3eb: Download complete 2025-12-04T08:55:34.1993266Z 99b1f1ea3e85: Verifying Checksum 2025-12-04T08:55:34.1994351Z 99b1f1ea3e85: Download complete 2025-12-04T08:55:34.7769671Z 18d6daba0a57: Download complete 2025-12-04T08:55:35.3732730Z 5277f2a503eb: Verifying Checksum 2025-12-04T08:55:35.3733110Z 5277f2a503eb: Download complete 2025-12-04T08:55:35.9628826Z 3198a9717aac: Verifying Checksum 2025-12-04T08:55:35.9629158Z 3198a9717aac: Download complete 2025-12-04T08:55:36.5555619Z 99a4918e5808: Verifying Checksum 2025-12-04T08:55:36.5556033Z 99a4918e5808: Download complete 2025-12-04T08:55:37.6735003Z 15bb11dfc6ac: Verifying Checksum 2025-12-04T08:55:37.6735229Z 15bb11dfc6ac: Download complete 2025-12-04T08:55:38.2515189Z bd87c8766e90: Download complete 2025-12-04T08:55:38.8160833Z 1969e15d0c13: Verifying Checksum 2025-12-04T08:55:38.8164429Z 1969e15d0c13: Download complete 2025-12-04T08:55:40.5571495Z 24a03847d382: Verifying Checksum 2025-12-04T08:55:40.5571813Z 24a03847d382: Download complete 2025-12-04T08:55:41.1400815Z 816e2e34e018: Verifying Checksum 2025-12-04T08:55:41.1402901Z 816e2e34e018: Download complete 2025-12-04T08:55:41.7396006Z b168858b8537: Verifying Checksum 2025-12-04T08:55:41.7396295Z b168858b8537: Download complete 2025-12-04T08:55:43.3314072Z 6b8d5ff02e26: Verifying Checksum 2025-12-04T08:55:43.3314491Z 6b8d5ff02e26: Download complete 2025-12-04T08:55:43.9152521Z 4e3b10a5dd6a: Verifying Checksum 2025-12-04T08:55:43.9152856Z 4e3b10a5dd6a: Download complete 2025-12-04T08:55:44.4899222Z 3092fab73b59: Verifying Checksum 2025-12-04T08:55:44.4899600Z 3092fab73b59: Download complete 2025-12-04T08:55:45.0840418Z 20020dd28a15: Verifying Checksum 2025-12-04T08:55:45.0840790Z 20020dd28a15: Download complete 2025-12-04T08:55:45.6854807Z ae5280ce969d: Verifying Checksum 2025-12-04T08:55:45.6855217Z ae5280ce969d: Download complete 2025-12-04T08:55:45.9902084Z 4f4fb700ef54: Verifying Checksum 2025-12-04T08:55:46.5846729Z fe17d9eb0fd2: Verifying Checksum 2025-12-04T08:55:46.5847195Z fe17d9eb0fd2: Download complete 2025-12-04T08:55:47.3311781Z a51e0dab2d59: Verifying Checksum 2025-12-04T08:55:47.3312027Z a51e0dab2d59: Download complete 2025-12-04T08:55:48.3034684Z 6eb176cefd72: Verifying Checksum 2025-12-04T08:55:48.3047896Z 6eb176cefd72: Download complete 2025-12-04T08:55:49.0412827Z e7b8cf2e8d5a: Verifying Checksum 2025-12-04T08:55:49.0413244Z e7b8cf2e8d5a: Download complete 2025-12-04T08:55:49.7556982Z ef3a5060abce: Verifying Checksum 2025-12-04T08:55:49.7557249Z ef3a5060abce: Download complete 2025-12-04T08:55:50.4777200Z a6f4ec14b42b: Verifying Checksum 2025-12-04T08:55:51.1874270Z 7e5a0c956cfb: Verifying Checksum 2025-12-04T08:55:51.1874640Z 7e5a0c956cfb: Download complete 2025-12-04T09:08:02.4548316Z 73e33534e9eb: Download complete 2025-12-04T09:08:03.0751678Z 081028f24389: Download complete 2025-12-04T09:08:03.6749107Z a534dcf4b9a9: Verifying Checksum 2025-12-04T09:08:03.6749553Z a534dcf4b9a9: Download complete 2025-12-04T09:08:19.2578291Z 2e77500302cc: Verifying Checksum 2025-12-04T09:08:19.2578753Z 2e77500302cc: Download complete 2025-12-04T09:08:19.9682894Z bc08246bb4ba: Download complete 2025-12-04T09:08:20.6490949Z ff0c473ca120: Verifying Checksum 2025-12-04T09:08:20.6491287Z ff0c473ca120: Download complete 2025-12-04T09:08:23.6451337Z 6bbc14b250ef: Verifying Checksum 2025-12-04T09:08:23.6451657Z 6bbc14b250ef: Download complete 2025-12-04T09:08:25.5356156Z 73e33534e9eb: Pull complete 2025-12-04T09:08:25.5505102Z 5bfdaeb5578d: Pull complete 2025-12-04T09:08:25.5684989Z c07d27e4d3a5: Pull complete 2025-12-04T09:08:25.5760212Z b21856d1bf42: Pull complete 2025-12-04T09:08:25.5812079Z cb19d84867e4: Pull complete 2025-12-04T09:08:25.5860205Z 8165374f8dcc: Pull complete 2025-12-04T09:08:29.4048472Z 1aecc77354ce: Pull complete 2025-12-04T09:08:29.4182406Z 465d3fd643aa: Pull complete 2025-12-04T09:08:29.4242171Z 6c503e779d6f: Pull complete 2025-12-04T09:08:29.4482513Z f7e9a021f0ee: Pull complete 2025-12-04T09:08:29.4544414Z 8e023b349080: Pull complete 2025-12-04T09:15:08.4411136Z 8188df80e595: Verifying Checksum 2025-12-04T09:15:08.4411515Z 8188df80e595: Download complete 2025-12-04T09:15:56.4780885Z 8188df80e595: Pull complete 2025-12-04T09:15:56.4848382Z 3c2c2f8c74bf: Pull complete 2025-12-04T09:15:56.4905823Z 2aa7784fbe33: Pull complete 2025-12-04T09:15:56.4965740Z 2b3b5215d3eb: Pull complete 2025-12-04T09:16:02.0195925Z 99b1f1ea3e85: Pull complete 2025-12-04T09:16:02.0279491Z 18d6daba0a57: Pull complete 2025-12-04T09:16:02.0359715Z 5277f2a503eb: Pull complete 2025-12-04T09:16:02.0437188Z 3198a9717aac: Pull complete 2025-12-04T09:16:02.0660380Z 99a4918e5808: Pull complete 2025-12-04T09:16:02.0956848Z 15bb11dfc6ac: Pull complete 2025-12-04T09:16:02.0995041Z bd87c8766e90: Pull complete 2025-12-04T09:16:02.1268720Z 1969e15d0c13: Pull complete 2025-12-04T09:16:02.3372837Z 24a03847d382: Pull complete 2025-12-04T09:16:02.3449668Z 816e2e34e018: Pull complete 2025-12-04T09:16:02.3513269Z b168858b8537: Pull complete 2025-12-04T09:16:02.4507940Z 6b8d5ff02e26: Pull complete 2025-12-04T09:16:02.4644792Z 4e3b10a5dd6a: Pull complete 2025-12-04T09:16:02.4759570Z 3092fab73b59: Pull complete 2025-12-04T09:16:02.4809764Z 20020dd28a15: Pull complete 2025-12-04T09:16:02.4846294Z ae5280ce969d: Pull complete 2025-12-04T09:16:02.4886431Z 4f4fb700ef54: Pull complete 2025-12-04T09:16:02.4924777Z fe17d9eb0fd2: Pull complete 2025-12-04T09:16:02.4982132Z a51e0dab2d59: Pull complete 2025-12-04T09:16:02.5231468Z 6eb176cefd72: Pull complete 2025-12-04T09:16:02.5284102Z e7b8cf2e8d5a: Pull complete 2025-12-04T09:16:02.5310490Z ef3a5060abce: Pull complete 2025-12-04T09:16:02.5398224Z a6f4ec14b42b: Pull complete 2025-12-04T09:16:02.5427351Z 7e5a0c956cfb: Pull complete 2025-12-04T09:17:12.3535832Z b4f78730cfe7: Verifying Checksum 2025-12-04T09:17:12.3536002Z b4f78730cfe7: Download complete 2025-12-04T09:17:51.6294719Z b4f78730cfe7: Pull complete 2025-12-04T09:17:51.6335717Z 081028f24389: Pull complete 2025-12-04T09:17:51.6408568Z a534dcf4b9a9: Pull complete 2025-12-04T09:17:54.3053817Z 2e77500302cc: Pull complete 2025-12-04T09:17:54.3109266Z bc08246bb4ba: Pull complete 2025-12-04T09:17:54.3187179Z ff0c473ca120: Pull complete 2025-12-04T09:17:55.0176095Z 6bbc14b250ef: Pull complete 2025-12-04T09:17:55.0203119Z Digest: sha256:5e190224966743059cf8506170eaec525eada34e38cf646e02d1dbeadfe5a366 2025-12-04T09:17:55.0211967Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:17:55.0220383Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:17:55.0274490Z Prepare all required actions 2025-12-04T09:17:55.0289139Z ##[group]Run ./.github/actions/get-workflow-job-id 2025-12-04T09:17:55.0289279Z with: 2025-12-04T09:17:55.0289588Z github-token: *** 2025-12-04T09:17:55.0289685Z env: 2025-12-04T09:17:55.0289795Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:17:55.0289938Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:17:55.0290167Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:17:55.0290335Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:17:55.0290724Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:17:55.0291102Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:17:55.0291222Z AWS_REGION: us-east-1 2025-12-04T09:17:55.0291346Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:17:55.0291508Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:17:55.0293586Z AWS_SESSION_TOKEN: *** 2025-12-04T09:17:55.0293694Z ##[endgroup] 2025-12-04T09:17:55.0300243Z ##[group]Run set -eux 2025-12-04T09:17:55.0300376Z set -eux 2025-12-04T09:17:55.0300554Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-12-04T09:17:55.0304686Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:17:55.0304833Z env: 2025-12-04T09:17:55.0304926Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:17:55.0305062Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:17:55.0305237Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:17:55.0305403Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:17:55.0305782Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:17:55.0306149Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:17:55.0306264Z AWS_REGION: us-east-1 2025-12-04T09:17:55.0306436Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:17:55.0306598Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:17:55.0308799Z AWS_SESSION_TOKEN: *** 2025-12-04T09:17:55.0308953Z GITHUB_TOKEN: *** 2025-12-04T09:17:55.0309055Z ##[endgroup] 2025-12-04T09:17:55.0327569Z + python3 .github/scripts/get_workflow_job_id.py 19922849170 linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw 2025-12-04T09:17:56.1183496Z Setting output job-id=57116213140 2025-12-04T09:17:56.1184328Z Setting output job-name=linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:17:56.1324309Z Prepare all required actions 2025-12-04T09:17:56.1324554Z Getting action download info 2025-12-04T09:17:56.5108067Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-12-04T09:17:57.6551008Z Download action repository 'actions/download-artifact@v4' (SHA:d3f86a106a0bac45b974a628896c90dbdf5c8093) 2025-12-04T09:17:58.7084367Z ##[group]Run ./.github/actions/download-build-artifacts 2025-12-04T09:17:58.7084537Z with: 2025-12-04T09:17:58.7084662Z name: linux-jammy-rocm-py3.10 2025-12-04T09:17:58.7084790Z s3-bucket: gha-artifacts 2025-12-04T09:17:58.7084902Z env: 2025-12-04T09:17:58.7084999Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:17:58.7085141Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:17:58.7085322Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:17:58.7085492Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:17:58.7085901Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:17:58.7086278Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:17:58.7086398Z AWS_REGION: us-east-1 2025-12-04T09:17:58.7086561Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:17:58.7086716Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:17:58.7088778Z AWS_SESSION_TOKEN: *** 2025-12-04T09:17:58.7088889Z ##[endgroup] 2025-12-04T09:17:58.7102628Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:17:58.7102762Z with: 2025-12-04T09:17:58.7102862Z name: linux-jammy-rocm-py3.10 2025-12-04T09:17:58.7102983Z s3-bucket: gha-artifacts 2025-12-04T09:17:58.7103089Z region: us-east-1 2025-12-04T09:17:58.7103183Z env: 2025-12-04T09:17:58.7103273Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:17:58.7103405Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:17:58.7103582Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:17:58.7103744Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:17:58.7104126Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:17:58.7104493Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:17:58.7104605Z AWS_REGION: us-east-1 2025-12-04T09:17:58.7104743Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:17:58.7104894Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:17:58.7106956Z AWS_SESSION_TOKEN: *** 2025-12-04T09:17:58.7107058Z ##[endgroup] 2025-12-04T09:17:58.9710883Z (node:17164) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:17:58.9712472Z 2025-12-04T09:17:58.9713215Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:17:58.9713532Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:17:58.9713771Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:17:59.2399804Z Found 1 objects with prefix pytorch/pytorch/19922849170/linux-jammy-rocm-py3.10/ 2025-12-04T09:17:59.2400070Z Starting download (1/1): /home/runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:18:46.2708186Z Finished download (1/1): /home/runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:18:46.2712046Z Artifact download has finished successfully 2025-12-04T09:18:46.3043029Z ##[group]Run unzip -o artifacts.zip 2025-12-04T09:18:46.3043200Z unzip -o artifacts.zip 2025-12-04T09:18:46.3047277Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:46.3047438Z env: 2025-12-04T09:18:46.3047539Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:46.3047690Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:46.3048073Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:46.3048251Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:46.3048643Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:46.3049024Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:46.3049146Z AWS_REGION: us-east-1 2025-12-04T09:18:46.3049317Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:46.3049487Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:46.3051611Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:46.3051722Z ##[endgroup] 2025-12-04T09:18:46.3092831Z Archive: artifacts.zip 2025-12-04T09:18:46.3093528Z creating: dist/ 2025-12-04T09:18:46.3176994Z inflating: dist/.ninja_log 2025-12-04T09:18:49.4936916Z inflating: dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:18:49.4940240Z creating: build/ 2025-12-04T09:18:49.4940654Z creating: build/custom_test_artifacts/ 2025-12-04T09:18:49.4941005Z creating: build/custom_test_artifacts/custom-op-build/ 2025-12-04T09:18:49.4941384Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2025-12-04T09:18:49.4941849Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:18:49.4942312Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:18:49.4942773Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/ 2025-12-04T09:18:49.4943226Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:18:49.4943696Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:18:49.4944153Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:18:49.4944681Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:18:49.4945208Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:18:49.4945702Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:18:49.4946184Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:18:49.4946651Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:18:49.4947200Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:18:49.4947749Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:18:49.4948262Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:18:49.4948809Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:18:49.4949400Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:18:49.4949893Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:18:49.4950343Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:18:49.4950761Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2025-12-04T09:18:49.4954448Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2025-12-04T09:18:49.4954808Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2025-12-04T09:18:49.4955213Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2025-12-04T09:18:49.4955809Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2025-12-04T09:18:49.4956176Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2025-12-04T09:18:49.4956557Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2025-12-04T09:18:49.4956934Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2025-12-04T09:18:49.4957311Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2025-12-04T09:18:49.4957689Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2025-12-04T09:18:49.4958101Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2025-12-04T09:18:49.4962541Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2025-12-04T09:18:49.5098111Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2025-12-04T09:18:49.5101365Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2025-12-04T09:18:49.5101888Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2025-12-04T09:18:49.5102451Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2025-12-04T09:18:49.5102971Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2025-12-04T09:18:49.5103465Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2025-12-04T09:18:49.5103979Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2025-12-04T09:18:49.5104486Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2025-12-04T09:18:49.5104997Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2025-12-04T09:18:49.5105503Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2025-12-04T09:18:49.5106008Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2025-12-04T09:18:49.5119267Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2025-12-04T09:18:49.5165384Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2025-12-04T09:18:49.5165830Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:18:49.5166217Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:18:49.5166565Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2025-12-04T09:18:49.5166910Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2025-12-04T09:18:49.5167229Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2025-12-04T09:18:49.5167571Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:18:49.5167901Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:18:49.5168234Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2025-12-04T09:18:49.5168603Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2025-12-04T09:18:49.5169923Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2025-12-04T09:18:49.5266503Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2025-12-04T09:18:49.5297453Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2025-12-04T09:18:49.5297708Z creating: build/custom_test_artifacts/jit-hook-build/ 2025-12-04T09:18:49.5298111Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2025-12-04T09:18:49.5298380Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:18:49.5300321Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:18:49.5300620Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/ 2025-12-04T09:18:49.5300909Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:18:49.5301226Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:18:49.5301552Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:18:49.5302114Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:18:49.5302945Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:18:49.5303303Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:18:49.5303631Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:18:49.5303941Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:18:49.5305001Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:18:49.5305521Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:18:49.5305944Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:18:49.5307224Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:18:49.5307753Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:18:49.5308115Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:18:49.5308385Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:18:49.5308683Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2025-12-04T09:18:49.5308985Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2025-12-04T09:18:49.5309335Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2025-12-04T09:18:49.5309729Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2025-12-04T09:18:49.5310153Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2025-12-04T09:18:49.5310440Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2025-12-04T09:18:49.5310742Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2025-12-04T09:18:49.5311047Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2025-12-04T09:18:49.5311353Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2025-12-04T09:18:49.5311651Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2025-12-04T09:18:49.5311944Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2025-12-04T09:18:49.5324782Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2025-12-04T09:18:49.5358518Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2025-12-04T09:18:49.5358834Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:18:49.5359197Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:18:49.5359453Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2025-12-04T09:18:49.5359691Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2025-12-04T09:18:49.5360183Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2025-12-04T09:18:49.5360430Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:18:49.5360686Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:18:49.5361585Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2025-12-04T09:18:49.5362054Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2025-12-04T09:18:49.5362272Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2025-12-04T09:18:49.5382822Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2025-12-04T09:18:49.5383024Z creating: build/custom_test_artifacts/custom-backend-build/ 2025-12-04T09:18:49.5383219Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2025-12-04T09:18:49.5383451Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:18:49.5386150Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:18:49.5386418Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/ 2025-12-04T09:18:49.5386676Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:18:49.5386951Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:18:49.5387221Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:18:49.5388146Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:18:49.5388827Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:18:49.5389125Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:18:49.5389416Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:18:49.5389691Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:18:49.5390827Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:18:49.5391397Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:18:49.5391809Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:18:49.5392849Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:18:49.5393709Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:18:49.5394020Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:18:49.5394262Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:18:49.5394509Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2025-12-04T09:18:49.5394830Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2025-12-04T09:18:49.5395122Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2025-12-04T09:18:49.5395539Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2025-12-04T09:18:49.5395855Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2025-12-04T09:18:49.5396154Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2025-12-04T09:18:49.5396474Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2025-12-04T09:18:49.5396784Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2025-12-04T09:18:49.5397098Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2025-12-04T09:18:49.5397407Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2025-12-04T09:18:49.5397707Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2025-12-04T09:18:49.5398504Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2025-12-04T09:18:49.5463722Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2025-12-04T09:18:49.5464037Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2025-12-04T09:18:49.5464371Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2025-12-04T09:18:49.5464718Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2025-12-04T09:18:49.5465061Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2025-12-04T09:18:49.5465373Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2025-12-04T09:18:49.5465697Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2025-12-04T09:18:49.5466023Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2025-12-04T09:18:49.5466344Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2025-12-04T09:18:49.5466669Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2025-12-04T09:18:49.5466987Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2025-12-04T09:18:49.5477550Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2025-12-04T09:18:49.5507657Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2025-12-04T09:18:49.5508008Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:18:49.5508310Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:18:49.5508583Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2025-12-04T09:18:49.5508841Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2025-12-04T09:18:49.5509188Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2025-12-04T09:18:49.5509517Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:18:49.5509786Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:18:49.5510746Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2025-12-04T09:18:49.5510972Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2025-12-04T09:18:49.5511265Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2025-12-04T09:18:49.5566590Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2025-12-04T09:18:49.5587574Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2025-12-04T09:18:49.5587764Z creating: build/lib/ 2025-12-04T09:18:49.5632873Z inflating: build/lib/libprotobuf-lite.a 2025-12-04T09:18:49.5903741Z inflating: build/lib/libprotobuf.a 2025-12-04T09:18:49.6192400Z inflating: build/lib/libprotoc.a 2025-12-04T09:18:49.6197643Z inflating: build/lib/libpthreadpool.a 2025-12-04T09:18:49.6201935Z inflating: build/lib/libcpuinfo.a 2025-12-04T09:18:49.6206035Z inflating: build/lib/libcpuinfo_internals.a 2025-12-04T09:18:49.6207898Z inflating: build/lib/libclog.a 2025-12-04T09:18:49.6226271Z inflating: build/lib/libpytorch_qnnpack.a 2025-12-04T09:18:49.6229527Z inflating: build/lib/libnnpack_reference_layers.a 2025-12-04T09:18:49.6243222Z inflating: build/lib/libnnpack.a 2025-12-04T09:18:49.6357798Z inflating: build/lib/libmicrokernels-prod.a 2025-12-04T09:18:49.6859347Z inflating: build/lib/libmicrokernels-all.a 2025-12-04T09:18:49.6903591Z inflating: build/lib/libgtest.a 2025-12-04T09:18:49.6912865Z inflating: build/lib/libgmock.a 2025-12-04T09:18:49.6913053Z inflating: build/lib/libgtest_main.a 2025-12-04T09:18:49.6913212Z inflating: build/lib/libgmock_main.a 2025-12-04T09:18:49.6967001Z inflating: build/lib/libXNNPACK.a 2025-12-04T09:18:49.7008532Z inflating: build/lib/libbenchmark.a 2025-12-04T09:18:49.7008733Z inflating: build/lib/libbenchmark_main.a 2025-12-04T09:18:49.7008900Z inflating: build/lib/libjitprofiling.a 2025-12-04T09:18:49.7015689Z inflating: build/lib/libittnotify.a 2025-12-04T09:18:49.7054015Z inflating: build/lib/libasmjit.a 2025-12-04T09:18:49.7729909Z inflating: build/lib/libfbgemm.a 2025-12-04T09:18:49.7746521Z inflating: build/lib/libtensorpipe_uv.a 2025-12-04T09:18:49.8068083Z inflating: build/lib/libtensorpipe.a 2025-12-04T09:18:49.8140757Z inflating: build/lib/libgloo.a 2025-12-04T09:18:49.8175036Z inflating: build/lib/libonnx_proto.a 2025-12-04T09:18:49.8402611Z inflating: build/lib/libgloo_hip.a 2025-12-04T09:18:49.8863790Z inflating: build/lib/libonnx.a 2025-12-04T09:18:50.4863288Z inflating: build/lib/libdnnl.a 2025-12-04T09:18:50.4873011Z inflating: build/lib/libfmt.a 2025-12-04T09:18:50.5055502Z inflating: build/lib/libkineto.a 2025-12-04T09:18:50.5125955Z inflating: build/lib/libc10.so 2025-12-04T09:18:50.5127206Z inflating: build/lib/libtorch_global_deps.so 2025-12-04T09:18:50.5127539Z inflating: build/lib/libcaffe2_nvrtc.so 2025-12-04T09:18:50.5152605Z inflating: build/lib/libc10_hip.so 2025-12-04T09:18:50.5436620Z inflating: build/lib/libfbgemm_genai.a 2025-12-04T09:18:52.3588155Z inflating: build/lib/libtorch_cpu.so 2025-12-04T09:18:52.3590740Z inflating: build/lib/libshm.so 2025-12-04T09:18:53.2581601Z inflating: build/lib/libtorch_hip.so 2025-12-04T09:18:53.2581989Z inflating: build/lib/libtorch.so 2025-12-04T09:18:53.2592657Z inflating: build/lib/libjitbackend_test.so 2025-12-04T09:18:53.2606549Z inflating: build/lib/libbackend_with_compiler.so 2025-12-04T09:18:53.2657093Z inflating: build/lib/libtorchbind_test.so 2025-12-04T09:18:53.2679967Z inflating: build/lib/libaoti_custom_ops.so 2025-12-04T09:18:53.4101867Z inflating: build/lib/libtorch_python.so 2025-12-04T09:18:53.4123331Z inflating: build/lib/libnnapi_backend.so 2025-12-04T09:18:53.4123558Z creating: build/bin/ 2025-12-04T09:18:53.4123762Z creating: build/bin/CMakeFiles/ 2025-12-04T09:18:53.4123999Z inflating: build/bin/cmake_install.cmake 2025-12-04T09:18:53.4124231Z inflating: build/bin/CTestTestfile.cmake 2025-12-04T09:18:53.4382385Z inflating: build/bin/protoc-3.13.0.0 2025-12-04T09:18:53.4669918Z inflating: build/bin/protoc 2025-12-04T09:18:53.4710792Z inflating: build/bin/c10_AllocatorConfig_test 2025-12-04T09:18:53.4739177Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2025-12-04T09:18:53.4771902Z inflating: build/bin/c10_DeviceGuard_test 2025-12-04T09:18:53.4803500Z inflating: build/bin/c10_Device_test 2025-12-04T09:18:53.4840235Z inflating: build/bin/c10_DispatchKeySet_test 2025-12-04T09:18:53.4872998Z inflating: build/bin/c10_Scalar_test 2025-12-04T09:18:53.4903105Z inflating: build/bin/c10_StreamGuard_test 2025-12-04T09:18:53.4937456Z inflating: build/bin/c10_SymInt_test 2025-12-04T09:18:53.4992380Z inflating: build/bin/c10_SizesAndStrides_test 2025-12-04T09:18:53.5033875Z inflating: build/bin/c10_Bitset_test 2025-12-04T09:18:53.5079032Z inflating: build/bin/c10_cow_test 2025-12-04T09:18:53.5116365Z inflating: build/bin/c10_InlineDeviceGuard_test 2025-12-04T09:18:53.5155620Z inflating: build/bin/c10_InlineStreamGuard_test 2025-12-04T09:18:53.5185862Z inflating: build/bin/c10_ArrayRef_test 2025-12-04T09:18:53.5217259Z inflating: build/bin/c10_ConstexprCrc_test 2025-12-04T09:18:53.5247789Z inflating: build/bin/c10_DeadlockDetection_test 2025-12-04T09:18:53.5284745Z inflating: build/bin/c10_IntrusiveList_test 2025-12-04T09:18:53.5317194Z inflating: build/bin/c10_Half_test 2025-12-04T09:18:53.5352151Z inflating: build/bin/c10_Enumerate_test 2025-12-04T09:18:53.5389626Z inflating: build/bin/c10_LeftRight_test 2025-12-04T09:18:53.5424420Z inflating: build/bin/c10_NetworkFlow_test 2025-12-04T09:18:53.5455369Z inflating: build/bin/c10_Semaphore_test 2025-12-04T09:18:53.5486200Z inflating: build/bin/c10_Synchronized_test 2025-12-04T09:18:53.5518004Z inflating: build/bin/c10_TypeIndex_test 2025-12-04T09:18:53.5552920Z inflating: build/bin/c10_ThreadLocal_test 2025-12-04T09:18:53.5602151Z inflating: build/bin/c10_accumulate_test 2025-12-04T09:18:53.5644326Z inflating: build/bin/c10_bfloat16_test 2025-12-04T09:18:53.5676478Z inflating: build/bin/c10_error_test 2025-12-04T09:18:53.5707509Z inflating: build/bin/c10_bit_cast_test 2025-12-04T09:18:53.5747515Z inflating: build/bin/c10_complex_test 2025-12-04T09:18:53.5780021Z inflating: build/bin/c10_exception_test 2025-12-04T09:18:53.5816748Z inflating: build/bin/c10_complex_math_test 2025-12-04T09:18:53.5851820Z inflating: build/bin/c10_flags_test 2025-12-04T09:18:53.5885478Z inflating: build/bin/c10_irange_test 2025-12-04T09:18:53.5919657Z inflating: build/bin/c10_generic_math_test 2025-12-04T09:18:53.6016249Z inflating: build/bin/c10_intrusive_ptr_test 2025-12-04T09:18:53.6050725Z inflating: build/bin/c10_logging_test 2025-12-04T09:18:53.6082522Z inflating: build/bin/c10_nofatal_test 2025-12-04T09:18:53.6115123Z inflating: build/bin/c10_lazy_test 2025-12-04T09:18:53.6155410Z inflating: build/bin/c10_ordered_preserving_dict_test 2025-12-04T09:18:53.6187890Z inflating: build/bin/c10_registry_test 2025-12-04T09:18:53.6221538Z inflating: build/bin/c10_ssize_test 2025-12-04T09:18:53.6283380Z inflating: build/bin/c10_optional_test 2025-12-04T09:18:53.6373901Z inflating: build/bin/c10_small_vector_test 2025-12-04T09:18:53.6413097Z inflating: build/bin/c10_string_util_test 2025-12-04T09:18:53.6443729Z inflating: build/bin/c10_tempfile_test 2025-12-04T09:18:53.6477230Z inflating: build/bin/c10_string_view_test 2025-12-04T09:18:53.6507466Z inflating: build/bin/c10_intrusive_ptr_benchmark 2025-12-04T09:18:53.6546127Z inflating: build/bin/c10_typeid_test 2025-12-04T09:18:53.6580400Z inflating: build/bin/c10_hip_HIPAssertionsTest_1_var_test 2025-12-04T09:18:53.6613186Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_stream 2025-12-04T09:18:53.6643327Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2025-12-04T09:18:53.6673343Z inflating: build/bin/c10_hip_HIPAssertionsTest_from_2_processes 2025-12-04T09:18:53.6703589Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2025-12-04T09:18:53.6733652Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2025-12-04T09:18:53.6763708Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2025-12-04T09:18:53.6795617Z inflating: build/bin/c10_hip_HIPTest 2025-12-04T09:18:53.7181454Z inflating: build/bin/vec_test_all_types_DEFAULT 2025-12-04T09:18:53.7561023Z inflating: build/bin/vec_test_all_types_AVX512 2025-12-04T09:18:53.7913036Z inflating: build/bin/vec_test_all_types_AVX2 2025-12-04T09:18:53.8013642Z inflating: build/bin/test_aoti_abi_check 2025-12-04T09:18:53.8045745Z inflating: build/bin/test_vec_half_DEFAULT 2025-12-04T09:18:53.8079537Z inflating: build/bin/test_vec_half_AVX2 2025-12-04T09:18:53.8110412Z inflating: build/bin/test_vec_half_AVX512 2025-12-04T09:18:53.8150175Z inflating: build/bin/BackoffTest 2025-12-04T09:18:53.8182721Z inflating: build/bin/FileStoreTest 2025-12-04T09:18:53.8217816Z inflating: build/bin/TCPStoreTest 2025-12-04T09:18:53.8250878Z inflating: build/bin/HashStoreTest 2025-12-04T09:18:53.8291736Z inflating: build/bin/ProcessGroupGlooTest 2025-12-04T09:18:53.8293279Z inflating: build/bin/example_allreduce 2025-12-04T09:18:53.8295201Z inflating: build/bin/torch_shm_manager 2025-12-04T09:18:53.8328424Z inflating: build/bin/static_runtime_bench 2025-12-04T09:18:53.8477232Z inflating: build/bin/static_runtime_test 2025-12-04T09:18:53.8523579Z inflating: build/bin/Dict_test 2025-12-04T09:18:53.8577024Z inflating: build/bin/Dimname_test 2025-12-04T09:18:53.8643052Z inflating: build/bin/MaybeOwned_test 2025-12-04T09:18:53.8680426Z inflating: build/bin/NamedTensor_test 2025-12-04T09:18:53.8716378Z inflating: build/bin/apply_utils_test 2025-12-04T09:18:53.8752227Z inflating: build/bin/atest 2025-12-04T09:18:53.8791030Z inflating: build/bin/basic 2025-12-04T09:18:53.8824155Z inflating: build/bin/broadcast_test 2025-12-04T09:18:53.8857110Z inflating: build/bin/cpu_allocator_test 2025-12-04T09:18:53.8892244Z inflating: build/bin/cpu_generator_test 2025-12-04T09:18:53.8924542Z inflating: build/bin/cpu_profiling_allocator_test 2025-12-04T09:18:53.8979799Z inflating: build/bin/cpu_rng_test 2025-12-04T09:18:53.9014227Z inflating: build/bin/dlconvertor_test 2025-12-04T09:18:53.9049279Z inflating: build/bin/extension_backend_test 2025-12-04T09:18:53.9096881Z inflating: build/bin/half_test 2025-12-04T09:18:53.9184544Z inflating: build/bin/ivalue_test 2025-12-04T09:18:53.9219692Z inflating: build/bin/lazy_tensor_test 2025-12-04T09:18:53.9254504Z inflating: build/bin/math_kernel_test 2025-12-04T09:18:53.9286748Z inflating: build/bin/memory_format_test 2025-12-04T09:18:53.9321869Z inflating: build/bin/memory_overlapping_test 2025-12-04T09:18:53.9354380Z inflating: build/bin/mobile_memory_cleanup 2025-12-04T09:18:53.9391488Z inflating: build/bin/native_test 2025-12-04T09:18:53.9422981Z inflating: build/bin/operator_name_test 2025-12-04T09:18:53.9454058Z inflating: build/bin/operators_test 2025-12-04T09:18:53.9488785Z inflating: build/bin/packedtensoraccessor_test 2025-12-04T09:18:53.9529314Z inflating: build/bin/pow_test 2025-12-04T09:18:53.9563698Z inflating: build/bin/quantized_test 2025-12-04T09:18:53.9594324Z inflating: build/bin/reduce_ops_test 2025-12-04T09:18:53.9625488Z inflating: build/bin/reportMemoryUsage_test 2025-12-04T09:18:53.9671981Z inflating: build/bin/scalar_tensor_test 2025-12-04T09:18:53.9718918Z inflating: build/bin/scalar_test 2025-12-04T09:18:53.9755555Z inflating: build/bin/StorageUtils_test 2025-12-04T09:18:53.9800024Z inflating: build/bin/stride_properties_test 2025-12-04T09:18:53.9847319Z inflating: build/bin/tensor_iterator_test 2025-12-04T09:18:53.9880439Z inflating: build/bin/test_parallel 2025-12-04T09:18:53.9911786Z inflating: build/bin/thread_init_test 2025-12-04T09:18:53.9949396Z inflating: build/bin/type_ptr_test 2025-12-04T09:18:53.9983231Z inflating: build/bin/type_test 2025-12-04T09:18:54.0015218Z inflating: build/bin/undefined_tensor_test 2025-12-04T09:18:54.0045578Z inflating: build/bin/verify_api_visibility 2025-12-04T09:18:54.0088138Z inflating: build/bin/legacy_vmap_test 2025-12-04T09:18:54.0122001Z inflating: build/bin/weakref_test 2025-12-04T09:18:54.0156141Z inflating: build/bin/wrapdim_test 2025-12-04T09:18:54.0234438Z inflating: build/bin/List_test 2025-12-04T09:18:54.0269175Z inflating: build/bin/xla_tensor_test 2025-12-04T09:18:54.0305066Z inflating: build/bin/IListRef_test 2025-12-04T09:18:54.0376919Z inflating: build/bin/kernel_function_legacy_test 2025-12-04T09:18:54.0429928Z inflating: build/bin/KernelFunction_test 2025-12-04T09:18:54.0501213Z inflating: build/bin/kernel_function_test 2025-12-04T09:18:54.0574962Z inflating: build/bin/kernel_lambda_legacy_test 2025-12-04T09:18:54.0636962Z inflating: build/bin/kernel_lambda_test 2025-12-04T09:18:54.0673559Z inflating: build/bin/kernel_stackbased_test 2025-12-04T09:18:54.0729665Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2025-12-04T09:18:54.0770067Z inflating: build/bin/CppSignature_test 2025-12-04T09:18:54.0808140Z inflating: build/bin/op_allowlist_test 2025-12-04T09:18:54.1004179Z inflating: build/bin/op_registration_test 2025-12-04T09:18:54.1034865Z inflating: build/bin/hip_complex_math_test 2025-12-04T09:18:54.1069631Z inflating: build/bin/backend_fallback_test 2025-12-04T09:18:54.1099888Z inflating: build/bin/hip_complex_test 2025-12-04T09:18:54.1143138Z inflating: build/bin/inline_container_test 2025-12-04T09:18:54.1175410Z inflating: build/bin/hip_apply_test 2025-12-04T09:18:54.1205571Z inflating: build/bin/hip_distributions_test 2025-12-04T09:18:54.1235630Z inflating: build/bin/hip_generator_test 2025-12-04T09:18:54.1265593Z inflating: build/bin/hip_half_test 2025-12-04T09:18:54.1297540Z inflating: build/bin/hip_integer_divider_test 2025-12-04T09:18:54.1327439Z inflating: build/bin/hip_optional_test 2025-12-04T09:18:54.1363687Z inflating: build/bin/hip_packedtensoraccessor_test 2025-12-04T09:18:54.1408765Z inflating: build/bin/hip_vectorized_test 2025-12-04T09:18:54.1444909Z inflating: build/bin/hip_dlconvertor_test 2025-12-04T09:18:54.2146491Z inflating: build/bin/test_jit 2025-12-04T09:18:54.2347171Z inflating: build/bin/test_lazy 2025-12-04T09:18:54.2380949Z inflating: build/bin/test_dist_autograd 2025-12-04T09:18:54.2422475Z inflating: build/bin/test_cpp_rpc 2025-12-04T09:18:54.2423722Z inflating: build/bin/parallel_benchmark 2025-12-04T09:18:54.3161145Z inflating: build/bin/test_api 2025-12-04T09:18:54.3161611Z creating: .additional_ci_files/ 2025-12-04T09:18:54.3197008Z inflating: .additional_ci_files/test-times.json 2025-12-04T09:18:54.3341487Z inflating: .additional_ci_files/test-class-times.json 2025-12-04T09:18:54.3369588Z ##[group]Run rm artifacts.zip 2025-12-04T09:18:54.3369743Z rm artifacts.zip 2025-12-04T09:18:54.3374062Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:54.3374217Z env: 2025-12-04T09:18:54.3374313Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:54.3374454Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:54.3374632Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:54.3374802Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:54.3375334Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:54.3375712Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:54.3375830Z AWS_REGION: us-east-1 2025-12-04T09:18:54.3375994Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:54.3376285Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:54.3378363Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:54.3378470Z ##[endgroup] 2025-12-04T09:18:54.4652699Z ##[group]Run df -H 2025-12-04T09:18:54.4652808Z df -H 2025-12-04T09:18:54.4657751Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:54.4657908Z env: 2025-12-04T09:18:54.4658009Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:54.4658150Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:54.4658337Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:54.4658521Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:54.4658919Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:54.4659307Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:54.4659431Z AWS_REGION: us-east-1 2025-12-04T09:18:54.4659584Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:54.4659750Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:54.4661875Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:54.4661979Z ##[endgroup] 2025-12-04T09:18:54.4959943Z Filesystem Size Used Avail Use% Mounted on 2025-12-04T09:18:54.4960334Z overlay 16T 612G 15T 4% / 2025-12-04T09:18:54.4960573Z tmpfs 68M 0 68M 0% /dev 2025-12-04T09:18:54.4960806Z /dev/md0 16T 612G 15T 4% /run 2025-12-04T09:18:54.4961047Z shm 68M 4.1k 68M 1% /dev/shm 2025-12-04T09:18:54.4961510Z amdprj2-k8s_2 5.5T 120G 5.4T 3% /home/runner/pytorch-data 2025-12-04T09:18:54.4961935Z tmpfs 3.3T 13k 3.3T 1% /run/secrets/kubernetes.io/serviceaccount 2025-12-04T09:18:54.4962248Z tmpfs 1.7T 0 1.7T 0% /proc/acpi 2025-12-04T09:18:54.4962500Z tmpfs 1.7T 0 1.7T 0% /proc/scsi 2025-12-04T09:18:54.4962751Z tmpfs 1.7T 0 1.7T 0% /sys/firmware 2025-12-04T09:18:54.4963049Z tmpfs 1.7T 0 1.7T 0% /sys/devices/virtual/powercap 2025-12-04T09:18:54.4993883Z Prepare all required actions 2025-12-04T09:18:54.4994111Z Getting action download info 2025-12-04T09:18:54.7596503Z ##[group]Run ./.github/actions/download-td-artifacts 2025-12-04T09:18:54.7596656Z with: 2025-12-04T09:18:54.7596752Z env: 2025-12-04T09:18:54.7596850Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:54.7596993Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:54.7597174Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:54.7597359Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:54.7597751Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:54.7598132Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:54.7598254Z AWS_REGION: us-east-1 2025-12-04T09:18:54.7598437Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:54.7598622Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:54.7600747Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:54.7600851Z ##[endgroup] 2025-12-04T09:18:54.7620543Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:18:54.7620729Z with: 2025-12-04T09:18:54.7620852Z name: td_results 2025-12-04T09:18:54.7620980Z s3-bucket: gha-artifacts 2025-12-04T09:18:54.7621119Z region: us-east-1 2025-12-04T09:18:54.7621245Z env: 2025-12-04T09:18:54.7621512Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:54.7621683Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:54.7621905Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:54.7622121Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:54.7622580Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:54.7622972Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:54.7623098Z AWS_REGION: us-east-1 2025-12-04T09:18:54.7623295Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:54.7623460Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:54.7625547Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:54.7625654Z ##[endgroup] 2025-12-04T09:18:55.6444658Z (node:17202) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:18:55.6445081Z 2025-12-04T09:18:55.6445239Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:18:55.6445631Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:18:55.6446030Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:18:55.9221220Z Found 1 objects with prefix pytorch/pytorch/19922849170/td_results/ 2025-12-04T09:18:55.9221647Z Starting download (1/1): /home/runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:18:56.3383421Z Finished download (1/1): /home/runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:18:56.3387230Z Artifact download has finished successfully 2025-12-04T09:18:56.3602571Z ##[group]Run mkdir -p .additional_ci_files 2025-12-04T09:18:56.3602759Z mkdir -p .additional_ci_files 2025-12-04T09:18:56.3602946Z mv td_results.json .additional_ci_files/td_results.json || true 2025-12-04T09:18:56.3608048Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:56.3608230Z env: 2025-12-04T09:18:56.3608337Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:56.3608489Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:56.3608685Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:56.3608871Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:56.3609463Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:56.3609885Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:56.3610017Z AWS_REGION: us-east-1 2025-12-04T09:18:56.3610516Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:56.3610670Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:56.3612743Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:56.3612879Z ##[endgroup] 2025-12-04T09:18:56.3713382Z ##[group]Run .github/scripts/parse_ref.py 2025-12-04T09:18:56.3713593Z .github/scripts/parse_ref.py 2025-12-04T09:18:56.3720838Z shell: /usr/bin/bash -e {0} 2025-12-04T09:18:56.3720962Z env: 2025-12-04T09:18:56.3721062Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:56.3721208Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:56.3721394Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:56.3721567Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:56.3721952Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:56.3722336Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:56.3722462Z AWS_REGION: us-east-1 2025-12-04T09:18:56.3722643Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:56.3722823Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:56.3724902Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:56.3725164Z ##[endgroup] 2025-12-04T09:18:56.3830687Z Setting output branch=main 2025-12-04T09:18:56.3897313Z Prepare all required actions 2025-12-04T09:18:56.3897536Z Getting action download info 2025-12-04T09:18:56.5820376Z ##[group]Run ./.github/actions/filter-test-configs 2025-12-04T09:18:56.5820535Z with: 2025-12-04T09:18:56.5820760Z github-token: *** 2025-12-04T09:18:56.5823773Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:18:56.5826986Z job-name: linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:18:56.5827204Z env: 2025-12-04T09:18:56.5827306Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:56.5827454Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:56.5827640Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:56.5827812Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:56.5828202Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:56.5828578Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:56.5828832Z AWS_REGION: us-east-1 2025-12-04T09:18:56.5829084Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:56.5829244Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:56.5831576Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:56.5831686Z ##[endgroup] 2025-12-04T09:18:56.5847151Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T09:18:56.5847276Z with: 2025-12-04T09:18:56.5847365Z shell: bash 2025-12-04T09:18:56.5847462Z timeout_minutes: 10 2025-12-04T09:18:56.5847564Z max_attempts: 5 2025-12-04T09:18:56.5847663Z retry_wait_seconds: 30 2025-12-04T09:18:56.5847956Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:18:56.5848272Z polling_interval_seconds: 1 2025-12-04T09:18:56.5848387Z warning_on_retry: true 2025-12-04T09:18:56.5848495Z continue_on_error: false 2025-12-04T09:18:56.5848602Z env: 2025-12-04T09:18:56.5848692Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:56.5848835Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:56.5849013Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:56.5849178Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:56.5849556Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:56.5849926Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:56.5850042Z AWS_REGION: us-east-1 2025-12-04T09:18:56.5850217Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:56.5850367Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:56.5852444Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:56.5852590Z GITHUB_TOKEN: *** 2025-12-04T09:18:56.5852687Z ##[endgroup] 2025-12-04T09:18:56.6275931Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:18:56.7715012Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T09:18:56.9062259Z Collecting requests==2.27.1 2025-12-04T09:18:56.9428761Z Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB) 2025-12-04T09:18:56.9535222Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 63.1/63.1 KB 5.9 MB/s eta 0:00:00 2025-12-04T09:18:57.0016466Z Collecting pyyaml==6.0.2 2025-12-04T09:18:57.0114074Z Downloading PyYAML-6.0.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (751 kB) 2025-12-04T09:18:57.0518867Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 751.2/751.2 KB 19.1 MB/s eta 0:00:00 2025-12-04T09:18:57.0841282Z Collecting urllib3<1.27,>=1.21.1 2025-12-04T09:18:57.0893627Z Downloading urllib3-1.26.20-py2.py3-none-any.whl (144 kB) 2025-12-04T09:18:57.0950568Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 144.2/144.2 KB 28.7 MB/s eta 0:00:00 2025-12-04T09:18:57.1139691Z Collecting certifi>=2017.4.17 2025-12-04T09:18:57.1194983Z Downloading certifi-2025.11.12-py3-none-any.whl (159 kB) 2025-12-04T09:18:57.1257801Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 159.4/159.4 KB 28.0 MB/s eta 0:00:00 2025-12-04T09:18:57.1428277Z Collecting idna<4,>=2.5 2025-12-04T09:18:57.1486857Z Downloading idna-3.11-py3-none-any.whl (71 kB) 2025-12-04T09:18:57.1511414Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 71.0/71.0 KB 68.1 MB/s eta 0:00:00 2025-12-04T09:18:57.2417380Z Collecting charset-normalizer~=2.0.0 2025-12-04T09:18:57.2470504Z Downloading charset_normalizer-2.0.12-py3-none-any.whl (39 kB) 2025-12-04T09:18:57.3067332Z Installing collected packages: urllib3, pyyaml, idna, charset-normalizer, certifi, requests 2025-12-04T09:18:57.4013021Z WARNING: The script normalizer is installed in '/home/runner/.local/bin' which is not on PATH. 2025-12-04T09:18:57.4013544Z Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location. 2025-12-04T09:18:57.4183117Z Successfully installed certifi-2025.11.12 charset-normalizer-2.0.12 idna-3.11 pyyaml-6.0.2 requests-2.27.1 urllib3-1.26.20 2025-12-04T09:18:57.6288489Z Command completed after 1 attempt(s). 2025-12-04T09:18:57.6332383Z ##[group]Run set -x 2025-12-04T09:18:57.6332522Z set -x 2025-12-04T09:18:57.6332624Z  2025-12-04T09:18:57.6332786Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:18:57.6332983Z # in runner workspace 2025-12-04T09:18:57.6333152Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2025-12-04T09:18:57.6339655Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:57.6339815Z env: 2025-12-04T09:18:57.6339921Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:57.6340074Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:57.6340346Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:57.6340524Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:57.6340929Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:57.6341324Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:57.6341450Z AWS_REGION: us-east-1 2025-12-04T09:18:57.6341631Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:57.6341791Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:57.6343875Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:57.6343990Z ##[endgroup] 2025-12-04T09:18:57.6371334Z + python3 /home/runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2025-12-04T09:18:57.6461394Z Setting output branch=main 2025-12-04T09:18:57.6496973Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:18:57.6497180Z echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:18:57.6497331Z echo "Job name: ${JOB_NAME}" 2025-12-04T09:18:57.6497464Z  2025-12-04T09:18:57.6497630Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:18:57.6497853Z # in runner workspace 2025-12-04T09:18:57.6498038Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2025-12-04T09:18:57.6498281Z  --workflow "${GITHUB_WORKFLOW}" \ 2025-12-04T09:18:57.6498478Z  --job-name "${JOB_NAME}" \ 2025-12-04T09:18:57.6501874Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" \ 2025-12-04T09:18:57.6505124Z  --selected-test-configs "" \ 2025-12-04T09:18:57.6505261Z  --pr-number "${PR_NUMBER}" \ 2025-12-04T09:18:57.6505389Z  --tag "${TAG}" \ 2025-12-04T09:18:57.6505510Z  --event-name "${EVENT_NAME}" \ 2025-12-04T09:18:57.6505634Z  --schedule "${SCHEDULE}" \ 2025-12-04T09:18:57.6505755Z  --branch "${HEAD_BRANCH}" 2025-12-04T09:18:57.6509965Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:57.6510294Z env: 2025-12-04T09:18:57.6510390Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:57.6510528Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:57.6510702Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:57.6510865Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:57.6511249Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:57.6511609Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:57.6511726Z AWS_REGION: us-east-1 2025-12-04T09:18:57.6511896Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:57.6512048Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:57.6514114Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:57.6514326Z GITHUB_TOKEN: *** 2025-12-04T09:18:57.6514515Z JOB_NAME: linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:18:57.6514720Z PR_NUMBER: 2025-12-04T09:18:57.6514811Z TAG: 2025-12-04T09:18:57.6514901Z EVENT_NAME: schedule 2025-12-04T09:18:57.6515001Z SCHEDULE: 29 8 * * * 2025-12-04T09:18:57.6515100Z HEAD_BRANCH: main 2025-12-04T09:18:57.6515198Z ##[endgroup] 2025-12-04T09:18:57.6538922Z Workflow: trunk-rocm-mi300 2025-12-04T09:18:57.6539341Z Job name: linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:18:58.2139321Z INFO:root:Issue https://github.com/pytorch/pytorch/issues/167616 created by jithunnair-amd has unstable all the test jobs for trunk-rocm-mi300 / linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:18:58.6125206Z Setting output keep-going=True 2025-12-04T09:18:58.6125482Z Setting output ci-verbose-test-logs=False 2025-12-04T09:18:58.6125674Z Setting output ci-test-showlocals=False 2025-12-04T09:18:58.6125839Z Setting output ci-no-test-timeout=False 2025-12-04T09:18:58.6126004Z Setting output ci-no-td=False 2025-12-04T09:18:58.6126163Z Setting output ci-td-distributed=False 2025-12-04T09:18:58.6126336Z Setting output is-unstable=True 2025-12-04T09:18:58.6126493Z Setting output reenabled-issues= 2025-12-04T09:18:58.6136526Z Setting output test-matrix={"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:18:58.6143972Z Setting output is-test-matrix-empty=False 2025-12-04T09:18:58.6250909Z ##[group]Run echo "Filtered matrix:" 2025-12-04T09:18:58.6251112Z echo "Filtered matrix:" 2025-12-04T09:18:58.6258071Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" 2025-12-04T09:18:58.6265115Z  2025-12-04T09:18:58.6265211Z echo 2025-12-04T09:18:58.6265330Z echo "Is the current job unstable? True" 2025-12-04T09:18:58.6265459Z  2025-12-04T09:18:58.6265552Z echo 2025-12-04T09:18:58.6265665Z echo "Is keep-going label set? True" 2025-12-04T09:18:58.6265793Z  2025-12-04T09:18:58.6265880Z echo 2025-12-04T09:18:58.6265984Z echo "Reenabled issues? " 2025-12-04T09:18:58.6270264Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:58.6270421Z env: 2025-12-04T09:18:58.6270522Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:58.6270665Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:58.6270849Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:58.6271024Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:58.6271418Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:58.6271798Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:58.6271922Z AWS_REGION: us-east-1 2025-12-04T09:18:58.6272101Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:58.6272262Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:58.6274348Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:58.6274462Z ##[endgroup] 2025-12-04T09:18:58.6293525Z Filtered matrix: 2025-12-04T09:18:58.6304589Z {include: [{config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}]} 2025-12-04T09:18:58.6313388Z 2025-12-04T09:18:58.6313481Z Is the current job unstable? True 2025-12-04T09:18:58.6313579Z 2025-12-04T09:18:58.6313653Z Is keep-going label set? True 2025-12-04T09:18:58.6313744Z 2025-12-04T09:18:58.6313789Z Reenabled issues? 2025-12-04T09:18:58.6343586Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:18:58.6343806Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:18:58.6348196Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:58.6348344Z env: 2025-12-04T09:18:58.6348440Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:58.6348580Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:58.6348756Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:58.6348921Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:58.6349301Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:58.6349686Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:58.6349803Z AWS_REGION: us-east-1 2025-12-04T09:18:58.6349989Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:58.6350274Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:58.6352347Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:58.6352452Z JOB_TIMEOUT: 600 2025-12-04T09:18:58.6352550Z ##[endgroup] 2025-12-04T09:18:58.6396875Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:18:58.6397167Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:18:58.6397403Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:18:58.6402453Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:18:58.6402619Z env: 2025-12-04T09:18:58.6402725Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:58.6402872Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:58.6403080Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:58.6403256Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:58.6403659Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:58.6404074Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:58.6404205Z AWS_REGION: us-east-1 2025-12-04T09:18:58.6404381Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:58.6404547Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:58.6406728Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:58.6406845Z ##[endgroup] 2025-12-04T09:18:58.6494959Z ##[group]Run set -x 2025-12-04T09:18:58.6495115Z set -x 2025-12-04T09:18:58.6495214Z  2025-12-04T09:18:58.6495328Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2025-12-04T09:18:58.6495504Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2025-12-04T09:18:58.6495663Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2025-12-04T09:18:58.6495807Z  TEST_COMMAND=.ci/caffe2/test.sh 2025-12-04T09:18:58.6495930Z else 2025-12-04T09:18:58.6496034Z  TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:18:58.6496156Z fi 2025-12-04T09:18:58.6496242Z  2025-12-04T09:18:58.6496378Z # detached container should get cleaned up by teardown_ec2_linux 2025-12-04T09:18:58.6496581Z # TODO: Stop building test binaries as part of the build phase 2025-12-04T09:18:58.6496764Z # Used for GPU_FLAG since that doesn't play nice 2025-12-04T09:18:58.6496934Z # shellcheck disable=SC2086,SC2090 2025-12-04T09:18:58.6497068Z container_name=$(docker run \ 2025-12-04T09:18:58.6497195Z  ${GPU_FLAG:-} \ 2025-12-04T09:18:58.6497316Z  -e BUILD_ENVIRONMENT \ 2025-12-04T09:18:58.6497534Z  -e PR_NUMBER \ 2025-12-04T09:18:58.6497648Z  -e GITHUB_ACTIONS \ 2025-12-04T09:18:58.6497766Z  -e GITHUB_REPOSITORY \ 2025-12-04T09:18:58.6497887Z  -e GITHUB_WORKFLOW \ 2025-12-04T09:18:58.6498001Z  -e GITHUB_JOB \ 2025-12-04T09:18:58.6498111Z  -e GITHUB_RUN_ID \ 2025-12-04T09:18:58.6498223Z  -e GITHUB_RUN_NUMBER \ 2025-12-04T09:18:58.6498346Z  -e GITHUB_RUN_ATTEMPT \ 2025-12-04T09:18:58.6498464Z  -e JOB_ID \ 2025-12-04T09:18:58.6498570Z  -e JOB_NAME \ 2025-12-04T09:18:58.6498678Z  -e BASE_SHA \ 2025-12-04T09:18:58.6498786Z  -e BRANCH \ 2025-12-04T09:18:58.6498887Z  -e SHA1 \ 2025-12-04T09:18:58.6498993Z  -e AWS_DEFAULT_REGION \ 2025-12-04T09:18:58.6499111Z  -e IN_WHEEL_TEST \ 2025-12-04T09:18:58.6499228Z  -e SHARD_NUMBER \ 2025-12-04T09:18:58.6499340Z  -e TEST_CONFIG \ 2025-12-04T09:18:58.6499456Z  -e NUM_TEST_SHARDS \ 2025-12-04T09:18:58.6499575Z  -e REENABLED_ISSUES \ 2025-12-04T09:18:58.6499703Z  -e CONTINUE_THROUGH_ERROR \ 2025-12-04T09:18:58.6499832Z  -e VERBOSE_TEST_LOGS \ 2025-12-04T09:18:58.6499953Z  -e TEST_SHOWLOCALS \ 2025-12-04T09:18:58.6500070Z  -e NO_TEST_TIMEOUT \ 2025-12-04T09:18:58.6500378Z  -e NO_TD \ 2025-12-04T09:18:58.6500497Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2025-12-04T09:18:58.6500642Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2025-12-04T09:18:58.6500786Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2025-12-04T09:18:58.6500923Z  -e TESTS_TO_INCLUDE \ 2025-12-04T09:18:58.6501046Z  -e HUGGING_FACE_HUB_TOKEN \ 2025-12-04T09:18:58.6501174Z  -e DASHBOARD_TAG \ 2025-12-04T09:18:58.6501324Z  --env-file="${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:18:58.6501491Z  --ulimit stack=10485760:83886080 \ 2025-12-04T09:18:58.6501648Z  --ulimit core=0 \ 2025-12-04T09:18:58.6501795Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:18:58.6501961Z  --security-opt seccomp=unconfined \ 2025-12-04T09:18:58.6502107Z  --cap-add=SYS_PTRACE \ 2025-12-04T09:18:58.6502232Z  --shm-size="8g" \ 2025-12-04T09:18:58.6502345Z  --tty \ 2025-12-04T09:18:58.6502452Z  --detach \ 2025-12-04T09:18:58.6502571Z  --name="${container_name}" \ 2025-12-04T09:18:58.6502701Z  --user jenkins \ 2025-12-04T09:18:58.6502846Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2025-12-04T09:18:58.6503005Z  -w /var/lib/jenkins/workspace \ 2025-12-04T09:18:58.6503203Z  "${DOCKER_IMAGE}" 2025-12-04T09:18:58.6503315Z ) 2025-12-04T09:18:58.6503427Z # save container name for later step 2025-12-04T09:18:58.6503594Z echo "CONTAINER_NAME=${container_name}" >> "$GITHUB_ENV" 2025-12-04T09:18:58.6503871Z # jenkins user does not have write permission to mounted workspace; work-around by copying within container to jenkins home 2025-12-04T09:18:58.6504221Z docker exec -t "${container_name}" sh -c "cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && ${TEST_COMMAND}" 2025-12-04T09:18:58.6508908Z shell: /usr/bin/bash -e {0} 2025-12-04T09:18:58.6509031Z env: 2025-12-04T09:18:58.6509135Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:18:58.6509280Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:18:58.6509465Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:18:58.6509638Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:18:58.6510025Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:18:58.6510496Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:18:58.6510618Z AWS_REGION: us-east-1 2025-12-04T09:18:58.6510794Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:18:58.6510953Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:18:58.6513023Z AWS_SESSION_TOKEN: *** 2025-12-04T09:18:58.6513153Z BUILD_ENVIRONMENT: linux-jammy-rocm-py3.10 2025-12-04T09:18:58.6513287Z PR_NUMBER: 2025-12-04T09:18:58.6513400Z GITHUB_REPOSITORY: pytorch/pytorch 2025-12-04T09:18:58.6513533Z GITHUB_WORKFLOW: trunk-rocm-mi300 2025-12-04T09:18:58.6513656Z GITHUB_JOB: test 2025-12-04T09:18:58.6513760Z GITHUB_RUN_ID: 19922849170 2025-12-04T09:18:58.6513875Z GITHUB_RUN_NUMBER: 689 2025-12-04T09:18:58.6513986Z GITHUB_RUN_ATTEMPT: 1 2025-12-04T09:18:58.6514087Z JOB_ID: 57116213140 2025-12-04T09:18:58.6514297Z JOB_NAME: linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:18:58.6514506Z BRANCH: main 2025-12-04T09:18:58.6514619Z SHA1: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:18:58.6514786Z BASE_SHA: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:18:58.6514928Z TEST_CONFIG: default 2025-12-04T09:18:58.6515034Z SHARD_NUMBER: 2 2025-12-04T09:18:58.6515136Z NUM_TEST_SHARDS: 6 2025-12-04T09:18:58.6515240Z REENABLED_ISSUES: 2025-12-04T09:18:58.6515349Z CONTINUE_THROUGH_ERROR: True 2025-12-04T09:18:58.6515469Z VERBOSE_TEST_LOGS: False 2025-12-04T09:18:58.6515584Z TEST_SHOWLOCALS: False 2025-12-04T09:18:58.6515695Z NO_TEST_TIMEOUT: False 2025-12-04T09:18:58.6515802Z NO_TD: False 2025-12-04T09:18:58.6516071Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:18:58.6516373Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 1 2025-12-04T09:18:58.6516508Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2025-12-04T09:18:58.6516634Z TESTS_TO_INCLUDE: 2025-12-04T09:18:58.6516739Z DASHBOARD_TAG: 2025-12-04T09:18:58.6516887Z HUGGING_FACE_HUB_TOKEN: *** 2025-12-04T09:18:58.6517009Z ##[endgroup] 2025-12-04T09:18:58.6530227Z + [[ default == \m\u\l\t\i\g\p\u ]] 2025-12-04T09:18:58.6530375Z + [[ linux-jammy-rocm-py3.10 == *onnx* ]] 2025-12-04T09:18:58.6530530Z + TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:18:58.6537087Z +++ nproc --ignore=2 2025-12-04T09:18:58.6547712Z ++ docker run --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e MAX_JOBS=126 -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e TESTS_TO_INCLUDE -e HUGGING_FACE_HUB_TOKEN -e DASHBOARD_TAG --env-file=/home/runner/_work/_temp/github_env_19922849170 --ulimit stack=10485760:83886080 --ulimit core=0 --env-file=/tmp/github_env_19922849170 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --shm-size=8g --tty --detach --name= --user jenkins -v /home/runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:18:58.8813829Z + container_name=f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T09:18:58.8814335Z + echo CONTAINER_NAME=f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T09:18:58.8814950Z + docker exec -t f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e sh -c 'cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && .ci/pytorch/test.sh' 2025-12-04T09:19:02.5951007Z Processing ./dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:19:03.1533214Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.18.0) 2025-12-04T09:19:03.1534214Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (4.12.2) 2025-12-04T09:19:03.1535104Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (1.13.3) 2025-12-04T09:19:03.1537027Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2.8.8) 2025-12-04T09:19:03.1538189Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.1.6) 2025-12-04T09:19:03.1540167Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2025.10.0) 2025-12-04T09:19:03.1707379Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch==2.10.0a0+gitffd9b0f) (1.3.0) 2025-12-04T09:19:03.1730764Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.10.0a0+gitffd9b0f) (3.0.3) 2025-12-04T09:19:03.3732720Z Installing collected packages: torch 2025-12-04T09:19:09.5292506Z Successfully installed torch-2.10.0a0+gitffd9b0f 2025-12-04T09:19:09.5780983Z + export TERM=vt100 2025-12-04T09:19:09.5781277Z + TERM=vt100 2025-12-04T09:19:09.5796309Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:19:09.5826908Z + source .ci/pytorch/common.sh 2025-12-04T09:19:09.5855472Z +++ dirname .ci/pytorch/common.sh 2025-12-04T09:19:09.5864801Z ++ source .ci/pytorch/common_utils.sh 2025-12-04T09:19:09.5866979Z +++ declare -f -t trap_add 2025-12-04T09:19:09.5872385Z ++ set -ex -o pipefail 2025-12-04T09:19:09.5872622Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:19:09.5872875Z ++ unset HIP_PLATFORM 2025-12-04T09:19:09.5873081Z ++ export PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:19:09.5873313Z ++ PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:19:09.5873521Z ++ BUILD_TEST_LIBTORCH=0 2025-12-04T09:19:09.5887308Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:19:09.5899000Z + source .ci/pytorch/common-build.sh 2025-12-04T09:19:09.5900409Z ++ [[ linux-jammy-rocm-py3.10 != *win-* ]] 2025-12-04T09:19:09.5927901Z ++++ dirname .ci/pytorch/common-build.sh 2025-12-04T09:19:09.5939039Z +++ cd .ci/pytorch 2025-12-04T09:19:09.5939283Z +++ pwd -P 2025-12-04T09:19:09.5952648Z ++ script_dir=/var/lib/jenkins/pytorch/.ci/pytorch 2025-12-04T09:19:09.5952977Z ++ [[ linux-jammy-rocm-py3.10 == *-pch* ]] 2025-12-04T09:19:09.5953202Z ++ which sccache 2025-12-04T09:19:09.5962077Z ++ [[ -z '' ]] 2025-12-04T09:19:09.5962231Z ++ unset SCCACHE_BUCKET 2025-12-04T09:19:09.5962417Z ++ unset SCCACHE_REGION 2025-12-04T09:19:09.5962577Z ++ sccache --stop-server 2025-12-04T09:19:09.5986567Z ++ true 2025-12-04T09:19:09.5986728Z ++ rm -f /var/lib/jenkins/sccache_error.log 2025-12-04T09:19:09.5993826Z ++ trap_add sccache_epilogue EXIT 2025-12-04T09:19:09.5994029Z ++ trap_add_cmd=sccache_epilogue 2025-12-04T09:19:09.5994208Z ++ shift 2025-12-04T09:19:09.5994365Z ++ for trap_add_name in "$@" 2025-12-04T09:19:09.6020968Z ++++ trap -p EXIT 2025-12-04T09:19:09.6023751Z +++ eval 'extract_trap_cmd ' 2025-12-04T09:19:09.6023929Z ++++ extract_trap_cmd 2025-12-04T09:19:09.6024791Z ++++ printf '%s\n' '' 2025-12-04T09:19:09.6025058Z +++ printf '%s\n' sccache_epilogue 2025-12-04T09:19:09.6026709Z ++ trap -- ' 2025-12-04T09:19:09.6026914Z sccache_epilogue' EXIT 2025-12-04T09:19:09.6027050Z ++ [[ -n '' ]] 2025-12-04T09:19:09.6027204Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:19:09.6027410Z ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:19:09.6031849Z ++ SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:19:09.6031992Z ++ sccache --start-server 2025-12-04T09:19:09.6056700Z sccache: Starting the server... 2025-12-04T09:19:09.6300851Z sccache: Listening on address 127.0.0.1:4226 2025-12-04T09:19:09.6310652Z ++ sccache --zero-stats 2025-12-04T09:19:09.6330966Z Statistics zeroed. 2025-12-04T09:19:09.6335082Z ++ which ccache 2025-12-04T09:19:09.6343329Z + [[ linux-jammy-rocm-py3.10 != *rocm* ]] 2025-12-04T09:19:09.6343498Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:19:09.6343630Z + echo 'Environment variables:' 2025-12-04T09:19:09.6343754Z Environment variables: 2025-12-04T09:19:09.6343853Z + env 2025-12-04T09:19:09.6355380Z GITHUB_WORKSPACE=/home/runner/_work/pytorch/pytorch 2025-12-04T09:19:09.6355535Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:19:09.6355681Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:19:09.6355848Z HOSTNAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw 2025-12-04T09:19:09.6356095Z GITHUB_PATH=/home/runner/_work/_temp/_runner_file_commands/add_path_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6356310Z GITHUB_ACTION=__run_2 2025-12-04T09:19:09.6356422Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:19:09.6356541Z GITHUB_RUN_NUMBER=689 2025-12-04T09:19:09.6356640Z TEST_CONFIG=default 2025-12-04T09:19:09.6356774Z RUNNER_NAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw 2025-12-04T09:19:09.6356930Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:19:09.6357054Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:19:09.6357192Z RUNNER_ARTIFACT_DIR=/home/runner/_work/_temp/artifacts 2025-12-04T09:19:09.6357340Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:19:09.6357463Z GITHUB_REF_TYPE=branch 2025-12-04T09:19:09.6357585Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6357863Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:19:09.6359910Z *** 2025-12-04T09:19:09.6360003Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:19:09.6360161Z GITHUB_ACTIONS=true 2025-12-04T09:19:09.6360277Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6360432Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6360656Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk-rocm-mi300.yml@refs/heads/main 2025-12-04T09:19:09.6360852Z UCC_HOME=/usr 2025-12-04T09:19:09.6360953Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:19:09.6361070Z VERBOSE_TEST_LOGS=False 2025-12-04T09:19:09.6361265Z GITHUB_REF=refs/heads/main 2025-12-04T09:19:09.6361371Z RUNNER_OS=Linux 2025-12-04T09:19:09.6361462Z SHARD_NUMBER=2 2025-12-04T09:19:09.6361560Z GITHUB_REF_PROTECTED=true 2025-12-04T09:19:09.6361672Z RUNNER_MANUALLY_TRAP_SIG=1 2025-12-04T09:19:09.6361776Z HOME=/var/lib/jenkins 2025-12-04T09:19:09.6361895Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:19:09.6362223Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:19:09.6362358Z RUNNER_DOCS_DIR=/home/runner/_work/_temp/docs 2025-12-04T09:19:09.6362492Z LANG=C.UTF-8 2025-12-04T09:19:09.6362608Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:19:09.6362753Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:19:09.6362899Z RUNNER_TRACKING_ID=github_0adbb7dc-8002-4bda-aa57-cf62aa530fc9 2025-12-04T09:19:09.6363056Z RUNNER_ARCH=X64 2025-12-04T09:19:09.6363165Z RUNNER_TEMP=/home/runner/_work/_temp 2025-12-04T09:19:09.6363284Z NUM_TEST_SHARDS=6 2025-12-04T09:19:09.6363382Z UCX_HOME=/usr 2025-12-04T09:19:09.6363572Z GITHUB_STATE=/home/runner/_work/_temp/_runner_file_commands/save_state_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6363877Z JOB_NAME=linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:19:09.6364085Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:19:09.6364279Z GITHUB_ENV=/home/runner/_work/_temp/_runner_file_commands/set_env_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6364520Z GITHUB_EVENT_PATH=/home/runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:19:09.6364681Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:19:09.6364838Z GITHUB_ACTIONS_RUNNER_EXTRA_USER_AGENT=actions-runner-controller/0.12.1 2025-12-04T09:19:09.6365052Z DASHBOARD_TAG= 2025-12-04T09:19:09.6365149Z GITHUB_RUN_ID=19922849170 2025-12-04T09:19:09.6365355Z GITHUB_STEP_SUMMARY=/home/runner/_work/_temp/_runner_file_commands/step_summary_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6365579Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:19:09.6365693Z PR_NUMBER= 2025-12-04T09:19:09.6365792Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:19:09.6365908Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:19:09.6366052Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:19:09.6366194Z TERM=vt100 2025-12-04T09:19:09.6366290Z INSTALLED_VISION=yes 2025-12-04T09:19:09.6366398Z BRANCH=main 2025-12-04T09:19:09.6366499Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:19:09.6366617Z TESTS_TO_INCLUDE= 2025-12-04T09:19:09.6366777Z GITHUB_ACTION_PATH=/home/runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:19:09.6366971Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:19:09.6367114Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:19:09.6367274Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:19:09.6367410Z REENABLED_ISSUES= 2025-12-04T09:19:09.6367503Z SHLVL=1 2025-12-04T09:19:09.6367590Z MAX_JOBS=126 2025-12-04T09:19:09.6367719Z RUNNER_TEST_RESULTS_DIR=/home/runner/_work/_temp/test-results 2025-12-04T09:19:09.6367871Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:19:09.6367987Z RUNNER_TOOL_CACHE=/home/runner/_work/_tool 2025-12-04T09:19:09.6368149Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6368298Z GITHUB_REF_NAME=main 2025-12-04T09:19:09.6368399Z ROCM_PATH=/opt/rocm 2025-12-04T09:19:09.6368496Z GITHUB_JOB=test 2025-12-04T09:19:09.6368591Z NO_TEST_TIMEOUT=False 2025-12-04T09:19:09.6368704Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:19:09.6368819Z LC_ALL=C.UTF-8 2025-12-04T09:19:09.6368915Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:19:09.6369032Z RUNNER_WORKSPACE=/home/runner/_work/pytorch 2025-12-04T09:19:09.6369160Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:19:09.6369273Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:19:09.6369627Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:19:09.6369975Z GITHUB_BASE_REF= 2025-12-04T09:19:09.6370068Z CI=true 2025-12-04T09:19:09.6370206Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:19:09.6370318Z JOB_ID=57116213140 2025-12-04T09:19:09.6370412Z GITHUB_HEAD_REF= 2025-12-04T09:19:09.6370506Z GITHUB_ACTION_REF= 2025-12-04T09:19:09.6370603Z TEST_SHOWLOCALS=False 2025-12-04T09:19:09.6370713Z GITHUB_WORKFLOW=trunk-rocm-mi300 2025-12-04T09:19:09.6370834Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:19:09.6371111Z GITHUB_OUTPUT=/home/runner/_work/_temp/_runner_file_commands/set_output_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6371317Z NO_TD=False 2025-12-04T09:19:09.6371409Z OLDPWD=/var/lib/jenkins 2025-12-04T09:19:09.6371509Z _=/usr/bin/env 2025-12-04T09:19:09.6371640Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2025-12-04T09:19:09.6420817Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2025-12-04T09:19:09.6421051Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2025-12-04T09:19:09.6421264Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2025-12-04T09:19:09.6421478Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2025-12-04T09:19:09.6421646Z + BUILD_DIR=build 2025-12-04T09:19:09.6421747Z + BUILD_RENAMED_DIR=build_renamed 2025-12-04T09:19:09.6421866Z + BUILD_BIN_DIR=build/bin 2025-12-04T09:19:09.6421980Z + SHARD_NUMBER=2 2025-12-04T09:19:09.6422077Z + NUM_TEST_SHARDS=6 2025-12-04T09:19:09.6422183Z + export TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:19:09.6422307Z + TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:19:09.6422417Z + export VALGRIND=ON 2025-12-04T09:19:09.6422516Z + VALGRIND=ON 2025-12-04T09:19:09.6422669Z + [[ linux-jammy-rocm-py3.10 == *clang9* ]] 2025-12-04T09:19:09.6422805Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:19:09.6422927Z + detect_cuda_arch 2025-12-04T09:19:09.6423033Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:19:09.6423169Z + [[ linux-jammy-rocm-py3.10 == *s390x* ]] 2025-12-04T09:19:09.6423290Z + [[ 0 == \1 ]] 2025-12-04T09:19:09.6423383Z + [[ True == \1 ]] 2025-12-04T09:19:09.6423496Z + [[ linux-jammy-rocm-py3.10 != *bazel* ]] 2025-12-04T09:19:09.6426412Z ++ realpath build/custom_test_artifacts 2025-12-04T09:19:09.6437547Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/pytorch/build/custom_test_artifacts 2025-12-04T09:19:09.6437850Z + [[ -n '' ]] 2025-12-04T09:19:09.6438009Z + echo 'Environment variables' 2025-12-04T09:19:09.6438195Z Environment variables 2025-12-04T09:19:09.6438342Z + env 2025-12-04T09:19:09.6454524Z GITHUB_WORKSPACE=/home/runner/_work/pytorch/pytorch 2025-12-04T09:19:09.6454777Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:19:09.6454980Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:19:09.6455215Z HOSTNAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw 2025-12-04T09:19:09.6455550Z GITHUB_PATH=/home/runner/_work/_temp/_runner_file_commands/add_path_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6455840Z GITHUB_ACTION=__run_2 2025-12-04T09:19:09.6455996Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:19:09.6456162Z GITHUB_RUN_NUMBER=689 2025-12-04T09:19:09.6456300Z TEST_CONFIG=default 2025-12-04T09:19:09.6456482Z RUNNER_NAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-kfwnw 2025-12-04T09:19:09.6456700Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:19:09.6456880Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:19:09.6457067Z RUNNER_ARTIFACT_DIR=/home/runner/_work/_temp/artifacts 2025-12-04T09:19:09.6457271Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:19:09.6467771Z GITHUB_REF_TYPE=branch 2025-12-04T09:19:09.6467958Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6468329Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:19:09.6468568Z *** 2025-12-04T09:19:09.6468745Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:19:09.6468904Z GITHUB_ACTIONS=true 2025-12-04T09:19:09.6469065Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6469264Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6469511Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk-rocm-mi300.yml@refs/heads/main 2025-12-04T09:19:09.6469727Z UCC_HOME=/usr 2025-12-04T09:19:09.6469844Z TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:19:09.6469983Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:19:09.6470150Z VERBOSE_TEST_LOGS=False 2025-12-04T09:19:09.6470276Z GITHUB_REF=refs/heads/main 2025-12-04T09:19:09.6470400Z RUNNER_OS=Linux 2025-12-04T09:19:09.6470507Z SHARD_NUMBER=2 2025-12-04T09:19:09.6473888Z GITHUB_REF_PROTECTED=true 2025-12-04T09:19:09.6474016Z RUNNER_MANUALLY_TRAP_SIG=1 2025-12-04T09:19:09.6474137Z HOME=/var/lib/jenkins 2025-12-04T09:19:09.6474264Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:19:09.6474420Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:19:09.6474574Z RUNNER_DOCS_DIR=/home/runner/_work/_temp/docs 2025-12-04T09:19:09.6474718Z LANG=C.UTF-8 2025-12-04T09:19:09.6474851Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:19:09.6475007Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:19:09.6475170Z RUNNER_TRACKING_ID=github_0adbb7dc-8002-4bda-aa57-cf62aa530fc9 2025-12-04T09:19:09.6475337Z RUNNER_ARCH=X64 2025-12-04T09:19:09.6475457Z RUNNER_TEMP=/home/runner/_work/_temp 2025-12-04T09:19:09.6475590Z NUM_TEST_SHARDS=6 2025-12-04T09:19:09.6475700Z UCX_HOME=/usr 2025-12-04T09:19:09.6475910Z GITHUB_STATE=/home/runner/_work/_temp/_runner_file_commands/save_state_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6476247Z JOB_NAME=linux-jammy-rocm-py3.10 / test (default, 2, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:19:09.6476477Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:19:09.6476693Z GITHUB_ENV=/home/runner/_work/_temp/_runner_file_commands/set_env_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6477085Z GITHUB_EVENT_PATH=/home/runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:19:09.6477266Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:19:09.6477441Z GITHUB_ACTIONS_RUNNER_EXTRA_USER_AGENT=actions-runner-controller/0.12.1 2025-12-04T09:19:09.6477622Z DASHBOARD_TAG= 2025-12-04T09:19:09.6477734Z GITHUB_RUN_ID=19922849170 2025-12-04T09:19:09.6477965Z GITHUB_STEP_SUMMARY=/home/runner/_work/_temp/_runner_file_commands/step_summary_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6478209Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:19:09.6478336Z PR_NUMBER= 2025-12-04T09:19:09.6478444Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:19:09.6478560Z VALGRIND=ON 2025-12-04T09:19:09.6478672Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:19:09.6478829Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:19:09.6478983Z TERM=vt100 2025-12-04T09:19:09.6479088Z INSTALLED_VISION=yes 2025-12-04T09:19:09.6479206Z BRANCH=main 2025-12-04T09:19:09.6479317Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:19:09.6479437Z TESTS_TO_INCLUDE= 2025-12-04T09:19:09.6479606Z GITHUB_ACTION_PATH=/home/runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:19:09.6479808Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:19:09.6479958Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:19:09.6480159Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:19:09.6480304Z REENABLED_ISSUES= 2025-12-04T09:19:09.6480406Z SHLVL=1 2025-12-04T09:19:09.6480497Z MAX_JOBS=126 2025-12-04T09:19:09.6480635Z RUNNER_TEST_RESULTS_DIR=/home/runner/_work/_temp/test-results 2025-12-04T09:19:09.6480797Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:19:09.6480922Z RUNNER_TOOL_CACHE=/home/runner/_work/_tool 2025-12-04T09:19:09.6481094Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:19:09.6481252Z GITHUB_REF_NAME=main 2025-12-04T09:19:09.6481361Z ROCM_PATH=/opt/rocm 2025-12-04T09:19:09.6481462Z GITHUB_JOB=test 2025-12-04T09:19:09.6481578Z NO_TEST_TIMEOUT=False 2025-12-04T09:19:09.6481700Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:19:09.6481825Z LC_ALL=C.UTF-8 2025-12-04T09:19:09.6481931Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:19:09.6482060Z RUNNER_WORKSPACE=/home/runner/_work/pytorch 2025-12-04T09:19:09.6482196Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:19:09.6482309Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:19:09.6482669Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:19:09.6483023Z GITHUB_BASE_REF= 2025-12-04T09:19:09.6483117Z CI=true 2025-12-04T09:19:09.6483215Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:19:09.6483375Z JOB_ID=57116213140 2025-12-04T09:19:09.6483476Z GITHUB_HEAD_REF= 2025-12-04T09:19:09.6483576Z GITHUB_ACTION_REF= 2025-12-04T09:19:09.6483679Z TEST_SHOWLOCALS=False 2025-12-04T09:19:09.6483796Z GITHUB_WORKFLOW=trunk-rocm-mi300 2025-12-04T09:19:09.6483926Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:19:09.6484137Z GITHUB_OUTPUT=/home/runner/_work/_temp/_runner_file_commands/set_output_99f7893b-e35d-4054-baa7-fa18075b3219 2025-12-04T09:19:09.6484348Z NO_TD=False 2025-12-04T09:19:09.6484442Z OLDPWD=/var/lib/jenkins 2025-12-04T09:19:09.6484548Z _=/usr/bin/env 2025-12-04T09:19:09.6484650Z + echo 'Testing pytorch' 2025-12-04T09:19:09.6484758Z Testing pytorch 2025-12-04T09:19:09.6484859Z + export LANG=C.UTF-8 2025-12-04T09:19:09.6484963Z + LANG=C.UTF-8 2025-12-04T09:19:09.6485060Z + PR_NUMBER= 2025-12-04T09:19:09.6485162Z + [[ default == \d\e\f\a\u\l\t ]] 2025-12-04T09:19:09.6485287Z + export CUDA_VISIBLE_DEVICES=0 2025-12-04T09:19:09.6485407Z + CUDA_VISIBLE_DEVICES=0 2025-12-04T09:19:09.6485521Z + export HIP_VISIBLE_DEVICES=0 2025-12-04T09:19:09.6485634Z + HIP_VISIBLE_DEVICES=0 2025-12-04T09:19:09.6485749Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2025-12-04T09:19:09.6485878Z + [[ default == \s\l\o\w ]] 2025-12-04T09:19:09.6486013Z + [[ linux-jammy-rocm-py3.10 == *slow-gradcheck* ]] 2025-12-04T09:19:09.6486249Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:19:09.6486387Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:19:09.6486527Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:19:09.6486667Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:19:09.6486795Z + [[ default == *crossref* ]] 2025-12-04T09:19:09.6486918Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:19:09.6487043Z + export VALGRIND=OFF 2025-12-04T09:19:09.6487148Z + VALGRIND=OFF 2025-12-04T09:19:09.6487242Z + rocminfo 2025-12-04T09:19:09.6590616Z ROCk module version 6.12.12 is loaded 2025-12-04T09:19:09.6975645Z ===================== 2025-12-04T09:19:09.6975820Z HSA System Attributes 2025-12-04T09:19:09.6975943Z ===================== 2025-12-04T09:19:09.6976067Z Runtime Version: 1.18 2025-12-04T09:19:09.6976192Z Runtime Ext Version: 1.14 2025-12-04T09:19:09.6976324Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T09:19:09.6976532Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T09:19:09.6976746Z Machine Model: LARGE 2025-12-04T09:19:09.6976933Z System Endianness: LITTLE 2025-12-04T09:19:09.6977082Z Mwaitx: DISABLED 2025-12-04T09:19:09.6977205Z XNACK enabled: NO 2025-12-04T09:19:09.6977330Z DMAbuf Support: YES 2025-12-04T09:19:09.6977451Z VMM Support: YES 2025-12-04T09:19:09.6977521Z 2025-12-04T09:19:09.6977567Z ========== 2025-12-04T09:19:09.6977677Z HSA Agents 2025-12-04T09:19:09.6977786Z ========== 2025-12-04T09:19:09.6977896Z ******* 2025-12-04T09:19:09.6978012Z Agent 1 2025-12-04T09:19:09.6978118Z ******* 2025-12-04T09:19:09.6978251Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.6978411Z Uuid: CPU-XX 2025-12-04T09:19:09.6978579Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.6978751Z Vendor Name: CPU 2025-12-04T09:19:09.6978911Z Feature: None specified 2025-12-04T09:19:09.6979084Z Profile: FULL_PROFILE 2025-12-04T09:19:09.6979247Z Float Round Mode: NEAR 2025-12-04T09:19:09.6979413Z Max Queue Number: 0(0x0) 2025-12-04T09:19:09.6979573Z Queue Min Size: 0(0x0) 2025-12-04T09:19:09.6979749Z Queue Max Size: 0(0x0) 2025-12-04T09:19:09.6982933Z Queue Type: MULTI 2025-12-04T09:19:09.6983088Z Node: 0 2025-12-04T09:19:09.6983268Z Device Type: CPU 2025-12-04T09:19:09.6983413Z Cache Info: 2025-12-04T09:19:09.6983548Z L1: 49152(0xc000) KB 2025-12-04T09:19:09.6983695Z Chip ID: 0(0x0) 2025-12-04T09:19:09.6983849Z ASIC Revision: 0(0x0) 2025-12-04T09:19:09.6984009Z Cacheline Size: 64(0x40) 2025-12-04T09:19:09.6984171Z Max Clock Freq. (MHz): 3300 2025-12-04T09:19:09.6984325Z BDFID: 0 2025-12-04T09:19:09.6984480Z Internal Node ID: 0 2025-12-04T09:19:09.6984696Z Compute Unit: 64 2025-12-04T09:19:09.6984856Z SIMDs per CU: 0 2025-12-04T09:19:09.6985017Z Shader Engines: 0 2025-12-04T09:19:09.6985254Z Shader Arrs. per Eng.: 0 2025-12-04T09:19:09.6985423Z WatchPts on Addr. Ranges:1 2025-12-04T09:19:09.6985574Z Memory Properties: 2025-12-04T09:19:09.6985691Z Features: None 2025-12-04T09:19:09.6985812Z Pool Info: 2025-12-04T09:19:09.6985924Z Pool 1 2025-12-04T09:19:09.6986065Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:19:09.6986228Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T09:19:09.6986385Z Allocatable: TRUE 2025-12-04T09:19:09.6986546Z Alloc Granule: 4KB 2025-12-04T09:19:09.6986725Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6986901Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6987068Z Accessible by all: TRUE 2025-12-04T09:19:09.6987216Z Pool 2 2025-12-04T09:19:09.6987355Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:19:09.6987515Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T09:19:09.6987671Z Allocatable: TRUE 2025-12-04T09:19:09.6987836Z Alloc Granule: 4KB 2025-12-04T09:19:09.6988006Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6988177Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6988344Z Accessible by all: TRUE 2025-12-04T09:19:09.6988492Z Pool 3 2025-12-04T09:19:09.6988631Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:19:09.6988787Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T09:19:09.6988946Z Allocatable: TRUE 2025-12-04T09:19:09.6989110Z Alloc Granule: 4KB 2025-12-04T09:19:09.6989281Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6989451Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6989618Z Accessible by all: TRUE 2025-12-04T09:19:09.6989760Z Pool 4 2025-12-04T09:19:09.6989896Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:19:09.6990050Z Size: 1584734456(0x5e7520f8) KB 2025-12-04T09:19:09.6990288Z Allocatable: TRUE 2025-12-04T09:19:09.6990450Z Alloc Granule: 4KB 2025-12-04T09:19:09.6990620Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6990792Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6990959Z Accessible by all: TRUE 2025-12-04T09:19:09.6991102Z ISA Info: 2025-12-04T09:19:09.6991211Z ******* 2025-12-04T09:19:09.6991321Z Agent 2 2025-12-04T09:19:09.6991427Z ******* 2025-12-04T09:19:09.6991556Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.6991711Z Uuid: CPU-XX 2025-12-04T09:19:09.6991875Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.6992044Z Vendor Name: CPU 2025-12-04T09:19:09.6992208Z Feature: None specified 2025-12-04T09:19:09.6992369Z Profile: FULL_PROFILE 2025-12-04T09:19:09.6992531Z Float Round Mode: NEAR 2025-12-04T09:19:09.6992729Z Max Queue Number: 0(0x0) 2025-12-04T09:19:09.6992888Z Queue Min Size: 0(0x0) 2025-12-04T09:19:09.6993045Z Queue Max Size: 0(0x0) 2025-12-04T09:19:09.6993203Z Queue Type: MULTI 2025-12-04T09:19:09.6993353Z Node: 1 2025-12-04T09:19:09.6993506Z Device Type: CPU 2025-12-04T09:19:09.6993651Z Cache Info: 2025-12-04T09:19:09.6993776Z L1: 49152(0xc000) KB 2025-12-04T09:19:09.6993926Z Chip ID: 0(0x0) 2025-12-04T09:19:09.6994080Z ASIC Revision: 0(0x0) 2025-12-04T09:19:09.6994240Z Cacheline Size: 64(0x40) 2025-12-04T09:19:09.6994408Z Max Clock Freq. (MHz): 3300 2025-12-04T09:19:09.6994562Z BDFID: 0 2025-12-04T09:19:09.6994718Z Internal Node ID: 1 2025-12-04T09:19:09.6994880Z Compute Unit: 64 2025-12-04T09:19:09.6995035Z SIMDs per CU: 0 2025-12-04T09:19:09.6995194Z Shader Engines: 0 2025-12-04T09:19:09.6995360Z Shader Arrs. per Eng.: 0 2025-12-04T09:19:09.6995528Z WatchPts on Addr. Ranges:1 2025-12-04T09:19:09.6995678Z Memory Properties: 2025-12-04T09:19:09.6995799Z Features: None 2025-12-04T09:19:09.6995921Z Pool Info: 2025-12-04T09:19:09.6996035Z Pool 1 2025-12-04T09:19:09.6996174Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:19:09.6996336Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T09:19:09.6996502Z Allocatable: TRUE 2025-12-04T09:19:09.6996666Z Alloc Granule: 4KB 2025-12-04T09:19:09.6996839Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6997010Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6997177Z Accessible by all: TRUE 2025-12-04T09:19:09.6997321Z Pool 2 2025-12-04T09:19:09.6997458Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:19:09.6997643Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T09:19:09.6997799Z Allocatable: TRUE 2025-12-04T09:19:09.6997964Z Alloc Granule: 4KB 2025-12-04T09:19:09.6998132Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6998304Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6998472Z Accessible by all: TRUE 2025-12-04T09:19:09.6998617Z Pool 3 2025-12-04T09:19:09.6998756Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:19:09.6998917Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T09:19:09.6999077Z Allocatable: TRUE 2025-12-04T09:19:09.6999242Z Alloc Granule: 4KB 2025-12-04T09:19:09.6999412Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.6999583Z Alloc Alignment: 4KB 2025-12-04T09:19:09.6999746Z Accessible by all: TRUE 2025-12-04T09:19:09.6999915Z Pool 4 2025-12-04T09:19:09.7000052Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:19:09.7000262Z Size: 1585355628(0x5e7e9b6c) KB 2025-12-04T09:19:09.7000417Z Allocatable: TRUE 2025-12-04T09:19:09.7000584Z Alloc Granule: 4KB 2025-12-04T09:19:09.7000753Z Alloc Recommended Granule:4KB 2025-12-04T09:19:09.7000924Z Alloc Alignment: 4KB 2025-12-04T09:19:09.7001092Z Accessible by all: TRUE 2025-12-04T09:19:09.7001241Z ISA Info: 2025-12-04T09:19:09.7001352Z ******* 2025-12-04T09:19:09.7001460Z Agent 3 2025-12-04T09:19:09.7001568Z ******* 2025-12-04T09:19:09.7001690Z Name: gfx942 2025-12-04T09:19:09.7001848Z Uuid: GPU-c59e59538c4aacf0 2025-12-04T09:19:09.7002008Z Marketing Name: 2025-12-04T09:19:09.7002172Z Vendor Name: AMD 2025-12-04T09:19:09.7002333Z Feature: KERNEL_DISPATCH 2025-12-04T09:19:09.7002493Z Profile: BASE_PROFILE 2025-12-04T09:19:09.7002657Z Float Round Mode: NEAR 2025-12-04T09:19:09.7002819Z Max Queue Number: 128(0x80) 2025-12-04T09:19:09.7002981Z Queue Min Size: 64(0x40) 2025-12-04T09:19:09.7003139Z Queue Max Size: 131072(0x20000) 2025-12-04T09:19:09.7003297Z Queue Type: MULTI 2025-12-04T09:19:09.7003447Z Node: 2 2025-12-04T09:19:09.7003598Z Device Type: GPU 2025-12-04T09:19:09.7003738Z Cache Info: 2025-12-04T09:19:09.7003863Z L1: 32(0x20) KB 2025-12-04T09:19:09.7004003Z L2: 4096(0x1000) KB 2025-12-04T09:19:09.7004140Z L3: 262144(0x40000) KB 2025-12-04T09:19:09.7004281Z Chip ID: 29861(0x74a5) 2025-12-04T09:19:09.7004438Z ASIC Revision: 1(0x1) 2025-12-04T09:19:09.7004598Z Cacheline Size: 128(0x80) 2025-12-04T09:19:09.7004803Z Max Clock Freq. (MHz): 2100 2025-12-04T09:19:09.7004959Z BDFID: 1280 2025-12-04T09:19:09.7005119Z Internal Node ID: 2 2025-12-04T09:19:09.7005281Z Compute Unit: 304 2025-12-04T09:19:09.7005439Z SIMDs per CU: 4 2025-12-04T09:19:09.7005600Z Shader Engines: 32 2025-12-04T09:19:09.7005767Z Shader Arrs. per Eng.: 1 2025-12-04T09:19:09.7005934Z WatchPts on Addr. Ranges:4 2025-12-04T09:19:09.7006103Z Coherent Host Access: FALSE 2025-12-04T09:19:09.7006251Z Memory Properties: 2025-12-04T09:19:09.7006379Z Features: KERNEL_DISPATCH 2025-12-04T09:19:09.7006531Z Fast F16 Operation: TRUE 2025-12-04T09:19:09.7006699Z Wavefront Size: 64(0x40) 2025-12-04T09:19:09.7006865Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:19:09.7007023Z Workgroup Max Size per Dimension: 2025-12-04T09:19:09.7007186Z x 1024(0x400) 2025-12-04T09:19:09.7007324Z y 1024(0x400) 2025-12-04T09:19:09.7007462Z z 1024(0x400) 2025-12-04T09:19:09.7007611Z Max Waves Per CU: 32(0x20) 2025-12-04T09:19:09.7007777Z Max Work-item Per CU: 2048(0x800) 2025-12-04T09:19:09.7007943Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:19:09.7008090Z Grid Max Size per Dimension: 2025-12-04T09:19:09.7008218Z x 2147483647(0x7fffffff) 2025-12-04T09:19:09.7008359Z y 65535(0xffff) 2025-12-04T09:19:09.7008498Z z 65535(0xffff) 2025-12-04T09:19:09.7008654Z Max fbarriers/Workgrp: 32 2025-12-04T09:19:09.7008878Z Packet Processor uCode:: 185 2025-12-04T09:19:09.7009054Z SDMA engine uCode:: 24 2025-12-04T09:19:09.7009223Z IOMMU Support:: None 2025-12-04T09:19:09.7009368Z Pool Info: 2025-12-04T09:19:09.7009482Z Pool 1 2025-12-04T09:19:09.7009624Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:19:09.7009786Z Size: 268419072(0xfffc000) KB 2025-12-04T09:19:09.7009939Z Allocatable: TRUE 2025-12-04T09:19:09.7010131Z Alloc Granule: 4KB 2025-12-04T09:19:09.7010299Z Alloc Recommended Granule:2048KB 2025-12-04T09:19:09.7010466Z Alloc Alignment: 4KB 2025-12-04T09:19:09.7010630Z Accessible by all: FALSE 2025-12-04T09:19:09.7010773Z Pool 2 2025-12-04T09:19:09.7010907Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:19:09.7011061Z Size: 268419072(0xfffc000) KB 2025-12-04T09:19:09.7011215Z Allocatable: TRUE 2025-12-04T09:19:09.7011374Z Alloc Granule: 4KB 2025-12-04T09:19:09.7011541Z Alloc Recommended Granule:2048KB 2025-12-04T09:19:09.7011705Z Alloc Alignment: 4KB 2025-12-04T09:19:09.7011866Z Accessible by all: FALSE 2025-12-04T09:19:09.7012007Z Pool 3 2025-12-04T09:19:09.7012191Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:19:09.7012343Z Size: 268419072(0xfffc000) KB 2025-12-04T09:19:09.7012502Z Allocatable: TRUE 2025-12-04T09:19:09.7012669Z Alloc Granule: 4KB 2025-12-04T09:19:09.7012845Z Alloc Recommended Granule:2048KB 2025-12-04T09:19:09.7013020Z Alloc Alignment: 4KB 2025-12-04T09:19:09.7013191Z Accessible by all: FALSE 2025-12-04T09:19:09.7013338Z Pool 4 2025-12-04T09:19:09.7013473Z Segment: GROUP 2025-12-04T09:19:09.7013627Z Size: 64(0x40) KB 2025-12-04T09:19:09.7013785Z Allocatable: FALSE 2025-12-04T09:19:09.7013956Z Alloc Granule: 0KB 2025-12-04T09:19:09.7014132Z Alloc Recommended Granule:0KB 2025-12-04T09:19:09.7014318Z Alloc Alignment: 0KB 2025-12-04T09:19:09.7014521Z Accessible by all: FALSE 2025-12-04T09:19:09.7014669Z ISA Info: 2025-12-04T09:19:09.7014785Z ISA 1 2025-12-04T09:19:09.7014925Z Name: amdgcn-amd-amdhsa--gfx942:sramecc+:xnack- 2025-12-04T09:19:09.7015103Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:19:09.7015277Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:19:09.7015449Z Default Rounding Mode: NEAR 2025-12-04T09:19:09.7015628Z Default Rounding Mode: NEAR 2025-12-04T09:19:09.7015800Z Fast f16: TRUE 2025-12-04T09:19:09.7015959Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:19:09.7016111Z Workgroup Max Size per Dimension: 2025-12-04T09:19:09.7016251Z x 1024(0x400) 2025-12-04T09:19:09.7016391Z y 1024(0x400) 2025-12-04T09:19:09.7016526Z z 1024(0x400) 2025-12-04T09:19:09.7016673Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:19:09.7016817Z Grid Max Size per Dimension: 2025-12-04T09:19:09.7016943Z x 2147483647(0x7fffffff) 2025-12-04T09:19:09.7017082Z y 65535(0xffff) 2025-12-04T09:19:09.7017219Z z 65535(0xffff) 2025-12-04T09:19:09.7017370Z FBarrier Max Size: 32 2025-12-04T09:19:09.7017516Z ISA 2 2025-12-04T09:19:09.7017669Z Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- 2025-12-04T09:19:09.7017855Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:19:09.7018022Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:19:09.7018191Z Default Rounding Mode: NEAR 2025-12-04T09:19:09.7018361Z Default Rounding Mode: NEAR 2025-12-04T09:19:09.7018518Z Fast f16: TRUE 2025-12-04T09:19:09.7018676Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:19:09.7018824Z Workgroup Max Size per Dimension: 2025-12-04T09:19:09.7018957Z x 1024(0x400) 2025-12-04T09:19:09.7019122Z y 1024(0x400) 2025-12-04T09:19:09.7019256Z z 1024(0x400) 2025-12-04T09:19:09.7019404Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:19:09.7019551Z Grid Max Size per Dimension: 2025-12-04T09:19:09.7019677Z x 2147483647(0x7fffffff) 2025-12-04T09:19:09.7019812Z y 65535(0xffff) 2025-12-04T09:19:09.7019944Z z 65535(0xffff) 2025-12-04T09:19:09.7020132Z FBarrier Max Size: 32 2025-12-04T09:19:09.7020270Z *** Done *** 2025-12-04T09:19:09.7059715Z + rocminfo 2025-12-04T09:19:09.7069608Z + grep -E 'Name:.*\sgfx|Marketing' 2025-12-04T09:19:09.7624457Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.7624978Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:19:09.7625415Z Name: gfx942 2025-12-04T09:19:09.7625825Z Marketing Name: 2025-12-04T09:19:09.7665529Z + MAYBE_ROCM=rocm/ 2025-12-04T09:19:09.7666427Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:19:09.7666579Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:19:09.7666713Z + pip_install ninja==1.10.2 2025-12-04T09:19:09.7666868Z + pip_install_pkg='python3 -m pip install --progress-bar off' 2025-12-04T09:19:09.7667051Z + python3 -m pip install --progress-bar off ninja==1.10.2 2025-12-04T09:19:09.9868243Z Collecting ninja==1.10.2 2025-12-04T09:19:10.0120243Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2025-12-04T09:19:10.0281319Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2025-12-04T09:19:10.1961851Z Installing collected packages: ninja 2025-12-04T09:19:10.1962130Z Attempting uninstall: ninja 2025-12-04T09:19:10.1965339Z Found existing installation: ninja 1.11.1.4 2025-12-04T09:19:10.1974918Z Uninstalling ninja-1.11.1.4: 2025-12-04T09:19:10.2266449Z Successfully uninstalled ninja-1.11.1.4 2025-12-04T09:19:10.2384994Z Successfully installed ninja-1.10.2 2025-12-04T09:19:10.2824474Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:19:10.2825787Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:19:10.2826550Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:19:10.2826813Z + [[ linux-jammy-rocm-py3.10 == *asan* ]] 2025-12-04T09:19:10.2827077Z + [[ linux-jammy-rocm-py3.10 == *-debug* ]] 2025-12-04T09:19:10.2827326Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:19:10.2827673Z + echo 'We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass' 2025-12-04T09:19:10.2828106Z We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass 2025-12-04T09:19:10.2880774Z + cd test 2025-12-04T09:19:10.2881042Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2025-12-04T09:19:11.2585504Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2025-12-04T09:19:11.2585831Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2025-12-04T09:19:11.2586113Z + [[ default == \l\e\g\a\c\y\_\n\v\i\d\i\a\_\d\r\i\v\e\r ]] 2025-12-04T09:19:11.2590996Z + DYNAMO_BENCHMARK_FLAGS=() 2025-12-04T09:19:11.2591253Z + [[ default == *pr_time_benchmarks* ]] 2025-12-04T09:19:11.2591494Z + [[ default == *dynamo_eager* ]] 2025-12-04T09:19:11.2591704Z + [[ default == *aot_eager* ]] 2025-12-04T09:19:11.2595954Z + [[ default == *aot_inductor* ]] 2025-12-04T09:19:11.2596170Z + [[ default == *max_autotune_inductor* ]] 2025-12-04T09:19:11.2596393Z + [[ default == *inductor* ]] 2025-12-04T09:19:11.2596603Z + [[ default == *dynamic* ]] 2025-12-04T09:19:11.2596807Z + [[ default == *cpu* ]] 2025-12-04T09:19:11.2597001Z + [[ default == *xpu* ]] 2025-12-04T09:19:11.2597224Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2025-12-04T09:19:11.2601268Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:19:11.2601501Z + [[ linux-jammy-rocm-py3.10 == *-bazel-* ]] 2025-12-04T09:19:11.2604822Z + cd test 2025-12-04T09:19:11.2605343Z + python -c 'import torch; print(torch.__config__.show())' 2025-12-04T09:19:12.0199481Z PyTorch built with: 2025-12-04T09:19:12.0199852Z - GCC 11.4 2025-12-04T09:19:12.0200259Z - C++ Version: 201703 2025-12-04T09:19:12.0200818Z - Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:19:12.0201547Z - Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:19:12.0201957Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:19:12.0202276Z - LAPACK is enabled (usually provided by MKL) 2025-12-04T09:19:12.0202581Z - NNPACK is enabled 2025-12-04T09:19:12.0207600Z - CPU capability usage: AVX512 2025-12-04T09:19:12.0207878Z - HIP Runtime 7.1.25424 2025-12-04T09:19:12.0208061Z - MIOpen 3.5.1 2025-12-04T09:19:12.0208219Z - Magma 2.9.0 2025-12-04T09:19:12.0211168Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, COMMIT_SHA=35b7a9a26c5923d98aebaa41a031dae21788a9ee, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_FBGEMM_GENAI -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -DC10_NODEPRECATED -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=range-loop-construct -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -faligned-new -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.10.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=ON, USE_ROCM_KERNEL_ASSERT=OFF, USE_XCCL=OFF, USE_XPU=OFF, 2025-12-04T09:19:12.0214138Z 2025-12-04T09:19:12.2940697Z + cd test 2025-12-04T09:19:12.2941148Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2025-12-04T09:19:12.9681578Z ATen/Parallel: 2025-12-04T09:19:12.9681930Z at::get_num_threads() : 128 2025-12-04T09:19:12.9682119Z at::get_num_interop_threads() : 128 2025-12-04T09:19:12.9682309Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:19:12.9682522Z omp_get_max_threads() : 128 2025-12-04T09:19:12.9682853Z Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:19:12.9683169Z mkl_get_max_threads() : 128 2025-12-04T09:19:12.9683401Z Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:19:12.9683654Z std::thread::hardware_concurrency() : 128 2025-12-04T09:19:12.9683839Z Environment variables: 2025-12-04T09:19:12.9683990Z OMP_NUM_THREADS : [not set] 2025-12-04T09:19:12.9684147Z MKL_NUM_THREADS : [not set] 2025-12-04T09:19:12.9684308Z ATen parallel backend: OpenMP 2025-12-04T09:19:12.9684416Z 2025-12-04T09:19:13.2108505Z + [[ default == *numpy_2* ]] 2025-12-04T09:19:13.2108771Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:19:13.2109036Z + [[ default == *backward* ]] 2025-12-04T09:19:13.2109268Z + [[ default == *libtorch_agnostic_targetting* ]] 2025-12-04T09:19:13.2109506Z + [[ default == *xla* ]] 2025-12-04T09:19:13.2109691Z + [[ default == *vllm* ]] 2025-12-04T09:19:13.2114213Z + [[ default == *executorch* ]] 2025-12-04T09:19:13.2114421Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2025-12-04T09:19:13.2114650Z + [[ default == \q\u\a\n\t\i\z\a\t\i\o\n ]] 2025-12-04T09:19:13.2114891Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:19:13.2115123Z + [[ default == distributed ]] 2025-12-04T09:19:13.2115328Z + [[ default == *operator_benchmark* ]] 2025-12-04T09:19:13.2115557Z + [[ default == *operator_microbenchmark* ]] 2025-12-04T09:19:13.2115792Z + [[ default == *attention_microbenchmark* ]] 2025-12-04T09:19:13.2116021Z + [[ default == *inductor_distributed* ]] 2025-12-04T09:19:13.2116242Z + [[ default == *inductor-halide* ]] 2025-12-04T09:19:13.2116455Z + [[ default == *inductor-pallas* ]] 2025-12-04T09:19:13.2116676Z + [[ default == *inductor-triton-cpu* ]] 2025-12-04T09:19:13.2116915Z + [[ default == *inductor-micro-benchmark* ]] 2025-12-04T09:19:13.2117161Z + [[ default == *aoti_cross_compile_for_windows* ]] 2025-12-04T09:19:13.2117398Z + [[ default == *huggingface* ]] 2025-12-04T09:19:13.2117592Z + [[ default == *timm* ]] 2025-12-04T09:19:13.2117784Z + [[ default == cachebench ]] 2025-12-04T09:19:13.2117987Z + [[ default == verify_cachebench ]] 2025-12-04T09:19:13.2118349Z + [[ default == *torchbench* ]] 2025-12-04T09:19:13.2118557Z + [[ default == *inductor_cpp_wrapper* ]] 2025-12-04T09:19:13.2118774Z + [[ default == *inductor_core* ]] 2025-12-04T09:19:13.2118994Z + [[ default == *inductor* ]] 2025-12-04T09:19:13.2119232Z + [[ default == *einops* ]] 2025-12-04T09:19:13.2119415Z + [[ default == *dynamo_core* ]] 2025-12-04T09:19:13.2119616Z + [[ default == *dynamo_wrapped* ]] 2025-12-04T09:19:13.2119791Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:19:13.2119957Z + [[ -n '' ]] 2025-12-04T09:19:13.2120147Z + [[ 2 == 1 ]] 2025-12-04T09:19:13.2120331Z + [[ 2 == 2 ]] 2025-12-04T09:19:13.2120469Z + [[ 6 -gt 1 ]] 2025-12-04T09:19:13.2120607Z + install_torchvision 2025-12-04T09:19:13.2120752Z + local orig_preload 2025-12-04T09:19:13.2120897Z + local commit 2025-12-04T09:19:13.2166222Z ++ get_pinned_commit vision 2025-12-04T09:19:13.2166398Z ++ cat .github/ci_commit_pins/vision.txt 2025-12-04T09:19:13.2179578Z + commit=617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:13.2179795Z + orig_preload= 2025-12-04T09:19:13.2179917Z + '[' -n '' ']' 2025-12-04T09:19:13.2180056Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:19:13.2180460Z + pip_build_and_install git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e dist/vision 2025-12-04T09:19:13.2180861Z + local build_target=git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:13.2181125Z + local wheel_dir=dist/vision 2025-12-04T09:19:13.2181271Z + local found_whl=0 2025-12-04T09:19:13.2181414Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:19:13.2181572Z + [[ -f dist/vision/*.whl ]] 2025-12-04T09:19:13.2181707Z + '[' 0 == 0 ']' 2025-12-04T09:19:13.2182022Z + python3 -m pip wheel --no-build-isolation --no-deps -w dist/vision git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:13.3732119Z Collecting git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:13.3733372Z Cloning https://github.com/pytorch/vision.git (to revision 617079d944b0e72632311c30ae2bbdf1168b901e) to /tmp/pip-req-build-7yyxq6rw 2025-12-04T09:19:13.3762485Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-7yyxq6rw 2025-12-04T09:19:16.7617861Z Running command git rev-parse -q --verify 'sha^617079d944b0e72632311c30ae2bbdf1168b901e' 2025-12-04T09:19:16.7623269Z Running command git fetch -q https://github.com/pytorch/vision.git 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:17.4194592Z Resolved https://github.com/pytorch/vision.git to commit 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:19:19.0921621Z Preparing metadata (pyproject.toml) ... [?25l- \ | done 2025-12-04T09:19:19.0958778Z [?25hBuilding wheels for collected packages: torchvision 2025-12-04T09:20:06.8172533Z Building wheel for torchvision (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-12-04T09:20:06.8193363Z [?25h Created wheel for torchvision: filename=torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl size=1808990 sha256=05bfb1df7c96e86986f14e55dd28f6289c377846116fcaae9ba53fc3caff0491 2025-12-04T09:20:06.8193846Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/12/b2/29/1f82685c5b5173629e1f36a9b93989ce92ce563e5fb91d27ac 2025-12-04T09:20:06.8223233Z Successfully built torchvision 2025-12-04T09:20:06.8799641Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:20:06.8811156Z + pip_install_whl dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:20:06.8811485Z + args=('dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl') 2025-12-04T09:20:06.8811706Z + local args 2025-12-04T09:20:06.8811886Z + [[ dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl == *\ * ]] 2025-12-04T09:20:06.8812134Z + for path in "${args[@]}" 2025-12-04T09:20:06.8812763Z + echo 'Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl' 2025-12-04T09:20:06.8813057Z Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:20:06.8813396Z + python3 -mpip install --no-index --no-deps dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:20:07.0311058Z Processing ./dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:20:07.0359251Z Installing collected packages: torchvision 2025-12-04T09:20:07.2378163Z Successfully installed torchvision-0.25.0a0+617079d 2025-12-04T09:20:07.2585910Z + '[' -n '' ']' 2025-12-04T09:20:07.2586070Z + test_python_shard 2 2025-12-04T09:20:07.2586184Z + [[ -z 6 ]] 2025-12-04T09:20:07.2586506Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --exclude-quantization-tests --shard 2 6 --verbose --upload-artifacts-while-running 2025-12-04T09:20:09.0393013Z Excluding inductor/test_max_autotune on ROCm 2025-12-04T09:20:09.0393282Z Excluding test_cuda_nvml_based_avail on ROCm 2025-12-04T09:20:10.0260752Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2025-12-04T09:20:10.4055686Z Ignoring disabled issues: [''] 2025-12-04T09:20:10.4101102Z Found test times from artifacts 2025-12-04T09:20:10.4265686Z Found test times from artifacts 2025-12-04T09:20:10.4271097Z Running all tests 2025-12-04T09:20:10.4556972Z Running parallel tests on 1 processes 2025-12-04T09:20:10.4559827Z Name: tests to run (est. time: 181.89min) 2025-12-04T09:20:10.4559980Z Serial tests (94): 2025-12-04T09:20:10.4560387Z inductor/test_aot_inductor 1/3 2025-12-04T09:20:10.4560600Z inductor/test_torchinductor_dynamic_shapes 4/4 2025-12-04T09:20:10.4561070Z inductor/test_torchinductor_opinfo 4/12 2025-12-04T09:20:10.4561266Z inductor/test_torchinductor_opinfo 10/12 2025-12-04T09:20:10.4561457Z inductor/test_cpu_repro 2/5 2025-12-04T09:20:10.4561649Z inductor/test_smoke 1/1 2025-12-04T09:20:10.4561832Z inductor/test_compiled_autograd 2/2 2025-12-04T09:20:10.4561992Z inductor/test_mmdecomp 1/1 2025-12-04T09:20:10.4562168Z dynamo/test_ctx_manager 1/1 2025-12-04T09:20:10.4562337Z dynamo/test_exc 1/1 2025-12-04T09:20:10.4562479Z dynamo/test_misc 1/1 2025-12-04T09:20:10.4562598Z inductor/test_flex_attention 4/4 2025-12-04T09:20:10.4562729Z inductor/test_flex_decoding 2/2 2025-12-04T09:20:10.4562866Z inductor/test_triton_extension_backend 1/1 2025-12-04T09:20:10.4563063Z inductor/test_cutedsl_grouped_mm 1/1 2025-12-04T09:20:10.4563256Z inductor/test_cpp_wrapper_hipify 1/1 2025-12-04T09:20:10.4563449Z export/test_retraceability 1/1 2025-12-04T09:20:10.4564204Z dynamo/test_deque_reconstruct 1/1 2025-12-04T09:20:10.4564391Z inductor/test_utils 1/1 2025-12-04T09:20:10.4564564Z inductor/test_indexing 1/1 2025-12-04T09:20:10.4564747Z inductor/test_inductor_annotations 1/1 2025-12-04T09:20:10.4564942Z inductor/test_compile_worker 1/1 2025-12-04T09:20:10.4565122Z dynamo/test_einops 1/1 2025-12-04T09:20:10.4565295Z inductor/test_external_callables 1/1 2025-12-04T09:20:10.4565481Z dynamo/test_fx_passes_pre_grad 1/1 2025-12-04T09:20:10.4565665Z inductor/test_fp8 1/1 2025-12-04T09:20:10.4565839Z inductor/test_flex_flash 1/1 2025-12-04T09:20:10.4566014Z dynamo/test_model_output 1/1 2025-12-04T09:20:10.4566189Z inductor/test_metrics 1/1 2025-12-04T09:20:10.4566364Z export/test_unflatten_training_ir 1/1 2025-12-04T09:20:10.4566554Z inductor/test_triton_kernels 1/1 2025-12-04T09:20:10.4566730Z dynamo/test_modules 1/1 2025-12-04T09:20:10.4566910Z inductor/test_cudacodecache 1/1 2025-12-04T09:20:10.4567092Z dynamo/test_fx_graph_runnable 1/1 2025-12-04T09:20:10.4567273Z inductor/test_codegen_triton 1/1 2025-12-04T09:20:10.4567451Z dynamo/test_frame_init 1/1 2025-12-04T09:20:10.4567627Z inductor/test_device_assert 1/1 2025-12-04T09:20:10.4567944Z dynamo/test_skip_non_tensor 1/1 2025-12-04T09:20:10.4568123Z dynamo/test_skip_guard_eval_unsafe 1/1 2025-12-04T09:20:10.4568318Z inductor/test_decompose_mem_bound_mm 1/1 2025-12-04T09:20:10.4568507Z inductor/test_op_dtype_prop 1/1 2025-12-04T09:20:10.4568730Z inductor/test_control_flow 4/4 2025-12-04T09:20:10.4569173Z dynamo/test_structured_trace 1/1 2025-12-04T09:20:10.4569400Z export/test_hop 1/1 2025-12-04T09:20:10.4569610Z export/test_experimental 1/1 2025-12-04T09:20:10.4570761Z export/test_export 1/1 2025-12-04T09:20:10.4570985Z dynamo/test_comptime 1/1 2025-12-04T09:20:10.4571109Z test_mkl_verbose 1/1 2025-12-04T09:20:10.4571235Z test_comparison_utils 1/1 2025-12-04T09:20:10.4571408Z functorch/test_ac_logging 1/1 2025-12-04T09:20:10.4571541Z test_mkldnn_verbose 1/1 2025-12-04T09:20:10.4571663Z test_cpp_api_parity 1/1 2025-12-04T09:20:10.4571790Z nn/attention/test_open_registry 1/1 2025-12-04T09:20:10.4571928Z test_as_strided 1/1 2025-12-04T09:20:10.4572057Z test_proxy_tensor 1/1 2025-12-04T09:20:10.4572171Z test_matmul_cuda 1/1 2025-12-04T09:20:10.4572281Z xpu/test_gemm 1/1 2025-12-04T09:20:10.4572384Z test_fx_passes 1/1 2025-12-04T09:20:10.4572503Z functorch/test_logging 1/1 2025-12-04T09:20:10.4572626Z higher_order_ops/test_local_map 1/1 2025-12-04T09:20:10.4572760Z test_tensorexpr 1/1 2025-12-04T09:20:10.4572867Z test_jiterator 1/1 2025-12-04T09:20:10.4572977Z test_native_functions 1/1 2025-12-04T09:20:10.4573088Z test_typing 1/1 2025-12-04T09:20:10.4573203Z higher_order_ops/test_invoke_subgraph 1/1 2025-12-04T09:20:10.4573334Z test_decomp 3/12 2025-12-04T09:20:10.4573437Z test_decomp 9/12 2025-12-04T09:20:10.4573551Z test_legacy_vmap 1/1 2025-12-04T09:20:10.4573670Z higher_order_ops/test_print 1/1 2025-12-04T09:20:10.4573797Z test_per_overload_api 1/1 2025-12-04T09:20:10.4573918Z test_multiprocessing 1/1 2025-12-04T09:20:10.4574031Z test_meta 2/3 2025-12-04T09:20:10.4574139Z test_numpy_interop 1/1 2025-12-04T09:20:10.4574258Z profiler/test_cpp_thread 1/1 2025-12-04T09:20:10.4574382Z test_ops_gradients 1/2 2025-12-04T09:20:10.4574504Z distributions/test_constraints 1/1 2025-12-04T09:20:10.4574630Z test_linalg 1/2 2025-12-04T09:20:10.4574729Z test_modules 2/2 2025-12-04T09:20:10.4574839Z optim/test_swa_utils 1/1 2025-12-04T09:20:10.4575005Z cpp_extensions/python_agnostic_extension/test/test_python_agnostic 1/1 2025-12-04T09:20:10.4575194Z functorch/test_memory_efficient_fusion 1/1 2025-12-04T09:20:10.4575341Z torch_np/numpy_tests/lib/test_histograms 1/1 2025-12-04T09:20:10.4575482Z torch_np/test_indexing 1/1 2025-12-04T09:20:10.4575600Z test_tensorboard 1/1 2025-12-04T09:20:10.4576161Z test_numba_integration 1/1 2025-12-04T09:20:10.4576279Z test_functional_optim 1/1 2025-12-04T09:20:10.4576401Z test_maskedtensor 1/1 2025-12-04T09:20:10.4576517Z test_ops 2/5 2025-12-04T09:20:10.4576636Z torch_np/numpy_tests/core/test_dtype 1/1 2025-12-04T09:20:10.4576769Z lazy/test_debug_util 1/1 2025-12-04T09:20:10.4584429Z nn/test_load_state_dict 1/1 2025-12-04T09:20:10.4584613Z test_shape_ops 1/1 2025-12-04T09:20:10.4584723Z functorch/test_ops 1/4 2025-12-04T09:20:10.4584832Z test_nn 2/2 2025-12-04T09:20:10.4584932Z Parallel tests (0): 2025-12-04T09:20:10.4585045Z Name: excluded (est. time: 0.0min) 2025-12-04T09:20:10.4585163Z Serial tests (0): 2025-12-04T09:20:10.4585262Z Parallel tests (0): 2025-12-04T09:20:10.4585433Z Running inductor/test_aot_inductor 1/3 ... [2025-12-04 09:20:10.456733][2189472.917711404] 2025-12-04T09:20:10.4585628Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:20:10.4586054Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '--shard-id=1', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:20:10.457199] 2025-12-04T09:27:32.4297228Z 2025-12-04T09:27:32.4297778Z inductor/test_aot_inductor 1/3 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_1.3_2050bad61886c9f9_.log 2025-12-04T09:27:32.4355521Z Running 306 items in this shard: test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_sets_package_cpp, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_add_complex_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_addmm_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aot_inductor_consts_cpp_build_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_constant_tensor_name_collision_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_cpp_kernel_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_backward_no_op_logging_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bmm_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_boolean_indexing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_mismatched_branch_output_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_non_tensor_predicates_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_non_tensor_predicates_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_predicate_on_cpu_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_share_predicate_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_simple_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_symint_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_unbacked_symint_closure_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_use_buffers_from_outer_scope_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_reinterpret_view_inputs_outputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_replace_view_ops_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_device_moved_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_duplicated_params_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fake_tensor_device_validation_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fft_c2c_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_foreach_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fp8_view_of_param_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_free_inactive_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_freezing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fx_gm_return_tuple_validation_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_input_codegen_with_sympy_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_int_list_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_issue_140766_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_grid_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_weight_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_linear_dynamic_maxautotune_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_masked_select_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misaligned_input_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multi_device_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_nested_tensor_from_jagged_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_non_default_gpu_device_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_normal_functional_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_pad_non_zero_memory_leak_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_poi_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_abs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_squeeze_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_pytree_inputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_quanatized_int8_linear_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_quantized_linear_bias_none_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeat_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_replace_unbacked_symbol_with_backed_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_view_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_multi_arch_embed_kernel_binary_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_split_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_stft_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_subclasses_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_symbool_item_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_on_device_tma_dynamic_True_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_with_none_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_mutated_autotuning_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_0_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_0_use_static_size_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbounded_expr_substitutions_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_constant_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_using_model_name_for_files_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_weight_on_disk_legacy_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_simple_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_conv_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_outer_code_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_sym_expr_cond_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_sym_expr_cond_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_unbacked_symint_closure_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_cudagraphs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_add_complex_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_addmm_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_amp_fallback_random_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_constant_tensor_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_debug_printer_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_assert_async_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_autotune_with_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_3_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_clamp_decomposition_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_composed_dynamic_size_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_nested_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_non_tensor_predicates_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_share_predicate_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_simple_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_symint_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_unbacked_symint_closure_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_use_buffers_from_outer_scope_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_original_fqn_and_dtype_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_conv_freezing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_d2h_copy_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_device_moved_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_smem_above_default_limit_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_empty_cat_dtype_promotion_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fill__fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fqn_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_free_inactive_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_freezing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_grid_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_mmaped_weights_on_disk_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_weight_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_linear_dynamic_maxautotune_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_nan_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_no_args_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_default_gpu_device_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_on_gpu_device1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_proxy_executor_hann_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_proxy_executor_permute_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_pytree_inputs_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_quantized_linear_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_return_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_reuse_kernel_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_complex_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_same_backing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scaled_grouped_mm_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sdpa_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_multi_arch_embed_kernel_binary_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_from_multi_output_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_and_mul_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_small_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_stft_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_symint_item_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sympy_cpp_printer_min_max_minmax0_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sympy_cpp_printer_min_max_minmax1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_extern_kernel_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_True_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_True_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_reinterpret_view_mem_leak_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_sympy_expr_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_0_use_static_size_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_simple_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_conv_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_mixed_device_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_sym_expr_cond_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_unbacked_symint_closure_dynamic_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_unbacked_symint_closure_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_grid_with_backed_symbols_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_size_weight_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_add_complex_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aliased_buffer_reuse_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_amp_fallback_random_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aot_inductor_consts_cpp_build_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printer_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printer_fp8_dtype_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_user_defined_triton_kernel_profiling_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotune_with_constant_folding_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotuning_args_reuse_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_bmm_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_4_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_codegen_int_array_var_fix_memory_leak_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_composed_dynamic_size_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_cpu_predicate_cuda_operands_max_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_mismatched_branch_output_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_nested_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_symint_input_disable_one_pass_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_multiple_outputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_replace_view_ops_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_constant_original_fqn_and_dtype_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_constant_type_propagation_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_convolution_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_device_moved_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dynamic_scalar_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dynamic_smem_above_default_limit_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_embedding_bag_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_empty_graph_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fake_tensor_device_validation_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fallback_kernel_with_symexpr_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fallback_mem_leak_fix_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fill__fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_foreach_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fp8_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fqn_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_free_inactive_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_inf_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_issue_140766_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_grid_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_weight_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_masked_select_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_missing_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_mixed_device_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_nan_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_narrow_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_nested_tensor_from_jagged_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_non_contiguous_output_alias_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_none_args_aot_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_output_misaligned_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_output_path_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_pad_non_zero_memory_leak_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_proxy_executor_permute_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_pytree_inputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_quantized_linear_bias_none_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_user_defined_triton_kernel_embed_kernel_binary_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_replace_unbacked_symbol_with_backed_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_return_view_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_reuse_kernel_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_run_with_grad_enabled_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_dtype_failed_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_fp8_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_large_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_same_backing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_scatter_reduce_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_seq_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_simple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_simple_embed_kernel_binary_False_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_expr_transitive_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_small_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_stride_with_unbacked_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sym_expr_indexing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sym_i64_input_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sympy_cpp_printer_min_max_minmax1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_autotuning_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_multi_output_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_reinterpret_view_mem_leak_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_with_none_input_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_with_none_inputs_and_equal_to_1_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_next_power_of_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_0_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbounded_expr_substitutions_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_update_inactive_constant_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_update_user_managed_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_using_model_name_for_files_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_nested_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_simple_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_conv_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_mixed_device_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_mixed_device_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_outer_buffers_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_parameters_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_pytree_inputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_no_triton_profiler_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_profiler_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_grid_with_backed_symbols_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_grid_with_unbacked_symbols_mps 2025-12-04T09:27:32.4402106Z 2025-12-04T09:27:32.4402233Z Finished inductor/test_aot_inductor 1/3 ... [2025-12-04 09:27:32.429996][2189914.890980575], took 7.37min 2025-12-04T09:27:32.4402635Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T09:27:34.5986978Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:27:34.5987594Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T09:27:34.5988069Z Uploading artifacts took 0.00 seconds 2025-12-04T09:27:34.5988577Z Running inductor/test_torchinductor_dynamic_shapes 4/4 ... [2025-12-04 09:27:34.598574][2189917.059557164] 2025-12-04T09:27:34.5989076Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:27:34.5990641Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '--shard-id=4', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:27:34.598892] 2025-12-04T09:45:49.5850572Z 2025-12-04T09:45:49.5852303Z inductor/test_torchinductor_dynamic_shapes 4/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_4.4_9bda75a700774265_.log 2025-12-04T09:45:49.5948788Z Running 518 items in this shard: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_matmul_4bit_bf16_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_pack_4bit_weight_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__unsafe_masked_index_put_accumulate_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_abs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adaptive_avg_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adaptive_avg_pool_errors_with_long_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adding_tensor_offsets_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_override_registration_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_support_out_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_as_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_assert_size_stride_op_name_pass_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d7_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d_backward3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_batch_norm_2d_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bfloat16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bitwise3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bmm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_both_scalars_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_add_autotune_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int16_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int32_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int64_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_uint8_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_buffer_copied_in_graph_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_float_ndigits_pos_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_extern_kernel_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_unbacked_legacy_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cauchy_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_clamp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_clamp_type_promotion_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_complex_from_real_imag_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_concat_add_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_config_option_dont_assume_alignment_cudagraphs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_const_int32_to_float_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv1d_depthwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv2d_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_functional_bn_fuse_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_inference_heuristics_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_copy_non_blocking_is_pinned_use_cat_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumprod_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_pattern_matcher_issue_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_custom_op_fixed_layout_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_by_zero_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_presicion_accuracy_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_prim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_trivial_0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_trivial_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_bfloat16_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_bfloat16_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_bfloat16_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float16_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int8_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_uint8_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_uint8_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_uint8_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_elu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_exact_stride_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_expand_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_expm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_real_output_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fill1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_flexible_layout_immutable_free_symbols_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_flip_cat_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_float16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_float_repr_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fmod_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_forced_buffer_realize_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fractional_max_pool2d3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_like_transposed_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fuse_large_params_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_generate_rand_fp8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_getitem_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_glu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gpu_scalar_with_gpu_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_graph_partition_arange2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_horizonal_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_float_zero_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_abs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_indirect_load_broadcast_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inductor_assert_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inf_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inner_reduction_detection_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inplace_add_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inplace_where_pointwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_insignificant_strides_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_isinf2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_kernel_names_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_broadcast_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_grid_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_strided_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_like_rands2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_list_clearing_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lite_mode_not_decompose_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lite_regional_compile_flex_attention_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lite_regional_compile_invoke_subgraph_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_low_memory_max_pool_dilation_1_dim_3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mark_dynamic_with_hint_override_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_matmul_layer_norm_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mix_device_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mul_index_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multi_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_any_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mutable_custom_op_fixed_layout2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_to_num_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_narrow_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_new_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_new_ones_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_op_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_specization_over_symbolic_value_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_output_strides_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pad_view_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_permute1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pixel_shuffle_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_chebyshev_polynomial_u_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_expm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_gammaln_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i0e_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i1e_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_psi_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_spherical_bessel_j0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow_by_natural_log2_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randint_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randint_int64_mod_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randn_generator_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reflection_pad2d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reflection_pad2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_relu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_remove_noop_slice_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_remove_noop_slice_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_remove_noop_view_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_roi_align_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_rsqrt_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_rsqrt_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_add3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scheduler_vertical_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_unaligned_mask_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_select_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_setitem_with_int_parameter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sigmoid_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_silu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_simplify_loops_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sizehint_issue1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter_dtype_consistency_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_view_with_graph_break_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_softmax_one_kernel_loop_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_softmax_one_kernel_persist_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_reduction_dynamic_shape_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor_index_put_slice_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tmp_not_defined_issue2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_memory_format_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_triu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unbacked_float_item_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unfold_zero_dimension_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unroll_small_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unspec_inputs_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unspec_inputs_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unspec_inputs_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_var_mean_tile_reduction_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_as_complex_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_as_real_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_weight_norm_bwd_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_weight_norm_conv2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_where_with_logical_op_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_zero_element_mutation_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test__dyn_quant_pack_4bit_weight_bf16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test__unsafe_masked_index_put_accumulate_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_abs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool_with_output_size_0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex_strided_fallback_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_inplace_permuted_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adding_tensor_offsets_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_alexnet_prefix_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aliased_buffer_reuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_with_scalar_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_arange1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_to_float_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_as_strided_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_assert_alignment_op_name_fail_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_assert_size_stride_op_name_pass_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool_errors_with_uint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_baddbmm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_batch_norm_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bmm2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_default_kwargs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int32_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int64_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int8_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int8_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_uint8_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_uint8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_nd_tiling_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_float_ndigits_pos_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_int_ndigits_zero_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_inplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_clamp_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_compar_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_complex_from_real_imag_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_computed_buffer_inlining_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_config_option_dont_assume_alignment_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_consecutive_split_cumprod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_consecutive_split_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_const_int32_to_float_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_depthwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_with_permute_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv2d_backward_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv3d_channels_last_use_block_ptr_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_bn_fuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_functional_bn_fuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_inference_heuristics_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cudnn_rnn_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cumsum_inf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cumsum_zero_dim_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_default_layout_constraint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_fixed_layout_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_scan_op_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_deterministic_codegen_on_graph_break_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div9_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div_by_zero_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div_precision_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div_softmax_symfloat_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dropout2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dropout_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtype_mismatch_issue_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_bfloat16_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_bfloat16_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float16_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float16_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float64_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float64_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int32_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int32_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_uint8_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_uint8_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_uint8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_uint8_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_elu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_emulate_precision_triton_fp_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_erfinv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_exact_stride_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_as_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expanded_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_basic_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_list_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fill1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fill2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_flexible_layout_immutable_free_symbols_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_flip_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_float_index_expression_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_float_index_expression_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_float_repr_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_floordiv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fmod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_forced_buffer_realize_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_full_truncation_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_generate_rand_fp8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_glu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_arange1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_constant_tensor1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_refcount_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_unbacked_symint_as_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_grid_sampler_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_remainder_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_fallback2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_remainder_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_select_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_indirect_load_broadcast_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_add_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_input_mutation2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_input_mutation5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_isinf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_kwargs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_broadcast_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_pointwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_leaky_relu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_like_rands_sliced_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear_mixed_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linspace4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_list_clearing_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_lite_dynamic_shape_assertion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_log_fp64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logaddexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logcumsumexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logsumexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_low_memory_max_pool_dilation_1_dim_3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mark_unbacked_with_hint_override_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_masked_fill_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d6_dilation_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mix_device_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_move_arange_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multi_gpu_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_prime_size_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mutable_custom_op_fixed_layout2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_sort_stable_False_descending_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_narrow_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_needs_contiguous_strides_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_new_ones_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nll_loss_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nll_loss_forward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_norm_constant_overflow_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_output_strides_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pad_single_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pattern_matcher_unbacked_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_permute1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pixel_shuffle_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_chebyshev_polynomial_v_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_chebyshev_polynomial_w_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_digamma_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfc_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfcx_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfinv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_gammaincc_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_i0e_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_log1p_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_multigammaln_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_w_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_xlog1py_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_zeta_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction_config_limit_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_relu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_remove_noop_clone_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_remove_noop_view_default_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_as_strided_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_decomposition_has_clamp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_require_stride_expanded_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_round_correctness_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_rsqrt_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_cpu_tensor_arg_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scaled_dot_product_attention_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_reduce1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_reduce2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sdpa_unaligned_mask_freezing_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_select_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_shape_prop_torch_ones_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_single_elem_indirect_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_softmax_one_kernel_loop_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_stable_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_failed_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_with_integer_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_with_list_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_with_sizes_with_unbacked_symints_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_stride_preservation_with_stride_modifying_fx_pass_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tanh_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor_index_slice_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_device_constant_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_topk_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_add_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_triton_argmin_argmax_transpose_logical_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_triton_kernel_bool_param_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unspec_inputs_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unsqueeze_inplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_cat_conv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest3d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_var_correction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_complex_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_on_aliased_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_uint8_through_differing_bitwidths_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views7_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_weight_norm_bwd_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_weight_norm_conv2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_where_with_logical_op_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_zero_dim_reductions_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_zero_element_mutation_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_zeros_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_cat_unbacked_duplicate_size_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_constant_fold_uniform_value_dynamic_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_inf_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_return_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_return_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_unbacked_stride_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_mark_unbacked_slice_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op5_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op9_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_multi_output_unbacked_custom_op_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_non_persistent_dynamic_rblock_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_noops_tensor_repropagate_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_slice_index_changing_sign_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_sym_stride_lowering_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unbacked_cat_backwards_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unbacked_index_select_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unwrap_storage_didnt_work_repro_cuda 2025-12-04T09:45:49.6032520Z 2025-12-04T09:45:49.6032676Z Finished inductor/test_torchinductor_dynamic_shapes 4/4 ... [2025-12-04 09:45:49.585438][2191012.046420974], took 18.25min 2025-12-04T09:45:49.6033097Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T09:45:49.6033457Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:45:49.6033750Z Running inductor/test_torchinductor_opinfo 4/12 ... [2025-12-04 09:45:49.591171][2191012.052161019] 2025-12-04T09:45:49.6033957Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:45:49.6034365Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=4', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:45:49.591389] 2025-12-04T09:53:42.4373235Z 2025-12-04T09:53:42.4373969Z inductor/test_torchinductor_opinfo 4/12 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_4.12_39a47bbc96d9aec5_.log 2025-12-04T09:53:42.4434186Z Running 299 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_H_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_T_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rmul___cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rpow___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rxor___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acos_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addmm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addmv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_all_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_aminmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_arange_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmax_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argsort_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_partial_views_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_1d_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_1d_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cartesian_prod_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cat_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cdouble_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chalf_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chunk_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_max_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clone_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_combinations_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_combinations_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_copysign_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_copysign_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cos_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cross_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_scatter_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diff_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dist_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_floor_rounding_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dsplit_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dstack_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_strided_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfinv_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfinv_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_as_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftshift_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftshift_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftshift_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft2_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfftn_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfftn_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fill_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmod_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_full_like_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gt_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_histc_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_copy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_select_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_int_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isclose_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isneginf_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isreal_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kron_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lerp_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_diagonal_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_inv_ex_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lstsq_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_norm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_qr_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_solve_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vecdot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vecdot_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vecdot_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log1p_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logcumsumexp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_and_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logspace_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lt_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amax_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_argmin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_prod_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_softmin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_var_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_binary_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_median_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_variadic_tensors_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_no_dim_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_msort_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_msort_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mul_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_multinomial_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nanquantile_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ne_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_neg_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_neg_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_strided_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_full_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_avg_pool3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_avg_pool1d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_batch_norm_without_cudnn_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_bilinear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_similarity_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cross_entropy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_fractional_max_pool3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_glu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_group_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_kl_div_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool1d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_normalize_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_circular_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_circular_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu6_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu6_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_silu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_silu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softshrink_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softsign_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_threshold_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_unfold_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_upsample_bilinear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_nuc_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_outer_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_permute_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ravel_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ravel_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_real_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_real_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reciprocal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_remainder_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_remainder_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize_as__cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_neg_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_neg_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rot90_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rsqrt_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rsub_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_add_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_prod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_prod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_select_scatter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_short_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sigmoid_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinc_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_softmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_airy_ai_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_y1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_square_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_square_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_to_size_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_along_dim_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tan_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensor_split_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensordot_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_to_sparse_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_to_sparse_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__flash_attention_forward_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapezoid_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapz_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_triangular_solve_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_copy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_uint16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vdot_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zero__cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_like_cuda_int64 2025-12-04T09:53:42.4480151Z 2025-12-04T09:53:42.4480291Z Finished inductor/test_torchinductor_opinfo 4/12 ... [2025-12-04 09:53:42.437346][2191484.898331597], took 7.88min 2025-12-04T09:53:42.4480691Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T09:53:42.4481047Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:53:42.4481265Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T09:53:42.4481442Z Uploading artifacts took 0.00 seconds 2025-12-04T09:53:42.4481637Z Running inductor/test_torchinductor_opinfo 10/12 ... [2025-12-04 09:53:42.443966][2191484.90495555] 2025-12-04T09:53:42.4481838Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:53:42.4482292Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=10', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:53:42.444157] 2025-12-04T10:03:56.6229561Z 2025-12-04T10:03:56.6231066Z inductor/test_torchinductor_opinfo 10/12 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_10.12_e000a6faaf248245_.log 2025-12-04T10:03:56.6286358Z Running 308 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_H_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___radd___cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rdiv___cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___ror___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rxor___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_abs_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_alias_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_arange_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argsort_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asinh_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_1d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_and_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_left_shift_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_or_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_right_shift_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_xor_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_block_diag_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bmm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_tensors_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_to_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cat_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cat_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cauchy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_char_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_char_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cholesky_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chunk_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clone_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clone_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_combinations_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_copysign_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_copysign_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_count_nonzero_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_count_nonzero_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cross_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumprod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumulative_trapezoid_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_embed_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_scatter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diff_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diff_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_floor_rounding_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_floor_rounding_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_no_rounding_mode_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_double_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_double_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_like_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_strided_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_eq_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_eq_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_eq_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_equal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftshift_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfftn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfftn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fill_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flip_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flip_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flipud_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_power_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmax_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmin_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_full_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_full_like_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ge_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ge_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hash_tensor_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hstack_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_igammac_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_mean_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_inner_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isclose_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isfinite_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isneginf_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lcm_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lerp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cond_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_eigh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_inv_ex_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_ldl_factor_ex_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_factor_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_solve_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_pinv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vander_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vander_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linspace_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_normal_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_not_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_not_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_long_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumprod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_fill_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_fill_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_median_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_normalize_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_prod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_var_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_pool2d_with_indices_backward_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_no_dim_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_with_dim_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_with_dim_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mul_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nan_to_num_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nanmedian_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_copy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_zeros_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_zeros_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_avg_pool2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_alpha_dropout_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv1d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_dropout_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_gelu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_group_norm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_huber_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_trilinear_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_layer_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_linear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool3d_grad_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool3d_grad_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_normalize_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_normalize_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_rrelu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_rrelu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_silu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_fro_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_normal_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_outer_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_outer_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_permute_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_positive_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rad2deg_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randint_like_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ravel_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_real_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reciprocal_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_remainder_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_interleave_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rot90_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_decimals_3_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_add_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_amin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_prod_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_select_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_select_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sigmoid_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_cosine_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_general_hamming_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_slice_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_softmax_with_dtype_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_y0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_y1_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_v_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_v_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i0_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_k0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtr_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtr_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_spherical_bessel_j0_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_zeta_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_list_args_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_copy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_multiple_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_stack_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_std_mean_unbiased_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_std_unbiased_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sub_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_to_size_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_svd_lowrank_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tan_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_to_sparse_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trace_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trace_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapz_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapz_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tril_indices_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_copy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unflatten_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unflatten_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vdot_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_complex_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_copy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_where_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_cuda_int64 2025-12-04T10:03:56.6334490Z 2025-12-04T10:03:56.6334629Z Finished inductor/test_torchinductor_opinfo 10/12 ... [2025-12-04 10:03:56.622790][2192099.083776856], took 10.24min 2025-12-04T10:03:56.6335044Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:03:56.6335400Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:03:56.6335630Z Running inductor/test_cpu_repro 2/5 ... [2025-12-04 10:03:56.628600][2192099.089589121] 2025-12-04T10:03:56.6335818Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:03:56.6336206Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_repro.py', '--shard-id=2', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:03:56.628790] 2025-12-04T10:15:00.8434582Z 2025-12-04T10:15:00.8435812Z inductor/test_cpu_repro 2/5 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_repro_2.5_de016b027f95812d_.log 2025-12-04T10:15:00.8481195Z Running 178 items in this shard: test/inductor/test_cpu_repro.py::CPUReproTests::test_ModularIndexing_range_issue_103133, test/inductor/test_cpu_repro.py::CPUReproTests::test_acosh_with_negative_large_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_attention_size_mismatch, test/inductor/test_cpu_repro.py::CPUReproTests::test_auto_zvec_vsx_simd, test/inductor/test_cpu_repro.py::CPUReproTests::test_complex_memory_overlap, test/inductor/test_cpu_repro.py::CPUReproTests::test_constant_store, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv1d_strided_weight_torch_compile, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv2d_packed, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv_transpose2d_packed_cpu, test/inductor/test_cpu_repro.py::CPUReproTests::test_convert_fp32_int64_oob_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_convert_int8_to_half_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_maxpool2d_lowering_int8, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_quant_lowering_fp8_e4m3, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_quant_lowering_int8, test/inductor/test_cpu_repro.py::CPUReproTests::test_disabled_amp_is_inference_False, test/inductor/test_cpu_repro.py::CPUReproTests::test_double_pointwise_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_eliminate_meaningless_copy, test/inductor/test_cpu_repro.py::CPUReproTests::test_embedding_vec_bf16, test/inductor/test_cpu_repro.py::CPUReproTests::test_frexp, test/inductor/test_cpu_repro.py::CPUReproTests::test_full_bits_lowp, test/inductor/test_cpu_repro.py::CPUReproTests::test_fused_attention_conv, test/inductor/test_cpu_repro.py::CPUReproTests::test_group_norm_backward_symint_divisible_channels, test/inductor/test_cpu_repro.py::CPUReproTests::test_highp_to_lowp_cse_var_cache_with_store, test/inductor/test_cpu_repro.py::CPUReproTests::test_horizontal_fusion, test/inductor/test_cpu_repro.py::CPUReproTests::test_in_out_buffer, test/inductor/test_cpu_repro.py::CPUReproTests::test_insert_to_dtype_count, test/inductor/test_cpu_repro.py::CPUReproTests::test_int64_reduction_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_int_div_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_invalid_index_of_empty_tensor, test/inductor/test_cpu_repro.py::CPUReproTests::test_linear_buffer_reuse, test/inductor/test_cpu_repro.py::CPUReproTests::test_linear_with_no_default_contiguous_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_load_same_bool_tensor_twice, test/inductor/test_cpu_repro.py::CPUReproTests::test_local_buffer_in_outer_loop_fusion, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_nn_param_assign, test/inductor/test_cpu_repro.py::CPUReproTests::test_nn_param_assign_wrapped, test/inductor/test_cpu_repro.py::CPUReproTests::test_outer_looop_fusion_with_local_buf, test/inductor/test_cpu_repro.py::CPUReproTests::test_pad_with_nan_value, test/inductor/test_cpu_repro.py::CPUReproTests::test_scatter_using_atomic_add, test/inductor/test_cpu_repro.py::CPUReproTests::test_scatter_using_atomic_add_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_select_tiliing_with_index_expr, test/inductor/test_cpu_repro.py::CPUReproTests::test_sigmoid_with_reduction, test/inductor/test_cpu_repro.py::CPUReproTests::test_tile2d_load_decomposed_dequant_add_relu_quant_int8, test/inductor/test_cpu_repro.py::CPUReproTests::test_to_dtype_bool_float, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_copy, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_mxn_16_16_bf16_fp16, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_mxn_32_32_bf16_fp16, test/inductor/test_cpu_repro.py::CPUReproTests::test_uint32_reduction_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_indirect_load_cse_cache, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_logical, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_remainder 2025-12-04T10:15:00.8518868Z 2025-12-04T10:15:00.8518985Z Finished inductor/test_cpu_repro 2/5 ... [2025-12-04 10:15:00.843315][2192763.304300998], took 11.07min 2025-12-04T10:15:00.8519378Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:15:00.8519732Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:15:00.8519954Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:15:00.8520177Z Uploading artifacts took 0.00 seconds 2025-12-04T10:15:00.8520349Z Running inductor/test_smoke 1/1 ... [2025-12-04 10:15:00.849394][2192763.310382944] 2025-12-04T10:15:00.8520525Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:15:00.8520903Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_smoke.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:15:00.849590] 2025-12-04T10:15:06.1210279Z 2025-12-04T10:15:06.1211260Z inductor/test_smoke 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_smoke_1.1_becebdf3efc74d55_.log 2025-12-04T10:15:06.1211827Z 2025-12-04T10:15:06.1212067Z Finished inductor/test_smoke 1/1 ... [2025-12-04 10:15:06.120685][2192768.581671019], took 0.09min 2025-12-04T10:15:06.1214084Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:15:06.1268793Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:15:06.1272776Z Running inductor/test_compiled_autograd 2/2 ... [2025-12-04 10:15:06.126983][2192768.587972563] 2025-12-04T10:15:06.1273182Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:15:06.1273942Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compiled_autograd.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:15:06.127184] 2025-12-04T10:20:43.6144189Z 2025-12-04T10:20:43.6146160Z inductor/test_compiled_autograd 2/2 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_compiled_autograd_2.2_7b669a18eae58299_.log 2025-12-04T10:20:43.6218438Z Running 423 items in this shard: test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_accuracy, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_polyfill_case_1_5_1, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_polyfill_case_2_1, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_polyfill_case_2_3_1, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_polyfill_case_2_3_2, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_grad_polyfill_case_2_3_3, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_accumulate_without_zero, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_aot_bwd_gm_runnable, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_basic_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_data_dependent_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_id_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_saved_basic_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_saved_basic_is_traceable_True, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_saved_dynamic_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_autograd_cpp_node_saved_float_is_traceable_False, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_basic, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_callback_graph_break_throws_error, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_api_optimize_backend_eager, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_api_optimize_backend_inductor, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_disable_api_compile_backend_aot_eager, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_disable_api_optimize_backend_aot_eager, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_disable_api_optimize_backend_eager, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_compile_api_disable_api_optimize_backend_inductor, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_cudagraphs_cpu_division, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_compiled_fw_graph_break, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_non_variable_input, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_output_metadata, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_saved_multiple_tensors_dedup, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_saved_shape_tensor, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_custom_fn_with_same_graph, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_dont_dce_side_effects, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_dynamic_shapes, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_dynamic_shapes_from_forward, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_dynamo_flaky_segfault, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_free_activation_memory, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_graph_break_custom_op, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_implicit_add, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_inputs_aliasing_bytecode_attr_mutations, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_keep_graph_simple, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_logs, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_logs_aot_bwd_reuse, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_mismatch_fake_tensor_mode, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_mismatch_fake_tensor_mode_dynamic_shape, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_multiple_torch_compile, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_nested_compile, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_nested_context_manager, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_no_nested_compiled_autograd, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_no_output_nodes_all_leaves, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_no_output_nodes_different_leaves_will_recompile, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_no_output_nodes_some_leaves, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_output_nodes_some_leaves, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_acc_grad, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_all_bwd_hooks, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_multi_post_hooks, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_post_hook1, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_post_hook2, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_reorder_post_hook3, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_tensor_grad_hook3, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_tensor_subclass_basic, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_torch_compile, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_torch_compile_api_dynamic_shapes, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_torch_compile_graph_break, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_torch_compile_graph_break2, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_torch_dispatch_mode, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_trace_auto_functionalized, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_trace_auto_functionalized_v2, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_verbose_logs_aot_id, test/inductor/test_compiled_autograd.py::TestCompiledAutograd::test_verbose_logs_graph, test/inductor/test_compiled_autograd.py::WrapTestClassTests::test_wrap_preserves_inheritance_and_super, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_accumulate_grad_tensor_reference, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_anomaly_grad_warnings, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_attribute_deletion, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_autograd_node_isinstance, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_autograd_print_tensor, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_autograd_python_custom_function_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward_to_node, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward_twice_retained_graph_without_saved_values, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward_twice_without_saved_values, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward_with_nonleaf_inputs, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_backward_with_scalar_input, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_callback_propagates_errors_from_device_thread, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpoint_sequential_warns_if_use_reentrant_not_passed_explcitly, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpoint_warns_if_use_reentrant_not_passed_explcitly, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_non_reentrant_autocast_cpu, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_non_reentrant_autocast_gpu, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_without_reentrant_arbitrary_input_output, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_without_reentrant_detached_tensor_use_reentrant_True, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_without_reentrant_parameter_used_in_an_out, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_without_reentrant_saved_object_identity, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_checkpointing_without_reentrant_with_context_fn, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_copy_slices_graph_task_updates, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_current_graph_task_id, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_current_node, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_forward_mode_forward_is_no_op, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_forward_mode_inplace_checks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_forward_mode_view_checks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_inplace_on_non_default_view, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_inplace_on_view_of_leaf, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_local_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_mark_output_view_of_intermediate, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_no_tensors, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_non_tensor_inputs_outputs, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_return_view_in_nograd, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_save_for_forward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_setup_context_multi_input, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_custom_function_setup_context_multi_output, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_default_saved_tensors_hooks_double_backward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_detach, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_diagonal_expanded_v, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_dir, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_dont_materialize_grads, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_free_deep_graph, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_full_backward_hook_double_backward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_function_returns_input, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_function_returns_undefined_tensor, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gc_in_destructor, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_badcalls, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_dtype, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_fn_attr_bindings, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_fn_prehooks_remove_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_materialize_grads, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_mode_class_decoration, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_mode_restored_reentrant, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_nonleaf_many_outputs, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_to_node_multi, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_to_node_set, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_grad_unreachable, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_backward_mul_by_grad_output, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_check_no_differentiable_outputs, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_default_device_placement_context, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_forward_ad_batched_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_get_analytical_jacobian, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_get_numerical_jacobian, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_input_layout0, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_input_layout1, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_input_layout3, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_jacobian_mismatch, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_multiple_mkldnn_inputs, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_nondeterministic, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_single_input, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_undefined_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradcheck_validates_input_mkldnn, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradient_edge_graph_ownership, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_gradient_edge_output, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_graph_save_on_cpu_cuda, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hessian_vector, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hook_closure_cycle_use_custom_function_False_use_tensor_hook_False, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hook_closure_cycle_use_custom_function_False_use_tensor_hook_True, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hook_closure_cycle_use_custom_function_True_use_tensor_hook_False, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hook_closure_cycle_use_custom_function_True_use_tensor_hook_True, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hook_with_no_name, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_increment_version, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_index_backward_does_not_save_tensor, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_indexing_duplicates, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_inplace_on_view_saved_output, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_input_buffer_accum, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_invalid_gradients, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_isolated_node, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_leaf_assignment, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_mark_non_differentiable_mixed, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_mark_non_differentiable_none, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_multi_grad_all_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_multi_grad_any_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_multi_grad_hooks_invalid_mode, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_multiple_insert_removal_caching, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_naughty_autograd_function_attribute_access, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_nested_anomaly_detect_nan, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad_assignment, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad_copy, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad_copy_sparse, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad_input, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_grad_modifies_version, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_no_unnecessary_unwrapping, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_node_ordering_when_none_returned, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_node_post_hook_registered_during_unpack_hook, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_not_implemented_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_numpy_requires_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_once_differentiable, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_out_variant_raises_when_inputs_require_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_pack_hook_with_inplace_modification_should_fail, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_post_accumulate_grad_hook_e2e, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_post_accumulate_grad_hook_multiple_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_post_accumulate_grad_hook_multiple_tensors, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_post_accumulate_grad_hook_on_non_leaf, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_post_accumulate_grad_hook_ordering, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_profiler_aggregation_fake, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_profiler_aggregation_lstm, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_profiler_propagation, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_profiler_unboxed_only, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_pynode_destruction_deadlock, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_record_function_callbacks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_record_function_legacy, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_record_function_multithreaded, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_reentrant_priority, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_reentrant_with_callbacks_both_depths, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_reentrant_with_callbacks_depth_1, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_reentrant_with_non_leaf_variable_hook, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_requires_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_requires_grad_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_retain_grad_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_retain_grad_inplace_over_view, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_retains_grad_can_always_observe_tensor_prehook, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_return_leaf_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_tensor_hooks_custom_error_propagation, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_tensor_hooks_extra_exit_during_bw_no_crash, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_tensors_hook_version_counter_not_shared, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_variable_packing_unpacking_did_not_save_original_with_default_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_variable_packing_unpacking_saved_original_with_hooks, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_variable_saved_original_inplace_detach, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saved_variables_deprecated, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_saving_variable_to_disk, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_select_sum, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_data_preserve_pyobj, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_data_self_requires_grad, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_grad_coroutines, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_grad_coroutines_critical_exceptions, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_grad_coroutines_exit, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_grad_enabled, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_set_grad_generator_functions_recursive, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_setitem, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_setitem_mask, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_setting_default_saved_variable_hooks_twice_should_not_fail, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_setting_default_saved_variable_hooks_twice_should_use_inner, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_setup_context_when_forward_has_default_args, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_simple_reentrant, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_slice_expanded_v, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_sparse_gather_dim0, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_sparse_gather_dim1, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_sparse_gather_x_scalar, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_sparse_mm_backward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_tensor_hooks_inplace, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_tensor_hooks_inplace_over_view, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_to_sparse_backward, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_type_conversions, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_unpack_hooks_exec_count, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_unsafe_set_version_counter, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_unused_grad_requires_grad_with_materialize, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_variable_traverse, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_view_func_replay, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_view_replay_enabled, test/inductor/test_compiled_autograd.py::TestAutogradWithCompiledAutograd::test_wrapped_number_saved_tensors_hooks, test/inductor/test_compiled_autograd.py::TestNestedCheckpointWithCompiledAutograd::test_nested_checkpoint_kwargs_early_stop_False, test/inductor/test_compiled_autograd.py::TestNestedCheckpointWithCompiledAutograd::test_nested_checkpoint_non_tensor_inputs_and_outputs_early_stop_False, test/inductor/test_compiled_autograd.py::TestNestedCheckpointWithCompiledAutograd::test_nested_checkpoint_same_graph_early_stop_False, test/inductor/test_compiled_autograd.py::TestNestedCheckpointWithCompiledAutograd::test_nested_checkpoint_set_early_stop, test/inductor/test_compiled_autograd.py::TestNestedCheckpointWithCompiledAutograd::test_nested_checkpoint_set_early_stop_no_recompution_needed, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_abstract_impl_on_existing_op_with_CompositeImplicitAutograd, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_abstract_impl_on_existing_op_with_meta, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_autogen_aten_ops_are_pt2_compliant, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_autograd_function_backed_op, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_autograd_notimplemented, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_autograd_notimplemented_gradmode, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_dict_invalid_keys, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_dict_requires_keys_for_input_optional_tensors, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_dict_requires_keys_for_input_tensors, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_grads_are_tensor_or_none, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_impl_on_existing_op, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_impl_on_existing_op_CompositeImplicitAutograd, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_impl_on_existing_op_incorrect_schema_views, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_impl_on_existing_op_with_key_key_Autograd, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_impl_on_existing_op_with_key_key_AutogradCPU, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_output_differentiability_non_tensor, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_output_differentiability_numel, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_output_differentiability_type, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_partially_registered, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_returns_dict, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_tensorlist_input_requires_list_grads, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_backward_tensorlist_input_requires_list_grads_none_or_Tensor, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_builtin_aten_ops_are_pt2_compliant, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_builtin_torchscript_ops, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_data_dependent_compile, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_data_dependent_fake_tracing, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_and_impl, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_bad_schema, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_validation, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_with_tags_list, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_with_tags_single, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_define_with_tags_tuple, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_functionalize_error, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_cpu, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_device_cuda, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_device_function, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_device_invalid, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_function, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_meta, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_on_existing_op, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_on_existing_op_with_cpu_registration_key_CUDA, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_impl_on_existing_op_with_cpu_registration_key_CompositeExplicitAutograd, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_incorrect_schema_types, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_infer_schema_no_return, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_lifetime, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_load_library, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_not_implemented_error, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_override_cea, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_override_fake, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_reserved_ns, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_resolve_packet, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_save_for_backward_inputs_are_namedtuple, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_schema_matches_signature, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_sequences, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_supported_return_types_multi_return, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_supported_return_types_single_return, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_supported_schemas, test/inductor/test_compiled_autograd.py::TestCustomOpWithCompiledAutograd::test_unsupported_param_types, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_access_module_attr, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_global_num, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_global_num_adds_guard, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_tracked_nested, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_untracked_global, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_untracked_nonlocal, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_capture_value_created_in_subgraph, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_concat_unbacked_shape_tensor, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_cond_branches_no_arguments_no_closure, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_cond_pytree_operands_with_non_tensor_leaves, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_cond_subgraph_name_is_valid, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_cond_with_empty_operands, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_dynamic_shapes_over_vmap_batch_size, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_enum_arg, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_error_message_sane, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_fallback_on_graph_break_complicated, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_flat_list_output, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_fn_with_kwargs_in_torch_ops, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_hints_wrapper, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_hints_wrapper_incorrect_type, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_hints_wrapper_pytree_inputs, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_hooks, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_inlined_functions, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_lift_tensor_constant, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_lift_tensors_with_shared_symbols, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_make_closure, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_map_example_value_metadata_consistent_with_eager, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_map_graph_break, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_map_side_effect, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_map_symint_input, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_modules, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_register_mode, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_same_freevar_twice, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_del_existing_attr_global_module, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_del_existing_attr_nonlocal_module, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_in_body, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_mutate_global_tensor_builtin, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_mutate_nonlocal_tensor, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_set_existing_attr_nonlocal_obj, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_set_new_attr_global_obj, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_set_new_attr_nonlocal_module, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_side_effect_set_new_attr_nonlocal_obj, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_support_float_in_output, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_symint_input, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_tensor_and_unbacked_symbol_closure, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_tensor_to_list_closure, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_tensor_with_unbacked_shape_closure, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_all_kwarg, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_inductor_compiled_regions_with_backward, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_kwarg_default, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_kwarg_default_if_branch, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_kwarg_int, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_pytree_args_nested, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_pytree_args_not_const_symint_tensor, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_pytree_args_with_symint_constant, test/inductor/test_compiled_autograd.py::HigherOrderOpTestsWithCompiledAutograd::test_wrap_subgraph_name_is_valid, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_functional_call, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_functional_call_disable_inline_nn_module, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_capture_tensor, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_closure_scalar, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_non_tensor_input, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_over_grad, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_two_tensor_all_grad_has_aux, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_grad_two_tensor_has_aux, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jacfwd_randomness, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jacfwd_two_tensors_argnums, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jacrev, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jvp_freevar_python_scalar, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jvp_jvp, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jvp_two_tensors_disable_enable_disable_grad, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_jvp_two_tensors_disable_grad, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_linearize_jvp_fn, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vjp, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vjp_has_aux, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_call_compiled_backward_fn, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_free_tensor, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_get_wrapped, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_kwargs, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_multiple_outputs, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_multiple_outputs_out_dims_tuple, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_new_tensor_implicit_via_op, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_new_tensor_in_body, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_new_tensor_unused_in_body, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_out_dims_None, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_over_vmap_two_inputs, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_previous_illegal_op_no_graph_break, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_recompile_with_randomness, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_with_graph_break_2, test/inductor/test_compiled_autograd.py::FuncTorchHigherOrderOpTestsWithCompiledAutograd::test_vmap_with_graph_break_lambda, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_cond_with_kwargs, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_cond_with_mismatched_output, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_dropout, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_fallback, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_flop_counter_for_nested_cond, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_function_with_kwargs, test/inductor/test_compiled_autograd.py::ActivationCheckpointingTestsWithCompiledAutograd::test_override_fallthrough_dispatch_key, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_attribute_access_on_intermediate, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_basic, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_contiguous_dtensor_noncontiguous_local_as_tangent, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_dynamic_cat, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_dynamic_recompiles, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dtensor_noncontiguous_output, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_dynamo_dtensor_from_local, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_get_local_rank_compile, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_tp_compile_comm_reordering, test/inductor/test_compiled_autograd.py::TestDTensorCompileWithCompiledAutograd::test_tp_compile_comm_reordering_graph_partition, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_auto_functionalize_simple_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_flex_attention_backward_simple_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_flex_attention_simple_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_invoke_quant_simple_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_map_triple_nested_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_scan_simple_cuda_float32, test/inductor/test_compiled_autograd.py::TestCompiledAutogradOpInfoCUDA::test_hops_in_bwd_while_loop_stack_output_simple_cuda_float32 2025-12-04T10:20:43.6285218Z 2025-12-04T10:20:43.6285356Z Finished inductor/test_compiled_autograd 2/2 ... [2025-12-04 10:20:43.614370][2193106.075355352], took 5.62min 2025-12-04T10:20:43.6285768Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:20:43.6286134Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:20:43.6286355Z Running inductor/test_mmdecomp 1/1 ... [2025-12-04 10:20:43.619809][2193106.08079822] 2025-12-04T10:20:43.6286539Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:20:43.6286932Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_mmdecomp.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:20:43.620028] 2025-12-04T10:20:56.3179055Z 2025-12-04T10:20:56.3179920Z inductor/test_mmdecomp 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_mmdecomp_1.1_3b15fe227b685fd2_.log 2025-12-04T10:20:56.3183998Z Running 28 items in this shard: test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_10_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_1_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_2_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_4_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_10_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_1_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_2_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_4_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_bmm_batch2_last_dim_size_is_one_cuda, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_dynamic_shape_mm_bfloat16_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_dynamic_shape_mm_float32_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_simple_mm_bfloat16_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_simple_mm_float32_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_10_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_1_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_2_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_4_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_10_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_1_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_2_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_4_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_10_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_1_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_2_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_4_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_bfloat16_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_float32_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_int32_cuda_int32 2025-12-04T10:20:56.3187910Z 2025-12-04T10:20:56.3188028Z Finished inductor/test_mmdecomp 1/1 ... [2025-12-04 10:20:56.317721][2193118.778707661], took 0.21min 2025-12-04T10:20:56.3188431Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:20:56.3234564Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:20:56.3236996Z Running dynamo/test_ctx_manager 1/1 ... [2025-12-04 10:20:56.323640][2193118.784627071] 2025-12-04T10:20:56.3237185Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:20:56.3239493Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_ctx_manager.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:20:56.323876] 2025-12-04T10:21:16.0448211Z 2025-12-04T10:21:16.0448880Z dynamo/test_ctx_manager 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_ctx_manager_1.1_1da63d81ab26ce6a_.log 2025-12-04T10:21:16.0470804Z Running 104 items in this shard: test/dynamo/test_ctx_manager.py::CtxManagerTests::test_311_resume_block_keyerror, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_311_resume_block_keyerror2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_arguments_binding, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break_2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break_inner_fn, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_device, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_float64, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_graph_break_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_sdpa, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autograd_profiler, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autograd_profiler_enabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_grad_mode_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_grad_mode_nested_function_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_set_grad_enabled_nested_function, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_amp_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_device, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_across_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_created_outside_of_graph, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_method_create_stream_outside_of_compile, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_reconstruct, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_across_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_compared_with_constant, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_compared_with_stream, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_context_manager1, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_context_manager2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_prev_disabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_prev_disabled_nested, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager_CustomizedCtxManager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager_customized_ctx_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager_with_graph_break_CustomizedCtxManager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager_with_graph_break_customized_ctx_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_ctx_manager_with_graph_break_CustomizedCtxManagerWithGraphBreak, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_ctx_manager_with_graph_break_customized_ctx_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_grad_mode_guard, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_graph_break_inlining_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_graph_break_inlining_grad, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local_nullctx, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local_nullctx2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_stack, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_stack2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_is_autocast_cpu_enabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager_CustomizedCtxManager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager_customized_ctx_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager_with_graph_break_CustomizedCtxManager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager_with_graph_break_customized_ctx_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_grad_mode_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_no_grad, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_return_context_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_return_context_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager1, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager3, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager_as_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager_kwargs, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_sdpa_kernel_ctx_manager_set_priority, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_store_attr_graph_break_key_error, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_torch_profiler, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_torch_profiler_use_after_with_block, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_WITH_EXCEPT_START, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_advanced_contextmanager_as_argument, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_advanced_contextmanager_as_argument_error, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_0, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_1, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_global_0, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_global_1, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_nonlocal_0, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_change_parent_nonlocal_1, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextlib_nullcontext, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextlib_suppress_name_stderr, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextlib_suppress_name_stdout, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextlib_suppress_name_suppress, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextmanager_as_argument, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextmanager_as_argument_only___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_contextmanager_as_argument_only___exit__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_ctx_basic0, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_ctx_basic1, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_disable___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_disable___exit__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_disable_ctx_manager, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_disable_trace_contextmanager, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_dynamo_disable_ctx, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_globals_change_in_other_file, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_after___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_and_disable___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_before___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_before___enter___and_disable___exit__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_before_and_after___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_in_finally, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_inside___enter__, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_inside_ctx, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_inside_ctx_1, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_inside_ctx_2, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_graph_break_inside_ctx_with_side_effects, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_return_advanced_contextmanager, test/dynamo/test_ctx_manager.py::ContextlibContextManagerTests::test_return_new_contextmanager 2025-12-04T10:21:16.0483719Z 2025-12-04T10:21:16.0483832Z Finished dynamo/test_ctx_manager 1/1 ... [2025-12-04 10:21:16.044619][2193138.50560658], took 0.33min 2025-12-04T10:21:16.0484221Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:21:16.0503590Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:21:16.0505455Z Running dynamo/test_exc 1/1 ... [2025-12-04 10:21:16.050450][2193138.511439283] 2025-12-04T10:21:16.0505637Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:21:16.0507420Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_exc.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:21:16.050647] 2025-12-04T10:21:20.2875291Z 2025-12-04T10:21:20.2876849Z dynamo/test_exc 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_exc_1.1_cf6ff2eb17412fa5_.log 2025-12-04T10:21:20.2912516Z Running 10 items in this shard: test/dynamo/test_exc.py::ExcTests::test_backend_suppress_line, test/dynamo/test_exc.py::ExcTests::test_graph_break_log, test/dynamo/test_exc.py::ExcTests::test_graph_break_log_generic_jump, test/dynamo/test_exc.py::ExcTests::test_internal_error_no_suppress, test/dynamo/test_exc.py::ExcTests::test_internal_error_suppress_errors, test/dynamo/test_exc.py::ExcTests::test_not_implemented_error, test/dynamo/test_exc.py::ExcTests::test_trigger_bisect_on_error, test/dynamo/test_exc.py::ExcTests::test_trigger_on_error, test/dynamo/test_exc.py::ExcTests::test_unsupported_error, test/dynamo/test_exc.py::ExcTests::test_unsupported_real_stack 2025-12-04T10:21:20.2913778Z 2025-12-04T10:21:20.2913894Z Finished dynamo/test_exc 1/1 ... [2025-12-04 10:21:20.287260][2193142.748246746], took 0.07min 2025-12-04T10:21:20.2914290Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:21:20.2932673Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:21:20.2939141Z Running dynamo/test_misc 1/1 ... [2025-12-04 10:21:20.293403][2193142.754389753] 2025-12-04T10:21:20.2940297Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:21:20.2940809Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_misc.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:21:20.293653] 2025-12-04T10:23:15.0985628Z 2025-12-04T10:23:15.0986736Z PRINTING LOG FILE of dynamo/test_misc 1/1 (test/test-reports/dynamo.test_misc_1.1_37cc06447c4694c0_.log) 2025-12-04T10:23:15.0988481Z Test results will be stored in test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-f29e33abba5bdd90.xml 2025-12-04T10:23:15.0989721Z ============================= test session starts ============================== 2025-12-04T10:23:15.0990075Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:23:15.0990388Z cachedir: .pytest_cache 2025-12-04T10:23:15.0990625Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:23:15.0990884Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:23:15.0991011Z configfile: pytest.ini 2025-12-04T10:23:15.0991938Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:23:15.0992207Z collecting ... collected 664 items 2025-12-04T10:23:15.0992358Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:23:15.1051255Z Running 664 items in this shard: test/dynamo/test_misc.py::MiscTests::test_312_binary_slice_with_graph_break1, test/dynamo/test_misc.py::MiscTests::test_312_binary_slice_with_graph_break2, test/dynamo/test_misc.py::MiscTests::test_RAISE_VARARGS_0, test/dynamo/test_misc.py::MiscTests::test_T_tensor_attribute, test/dynamo/test_misc.py::MiscTests::test_add_sizes, test/dynamo/test_misc.py::MiscTests::test_add_to_set, test/dynamo/test_misc.py::MiscTests::test_anomaly_aot_autograd, test/dynamo/test_misc.py::MiscTests::test_any_all_symnode, test/dynamo/test_misc.py::MiscTests::test_aot_autograd_propagate_unbacked_symints_shape, test/dynamo/test_misc.py::MiscTests::test_arange_length_with_float32_dtype, test/dynamo/test_misc.py::MiscTests::test_argwhere_with_dynamic_shapes, test/dynamo/test_misc.py::MiscTests::test_assert, test/dynamo/test_misc.py::MiscTests::test_assert_size_stride, test/dynamo/test_misc.py::MiscTests::test_assigning_function_to_class_attribute, test/dynamo/test_misc.py::MiscTests::test_assigning_function_to_object_attribute, test/dynamo/test_misc.py::MiscTests::test_assume_32_bit_indexing, test/dynamo/test_misc.py::MiscTests::test_backend_match_guard, test/dynamo/test_misc.py::MiscTests::test_backend_match_guard_multi_threads, test/dynamo/test_misc.py::MiscTests::test_backward_deterministic_mode_mismatch_warning, test/dynamo/test_misc.py::MiscTests::test_boolarg, test/dynamo/test_misc.py::MiscTests::test_bound_shape_checks, test/dynamo/test_misc.py::MiscTests::test_build_tuple_unpack, test/dynamo/test_misc.py::MiscTests::test_builder_for_class_with_metaclass, test/dynamo/test_misc.py::MiscTests::test_builtin_abs, test/dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symbool, test/dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symfloat, test/dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symint, test/dynamo/test_misc.py::MiscTests::test_builtin_complex, test/dynamo/test_misc.py::MiscTests::test_builtin_complex_args, test/dynamo/test_misc.py::MiscTests::test_builtin_isinstance, test/dynamo/test_misc.py::MiscTests::test_builtin_str_on_user_defined_function, test/dynamo/test_misc.py::MiscTests::test_builtin_subclasses_as_method_on_class_type, test/dynamo/test_misc.py::MiscTests::test_builtin_subclasses_as_method_on_var, test/dynamo/test_misc.py::MiscTests::test_call_parent_non_class_methods_from_child, test/dynamo/test_misc.py::MiscTests::test_callpacked, test/dynamo/test_misc.py::MiscTests::test_cannot_trace_mark_dynamic, test/dynamo/test_misc.py::MiscTests::test_cannot_trace_mark_dynamic_safe_unreached, test/dynamo/test_misc.py::MiscTests::test_cast, test/dynamo/test_misc.py::MiscTests::test_cat_unbacked, test/dynamo/test_misc.py::MiscTests::test_catch_watchings1, test/dynamo/test_misc.py::MiscTests::test_catch_watchings2, test/dynamo/test_misc.py::MiscTests::test_cell_captured_by_existing_func_but_not_root_frame, test/dynamo/test_misc.py::MiscTests::test_cell_output1, test/dynamo/test_misc.py::MiscTests::test_cell_output2, test/dynamo/test_misc.py::MiscTests::test_check_assert_error_at_runtime_when_predicate_false_and_message_has_closure, test/dynamo/test_misc.py::MiscTests::test_check_assert_error_at_runtime_when_predicate_true_and_message_has_closure, test/dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_None, test/dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_has_global, test/dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_has_no_closure, test/dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_constant_and_message_None, test/dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_constant_and_message_has_no_closure, test/dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_None, test/dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_global, test/dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_no_closure, test/dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_constant_and_message_None, test/dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_constant_and_message_has_no_closure, test/dynamo/test_misc.py::MiscTests::test_check_simplification, test/dynamo/test_misc.py::MiscTests::test_class_binop, test/dynamo/test_misc.py::MiscTests::test_class_duner_flags, test/dynamo/test_misc.py::MiscTests::test_class_duner_mro, test/dynamo/test_misc.py::MiscTests::test_class_has_instancecheck_method, test/dynamo/test_misc.py::MiscTests::test_clone_sparse_input, test/dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell, test/dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell_with_cond, test/dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell_with_mutation, test/dynamo/test_misc.py::MiscTests::test_closure_recompiles, test/dynamo/test_misc.py::MiscTests::test_closure_with_mutation_and_graph_break, test/dynamo/test_misc.py::MiscTests::test_closure_write_across_functions, test/dynamo/test_misc.py::MiscTests::test_compare_shapes_eq, test/dynamo/test_misc.py::MiscTests::test_compare_shapes_neq, test/dynamo/test_misc.py::MiscTests::test_compare_shapes_tuple_eq, test/dynamo/test_misc.py::MiscTests::test_compare_shapes_tuple_neq, test/dynamo/test_misc.py::MiscTests::test_compare_shapes_with_constant, test/dynamo/test_misc.py::MiscTests::test_compare_tensor_with_none, test/dynamo/test_misc.py::MiscTests::test_compilation_metrics_size_limit, test/dynamo/test_misc.py::MiscTests::test_compiled_class_graph_break, test/dynamo/test_misc.py::MiscTests::test_cond, test/dynamo/test_misc.py::MiscTests::test_cond_export, test/dynamo/test_misc.py::MiscTests::test_cond_export_single_arg, test/dynamo/test_misc.py::MiscTests::test_cond_nested, test/dynamo/test_misc.py::MiscTests::test_cond_runtime_assert_generation, test/dynamo/test_misc.py::MiscTests::test_cond_side_effects, test/dynamo/test_misc.py::MiscTests::test_cond_with_quantization, test/dynamo/test_misc.py::MiscTests::test_conditional_list_comp_in_context, test/dynamo/test_misc.py::MiscTests::test_config_getattr_default, test/dynamo/test_misc.py::MiscTests::test_config_obj, test/dynamo/test_misc.py::MiscTests::test_const_dict_variable_python_type, test/dynamo/test_misc.py::MiscTests::test_constant_getattr, test/dynamo/test_misc.py::MiscTests::test_constant_hasattr_returns_bool, test/dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_fancy_ctor1, test/dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_fancy_ctor2, test/dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_simple_ctor, test/dynamo/test_misc.py::MiscTests::test_custom_dict, test/dynamo/test_misc.py::MiscTests::test_custom_module_free, test/dynamo/test_misc.py::MiscTests::test_data_access_in_inference_mode, test/dynamo/test_misc.py::MiscTests::test_data_ptr_graph_break_aten, test/dynamo/test_misc.py::MiscTests::test_data_ptr_graph_break_builtin, test/dynamo/test_misc.py::MiscTests::test_dataclass, test/dynamo/test_misc.py::MiscTests::test_dataclass_fields, test/dynamo/test_misc.py::MiscTests::test_dataclass_local_hasattr, test/dynamo/test_misc.py::MiscTests::test_default_args_device_dtype, test/dynamo/test_misc.py::MiscTests::test_default_dtype_change, test/dynamo/test_misc.py::MiscTests::test_defaultdict, test/dynamo/test_misc.py::MiscTests::test_deque_append_left, test/dynamo/test_misc.py::MiscTests::test_deque_input, test/dynamo/test_misc.py::MiscTests::test_derpy_nn_module_usage, test/dynamo/test_misc.py::MiscTests::test_descriptor, test/dynamo/test_misc.py::MiscTests::test_descriptor_side_effect, test/dynamo/test_misc.py::MiscTests::test_deterministic_algorithms_mutated, test/dynamo/test_misc.py::MiscTests::test_dictcomp, test/dynamo/test_misc.py::MiscTests::test_dim_order, test/dynamo/test_misc.py::MiscTests::test_disable_flag, test/dynamo/test_misc.py::MiscTests::test_dtypes_no_graphbreaks, test/dynamo/test_misc.py::MiscTests::test_dunder_methods, test/dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining, test/dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining1, test/dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining2, test/dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining3, test/dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining4, test/dynamo/test_misc.py::MiscTests::test_dunder_weakref, test/dynamo/test_misc.py::MiscTests::test_duplicate_graph_break_log, test/dynamo/test_misc.py::MiscTests::test_dynamic_one_hot, test/dynamo/test_misc.py::MiscTests::test_dynamic_shapes_as_strided, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_dynamic_override, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_dynamic_override_regex, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_force_parameter_static_shapes_and_property_static_shapes_override, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_graph_break, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_int, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_precedence_over_int_specialization, test/dynamo/test_misc.py::MiscTests::test_dynamic_sources_tensor, test/dynamo/test_misc.py::MiscTests::test_dynamo_cache_invalidate, test/dynamo/test_misc.py::MiscTests::test_dynamo_cache_move_to_front, test/dynamo/test_misc.py::MiscTests::test_dynamo_compiling_fake_tensor_to_vararg_int, test/dynamo/test_misc.py::MiscTests::test_dynamo_disabled_in_custom_op_kernels, test/dynamo/test_misc.py::MiscTests::test_dynamo_inside_custom_op, test/dynamo/test_misc.py::MiscTests::test_dynamo_min_operator_with_shape, test/dynamo/test_misc.py::MiscTests::test_dynamo_reset_clears_cache, test/dynamo/test_misc.py::MiscTests::test_empty_list, test/dynamo/test_misc.py::MiscTests::test_enum_as_dict_key, test/dynamo/test_misc.py::MiscTests::test_enum_as_dict_key_with_overloaded_str, test/dynamo/test_misc.py::MiscTests::test_enum_guards, test/dynamo/test_misc.py::MiscTests::test_enum_method, test/dynamo/test_misc.py::MiscTests::test_enum_no_graphbreaks, test/dynamo/test_misc.py::MiscTests::test_enum_subclass, test/dynamo/test_misc.py::MiscTests::test_error_on_nested_fx_trace, test/dynamo/test_misc.py::MiscTests::test_error_on_recompile, test/dynamo/test_misc.py::MiscTests::test_escaping_closure_var_with_backward_hook, test/dynamo/test_misc.py::MiscTests::test_escaping_closure_var_with_nonlocal_var, test/dynamo/test_misc.py::MiscTests::test_existing_func_that_creates_capturing_nested_func, test/dynamo/test_misc.py::MiscTests::test_fail_on_recompile_error_message, test/dynamo/test_misc.py::MiscTests::test_flat_name_to_original_fqn, test/dynamo/test_misc.py::MiscTests::test_float_speculation_log_divergence, test/dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__1, test/dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__2, test/dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__3, test/dynamo/test_misc.py::MiscTests::test_fold, test/dynamo/test_misc.py::MiscTests::test_free_var_and_local_name_collision, test/dynamo/test_misc.py::MiscTests::test_frozen_dataclass_attr_access, test/dynamo/test_misc.py::MiscTests::test_frozen_dataclass_default_factory, test/dynamo/test_misc.py::MiscTests::test_frozen_dataclass_default_value, test/dynamo/test_misc.py::MiscTests::test_frozen_dataclass_hashable, test/dynamo/test_misc.py::MiscTests::test_frozen_dataclass_kw_only, test/dynamo/test_misc.py::MiscTests::test_frozen_dict, test/dynamo/test_misc.py::MiscTests::test_frozenset_of_non_literals, test/dynamo/test_misc.py::MiscTests::test_frozenset_torch_func_contains, test/dynamo/test_misc.py::MiscTests::test_fullgraph_capture, test/dynamo/test_misc.py::MiscTests::test_funcname_cache, test/dynamo/test_misc.py::MiscTests::test_function_annotation, test/dynamo/test_misc.py::MiscTests::test_function_generic_alias_annotation, test/dynamo/test_misc.py::MiscTests::test_generate_tensor_from_list_of_numpy_primitive_type, test/dynamo/test_misc.py::MiscTests::test_generate_trivial_abstract_impl, test/dynamo/test_misc.py::MiscTests::test_get_attr_function, test/dynamo/test_misc.py::MiscTests::test_get_cache_entry, test/dynamo/test_misc.py::MiscTests::test_get_custom_tensor_attribute, test/dynamo/test_misc.py::MiscTests::test_get_instruction_source_311, test/dynamo/test_misc.py::MiscTests::test_getattr_dict, test/dynamo/test_misc.py::MiscTests::test_getattrvariable_as_python_constant, test/dynamo/test_misc.py::MiscTests::test_getset_descriptor, test/dynamo/test_misc.py::MiscTests::test_global_state_guard_serialization, test/dynamo/test_misc.py::MiscTests::test_grad, test/dynamo/test_misc.py::MiscTests::test_grad_non_none, test/dynamo/test_misc.py::MiscTests::test_grad_none, test/dynamo/test_misc.py::MiscTests::test_grad_state_mutated, test/dynamo/test_misc.py::MiscTests::test_graph_break_compilation_metrics, test/dynamo/test_misc.py::MiscTests::test_graph_break_compilation_metrics_on_failure, test/dynamo/test_misc.py::MiscTests::test_graph_break_correctly_when_passing_numpy_ndarray_to_torch_function, test/dynamo/test_misc.py::MiscTests::test_guard_failure_fn, test/dynamo/test_misc.py::MiscTests::test_guard_failure_fn2, test/dynamo/test_misc.py::MiscTests::test_guard_failure_fn_shape_control, test/dynamo/test_misc.py::MiscTests::test_guard_failure_fn_tensor_iter, test/dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_id, test/dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_is_global, test/dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_name_and_value, test/dynamo/test_misc.py::MiscTests::test_guard_filter_globals, test/dynamo/test_misc.py::MiscTests::test_guard_filter_inbuilt_nn_modules, test/dynamo/test_misc.py::MiscTests::test_guard_filter_nn_modules, test/dynamo/test_misc.py::MiscTests::test_guard_filter_tensors, test/dynamo/test_misc.py::MiscTests::test_guard_function_builder_with_cse, test/dynamo/test_misc.py::MiscTests::test_guard_size_oblivious_backed, test/dynamo/test_misc.py::MiscTests::test_guard_string_escaped, test/dynamo/test_misc.py::MiscTests::test_guard_sym_node_fstring_when_used, test/dynamo/test_misc.py::MiscTests::test_guards_cse_pass_multiple, test/dynamo/test_misc.py::MiscTests::test_guards_cse_pass_single, test/dynamo/test_misc.py::MiscTests::test_guards_strip_function_call, test/dynamo/test_misc.py::MiscTests::test_hasattr_nn_module_guard, test/dynamo/test_misc.py::MiscTests::test_hash_getitem_slice, test/dynamo/test_misc.py::MiscTests::test_hash_hop, test/dynamo/test_misc.py::MiscTests::test_id_guarded_class, test/dynamo/test_misc.py::MiscTests::test_id_guarded_module, test/dynamo/test_misc.py::MiscTests::test_id_guarded_object, test/dynamo/test_misc.py::MiscTests::test_id_of_nn_module, test/dynamo/test_misc.py::MiscTests::test_id_tensor, test/dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod1, test/dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod2, test/dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod3, test/dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object, test/dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object2, test/dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object3, test/dynamo/test_misc.py::MiscTests::test_infer_unbacked_size_gt_zero, test/dynamo/test_misc.py::MiscTests::test_inference_mode, test/dynamo/test_misc.py::MiscTests::test_inference_mode_param, test/dynamo/test_misc.py::MiscTests::test_inline_closure_not_loaded_by_parent, test/dynamo/test_misc.py::MiscTests::test_inline_closure_returned_by_another_function_and_captures, test/dynamo/test_misc.py::MiscTests::test_inline_dict_function, test/dynamo/test_misc.py::MiscTests::test_inline_dict_function_passed_as_arg, test/dynamo/test_misc.py::MiscTests::test_inline_dict_mutation, test/dynamo/test_misc.py::MiscTests::test_inline_func_jump_on_tensor_condition, test/dynamo/test_misc.py::MiscTests::test_inline_list_mutation, test/dynamo/test_misc.py::MiscTests::test_inline_local_dict_clear, test/dynamo/test_misc.py::MiscTests::test_inline_module_attr_dict_clear, test/dynamo/test_misc.py::MiscTests::test_inline_user_defined_dict_attr_clear, test/dynamo/test_misc.py::MiscTests::test_inplace, test/dynamo/test_misc.py::MiscTests::test_inplace_desugaring, test/dynamo/test_misc.py::MiscTests::test_inplace_param_update, test/dynamo/test_misc.py::MiscTests::test_inplace_view_on_graph_input, test/dynamo/test_misc.py::MiscTests::test_input_cell_mutation, test/dynamo/test_misc.py::MiscTests::test_inspect_signature_bind, test/dynamo/test_misc.py::MiscTests::test_inspect_signature_bind_non_user_function, test/dynamo/test_misc.py::MiscTests::test_inspect_signature_parameters, test/dynamo/test_misc.py::MiscTests::test_int_int_comparisons, test/dynamo/test_misc.py::MiscTests::test_int_list, test/dynamo/test_misc.py::MiscTests::test_int_neg, test/dynamo/test_misc.py::MiscTests::test_int_shape_binops, test/dynamo/test_misc.py::MiscTests::test_int_shape_comparisons, test/dynamo/test_misc.py::MiscTests::test_int_shape_inplace_binops, test/dynamo/test_misc.py::MiscTests::test_intermediary_tensor_grad_access, test/dynamo/test_misc.py::MiscTests::test_invalid_args_builtin, test/dynamo/test_misc.py::MiscTests::test_is_compiling, test/dynamo/test_misc.py::MiscTests::test_is_floating_point, test/dynamo/test_misc.py::MiscTests::test_is_floating_point2, test/dynamo/test_misc.py::MiscTests::test_is_tensor, test/dynamo/test_misc.py::MiscTests::test_is_tensor2, test/dynamo/test_misc.py::MiscTests::test_is_tensor_like, test/dynamo/test_misc.py::MiscTests::test_is_tensor_like2, test/dynamo/test_misc.py::MiscTests::test_item, test/dynamo/test_misc.py::MiscTests::test_item_changes, test/dynamo/test_misc.py::MiscTests::test_item_changes_new_shape, test/dynamo/test_misc.py::MiscTests::test_iter_set, test/dynamo/test_misc.py::MiscTests::test_iter_type, test/dynamo/test_misc.py::MiscTests::test_iterator_limit, test/dynamo/test_misc.py::MiscTests::test_itertools_accumulate_symint_default_sum, test/dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_builtins, test/dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_default_sum, test/dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_kwargs, test/dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_user_defined, test/dynamo/test_misc.py::MiscTests::test_itertools_groupby_pure_python_default_identify_func, test/dynamo/test_misc.py::MiscTests::test_itertools_groupby_pure_python_key_func, test/dynamo/test_misc.py::MiscTests::test_itertools_infinite_count, test/dynamo/test_misc.py::MiscTests::test_itertools_infinite_cycle, test/dynamo/test_misc.py::MiscTests::test_itertools_infinite_repeat, test/dynamo/test_misc.py::MiscTests::test_itertools_infinite_repeat_mutation, test/dynamo/test_misc.py::MiscTests::test_itertools_islice, test/dynamo/test_misc.py::MiscTests::test_itertools_islice_default_end, test/dynamo/test_misc.py::MiscTests::test_itertools_islice_default_step, test/dynamo/test_misc.py::MiscTests::test_itertools_repeat, test/dynamo/test_misc.py::MiscTests::test_itertools_tee, test/dynamo/test_misc.py::MiscTests::test_jacfwd_one_hot_dynamic_compile, test/dynamo/test_misc.py::MiscTests::test_large_reduction_list, test/dynamo/test_misc.py::MiscTests::test_linear_module_free, test/dynamo/test_misc.py::MiscTests::test_list_append_return_none, test/dynamo/test_misc.py::MiscTests::test_list_class, test/dynamo/test_misc.py::MiscTests::test_list_hasattr1, test/dynamo/test_misc.py::MiscTests::test_list_hasattr2, test/dynamo/test_misc.py::MiscTests::test_list_iadd_side_effect, test/dynamo/test_misc.py::MiscTests::test_list_iadd_with_shape, test/dynamo/test_misc.py::MiscTests::test_list_iterator_contains, test/dynamo/test_misc.py::MiscTests::test_list_mul, test/dynamo/test_misc.py::MiscTests::test_list_slice_mul, test/dynamo/test_misc.py::MiscTests::test_listcomp, test/dynamo/test_misc.py::MiscTests::test_load_fast_and_clear_graph_break, test/dynamo/test_misc.py::MiscTests::test_mandelbrot_numpy, test/dynamo/test_misc.py::MiscTests::test_map_side_effects, test/dynamo/test_misc.py::MiscTests::test_map_with_quantization, test/dynamo/test_misc.py::MiscTests::test_mark_dynamic_with_ranges, test/dynamo/test_misc.py::MiscTests::test_mark_static, test/dynamo/test_misc.py::MiscTests::test_mark_unbacked_strict, test/dynamo/test_misc.py::MiscTests::test_matmul1, test/dynamo/test_misc.py::MiscTests::test_min_max_over_iterable, test/dynamo/test_misc.py::MiscTests::test_module_complex_iter, test/dynamo/test_misc.py::MiscTests::test_module_deepcopy, test/dynamo/test_misc.py::MiscTests::test_module_not_callable, test/dynamo/test_misc.py::MiscTests::test_mro_type_tensor_no_source, test/dynamo/test_misc.py::MiscTests::test_multiple_inheritance, test/dynamo/test_misc.py::MiscTests::test_mutable_mapping_multiple_inheritance, test/dynamo/test_misc.py::MiscTests::test_named_parameters, test/dynamo/test_misc.py::MiscTests::test_namedtuple1, test/dynamo/test_misc.py::MiscTests::test_namedtuple2, test/dynamo/test_misc.py::MiscTests::test_namedtuple3, test/dynamo/test_misc.py::MiscTests::test_namedtuple_class, test/dynamo/test_misc.py::MiscTests::test_namedtuple_source_dynamic_attributes, test/dynamo/test_misc.py::MiscTests::test_namedtuple_sourceless_dynamic_attributes, test/dynamo/test_misc.py::MiscTests::test_namedtuple_with_custom_getitem, test/dynamo/test_misc.py::MiscTests::test_nan, test/dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_eq, test/dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_graphbreak_eq, test/dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_ne, test/dynamo/test_misc.py::MiscTests::test_nested_closure, test/dynamo/test_misc.py::MiscTests::test_nested_closure_mutation, test/dynamo/test_misc.py::MiscTests::test_nested_dataclass_reconstruct, test/dynamo/test_misc.py::MiscTests::test_nested_frozen_dataclass_hashable, test/dynamo/test_misc.py::MiscTests::test_nested_function_resuming_with_correct_globals, test/dynamo/test_misc.py::MiscTests::test_nested_optimize, test/dynamo/test_misc.py::MiscTests::test_nested_optimize_decorator, test/dynamo/test_misc.py::MiscTests::test_nested_optimize_run, test/dynamo/test_misc.py::MiscTests::test_nested_sequential_try, test/dynamo/test_misc.py::MiscTests::test_nested_sequential_try_with, test/dynamo/test_misc.py::MiscTests::test_nested_sequential_try_with_graph_break, test/dynamo/test_misc.py::MiscTests::test_nested_sequential_with, test/dynamo/test_misc.py::MiscTests::test_nested_wraps, test/dynamo/test_misc.py::MiscTests::test_nesteduserfunction_setattr, test/dynamo/test_misc.py::MiscTests::test_new_with_int_list, test/dynamo/test_misc.py::MiscTests::test_newly_constructed_tensor_attr_mutation, test/dynamo/test_misc.py::MiscTests::test_nn_functional_reduction, test/dynamo/test_misc.py::MiscTests::test_nn_module_getattr, test/dynamo/test_misc.py::MiscTests::test_nn_module_getattribute, test/dynamo/test_misc.py::MiscTests::test_nn_sequential_invocation, test/dynamo/test_misc.py::MiscTests::test_nn_sequential_invocation_reposition_indices, test/dynamo/test_misc.py::MiscTests::test_no_error_on_nested_fx_trace, test/dynamo/test_misc.py::MiscTests::test_no_guard_for_unused_sym_node_fstring, test/dynamo/test_misc.py::MiscTests::test_no_raise_guard_partial_constraint, test/dynamo/test_misc.py::MiscTests::test_no_raise_guard_partial_constraint_across_break, test/dynamo/test_misc.py::MiscTests::test_non_pt2_compliant_ops_graph_break, test/dynamo/test_misc.py::MiscTests::test_not_dynamic_scope, test/dynamo/test_misc.py::MiscTests::test_numel, test/dynamo/test_misc.py::MiscTests::test_numpy_array_of_arrays, test/dynamo/test_misc.py::MiscTests::test_numpy_as_global, test/dynamo/test_misc.py::MiscTests::test_numpy_fallback_on_eager, test/dynamo/test_misc.py::MiscTests::test_numpy_force, test/dynamo/test_misc.py::MiscTests::test_numpy_gt, test/dynamo/test_misc.py::MiscTests::test_numpy_int_constant, test/dynamo/test_misc.py::MiscTests::test_numpy_iter, test/dynamo/test_misc.py::MiscTests::test_numpy_min, test/dynamo/test_misc.py::MiscTests::test_numpy_ndarray_graph_break, test/dynamo/test_misc.py::MiscTests::test_numpy_ndarray_graph_break_with_multiple_outputs, test/dynamo/test_misc.py::MiscTests::test_numpy_ndarray_works_with_builtin_function, test/dynamo/test_misc.py::MiscTests::test_numpy_no_raise, test/dynamo/test_misc.py::MiscTests::test_numpy_non_torch_dtype, test/dynamo/test_misc.py::MiscTests::test_numpy_random_config_to_numpy, test/dynamo/test_misc.py::MiscTests::test_numpy_readonly, test/dynamo/test_misc.py::MiscTests::test_numpy_recompilation_scalar, test/dynamo/test_misc.py::MiscTests::test_numpy_size_attr, test/dynamo/test_misc.py::MiscTests::test_numpy_subdtype, test/dynamo/test_misc.py::MiscTests::test_numpy_take_along_axis, test/dynamo/test_misc.py::MiscTests::test_numpy_tolist, test/dynamo/test_misc.py::MiscTests::test_numpy_torch_operators, test/dynamo/test_misc.py::MiscTests::test_numpy_ufunc_out, test/dynamo/test_misc.py::MiscTests::test_numpy_ufunc_out_graph_break, test/dynamo/test_misc.py::MiscTests::test_numpy_unique_f16, test/dynamo/test_misc.py::MiscTests::test_numpy_variable_isinstance, test/dynamo/test_misc.py::MiscTests::test_numpy_with_builtin_type, test/dynamo/test_misc.py::MiscTests::test_object_classmethod, test/dynamo/test_misc.py::MiscTests::test_object_setattr, test/dynamo/test_misc.py::MiscTests::test_object_staticmethod, test/dynamo/test_misc.py::MiscTests::test_onnx_shape_as_tensor, test/dynamo/test_misc.py::MiscTests::test_optimize_on_module, test/dynamo/test_misc.py::MiscTests::test_ordered_dict_alias_reconstruct, test/dynamo/test_misc.py::MiscTests::test_ordered_dict_move_to_end, test/dynamo/test_misc.py::MiscTests::test_os_environ_get, test/dynamo/test_misc.py::MiscTests::test_os_environ_set_graph_break, test/dynamo/test_misc.py::MiscTests::test_out_variant_custom_op, test/dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs, test/dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic, test/dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic1, test/dynamo/test_misc.py::MiscTests::test_outside_linear_module_free, test/dynamo/test_misc.py::MiscTests::test_overridden_getattribute, test/dynamo/test_misc.py::MiscTests::test_packaging_version_parse, test/dynamo/test_misc.py::MiscTests::test_pair, test/dynamo/test_misc.py::MiscTests::test_param_shape_binops, test/dynamo/test_misc.py::MiscTests::test_parameter_free, test/dynamo/test_misc.py::MiscTests::test_patched_builtin_functions, test/dynamo/test_misc.py::MiscTests::test_pep0479_convert_stopiteration, test/dynamo/test_misc.py::MiscTests::test_precompile_entries, test/dynamo/test_misc.py::MiscTests::test_precompile_entry_hit, test/dynamo/test_misc.py::MiscTests::test_precompile_entry_miss, test/dynamo/test_misc.py::MiscTests::test_precompile_fail_on_recompile, test/dynamo/test_misc.py::MiscTests::test_proxy_frozen_dataclass, test/dynamo/test_misc.py::MiscTests::test_pt2_compliant_ops_are_allowed, test/dynamo/test_misc.py::MiscTests::test_pt2_compliant_overload, test/dynamo/test_misc.py::MiscTests::test_pure_python_accumulate, test/dynamo/test_misc.py::MiscTests::test_py_guards_mark_dynamic, test/dynamo/test_misc.py::MiscTests::test_python_slice, test/dynamo/test_misc.py::MiscTests::test_raise_guard_full_constraint, test/dynamo/test_misc.py::MiscTests::test_raise_guard_indirect_full_constraint, test/dynamo/test_misc.py::MiscTests::test_raise_guard_partial_constraint_across_break, test/dynamo/test_misc.py::MiscTests::test_raise_guard_partial_constraint_no_graph_break, test/dynamo/test_misc.py::MiscTests::test_raise_on_backend_error, test/dynamo/test_misc.py::MiscTests::test_raises, test/dynamo/test_misc.py::MiscTests::test_raises_importerror1, test/dynamo/test_misc.py::MiscTests::test_raises_importerror2, test/dynamo/test_misc.py::MiscTests::test_range___iter__, test/dynamo/test_misc.py::MiscTests::test_range_input, test/dynamo/test_misc.py::MiscTests::test_range_iter_guards, test/dynamo/test_misc.py::MiscTests::test_range_iter_side_effects, test/dynamo/test_misc.py::MiscTests::test_range_with_shape, test/dynamo/test_misc.py::MiscTests::test_real_imag_tensor_attribute, test/dynamo/test_misc.py::MiscTests::test_recompile_message_on_parameter, test/dynamo/test_misc.py::MiscTests::test_recompile_on_disable_1, test/dynamo/test_misc.py::MiscTests::test_recompile_on_disable_2, test/dynamo/test_misc.py::MiscTests::test_recompile_on_global_state_change, test/dynamo/test_misc.py::MiscTests::test_reconstruct_frozen_dataclass, test/dynamo/test_misc.py::MiscTests::test_reconstruct_set_across_graph_break, test/dynamo/test_misc.py::MiscTests::test_recursion_depth_guards, test/dynamo/test_misc.py::MiscTests::test_recursive_inline_list_mutation, test/dynamo/test_misc.py::MiscTests::test_recursive_tensor_attribute, test/dynamo/test_misc.py::MiscTests::test_release_input_memory, test/dynamo/test_misc.py::MiscTests::test_release_module_memory, test/dynamo/test_misc.py::MiscTests::test_release_scope_memory, test/dynamo/test_misc.py::MiscTests::test_remove_set, test/dynamo/test_misc.py::MiscTests::test_repeat_interleave_graphbreaks, test/dynamo/test_misc.py::MiscTests::test_replay_side_effects_config, test/dynamo/test_misc.py::MiscTests::test_replay_side_effects_input_mut, test/dynamo/test_misc.py::MiscTests::test_replay_side_effects_model_attr, test/dynamo/test_misc.py::MiscTests::test_repr, test/dynamo/test_misc.py::MiscTests::test_repro_graph_breaks_in__get_item_by_idx, test/dynamo/test_misc.py::MiscTests::test_restore_graphstate, test/dynamo/test_misc.py::MiscTests::test_return_dict_with_graph_break_and_update, test/dynamo/test_misc.py::MiscTests::test_return_nested_function, test/dynamo/test_misc.py::MiscTests::test_returning_func_with_captured_func_and_tensor, test/dynamo/test_misc.py::MiscTests::test_returning_nested_func_with_captured_tensor, test/dynamo/test_misc.py::MiscTests::test_running_func_with_captured_func_and_tensor, test/dynamo/test_misc.py::MiscTests::test_running_nested_func_with_captured_tensor, test/dynamo/test_misc.py::MiscTests::test_runtime_assert_replacement, test/dynamo/test_misc.py::MiscTests::test_sample_input, test/dynamo/test_misc.py::MiscTests::test_scalar_device_movement, test/dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_int_list_argument, test/dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_symint_argument, test/dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_symint_list_argument, test/dynamo/test_misc.py::MiscTests::test_sequential_module_free, test/dynamo/test_misc.py::MiscTests::test_set_aliasing_recompiles, test/dynamo/test_misc.py::MiscTests::test_set_custom_tensor_attribute, test/dynamo/test_misc.py::MiscTests::test_set_descriptor, test/dynamo/test_misc.py::MiscTests::test_set_discard, test/dynamo/test_misc.py::MiscTests::test_set_update, test/dynamo/test_misc.py::MiscTests::test_setattr_mutation1, test/dynamo/test_misc.py::MiscTests::test_setattr_mutation2, test/dynamo/test_misc.py::MiscTests::test_setattr_mutation3, test/dynamo/test_misc.py::MiscTests::test_shape_and_tuple_equality, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_constructor, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_create_symbolic_sizes_strides_storage_offset, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_empty, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_divisible, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_refinement, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_replacement, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_runtime_assert, test/dynamo/test_misc.py::MiscTests::test_shape_env_equal_unbacked, test/dynamo/test_misc.py::MiscTests::test_shape_env_no_recording, test/dynamo/test_misc.py::MiscTests::test_shape_env_recorded_function_fallback, test/dynamo/test_misc.py::MiscTests::test_shape_int_comparisons, test/dynamo/test_misc.py::MiscTests::test_shape_int_inplace_binops, test/dynamo/test_misc.py::MiscTests::test_shape_type, test/dynamo/test_misc.py::MiscTests::test_shape_unpack, test/dynamo/test_misc.py::MiscTests::test_side_effects_codegen_update_mutated, test/dynamo/test_misc.py::MiscTests::test_simple_set_usage, test/dynamo/test_misc.py::MiscTests::test_size_dim, test/dynamo/test_misc.py::MiscTests::test_size_input, test/dynamo/test_misc.py::MiscTests::test_slice_input, test/dynamo/test_misc.py::MiscTests::test_source_non_input_grad_access, test/dynamo/test_misc.py::MiscTests::test_sourceless_namedtuple, test/dynamo/test_misc.py::MiscTests::test_sparse_output_inductor_should_break, test/dynamo/test_misc.py::MiscTests::test_storage_return, test/dynamo/test_misc.py::MiscTests::test_str___iter__, test/dynamo/test_misc.py::MiscTests::test_str_format_assert1, test/dynamo/test_misc.py::MiscTests::test_str_format_assert2, test/dynamo/test_misc.py::MiscTests::test_str_format_return1, test/dynamo/test_misc.py::MiscTests::test_str_format_return2, test/dynamo/test_misc.py::MiscTests::test_stride_dim, test/dynamo/test_misc.py::MiscTests::test_structseq1, test/dynamo/test_misc.py::MiscTests::test_structseq2, test/dynamo/test_misc.py::MiscTests::test_super_after_graph_break, test/dynamo/test_misc.py::MiscTests::test_super_calling_with_metaclass, test/dynamo/test_misc.py::MiscTests::test_sym_and_terms, test/dynamo/test_misc.py::MiscTests::test_sym_constrain_range_on_replaced_unbacked_symbol, test/dynamo/test_misc.py::MiscTests::test_symint_as_device_kwarg_multi_gpu, test/dynamo/test_misc.py::MiscTests::test_symint_as_device_kwarg_non_strict_export, test/dynamo/test_misc.py::MiscTests::test_symint_copy_into_unbacked_slice, test/dynamo/test_misc.py::MiscTests::test_symint_fold_nontrivial_product_modulo, test/dynamo/test_misc.py::MiscTests::test_sys_modules, test/dynamo/test_misc.py::MiscTests::test_tagging_tensors_mix_used_unused_structure, test/dynamo/test_misc.py::MiscTests::test_tagging_tensors_simple, test/dynamo/test_misc.py::MiscTests::test_tensor__iter__, test/dynamo/test_misc.py::MiscTests::test_tensor_build_list_unpack, test/dynamo/test_misc.py::MiscTests::test_tensor_ctor_list_of_tensor, test/dynamo/test_misc.py::MiscTests::test_tensor_data, test/dynamo/test_misc.py::MiscTests::test_tensor_dict1, test/dynamo/test_misc.py::MiscTests::test_tensor_dict2, test/dynamo/test_misc.py::MiscTests::test_tensor_dict3, test/dynamo/test_misc.py::MiscTests::test_tensor_dot_grad_no_graph_break, test/dynamo/test_misc.py::MiscTests::test_tensor_dynamic_method, test/dynamo/test_misc.py::MiscTests::test_tensor_hasattr, test/dynamo/test_misc.py::MiscTests::test_tensor_interacts_with_numpy_ndarray, test/dynamo/test_misc.py::MiscTests::test_tensor_is_contiguous, test/dynamo/test_misc.py::MiscTests::test_tensor_item_capture, test/dynamo/test_misc.py::MiscTests::test_tensor_item_no_capture, test/dynamo/test_misc.py::MiscTests::test_tensor_iter, test/dynamo/test_misc.py::MiscTests::test_tensor_layout, test/dynamo/test_misc.py::MiscTests::test_tensor_setattr_getset_descriptor, test/dynamo/test_misc.py::MiscTests::test_tensor_types, test/dynamo/test_misc.py::MiscTests::test_thread_local_setattr, test/dynamo/test_misc.py::MiscTests::test_tolist, test/dynamo/test_misc.py::MiscTests::test_tolist_0d, test/dynamo/test_misc.py::MiscTests::test_tolist_1d, test/dynamo/test_misc.py::MiscTests::test_tolist_float, test/dynamo/test_misc.py::MiscTests::test_tolist_kd, test/dynamo/test_misc.py::MiscTests::test_tolist_kd_dynamic, test/dynamo/test_misc.py::MiscTests::test_tolist_scalar, test/dynamo/test_misc.py::MiscTests::test_top_package_import, test/dynamo/test_misc.py::MiscTests::test_torch_check, test/dynamo/test_misc.py::MiscTests::test_torch_check_nonnegative, test/dynamo/test_misc.py::MiscTests::test_torch_check_symbolic_shape_rel, test/dynamo/test_misc.py::MiscTests::test_torch_compile_ctx_on_forward_and_training_step, test/dynamo/test_misc.py::MiscTests::test_torch_distributions_lazy_property, test/dynamo/test_misc.py::MiscTests::test_torch_dtype_python_type, test/dynamo/test_misc.py::MiscTests::test_torch_dynamo_codegen_pow, test/dynamo/test_misc.py::MiscTests::test_torch_generator_set_state, test/dynamo/test_misc.py::MiscTests::test_torch_guards_stack_frame_register_inlining, test/dynamo/test_misc.py::MiscTests::test_torch_guards_stack_frame_register_inlining_deep, test/dynamo/test_misc.py::MiscTests::test_torch_nn_parameter_isinstance, test/dynamo/test_misc.py::MiscTests::test_torch_objects_as_keys, test/dynamo/test_misc.py::MiscTests::test_torch_package_working_with_trace, test/dynamo/test_misc.py::MiscTests::test_torch_seed, test/dynamo/test_misc.py::MiscTests::test_torch_size, test/dynamo/test_misc.py::MiscTests::test_torch_size_numel, test/dynamo/test_misc.py::MiscTests::test_torch_size_numel_dynamic, test/dynamo/test_misc.py::MiscTests::test_torch_variable_hasattr, test/dynamo/test_misc.py::MiscTests::test_trace_ndarray_frame, test/dynamo/test_misc.py::MiscTests::test_trace_ndarray_frame_2, test/dynamo/test_misc.py::MiscTests::test_tracing_nested_py_tree_mixed_all, test/dynamo/test_misc.py::MiscTests::test_tuple_class, test/dynamo/test_misc.py::MiscTests::test_tuple_from_tuple_iter, test/dynamo/test_misc.py::MiscTests::test_tuple_hasattr, test/dynamo/test_misc.py::MiscTests::test_tuple_iadd_with_shape, test/dynamo/test_misc.py::MiscTests::test_tuple_mul, test/dynamo/test_misc.py::MiscTests::test_tuple_mul_with_shape, test/dynamo/test_misc.py::MiscTests::test_tying_union_new_syntax, test/dynamo/test_misc.py::MiscTests::test_type_copy, test/dynamo/test_misc.py::MiscTests::test_typing_dict, test/dynamo/test_misc.py::MiscTests::test_typing_typevar, test/dynamo/test_misc.py::MiscTests::test_typing_union_and_optional, test/dynamo/test_misc.py::MiscTests::test_typing_union_new_syntax_reconstruct, test/dynamo/test_misc.py::MiscTests::test_typing_variable_isinstance, test/dynamo/test_misc.py::MiscTests::test_unbacked_2d_expand, test/dynamo/test_misc.py::MiscTests::test_unbacked_empty_tensor, test/dynamo/test_misc.py::MiscTests::test_unbacked_repeat_cat, test/dynamo/test_misc.py::MiscTests::test_unbacked_sources_scalar, test/dynamo/test_misc.py::MiscTests::test_unbacked_sources_tensor, test/dynamo/test_misc.py::MiscTests::test_unbacked_strict_mode, test/dynamo/test_misc.py::MiscTests::test_unbacked_symint_split, test/dynamo/test_misc.py::MiscTests::test_unhandled_exception_in_dynamo, test/dynamo/test_misc.py::MiscTests::test_unhandled_exception_in_dynamo2, test/dynamo/test_misc.py::MiscTests::test_unique_consecutive, test/dynamo/test_misc.py::MiscTests::test_unpack4, test/dynamo/test_misc.py::MiscTests::test_unpack5, test/dynamo/test_misc.py::MiscTests::test_unpack_tensor_shape_mismatch, test/dynamo/test_misc.py::MiscTests::test_update_locals_and_stack_uses_shared_cache, test/dynamo/test_misc.py::MiscTests::test_user_code_statically_known, test/dynamo/test_misc.py::MiscTests::test_user_defined_binop, test/dynamo/test_misc.py::MiscTests::test_user_defined_class_name, test/dynamo/test_misc.py::MiscTests::test_user_defined_class_python_type, test/dynamo/test_misc.py::MiscTests::test_user_defined_iter, test/dynamo/test_misc.py::MiscTests::test_user_defined_object_class_interaction, test/dynamo/test_misc.py::MiscTests::test_user_defined_setattr1, test/dynamo/test_misc.py::MiscTests::test_user_defined_setattr2, test/dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_enum_argument, test/dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_function_argument, test/dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_type_abcmeta_argument, test/dynamo/test_misc.py::MiscTests::test_user_getattr1, test/dynamo/test_misc.py::MiscTests::test_user_getattr2, test/dynamo/test_misc.py::MiscTests::test_user_getattribute, test/dynamo/test_misc.py::MiscTests::test_user_property, test/dynamo/test_misc.py::MiscTests::test_usr_cls_classmethod, test/dynamo/test_misc.py::MiscTests::test_usr_cls_staticmethod, test/dynamo/test_misc.py::MiscTests::test_validate_outputs_unbacked, test/dynamo/test_misc.py::MiscTests::test_validate_outputs_unbacked_by_custom_op, test/dynamo/test_misc.py::MiscTests::test_variable_access_in_exception, test/dynamo/test_misc.py::MiscTests::test_variable_tracker_recursively_contains, test/dynamo/test_misc.py::MiscTests::test_version_ci, test/dynamo/test_misc.py::MiscTests::test_with_builtin_type, test/dynamo/test_misc.py::MiscTests::test_write_to_cells_with_name_shadowing, test/dynamo/test_misc.py::MiscTests::test_write_to_closures_in_inlining, test/dynamo/test_misc.py::MiscTests::test_writes_to_cells_across_frames1, test/dynamo/test_misc.py::MiscTests::test_writes_to_cells_across_frames2, test/dynamo/test_misc.py::MiscTests::test_yield_from, test/dynamo/test_misc.py::MiscTests::test_yield_from_in_a_loop, test/dynamo/test_misc.py::MiscTests::test_yield_from_user_stop_iteration, test/dynamo/test_misc.py::MiscTests::test_yield_gen_and_from, test/dynamo/test_misc.py::MiscTests::test_yield_send_to_subgenerator_graph_break, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_register_constant_with_side_effect, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_python, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_cxx, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_native_optree, test/dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_python, test/dynamo/test_misc.py::TestTracer::test_jit_save, test/dynamo/test_misc.py::TestCustomFunction::test_autograd_function_with_matmul_folding_at_output, test/dynamo/test_misc.py::TestCustomFunction::test_retain_grad, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_dynamic_fill_diagonal__cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_dynamic_float_scalar_tensor_coersion_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_full_graph_capture_dynamic_output_shape_ops_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_full_graph_capture_scalar_outputs_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_get_device_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_gpu_set_device_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_legacy_cuda_tensor_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_parsing_sdpa_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_rand_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_randint_no_graphbreak_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_scalar_isin_decomposition_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_symint_as_device_kwarg_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_cudnn_is_acceptable_bad_inputs_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_cudnn_is_acceptable_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_device_is_available_cuda, test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_device_python_type_cuda, test/dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_guard_or_false, test/dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_tensor_mul, test/dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_tensor_mul_does_not_fail, test/dynamo/test_misc.py::DynamoOpPromotionTests::test_tensorify_track_item_symint 2025-12-04T10:23:15.1109545Z 2025-12-04T10:23:15.1109806Z dynamo/test_misc.py::MiscTests::test_312_binary_slice_with_graph_break1 PASSED [0.1192s] [ 0%] 2025-12-04T10:23:15.1110184Z dynamo/test_misc.py::MiscTests::test_312_binary_slice_with_graph_break2 PASSED [0.0432s] [ 0%] 2025-12-04T10:23:15.1110441Z dynamo/test_misc.py::MiscTests::test_RAISE_VARARGS_0 SKIPPED [0.0002s] (Python 3.11+) [ 0%] 2025-12-04T10:23:15.1110679Z dynamo/test_misc.py::MiscTests::test_T_tensor_attribute PASSED [0.0170s] [ 0%] 2025-12-04T10:23:15.1110898Z dynamo/test_misc.py::MiscTests::test_add_sizes PASSED [0.0070s] [ 0%] 2025-12-04T10:23:15.1111114Z dynamo/test_misc.py::MiscTests::test_add_to_set PASSED [0.0162s] [ 0%] 2025-12-04T10:23:15.1111334Z dynamo/test_misc.py::MiscTests::test_anomaly_aot_autograd PASSED [0.0449s] [ 1%] 2025-12-04T10:23:15.1111551Z dynamo/test_misc.py::MiscTests::test_any_all_symnode PASSED [0.0620s] [ 1%] 2025-12-04T10:23:15.1111805Z dynamo/test_misc.py::MiscTests::test_aot_autograd_propagate_unbacked_symints_shape PASSED [0.0385s] [ 1%] 2025-12-04T10:23:15.1112081Z dynamo/test_misc.py::MiscTests::test_arange_length_with_float32_dtype PASSED [5.9648s] [ 1%] 2025-12-04T10:23:15.1112336Z dynamo/test_misc.py::MiscTests::test_argwhere_with_dynamic_shapes PASSED [0.5548s] [ 1%] 2025-12-04T10:23:15.1112566Z dynamo/test_misc.py::MiscTests::test_assert PASSED [0.0271s] [ 1%] 2025-12-04T10:23:15.1112780Z dynamo/test_misc.py::MiscTests::test_assert_size_stride PASSED [0.0019s] [ 1%] 2025-12-04T10:23:15.1115466Z dynamo/test_misc.py::MiscTests::test_assigning_function_to_class_attribute PASSED [0.0164s] [ 2%] 2025-12-04T10:23:15.1115740Z dynamo/test_misc.py::MiscTests::test_assigning_function_to_object_attribute PASSED [0.0155s] [ 2%] 2025-12-04T10:23:15.1115990Z dynamo/test_misc.py::MiscTests::test_assume_32_bit_indexing PASSED [3.0456s] [ 2%] 2025-12-04T10:23:15.1116217Z dynamo/test_misc.py::MiscTests::test_backend_match_guard PASSED [0.1007s] [ 2%] 2025-12-04T10:23:15.1116453Z dynamo/test_misc.py::MiscTests::test_backend_match_guard_multi_threads PASSED [0.0401s] [ 2%] 2025-12-04T10:23:15.1116728Z dynamo/test_misc.py::MiscTests::test_backward_deterministic_mode_mismatch_warning PASSED [0.4364s] [ 2%] 2025-12-04T10:23:15.1116984Z dynamo/test_misc.py::MiscTests::test_boolarg PASSED [0.0451s] [ 3%] 2025-12-04T10:23:15.1117195Z dynamo/test_misc.py::MiscTests::test_bound_shape_checks PASSED [0.0536s] [ 3%] 2025-12-04T10:23:15.1117407Z dynamo/test_misc.py::MiscTests::test_build_tuple_unpack PASSED [0.0364s] [ 3%] 2025-12-04T10:23:15.1117645Z dynamo/test_misc.py::MiscTests::test_builder_for_class_with_metaclass PASSED [0.0198s] [ 3%] 2025-12-04T10:23:15.1117874Z dynamo/test_misc.py::MiscTests::test_builtin_abs PASSED [0.0624s] [ 3%] 2025-12-04T10:23:15.1118093Z dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symbool PASSED [0.0177s] [ 3%] 2025-12-04T10:23:15.1118374Z dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symfloat PASSED [0.0176s] [ 3%] 2025-12-04T10:23:15.1118607Z dynamo/test_misc.py::MiscTests::test_builtin_bool_on_symint PASSED [0.0166s] [ 4%] 2025-12-04T10:23:15.1118828Z dynamo/test_misc.py::MiscTests::test_builtin_complex PASSED [0.0224s] [ 4%] 2025-12-04T10:23:15.1119047Z dynamo/test_misc.py::MiscTests::test_builtin_complex_args PASSED [0.0206s] [ 4%] 2025-12-04T10:23:15.1119264Z dynamo/test_misc.py::MiscTests::test_builtin_isinstance PASSED [0.0160s] [ 4%] 2025-12-04T10:23:15.1119503Z dynamo/test_misc.py::MiscTests::test_builtin_str_on_user_defined_function PASSED [0.0119s] [ 4%] 2025-12-04T10:23:15.1119775Z dynamo/test_misc.py::MiscTests::test_builtin_subclasses_as_method_on_class_type PASSED [0.0126s] [ 4%] 2025-12-04T10:23:15.1120048Z dynamo/test_misc.py::MiscTests::test_builtin_subclasses_as_method_on_var PASSED [0.0220s] [ 4%] 2025-12-04T10:23:15.1120368Z dynamo/test_misc.py::MiscTests::test_call_parent_non_class_methods_from_child PASSED [0.0461s] [ 5%] 2025-12-04T10:23:15.1120615Z dynamo/test_misc.py::MiscTests::test_callpacked PASSED [0.0331s] [ 5%] 2025-12-04T10:23:15.1120840Z dynamo/test_misc.py::MiscTests::test_cannot_trace_mark_dynamic PASSED [0.0070s] [ 5%] 2025-12-04T10:23:15.1121142Z dynamo/test_misc.py::MiscTests::test_cannot_trace_mark_dynamic_safe_unreached PASSED [0.0059s] [ 5%] 2025-12-04T10:23:15.1121388Z dynamo/test_misc.py::MiscTests::test_cast PASSED [0.0157s] [ 5%] 2025-12-04T10:23:15.1121598Z dynamo/test_misc.py::MiscTests::test_cat_unbacked PASSED [0.0220s] [ 5%] 2025-12-04T10:23:15.1121809Z dynamo/test_misc.py::MiscTests::test_catch_watchings1 PASSED [0.0138s] [ 6%] 2025-12-04T10:23:15.1122023Z dynamo/test_misc.py::MiscTests::test_catch_watchings2 PASSED [0.0156s] [ 6%] 2025-12-04T10:23:15.1122276Z dynamo/test_misc.py::MiscTests::test_cell_captured_by_existing_func_but_not_root_frame PASSED [0.3402s] [ 6%] 2025-12-04T10:23:15.1122536Z dynamo/test_misc.py::MiscTests::test_cell_output1 PASSED [0.0177s] [ 6%] 2025-12-04T10:23:15.1122747Z dynamo/test_misc.py::MiscTests::test_cell_output2 PASSED [0.4763s] [ 6%] 2025-12-04T10:23:15.1123029Z dynamo/test_misc.py::MiscTests::test_check_assert_error_at_runtime_when_predicate_false_and_message_has_closure PASSED [0.0123s] [ 6%] 2025-12-04T10:23:15.1123386Z dynamo/test_misc.py::MiscTests::test_check_assert_error_at_runtime_when_predicate_true_and_message_has_closure PASSED [0.0072s] [ 6%] 2025-12-04T10:23:15.1123710Z dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_None PASSED [0.0188s] [ 7%] 2025-12-04T10:23:15.1124015Z dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_has_global PASSED [0.0202s] [ 7%] 2025-12-04T10:23:15.1124334Z dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_and_message_has_no_closure PASSED [0.0201s] [ 7%] 2025-12-04T10:23:15.1124656Z dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_constant_and_message_None PASSED [0.0164s] [ 7%] 2025-12-04T10:23:15.1124992Z dynamo/test_misc.py::MiscTests::test_check_compiles_when_predicate_true_constant_and_message_has_no_closure PASSED [0.0160s] [ 7%] 2025-12-04T10:23:15.1125327Z dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_None PASSED [0.0103s] [ 7%] 2025-12-04T10:23:15.1125658Z dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_global PASSED [0.0067s] [ 7%] 2025-12-04T10:23:15.1125998Z dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_no_closure PASSED [0.0065s] [ 8%] 2025-12-04T10:23:15.1126343Z dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_constant_and_message_None PASSED [0.0047s] [ 8%] 2025-12-04T10:23:15.1126702Z dynamo/test_misc.py::MiscTests::test_check_raises_at_runtime_when_predicate_false_constant_and_message_has_no_closure PASSED [0.0055s] [ 8%] 2025-12-04T10:23:15.1127095Z dynamo/test_misc.py::MiscTests::test_check_simplification PASSED [0.0232s] [ 8%] 2025-12-04T10:23:15.1127311Z dynamo/test_misc.py::MiscTests::test_class_binop PASSED [0.0237s] [ 8%] 2025-12-04T10:23:15.1127520Z dynamo/test_misc.py::MiscTests::test_class_duner_flags PASSED [0.0293s] [ 8%] 2025-12-04T10:23:15.1127732Z dynamo/test_misc.py::MiscTests::test_class_duner_mro PASSED [0.0181s] [ 9%] 2025-12-04T10:23:15.1127960Z dynamo/test_misc.py::MiscTests::test_class_has_instancecheck_method PASSED [0.0171s] [ 9%] 2025-12-04T10:23:15.1128188Z dynamo/test_misc.py::MiscTests::test_clone_sparse_input PASSED [3.6118s] [ 9%] 2025-12-04T10:23:15.1128409Z dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell PASSED [0.0197s] [ 9%] 2025-12-04T10:23:15.1128653Z dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell_with_cond PASSED [0.0796s] [ 9%] 2025-12-04T10:23:15.1128940Z dynamo/test_misc.py::MiscTests::test_closure_out_of_scope_cell_with_mutation PASSED [0.0518s] [ 9%] 2025-12-04T10:23:15.1129181Z dynamo/test_misc.py::MiscTests::test_closure_recompiles PASSED [0.0449s] [ 9%] 2025-12-04T10:23:15.1129419Z dynamo/test_misc.py::MiscTests::test_closure_with_mutation_and_graph_break PASSED [0.0348s] [ 10%] 2025-12-04T10:23:15.1129705Z dynamo/test_misc.py::MiscTests::test_closure_write_across_functions PASSED [0.0188s] [ 10%] 2025-12-04T10:23:15.1129934Z dynamo/test_misc.py::MiscTests::test_compare_shapes_eq PASSED [0.0415s] [ 10%] 2025-12-04T10:23:15.1130179Z dynamo/test_misc.py::MiscTests::test_compare_shapes_neq PASSED [0.0424s] [ 10%] 2025-12-04T10:23:15.1130397Z dynamo/test_misc.py::MiscTests::test_compare_shapes_tuple_eq PASSED [0.0222s] [ 10%] 2025-12-04T10:23:15.1130626Z dynamo/test_misc.py::MiscTests::test_compare_shapes_tuple_neq PASSED [0.0229s] [ 10%] 2025-12-04T10:23:15.1130862Z dynamo/test_misc.py::MiscTests::test_compare_shapes_with_constant PASSED [0.0441s] [ 10%] 2025-12-04T10:23:15.1131095Z dynamo/test_misc.py::MiscTests::test_compare_tensor_with_none PASSED [0.0096s] [ 11%] 2025-12-04T10:23:15.1131333Z dynamo/test_misc.py::MiscTests::test_compilation_metrics_size_limit PASSED [0.1142s] [ 11%] 2025-12-04T10:23:15.1131574Z dynamo/test_misc.py::MiscTests::test_compiled_class_graph_break PASSED [0.0347s] [ 11%] 2025-12-04T10:23:15.1131800Z dynamo/test_misc.py::MiscTests::test_cond PASSED [0.0309s] [ 11%] 2025-12-04T10:23:15.1132010Z dynamo/test_misc.py::MiscTests::test_cond_export PASSED [0.0471s] [ 11%] 2025-12-04T10:23:15.1132225Z dynamo/test_misc.py::MiscTests::test_cond_export_single_arg PASSED [0.0270s] [ 11%] 2025-12-04T10:23:15.1132440Z dynamo/test_misc.py::MiscTests::test_cond_nested PASSED [0.0309s] [ 12%] 2025-12-04T10:23:15.1132665Z dynamo/test_misc.py::MiscTests::test_cond_runtime_assert_generation PASSED [0.1276s] [ 12%] 2025-12-04T10:23:15.1132892Z dynamo/test_misc.py::MiscTests::test_cond_side_effects PASSED [0.0293s] [ 12%] 2025-12-04T10:23:15.1133107Z dynamo/test_misc.py::MiscTests::test_cond_with_quantization PASSED [0.1278s] [ 12%] 2025-12-04T10:23:15.1133348Z dynamo/test_misc.py::MiscTests::test_conditional_list_comp_in_context PASSED [0.0191s] [ 12%] 2025-12-04T10:23:15.1133585Z dynamo/test_misc.py::MiscTests::test_config_getattr_default PASSED [0.0529s] [ 12%] 2025-12-04T10:23:15.1133802Z dynamo/test_misc.py::MiscTests::test_config_obj PASSED [0.0685s] [ 12%] 2025-12-04T10:23:15.1134030Z dynamo/test_misc.py::MiscTests::test_const_dict_variable_python_type PASSED [0.0020s] [ 13%] 2025-12-04T10:23:15.1134260Z dynamo/test_misc.py::MiscTests::test_constant_getattr PASSED [0.0057s] [ 13%] 2025-12-04T10:23:15.1134487Z dynamo/test_misc.py::MiscTests::test_constant_hasattr_returns_bool PASSED [0.0110s] [ 13%] 2025-12-04T10:23:15.1134819Z dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_fancy_ctor1 SKIPPED [0.0002s] (https://github.com/pytorch/pytorch/issues/99726) [ 13%] 2025-12-04T10:23:15.1135131Z dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_fancy_ctor2 PASSED [0.0835s] [ 13%] 2025-12-04T10:23:15.1135418Z dynamo/test_misc.py::MiscTests::test_cross_entropy_loss_simple_ctor PASSED [0.0506s] [ 13%] 2025-12-04T10:23:15.1135645Z dynamo/test_misc.py::MiscTests::test_custom_dict PASSED [0.0188s] [ 14%] 2025-12-04T10:23:15.1135855Z dynamo/test_misc.py::MiscTests::test_custom_module_free PASSED [0.4574s] [ 14%] 2025-12-04T10:23:15.1136084Z dynamo/test_misc.py::MiscTests::test_data_access_in_inference_mode PASSED [0.0318s] [ 14%] 2025-12-04T10:23:15.1136320Z dynamo/test_misc.py::MiscTests::test_data_ptr_graph_break_aten PASSED [0.0168s] [ 14%] 2025-12-04T10:23:15.1136551Z dynamo/test_misc.py::MiscTests::test_data_ptr_graph_break_builtin PASSED [0.0134s] [ 14%] 2025-12-04T10:23:15.1136772Z dynamo/test_misc.py::MiscTests::test_dataclass PASSED [0.0292s] [ 14%] 2025-12-04T10:23:15.1136981Z dynamo/test_misc.py::MiscTests::test_dataclass_fields PASSED [0.0576s] [ 14%] 2025-12-04T10:23:15.1137198Z dynamo/test_misc.py::MiscTests::test_dataclass_local_hasattr PASSED [0.0176s] [ 15%] 2025-12-04T10:23:15.1137432Z dynamo/test_misc.py::MiscTests::test_default_args_device_dtype PASSED [0.0180s] [ 15%] 2025-12-04T10:23:15.1137655Z dynamo/test_misc.py::MiscTests::test_default_dtype_change PASSED [0.4861s] [ 15%] 2025-12-04T10:23:15.1137868Z dynamo/test_misc.py::MiscTests::test_defaultdict PASSED [0.0175s] [ 15%] 2025-12-04T10:23:15.1138127Z dynamo/test_misc.py::MiscTests::test_deque_append_left PASSED [0.0190s] [ 15%] 2025-12-04T10:23:15.1138336Z dynamo/test_misc.py::MiscTests::test_deque_input PASSED [0.0185s] [ 15%] 2025-12-04T10:23:15.1138548Z dynamo/test_misc.py::MiscTests::test_derpy_nn_module_usage PASSED [0.0283s] [ 15%] 2025-12-04T10:23:15.1138762Z dynamo/test_misc.py::MiscTests::test_descriptor PASSED [0.0341s] [ 16%] 2025-12-04T10:23:15.1138979Z dynamo/test_misc.py::MiscTests::test_descriptor_side_effect PASSED [0.0187s] [ 16%] 2025-12-04T10:23:15.1139218Z dynamo/test_misc.py::MiscTests::test_deterministic_algorithms_mutated PASSED [0.0174s] [ 16%] 2025-12-04T10:23:15.1139450Z dynamo/test_misc.py::MiscTests::test_dictcomp PASSED [0.0200s] [ 16%] 2025-12-04T10:23:15.1139657Z dynamo/test_misc.py::MiscTests::test_dim_order PASSED [0.1411s] [ 16%] 2025-12-04T10:23:15.1139862Z dynamo/test_misc.py::MiscTests::test_disable_flag PASSED [0.0019s] [ 16%] 2025-12-04T10:23:15.1140078Z dynamo/test_misc.py::MiscTests::test_dtypes_no_graphbreaks PASSED [0.1393s] [ 17%] 2025-12-04T10:23:15.1140340Z dynamo/test_misc.py::MiscTests::test_dunder_methods PASSED [0.0264s] [ 17%] 2025-12-04T10:23:15.1140562Z dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining PASSED [0.0393s] [ 17%] 2025-12-04T10:23:15.1140801Z dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining1 PASSED [0.0186s] [ 17%] 2025-12-04T10:23:15.1141041Z dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining2 PASSED [0.0213s] [ 17%] 2025-12-04T10:23:15.1141277Z dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining3 PASSED [0.0196s] [ 17%] 2025-12-04T10:23:15.1141514Z dynamo/test_misc.py::MiscTests::test_dunder_new_function_inlining4 PASSED [0.0202s] [ 17%] 2025-12-04T10:23:15.1141740Z dynamo/test_misc.py::MiscTests::test_dunder_weakref PASSED [0.0177s] [ 18%] 2025-12-04T10:23:15.1141963Z dynamo/test_misc.py::MiscTests::test_duplicate_graph_break_log PASSED [0.2559s] [ 18%] 2025-12-04T10:23:15.1142187Z dynamo/test_misc.py::MiscTests::test_dynamic_one_hot PASSED [0.4389s] [ 18%] 2025-12-04T10:23:15.1142408Z dynamo/test_misc.py::MiscTests::test_dynamic_shapes_as_strided PASSED [0.0393s] [ 18%] 2025-12-04T10:23:15.1142650Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_dynamic_override PASSED [0.0188s] [ 18%] 2025-12-04T10:23:15.1142910Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_dynamic_override_regex PASSED [0.0212s] [ 18%] 2025-12-04T10:23:15.1143233Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_force_parameter_static_shapes_and_property_static_shapes_override PASSED [0.0509s] [ 18%] 2025-12-04T10:23:15.1143584Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_graph_break PASSED [0.0402s] [ 19%] 2025-12-04T10:23:15.1143811Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_int PASSED [0.0168s] [ 19%] 2025-12-04T10:23:15.1144066Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_precedence_over_int_specialization PASSED [0.0199s] [ 19%] 2025-12-04T10:23:15.1144329Z dynamo/test_misc.py::MiscTests::test_dynamic_sources_tensor PASSED [0.0182s] [ 19%] 2025-12-04T10:23:15.1144578Z dynamo/test_misc.py::MiscTests::test_dynamo_cache_invalidate PASSED [0.0585s] [ 19%] 2025-12-04T10:23:15.1144816Z dynamo/test_misc.py::MiscTests::test_dynamo_cache_move_to_front PASSED [0.0445s] [ 19%] 2025-12-04T10:23:15.1145078Z dynamo/test_misc.py::MiscTests::test_dynamo_compiling_fake_tensor_to_vararg_int PASSED [0.0178s] [ 20%] 2025-12-04T10:23:15.1145354Z dynamo/test_misc.py::MiscTests::test_dynamo_disabled_in_custom_op_kernels PASSED [0.0391s] [ 20%] 2025-12-04T10:23:15.1145604Z dynamo/test_misc.py::MiscTests::test_dynamo_inside_custom_op PASSED [0.4880s] [ 20%] 2025-12-04T10:23:15.1145850Z dynamo/test_misc.py::MiscTests::test_dynamo_min_operator_with_shape PASSED [0.0140s] [ 20%] 2025-12-04T10:23:15.1146095Z dynamo/test_misc.py::MiscTests::test_dynamo_reset_clears_cache PASSED [0.0168s] [ 20%] 2025-12-04T10:23:15.1154278Z dynamo/test_misc.py::MiscTests::test_empty_list PASSED [0.0315s] [ 20%] 2025-12-04T10:23:15.1154505Z dynamo/test_misc.py::MiscTests::test_enum_as_dict_key PASSED [0.0450s] [ 20%] 2025-12-04T10:23:15.1154791Z dynamo/test_misc.py::MiscTests::test_enum_as_dict_key_with_overloaded_str PASSED [0.0428s] [ 21%] 2025-12-04T10:23:15.1155042Z dynamo/test_misc.py::MiscTests::test_enum_guards PASSED [0.0197s] [ 21%] 2025-12-04T10:23:15.1155258Z dynamo/test_misc.py::MiscTests::test_enum_method PASSED [0.3690s] [ 21%] 2025-12-04T10:23:15.1155479Z dynamo/test_misc.py::MiscTests::test_enum_no_graphbreaks PASSED [0.0328s] [ 21%] 2025-12-04T10:23:15.1155698Z dynamo/test_misc.py::MiscTests::test_enum_subclass PASSED [0.0194s] [ 21%] 2025-12-04T10:23:15.1155932Z dynamo/test_misc.py::MiscTests::test_error_on_nested_fx_trace PASSED [0.0171s] [ 21%] 2025-12-04T10:23:15.1156162Z dynamo/test_misc.py::MiscTests::test_error_on_recompile PASSED [0.0180s] [ 21%] 2025-12-04T10:23:15.1156416Z dynamo/test_misc.py::MiscTests::test_escaping_closure_var_with_backward_hook PASSED [0.0205s] [ 22%] 2025-12-04T10:23:15.1156699Z dynamo/test_misc.py::MiscTests::test_escaping_closure_var_with_nonlocal_var PASSED [0.0164s] [ 22%] 2025-12-04T10:23:15.1156987Z dynamo/test_misc.py::MiscTests::test_existing_func_that_creates_capturing_nested_func PASSED [0.0175s] [ 22%] 2025-12-04T10:23:15.1157267Z dynamo/test_misc.py::MiscTests::test_fail_on_recompile_error_message PASSED [0.0029s] [ 22%] 2025-12-04T10:23:15.1157513Z dynamo/test_misc.py::MiscTests::test_flat_name_to_original_fqn PASSED [0.0360s] [ 22%] 2025-12-04T10:23:15.1157761Z dynamo/test_misc.py::MiscTests::test_float_speculation_log_divergence PASSED [1.9042s] [ 22%] 2025-12-04T10:23:15.1158003Z dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__1 PASSED [0.0124s] [ 23%] 2025-12-04T10:23:15.1158224Z dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__2 PASSED [0.0125s] [ 23%] 2025-12-04T10:23:15.1158442Z dynamo/test_misc.py::MiscTests::test_fn_hasattr__name__3 PASSED [0.0130s] [ 23%] 2025-12-04T10:23:15.1158661Z dynamo/test_misc.py::MiscTests::test_fold PASSED [0.0176s] [ 23%] 2025-12-04T10:23:15.1158897Z dynamo/test_misc.py::MiscTests::test_free_var_and_local_name_collision PASSED [0.0177s] [ 23%] 2025-12-04T10:23:15.1159147Z dynamo/test_misc.py::MiscTests::test_frozen_dataclass_attr_access PASSED [0.3264s] [ 23%] 2025-12-04T10:23:15.1159400Z dynamo/test_misc.py::MiscTests::test_frozen_dataclass_default_factory PASSED [0.3434s] [ 23%] 2025-12-04T10:23:15.1159654Z dynamo/test_misc.py::MiscTests::test_frozen_dataclass_default_value PASSED [0.3551s] [ 24%] 2025-12-04T10:23:15.1159903Z dynamo/test_misc.py::MiscTests::test_frozen_dataclass_hashable PASSED [0.4141s] [ 24%] 2025-12-04T10:23:15.1160235Z dynamo/test_misc.py::MiscTests::test_frozen_dataclass_kw_only PASSED [0.3272s] [ 24%] 2025-12-04T10:23:15.1160463Z dynamo/test_misc.py::MiscTests::test_frozen_dict PASSED [0.0195s] [ 24%] 2025-12-04T10:23:15.1160691Z dynamo/test_misc.py::MiscTests::test_frozenset_of_non_literals PASSED [0.0078s] [ 24%] 2025-12-04T10:23:15.1160936Z dynamo/test_misc.py::MiscTests::test_frozenset_torch_func_contains PASSED [0.0300s] [ 24%] 2025-12-04T10:23:15.1161169Z dynamo/test_misc.py::MiscTests::test_fullgraph_capture PASSED [0.0158s] [ 25%] 2025-12-04T10:23:15.1161388Z dynamo/test_misc.py::MiscTests::test_funcname_cache PASSED [0.0021s] [ 25%] 2025-12-04T10:23:15.1161607Z dynamo/test_misc.py::MiscTests::test_function_annotation PASSED [0.0304s] [ 25%] 2025-12-04T10:23:15.1161850Z dynamo/test_misc.py::MiscTests::test_function_generic_alias_annotation PASSED [0.0298s] [ 25%] 2025-12-04T10:23:15.1162132Z dynamo/test_misc.py::MiscTests::test_generate_tensor_from_list_of_numpy_primitive_type PASSED [0.3351s] [ 25%] 2025-12-04T10:23:15.1162414Z dynamo/test_misc.py::MiscTests::test_generate_trivial_abstract_impl PASSED [0.0203s] [ 25%] 2025-12-04T10:23:15.1162650Z dynamo/test_misc.py::MiscTests::test_get_attr_function PASSED [0.0075s] [ 25%] 2025-12-04T10:23:15.1162903Z dynamo/test_misc.py::MiscTests::test_get_cache_entry PASSED [0.7334s] [ 26%] 2025-12-04T10:23:15.1163132Z dynamo/test_misc.py::MiscTests::test_get_custom_tensor_attribute PASSED [0.0165s] [ 26%] 2025-12-04T10:23:15.1163374Z dynamo/test_misc.py::MiscTests::test_get_instruction_source_311 SKIPPED [0.0002s] [ 26%] 2025-12-04T10:23:15.1163603Z dynamo/test_misc.py::MiscTests::test_getattr_dict PASSED [0.0193s] [ 26%] 2025-12-04T10:23:15.1163848Z dynamo/test_misc.py::MiscTests::test_getattrvariable_as_python_constant PASSED [0.1127s] [ 26%] 2025-12-04T10:23:15.1164094Z dynamo/test_misc.py::MiscTests::test_getset_descriptor PASSED [0.0171s] [ 26%] 2025-12-04T10:23:15.1164334Z dynamo/test_misc.py::MiscTests::test_global_state_guard_serialization PASSED [0.0022s] [ 26%] 2025-12-04T10:23:15.1164573Z dynamo/test_misc.py::MiscTests::test_grad PASSED [0.0396s] [ 27%] 2025-12-04T10:23:15.1164788Z dynamo/test_misc.py::MiscTests::test_grad_non_none PASSED [0.0190s] [ 27%] 2025-12-04T10:23:15.1165006Z dynamo/test_misc.py::MiscTests::test_grad_none PASSED [0.0181s] [ 27%] 2025-12-04T10:23:15.1165222Z dynamo/test_misc.py::MiscTests::test_grad_state_mutated PASSED [0.0165s] [ 27%] 2025-12-04T10:23:15.1165459Z dynamo/test_misc.py::MiscTests::test_graph_break_compilation_metrics PASSED [0.0497s] [ 27%] 2025-12-04T10:23:15.1165728Z dynamo/test_misc.py::MiscTests::test_graph_break_compilation_metrics_on_failure PASSED [0.0095s] [ 27%] 2025-12-04T10:23:15.1166044Z dynamo/test_misc.py::MiscTests::test_graph_break_correctly_when_passing_numpy_ndarray_to_torch_function PASSED [0.0683s] [ 28%] 2025-12-04T10:23:15.1166327Z dynamo/test_misc.py::MiscTests::test_guard_failure_fn PASSED [0.0423s] [ 28%] 2025-12-04T10:23:15.1166545Z dynamo/test_misc.py::MiscTests::test_guard_failure_fn2 PASSED [0.0368s] [ 28%] 2025-12-04T10:23:15.1166781Z dynamo/test_misc.py::MiscTests::test_guard_failure_fn_shape_control PASSED [0.0352s] [ 28%] 2025-12-04T10:23:15.1167029Z dynamo/test_misc.py::MiscTests::test_guard_failure_fn_tensor_iter PASSED [0.0494s] [ 28%] 2025-12-04T10:23:15.1167268Z dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_id PASSED [0.0140s] [ 28%] 2025-12-04T10:23:15.1167502Z dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_is_global PASSED [0.6924s] [ 28%] 2025-12-04T10:23:15.1167752Z dynamo/test_misc.py::MiscTests::test_guard_filter_fn_by_name_and_value PASSED [0.0268s] [ 29%] 2025-12-04T10:23:15.1168015Z dynamo/test_misc.py::MiscTests::test_guard_filter_globals PASSED [0.4874s] [ 29%] 2025-12-04T10:23:15.1168257Z dynamo/test_misc.py::MiscTests::test_guard_filter_inbuilt_nn_modules PASSED [0.4841s] [ 29%] 2025-12-04T10:23:15.1168503Z dynamo/test_misc.py::MiscTests::test_guard_filter_nn_modules PASSED [0.5171s] [ 29%] 2025-12-04T10:23:15.1168759Z dynamo/test_misc.py::MiscTests::test_guard_filter_tensors PASSED [0.4744s] [ 29%] 2025-12-04T10:23:15.1169001Z dynamo/test_misc.py::MiscTests::test_guard_function_builder_with_cse PASSED [0.0031s] [ 29%] 2025-12-04T10:23:15.1169248Z dynamo/test_misc.py::MiscTests::test_guard_size_oblivious_backed PASSED [0.0335s] [ 29%] 2025-12-04T10:23:15.1169482Z dynamo/test_misc.py::MiscTests::test_guard_string_escaped PASSED [0.0174s] [ 30%] 2025-12-04T10:23:15.1169722Z dynamo/test_misc.py::MiscTests::test_guard_sym_node_fstring_when_used PASSED [0.0457s] [ 30%] 2025-12-04T10:23:15.1169966Z dynamo/test_misc.py::MiscTests::test_guards_cse_pass_multiple PASSED [0.0032s] [ 30%] 2025-12-04T10:23:15.1170238Z dynamo/test_misc.py::MiscTests::test_guards_cse_pass_single PASSED [0.0027s] [ 30%] 2025-12-04T10:23:15.1170474Z dynamo/test_misc.py::MiscTests::test_guards_strip_function_call PASSED [0.0013s] [ 30%] 2025-12-04T10:23:15.1170712Z dynamo/test_misc.py::MiscTests::test_hasattr_nn_module_guard PASSED [0.0210s] [ 30%] 2025-12-04T10:23:15.1170945Z dynamo/test_misc.py::MiscTests::test_hash_getitem_slice PASSED [0.0016s] [ 31%] 2025-12-04T10:23:15.1171164Z dynamo/test_misc.py::MiscTests::test_hash_hop PASSED [0.3050s] [ 31%] 2025-12-04T10:23:15.1171380Z dynamo/test_misc.py::MiscTests::test_id_guarded_class PASSED [0.0317s] [ 31%] 2025-12-04T10:23:15.1171625Z dynamo/test_misc.py::MiscTests::test_id_guarded_module PASSED [0.0339s] [ 31%] 2025-12-04T10:23:15.1171845Z dynamo/test_misc.py::MiscTests::test_id_guarded_object PASSED [0.0327s] [ 31%] 2025-12-04T10:23:15.1172058Z dynamo/test_misc.py::MiscTests::test_id_of_nn_module PASSED [0.0321s] [ 31%] 2025-12-04T10:23:15.1172265Z dynamo/test_misc.py::MiscTests::test_id_tensor PASSED [0.0371s] [ 31%] 2025-12-04T10:23:15.1172473Z dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod1 PASSED [0.0356s] [ 32%] 2025-12-04T10:23:15.1172684Z dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod2 PASSED [0.0187s] [ 32%] 2025-12-04T10:23:15.1172895Z dynamo/test_misc.py::MiscTests::test_if_cond_nn_mod3 PASSED [0.0285s] [ 32%] 2025-12-04T10:23:15.1173117Z dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object PASSED [0.0636s] [ 32%] 2025-12-04T10:23:15.1173351Z dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object2 PASSED [0.0076s] [ 32%] 2025-12-04T10:23:15.1173594Z dynamo/test_misc.py::MiscTests::test_if_cond_user_defined_object3 PASSED [0.0432s] [ 32%] 2025-12-04T10:23:15.1173827Z dynamo/test_misc.py::MiscTests::test_infer_unbacked_size_gt_zero PASSED [0.0203s] [ 32%] 2025-12-04T10:23:15.1174050Z dynamo/test_misc.py::MiscTests::test_inference_mode PASSED [0.0195s] [ 33%] 2025-12-04T10:23:15.1174266Z dynamo/test_misc.py::MiscTests::test_inference_mode_param PASSED [0.0171s] [ 33%] 2025-12-04T10:23:15.1174506Z dynamo/test_misc.py::MiscTests::test_inline_closure_not_loaded_by_parent PASSED [0.0173s] [ 33%] 2025-12-04T10:23:15.1174795Z dynamo/test_misc.py::MiscTests::test_inline_closure_returned_by_another_function_and_captures PASSED [0.3045s] [ 33%] 2025-12-04T10:23:15.1175063Z dynamo/test_misc.py::MiscTests::test_inline_dict_function PASSED [0.3025s] [ 33%] 2025-12-04T10:23:15.1175300Z dynamo/test_misc.py::MiscTests::test_inline_dict_function_passed_as_arg PASSED [0.7529s] [ 33%] 2025-12-04T10:23:15.1175540Z dynamo/test_misc.py::MiscTests::test_inline_dict_mutation PASSED [0.0184s] [ 34%] 2025-12-04T10:23:15.1175779Z dynamo/test_misc.py::MiscTests::test_inline_func_jump_on_tensor_condition PASSED [0.0575s] [ 34%] 2025-12-04T10:23:15.1176020Z dynamo/test_misc.py::MiscTests::test_inline_list_mutation PASSED [0.0168s] [ 34%] 2025-12-04T10:23:15.1176243Z dynamo/test_misc.py::MiscTests::test_inline_local_dict_clear PASSED [0.0142s] [ 34%] 2025-12-04T10:23:15.1176480Z dynamo/test_misc.py::MiscTests::test_inline_module_attr_dict_clear PASSED [0.0151s] [ 34%] 2025-12-04T10:23:15.1176734Z dynamo/test_misc.py::MiscTests::test_inline_user_defined_dict_attr_clear PASSED [0.0185s] [ 34%] 2025-12-04T10:23:15.1177011Z dynamo/test_misc.py::MiscTests::test_inplace PASSED [0.0187s] [ 34%] 2025-12-04T10:23:15.1177226Z dynamo/test_misc.py::MiscTests::test_inplace_desugaring PASSED [0.0169s] [ 35%] 2025-12-04T10:23:15.1177444Z dynamo/test_misc.py::MiscTests::test_inplace_param_update PASSED [0.0172s] [ 35%] 2025-12-04T10:23:15.1177673Z dynamo/test_misc.py::MiscTests::test_inplace_view_on_graph_input PASSED [0.2832s] [ 35%] 2025-12-04T10:23:15.1177901Z dynamo/test_misc.py::MiscTests::test_input_cell_mutation PASSED [0.0171s] [ 35%] 2025-12-04T10:23:15.1178121Z dynamo/test_misc.py::MiscTests::test_inspect_signature_bind PASSED [0.1135s] [ 35%] 2025-12-04T10:23:15.1178372Z dynamo/test_misc.py::MiscTests::test_inspect_signature_bind_non_user_function PASSED [0.1057s] [ 35%] 2025-12-04T10:23:15.1178631Z dynamo/test_misc.py::MiscTests::test_inspect_signature_parameters PASSED [0.0340s] [ 35%] 2025-12-04T10:23:15.1178861Z dynamo/test_misc.py::MiscTests::test_int_int_comparisons PASSED [0.0158s] [ 36%] 2025-12-04T10:23:15.1179077Z dynamo/test_misc.py::MiscTests::test_int_list PASSED [0.0352s] [ 36%] 2025-12-04T10:23:15.1179285Z dynamo/test_misc.py::MiscTests::test_int_neg PASSED [0.0176s] [ 36%] 2025-12-04T10:23:15.1179495Z dynamo/test_misc.py::MiscTests::test_int_shape_binops PASSED [0.0171s] [ 36%] 2025-12-04T10:23:15.1179742Z dynamo/test_misc.py::MiscTests::test_int_shape_comparisons PASSED [0.0175s] [ 36%] 2025-12-04T10:23:15.1179967Z dynamo/test_misc.py::MiscTests::test_int_shape_inplace_binops PASSED [0.0166s] [ 36%] 2025-12-04T10:23:15.1180249Z dynamo/test_misc.py::MiscTests::test_intermediary_tensor_grad_access PASSED [0.0173s] [ 37%] 2025-12-04T10:23:15.1180487Z dynamo/test_misc.py::MiscTests::test_invalid_args_builtin PASSED [0.0200s] [ 37%] 2025-12-04T10:23:15.1180701Z dynamo/test_misc.py::MiscTests::test_is_compiling PASSED [0.0554s] [ 37%] 2025-12-04T10:23:15.1180912Z dynamo/test_misc.py::MiscTests::test_is_floating_point PASSED [0.0189s] [ 37%] 2025-12-04T10:23:15.1181124Z dynamo/test_misc.py::MiscTests::test_is_floating_point2 PASSED [0.0174s] [ 37%] 2025-12-04T10:23:15.1181335Z dynamo/test_misc.py::MiscTests::test_is_tensor PASSED [0.0178s] [ 37%] 2025-12-04T10:23:15.1181545Z dynamo/test_misc.py::MiscTests::test_is_tensor2 PASSED [0.0296s] [ 37%] 2025-12-04T10:23:15.1181759Z dynamo/test_misc.py::MiscTests::test_is_tensor_like PASSED [0.0308s] [ 38%] 2025-12-04T10:23:15.1181967Z dynamo/test_misc.py::MiscTests::test_is_tensor_like2 PASSED [0.0292s] [ 38%] 2025-12-04T10:23:15.1182174Z dynamo/test_misc.py::MiscTests::test_item PASSED [0.0190s] [ 38%] 2025-12-04T10:23:15.1182381Z dynamo/test_misc.py::MiscTests::test_item_changes PASSED [0.0325s] [ 38%] 2025-12-04T10:23:15.1182597Z dynamo/test_misc.py::MiscTests::test_item_changes_new_shape PASSED [0.0351s] [ 38%] 2025-12-04T10:23:15.1182814Z dynamo/test_misc.py::MiscTests::test_iter_set PASSED [0.0189s] [ 38%] 2025-12-04T10:23:15.1183022Z dynamo/test_misc.py::MiscTests::test_iter_type PASSED [0.3543s] [ 39%] 2025-12-04T10:23:15.1183234Z dynamo/test_misc.py::MiscTests::test_iterator_limit PASSED [1.2142s] [ 39%] 2025-12-04T10:23:15.1183475Z dynamo/test_misc.py::MiscTests::test_itertools_accumulate_symint_default_sum PASSED [0.0199s] [ 39%] 2025-12-04T10:23:15.1183754Z dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_builtins PASSED [0.0714s] [ 39%] 2025-12-04T10:23:15.1184028Z dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_default_sum PASSED [0.0251s] [ 39%] 2025-12-04T10:23:15.1184298Z dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_kwargs PASSED [0.0716s] [ 39%] 2025-12-04T10:23:15.1184567Z dynamo/test_misc.py::MiscTests::test_itertools_accumulate_tensors_user_defined PASSED [0.0733s] [ 39%] 2025-12-04T10:23:15.1184854Z dynamo/test_misc.py::MiscTests::test_itertools_groupby_pure_python_default_identify_func PASSED [0.0151s] [ 40%] 2025-12-04T10:23:15.1185139Z dynamo/test_misc.py::MiscTests::test_itertools_groupby_pure_python_key_func PASSED [0.0154s] [ 40%] 2025-12-04T10:23:15.1185423Z dynamo/test_misc.py::MiscTests::test_itertools_infinite_count PASSED [0.0618s] [ 40%] 2025-12-04T10:23:15.1185656Z dynamo/test_misc.py::MiscTests::test_itertools_infinite_cycle PASSED [0.0333s] [ 40%] 2025-12-04T10:23:15.1185888Z dynamo/test_misc.py::MiscTests::test_itertools_infinite_repeat PASSED [0.0189s] [ 40%] 2025-12-04T10:23:15.1186138Z dynamo/test_misc.py::MiscTests::test_itertools_infinite_repeat_mutation PASSED [0.0216s] [ 40%] 2025-12-04T10:23:15.1186376Z dynamo/test_misc.py::MiscTests::test_itertools_islice PASSED [0.0150s] [ 40%] 2025-12-04T10:23:15.1186602Z dynamo/test_misc.py::MiscTests::test_itertools_islice_default_end PASSED [0.0147s] [ 41%] 2025-12-04T10:23:15.1186844Z dynamo/test_misc.py::MiscTests::test_itertools_islice_default_step PASSED [0.0147s] [ 41%] 2025-12-04T10:23:15.1187073Z dynamo/test_misc.py::MiscTests::test_itertools_repeat PASSED [0.0184s] [ 41%] 2025-12-04T10:23:15.1187286Z dynamo/test_misc.py::MiscTests::test_itertools_tee PASSED [0.0172s] [ 41%] 2025-12-04T10:23:15.1187521Z dynamo/test_misc.py::MiscTests::test_jacfwd_one_hot_dynamic_compile PASSED [0.6609s] [ 41%] 2025-12-04T10:23:15.1187754Z dynamo/test_misc.py::MiscTests::test_large_reduction_list PASSED [0.0151s] [ 41%] 2025-12-04T10:23:15.1188001Z dynamo/test_misc.py::MiscTests::test_linear_module_free PASSED [0.3774s] [ 42%] 2025-12-04T10:23:15.1188221Z dynamo/test_misc.py::MiscTests::test_list_append_return_none PASSED [0.0174s] [ 42%] 2025-12-04T10:23:15.1188440Z dynamo/test_misc.py::MiscTests::test_list_class PASSED [0.0180s] [ 42%] 2025-12-04T10:23:15.1188651Z dynamo/test_misc.py::MiscTests::test_list_hasattr1 PASSED [0.0167s] [ 42%] 2025-12-04T10:23:15.1188861Z dynamo/test_misc.py::MiscTests::test_list_hasattr2 PASSED [0.0163s] [ 42%] 2025-12-04T10:23:15.1189078Z dynamo/test_misc.py::MiscTests::test_list_iadd_side_effect PASSED [0.0209s] [ 42%] 2025-12-04T10:23:15.1189297Z dynamo/test_misc.py::MiscTests::test_list_iadd_with_shape PASSED [0.0184s] [ 42%] 2025-12-04T10:23:15.1189519Z dynamo/test_misc.py::MiscTests::test_list_iterator_contains PASSED [0.0175s] [ 43%] 2025-12-04T10:23:15.1189736Z dynamo/test_misc.py::MiscTests::test_list_mul PASSED [0.0062s] [ 43%] 2025-12-04T10:23:15.1189944Z dynamo/test_misc.py::MiscTests::test_list_slice_mul PASSED [0.0048s] [ 43%] 2025-12-04T10:23:15.1190200Z dynamo/test_misc.py::MiscTests::test_listcomp PASSED [0.0184s] [ 43%] 2025-12-04T10:23:15.1190431Z dynamo/test_misc.py::MiscTests::test_load_fast_and_clear_graph_break PASSED [0.0362s] [ 43%] 2025-12-04T10:23:15.1190661Z dynamo/test_misc.py::MiscTests::test_mandelbrot_numpy PASSED [10.0829s] [ 43%] 2025-12-04T10:23:15.1190874Z dynamo/test_misc.py::MiscTests::test_map_side_effects PASSED [0.0403s] [ 43%] 2025-12-04T10:23:15.1191093Z dynamo/test_misc.py::MiscTests::test_map_with_quantization PASSED [0.1015s] [ 44%] 2025-12-04T10:23:15.1191321Z dynamo/test_misc.py::MiscTests::test_mark_dynamic_with_ranges PASSED [0.0078s] [ 44%] 2025-12-04T10:23:15.1191541Z dynamo/test_misc.py::MiscTests::test_mark_static PASSED [0.0337s] [ 44%] 2025-12-04T10:23:15.1191754Z dynamo/test_misc.py::MiscTests::test_mark_unbacked_strict PASSED [0.4293s] [ 44%] 2025-12-04T10:23:15.1191967Z dynamo/test_misc.py::MiscTests::test_matmul1 PASSED [0.0162s] [ 44%] 2025-12-04T10:23:15.1192183Z dynamo/test_misc.py::MiscTests::test_min_max_over_iterable PASSED [0.0347s] [ 44%] 2025-12-04T10:23:15.1192399Z dynamo/test_misc.py::MiscTests::test_module_complex_iter PASSED [0.0303s] [ 45%] 2025-12-04T10:23:15.1192615Z dynamo/test_misc.py::MiscTests::test_module_deepcopy PASSED [0.0792s] [ 45%] 2025-12-04T10:23:15.1192829Z dynamo/test_misc.py::MiscTests::test_module_not_callable PASSED [0.0149s] [ 45%] 2025-12-04T10:23:15.1193051Z dynamo/test_misc.py::MiscTests::test_mro_type_tensor_no_source PASSED [0.0312s] [ 45%] 2025-12-04T10:23:15.1193276Z dynamo/test_misc.py::MiscTests::test_multiple_inheritance PASSED [0.0210s] [ 45%] 2025-12-04T10:23:15.1193552Z dynamo/test_misc.py::MiscTests::test_mutable_mapping_multiple_inheritance PASSED [0.0443s] [ 45%] 2025-12-04T10:23:15.1193795Z dynamo/test_misc.py::MiscTests::test_named_parameters PASSED [1.3611s] [ 45%] 2025-12-04T10:23:15.1194007Z dynamo/test_misc.py::MiscTests::test_namedtuple1 PASSED [0.0201s] [ 46%] 2025-12-04T10:23:15.1194219Z dynamo/test_misc.py::MiscTests::test_namedtuple2 PASSED [0.0201s] [ 46%] 2025-12-04T10:23:15.1194427Z dynamo/test_misc.py::MiscTests::test_namedtuple3 PASSED [0.0164s] [ 46%] 2025-12-04T10:23:15.1194637Z dynamo/test_misc.py::MiscTests::test_namedtuple_class PASSED [0.0191s] [ 46%] 2025-12-04T10:23:15.1194876Z dynamo/test_misc.py::MiscTests::test_namedtuple_source_dynamic_attributes PASSED [0.0174s] [ 46%] 2025-12-04T10:23:15.1195145Z dynamo/test_misc.py::MiscTests::test_namedtuple_sourceless_dynamic_attributes PASSED [0.0172s] [ 46%] 2025-12-04T10:23:15.1195408Z dynamo/test_misc.py::MiscTests::test_namedtuple_with_custom_getitem PASSED [0.0160s] [ 46%] 2025-12-04T10:23:15.1195643Z dynamo/test_misc.py::MiscTests::test_nan PASSED [0.0169s] [ 47%] 2025-12-04T10:23:15.1195864Z dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_eq PASSED [0.0191s] [ 47%] 2025-12-04T10:23:15.1196114Z dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_graphbreak_eq PASSED [0.0270s] [ 47%] 2025-12-04T10:23:15.1196402Z dynamo/test_misc.py::MiscTests::test_ne_operator_with_custom_ne PASSED [0.0178s] [ 47%] 2025-12-04T10:23:15.1196628Z dynamo/test_misc.py::MiscTests::test_nested_closure PASSED [0.0375s] [ 47%] 2025-12-04T10:23:15.1196852Z dynamo/test_misc.py::MiscTests::test_nested_closure_mutation PASSED [0.0334s] [ 47%] 2025-12-04T10:23:15.1197088Z dynamo/test_misc.py::MiscTests::test_nested_dataclass_reconstruct PASSED [0.0242s] [ 48%] 2025-12-04T10:23:15.1197335Z dynamo/test_misc.py::MiscTests::test_nested_frozen_dataclass_hashable PASSED [0.3960s] [ 48%] 2025-12-04T10:23:15.1197604Z dynamo/test_misc.py::MiscTests::test_nested_function_resuming_with_correct_globals PASSED [0.0550s] [ 48%] 2025-12-04T10:23:15.1197858Z dynamo/test_misc.py::MiscTests::test_nested_optimize PASSED [0.0332s] [ 48%] 2025-12-04T10:23:15.1198081Z dynamo/test_misc.py::MiscTests::test_nested_optimize_decorator PASSED [0.0213s] [ 48%] 2025-12-04T10:23:15.1198311Z dynamo/test_misc.py::MiscTests::test_nested_optimize_run PASSED [0.0447s] [ 48%] 2025-12-04T10:23:15.1198530Z dynamo/test_misc.py::MiscTests::test_nested_sequential_try PASSED [0.0162s] [ 48%] 2025-12-04T10:23:15.1198756Z dynamo/test_misc.py::MiscTests::test_nested_sequential_try_with PASSED [0.0179s] [ 49%] 2025-12-04T10:23:15.1199010Z dynamo/test_misc.py::MiscTests::test_nested_sequential_try_with_graph_break PASSED [0.0847s] [ 49%] 2025-12-04T10:23:15.1199259Z dynamo/test_misc.py::MiscTests::test_nested_sequential_with PASSED [0.0173s] [ 49%] 2025-12-04T10:23:15.1199479Z dynamo/test_misc.py::MiscTests::test_nested_wraps PASSED [0.0327s] [ 49%] 2025-12-04T10:23:15.1199704Z dynamo/test_misc.py::MiscTests::test_nesteduserfunction_setattr PASSED [0.0180s] [ 49%] 2025-12-04T10:23:15.1199930Z dynamo/test_misc.py::MiscTests::test_new_with_int_list PASSED [0.0183s] [ 49%] 2025-12-04T10:23:15.1200225Z dynamo/test_misc.py::MiscTests::test_newly_constructed_tensor_attr_mutation PASSED [0.0168s] [ 50%] 2025-12-04T10:23:15.1200481Z dynamo/test_misc.py::MiscTests::test_nn_functional_reduction PASSED [0.0178s] [ 50%] 2025-12-04T10:23:15.1200707Z dynamo/test_misc.py::MiscTests::test_nn_module_getattr PASSED [0.0205s] [ 50%] 2025-12-04T10:23:15.1200926Z dynamo/test_misc.py::MiscTests::test_nn_module_getattribute PASSED [0.0211s] [ 50%] 2025-12-04T10:23:15.1201154Z dynamo/test_misc.py::MiscTests::test_nn_sequential_invocation PASSED [0.0536s] [ 50%] 2025-12-04T10:23:15.1201408Z dynamo/test_misc.py::MiscTests::test_nn_sequential_invocation_reposition_indices PASSED [0.0472s] [ 50%] 2025-12-04T10:23:15.1201668Z dynamo/test_misc.py::MiscTests::test_no_error_on_nested_fx_trace PASSED [0.0150s] [ 50%] 2025-12-04T10:23:15.1201949Z dynamo/test_misc.py::MiscTests::test_no_guard_for_unused_sym_node_fstring PASSED [0.0215s] [ 51%] 2025-12-04T10:23:15.1202207Z dynamo/test_misc.py::MiscTests::test_no_raise_guard_partial_constraint PASSED [0.0382s] [ 51%] 2025-12-04T10:23:15.1202476Z dynamo/test_misc.py::MiscTests::test_no_raise_guard_partial_constraint_across_break PASSED [0.0712s] [ 51%] 2025-12-04T10:23:15.1202749Z dynamo/test_misc.py::MiscTests::test_non_pt2_compliant_ops_graph_break PASSED [0.0130s] [ 51%] 2025-12-04T10:23:15.1202986Z dynamo/test_misc.py::MiscTests::test_not_dynamic_scope PASSED [0.0141s] [ 51%] 2025-12-04T10:23:15.1203199Z dynamo/test_misc.py::MiscTests::test_numel PASSED [0.0178s] [ 51%] 2025-12-04T10:23:15.1203415Z dynamo/test_misc.py::MiscTests::test_numpy_array_of_arrays PASSED [0.0511s] [ 51%] 2025-12-04T10:23:15.1203631Z dynamo/test_misc.py::MiscTests::test_numpy_as_global PASSED [0.3287s] [ 52%] 2025-12-04T10:23:15.1203850Z dynamo/test_misc.py::MiscTests::test_numpy_fallback_on_eager PASSED [0.0148s] [ 52%] 2025-12-04T10:23:15.1204076Z dynamo/test_misc.py::MiscTests::test_numpy_force PASSED [0.0328s] [ 52%] 2025-12-04T10:23:15.1204285Z dynamo/test_misc.py::MiscTests::test_numpy_gt PASSED [0.3327s] [ 52%] 2025-12-04T10:23:15.1204495Z dynamo/test_misc.py::MiscTests::test_numpy_int_constant PASSED [0.0268s] [ 52%] 2025-12-04T10:23:15.1204745Z dynamo/test_misc.py::MiscTests::test_numpy_iter PASSED [0.0183s] [ 52%] 2025-12-04T10:23:15.1204954Z dynamo/test_misc.py::MiscTests::test_numpy_min PASSED [0.3375s] [ 53%] 2025-12-04T10:23:15.1205174Z dynamo/test_misc.py::MiscTests::test_numpy_ndarray_graph_break PASSED [0.0366s] [ 53%] 2025-12-04T10:23:15.1205434Z dynamo/test_misc.py::MiscTests::test_numpy_ndarray_graph_break_with_multiple_outputs PASSED [0.0410s] [ 53%] 2025-12-04T10:23:15.1205717Z dynamo/test_misc.py::MiscTests::test_numpy_ndarray_works_with_builtin_function PASSED [0.0191s] [ 53%] 2025-12-04T10:23:15.1205960Z dynamo/test_misc.py::MiscTests::test_numpy_no_raise PASSED [0.8354s] [ 53%] 2025-12-04T10:23:15.1206183Z dynamo/test_misc.py::MiscTests::test_numpy_non_torch_dtype PASSED [0.0074s] [ 53%] 2025-12-04T10:23:15.1206414Z dynamo/test_misc.py::MiscTests::test_numpy_random_config_to_numpy PASSED [0.0131s] [ 53%] 2025-12-04T10:23:15.1206647Z dynamo/test_misc.py::MiscTests::test_numpy_readonly PASSED [0.0132s] [ 54%] 2025-12-04T10:23:15.1206870Z dynamo/test_misc.py::MiscTests::test_numpy_recompilation_scalar PASSED [0.0256s] [ 54%] 2025-12-04T10:23:15.1207101Z dynamo/test_misc.py::MiscTests::test_numpy_size_attr PASSED [0.0167s] [ 54%] 2025-12-04T10:23:15.1207316Z dynamo/test_misc.py::MiscTests::test_numpy_subdtype PASSED [0.0170s] [ 54%] 2025-12-04T10:23:15.1207534Z dynamo/test_misc.py::MiscTests::test_numpy_take_along_axis PASSED [0.1479s] [ 54%] 2025-12-04T10:23:15.1207750Z dynamo/test_misc.py::MiscTests::test_numpy_tolist PASSED [0.0213s] [ 54%] 2025-12-04T10:23:15.1207970Z dynamo/test_misc.py::MiscTests::test_numpy_torch_operators PASSED [1.2542s] [ 54%] 2025-12-04T10:23:15.1208188Z dynamo/test_misc.py::MiscTests::test_numpy_ufunc_out PASSED [0.0184s] [ 55%] 2025-12-04T10:23:15.1208410Z dynamo/test_misc.py::MiscTests::test_numpy_ufunc_out_graph_break XFAIL [0.0306s] [ 55%] 2025-12-04T10:23:15.1208632Z dynamo/test_misc.py::MiscTests::test_numpy_unique_f16 PASSED [0.0182s] [ 55%] 2025-12-04T10:23:15.1208855Z dynamo/test_misc.py::MiscTests::test_numpy_variable_isinstance PASSED [0.0313s] [ 55%] 2025-12-04T10:23:15.1209085Z dynamo/test_misc.py::MiscTests::test_numpy_with_builtin_type PASSED [0.0211s] [ 55%] 2025-12-04T10:23:15.1209305Z dynamo/test_misc.py::MiscTests::test_object_classmethod PASSED [0.0171s] [ 55%] 2025-12-04T10:23:15.1209518Z dynamo/test_misc.py::MiscTests::test_object_setattr PASSED [0.0621s] [ 56%] 2025-12-04T10:23:15.1209735Z dynamo/test_misc.py::MiscTests::test_object_staticmethod PASSED [0.0167s] [ 56%] 2025-12-04T10:23:15.1209956Z dynamo/test_misc.py::MiscTests::test_onnx_shape_as_tensor PASSED [0.0718s] [ 56%] 2025-12-04T10:23:15.1210271Z dynamo/test_misc.py::MiscTests::test_optimize_on_module PASSED [0.0202s] [ 56%] 2025-12-04T10:23:15.1210506Z dynamo/test_misc.py::MiscTests::test_ordered_dict_alias_reconstruct PASSED [0.0249s] [ 56%] 2025-12-04T10:23:15.1210747Z dynamo/test_misc.py::MiscTests::test_ordered_dict_move_to_end PASSED [0.0156s] [ 56%] 2025-12-04T10:23:15.1210970Z dynamo/test_misc.py::MiscTests::test_os_environ_get PASSED [0.0381s] [ 56%] 2025-12-04T10:23:15.1211195Z dynamo/test_misc.py::MiscTests::test_os_environ_set_graph_break PASSED [0.0354s] [ 57%] 2025-12-04T10:23:15.1211422Z dynamo/test_misc.py::MiscTests::test_out_variant_custom_op PASSED [0.0394s] [ 57%] 2025-12-04T10:23:15.1211671Z dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs PASSED [0.0183s] [ 57%] 2025-12-04T10:23:15.1211967Z dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic PASSED [0.0712s] [ 57%] 2025-12-04T10:23:15.1212282Z dynamo/test_misc.py::MiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic1 PASSED [0.0731s] [ 57%] 2025-12-04T10:23:15.1212555Z dynamo/test_misc.py::MiscTests::test_outside_linear_module_free PASSED [0.4278s] [ 57%] 2025-12-04T10:23:15.1212787Z dynamo/test_misc.py::MiscTests::test_overridden_getattribute PASSED [0.0227s] [ 57%] 2025-12-04T10:23:15.1213046Z dynamo/test_misc.py::MiscTests::test_packaging_version_parse PASSED [0.0348s] [ 58%] 2025-12-04T10:23:15.1213268Z dynamo/test_misc.py::MiscTests::test_pair PASSED [0.0210s] [ 58%] 2025-12-04T10:23:15.1213485Z dynamo/test_misc.py::MiscTests::test_param_shape_binops PASSED [0.0187s] [ 58%] 2025-12-04T10:23:15.1213698Z dynamo/test_misc.py::MiscTests::test_parameter_free PASSED [0.3780s] [ 58%] 2025-12-04T10:23:15.1213921Z dynamo/test_misc.py::MiscTests::test_patched_builtin_functions PASSED [0.0331s] [ 58%] 2025-12-04T10:23:15.1214163Z dynamo/test_misc.py::MiscTests::test_pep0479_convert_stopiteration PASSED [0.0145s] [ 58%] 2025-12-04T10:23:15.1214396Z dynamo/test_misc.py::MiscTests::test_precompile_entries PASSED [0.3446s] [ 59%] 2025-12-04T10:23:15.1214613Z dynamo/test_misc.py::MiscTests::test_precompile_entry_hit PASSED [0.3664s] [ 59%] 2025-12-04T10:23:15.1214833Z dynamo/test_misc.py::MiscTests::test_precompile_entry_miss PASSED [0.2962s] [ 59%] 2025-12-04T10:23:15.1215067Z dynamo/test_misc.py::MiscTests::test_precompile_fail_on_recompile PASSED [0.0028s] [ 59%] 2025-12-04T10:23:15.1215302Z dynamo/test_misc.py::MiscTests::test_proxy_frozen_dataclass PASSED [0.4217s] [ 59%] 2025-12-04T10:23:15.1215538Z dynamo/test_misc.py::MiscTests::test_pt2_compliant_ops_are_allowed PASSED [0.0170s] [ 59%] 2025-12-04T10:23:15.1215771Z dynamo/test_misc.py::MiscTests::test_pt2_compliant_overload PASSED [0.0267s] [ 59%] 2025-12-04T10:23:15.1215994Z dynamo/test_misc.py::MiscTests::test_pure_python_accumulate PASSED [0.0190s] [ 60%] 2025-12-04T10:23:15.1216216Z dynamo/test_misc.py::MiscTests::test_py_guards_mark_dynamic PASSED [0.1132s] [ 60%] 2025-12-04T10:23:15.1216438Z dynamo/test_misc.py::MiscTests::test_python_slice PASSED [0.0134s] [ 60%] 2025-12-04T10:23:15.1216662Z dynamo/test_misc.py::MiscTests::test_raise_guard_full_constraint PASSED [0.0134s] [ 60%] 2025-12-04T10:23:15.1216913Z dynamo/test_misc.py::MiscTests::test_raise_guard_indirect_full_constraint PASSED [0.0131s] [ 60%] 2025-12-04T10:23:15.1217187Z dynamo/test_misc.py::MiscTests::test_raise_guard_partial_constraint_across_break XFAIL [0.0296s] [ 60%] 2025-12-04T10:23:15.1217469Z dynamo/test_misc.py::MiscTests::test_raise_guard_partial_constraint_no_graph_break PASSED [0.0163s] [ 60%] 2025-12-04T10:23:15.1217723Z dynamo/test_misc.py::MiscTests::test_raise_on_backend_error PASSED [0.0081s] [ 61%] 2025-12-04T10:23:15.1217942Z dynamo/test_misc.py::MiscTests::test_raises PASSED [0.0214s] [ 61%] 2025-12-04T10:23:15.1218156Z dynamo/test_misc.py::MiscTests::test_raises_importerror1 PASSED [0.0066s] [ 61%] 2025-12-04T10:23:15.1218376Z dynamo/test_misc.py::MiscTests::test_raises_importerror2 PASSED [0.0050s] [ 61%] 2025-12-04T10:23:15.1218620Z dynamo/test_misc.py::MiscTests::test_range___iter__ PASSED [0.0143s] [ 61%] 2025-12-04T10:23:15.1218834Z dynamo/test_misc.py::MiscTests::test_range_input PASSED [0.0173s] [ 61%] 2025-12-04T10:23:15.1219048Z dynamo/test_misc.py::MiscTests::test_range_iter_guards PASSED [0.7198s] [ 62%] 2025-12-04T10:23:15.1219272Z dynamo/test_misc.py::MiscTests::test_range_iter_side_effects PASSED [0.0149s] [ 62%] 2025-12-04T10:23:15.1219493Z dynamo/test_misc.py::MiscTests::test_range_with_shape PASSED [0.0187s] [ 62%] 2025-12-04T10:23:15.1219718Z dynamo/test_misc.py::MiscTests::test_real_imag_tensor_attribute PASSED [0.0202s] [ 62%] 2025-12-04T10:23:15.1219963Z dynamo/test_misc.py::MiscTests::test_recompile_message_on_parameter PASSED [0.0467s] [ 62%] 2025-12-04T10:23:15.1220248Z dynamo/test_misc.py::MiscTests::test_recompile_on_disable_1 PASSED [0.0433s] [ 62%] 2025-12-04T10:23:15.1220473Z dynamo/test_misc.py::MiscTests::test_recompile_on_disable_2 PASSED [0.0018s] [ 62%] 2025-12-04T10:23:15.1220718Z dynamo/test_misc.py::MiscTests::test_recompile_on_global_state_change PASSED [0.0431s] [ 63%] 2025-12-04T10:23:15.1220968Z dynamo/test_misc.py::MiscTests::test_reconstruct_frozen_dataclass PASSED [0.3370s] [ 63%] 2025-12-04T10:23:15.1221221Z dynamo/test_misc.py::MiscTests::test_reconstruct_set_across_graph_break PASSED [0.0274s] [ 63%] 2025-12-04T10:23:15.1221498Z dynamo/test_misc.py::MiscTests::test_recursion_depth_guards PASSED [2.8386s] [ 63%] 2025-12-04T10:23:15.1221733Z dynamo/test_misc.py::MiscTests::test_recursive_inline_list_mutation PASSED [0.0214s] [ 63%] 2025-12-04T10:23:15.1221976Z dynamo/test_misc.py::MiscTests::test_recursive_tensor_attribute PASSED [0.0209s] [ 63%] 2025-12-04T10:23:15.1222204Z dynamo/test_misc.py::MiscTests::test_release_input_memory PASSED [0.0162s] [ 64%] 2025-12-04T10:23:15.1222423Z dynamo/test_misc.py::MiscTests::test_release_module_memory PASSED [0.0222s] [ 64%] 2025-12-04T10:23:15.1222643Z dynamo/test_misc.py::MiscTests::test_release_scope_memory PASSED [0.0071s] [ 64%] 2025-12-04T10:23:15.1222858Z dynamo/test_misc.py::MiscTests::test_remove_set PASSED [0.0157s] [ 64%] 2025-12-04T10:23:15.1223086Z dynamo/test_misc.py::MiscTests::test_repeat_interleave_graphbreaks PASSED [0.0618s] [ 64%] 2025-12-04T10:23:15.1223326Z dynamo/test_misc.py::MiscTests::test_replay_side_effects_config PASSED [0.0417s] [ 64%] 2025-12-04T10:23:15.1223565Z dynamo/test_misc.py::MiscTests::test_replay_side_effects_input_mut PASSED [0.5116s] [ 64%] 2025-12-04T10:23:15.1223808Z dynamo/test_misc.py::MiscTests::test_replay_side_effects_model_attr PASSED [0.4653s] [ 65%] 2025-12-04T10:23:15.1224034Z dynamo/test_misc.py::MiscTests::test_repr PASSED [0.3708s] [ 65%] 2025-12-04T10:23:15.1224272Z dynamo/test_misc.py::MiscTests::test_repro_graph_breaks_in__get_item_by_idx PASSED [0.0346s] [ 65%] 2025-12-04T10:23:15.1224515Z dynamo/test_misc.py::MiscTests::test_restore_graphstate PASSED [0.0662s] [ 65%] 2025-12-04T10:23:15.1224758Z dynamo/test_misc.py::MiscTests::test_return_dict_with_graph_break_and_update PASSED [0.0452s] [ 65%] 2025-12-04T10:23:15.1225008Z dynamo/test_misc.py::MiscTests::test_return_nested_function PASSED [0.0368s] [ 65%] 2025-12-04T10:23:15.1225264Z dynamo/test_misc.py::MiscTests::test_returning_func_with_captured_func_and_tensor PASSED [0.0177s] [ 65%] 2025-12-04T10:23:15.1225548Z dynamo/test_misc.py::MiscTests::test_returning_nested_func_with_captured_tensor PASSED [0.0170s] [ 66%] 2025-12-04T10:23:15.1225825Z dynamo/test_misc.py::MiscTests::test_running_func_with_captured_func_and_tensor PASSED [0.0173s] [ 66%] 2025-12-04T10:23:15.1226105Z dynamo/test_misc.py::MiscTests::test_running_nested_func_with_captured_tensor PASSED [0.0169s] [ 66%] 2025-12-04T10:23:15.1226482Z dynamo/test_misc.py::MiscTests::test_runtime_assert_replacement PASSED [0.0233s] [ 66%] 2025-12-04T10:23:15.1226708Z dynamo/test_misc.py::MiscTests::test_sample_input PASSED [0.0182s] [ 66%] 2025-12-04T10:23:15.1226926Z dynamo/test_misc.py::MiscTests::test_scalar_device_movement PASSED [0.0179s] [ 66%] 2025-12-04T10:23:15.1227224Z dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_int_list_argument PASSED [0.0305s] [ 67%] 2025-12-04T10:23:15.1227515Z dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_symint_argument PASSED [0.0617s] [ 67%] 2025-12-04T10:23:15.1227809Z dynamo/test_misc.py::MiscTests::test_scalar_tensor_is_equivalent_to_symint_list_argument PASSED [0.0186s] [ 67%] 2025-12-04T10:23:15.1228071Z dynamo/test_misc.py::MiscTests::test_sequential_module_free PASSED [0.5880s] [ 67%] 2025-12-04T10:23:15.1228299Z dynamo/test_misc.py::MiscTests::test_set_aliasing_recompiles PASSED [0.0934s] [ 67%] 2025-12-04T10:23:15.1228531Z dynamo/test_misc.py::MiscTests::test_set_custom_tensor_attribute PASSED [0.0164s] [ 67%] 2025-12-04T10:23:15.1228756Z dynamo/test_misc.py::MiscTests::test_set_descriptor PASSED [0.0195s] [ 67%] 2025-12-04T10:23:15.1228967Z dynamo/test_misc.py::MiscTests::test_set_discard PASSED [0.0174s] [ 68%] 2025-12-04T10:23:15.1229182Z dynamo/test_misc.py::MiscTests::test_set_update PASSED [0.0201s] [ 68%] 2025-12-04T10:23:15.1229392Z dynamo/test_misc.py::MiscTests::test_setattr_mutation1 PASSED [0.0229s] [ 68%] 2025-12-04T10:23:15.1229603Z dynamo/test_misc.py::MiscTests::test_setattr_mutation2 PASSED [0.0218s] [ 68%] 2025-12-04T10:23:15.1229846Z dynamo/test_misc.py::MiscTests::test_setattr_mutation3 PASSED [0.0219s] [ 68%] 2025-12-04T10:23:15.1230074Z dynamo/test_misc.py::MiscTests::test_shape_and_tuple_equality PASSED [0.0214s] [ 68%] 2025-12-04T10:23:15.1230416Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_constructor SKIPPED [0.0019s] (only works when TV is True.) [ 68%] 2025-12-04T10:23:15.1230774Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_create_symbolic_sizes_strides_storage_offset SKIPPED [0.0014s] (only works when TV is True.) [ 69%] 2025-12-04T10:23:15.1231089Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_empty PASSED [0.0014s] [ 69%] 2025-12-04T10:23:15.1231381Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_divisible SKIPPED [0.0012s] (only works when TV is True.) [ 69%] 2025-12-04T10:23:15.1231732Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_refinement SKIPPED [0.0012s] (only works when TV is True.) [ 69%] 2025-12-04T10:23:15.1232090Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_evaluate_expr_replacement SKIPPED [0.0013s] (only works when TV is True.) [ 69%] 2025-12-04T10:23:15.1232433Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_runtime_assert SKIPPED [0.0011s] (only works when TV is True.) [ 69%] 2025-12-04T10:23:15.1232749Z dynamo/test_misc.py::MiscTests::test_shape_env_equal_unbacked SKIPPED [0.0011s] (only works when TV is True.) [ 70%] 2025-12-04T10:23:15.1233017Z dynamo/test_misc.py::MiscTests::test_shape_env_no_recording PASSED [0.0078s] [ 70%] 2025-12-04T10:23:15.1233264Z dynamo/test_misc.py::MiscTests::test_shape_env_recorded_function_fallback PASSED [0.0011s] [ 70%] 2025-12-04T10:23:15.1233508Z dynamo/test_misc.py::MiscTests::test_shape_int_comparisons PASSED [0.0161s] [ 70%] 2025-12-04T10:23:15.1233738Z dynamo/test_misc.py::MiscTests::test_shape_int_inplace_binops PASSED [0.0172s] [ 70%] 2025-12-04T10:23:15.1233963Z dynamo/test_misc.py::MiscTests::test_shape_type PASSED [0.0174s] [ 70%] 2025-12-04T10:23:15.1234181Z dynamo/test_misc.py::MiscTests::test_shape_unpack PASSED [0.0172s] [ 70%] 2025-12-04T10:23:15.1234420Z dynamo/test_misc.py::MiscTests::test_side_effects_codegen_update_mutated PASSED [0.0867s] [ 71%] 2025-12-04T10:23:15.1234658Z dynamo/test_misc.py::MiscTests::test_simple_set_usage PASSED [0.0147s] [ 71%] 2025-12-04T10:23:15.1234868Z dynamo/test_misc.py::MiscTests::test_size_dim PASSED [0.0282s] [ 71%] 2025-12-04T10:23:15.1235075Z dynamo/test_misc.py::MiscTests::test_size_input PASSED [0.0456s] [ 71%] 2025-12-04T10:23:15.1235282Z dynamo/test_misc.py::MiscTests::test_slice_input PASSED [0.0517s] [ 71%] 2025-12-04T10:23:15.1235504Z dynamo/test_misc.py::MiscTests::test_source_non_input_grad_access PASSED [0.0299s] [ 71%] 2025-12-04T10:23:15.1235770Z dynamo/test_misc.py::MiscTests::test_sourceless_namedtuple PASSED [0.0173s] [ 71%] 2025-12-04T10:23:15.1236011Z dynamo/test_misc.py::MiscTests::test_sparse_output_inductor_should_break PASSED [0.1593s] [ 72%] 2025-12-04T10:23:15.1236248Z dynamo/test_misc.py::MiscTests::test_storage_return PASSED [0.0247s] [ 72%] 2025-12-04T10:23:15.1236463Z dynamo/test_misc.py::MiscTests::test_str___iter__ PASSED [0.0166s] [ 72%] 2025-12-04T10:23:15.1236678Z dynamo/test_misc.py::MiscTests::test_str_format_assert1 PASSED [0.0186s] [ 72%] 2025-12-04T10:23:15.1236894Z dynamo/test_misc.py::MiscTests::test_str_format_assert2 PASSED [0.0538s] [ 72%] 2025-12-04T10:23:15.1237109Z dynamo/test_misc.py::MiscTests::test_str_format_return1 PASSED [0.0171s] [ 72%] 2025-12-04T10:23:15.1237323Z dynamo/test_misc.py::MiscTests::test_str_format_return2 PASSED [0.0182s] [ 73%] 2025-12-04T10:23:15.1237536Z dynamo/test_misc.py::MiscTests::test_stride_dim PASSED [0.0268s] [ 73%] 2025-12-04T10:23:15.1237753Z dynamo/test_misc.py::MiscTests::test_structseq1 PASSED [0.0154s] [ 73%] 2025-12-04T10:23:15.1237965Z dynamo/test_misc.py::MiscTests::test_structseq2 PASSED [0.3583s] [ 73%] 2025-12-04T10:23:15.1238188Z dynamo/test_misc.py::MiscTests::test_super_after_graph_break PASSED [0.1040s] [ 73%] 2025-12-04T10:23:15.1238469Z dynamo/test_misc.py::MiscTests::test_super_calling_with_metaclass PASSED [0.0590s] [ 73%] 2025-12-04T10:23:15.1238697Z dynamo/test_misc.py::MiscTests::test_sym_and_terms PASSED [0.0359s] [ 73%] 2025-12-04T10:23:15.1238951Z dynamo/test_misc.py::MiscTests::test_sym_constrain_range_on_replaced_unbacked_symbol PASSED [0.3551s] [ 74%] 2025-12-04T10:23:15.1239252Z dynamo/test_misc.py::MiscTests::test_symint_as_device_kwarg_multi_gpu SKIPPED [0.0003s] (need multiple GPU) [ 74%] 2025-12-04T10:23:15.1239542Z dynamo/test_misc.py::MiscTests::test_symint_as_device_kwarg_non_strict_export PASSED [0.0150s] [ 74%] 2025-12-04T10:23:15.1239803Z dynamo/test_misc.py::MiscTests::test_symint_copy_into_unbacked_slice PASSED [0.7680s] [ 74%] 2025-12-04T10:23:15.1240060Z dynamo/test_misc.py::MiscTests::test_symint_fold_nontrivial_product_modulo PASSED [0.3669s] [ 74%] 2025-12-04T10:23:15.1240329Z dynamo/test_misc.py::MiscTests::test_sys_modules PASSED [0.1231s] [ 74%] 2025-12-04T10:23:15.1240576Z dynamo/test_misc.py::MiscTests::test_tagging_tensors_mix_used_unused_structure PASSED [0.0471s] [ 75%] 2025-12-04T10:23:15.1240823Z dynamo/test_misc.py::MiscTests::test_tagging_tensors_simple PASSED [0.0149s] [ 75%] 2025-12-04T10:23:15.1241039Z dynamo/test_misc.py::MiscTests::test_tensor__iter__ PASSED [0.0174s] [ 75%] 2025-12-04T10:23:15.1241258Z dynamo/test_misc.py::MiscTests::test_tensor_build_list_unpack PASSED [0.2379s] [ 75%] 2025-12-04T10:23:15.1241488Z dynamo/test_misc.py::MiscTests::test_tensor_ctor_list_of_tensor PASSED [0.0202s] [ 75%] 2025-12-04T10:23:15.1241708Z dynamo/test_misc.py::MiscTests::test_tensor_data PASSED [0.0182s] [ 75%] 2025-12-04T10:23:15.1241919Z dynamo/test_misc.py::MiscTests::test_tensor_dict1 PASSED [0.0180s] [ 75%] 2025-12-04T10:23:15.1242127Z dynamo/test_misc.py::MiscTests::test_tensor_dict2 PASSED [0.0516s] [ 76%] 2025-12-04T10:23:15.1242333Z dynamo/test_misc.py::MiscTests::test_tensor_dict3 PASSED [0.0201s] [ 76%] 2025-12-04T10:23:15.1242562Z dynamo/test_misc.py::MiscTests::test_tensor_dot_grad_no_graph_break PASSED [0.0437s] [ 76%] 2025-12-04T10:23:15.1242794Z dynamo/test_misc.py::MiscTests::test_tensor_dynamic_method PASSED [0.3293s] [ 76%] 2025-12-04T10:23:15.1243007Z dynamo/test_misc.py::MiscTests::test_tensor_hasattr PASSED [0.6427s] [ 76%] 2025-12-04T10:23:15.1243240Z dynamo/test_misc.py::MiscTests::test_tensor_interacts_with_numpy_ndarray PASSED [0.0473s] [ 76%] 2025-12-04T10:23:15.1243478Z dynamo/test_misc.py::MiscTests::test_tensor_is_contiguous PASSED [0.0562s] [ 76%] 2025-12-04T10:23:15.1243692Z dynamo/test_misc.py::MiscTests::test_tensor_item_capture PASSED [0.0192s] [ 77%] 2025-12-04T10:23:15.1243942Z dynamo/test_misc.py::MiscTests::test_tensor_item_no_capture PASSED [0.0197s] [ 77%] 2025-12-04T10:23:15.1244158Z dynamo/test_misc.py::MiscTests::test_tensor_iter PASSED [0.0204s] [ 77%] 2025-12-04T10:23:15.1244367Z dynamo/test_misc.py::MiscTests::test_tensor_layout PASSED [0.0166s] [ 77%] 2025-12-04T10:23:15.1244599Z dynamo/test_misc.py::MiscTests::test_tensor_setattr_getset_descriptor PASSED [0.0289s] [ 77%] 2025-12-04T10:23:15.1244829Z dynamo/test_misc.py::MiscTests::test_tensor_types PASSED [0.1086s] [ 77%] 2025-12-04T10:23:15.1245041Z dynamo/test_misc.py::MiscTests::test_thread_local_setattr PASSED [0.3359s] [ 78%] 2025-12-04T10:23:15.1245252Z dynamo/test_misc.py::MiscTests::test_tolist PASSED [0.6600s] [ 78%] 2025-12-04T10:23:15.1245457Z dynamo/test_misc.py::MiscTests::test_tolist_0d PASSED [0.0166s] [ 78%] 2025-12-04T10:23:15.1245662Z dynamo/test_misc.py::MiscTests::test_tolist_1d PASSED [0.0190s] [ 78%] 2025-12-04T10:23:15.1245870Z dynamo/test_misc.py::MiscTests::test_tolist_float PASSED [0.0171s] [ 78%] 2025-12-04T10:23:15.1246077Z dynamo/test_misc.py::MiscTests::test_tolist_kd PASSED [0.0368s] [ 78%] 2025-12-04T10:23:15.1246281Z dynamo/test_misc.py::MiscTests::test_tolist_kd_dynamic PASSED [0.0809s] [ 78%] 2025-12-04T10:23:15.1246523Z dynamo/test_misc.py::MiscTests::test_tolist_scalar PASSED [0.0179s] [ 79%] 2025-12-04T10:23:15.1246732Z dynamo/test_misc.py::MiscTests::test_top_package_import PASSED [0.0166s] [ 79%] 2025-12-04T10:23:15.1246939Z dynamo/test_misc.py::MiscTests::test_torch_check PASSED [0.0226s] [ 79%] 2025-12-04T10:23:15.1247155Z dynamo/test_misc.py::MiscTests::test_torch_check_nonnegative PASSED [0.0110s] [ 79%] 2025-12-04T10:23:15.1247392Z dynamo/test_misc.py::MiscTests::test_torch_check_symbolic_shape_rel PASSED [0.0196s] [ 79%] 2025-12-04T10:23:15.1247654Z dynamo/test_misc.py::MiscTests::test_torch_compile_ctx_on_forward_and_training_step PASSED [0.4997s] [ 79%] 2025-12-04T10:23:15.1247926Z dynamo/test_misc.py::MiscTests::test_torch_distributions_lazy_property PASSED [0.0285s] [ 79%] 2025-12-04T10:23:15.1248169Z dynamo/test_misc.py::MiscTests::test_torch_dtype_python_type PASSED [0.0185s] [ 80%] 2025-12-04T10:23:15.1248396Z dynamo/test_misc.py::MiscTests::test_torch_dynamo_codegen_pow PASSED [0.3218s] [ 80%] 2025-12-04T10:23:15.1248628Z dynamo/test_misc.py::MiscTests::test_torch_generator_set_state PASSED [0.0632s] [ 80%] 2025-12-04T10:23:15.1248879Z dynamo/test_misc.py::MiscTests::test_torch_guards_stack_frame_register_inlining PASSED [0.0224s] [ 80%] 2025-12-04T10:23:15.1249160Z dynamo/test_misc.py::MiscTests::test_torch_guards_stack_frame_register_inlining_deep PASSED [0.0215s] [ 80%] 2025-12-04T10:23:15.1249424Z dynamo/test_misc.py::MiscTests::test_torch_nn_parameter_isinstance PASSED [0.0330s] [ 80%] 2025-12-04T10:23:15.1249655Z dynamo/test_misc.py::MiscTests::test_torch_objects_as_keys PASSED [0.0156s] [ 81%] 2025-12-04T10:23:15.1249888Z dynamo/test_misc.py::MiscTests::test_torch_package_working_with_trace PASSED [0.0214s] [ 81%] 2025-12-04T10:23:15.1250160Z dynamo/test_misc.py::MiscTests::test_torch_seed PASSED [0.0242s] [ 81%] 2025-12-04T10:23:15.1250367Z dynamo/test_misc.py::MiscTests::test_torch_size PASSED [0.0165s] [ 81%] 2025-12-04T10:23:15.1250576Z dynamo/test_misc.py::MiscTests::test_torch_size_numel PASSED [0.0121s] [ 81%] 2025-12-04T10:23:15.1250791Z dynamo/test_misc.py::MiscTests::test_torch_size_numel_dynamic PASSED [0.0141s] [ 81%] 2025-12-04T10:23:15.1251014Z dynamo/test_misc.py::MiscTests::test_torch_variable_hasattr PASSED [0.0164s] [ 81%] 2025-12-04T10:23:15.1251230Z dynamo/test_misc.py::MiscTests::test_trace_ndarray_frame PASSED [0.0347s] [ 82%] 2025-12-04T10:23:15.1251443Z dynamo/test_misc.py::MiscTests::test_trace_ndarray_frame_2 PASSED [0.0286s] [ 82%] 2025-12-04T10:23:15.1251674Z dynamo/test_misc.py::MiscTests::test_tracing_nested_py_tree_mixed_all PASSED [0.0761s] [ 82%] 2025-12-04T10:23:15.1251901Z dynamo/test_misc.py::MiscTests::test_tuple_class PASSED [0.0189s] [ 82%] 2025-12-04T10:23:15.1252144Z dynamo/test_misc.py::MiscTests::test_tuple_from_tuple_iter PASSED [0.0607s] [ 82%] 2025-12-04T10:23:15.1252354Z dynamo/test_misc.py::MiscTests::test_tuple_hasattr PASSED [0.0179s] [ 82%] 2025-12-04T10:23:15.1252565Z dynamo/test_misc.py::MiscTests::test_tuple_iadd_with_shape PASSED [0.0173s] [ 82%] 2025-12-04T10:23:15.1252776Z dynamo/test_misc.py::MiscTests::test_tuple_mul PASSED [0.0060s] [ 83%] 2025-12-04T10:23:15.1252988Z dynamo/test_misc.py::MiscTests::test_tuple_mul_with_shape PASSED [0.0144s] [ 83%] 2025-12-04T10:23:15.1253204Z dynamo/test_misc.py::MiscTests::test_tying_union_new_syntax PASSED [0.0165s] [ 83%] 2025-12-04T10:23:15.1253416Z dynamo/test_misc.py::MiscTests::test_type_copy PASSED [0.0330s] [ 83%] 2025-12-04T10:23:15.1253620Z dynamo/test_misc.py::MiscTests::test_typing_dict PASSED [0.0158s] [ 83%] 2025-12-04T10:23:15.1253827Z dynamo/test_misc.py::MiscTests::test_typing_typevar PASSED [0.0172s] [ 83%] 2025-12-04T10:23:15.1254049Z dynamo/test_misc.py::MiscTests::test_typing_union_and_optional PASSED [0.0174s] [ 84%] 2025-12-04T10:23:15.1254291Z dynamo/test_misc.py::MiscTests::test_typing_union_new_syntax_reconstruct XFAIL [0.0088s] [ 84%] 2025-12-04T10:23:15.1254532Z dynamo/test_misc.py::MiscTests::test_typing_variable_isinstance PASSED [0.0148s] [ 84%] 2025-12-04T10:23:15.1254789Z dynamo/test_misc.py::MiscTests::test_unbacked_2d_expand PASSED [0.7758s] [ 84%] 2025-12-04T10:23:15.1255000Z dynamo/test_misc.py::MiscTests::test_unbacked_empty_tensor PASSED [0.0237s] [ 84%] 2025-12-04T10:23:15.1255214Z dynamo/test_misc.py::MiscTests::test_unbacked_repeat_cat PASSED [0.0368s] [ 84%] 2025-12-04T10:23:15.1255429Z dynamo/test_misc.py::MiscTests::test_unbacked_sources_scalar PASSED [0.0155s] [ 84%] 2025-12-04T10:23:15.1255652Z dynamo/test_misc.py::MiscTests::test_unbacked_sources_tensor PASSED [0.0200s] [ 85%] 2025-12-04T10:23:15.1255870Z dynamo/test_misc.py::MiscTests::test_unbacked_strict_mode PASSED [0.0120s] [ 85%] 2025-12-04T10:23:15.1256086Z dynamo/test_misc.py::MiscTests::test_unbacked_symint_split PASSED [0.0339s] [ 85%] 2025-12-04T10:23:15.1256315Z dynamo/test_misc.py::MiscTests::test_unhandled_exception_in_dynamo PASSED [0.0080s] [ 85%] 2025-12-04T10:23:15.1256555Z dynamo/test_misc.py::MiscTests::test_unhandled_exception_in_dynamo2 PASSED [0.0398s] [ 85%] 2025-12-04T10:23:15.1256784Z dynamo/test_misc.py::MiscTests::test_unique_consecutive PASSED [0.0198s] [ 85%] 2025-12-04T10:23:15.1256989Z dynamo/test_misc.py::MiscTests::test_unpack4 PASSED [0.0203s] [ 85%] 2025-12-04T10:23:15.1257192Z dynamo/test_misc.py::MiscTests::test_unpack5 PASSED [0.0187s] [ 86%] 2025-12-04T10:23:15.1257410Z dynamo/test_misc.py::MiscTests::test_unpack_tensor_shape_mismatch PASSED [0.0280s] [ 86%] 2025-12-04T10:23:15.1257661Z dynamo/test_misc.py::MiscTests::test_update_locals_and_stack_uses_shared_cache PASSED [0.0087s] [ 86%] 2025-12-04T10:23:15.1257910Z dynamo/test_misc.py::MiscTests::test_user_code_statically_known PASSED [0.0402s] [ 86%] 2025-12-04T10:23:15.1258131Z dynamo/test_misc.py::MiscTests::test_user_defined_binop PASSED [0.0176s] [ 86%] 2025-12-04T10:23:15.1258346Z dynamo/test_misc.py::MiscTests::test_user_defined_class_name PASSED [0.0186s] [ 86%] 2025-12-04T10:23:15.1258577Z dynamo/test_misc.py::MiscTests::test_user_defined_class_python_type PASSED [0.0306s] [ 87%] 2025-12-04T10:23:15.1258804Z dynamo/test_misc.py::MiscTests::test_user_defined_iter PASSED [0.0369s] [ 87%] 2025-12-04T10:23:15.1259039Z dynamo/test_misc.py::MiscTests::test_user_defined_object_class_interaction PASSED [0.0179s] [ 87%] 2025-12-04T10:23:15.1259278Z dynamo/test_misc.py::MiscTests::test_user_defined_setattr1 PASSED [0.0192s] [ 87%] 2025-12-04T10:23:15.1259493Z dynamo/test_misc.py::MiscTests::test_user_defined_setattr2 PASSED [0.0196s] [ 87%] 2025-12-04T10:23:15.1259740Z dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_enum_argument PASSED [0.0157s] [ 87%] 2025-12-04T10:23:15.1260061Z dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_function_argument PASSED [0.0181s] [ 87%] 2025-12-04T10:23:15.1260414Z dynamo/test_misc.py::MiscTests::test_user_function_variable_supports_type_abcmeta_argument PASSED [0.0167s] [ 88%] 2025-12-04T10:23:15.1260670Z dynamo/test_misc.py::MiscTests::test_user_getattr1 PASSED [0.0174s] [ 88%] 2025-12-04T10:23:15.1260877Z dynamo/test_misc.py::MiscTests::test_user_getattr2 PASSED [0.0180s] [ 88%] 2025-12-04T10:23:15.1261083Z dynamo/test_misc.py::MiscTests::test_user_getattribute PASSED [0.0205s] [ 88%] 2025-12-04T10:23:15.1261291Z dynamo/test_misc.py::MiscTests::test_user_property PASSED [0.0173s] [ 88%] 2025-12-04T10:23:15.1261501Z dynamo/test_misc.py::MiscTests::test_usr_cls_classmethod PASSED [0.0181s] [ 88%] 2025-12-04T10:23:15.1261716Z dynamo/test_misc.py::MiscTests::test_usr_cls_staticmethod PASSED [0.0176s] [ 89%] 2025-12-04T10:23:15.1261938Z dynamo/test_misc.py::MiscTests::test_validate_outputs_unbacked XFAIL [0.0379s] [ 89%] 2025-12-04T10:23:15.1262187Z dynamo/test_misc.py::MiscTests::test_validate_outputs_unbacked_by_custom_op PASSED [0.0642s] [ 89%] 2025-12-04T10:23:15.1262436Z dynamo/test_misc.py::MiscTests::test_variable_access_in_exception PASSED [0.0153s] [ 89%] 2025-12-04T10:23:15.1262684Z dynamo/test_misc.py::MiscTests::test_variable_tracker_recursively_contains PASSED [0.0164s] [ 89%] 2025-12-04T10:23:15.1262948Z dynamo/test_misc.py::MiscTests::test_version_ci PASSED [0.0017s] [ 89%] 2025-12-04T10:23:15.1263153Z dynamo/test_misc.py::MiscTests::test_with_builtin_type PASSED [0.0174s] [ 89%] 2025-12-04T10:23:15.1263384Z dynamo/test_misc.py::MiscTests::test_write_to_cells_with_name_shadowing PASSED [0.3182s] [ 90%] 2025-12-04T10:23:15.1263627Z dynamo/test_misc.py::MiscTests::test_write_to_closures_in_inlining PASSED [0.0178s] [ 90%] 2025-12-04T10:23:15.1263863Z dynamo/test_misc.py::MiscTests::test_writes_to_cells_across_frames1 PASSED [0.0103s] [ 90%] 2025-12-04T10:23:15.1264100Z dynamo/test_misc.py::MiscTests::test_writes_to_cells_across_frames2 PASSED [0.0156s] [ 90%] 2025-12-04T10:23:15.1264324Z dynamo/test_misc.py::MiscTests::test_yield_from PASSED [0.0210s] [ 90%] 2025-12-04T10:23:15.1264535Z dynamo/test_misc.py::MiscTests::test_yield_from_in_a_loop PASSED [0.0198s] [ 90%] 2025-12-04T10:23:15.1264760Z dynamo/test_misc.py::MiscTests::test_yield_from_user_stop_iteration PASSED [0.0143s] [ 90%] 2025-12-04T10:23:15.1264988Z dynamo/test_misc.py::MiscTests::test_yield_gen_and_from PASSED [0.0204s] [ 91%] 2025-12-04T10:23:15.1265221Z dynamo/test_misc.py::MiscTests::test_yield_send_to_subgenerator_graph_break PASSED [0.0023s] [ 91%] 2025-12-04T10:23:15.1265497Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_register_constant_with_side_effect PASSED [0.0167s] [ 91%] 2025-12-04T10:23:15.1265776Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_cxx PASSED [0.5554s] [ 91%] 2025-12-04T10:23:15.1266057Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_native_optree PASSED [0.4777s] [ 91%] 2025-12-04T10:23:15.1266343Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_flatten_unflatten_python PASSED [0.4137s] [ 91%] 2025-12-04T10:23:15.1266602Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_cxx PASSED [0.3754s] [ 92%] 2025-12-04T10:23:15.1266854Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_native_optree PASSED [0.3603s] [ 92%] 2025-12-04T10:23:15.1267112Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_leaves_python PASSED [0.3591s] [ 92%] 2025-12-04T10:23:15.1267351Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_cxx PASSED [0.9115s] [ 92%] 2025-12-04T10:23:15.1267597Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_cxx PASSED [0.0458s] [ 92%] 2025-12-04T10:23:15.1267867Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_native_optree PASSED [0.0464s] [ 92%] 2025-12-04T10:23:15.1268142Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_dict_order_python PASSED [0.0453s] [ 92%] 2025-12-04T10:23:15.1268451Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_native_optree PASSED [0.4384s] [ 93%] 2025-12-04T10:23:15.1268703Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_cxx PASSED [0.0246s] [ 93%] 2025-12-04T10:23:15.1268960Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_native_optree PASSED [0.0017s] [ 93%] 2025-12-04T10:23:15.1269222Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_only_python PASSED [0.0237s] [ 93%] 2025-12-04T10:23:15.1269469Z dynamo/test_misc.py::MiscTestsPyTree::test_pytree_tree_map_python PASSED [0.3916s] [ 93%] 2025-12-04T10:23:15.1269710Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_cxx PASSED [0.1230s] [ 93%] 2025-12-04T10:23:15.1269966Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_native_optree PASSED [0.0765s] [ 93%] 2025-12-04T10:23:15.1270305Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_dicts_python PASSED [0.0497s] [ 94%] 2025-12-04T10:23:15.1270562Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_cxx PASSED [0.1830s] [ 94%] 2025-12-04T10:23:15.1270834Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_native_optree PASSED [0.1303s] [ 94%] 2025-12-04T10:23:15.1271109Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_mixed_all_python PASSED [0.0729s] [ 94%] 2025-12-04T10:23:15.1271404Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_cxx PASSED [0.1322s] [ 94%] 2025-12-04T10:23:15.1271667Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_native_optree PASSED [0.0849s] [ 94%] 2025-12-04T10:23:15.1271933Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_pytree_python PASSED [0.0509s] [ 95%] 2025-12-04T10:23:15.1272197Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_cxx PASSED [0.0302s] [ 95%] 2025-12-04T10:23:15.1272481Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_native_optree PASSED [0.0277s] [ 95%] 2025-12-04T10:23:15.1272772Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tensor_subclass_python PASSED [0.0230s] [ 95%] 2025-12-04T10:23:15.1273039Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_cxx PASSED [0.1307s] [ 95%] 2025-12-04T10:23:15.1273300Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_native_optree PASSED [0.0858s] [ 95%] 2025-12-04T10:23:15.1273568Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_nested_tuples_python PASSED [0.0541s] [ 95%] 2025-12-04T10:23:15.1273811Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_cxx PASSED [0.6589s] [ 96%] 2025-12-04T10:23:15.1274056Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_native_optree PASSED [0.0443s] [ 96%] 2025-12-04T10:23:15.1274304Z dynamo/test_misc.py::MiscTestsPyTree::test_tracing_pytree_python PASSED [0.0331s] [ 96%] 2025-12-04T10:23:15.1274529Z dynamo/test_misc.py::TestTracer::test_jit_save PASSED [0.0808s] [ 96%] 2025-12-04T10:23:15.1274797Z dynamo/test_misc.py::TestCustomFunction::test_autograd_function_with_matmul_folding_at_output PASSED [0.7905s] [ 96%] 2025-12-04T10:23:15.1275070Z dynamo/test_misc.py::TestCustomFunction::test_retain_grad PASSED [0.0060s] [ 96%] 2025-12-04T10:23:15.1275316Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_dynamic_fill_diagonal__cuda PASSED [0.3946s] [ 96%] 2025-12-04T10:23:15.1275604Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_dynamic_float_scalar_tensor_coersion_cuda PASSED [0.4090s] [ 97%] 2025-12-04T10:23:15.1275918Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_full_graph_capture_dynamic_output_shape_ops_cuda PASSED [0.4756s] [ 97%] 2025-12-04T10:23:15.1276229Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_full_graph_capture_scalar_outputs_cuda PASSED [0.3646s] [ 97%] 2025-12-04T10:23:15.1276495Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_get_device_cuda PASSED [0.0357s] [ 97%] 2025-12-04T10:23:15.1276763Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_gpu_set_device_cuda SKIPPED [0.0002s] (need multiple GPU) [ 97%] 2025-12-04T10:23:15.1277117Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [0.6821s] [ 97%] 2025-12-04T10:23:15.1277471Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [1.0222s] [ 97%] 2025-12-04T10:23:15.1277801Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda FAILED [1.2948s] [ 97%] 2025-12-04T10:23:15.1277971Z 2025-12-04T10:23:15.1278030Z ==================================== RERUNS ==================================== 2025-12-04T10:23:15.1278219Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1278403Z Traceback (most recent call last): 2025-12-04T10:23:15.1278653Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1278890Z method(*args, **kwargs) 2025-12-04T10:23:15.1279119Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1279348Z method(*args, **kwargs) 2025-12-04T10:23:15.1279567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1279823Z with policy(): 2025-12-04T10:23:15.1280036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1280309Z raise RuntimeError(msg) 2025-12-04T10:23:15.1280752Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 0 and is now reported as 13312 on device 0. CUDA driver allocated memory was 899678208 and is now 916455424. 2025-12-04T10:23:15.1281135Z 2025-12-04T10:23:15.1281211Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1281521Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1281756Z 2025-12-04T10:23:15.1281845Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1282049Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1282224Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1282414Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1282593Z Traceback (most recent call last): 2025-12-04T10:23:15.1282830Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1283062Z method(*args, **kwargs) 2025-12-04T10:23:15.1283279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1283506Z method(*args, **kwargs) 2025-12-04T10:23:15.1283726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1283949Z with policy(): 2025-12-04T10:23:15.1284161Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1284389Z raise RuntimeError(msg) 2025-12-04T10:23:15.1284797Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 512 and is now reported as 13824 on device 0. CUDA driver allocated memory was 916455424 and is now 920649728. 2025-12-04T10:23:15.1285172Z 2025-12-04T10:23:15.1285245Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1285549Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1285818Z 2025-12-04T10:23:15.1285909Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1286106Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1286280Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1286446Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1286613Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1286757Z =================================== FAILURES =================================== 2025-12-04T10:23:15.1286943Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1287121Z Traceback (most recent call last): 2025-12-04T10:23:15.1287353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1287581Z method(*args, **kwargs) 2025-12-04T10:23:15.1287803Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1288030Z method(*args, **kwargs) 2025-12-04T10:23:15.1288284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1288507Z with policy(): 2025-12-04T10:23:15.1288717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1288947Z raise RuntimeError(msg) 2025-12-04T10:23:15.1289356Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 920649728 and is now 924844032. 2025-12-04T10:23:15.1289733Z 2025-12-04T10:23:15.1289812Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1290159Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1290393Z 2025-12-04T10:23:15.1290481Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1290677Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1290847Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1291011Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1291178Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1291342Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1291509Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1291791Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-f29e33abba5bdd90.xml - 2025-12-04T10:23:15.1292070Z =========================== short test summary info ============================ 2025-12-04T10:23:15.1292651Z FAILED [1.2948s] dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 920649728 and is now 924844032. 2025-12-04T10:23:15.1293161Z 2025-12-04T10:23:15.1293236Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1293536Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1293766Z 2025-12-04T10:23:15.1293883Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1294068Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:23:15.1294249Z === 1 failed, 633 passed, 12 skipped, 4 xfailed, 2 rerun in 86.26s (0:01:26) === 2025-12-04T10:23:15.1294407Z Got exit code 1 2025-12-04T10:23:15.1294504Z Retrying single test... 2025-12-04T10:23:15.1294712Z Test results will be stored in test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-cab74a09f842642a.xml 2025-12-04T10:23:15.1294945Z ============================= test session starts ============================== 2025-12-04T10:23:15.1295156Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:23:15.1295347Z cachedir: .pytest_cache 2025-12-04T10:23:15.1295571Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:23:15.1295807Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:23:15.1295929Z configfile: pytest.ini 2025-12-04T10:23:15.1296155Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:23:15.1296427Z collecting ... collected 664 items / 663 deselected / 1 selected 2025-12-04T10:23:15.1296759Z stepcurrent: skipping 649 already run items. Running only test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1297025Z Running 1 items in this shard 2025-12-04T10:23:15.1297098Z 2025-12-04T10:23:15.1297262Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [0.5436s] [100%] 2025-12-04T10:23:15.1297617Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [0.5031s] [100%] 2025-12-04T10:23:15.1297947Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda FAILED [0.4851s] [100%] 2025-12-04T10:23:15.1298120Z 2025-12-04T10:23:15.1298174Z ==================================== RERUNS ==================================== 2025-12-04T10:23:15.1298359Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1298541Z Traceback (most recent call last): 2025-12-04T10:23:15.1298777Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1299009Z method(*args, **kwargs) 2025-12-04T10:23:15.1299229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1299456Z method(*args, **kwargs) 2025-12-04T10:23:15.1299674Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1299897Z with policy(): 2025-12-04T10:23:15.1300147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1300377Z raise RuntimeError(msg) 2025-12-04T10:23:15.1300778Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 0 and is now reported as 13312 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T10:23:15.1301148Z 2025-12-04T10:23:15.1301220Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1301521Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1301752Z 2025-12-04T10:23:15.1301840Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1302037Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1302249Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1302436Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1302615Z Traceback (most recent call last): 2025-12-04T10:23:15.1302845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1303078Z method(*args, **kwargs) 2025-12-04T10:23:15.1303301Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1303526Z method(*args, **kwargs) 2025-12-04T10:23:15.1303739Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1303960Z with policy(): 2025-12-04T10:23:15.1304170Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1304403Z raise RuntimeError(msg) 2025-12-04T10:23:15.1304804Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 512 and is now reported as 13824 on device 0. CUDA driver allocated memory was 853540864 and is now 857735168. 2025-12-04T10:23:15.1305207Z 2025-12-04T10:23:15.1305282Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1305582Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1305810Z 2025-12-04T10:23:15.1305899Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1306092Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1306261Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1306428Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1306595Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1306738Z =================================== FAILURES =================================== 2025-12-04T10:23:15.1306925Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1307100Z Traceback (most recent call last): 2025-12-04T10:23:15.1307329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1307555Z method(*args, **kwargs) 2025-12-04T10:23:15.1307772Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1308000Z method(*args, **kwargs) 2025-12-04T10:23:15.1308215Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1308437Z with policy(): 2025-12-04T10:23:15.1308647Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1308876Z raise RuntimeError(msg) 2025-12-04T10:23:15.1309282Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 857735168 and is now 861929472. 2025-12-04T10:23:15.1309654Z 2025-12-04T10:23:15.1309729Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1310030Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1310297Z 2025-12-04T10:23:15.1310385Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1310614Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1310784Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1310948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1311117Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1311281Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1311447Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1311721Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-cab74a09f842642a.xml - 2025-12-04T10:23:15.1311997Z =========================== short test summary info ============================ 2025-12-04T10:23:15.1312579Z FAILED [0.4851s] dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 857735168 and is now 861929472. 2025-12-04T10:23:15.1313090Z 2025-12-04T10:23:15.1313195Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1313497Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1313727Z 2025-12-04T10:23:15.1313814Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1313998Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:23:15.1314167Z ================== 1 failed, 663 deselected, 2 rerun in 1.74s ================== 2025-12-04T10:23:15.1314314Z Got exit code 1 2025-12-04T10:23:15.1314414Z Retrying single test... 2025-12-04T10:23:15.1314620Z Test results will be stored in test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-0845acdcac3548d0.xml 2025-12-04T10:23:15.1314852Z ============================= test session starts ============================== 2025-12-04T10:23:15.1315059Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:23:15.1315254Z cachedir: .pytest_cache 2025-12-04T10:23:15.1315481Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:23:15.1315725Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:23:15.1315851Z configfile: pytest.ini 2025-12-04T10:23:15.1316081Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:23:15.1316359Z collecting ... collected 664 items / 663 deselected / 1 selected 2025-12-04T10:23:15.1316666Z stepcurrent: skipping 649 already run items. Running only test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1316938Z Running 1 items in this shard 2025-12-04T10:23:15.1317016Z 2025-12-04T10:23:15.1317181Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [0.5339s] [100%] 2025-12-04T10:23:15.1317545Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda ('RERUN', {'yellow': True}) [0.4634s] [100%] 2025-12-04T10:23:15.1317882Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda FAILED [0.4620s] [100%] 2025-12-04T10:23:15.1318060Z 2025-12-04T10:23:15.1318115Z ==================================== RERUNS ==================================== 2025-12-04T10:23:15.1318307Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1322462Z Traceback (most recent call last): 2025-12-04T10:23:15.1322777Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1323018Z method(*args, **kwargs) 2025-12-04T10:23:15.1323242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1323474Z method(*args, **kwargs) 2025-12-04T10:23:15.1323692Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1323918Z with policy(): 2025-12-04T10:23:15.1324130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1324358Z raise RuntimeError(msg) 2025-12-04T10:23:15.1324762Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 0 and is now reported as 13312 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T10:23:15.1325141Z 2025-12-04T10:23:15.1325216Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1325521Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1325795Z 2025-12-04T10:23:15.1325884Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1326086Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1326258Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1326447Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1326625Z Traceback (most recent call last): 2025-12-04T10:23:15.1326859Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1327089Z method(*args, **kwargs) 2025-12-04T10:23:15.1327310Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1327538Z method(*args, **kwargs) 2025-12-04T10:23:15.1327755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1327975Z with policy(): 2025-12-04T10:23:15.1328185Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1328412Z raise RuntimeError(msg) 2025-12-04T10:23:15.1328818Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 512 and is now reported as 13824 on device 0. CUDA driver allocated memory was 853540864 and is now 857735168. 2025-12-04T10:23:15.1329188Z 2025-12-04T10:23:15.1329265Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1329568Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1329804Z 2025-12-04T10:23:15.1329892Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1330128Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1330299Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1330465Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1330632Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1330778Z =================================== FAILURES =================================== 2025-12-04T10:23:15.1330964Z _______ MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda _______ 2025-12-04T10:23:15.1331140Z Traceback (most recent call last): 2025-12-04T10:23:15.1331408Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1331641Z method(*args, **kwargs) 2025-12-04T10:23:15.1331858Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T10:23:15.1332087Z method(*args, **kwargs) 2025-12-04T10:23:15.1332301Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T10:23:15.1332524Z with policy(): 2025-12-04T10:23:15.1332732Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T10:23:15.1332960Z raise RuntimeError(msg) 2025-12-04T10:23:15.1333374Z RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 857735168 and is now 861929472. 2025-12-04T10:23:15.1333744Z 2025-12-04T10:23:15.1333820Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1334157Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1334386Z 2025-12-04T10:23:15.1334475Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1334802Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1334971Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1335138Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1335306Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1335476Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:23:15.1335642Z stats [('calls_captured', 10), ('unique_graphs', 1)] 2025-12-04T10:23:15.1335921Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-0845acdcac3548d0.xml - 2025-12-04T10:23:15.1336205Z =========================== short test summary info ============================ 2025-12-04T10:23:15.1336788Z FAILED [0.4620s] dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda! Caching allocator allocated memory was 1024 and is now reported as 14336 on device 0. CUDA driver allocated memory was 857735168 and is now 861929472. 2025-12-04T10:23:15.1337298Z 2025-12-04T10:23:15.1337371Z To execute this test, run the following from the base repo dir: 2025-12-04T10:23:15.1337671Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_misc.py MiscTestsDeviceCUDA.test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1337900Z 2025-12-04T10:23:15.1337986Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:23:15.1338173Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:23:15.1338339Z ================== 1 failed, 663 deselected, 2 rerun in 1.66s ================== 2025-12-04T10:23:15.1338480Z Got exit code 1 2025-12-04T10:23:15.1338678Z FAILED CONSISTENTLY: test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda 2025-12-04T10:23:15.1338983Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:23:15.1339285Z Test results will be stored in test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-a5467beb7e9dded1.xml 2025-12-04T10:23:15.1339520Z ============================= test session starts ============================== 2025-12-04T10:23:15.1339766Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:23:15.1339956Z cachedir: .pytest_cache 2025-12-04T10:23:15.1340218Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:23:15.1340457Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:23:15.1340577Z configfile: pytest.ini 2025-12-04T10:23:15.1340803Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:23:15.1341079Z collecting ... collected 664 items / 650 deselected / 14 selected 2025-12-04T10:23:15.1341246Z stepcurrent: skipping 650 already run items. 2025-12-04T10:23:15.1341378Z Running 14 items in this shard 2025-12-04T10:23:15.1341448Z 2025-12-04T10:23:15.1341566Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_legacy_cuda_tensor_cuda PASSED [0.2891s] [ 7%] 2025-12-04T10:23:15.1341829Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_parsing_sdpa_cuda PASSED [1.2832s] [ 14%] 2025-12-04T10:23:15.1342063Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_rand_cuda PASSED [0.0515s] [ 21%] 2025-12-04T10:23:15.1342343Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_randint_no_graphbreak_cuda PASSED [0.0292s] [ 28%] 2025-12-04T10:23:15.1342615Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_scalar_isin_decomposition_cuda PASSED [4.5259s] [ 35%] 2025-12-04T10:23:15.1342888Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_symint_as_device_kwarg_cuda PASSED [0.0385s] [ 42%] 2025-12-04T10:23:15.1343170Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_cudnn_is_acceptable_bad_inputs_cuda PASSED [0.0120s] [ 50%] 2025-12-04T10:23:15.1343457Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_cudnn_is_acceptable_cuda PASSED [0.0163s] [ 57%] 2025-12-04T10:23:15.1343729Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_device_is_available_cuda PASSED [0.0183s] [ 64%] 2025-12-04T10:23:15.1344004Z dynamo/test_misc.py::MiscTestsDeviceCUDA::test_torch_device_python_type_cuda PASSED [0.0386s] [ 71%] 2025-12-04T10:23:15.1344419Z dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_guard_or_false W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] Graph break from `Tensor.item()`, consider setting: 2025-12-04T10:23:15.1344883Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] torch._dynamo.config.capture_scalar_outputs = True 2025-12-04T10:23:15.1345184Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] or: 2025-12-04T10:23:15.1345472Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] env TORCHDYNAMO_CAPTURE_SCALAR_OUTPUTS=1 2025-12-04T10:23:15.1345820Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] to include these operations in the captured graph. 2025-12-04T10:23:15.1346113Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] 2025-12-04T10:23:15.1346386Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] Graph break: from user code at: 2025-12-04T10:23:15.1346779Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] File "/var/lib/jenkins/pytorch/test/dynamo/test_misc.py", line 14094, in symbool_guard_fn 2025-12-04T10:23:15.1347166Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] u0 = a_bool_tensor.item() 2025-12-04T10:23:15.1347435Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] 2025-12-04T10:23:15.1347661Z W1204 10:23:11.334000 431162 site-packages/torch/_dynamo/variables/tensor.py:1073] [0/0] 2025-12-04T10:23:15.1347831Z PASSED [0.0547s] [ 78%] 2025-12-04T10:23:15.1348004Z dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_tensor_mul PASSED [0.2325s] [ 85%] 2025-12-04T10:23:15.1348528Z dynamo/test_misc.py::DynamoOpPromotionTests::test_symbool_tensor_mul_does_not_fail W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] failed to eagerly compile backwards for dynamic, suppressing in case backwards not needed 2025-12-04T10:23:15.1349064Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] Traceback (most recent call last): 2025-12-04T10:23:15.1349566Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/graph_compile.py", line 1912, in _aot_stage2b_bw_compile 2025-12-04T10:23:15.1350073Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] compiled_bw_func = aot_config.bw_compiler( 2025-12-04T10:23:15.1350596Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:23:15.1351074Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return self.compiler_fn(gm, example_inputs) 2025-12-04T10:23:15.1351663Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/backends/common.py", line 83, in _wrapped_bw_compiler 2025-12-04T10:23:15.1352108Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] disable( 2025-12-04T10:23:15.1352521Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T10:23:15.1352956Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return fn(*args, **kwargs) 2025-12-04T10:23:15.1353404Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_utils_internal.py", line 97, in wrapper_function 2025-12-04T10:23:15.1353859Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return function(*args, **kwargs) 2025-12-04T10:23:15.1354311Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2707, in bw_compiler 2025-12-04T10:23:15.1354764Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return compile_fx_backward( 2025-12-04T10:23:15.1355224Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2387, in compile_fx_backward 2025-12-04T10:23:15.1355679Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return inner_compile( 2025-12-04T10:23:15.1356127Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:23:15.1356634Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:23:15.1357167Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:23:15.1357644Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:23:15.1358128Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T10:23:15.1358611Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:23:15.1359096Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T10:23:15.1359569Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:23:15.1360055Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:23:15.1360655Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:23:15.1361180Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1452, in codegen_and_compile 2025-12-04T10:23:15.1361641Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] graph.run(*example_inputs) 2025-12-04T10:23:15.1362072Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/graph.py", line 987, in run 2025-12-04T10:23:15.1362501Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return super().run(*args) 2025-12-04T10:23:15.1362930Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/interpreter.py", line 200, in run 2025-12-04T10:23:15.1363428Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] self.env[node] = self.run_node(node) 2025-12-04T10:23:15.1363874Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1726, in run_node 2025-12-04T10:23:15.1364314Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] result = super().run_node(n) 2025-12-04T10:23:15.1364754Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/interpreter.py", line 295, in run_node 2025-12-04T10:23:15.1365218Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] return getattr(self, n.op)(n.target, args, kwargs) 2025-12-04T10:23:15.1365674Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/graph.py", line 1503, in output 2025-12-04T10:23:15.1366099Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] assert isinstance( 2025-12-04T10:23:15.1366613Z W1204 10:23:11.772000 431162 site-packages/torch/_functorch/_aot_autograd/graph_compile.py:1932] [0/0] torch._inductor.exc.InductorError: AssertionError: Unsupported inductor graph input type: 2025-12-04T10:23:15.1366986Z PASSED [0.1740s] [ 92%] 2025-12-04T10:23:15.1367176Z dynamo/test_misc.py::DynamoOpPromotionTests::test_tensorify_track_item_symint PASSED [1.1217s] [100%] 2025-12-04T10:23:15.1367336Z 2025-12-04T10:23:15.1367522Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/dynamo.test_misc/dynamo.test_misc-a5467beb7e9dded1.xml - 2025-12-04T10:23:15.1367807Z ====================== 14 passed, 650 deselected in 8.18s ====================== 2025-12-04T10:23:15.1368083Z The following tests failed consistently: ['test/dynamo/test_misc.py::MiscTestsDeviceCUDA::test_interpolate_propagate_real_tensors_cuda'] 2025-12-04T10:23:15.1368288Z 2025-12-04T10:23:15.1368425Z FINISHED PRINTING LOG FILE of dynamo/test_misc 1/1 (test/test-reports/dynamo.test_misc_1.1_37cc06447c4694c0_.log) 2025-12-04T10:23:15.1368596Z 2025-12-04T10:23:15.1368696Z Finished dynamo/test_misc 1/1 ... [2025-12-04 10:23:15.099221][2193257.5602064], took 1.91min 2025-12-04T10:23:15.1369056Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:23:15.1369440Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:23:15.1369655Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:23:15.1369834Z Uploading artifacts took 0.00 seconds 2025-12-04T10:23:15.1369960Z dynamo/test_misc 1/1 failed! 2025-12-04T10:23:15.1370167Z Running inductor/test_flex_attention 4/4 ... [2025-12-04 10:23:15.105264][2193257.566253738] 2025-12-04T10:23:15.1370354Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:23:15.1370746Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=4', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:23:15.105444] 2025-12-04T10:31:40.3001298Z 2025-12-04T10:31:40.3002147Z inductor/test_flex_attention 4/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_attention_4.4_4a3e1bca626dd2f8_.log 2025-12-04T10:31:40.3036925Z Running 196 items in this shard: test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod2_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod3_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_autograd_function_in_score_mod_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod4_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_seqlen_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod5_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod6_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod7_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod7_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_buffers_all_dims_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_buffers_all_dims_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_score_mod_aot_eager_gradcheck_score_mod_name__head_offset_mode_eager_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_wrong_device_error_message_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_causal_block_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_causal_block_non_divisible_with_captured_buffer_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_custom_score_mod_layout_freeze_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_epilogue_fused_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order0_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order1_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_eager_permute_order3_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order0_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order3_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order0_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order3_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order0_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_inductor_permute_order4_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order0_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order2_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order3_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order4_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_fully_masked_out_rows_0_check_compile_True_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_function_composition_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_index_weird1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kernel_options_argument_is_respected_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_load_from_view_buffer_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_lse_masked_output_backend_eager_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_multiple_score_mod_calls2_paged_attention_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_multiple_score_mod_calls_paged_attention_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_njt_causal_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod5_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod7_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_pow_2_headdim_head_dim_24_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_num_warps_8_error_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__alibi_bias_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__squared_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_selective_ac_with_max_autotune_short_query_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_silu_on_score_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_small_block_mask_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s0_v_s0_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s2_v_s2_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s0_v_s0_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s1_v_s1_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s1_v_s1_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s3_v_s3_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_subgraph_respect_decompostion_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_symbol_closure_in_score_mod_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_convert_logical_block_mask_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod1_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod5_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod7_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_update_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_operations_with_none_q_indices_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_vs_sequence_lengths_compile_True_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE4_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_compiling_create_block_mask_no_recompile_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_doc_mask_clamped_repro_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_eager_tracing_correctness_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_from_kv_blocks_without_q_computation_full_indices_False_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_pytree_flatten_with_keys_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_upcast_appropriately_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_absolute_2d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda 2025-12-04T10:31:40.3070137Z 2025-12-04T10:31:40.3070268Z Finished inductor/test_flex_attention 4/4 ... [2025-12-04 10:31:40.300069][2193762.761050307], took 8.42min 2025-12-04T10:31:40.3070676Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:31:40.3071036Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:31:40.3071268Z Running inductor/test_flex_decoding 2/2 ... [2025-12-04 10:31:40.306185][2193762.767168068] 2025-12-04T10:31:40.3071508Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:31:40.3071912Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_decoding.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:31:40.306434] 2025-12-04T10:42:33.3116403Z 2025-12-04T10:42:33.3117278Z inductor/test_flex_decoding 2/2 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_decoding_2.2_f5e2a3ff89296243_.log 2025-12-04T10:42:33.3178783Z Running 294 items in this shard: test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod1_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod1_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod2_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod2_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod3_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod4_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod4_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod5_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod5_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod6_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod7_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod7_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod7_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod8_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_bfloat16_score_mod8_head_dims2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod0_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod0_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod1_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod2_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod2_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod3_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod3_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod4_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod4_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod4_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod4_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod5_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod5_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod6_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod6_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod6_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod6_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod7_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod7_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_bfloat16_score_mod8_BLOCK_SIZE_64_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod0_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod0_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod0_BLOCK_SIZE_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod1_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod1_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod2_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod3_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod3_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod3_BLOCK_SIZE_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod4_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod4_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod5_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod5_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod5_BLOCK_SIZE_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod6_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod6_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod7_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod7_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod7_BLOCK_SIZE_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod8_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float16_score_mod8_BLOCK_SIZE_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod0_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod0_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod0_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod1_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod2_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod2_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod2_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod2_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod3_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod3_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod4_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod4_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod5_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod5_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod5_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod6_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod6_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod6_BLOCK_SIZE_64_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod8_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod8_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_different_block_size_float32_score_mod8_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod1_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod1_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod2_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod2_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod3_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod3_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod4_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod5_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod6_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float16_score_mod7_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod0_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod1_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod1_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod2_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod3_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod3_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod4_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod4_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod4_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod5_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod5_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod6_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod6_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod7_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod7_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_builtin_score_mods_float32_score_mod8_head_dims2_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_bw_decoding_fails_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_captured_buffers_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_captured_reduction_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_captured_scale_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod4_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod7_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_decode_at_different_input_position_float16_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_function_composition_bfloat16_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_function_composition_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod0_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod0_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod1_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod1_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod3_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod3_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod4_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod5_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod6_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod6_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod7_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod8_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod8_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_head_dependent_mask_mod_float16_score_mod8_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims0_score_mod1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims0_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims0_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims0_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims1_score_mod1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims1_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims1_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims2_score_mod1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims2_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims2_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims2_score_mod7_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims0_batch_dims3_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod4_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod7_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims0_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims1_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims1_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims1_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims1_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims1_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims2_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims2_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims2_score_mod4_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims2_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims2_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims3_score_mod4_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims3_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims1_batch_dims3_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims0_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims0_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims0_score_mod7_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims0_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims1_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims1_score_mod1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims1_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims1_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims1_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod6_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod7_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims2_score_mod8_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims3_score_mod2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims3_score_mod3_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims3_score_mod4_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_kv_batch_broadcast_float16_head_dims2_batch_dims3_score_mod5_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_load_from_bias_head_seq_batch_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_load_from_bias_seq_batch_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_logsumexp_correctness_bfloat16_score_mod1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_logsumexp_only_return_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_max_autotune_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_max_autotune_with_captured_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_mixed_dtypes_fails_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_multiple_score_mod_calls_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_njt_causal_bfloat16_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_njt_causal_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_divisible_multi_token_offset_mask_with_captured_buffer_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod0_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod0_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod0_float16_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod1_bfloat16_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod1_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod1_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod1_float32_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod1_float32_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod2_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod2_float16_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod2_float32_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod3_bfloat16_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod3_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod3_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod3_float32_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod4_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod4_float16_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod4_float32_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod5_float16_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod6_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod6_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod6_float32_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod6_float32_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod7_bfloat16_head_dims0_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod7_bfloat16_head_dims1_cuda_bfloat16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod7_float16_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod7_float16_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod7_float32_head_dims1_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_equal_head_dims_score_mod8_float32_head_dims0_cuda_float32, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_pow_2_headdim_head_dim_17_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_pow_2_headdim_head_dim_94_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_non_sparse_mulitple_block_size_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod0_head_dims0_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod0_head_dims0_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod0_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod0_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod1_head_dims0_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod1_head_dims0_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod1_head_dims1_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod1_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod1_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod2_head_dims0_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod2_head_dims0_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod2_head_dims1_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod2_head_dims1_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod2_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod3_head_dims0_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod3_head_dims0_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod3_head_dims1_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod3_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod4_head_dims0_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod4_head_dims1_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod4_head_dims1_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod4_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod5_head_dims0_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod5_head_dims0_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod5_head_dims1_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod5_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod5_head_dims2_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod6_head_dims0_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod6_head_dims1_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod6_head_dims1_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod6_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod7_head_dims0_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod7_head_dims0_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod7_head_dims1_page_size_64_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod7_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod7_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod8_head_dims1_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod8_head_dims2_page_size_128_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_paged_attention_page_size_float16_score_mod8_head_dims2_page_size_256_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_seq_masking_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_silu_on_score_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_skip_odd_keys_float16_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s0_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s1_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s1_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s1_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s2_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s2_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s0_v_s3_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s1_v_s0_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s1_v_s1_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s1_v_s1_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s1_v_s2_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s1_v_s3_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s2_v_s0_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s2_v_s0_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s2_v_s0_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s2_v_s2_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s0_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s0_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s1_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s2_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s2_head_dims2_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s3_head_dims0_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_strided_inputs_float16_k_s3_v_s3_head_dims1_cuda_float16, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_windowed_full_mask_vs_sdpa_paged_attention_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_windowed_no_mask_vs_sdpa_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_windowed_no_mask_vs_sdpa_paged_attention_cuda, test/inductor/test_flex_decoding.py::TestFlexDecodingCUDA::test_windowed_partial_block_vs_sdpa_cuda 2025-12-04T10:42:33.3229737Z 2025-12-04T10:42:33.3229866Z Finished inductor/test_flex_decoding 2/2 ... [2025-12-04 10:42:33.311726][2194415.772710261], took 10.88min 2025-12-04T10:42:33.3230324Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:42:33.3230681Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:42:33.3230923Z Running inductor/test_triton_extension_backend 1/1 ... [2025-12-04 10:42:33.317618][2194415.778607065] 2025-12-04T10:42:33.3231178Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:42:33.3231592Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_triton_extension_backend.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:42:33.317824] 2025-12-04T10:42:38.9913076Z 2025-12-04T10:42:38.9914046Z inductor/test_triton_extension_backend 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_triton_extension_backend_1.1_67a55ee7377541af_.log 2025-12-04T10:42:38.9914739Z Running 0 items in this shard: 2025-12-04T10:42:38.9914895Z 2025-12-04T10:42:38.9915165Z Finished inductor/test_triton_extension_backend 1/1 ... [2025-12-04 10:42:38.990928][2194421.451914268], took 0.09min 2025-12-04T10:42:38.9918629Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:42:38.9963909Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:42:38.9966606Z Running inductor/test_cutedsl_grouped_mm 1/1 ... [2025-12-04 10:42:38.996408][2194421.457397709] 2025-12-04T10:42:38.9967612Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:42:38.9968285Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cutedsl_grouped_mm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:42:38.996626] 2025-12-04T10:42:40.7641981Z 2025-12-04T10:42:40.7642846Z inductor/test_cutedsl_grouped_mm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cutedsl_grouped_mm_1.1_cc2e6babc150a33b_.log 2025-12-04T10:42:40.7650715Z Running 24 items in this shard: test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_contiguous_layout_B_broadcasted, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_contiguous_layout_B_contiguous, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_offset_layout_B_broadcasted, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_offset_layout_B_contiguous, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_padded_layout_B_broadcasted, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_padded_layout_B_contiguous, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_view_layout_B_broadcasted, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_assorted_layouts_layout_A_view_layout_B_contiguous, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_1024_K_128_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_1024_K_128_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_1024_K_64_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_1024_K_64_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_256_K_128_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_256_K_128_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_256_K_64_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_2_M_hint_256_K_64_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_1024_K_128_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_1024_K_128_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_1024_K_64_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_1024_K_64_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_256_K_128_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_256_K_128_N_256, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_256_K_64_N_128, test/inductor/test_cutedsl_grouped_mm.py::TestCuTeDSLGroupedGemm::test_grouped_gemm_basic_group_size_8_M_hint_256_K_64_N_256 2025-12-04T10:42:40.7657325Z 2025-12-04T10:42:40.7657497Z Finished inductor/test_cutedsl_grouped_mm 1/1 ... [2025-12-04 10:42:40.763880][2194423.224863287], took 0.03min 2025-12-04T10:42:40.7658055Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:42:40.7701901Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:42:40.7703155Z Running inductor/test_cpp_wrapper_hipify 1/1 ... [2025-12-04 10:42:40.770114][2194423.231103987] 2025-12-04T10:42:40.7703379Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:42:40.7705331Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpp_wrapper_hipify.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:42:40.770329] 2025-12-04T10:42:43.3891762Z 2025-12-04T10:42:43.3892922Z inductor/test_cpp_wrapper_hipify 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpp_wrapper_hipify_1.1_1194df3009605526_.log 2025-12-04T10:42:43.3894911Z Running 3 items in this shard: test/inductor/test_cpp_wrapper_hipify.py::TestCppWrapperHipify::test_hipify_aoti_driver_header, test/inductor/test_cpp_wrapper_hipify.py::TestCppWrapperHipify::test_hipify_basic_declaration, test/inductor/test_cpp_wrapper_hipify.py::TestCppWrapperHipify::test_hipify_cross_platform 2025-12-04T10:42:43.3896191Z 2025-12-04T10:42:43.3896551Z Finished inductor/test_cpp_wrapper_hipify 1/1 ... [2025-12-04 10:42:43.388842][2194425.849825601], took 0.04min 2025-12-04T10:42:43.3899086Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:42:43.3950832Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:42:43.3951247Z Running export/test_retraceability 1/1 ... [2025-12-04 10:42:43.394952][2194425.855941462] 2025-12-04T10:42:43.3951583Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:42:43.3953632Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_retraceability.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:42:43.395179] 2025-12-04T10:45:41.9074127Z 2025-12-04T10:45:41.9075024Z export/test_retraceability 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_retraceability_1.1_261a2802ab0dcc9c_.log 2025-12-04T10:45:41.9240042Z Running 880 items in this shard: test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_assume_static_by_default_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_constraints_error_not_in_range_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_constraints_error_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_inline_constraints_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_slice_maxsize_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_slice_unbacked_dim1_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_export_strict_narrow_unbacked_expr_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_no_grad_param_inplace_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestDynamismExpression::test_reshape_view_backed_size_oblivious_retraceability_strict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_assume_static_by_default_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_constraints_error_not_in_range_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_constraints_error_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_inline_constraints_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_slice_maxsize_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_slice_unbacked_dim1_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_export_strict_narrow_unbacked_expr_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_no_grad_param_inplace_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestDynamismExpression::test_reshape_view_backed_size_oblivious_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportTestExport::test__scaled_dot_product_flash_attention_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_additional_inputs_constants_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_allow_explicit_guards_as_runtime_asserts_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_annotate_on_assert_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_args_type_checked_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_aten_lift_fresh_copy_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_attention_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_attr_assignment_extra_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_automatic_constrain_size_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_automatic_dynamic_shapes_constant_relation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_automatic_dynamic_shapes_linear_relation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_automatic_dynamic_shapes_simple_equality_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_baddbmm_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_basic_non_strict_fake_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_basic_non_strict_real_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_bincount_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_buffer_util_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_capture_subclass_constructor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_capture_subclass_constructor_torch_ir_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_capture_subclass_wrong_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_ccode_python_mod_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cdist_forward_compute_mode_zero_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_check_specialized_int_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_checks_to_constrain_range_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cleanup_dynamic_markers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_colin_unbacked_backed_vr_sub_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_colon_parameter_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_compiling_state_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_access_identical_symint_closure_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_branches_return_constant_int_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_branches_return_same_int_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_buffers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_contains_unbacked_no_escape_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_int_closure_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_unflatten_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_with_module_stack_export_with_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cond_with_module_stack_export_with_unflatten_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_aliasing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_input_naming_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_no_user_inp_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_output_dup_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_output_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_requires_grad_const_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_return_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_tensor_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_tensor_with_non_functional_nested_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constant_tensor_with_non_functional_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constrain_decomp_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constrain_size_in_eager_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constrain_size_with_constrain_value_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_constrain_size_with_various_cases_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_conv_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_crop_like_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_cse_for_symint_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_op_auto_functionalize_pre_dispatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_op_auto_functionalize_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_op_auto_warn_pre_dispatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_op_preserve_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_pytree_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_custom_tag_metadata_re_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_decomp_batch_norm_functional_predispatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_decomp_item_in_prim_after_decomposition_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_decomp_item_in_prim_before_decomposition_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_default_decomposition_core_cia_ops_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_1_2_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_integer_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_nested_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_out_of_order_repeat_derived_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_out_of_order_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_out_of_order_simplified_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_derived_dim_repeat_derived_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_detect_leak_nonstrict_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_detect_leak_nonstrict_with_stacktrace_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_detect_leak_strict_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_device_to_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_device_to_gpu_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_device_to_mutation_float_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_device_to_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_device_to_static_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_1_2_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_auto_and_dim_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_dynamic_divisibility_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_dynamic_specialization_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_hint_range_violations_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dim_hint_ranges_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_disable_forced_specializations_errors_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_disable_forced_specializations_ok_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_distributed_all_gather_into_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_distributed_all_gather_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_distributed_all_reduce_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_distributed_all_to_all_single_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_distributed_reduce_scatter_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dont_duck_size_for_auto_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_double_lifted_constants_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_checks_aliasing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_checks_mutation_list_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_checks_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_checks_mutation_with_nan_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_fake_kernel_inference_errors_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_draft_export_infers_fake_kernel_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_duplicate_modules_with_non_persistent_buffers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_lr_shift_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_bounds_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_builder_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_builder_kwargs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_builder_pytree_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_dataclass_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_inferred_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_serdes_generic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_serdes_user_errors_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_serdes_various_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_spec_with_pytree_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_shapes_wrapped_with_shape_guards_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_dynamic_sym_round_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_ends_of_bounds_oblivious_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_enum_str_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_error_does_not_reference_eager_fallback_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_error_when_passing_mutating_primitive_op_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_exception_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_expand_copy_export_handles_implicit_true_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_api_with_dynamic_shapes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_as_backend_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_associative_scan_lifted_buffers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_associative_scan_symbol_dim_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_associative_scan_symbol_scandim_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_aten_to_unflatten_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_aten_to_unflatten_subclass_pre_dispatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_aten_to_unflatten_subclass_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_cond_preserve_torch_fn_for_subgraphs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_cond_symbool_pred_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_cond_warns_constant_pred_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_custom_decomp_table_basic_pop_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_custom_decomp_table_container_methods_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_custom_op_lib_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_custom_triton_kernel_mutable_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_custom_triton_kernel_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_cyclic_reference_leak_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_decomp_torture_case_1_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_decomp_torture_case_2_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_decomps_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_decomps_simple_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_dynamo_config_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_for_training_run_decomp_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_for_training_with_container_type_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_for_training_with_dynamic_shapes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_for_training_with_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_for_training_with_state_dict_hooks_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_default_kwargs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_keyword_only_args_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_kwargs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_pytree_kwargs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_var_keyword_args_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_var_keyword_pytree_args_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_func_with_var_postional_args_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_function_schema_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_graph_with_no_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_input_mutation_bug_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_input_mutation_dynamic_shape_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_input_mutation_static_shape_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_leak_compile_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_linear_preserve_dynamic_shape_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_max_nonstrict_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_max_onnx_reported_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_method_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_mod_constraints_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_preserve_linear_at_aot_level_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_preserve_linear_but_not_custom_op_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_rnn_variants_with_warning_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_scan_pytree_output_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_script_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_statically_known_true_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_then_compile_tensor_ctor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_autocast_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_fake_tensor_inputs_on_cuda_devices_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_fake_tensor_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_inline_constraints_complex_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_inline_constraints_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_set_grad_enabled_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_export_with_wrong_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_external_call_non_strict_real_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_fake_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_fake_weights_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_filter_traceback_frames_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_flex_attention_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_float_conversion_from_int_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_float_conversion_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_fqn_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_from_node_metadata_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_full_on_scalar_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_function_holding_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_hints_wrapper_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_hoo_inline_users_issue_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_if_functional_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_if_post_autograd_op_preserved_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_inductor_backend_inside_nonstrict_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_inline_script_class_method_recursive_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_inline_script_class_method_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_inline_script_function_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_inline_script_method_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_int_shape_specialization_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_intermediate_shape_comp_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_invalid_pytree_dynamo_graph_capture_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_is_exporting_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_is_nonzero_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_isnonzero_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_issue_113041_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_issue_157289_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_issue_161902_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_istft_op_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_keep_composite_ops_invalid_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_keep_composite_ops_linear_convd_for_training_ir_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_keep_composite_ops_linear_convd_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_kwarg_dynamic_shapes_diff_order_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_kwargs_reorder_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_layer_norm_unbacked_normalized_shape_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_layer_sharing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_lazy_module_kwargs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_lifted_constants_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_linear_conv_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_malformed_fqn_from_source_name_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_map_buffers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_map_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_mask_nonzero_static_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_masked_select_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_math_pow_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_mismatched_dynamic_shapes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_mixed_input_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_dict_key_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_input_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_input_subclasses_parameterization_nested_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_list_slice_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_module_with_dict_container_inp_out_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_modules_access_for_deleted_submodule_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_more_multidimensional_slicing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_multidimensional_slicing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_multinomial_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_multiple_definitions_same_name_dim_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_namedtuple_input_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_native_multi_attention_head_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_dynamic_shapes_spec_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_module_fake_tensor_leak_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_module_with_constant_buffer_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_module_with_init_buffer_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nested_module_with_parameter_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nn_module_stack_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nn_module_stack_shared_submodule_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_check_is_size_error_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_suggested_fixes_for_data_dependent_errors_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_tensor_computation_2_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_tensor_computation_3_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_tensor_computation_4_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_no_tensor_computation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_arg_name_dynamic_shapes_api_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_arg_name_dynamic_shapes_api_with_container_type_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_persistent_buffer_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_strict_dynamic_shapes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_non_strict_dynamic_shapes_suggested_fixes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_none_buffers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nonstrict_retrace_preserves_metadata_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nonzero_2_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_nonzero_dynamic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_not_registered_parameter_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_operator_aten_tensor_mode_variant_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_output_node_name_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_pad_sequence_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_param_util_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_partial_patched_forward_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_placeholder_naming_collisions_hoo_subgraphs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_placeholder_naming_collisions_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_placeholder_naming_order_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_placeholder_naming_order_variadic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_placeholder_update_preserving_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_predispatch_cond_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_predispatch_grad_wrappers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_preserve_annotation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_preserve_module_call_signature_unflatten_specialization_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_preserve_requires_grad_placeholders_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_preserve_shape_dynamism_for_unused_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_profiling_code_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_python_asserts_with_sym_int_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_pytree_register_data_class_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_pytree_register_nested_data_class_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_raise_user_error_when_guard_on_data_dependent_operation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_range_constraints_with_replacement_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_real_tensor_alias_dtype_mismatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_real_tensor_bool_cast_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_real_tensor_errors_on_aliasing_custom_op_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_real_tensor_for_max_op_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_real_tensor_size_mismatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_redundant_assert_max_upper_bound_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_redundant_asserts_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_refine_dynamic_shapes_from_suggested_fixes_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_register_constant_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_repeat_interleave_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_replace_unbacked_with_very_large_upperbound_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_replaced_unbacked_bindings_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_reshape_view_helper_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_retracable_ep_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_retrace_pre_autograd_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_run_decomposition_supports_user_input_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_run_decompositions_keep_metadata_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_run_decompositions_keep_tensor_constant_metadata_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_runtime_assert_for_prim_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_runtime_assert_for_prm_str_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_runtime_assert_with_size_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_sdpa_gqa_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_sequential_slicing_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_set_example_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_set_grad_as_side_effect_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_set_grad_empty_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_set_grad_unflatten_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_setgrad_lifted_tensor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_shared_submodule_nn_module_stack_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_simple_export_for_training_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_simple_unbacked_view_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_size_input_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_slice_nn_module_stack_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_solver_unsupported_sympy_function_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_specialize_derived_dim_roots_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_split_const_gm_with_lifted_constants_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_stack_trace_make_fx_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_stack_trace_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_state_primitives_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_state_shape_attribute_assignment_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_state_tensors_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_static_dim_constraints_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_context_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_nested_attr_access_complicated_metadata_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_nested_attr_access_const_metadata_not_top_level_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_nested_attr_access_const_metadata_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_nested_attr_access_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclass_nested_attr_access_submodule_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclasses_parameterization_nested_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_subclasses_parameterization_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_suggest_torch_checks_with_non_negative_check_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_suggest_torch_checks_with_regular_check_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_suggested_fixes_for_data_dependent_errors_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_suggested_fixes_new_roots_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_sym_float_operators_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_sym_or_sym_and_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_sym_sqrt_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symbool_item_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symfloat_item_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_input_additional_inputs_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_input_basic_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_input_ranges_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_input_shapes_collection_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_input_specialization_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_item_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_output_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_symint_tensor_return_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tag_ac_export_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tensor_attribute_zero_args_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tensor_constant_aten_to_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tensor_constant_with_wrapped_method_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_to_module_with_mutated_buffer_multiple_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_to_module_with_mutated_buffer_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tolist_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_torch_check_eq_commutativity_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_torch_fn_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_trace_under_fake_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_train_eval_on_exported_preautograd_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_tril_dynamic_diagonal_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_triu_dynamic_diagonal_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_3d_matmul_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_bincount_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_bindings_for_divisible_u_symint_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_deferred_runtime_retrace_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_expand_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_infer_size_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_kth_value_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_linear_layer_norm_input_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_noncontig_lin_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_pad_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_scalar_constructor_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_slice_forward_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_slice_simple_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_stack_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_to_cond_passthrough_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_to_cond_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unbacked_unsqueeze_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_asserts_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_buffer_update_child2parent_swap_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_closure_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_isinstance_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_multiple_graphs_dispatch_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_multiple_graphs_preserve_signature_no_error_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_multiple_graphs_shared_submodule_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_multiple_graphs_state_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_no_unroll_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_placeholder_update_child2parent_swap_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_placeholder_update_grandchild2cousin_swap_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_5_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_6_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_buf_8_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_const_preserving_3_1_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_const_preserving_3_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_4_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_6_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_9_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_10_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_5_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_7_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unflatten_random_dag_preserving_4_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unused_aliases_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_unused_constant_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_uplift_common_custom_meta_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_uplift_common_custom_meta_with_multiple_calls_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_use_embedding_twice_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_user_input_and_buffer_mutation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_vmap_custom_autograd_function_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_vmap_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_vmap_to_assert_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_where_decomp_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_while_loop_assert_separation_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_while_loop_index_assertions_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_while_loop_simple_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_while_loop_tensor_constant_idx_retraceability_strict, test/export/test_retraceability.py::RetraceExportTestExport::test_wrapper_module_retraceability_strict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test__scaled_dot_product_flash_attention_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_additional_inputs_constants_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_allow_explicit_guards_as_runtime_asserts_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_annotate_on_assert_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_args_type_checked_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_aten_lift_fresh_copy_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_attention_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_attr_assignment_extra_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_automatic_constrain_size_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_automatic_dynamic_shapes_constant_relation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_automatic_dynamic_shapes_linear_relation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_automatic_dynamic_shapes_simple_equality_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_baddbmm_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_basic_non_strict_fake_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_basic_non_strict_real_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_bincount_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_buffer_util_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_capture_subclass_constructor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_capture_subclass_constructor_torch_ir_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_capture_subclass_wrong_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_ccode_python_mod_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cdist_forward_compute_mode_zero_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_check_specialized_int_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_checks_to_constrain_range_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cleanup_dynamic_markers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_colin_unbacked_backed_vr_sub_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_colon_parameter_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_compiling_state_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_access_identical_symint_closure_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_branches_return_constant_int_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_branches_return_same_int_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_buffers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_contains_unbacked_no_escape_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_int_closure_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_unflatten_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_with_module_stack_export_with_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cond_with_module_stack_export_with_unflatten_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_aliasing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_input_naming_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_no_user_inp_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_output_dup_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_output_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_requires_grad_const_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_return_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_tensor_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_tensor_with_non_functional_nested_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constant_tensor_with_non_functional_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constrain_decomp_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constrain_size_in_eager_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constrain_size_with_constrain_value_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_constrain_size_with_various_cases_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_conv_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_crop_like_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_cse_for_symint_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_op_auto_functionalize_pre_dispatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_op_auto_functionalize_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_op_auto_warn_pre_dispatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_op_preserve_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_pytree_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_custom_tag_metadata_re_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_decomp_batch_norm_functional_predispatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_decomp_item_in_prim_after_decomposition_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_decomp_item_in_prim_before_decomposition_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_default_decomposition_core_cia_ops_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_1_2_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_integer_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_nested_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_out_of_order_repeat_derived_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_out_of_order_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_out_of_order_simplified_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_derived_dim_repeat_derived_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_detect_leak_nonstrict_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_detect_leak_nonstrict_with_stacktrace_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_detect_leak_strict_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_device_to_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_device_to_gpu_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_device_to_mutation_float_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_device_to_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_device_to_static_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_1_2_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_auto_and_dim_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_dynamic_divisibility_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_dynamic_specialization_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_hint_range_violations_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dim_hint_ranges_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_disable_forced_specializations_errors_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_disable_forced_specializations_ok_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_distributed_all_gather_into_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_distributed_all_gather_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_distributed_all_reduce_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_distributed_all_to_all_single_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_distributed_reduce_scatter_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dont_duck_size_for_auto_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_double_lifted_constants_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_checks_aliasing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_checks_mutation_list_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_checks_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_checks_mutation_with_nan_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_fake_kernel_inference_errors_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_draft_export_infers_fake_kernel_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_duplicate_modules_with_non_persistent_buffers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_lr_shift_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_bounds_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_builder_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_builder_kwargs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_builder_pytree_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_dataclass_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_inferred_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_serdes_generic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_serdes_user_errors_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_serdes_various_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_spec_with_pytree_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_shapes_wrapped_with_shape_guards_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_dynamic_sym_round_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_ends_of_bounds_oblivious_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_enum_str_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_error_does_not_reference_eager_fallback_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_error_when_passing_mutating_primitive_op_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_exception_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_expand_copy_export_handles_implicit_true_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_api_with_dynamic_shapes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_as_backend_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_associative_scan_lifted_buffers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_associative_scan_symbol_dim_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_associative_scan_symbol_scandim_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_aten_to_unflatten_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_aten_to_unflatten_subclass_pre_dispatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_aten_to_unflatten_subclass_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_cond_preserve_torch_fn_for_subgraphs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_cond_symbool_pred_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_cond_warns_constant_pred_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_custom_decomp_table_basic_pop_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_custom_decomp_table_container_methods_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_custom_op_lib_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_custom_triton_kernel_mutable_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_custom_triton_kernel_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_cyclic_reference_leak_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_decomp_torture_case_1_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_decomp_torture_case_2_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_decomps_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_decomps_simple_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_dynamo_config_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_for_training_run_decomp_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_for_training_with_container_type_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_for_training_with_dynamic_shapes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_for_training_with_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_for_training_with_state_dict_hooks_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_default_kwargs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_keyword_only_args_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_kwargs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_pytree_kwargs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_var_keyword_args_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_var_keyword_pytree_args_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_func_with_var_postional_args_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_function_schema_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_graph_with_no_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_input_mutation_bug_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_input_mutation_dynamic_shape_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_input_mutation_static_shape_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_leak_compile_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_linear_preserve_dynamic_shape_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_max_nonstrict_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_max_onnx_reported_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_method_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_mod_constraints_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_module_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_preserve_linear_at_aot_level_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_preserve_linear_but_not_custom_op_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_rnn_variants_with_warning_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_scan_pytree_output_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_script_module_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_statically_known_true_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_then_compile_tensor_ctor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_autocast_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_fake_tensor_inputs_on_cuda_devices_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_fake_tensor_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_inline_constraints_complex_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_inline_constraints_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_set_grad_enabled_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_export_with_wrong_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_external_call_non_strict_real_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_fake_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_fake_weights_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_filter_traceback_frames_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_flex_attention_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_float_conversion_from_int_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_float_conversion_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_fqn_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_from_node_metadata_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_full_on_scalar_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_function_holding_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_hints_wrapper_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_hoo_inline_users_issue_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_if_functional_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_if_post_autograd_op_preserved_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_inductor_backend_inside_nonstrict_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_inline_script_class_method_recursive_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_inline_script_class_method_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_inline_script_function_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_inline_script_method_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_int_shape_specialization_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_intermediate_shape_comp_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_invalid_pytree_dynamo_graph_capture_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_is_exporting_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_is_nonzero_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_isnonzero_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_issue_113041_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_issue_157289_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_issue_161902_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_istft_op_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_keep_composite_ops_invalid_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_keep_composite_ops_linear_convd_for_training_ir_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_keep_composite_ops_linear_convd_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_kwarg_dynamic_shapes_diff_order_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_kwargs_reorder_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_layer_norm_unbacked_normalized_shape_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_layer_sharing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_lazy_module_kwargs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_lifted_constants_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_linear_conv_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_malformed_fqn_from_source_name_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_map_buffers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_map_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_mask_nonzero_static_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_masked_select_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_math_pow_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_mismatched_dynamic_shapes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_mixed_input_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_dict_key_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_input_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_input_subclasses_parameterization_nested_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_list_slice_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_module_with_dict_container_inp_out_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_modules_access_for_deleted_submodule_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_more_multidimensional_slicing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_multidimensional_slicing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_multinomial_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_multiple_definitions_same_name_dim_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_namedtuple_input_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_native_multi_attention_head_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_dynamic_shapes_spec_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_module_fake_tensor_leak_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_module_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_module_with_constant_buffer_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_module_with_init_buffer_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nested_module_with_parameter_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nn_module_stack_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nn_module_stack_shared_submodule_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_check_is_size_error_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_suggested_fixes_for_data_dependent_errors_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_tensor_computation_2_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_tensor_computation_3_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_tensor_computation_4_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_no_tensor_computation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_arg_name_dynamic_shapes_api_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_arg_name_dynamic_shapes_api_with_container_type_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_persistent_buffer_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_strict_dynamic_shapes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_non_strict_dynamic_shapes_suggested_fixes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_none_buffers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nonstrict_retrace_preserves_metadata_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nonzero_2_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_nonzero_dynamic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_not_registered_parameter_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_operator_aten_tensor_mode_variant_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_output_node_name_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_pad_sequence_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_param_util_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_partial_patched_forward_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_placeholder_naming_collisions_hoo_subgraphs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_placeholder_naming_collisions_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_placeholder_naming_order_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_placeholder_naming_order_variadic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_placeholder_update_preserving_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_predispatch_cond_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_predispatch_grad_wrappers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_preserve_annotation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_preserve_module_call_signature_unflatten_specialization_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_preserve_requires_grad_placeholders_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_preserve_shape_dynamism_for_unused_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_profiling_code_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_python_asserts_with_sym_int_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_pytree_register_data_class_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_pytree_register_nested_data_class_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_raise_user_error_when_guard_on_data_dependent_operation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_range_constraints_with_replacement_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_real_tensor_alias_dtype_mismatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_real_tensor_bool_cast_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_real_tensor_errors_on_aliasing_custom_op_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_real_tensor_for_max_op_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_real_tensor_size_mismatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_redundant_assert_max_upper_bound_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_redundant_asserts_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_refine_dynamic_shapes_from_suggested_fixes_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_register_constant_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_repeat_interleave_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_replace_unbacked_with_very_large_upperbound_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_replaced_unbacked_bindings_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_reshape_view_helper_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_retracable_ep_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_retrace_pre_autograd_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_run_decomposition_supports_user_input_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_run_decompositions_keep_metadata_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_run_decompositions_keep_tensor_constant_metadata_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_runtime_assert_for_prim_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_runtime_assert_for_prm_str_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_runtime_assert_with_size_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_sdpa_gqa_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_sequential_slicing_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_set_example_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_set_grad_as_side_effect_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_set_grad_empty_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_set_grad_unflatten_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_setgrad_lifted_tensor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_shared_submodule_nn_module_stack_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_simple_export_for_training_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_simple_unbacked_view_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_size_input_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_slice_nn_module_stack_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_solver_unsupported_sympy_function_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_specialize_derived_dim_roots_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_split_const_gm_with_lifted_constants_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_stack_trace_make_fx_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_stack_trace_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_state_primitives_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_state_shape_attribute_assignment_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_state_tensors_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_static_dim_constraints_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_context_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_nested_attr_access_complicated_metadata_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_nested_attr_access_const_metadata_not_top_level_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_nested_attr_access_const_metadata_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_nested_attr_access_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclass_nested_attr_access_submodule_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclasses_parameterization_nested_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_subclasses_parameterization_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_suggest_torch_checks_with_non_negative_check_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_suggest_torch_checks_with_regular_check_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_suggested_fixes_for_data_dependent_errors_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_suggested_fixes_new_roots_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_sym_float_operators_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_sym_or_sym_and_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_sym_sqrt_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symbool_item_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symfloat_item_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_input_additional_inputs_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_input_basic_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_input_ranges_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_input_shapes_collection_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_input_specialization_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_item_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_output_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_symint_tensor_return_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tag_ac_export_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tensor_attribute_zero_args_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tensor_constant_aten_to_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tensor_constant_with_wrapped_method_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_to_module_with_mutated_buffer_multiple_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_to_module_with_mutated_buffer_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tolist_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_torch_check_eq_commutativity_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_torch_fn_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_trace_under_fake_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_train_eval_on_exported_preautograd_module_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_tril_dynamic_diagonal_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_triu_dynamic_diagonal_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_3d_matmul_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_bincount_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_bindings_for_divisible_u_symint_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_deferred_runtime_retrace_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_expand_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_infer_size_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_kth_value_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_linear_layer_norm_input_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_noncontig_lin_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_pad_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_scalar_constructor_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_slice_forward_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_slice_simple_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_stack_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_to_cond_passthrough_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_to_cond_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unbacked_unsqueeze_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_asserts_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_buffer_update_child2parent_swap_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_closure_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_isinstance_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_multiple_graphs_dispatch_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_multiple_graphs_preserve_signature_no_error_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_multiple_graphs_shared_submodule_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_multiple_graphs_state_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_no_unroll_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_placeholder_update_child2parent_swap_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_placeholder_update_grandchild2cousin_swap_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_5_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_6_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_buf_8_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_const_preserving_3_1_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_const_preserving_3_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_4_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_6_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_9_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_preserving_10_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_preserving_5_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_mutating_buf_preserving_7_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unflatten_random_dag_preserving_4_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unused_aliases_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_unused_constant_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_uplift_common_custom_meta_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_uplift_common_custom_meta_with_multiple_calls_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_use_embedding_twice_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_user_input_and_buffer_mutation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_vmap_custom_autograd_function_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_vmap_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_vmap_to_assert_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_where_decomp_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_while_loop_assert_separation_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_while_loop_index_assertions_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_while_loop_simple_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_while_loop_tensor_constant_idx_retraceability_nonstrict, test/export/test_retraceability.py::RetraceExportNonStrictTestExport::test_wrapper_module_retraceability_nonstrict 2025-12-04T10:45:41.9425776Z 2025-12-04T10:45:41.9425926Z Finished export/test_retraceability 1/1 ... [2025-12-04 10:45:41.908421][2194604.369407847], took 2.98min 2025-12-04T10:45:41.9426327Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:45:41.9426741Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:45:41.9426960Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:45:41.9427139Z Uploading artifacts took 0.00 seconds 2025-12-04T10:45:41.9427327Z Running dynamo/test_deque_reconstruct 1/1 ... [2025-12-04 10:45:41.914315][2194604.375304813] 2025-12-04T10:45:41.9427516Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:45:41.9427913Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_deque_reconstruct.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:45:41.914509] 2025-12-04T10:45:44.3826981Z 2025-12-04T10:45:44.3827976Z dynamo/test_deque_reconstruct 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_deque_reconstruct_1.1_97a48b48fed60745_.log 2025-12-04T10:45:44.3829980Z Running 3 items in this shard: test/dynamo/test_deque_reconstruct.py::TestDequeReconstruct::test_deque_reconstruct_in_globals, test/dynamo/test_deque_reconstruct.py::TestDequeReconstruct::test_deque_reconstruct_not_in_globals, test/dynamo/test_deque_reconstruct.py::TestDequeReconstruct::test_deque_reconstruct_shallows_globals 2025-12-04T10:45:44.3831406Z 2025-12-04T10:45:44.3831745Z Finished dynamo/test_deque_reconstruct 1/1 ... [2025-12-04 10:45:44.382348][2194606.843334387], took 0.04min 2025-12-04T10:45:44.3832854Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:45:44.3882521Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:45:44.3884775Z Running inductor/test_utils 1/1 ... [2025-12-04 10:45:44.388359][2194606.849346521] 2025-12-04T10:45:44.3885110Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:45:44.3887135Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:45:44.388585] 2025-12-04T10:45:46.9089253Z 2025-12-04T10:45:46.9089953Z inductor/test_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_utils_1.1_f2a8d267e9b20bf6_.log 2025-12-04T10:45:46.9091917Z Running 7 items in this shard: test/inductor/test_utils.py::TestUtilsCUDA::testSympySubs_cuda, test/inductor/test_utils.py::TestUtilsCUDA::test_flops_fx_cuda, test/inductor/test_utils.py::TestUtilsCUDA::test_get_device_tflops_cuda_bfloat16, test/inductor/test_utils.py::TestUtilsCUDA::test_get_device_tflops_cuda_float16, test/inductor/test_utils.py::TestUtilsCUDA::test_get_device_tflops_cuda_float32, test/inductor/test_utils.py::TestUtilsCUDA::test_sympy_str_cuda, test/inductor/test_utils.py::TestUtilsCUDA::test_zip_schema_cuda 2025-12-04T10:45:46.9093046Z 2025-12-04T10:45:46.9093216Z Finished inductor/test_utils 1/1 ... [2025-12-04 10:45:46.908678][2194609.369664705], took 0.04min 2025-12-04T10:45:46.9095519Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:45:46.9147481Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:45:46.9150345Z Running inductor/test_indexing 1/1 ... [2025-12-04 10:45:46.914797][2194609.375786738] 2025-12-04T10:45:46.9150758Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:45:46.9151509Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_indexing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:45:46.914980] 2025-12-04T10:47:09.6604814Z 2025-12-04T10:47:09.6609835Z inductor/test_indexing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_indexing_1.1_2b8c01d39cfa5104_.log 2025-12-04T10:47:09.6617026Z Running 22 items in this shard: test/inductor/test_indexing.py::TestIndexingSimplification::test_expand_floor_div_applied, test/inductor/test_indexing.py::TestIndexingSimplification::test_expand_floor_div_skipped, test/inductor/test_indexing.py::TestIndexingSimplification::test_floordiv_div_sympy_is_integer_bug, test/inductor/test_indexing.py::TestIndexingSimplification::test_indexing_join, test/inductor/test_indexing.py::TestIndexingSimplification::test_indexing_simplification, test/inductor/test_indexing.py::TestIndexingSimplification::test_int8_unpack, test/inductor/test_indexing.py::TestIndexingSimplification::test_modular_indexing_pairs_merged, test/inductor/test_indexing.py::TestIndexingSimplification::test_modular_indexing_pairs_not_merged, test/inductor/test_indexing.py::TestIndexingSimplification::test_modular_indexing_positive, test/inductor/test_indexing.py::ExprPrinterTests::test_print_Min_Max, test/inductor/test_indexing.py::ExprPrinterTests::test_print_ceil, test/inductor/test_indexing.py::ExprPrinterTests::test_print_floor, test/inductor/test_indexing.py::ExprPrinterTests::test_print_floor_div, test/inductor/test_indexing.py::ExprPrinterTests::test_print_integer, test/inductor/test_indexing.py::ExprPrinterTests::test_print_mod, test/inductor/test_indexing.py::ExprPrinterTests::test_print_mod_index, test/inductor/test_indexing.py::ExprPrinterTests::test_print_pow, test/inductor/test_indexing.py::ExprPrinterTests::test_print_python_mod, test/inductor/test_indexing.py::ExprPrinterTests::test_print_round, test/inductor/test_indexing.py::ExprPrinterTests::test_print_round_decimal_ndigits_-1, test/inductor/test_indexing.py::ExprPrinterTests::test_print_round_decimal_ndigits_0, test/inductor/test_indexing.py::ExprPrinterTests::test_print_round_decimal_ndigits_1 2025-12-04T10:47:09.6621978Z 2025-12-04T10:47:09.6622201Z Finished inductor/test_indexing 1/1 ... [2025-12-04 10:47:09.660090][2194692.121074834], took 1.38min 2025-12-04T10:47:09.6622946Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:47:09.6666141Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:09.6669204Z Running inductor/test_inductor_annotations 1/1 ... [2025-12-04 10:47:09.666593][2194692.127582722] 2025-12-04T10:47:09.6669487Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:09.6670853Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_annotations.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:09.666784] 2025-12-04T10:47:17.5426050Z 2025-12-04T10:47:17.5427134Z inductor/test_inductor_annotations 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_annotations_1.1_3372cccf39dbb290_.log 2025-12-04T10:47:17.5428610Z Running 2 items in this shard: test/inductor/test_inductor_annotations.py::InductorAnnotationTestCase::test_no_annotations, test/inductor/test_inductor_annotations.py::InductorAnnotationTestCase::test_training_annotation 2025-12-04T10:47:17.5429402Z 2025-12-04T10:47:17.5429716Z Finished inductor/test_inductor_annotations 1/1 ... [2025-12-04 10:47:17.542238][2194700.003224831], took 0.13min 2025-12-04T10:47:17.5432316Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:47:17.5480985Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:17.5482684Z Running inductor/test_compile_worker 1/1 ... [2025-12-04 10:47:17.548082][2194700.009071688] 2025-12-04T10:47:17.5483834Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:17.5484552Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compile_worker.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:17.548275] 2025-12-04T10:47:54.2177971Z 2025-12-04T10:47:54.2178808Z inductor/test_compile_worker 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_compile_worker_1.1_d54b0538625a7b81_.log 2025-12-04T10:47:54.2182652Z Running 16 items in this shard: test/inductor/test_compile_worker.py::TestCompileWorker::test_basic_jobs, test/inductor/test_compile_worker.py::TestCompileWorker::test_crash, test/inductor/test_compile_worker.py::TestCompileWorker::test_exception, test/inductor/test_compile_worker.py::TestCompileWorker::test_logging, test/inductor/test_compile_worker.py::TestCompileWorker::test_quiesce, test/inductor/test_compile_worker.py::TestCompileWorker::test_quiesce_repeatedly, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_basic_jobs, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_crash, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_exception, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_logging, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_quiesce, test/inductor/test_compile_worker.py::TestCompileWorkerWithTimer::test_quiesce_repeatedly, test/inductor/test_compile_worker.py::TestTimer::test_basics, test/inductor/test_compile_worker.py::TestTimer::test_never_fires, test/inductor/test_compile_worker.py::TestTimer::test_repeated_calls, test/inductor/test_compile_worker.py::TestTimer::test_spammy_calls 2025-12-04T10:47:54.2185802Z 2025-12-04T10:47:54.2186019Z Finished inductor/test_compile_worker 1/1 ... [2025-12-04 10:47:54.217438][2194736.678425622], took 0.61min 2025-12-04T10:47:54.2186716Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:47:54.2235171Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:54.2235739Z Running dynamo/test_einops 1/1 ... [2025-12-04 10:47:54.223444][2194736.684433347] 2025-12-04T10:47:54.2235994Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:54.2239024Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_einops.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:54.223642] 2025-12-04T10:47:56.0911716Z 2025-12-04T10:47:56.0912682Z dynamo/test_einops 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_einops_1.1_d7f49ecc4c178a49_.log 2025-12-04T10:47:56.0913957Z Running 3 items in this shard: test/dynamo/test_einops.py::TestEinops::test_functions_version_none, test/dynamo/test_einops.py::TestEinops::test_layers_version_none, test/dynamo/test_einops.py::TestEinops::test_no_recompile_on_lazy_state_version_none 2025-12-04T10:47:56.0914842Z 2025-12-04T10:47:56.0915087Z Finished dynamo/test_einops 1/1 ... [2025-12-04 10:47:56.090813][2194738.551797713], took 0.03min 2025-12-04T10:47:56.0920014Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:47:56.0973091Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:56.0976649Z Running inductor/test_external_callables 1/1 ... [2025-12-04 10:47:56.097340][2194738.558329261] 2025-12-04T10:47:56.0976983Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:56.0977652Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_external_callables.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:56.097528] 2025-12-04T10:48:08.5353185Z 2025-12-04T10:48:08.5354227Z inductor/test_external_callables 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_external_callables_1.1_3bb1cba84783a111_.log 2025-12-04T10:48:08.5355495Z Running 3 items in this shard: test/inductor/test_external_callables.py::TestInductorExternalCallable::test_matmul_cpu, test/inductor/test_external_callables.py::TestInductorExternalCallable::test_matmul_cuda, test/inductor/test_external_callables.py::TestInductorExternalCallable::test_matmul_dup 2025-12-04T10:48:08.5356335Z 2025-12-04T10:48:08.5356570Z Finished inductor/test_external_callables 1/1 ... [2025-12-04 10:48:08.535086][2194750.996070483], took 0.21min 2025-12-04T10:48:08.5361292Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:48:08.5411545Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:48:08.5411920Z Running dynamo/test_fx_passes_pre_grad 1/1 ... [2025-12-04 10:48:08.541000][2194751.001990099] 2025-12-04T10:48:08.5412197Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:48:08.5414283Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_fx_passes_pre_grad.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:48:08.541224] 2025-12-04T10:48:11.3108807Z 2025-12-04T10:48:11.3110323Z dynamo/test_fx_passes_pre_grad 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_fx_passes_pre_grad_1.1_102a6c5963938e36_.log 2025-12-04T10:48:11.3111565Z Running 1 items in this shard: test/dynamo/test_fx_passes_pre_grad.py::FxPassesPreGradTests::test_pass_execution_and_save 2025-12-04T10:48:11.3112097Z 2025-12-04T10:48:11.3112438Z Finished dynamo/test_fx_passes_pre_grad 1/1 ... [2025-12-04 10:48:11.310586][2194753.771568681], took 0.05min 2025-12-04T10:48:11.3120538Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T10:48:11.3167405Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:48:11.3170442Z Running inductor/test_fp8 1/1 ... [2025-12-04 10:48:11.316741][2194753.777730394] 2025-12-04T10:48:11.3170736Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:48:11.3171942Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_fp8.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:48:11.316969] 2025-12-04T12:10:19.4698138Z 2025-12-04T12:10:19.4698708Z PRINTING LOG FILE of inductor/test_fp8 1/1 (test/test-reports/inductor.test_fp8_1.1_79611dc7575145fb_.log) 2025-12-04T12:10:19.4707962Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-88c1daaaf6fe01fc.xml 2025-12-04T12:10:19.4708416Z ============================= test session starts ============================== 2025-12-04T12:10:19.4708758Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.4709054Z cachedir: .pytest_cache 2025-12-04T12:10:19.4709415Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.4709843Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.4710028Z configfile: pytest.ini 2025-12-04T12:10:19.4710434Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.4711516Z collecting ... collected 188 items 2025-12-04T12:10:19.4711762Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T12:10:19.4747640Z Running 188 items in this shard: test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_bad_cast_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_bfloat16_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e4m3fn_shape_4,2048,4096_keepdim_False_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e4m3fn_shape_4,2048,4096_keepdim_True_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e5m2_shape_4,2048,4096_keepdim_False_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e5m2_shape_4,2048,4096_keepdim_True_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,1,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,15_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,512_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e4m3fn_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e4m3fn_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e5m2_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e5m2_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e4m3fn_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e4m3fn_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e5m2_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e5m2_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e4m3fn_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e4m3fn_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e5m2_shape_16,16,16_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e5m2_shape_4,2048,4096_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_bfloat16_shape_15,3,13_dst_types0_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_bfloat16_shape_4,2048,4096_dst_types0_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float16_shape_15,3,13_dst_types0_cuda_float16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float16_shape_4,2048,4096_dst_types0_cuda_float16, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float32_shape_15,3,13_dst_types0_cuda_float32, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float32_shape_4,2048,4096_dst_types0_cuda_float32, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_xblock_for_small_numel_float8_e4m3fn_cuda, test/inductor/test_fp8.py::TestFP8TypesCUDA::test_xblock_for_small_numel_float8_e5m2_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_False_scaling_block_sizes0_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_False_scaling_block_sizes1_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_True_scaling_block_sizes0_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_True_scaling_block_sizes1_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_False_scaling_block_sizes0_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_False_scaling_block_sizes1_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_True_scaling_block_sizes0_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_True_scaling_block_sizes1_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_mx_fp8_max_autotune_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_mx_fusion_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_1024,1024,512_use_fast_accum_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_1024,1024,512_use_fast_accum_True_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_16,32,32_use_fast_accum_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_16,32,32_use_fast_accum_True_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_scaled_mm_preserves_strides_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_1024,1024,512_use_fast_accum_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_1024,1024,512_use_fast_accum_True_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_16,32,32_use_fast_accum_False_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_16,32,32_use_fast_accum_True_cuda_bfloat16, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_1024,1024,512_use_fast_accum_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_1024,1024,512_use_fast_accum_True_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_16,32,32_use_fast_accum_False_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_16,32,32_use_fast_accum_True_cuda_float32, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_unacceptable_input_dims_cuda, test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_unacceptable_scale_dims_rowwise_scaling_cuda 2025-12-04T12:10:19.4778927Z 2025-12-04T12:10:19.4779096Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,1,15_cuda PASSED [0.9117s] [ 0%] 2025-12-04T12:10:19.4779448Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,15_cuda PASSED [0.3756s] [ 1%] 2025-12-04T12:10:19.4779789Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,4096_cuda PASSED [0.4159s] [ 1%] 2025-12-04T12:10:19.4780237Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_1,10,512_cuda PASSED [0.1341s] [ 2%] 2025-12-04T12:10:19.4780650Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e4m3fn_shape_4,2048,4096_cuda PASSED [0.2954s] [ 2%] 2025-12-04T12:10:19.4781003Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,1,15_cuda PASSED [0.2204s] [ 3%] 2025-12-04T12:10:19.4781344Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,15_cuda PASSED [0.2256s] [ 3%] 2025-12-04T12:10:19.4781730Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,4096_cuda PASSED [0.3414s] [ 4%] 2025-12-04T12:10:19.4782073Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_1,10,512_cuda PASSED [0.2390s] [ 4%] 2025-12-04T12:10:19.4782457Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_along_with_fp8_quant_float8_e5m2_shape_4,2048,4096_cuda PASSED [0.2577s] [ 5%] 2025-12-04T12:10:19.4782785Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,1,15_cuda PASSED [0.2157s] [ 5%] 2025-12-04T12:10:19.4783171Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,15_cuda PASSED [0.2122s] [ 6%] 2025-12-04T12:10:19.4783490Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,4096_cuda PASSED [0.2379s] [ 6%] 2025-12-04T12:10:19.4783807Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_1,10,512_cuda PASSED [0.2240s] [ 7%] 2025-12-04T12:10:19.4784180Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e4m3fn_shape_4,2048,4096_cuda PASSED [0.2384s] [ 7%] 2025-12-04T12:10:19.4784493Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,1,15_cuda PASSED [0.2055s] [ 8%] 2025-12-04T12:10:19.4784796Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,15_cuda PASSED [0.2086s] [ 9%] 2025-12-04T12:10:19.4785145Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,4096_cuda PASSED [0.2400s] [ 9%] 2025-12-04T12:10:19.4785462Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_1,10,512_cuda PASSED [0.2355s] [ 10%] 2025-12-04T12:10:19.4785848Z inductor/test_fp8.py::TestFP8TypesCUDA::test_amax_fp8_quant_float8_e5m2_shape_4,2048,4096_cuda PASSED [0.2634s] [ 10%] 2025-12-04T12:10:19.4786409Z inductor/test_fp8.py::TestFP8TypesCUDA::test_bad_cast_cuda C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] Error in codegen for ComputedBuffer(name='buf0', layout=FixedLayout('cuda:0', torch.float8_e5m2, size=[s77, s77, s77], stride=[s77**2, s77, 1]), data=Pointwise( 2025-12-04T12:10:19.4786931Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] 'cuda', 2025-12-04T12:10:19.4787265Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] torch.float8_e5m2, 2025-12-04T12:10:19.4787583Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] def inner_fn(index): 2025-12-04T12:10:19.4789203Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] i0, i1, i2 = index 2025-12-04T12:10:19.4789528Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] tmp0 = ops.load(arg1_1, i2 + i0 * s77**2 + i1 * s77) 2025-12-04T12:10:19.4789907Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] tmp1 = ops.to_dtype(tmp0, torch.float8_e5m2, src_dtype=torch.float8_e4m3fn) 2025-12-04T12:10:19.4790310Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] return tmp1 2025-12-04T12:10:19.4790556Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] , 2025-12-04T12:10:19.4790814Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] ranges=[s77, s77, s77], 2025-12-04T12:10:19.4791117Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] origin_node=convert_element_type, 2025-12-04T12:10:19.4791440Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] origins=OrderedSet([convert_element_type]), 2025-12-04T12:10:19.4791740Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] stack_traces = {, 2025-12-04T12:10:19.4792097Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] File "/var/lib/jenkins/pytorch/test/inductor/test_fp8.py", line 164, in fp8_cast, 2025-12-04T12:10:19.4792466Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] return x.to(dtype=dtype), 2025-12-04T12:10:19.4792731Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] , 2025-12-04T12:10:19.4792955Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] } 2025-12-04T12:10:19.4793295Z C1204 10:48:22.054000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/0] ), _split_size=None, _original_inner_fn=None, _original_ranges=None, _original_reduction_ranges=None) 2025-12-04T12:10:19.4793827Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] Error in codegen for ComputedBuffer(name='buf0', layout=FixedLayout('cuda:0', torch.float8_e4m3fn, size=[s77, s77, s77], stride=[s77**2, s77, 1]), data=Pointwise( 2025-12-04T12:10:19.4794257Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] 'cuda', 2025-12-04T12:10:19.4794573Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] torch.float8_e4m3fn, 2025-12-04T12:10:19.4794850Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] def inner_fn(index): 2025-12-04T12:10:19.4795123Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] i0, i1, i2 = index 2025-12-04T12:10:19.4795431Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] tmp0 = ops.load(arg1_1, i2 + i0 * s77**2 + i1 * s77) 2025-12-04T12:10:19.4795798Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] tmp1 = ops.to_dtype(tmp0, torch.float8_e4m3fn, src_dtype=torch.float8_e5m2) 2025-12-04T12:10:19.4796123Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] return tmp1 2025-12-04T12:10:19.4796366Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] , 2025-12-04T12:10:19.4796622Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] ranges=[s77, s77, s77], 2025-12-04T12:10:19.4796918Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] origin_node=convert_element_type, 2025-12-04T12:10:19.4797287Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] origins=OrderedSet([convert_element_type]), 2025-12-04T12:10:19.4797582Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] stack_traces = {, 2025-12-04T12:10:19.4797930Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] File "/var/lib/jenkins/pytorch/test/inductor/test_fp8.py", line 164, in fp8_cast, 2025-12-04T12:10:19.4798294Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] return x.to(dtype=dtype), 2025-12-04T12:10:19.4798555Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] , 2025-12-04T12:10:19.4798779Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] } 2025-12-04T12:10:19.4799118Z C1204 10:48:22.093000 544390 site-packages/torch/_inductor/scheduler.py:1683] [0/1] ), _split_size=None, _original_inner_fn=None, _original_ranges=None, _original_reduction_ranges=None) 2025-12-04T12:10:19.4799406Z PASSED [0.1644s] [ 11%] 2025-12-04T12:10:19.4799608Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_bfloat16_cuda_bfloat16 PASSED [46.3630s] [ 11%] 2025-12-04T12:10:19.4799922Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 ('RERUN', {'yellow': True}) [1.8318s] [ 12%] 2025-12-04T12:10:19.4800339Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 ('RERUN', {'yellow': True}) [1.3540s] [ 12%] 2025-12-04T12:10:19.4800646Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 FAILED [1.3850s] [ 12%] 2025-12-04T12:10:19.4800810Z 2025-12-04T12:10:19.4800868Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.4801054Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4801229Z Traceback (most recent call last): 2025-12-04T12:10:19.4801479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4801715Z method(*args, **kwargs) 2025-12-04T12:10:19.4801939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4802168Z method(*args, **kwargs) 2025-12-04T12:10:19.4802384Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4802608Z with policy(): 2025-12-04T12:10:19.4802821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4803053Z raise RuntimeError(msg) 2025-12-04T12:10:19.4803497Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1205862400 and is now 1317011456. 2025-12-04T12:10:19.4803862Z 2025-12-04T12:10:19.4803938Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4804240Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4804467Z 2025-12-04T12:10:19.4804556Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4804760Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4804923Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4805057Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4805260Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4805840Z inductor [('triton_bundler_save_kernel', 144), ('benchmarking.InductorBenchmarker.benchmark', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 16), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4806379Z graph_break [] 2025-12-04T12:10:19.4806544Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4806776Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4806950Z Traceback (most recent call last): 2025-12-04T12:10:19.4807189Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4807423Z method(*args, **kwargs) 2025-12-04T12:10:19.4807647Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4807875Z method(*args, **kwargs) 2025-12-04T12:10:19.4808095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4808320Z with policy(): 2025-12-04T12:10:19.4808529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4808759Z raise RuntimeError(msg) 2025-12-04T12:10:19.4809150Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1314914304 and is now 1352663040. 2025-12-04T12:10:19.4809505Z 2025-12-04T12:10:19.4809581Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4809877Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4810173Z 2025-12-04T12:10:19.4810265Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4810464Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4810622Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4810755Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4810950Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4811563Z inductor [('triton_bundler_save_kernel', 144), ('benchmarking.InductorBenchmarker.benchmark', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 16), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4812069Z graph_break [] 2025-12-04T12:10:19.4812230Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4812448Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4812603Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4812735Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4812926Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4813500Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4814005Z graph_break [] 2025-12-04T12:10:19.4814162Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4814385Z =================================== FAILURES =================================== 2025-12-04T12:10:19.4814569Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4814741Z Traceback (most recent call last): 2025-12-04T12:10:19.4814977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4815208Z method(*args, **kwargs) 2025-12-04T12:10:19.4815428Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4815656Z method(*args, **kwargs) 2025-12-04T12:10:19.4815875Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4816118Z with policy(): 2025-12-04T12:10:19.4816329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4816564Z raise RuntimeError(msg) 2025-12-04T12:10:19.4816956Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1350565888 and is now 1396703232. 2025-12-04T12:10:19.4817314Z 2025-12-04T12:10:19.4817387Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4817682Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4817906Z 2025-12-04T12:10:19.4817995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4818193Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4818350Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4818485Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4818678Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4819248Z inductor [('triton_bundler_save_kernel', 144), ('benchmarking.InductorBenchmarker.benchmark', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 16), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4819753Z graph_break [] 2025-12-04T12:10:19.4819910Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4820201Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4820358Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4820486Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4820680Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4821248Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4821752Z graph_break [] 2025-12-04T12:10:19.4821910Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4822129Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4822286Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4822416Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4822607Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4823214Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark', 19), ('benchmarking.InductorBenchmarker.benchmark_gpu', 19), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4823716Z graph_break [] 2025-12-04T12:10:19.4823872Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4824204Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-88c1daaaf6fe01fc.xml - 2025-12-04T12:10:19.4824491Z =========================== short test summary info ============================ 2025-12-04T12:10:19.4825053Z FAILED [1.3850s] inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1350565888 and is now 1396703232. 2025-12-04T12:10:19.4825546Z 2025-12-04T12:10:19.4825620Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4825913Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4826138Z 2025-12-04T12:10:19.4826229Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4826419Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.4826587Z ==================== 1 failed, 22 passed, 2 rerun in 56.84s ==================== 2025-12-04T12:10:19.4826729Z Got exit code 1 2025-12-04T12:10:19.4826829Z Retrying single test... 2025-12-04T12:10:19.4827043Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bcb1456ec0efb80d.xml 2025-12-04T12:10:19.4827287Z ============================= test session starts ============================== 2025-12-04T12:10:19.4827496Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.4827685Z cachedir: .pytest_cache 2025-12-04T12:10:19.4827911Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.4828151Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.4828272Z configfile: pytest.ini 2025-12-04T12:10:19.4828526Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.4828800Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.4829089Z stepcurrent: skipping 22 already run items. Running only test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4829347Z Running 1 items in this shard 2025-12-04T12:10:19.4829422Z 2025-12-04T12:10:19.4829694Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:20.406987525 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4830001Z 2025-12-04T12:10:19.4830204Z [W1204 10:49:28.886027841 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4830401Z 2025-12-04T12:10:19.4830562Z [W1204 10:49:28.886192888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4830756Z 2025-12-04T12:10:19.4830915Z [W1204 10:49:28.888103921 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4831368Z 2025-12-04T12:10:19.4831536Z [W1204 10:49:28.888190380 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4831729Z 2025-12-04T12:10:19.4831915Z [W1204 10:49:28.888652423 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4832109Z 2025-12-04T12:10:19.4832267Z [W1204 10:49:28.888794541 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4832461Z 2025-12-04T12:10:19.4832624Z [W1204 10:49:28.888862380 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4832827Z 2025-12-04T12:10:19.4833006Z [W1204 10:49:28.889075047 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4833229Z 2025-12-04T12:10:19.4833414Z [W1204 10:49:28.889141847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4833641Z 2025-12-04T12:10:19.4833798Z [W1204 10:49:28.889397043 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4834032Z 2025-12-04T12:10:19.4834190Z [W1204 10:49:28.889467072 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4834408Z 2025-12-04T12:10:19.4834567Z [W1204 10:49:28.889637020 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4834794Z 2025-12-04T12:10:19.4834958Z [W1204 10:49:28.889697259 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4835180Z 2025-12-04T12:10:19.4835361Z [W1204 10:49:28.889827367 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4835574Z 2025-12-04T12:10:19.4835757Z [W1204 10:49:28.889886626 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4835956Z 2025-12-04T12:10:19.4836111Z [W1204 10:49:28.890022004 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4836299Z 2025-12-04T12:10:19.4836450Z [W1204 10:49:28.890083183 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4836669Z 2025-12-04T12:10:19.4836826Z [W1204 10:49:29.362783136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4837017Z 2025-12-04T12:10:19.4837169Z [W1204 10:49:29.362986803 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4837358Z 2025-12-04T12:10:19.4837508Z [W1204 10:49:29.363061012 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4837698Z 2025-12-04T12:10:19.4837846Z [W1204 10:49:29.363319459 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4838036Z 2025-12-04T12:10:19.4838185Z [W1204 10:49:29.363387468 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4838396Z 2025-12-04T12:10:19.4838553Z [W1204 10:49:29.363527746 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4838794Z 2025-12-04T12:10:19.4838964Z [W1204 10:49:29.363613775 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4884298Z 2025-12-04T12:10:19.4884532Z [W1204 10:49:29.363671734 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4884735Z 2025-12-04T12:10:19.4884913Z [W1204 10:49:29.363818432 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4885145Z 2025-12-04T12:10:19.4885338Z [W1204 10:49:29.363878311 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4885544Z 2025-12-04T12:10:19.4885770Z [W1204 10:49:29.364037159 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4885966Z 2025-12-04T12:10:19.4886199Z [W1204 10:49:29.364099948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4886419Z 2025-12-04T12:10:19.4886618Z [W1204 10:49:29.364216696 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4886824Z 2025-12-04T12:10:19.4886984Z [W1204 10:49:29.364275425 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4887204Z 2025-12-04T12:10:19.4887370Z [W1204 10:49:29.364391994 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4887580Z 2025-12-04T12:10:19.4887744Z [W1204 10:49:29.364449833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4887955Z 2025-12-04T12:10:19.4888153Z [W1204 10:49:29.364565331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4888377Z 2025-12-04T12:10:19.4888557Z [W1204 10:49:29.364621980 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4888793Z 2025-12-04T12:10:19.4888858Z ('RERUN', {'yellow': True}) [10.0140s] [100%] 2025-12-04T12:10:19.4889237Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:30.938536314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4889541Z 2025-12-04T12:10:19.4889766Z [W1204 10:49:30.938743631 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4889970Z 2025-12-04T12:10:19.4890219Z [W1204 10:49:30.938825720 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4890418Z 2025-12-04T12:10:19.4890614Z [W1204 10:49:30.939083776 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4890830Z 2025-12-04T12:10:19.4890991Z [W1204 10:49:30.939154605 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4891243Z 2025-12-04T12:10:19.4891401Z [W1204 10:49:30.939296593 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4891610Z 2025-12-04T12:10:19.4891812Z [W1204 10:49:30.939384802 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4892086Z 2025-12-04T12:10:19.4892247Z [W1204 10:49:30.939442151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4892477Z 2025-12-04T12:10:19.4892709Z [W1204 10:49:30.939591739 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4892925Z 2025-12-04T12:10:19.4893092Z [W1204 10:49:30.939650768 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4893326Z 2025-12-04T12:10:19.4893509Z [W1204 10:49:30.939804626 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4893708Z 2025-12-04T12:10:19.4893868Z [W1204 10:49:30.939863635 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4894060Z 2025-12-04T12:10:19.4894228Z [W1204 10:49:30.939992323 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4894418Z 2025-12-04T12:10:19.4894574Z [W1204 10:49:30.940053632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4894772Z 2025-12-04T12:10:19.4894925Z [W1204 10:49:30.940167021 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4895119Z 2025-12-04T12:10:19.4895271Z [W1204 10:49:30.940223310 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4895468Z 2025-12-04T12:10:19.4895619Z [W1204 10:49:30.940340318 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4895818Z 2025-12-04T12:10:19.4895973Z [W1204 10:49:30.940396438 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4896175Z 2025-12-04T12:10:19.4896333Z [W1204 10:49:31.645545120 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4896534Z 2025-12-04T12:10:19.4896702Z [W1204 10:49:31.645768557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4896914Z 2025-12-04T12:10:19.4897072Z [W1204 10:49:31.645841736 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4897266Z 2025-12-04T12:10:19.4897420Z [W1204 10:49:31.646098222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4897610Z 2025-12-04T12:10:19.4897795Z [W1204 10:49:31.646171431 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4897987Z 2025-12-04T12:10:19.4898159Z [W1204 10:49:31.646316539 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4898353Z 2025-12-04T12:10:19.4898510Z [W1204 10:49:31.646404348 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4898704Z 2025-12-04T12:10:19.4898857Z [W1204 10:49:31.646462017 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4899055Z 2025-12-04T12:10:19.4899209Z [W1204 10:49:31.646609215 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4899401Z 2025-12-04T12:10:19.4899555Z [W1204 10:49:31.646668804 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4899751Z 2025-12-04T12:10:19.4899911Z [W1204 10:49:31.646823112 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4900198Z 2025-12-04T12:10:19.4900350Z [W1204 10:49:31.646883531 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4900545Z 2025-12-04T12:10:19.4900710Z [W1204 10:49:31.647014829 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4900938Z 2025-12-04T12:10:19.4901099Z [W1204 10:49:31.647082748 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4901309Z 2025-12-04T12:10:19.4901472Z [W1204 10:49:31.647199697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4901662Z 2025-12-04T12:10:19.4901823Z [W1204 10:49:31.647257026 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4902018Z 2025-12-04T12:10:19.4902199Z [W1204 10:49:31.647376274 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4902394Z 2025-12-04T12:10:19.4902547Z [W1204 10:49:31.647433093 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4902746Z 2025-12-04T12:10:19.4902803Z ('RERUN', {'yellow': True}) [1.2606s] [100%] 2025-12-04T12:10:19.4903162Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:31.202723210 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4903469Z 2025-12-04T12:10:19.4903625Z [W1204 10:49:31.202929427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4903823Z 2025-12-04T12:10:19.4903986Z [W1204 10:49:31.202998296 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4904182Z 2025-12-04T12:10:19.4904337Z [W1204 10:49:31.203252622 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4904534Z 2025-12-04T12:10:19.4904691Z [W1204 10:49:31.203319312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4904882Z 2025-12-04T12:10:19.4905043Z [W1204 10:49:31.203459700 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4905233Z 2025-12-04T12:10:19.4905426Z [W1204 10:49:31.203546368 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4905615Z 2025-12-04T12:10:19.4905793Z [W1204 10:49:31.203602908 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4905987Z 2025-12-04T12:10:19.4906144Z [W1204 10:49:31.203754805 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4906336Z 2025-12-04T12:10:19.4906501Z [W1204 10:49:31.203813665 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4906698Z 2025-12-04T12:10:19.4906854Z [W1204 10:49:31.203969682 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4907054Z 2025-12-04T12:10:19.4907223Z [W1204 10:49:31.204031522 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4907419Z 2025-12-04T12:10:19.4907572Z [W1204 10:49:31.204159430 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4907841Z 2025-12-04T12:10:19.4907996Z [W1204 10:49:31.204216749 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4908188Z 2025-12-04T12:10:19.4908341Z [W1204 10:49:31.204333317 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4908548Z 2025-12-04T12:10:19.4908704Z [W1204 10:49:31.204389487 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4908895Z 2025-12-04T12:10:19.4909057Z [W1204 10:49:31.204507485 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4909251Z 2025-12-04T12:10:19.4909407Z [W1204 10:49:31.204563234 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4909598Z 2025-12-04T12:10:19.4909756Z [W1204 10:49:32.936423390 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4909947Z 2025-12-04T12:10:19.4910170Z [W1204 10:49:32.936612417 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4910363Z 2025-12-04T12:10:19.4910518Z [W1204 10:49:32.936681677 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4910713Z 2025-12-04T12:10:19.4910869Z [W1204 10:49:32.936923263 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4911065Z 2025-12-04T12:10:19.4911219Z [W1204 10:49:32.936989692 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4911440Z 2025-12-04T12:10:19.4911591Z [W1204 10:49:32.937139180 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4911787Z 2025-12-04T12:10:19.4911939Z [W1204 10:49:32.937230309 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4912138Z 2025-12-04T12:10:19.4912293Z [W1204 10:49:32.937287818 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4912487Z 2025-12-04T12:10:19.4912676Z [W1204 10:49:32.937437486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4912868Z 2025-12-04T12:10:19.4913024Z [W1204 10:49:32.937497435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4913218Z 2025-12-04T12:10:19.4913374Z [W1204 10:49:32.937649303 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4913571Z 2025-12-04T12:10:19.4913727Z [W1204 10:49:32.937708042 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4913964Z 2025-12-04T12:10:19.4914156Z [W1204 10:49:32.937834080 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4914353Z 2025-12-04T12:10:19.4914551Z [W1204 10:49:32.937891949 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4914748Z 2025-12-04T12:10:19.4914900Z [W1204 10:49:32.938011868 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4915130Z 2025-12-04T12:10:19.4915288Z [W1204 10:49:32.938072427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4915480Z 2025-12-04T12:10:19.4915631Z [W1204 10:49:32.938191975 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4915867Z 2025-12-04T12:10:19.4916022Z [W1204 10:49:32.938249304 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4916217Z 2025-12-04T12:10:19.4916264Z FAILED [1.3031s] [100%] 2025-12-04T12:10:19.4916330Z 2025-12-04T12:10:19.4916391Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.4916586Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4916772Z Traceback (most recent call last): 2025-12-04T12:10:19.4917021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4917271Z method(*args, **kwargs) 2025-12-04T12:10:19.4917500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4917738Z method(*args, **kwargs) 2025-12-04T12:10:19.4917961Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4918251Z with policy(): 2025-12-04T12:10:19.4918476Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4918718Z raise RuntimeError(msg) 2025-12-04T12:10:19.4919125Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 807403520 and is now 970981376. 2025-12-04T12:10:19.4919523Z 2025-12-04T12:10:19.4919611Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4919920Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4920202Z 2025-12-04T12:10:19.4920303Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4920519Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4920737Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4920886Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4921467Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4922055Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4922248Z graph_break [] 2025-12-04T12:10:19.4922453Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4922682Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.4923164Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.4923619Z if out == self.unknown_value: 2025-12-04T12:10:19.4923796Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4923996Z Traceback (most recent call last): 2025-12-04T12:10:19.4924283Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4924530Z method(*args, **kwargs) 2025-12-04T12:10:19.4924772Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4925015Z method(*args, **kwargs) 2025-12-04T12:10:19.4925284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4925526Z with policy(): 2025-12-04T12:10:19.4925761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4926032Z raise RuntimeError(msg) 2025-12-04T12:10:19.4926436Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 968884224 and is now 1002438656. 2025-12-04T12:10:19.4926810Z 2025-12-04T12:10:19.4926929Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4927268Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4927558Z 2025-12-04T12:10:19.4927650Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4927863Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4928033Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4928177Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4928713Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4929300Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4929490Z graph_break [] 2025-12-04T12:10:19.4929658Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4929884Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.4930450Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.4930890Z if out == self.unknown_value: 2025-12-04T12:10:19.4931042Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4931203Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4931362Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4931563Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4932146Z inductor [('triton_bundler_save_kernel', 112), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark', 11), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4932661Z graph_break [] 2025-12-04T12:10:19.4932829Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4933026Z =================================== FAILURES =================================== 2025-12-04T12:10:19.4933240Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.4933417Z Traceback (most recent call last): 2025-12-04T12:10:19.4933661Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4933922Z method(*args, **kwargs) 2025-12-04T12:10:19.4934156Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.4934401Z method(*args, **kwargs) 2025-12-04T12:10:19.4934621Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.4934860Z with policy(): 2025-12-04T12:10:19.4935082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.4935319Z raise RuntimeError(msg) 2025-12-04T12:10:19.4935757Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1000341504 and is now 1033895936. 2025-12-04T12:10:19.4936140Z 2025-12-04T12:10:19.4936226Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4936537Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4936780Z 2025-12-04T12:10:19.4936868Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4937085Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4937247Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4937381Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4937912Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4938482Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4938660Z graph_break [] 2025-12-04T12:10:19.4938825Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4939053Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.4939552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.4939979Z if out == self.unknown_value: 2025-12-04T12:10:19.4940202Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4940360Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4940495Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4940688Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4941262Z inductor [('triton_bundler_save_kernel', 112), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark', 11), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4941770Z graph_break [] 2025-12-04T12:10:19.4941930Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4942180Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.4942338Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.4942471Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.4942667Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.4943235Z inductor [('triton_bundler_save_kernel', 112), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark', 11), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.4943745Z graph_break [] 2025-12-04T12:10:19.4943913Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.4944256Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bcb1456ec0efb80d.xml - 2025-12-04T12:10:19.4944558Z =========================== short test summary info ============================ 2025-12-04T12:10:19.4945125Z FAILED [1.3031s] inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1000341504 and is now 1033895936. 2025-12-04T12:10:19.4945623Z 2025-12-04T12:10:19.4945697Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.4946025Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4946257Z 2025-12-04T12:10:19.4946348Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.4946550Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.4946731Z ================= 1 failed, 187 deselected, 2 rerun in 12.60s ================== 2025-12-04T12:10:19.4946895Z Got exit code 1 2025-12-04T12:10:19.4947000Z Retrying single test... 2025-12-04T12:10:19.4947220Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9bec2110e28243b1.xml 2025-12-04T12:10:19.4947500Z ============================= test session starts ============================== 2025-12-04T12:10:19.4947735Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.4947936Z cachedir: .pytest_cache 2025-12-04T12:10:19.4948208Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.4948456Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.4948587Z configfile: pytest.ini 2025-12-04T12:10:19.4948822Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.4949101Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.4949416Z stepcurrent: skipping 22 already run items. Running only test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.4949685Z Running 1 items in this shard 2025-12-04T12:10:19.4949780Z 2025-12-04T12:10:19.4950056Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:40.712189600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4950498Z 2025-12-04T12:10:19.4950686Z [W1204 10:49:47.997549077 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4951034Z 2025-12-04T12:10:19.4951265Z [W1204 10:49:47.997729205 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4951498Z 2025-12-04T12:10:19.4951667Z [W1204 10:49:47.999377982 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4951931Z 2025-12-04T12:10:19.4952110Z [W1204 10:49:47.999470990 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4952338Z 2025-12-04T12:10:19.4952550Z [W1204 10:49:47.999964074 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4952775Z 2025-12-04T12:10:19.4952978Z [W1204 10:49:47.000107062 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4953199Z 2025-12-04T12:10:19.4953387Z [W1204 10:49:47.000175101 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4953589Z 2025-12-04T12:10:19.4953803Z [W1204 10:49:47.000387538 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4954066Z 2025-12-04T12:10:19.4954445Z [W1204 10:49:47.000450617 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4954752Z 2025-12-04T12:10:19.4954930Z [W1204 10:49:47.000713163 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4955235Z 2025-12-04T12:10:19.4955412Z [W1204 10:49:47.000782312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4955725Z 2025-12-04T12:10:19.4955933Z [W1204 10:49:47.000949700 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4956161Z 2025-12-04T12:10:19.4956324Z [W1204 10:49:47.001014979 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4956535Z 2025-12-04T12:10:19.4957062Z [W1204 10:49:47.001150357 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4957292Z 2025-12-04T12:10:19.4957725Z [W1204 10:49:47.001209026 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4957959Z 2025-12-04T12:10:19.4958222Z [W1204 10:49:47.001337174 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4958434Z 2025-12-04T12:10:19.4958687Z [W1204 10:49:47.001397743 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4958995Z 2025-12-04T12:10:19.4959186Z [W1204 10:49:49.568573080 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4959444Z 2025-12-04T12:10:19.4959655Z [W1204 10:49:49.568777937 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4959929Z 2025-12-04T12:10:19.4960206Z [W1204 10:49:49.568857716 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4960477Z 2025-12-04T12:10:19.4960701Z [W1204 10:49:49.569129602 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4960957Z 2025-12-04T12:10:19.4961141Z [W1204 10:49:49.569203491 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4961469Z 2025-12-04T12:10:19.4961683Z [W1204 10:49:49.569352049 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4961915Z 2025-12-04T12:10:19.4962113Z [W1204 10:49:49.569441888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4962332Z 2025-12-04T12:10:19.4962550Z [W1204 10:49:49.569498987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4962781Z 2025-12-04T12:10:19.4962992Z [W1204 10:49:49.569650335 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4963214Z 2025-12-04T12:10:19.4963398Z [W1204 10:49:49.569708844 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4963622Z 2025-12-04T12:10:19.4963781Z [W1204 10:49:49.569866382 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4964050Z 2025-12-04T12:10:19.4964211Z [W1204 10:49:49.569925191 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4964422Z 2025-12-04T12:10:19.4964584Z [W1204 10:49:49.570054519 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4964797Z 2025-12-04T12:10:19.4965003Z [W1204 10:49:49.570118028 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4965285Z 2025-12-04T12:10:19.4965694Z [W1204 10:49:49.570235577 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4965930Z 2025-12-04T12:10:19.4966273Z [W1204 10:49:49.570292476 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4966505Z 2025-12-04T12:10:19.4966764Z [W1204 10:49:49.570409124 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4966996Z 2025-12-04T12:10:19.4967230Z [W1204 10:49:49.570466773 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4967481Z 2025-12-04T12:10:19.4967670Z ('RERUN', {'yellow': True}) [10.0029s] [100%] 2025-12-04T12:10:19.4968201Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:49.272840432 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4968542Z 2025-12-04T12:10:19.4968730Z [W1204 10:49:49.273041969 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4969017Z 2025-12-04T12:10:19.4969208Z [W1204 10:49:49.273114268 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4969465Z 2025-12-04T12:10:19.4969632Z [W1204 10:49:49.273367045 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4969863Z 2025-12-04T12:10:19.4970044Z [W1204 10:49:49.273432634 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4970386Z 2025-12-04T12:10:19.4970847Z [W1204 10:49:49.273574342 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4971056Z 2025-12-04T12:10:19.4971291Z [W1204 10:49:49.273659861 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4971560Z 2025-12-04T12:10:19.4971784Z [W1204 10:49:49.273716370 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4972000Z 2025-12-04T12:10:19.4972216Z [W1204 10:49:49.273866888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4972411Z 2025-12-04T12:10:19.4972633Z [W1204 10:49:49.273925177 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4972846Z 2025-12-04T12:10:19.4973008Z [W1204 10:49:49.274077715 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4973235Z 2025-12-04T12:10:19.4976334Z [W1204 10:49:49.274139194 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4976651Z 2025-12-04T12:10:19.4977018Z [W1204 10:49:49.274274002 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4977261Z 2025-12-04T12:10:19.4977453Z [W1204 10:49:49.274330511 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4977716Z 2025-12-04T12:10:19.4977929Z [W1204 10:49:49.274451339 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.4978262Z 2025-12-04T12:10:19.5038607Z [W1204 10:49:49.274506939 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5039019Z 2025-12-04T12:10:19.5039236Z [W1204 10:49:49.274619547 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5039500Z 2025-12-04T12:10:19.5039722Z [W1204 10:49:49.274674476 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5040043Z 2025-12-04T12:10:19.5040291Z [W1204 10:49:50.982721785 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5040566Z 2025-12-04T12:10:19.5040827Z [W1204 10:49:50.982904703 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5041076Z 2025-12-04T12:10:19.5041429Z [W1204 10:49:50.982972842 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5041665Z 2025-12-04T12:10:19.5041899Z [W1204 10:49:50.983227798 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5042161Z 2025-12-04T12:10:19.5042382Z [W1204 10:49:50.983294707 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5042619Z 2025-12-04T12:10:19.5042882Z [W1204 10:49:50.983431215 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5043096Z 2025-12-04T12:10:19.5043343Z [W1204 10:49:50.983516384 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5043639Z 2025-12-04T12:10:19.5043935Z [W1204 10:49:50.983573463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5044230Z 2025-12-04T12:10:19.5044437Z [W1204 10:49:50.983718291 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5044769Z 2025-12-04T12:10:19.5044949Z [W1204 10:49:50.983777430 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5045238Z 2025-12-04T12:10:19.5045443Z [W1204 10:49:50.983928098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5045733Z 2025-12-04T12:10:19.5046021Z [W1204 10:49:50.983985457 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5046253Z 2025-12-04T12:10:19.5046506Z [W1204 10:49:50.984117386 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5046733Z 2025-12-04T12:10:19.5046967Z [W1204 10:49:50.984175965 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5047191Z 2025-12-04T12:10:19.5047475Z [W1204 10:49:50.984293463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5047744Z 2025-12-04T12:10:19.5048005Z [W1204 10:49:50.984351152 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5048322Z 2025-12-04T12:10:19.5048603Z [W1204 10:49:50.984465061 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5048854Z 2025-12-04T12:10:19.5049073Z [W1204 10:49:50.984520830 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5049354Z 2025-12-04T12:10:19.5049424Z ('RERUN', {'yellow': True}) [1.4159s] [100%] 2025-12-04T12:10:19.5049827Z inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 [W1204 10:49:51.702709996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5050285Z 2025-12-04T12:10:19.5050456Z [W1204 10:49:51.702909353 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5050668Z 2025-12-04T12:10:19.5050841Z [W1204 10:49:51.702981912 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5051043Z 2025-12-04T12:10:19.5051324Z [W1204 10:49:51.703242649 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5051581Z 2025-12-04T12:10:19.5051898Z [W1204 10:49:51.703316878 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5052158Z 2025-12-04T12:10:19.5052464Z [W1204 10:49:51.703471226 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5052689Z 2025-12-04T12:10:19.5052907Z [W1204 10:49:51.703559894 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5053198Z 2025-12-04T12:10:19.5053519Z [W1204 10:49:51.703616564 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5053790Z 2025-12-04T12:10:19.5053984Z [W1204 10:49:51.703770641 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5054239Z 2025-12-04T12:10:19.5054448Z [W1204 10:49:51.703830421 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5054704Z 2025-12-04T12:10:19.5054898Z [W1204 10:49:51.703985278 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5055193Z 2025-12-04T12:10:19.5055375Z [W1204 10:49:51.704061557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5055687Z 2025-12-04T12:10:19.5055916Z [W1204 10:49:51.704200465 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5056151Z 2025-12-04T12:10:19.5056374Z [W1204 10:49:51.704257855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5056593Z 2025-12-04T12:10:19.5056830Z [W1204 10:49:51.704377063 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5057080Z 2025-12-04T12:10:19.5057385Z [W1204 10:49:51.704433042 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5057663Z 2025-12-04T12:10:19.5057865Z [W1204 10:49:51.704548100 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5058166Z 2025-12-04T12:10:19.5058391Z [W1204 10:49:51.704604210 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5058773Z 2025-12-04T12:10:19.5058948Z [W1204 10:49:51.437603307 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5059243Z 2025-12-04T12:10:19.5059430Z [W1204 10:49:51.437784485 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5059702Z 2025-12-04T12:10:19.5059970Z [W1204 10:49:51.437853654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5060221Z 2025-12-04T12:10:19.5060452Z [W1204 10:49:51.438111770 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5060735Z 2025-12-04T12:10:19.5060941Z [W1204 10:49:51.438180759 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5061196Z 2025-12-04T12:10:19.5061385Z [W1204 10:49:51.438315897 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5061728Z 2025-12-04T12:10:19.5061995Z [W1204 10:49:51.438402556 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5062221Z 2025-12-04T12:10:19.5062510Z [W1204 10:49:51.438458775 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5062755Z 2025-12-04T12:10:19.5063046Z [W1204 10:49:51.438603303 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5063285Z 2025-12-04T12:10:19.5063667Z [W1204 10:49:51.438661863 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5063913Z 2025-12-04T12:10:19.5064101Z [W1204 10:49:51.438815130 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5064388Z 2025-12-04T12:10:19.5064617Z [W1204 10:49:51.438872460 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5064918Z 2025-12-04T12:10:19.5065131Z [W1204 10:49:51.438997358 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5065442Z 2025-12-04T12:10:19.5065640Z [W1204 10:49:51.439057537 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5065977Z 2025-12-04T12:10:19.5066170Z [W1204 10:49:51.439177745 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5066434Z 2025-12-04T12:10:19.5066714Z [W1204 10:49:51.439233794 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5066939Z 2025-12-04T12:10:19.5067172Z [W1204 10:49:51.439352783 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5067392Z 2025-12-04T12:10:19.5067635Z [W1204 10:49:51.439408632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5067857Z 2025-12-04T12:10:19.5068015Z FAILED [1.4372s] [100%] 2025-12-04T12:10:19.5068123Z 2025-12-04T12:10:19.5068408Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.5068761Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.5069074Z Traceback (most recent call last): 2025-12-04T12:10:19.5069507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5069860Z method(*args, **kwargs) 2025-12-04T12:10:19.5070275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5070628Z method(*args, **kwargs) 2025-12-04T12:10:19.5070974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5071357Z with policy(): 2025-12-04T12:10:19.5071727Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5072126Z raise RuntimeError(msg) 2025-12-04T12:10:19.5072818Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 807403520 and is now 970981376. 2025-12-04T12:10:19.5073369Z 2025-12-04T12:10:19.5073488Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5073863Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.5074191Z 2025-12-04T12:10:19.5074369Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5074687Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5074955Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5075163Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5075871Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5076565Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5076791Z graph_break [] 2025-12-04T12:10:19.5077159Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5077503Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5078186Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.5078820Z if out == self.unknown_value: 2025-12-04T12:10:19.5079070Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.5079318Z Traceback (most recent call last): 2025-12-04T12:10:19.5079626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5079955Z method(*args, **kwargs) 2025-12-04T12:10:19.5080291Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5080855Z method(*args, **kwargs) 2025-12-04T12:10:19.5081222Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5081527Z with policy(): 2025-12-04T12:10:19.5081908Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5082209Z raise RuntimeError(msg) 2025-12-04T12:10:19.5082682Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 968884224 and is now 1006632960. 2025-12-04T12:10:19.5083109Z 2025-12-04T12:10:19.5083251Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5083792Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.5084092Z 2025-12-04T12:10:19.5084193Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5084503Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5084726Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5084921Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5085554Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5086230Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5086521Z graph_break [] 2025-12-04T12:10:19.5086828Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5087106Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5087621Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.5088175Z if out == self.unknown_value: 2025-12-04T12:10:19.5088426Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5088665Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5089022Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5089273Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5090023Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5090744Z graph_break [] 2025-12-04T12:10:19.5090997Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5091300Z =================================== FAILURES =================================== 2025-12-04T12:10:19.5091591Z __________ TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 ___________ 2025-12-04T12:10:19.5091834Z Traceback (most recent call last): 2025-12-04T12:10:19.5092138Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5092475Z method(*args, **kwargs) 2025-12-04T12:10:19.5092895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5093244Z method(*args, **kwargs) 2025-12-04T12:10:19.5093549Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5093890Z with policy(): 2025-12-04T12:10:19.5094205Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5094491Z raise RuntimeError(msg) 2025-12-04T12:10:19.5095042Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1004535808 and is now 1042284544. 2025-12-04T12:10:19.5095457Z 2025-12-04T12:10:19.5095569Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5095973Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.5096241Z 2025-12-04T12:10:19.5096368Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5096777Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5097032Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5097299Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5097937Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5098609Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5098856Z graph_break [] 2025-12-04T12:10:19.5099077Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5099459Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5100050Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.5100669Z if out == self.unknown_value: 2025-12-04T12:10:19.5101138Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5101400Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5101682Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5101953Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5102612Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5103263Z graph_break [] 2025-12-04T12:10:19.5103507Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5103779Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5104084Z frames [('total', 2), ('ok', 2)] 2025-12-04T12:10:19.5104330Z stats [('calls_captured', 22), ('unique_graphs', 2)] 2025-12-04T12:10:19.5104695Z aot_autograd [('total', 2), ('autograd_cache_miss', 2), ('autograd_cache_saved', 2), ('ok', 2)] 2025-12-04T12:10:19.5105388Z inductor [('triton_bundler_save_kernel', 128), ('benchmarking.InductorBenchmarker.benchmark', 14), ('benchmarking.InductorBenchmarker.benchmark_gpu', 14), ('async_compile_cache_miss', 12), ('async_compile_cache_hit', 6), ('pattern_matcher_count', 4), ('pattern_matcher_nodes', 4), ('extern_calls', 4), ('fxgraph_cache_miss', 2), ('triton_bundler_save_static_autotuner', 2)] 2025-12-04T12:10:19.5106045Z graph_break [] 2025-12-04T12:10:19.5106290Z aten_mm_info [('aten._scaled_mm.default_s77_s0_s77', 1), ('aten._scaled_mm.default_s77_s0_s27', 1)] 2025-12-04T12:10:19.5106773Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9bec2110e28243b1.xml - 2025-12-04T12:10:19.5107144Z =========================== short test summary info ============================ 2025-12-04T12:10:19.5107820Z FAILED [1.4372s] inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16! Caching allocator allocated memory was 0 and is now reported as 4096 on device 0. CUDA driver allocated memory was 1004535808 and is now 1042284544. 2025-12-04T12:10:19.5108420Z 2025-12-04T12:10:19.5108521Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5108898Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8TypesCUDA.test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.5109152Z 2025-12-04T12:10:19.5109424Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5109823Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.5110189Z ================= 1 failed, 187 deselected, 2 rerun in 12.88s ================== 2025-12-04T12:10:19.5110383Z Got exit code 1 2025-12-04T12:10:19.5110696Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16 2025-12-04T12:10:19.5111164Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.5111501Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0009856d3858a02c.xml 2025-12-04T12:10:19.5111817Z ============================= test session starts ============================== 2025-12-04T12:10:19.5112184Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.5112463Z cachedir: .pytest_cache 2025-12-04T12:10:19.5112872Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.5113232Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.5113460Z configfile: pytest.ini 2025-12-04T12:10:19.5113773Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.5114196Z collecting ... collected 188 items / 23 deselected / 165 selected 2025-12-04T12:10:19.5114458Z stepcurrent: skipping 23 already run items. 2025-12-04T12:10:19.5114651Z Running 165 items in this shard 2025-12-04T12:10:19.5114821Z 2025-12-04T12:10:19.5115176Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e4m3fn_shape_4,2048,4096_keepdim_False_cuda [W1204 10:50:00.976974621 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 1 (function operator()) 2025-12-04T12:10:19.5115683Z PASSED [2.8152s] [ 0%] 2025-12-04T12:10:19.5116018Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e4m3fn_shape_4,2048,4096_keepdim_True_cuda PASSED [2.2714s] [ 1%] 2025-12-04T12:10:19.5116497Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e5m2_shape_4,2048,4096_keepdim_False_cuda PASSED [2.2277s] [ 1%] 2025-12-04T12:10:19.5117061Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_benchmark_float8_e5m2_shape_4,2048,4096_keepdim_True_cuda PASSED [2.2500s] [ 2%] 2025-12-04T12:10:19.5117564Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,1,15_cuda PASSED [0.3212s] [ 3%] 2025-12-04T12:10:19.5118099Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,15_cuda PASSED [0.4097s] [ 3%] 2025-12-04T12:10:19.5118572Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,4096_cuda PASSED [0.3897s] [ 4%] 2025-12-04T12:10:19.5119072Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_1,10,512_cuda PASSED [0.3110s] [ 4%] 2025-12-04T12:10:19.5119545Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_False_shape_4,2048,4096_cuda PASSED [0.3904s] [ 5%] 2025-12-04T12:10:19.5119999Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,1,15_cuda PASSED [0.2499s] [ 6%] 2025-12-04T12:10:19.5120499Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,15_cuda PASSED [0.3929s] [ 6%] 2025-12-04T12:10:19.5120970Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,4096_cuda PASSED [0.3925s] [ 7%] 2025-12-04T12:10:19.5121464Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_1,10,512_cuda PASSED [0.2918s] [ 7%] 2025-12-04T12:10:19.5122093Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e4m3fn_amax_keep_dim_True_shape_4,2048,4096_cuda PASSED [0.4054s] [ 8%] 2025-12-04T12:10:19.5122587Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,1,15_cuda PASSED [0.2547s] [ 9%] 2025-12-04T12:10:19.5123111Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,15_cuda PASSED [0.3947s] [ 9%] 2025-12-04T12:10:19.5123569Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,4096_cuda PASSED [0.3784s] [ 10%] 2025-12-04T12:10:19.5124039Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_1,10,512_cuda PASSED [0.1830s] [ 10%] 2025-12-04T12:10:19.5124489Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_False_shape_4,2048,4096_cuda PASSED [0.2708s] [ 11%] 2025-12-04T12:10:19.5124985Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,1,15_cuda PASSED [0.1410s] [ 12%] 2025-12-04T12:10:19.5125507Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,15_cuda PASSED [0.6430s] [ 12%] 2025-12-04T12:10:19.5125975Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,4096_cuda PASSED [0.2757s] [ 13%] 2025-12-04T12:10:19.5126633Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_1,10,512_cuda PASSED [0.1778s] [ 13%] 2025-12-04T12:10:19.5127088Z inductor/test_fp8.py::TestFP8TypesCUDA::test_layernorm_fp8_quant_float8_e5m2_amax_keep_dim_True_shape_4,2048,4096_cuda PASSED [0.2761s] [ 14%] 2025-12-04T12:10:19.5127559Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e4m3fn_shape_16,16,16_cuda PASSED [0.5002s] [ 15%] 2025-12-04T12:10:19.5127989Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e4m3fn_shape_4,2048,4096_cuda PASSED [0.4530s] [ 15%] 2025-12-04T12:10:19.5128487Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e5m2_shape_16,16,16_cuda PASSED [0.3917s] [ 16%] 2025-12-04T12:10:19.5128947Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_bfloat16_float8_e5m2_shape_4,2048,4096_cuda PASSED [0.4152s] [ 16%] 2025-12-04T12:10:19.5129388Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e4m3fn_shape_16,16,16_cuda PASSED [0.3784s] [ 17%] 2025-12-04T12:10:19.5129858Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e4m3fn_shape_4,2048,4096_cuda PASSED [0.3934s] [ 18%] 2025-12-04T12:10:19.5130630Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e5m2_shape_16,16,16_cuda PASSED [0.3874s] [ 18%] 2025-12-04T12:10:19.5131019Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float16_float8_e5m2_shape_4,2048,4096_cuda PASSED [0.4044s] [ 19%] 2025-12-04T12:10:19.5131445Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e4m3fn_shape_16,16,16_cuda PASSED [0.3503s] [ 20%] 2025-12-04T12:10:19.5131879Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e4m3fn_shape_4,2048,4096_cuda PASSED [0.3871s] [ 20%] 2025-12-04T12:10:19.5132299Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e5m2_shape_16,16,16_cuda PASSED [0.3500s] [ 21%] 2025-12-04T12:10:19.5132724Z inductor/test_fp8.py::TestFP8TypesCUDA::test_to_fp8_saturated_float32_float8_e5m2_shape_4,2048,4096_cuda PASSED [0.3808s] [ 21%] 2025-12-04T12:10:19.5133217Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_bfloat16_shape_15,3,13_dst_types0_cuda_bfloat16 PASSED [0.2574s] [ 22%] 2025-12-04T12:10:19.5133685Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_bfloat16_shape_4,2048,4096_dst_types0_cuda_bfloat16 PASSED [0.3921s] [ 23%] 2025-12-04T12:10:19.5134132Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float16_shape_15,3,13_dst_types0_cuda_float16 PASSED [0.6452s] [ 23%] 2025-12-04T12:10:19.5134559Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float16_shape_4,2048,4096_dst_types0_cuda_float16 PASSED [0.3329s] [ 24%] 2025-12-04T12:10:19.5135002Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float32_shape_15,3,13_dst_types0_cuda_float32 PASSED [0.2352s] [ 24%] 2025-12-04T12:10:19.5135445Z inductor/test_fp8.py::TestFP8TypesCUDA::test_valid_cast_float32_shape_4,2048,4096_dst_types0_cuda_float32 PASSED [0.3568s] [ 25%] 2025-12-04T12:10:19.5135887Z inductor/test_fp8.py::TestFP8TypesCUDA::test_xblock_for_small_numel_float8_e4m3fn_cuda PASSED [0.1196s] [ 26%] 2025-12-04T12:10:19.5136459Z inductor/test_fp8.py::TestFP8TypesCUDA::test_xblock_for_small_numel_float8_e5m2_cuda PASSED [0.1121s] [ 26%] 2025-12-04T12:10:19.5137007Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_False_scaling_block_sizes0_cuda SKIPPED [0.0002s] (Need device-side TMA support in Triton) [ 27%] 2025-12-04T12:10:19.5137567Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_False_scaling_block_sizes1_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 27%] 2025-12-04T12:10:19.5138156Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_True_scaling_block_sizes0_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 28%] 2025-12-04T12:10:19.5138734Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape0_use_fast_accum_True_scaling_block_sizes1_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 29%] 2025-12-04T12:10:19.5139396Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_False_scaling_block_sizes0_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 29%] 2025-12-04T12:10:19.5140023Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_False_scaling_block_sizes1_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 30%] 2025-12-04T12:10:19.5234332Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_True_scaling_block_sizes0_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 30%] 2025-12-04T12:10:19.5235120Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_main_loop_scaling_shape1_use_fast_accum_True_scaling_block_sizes1_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 31%] 2025-12-04T12:10:19.5235822Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_mx_fp8_max_autotune_cuda SKIPPED [0.0001s] (Not supported on non B200) [ 32%] 2025-12-04T12:10:19.5236314Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_mx_fusion_cuda PASSED [3.7189s] [ 32%] 2025-12-04T12:10:19.5236874Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6482s] [ 33%] 2025-12-04T12:10:19.5237592Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1714s] [ 33%] 2025-12-04T12:10:19.5238188Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda FAILED [1.1872s] [ 33%] 2025-12-04T12:10:19.5238559Z 2025-12-04T12:10:19.5238711Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.5239133Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5239526Z Traceback (most recent call last): 2025-12-04T12:10:19.5239838Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5240186Z method(*args, **kwargs) 2025-12-04T12:10:19.5240651Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5240926Z method(*args, **kwargs) 2025-12-04T12:10:19.5241206Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5241486Z with policy(): 2025-12-04T12:10:19.5241936Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5242240Z raise RuntimeError(msg) 2025-12-04T12:10:19.5242766Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 276824064 and is now reported as 276825088 on device 0. CUDA driver allocated memory was 1811939328 and is now 2004877312. 2025-12-04T12:10:19.5243254Z 2025-12-04T12:10:19.5243364Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5243780Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5244097Z 2025-12-04T12:10:19.5244223Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5244709Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5244905Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5245071Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5245321Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5246057Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5246662Z graph_break [] 2025-12-04T12:10:19.5246828Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5247040Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5247692Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5248318Z current_size = base.storage().size() 2025-12-04T12:10:19.5248475Z Autotune Choices Stats: 2025-12-04T12:10:19.5248985Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0062790000811219215, "best_triton_pos": 0} 2025-12-04T12:10:19.5249512Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5249717Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5249979Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5250459Z triton_mm_17 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5251060Z triton_mm_8 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5251599Z triton_mm_14 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5252193Z triton_mm_16 0.0070 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5252720Z triton_mm_12 0.0078 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5253645Z triton_mm_18 0.0079 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5254186Z triton_mm_15 0.0086 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5254711Z triton_mm_9 0.0090 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5255235Z triton_mm_13 0.0091 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5255823Z triton_mm_11 0.0092 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5256257Z SingleProcess AUTOTUNE benchmarking takes 0.1143 seconds and 0.1463 seconds precompiling for 20 choices 2025-12-04T12:10:19.5256620Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5256887Z Traceback (most recent call last): 2025-12-04T12:10:19.5257159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5257443Z method(*args, **kwargs) 2025-12-04T12:10:19.5257707Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5257988Z method(*args, **kwargs) 2025-12-04T12:10:19.5258254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5279115Z with policy(): 2025-12-04T12:10:19.5279405Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5306342Z raise RuntimeError(msg) 2025-12-04T12:10:19.5306942Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 276824064 and is now reported as 276825088 on device 0. CUDA driver allocated memory was 2000683008 and is now 2057306112. 2025-12-04T12:10:19.5307422Z 2025-12-04T12:10:19.5307598Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5308057Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5308430Z 2025-12-04T12:10:19.5308575Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5308959Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5309256Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5309478Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5309807Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5310621Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5311345Z graph_break [] 2025-12-04T12:10:19.5311599Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5311913Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5312672Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5313311Z current_size = base.storage().size() 2025-12-04T12:10:19.5313478Z Autotune Choices Stats: 2025-12-04T12:10:19.5313950Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0062790000811219215, "best_triton_pos": 0} 2025-12-04T12:10:19.5314522Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5314712Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5314967Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5315413Z triton_mm_17 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5315956Z triton_mm_8 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5316511Z triton_mm_14 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5317038Z triton_mm_16 0.0070 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5317560Z triton_mm_12 0.0078 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5318416Z triton_mm_18 0.0079 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5318914Z triton_mm_15 0.0086 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5319404Z triton_mm_9 0.0090 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5319927Z triton_mm_13 0.0091 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5320546Z triton_mm_11 0.0092 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5320941Z SingleProcess AUTOTUNE benchmarking takes 0.1143 seconds and 0.1463 seconds precompiling for 20 choices 2025-12-04T12:10:19.5321184Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5321343Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5321476Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5321672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5322299Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5322887Z graph_break [] 2025-12-04T12:10:19.5323005Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5323179Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5323329Z Autotune Choices Stats: 2025-12-04T12:10:19.5323757Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:19.5324218Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5324365Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5324569Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5324957Z triton_mm_35 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5325444Z triton_mm_33 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5325928Z triton_mm_36 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5326420Z triton_mm_27 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5326906Z triton_mm_34 0.0079 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5347989Z triton_mm_28 0.0084 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5348574Z triton_mm_30 0.0085 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5349138Z triton_mm_32 0.0088 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5349676Z triton_mm_31 0.0088 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5350266Z triton_mm_24 0.0101 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5350699Z SingleProcess AUTOTUNE benchmarking takes 0.1133 seconds and 0.1447 seconds precompiling for 20 choices 2025-12-04T12:10:19.5350980Z =================================== FAILURES =================================== 2025-12-04T12:10:19.5351257Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5351530Z Traceback (most recent call last): 2025-12-04T12:10:19.5352085Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5352393Z method(*args, **kwargs) 2025-12-04T12:10:19.5352659Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5352926Z method(*args, **kwargs) 2025-12-04T12:10:19.5353169Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5353443Z with policy(): 2025-12-04T12:10:19.5353697Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5353964Z raise RuntimeError(msg) 2025-12-04T12:10:19.5354482Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 276824064 and is now reported as 276825088 on device 0. CUDA driver allocated memory was 2053111808 and is now 2109734912. 2025-12-04T12:10:19.5354945Z 2025-12-04T12:10:19.5355032Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5355460Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5355784Z 2025-12-04T12:10:19.5355886Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5356112Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5356323Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5356484Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5356726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5357389Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5357978Z graph_break [] 2025-12-04T12:10:19.5358151Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5358479Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5359823Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5360609Z current_size = base.storage().size() 2025-12-04T12:10:19.5360883Z Autotune Choices Stats: 2025-12-04T12:10:19.5361454Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0062790000811219215, "best_triton_pos": 0} 2025-12-04T12:10:19.5361992Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5362268Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5362624Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5363189Z triton_mm_17 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5363836Z triton_mm_8 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5364510Z triton_mm_14 0.0069 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5365151Z triton_mm_16 0.0070 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5365801Z triton_mm_12 0.0078 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5366493Z triton_mm_18 0.0079 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5367129Z triton_mm_15 0.0086 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5367777Z triton_mm_9 0.0090 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5368414Z triton_mm_13 0.0091 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5369069Z triton_mm_11 0.0092 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5369637Z SingleProcess AUTOTUNE benchmarking takes 0.1143 seconds and 0.1463 seconds precompiling for 20 choices 2025-12-04T12:10:19.5369978Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5370673Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5370926Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5371287Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5372064Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5372806Z graph_break [] 2025-12-04T12:10:19.5373075Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5373396Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5373688Z Autotune Choices Stats: 2025-12-04T12:10:19.5374241Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:19.5374880Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5375154Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5375464Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5376020Z triton_mm_35 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5376622Z triton_mm_33 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5377263Z triton_mm_36 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5377896Z triton_mm_27 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5378510Z triton_mm_34 0.0079 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5379238Z triton_mm_28 0.0084 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5379886Z triton_mm_30 0.0085 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5380590Z triton_mm_32 0.0088 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5381238Z triton_mm_31 0.0088 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5381877Z triton_mm_24 0.0101 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5382446Z SingleProcess AUTOTUNE benchmarking takes 0.1133 seconds and 0.1447 seconds precompiling for 20 choices 2025-12-04T12:10:19.5382784Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5383224Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5383520Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5383824Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5384533Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5385260Z graph_break [] 2025-12-04T12:10:19.5385523Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5385805Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5386119Z Autotune Choices Stats: 2025-12-04T12:10:19.5386666Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_55", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:19.5387342Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5387701Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5388007Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5388539Z triton_mm_55 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5389184Z triton_mm_46 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5389859Z triton_mm_52 0.0067 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5390543Z triton_mm_54 0.0072 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5391151Z triton_mm_56 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5391773Z triton_mm_50 0.0083 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5392499Z triton_mm_47 0.0084 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5393217Z triton_mm_51 0.0085 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5393838Z triton_mm_49 0.0085 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5394473Z triton_mm_53 0.0101 ms 63.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5395015Z SingleProcess AUTOTUNE benchmarking takes 0.1136 seconds and 0.1452 seconds precompiling for 20 choices 2025-12-04T12:10:19.5395520Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0009856d3858a02c.xml - 2025-12-04T12:10:19.5395952Z =========================== short test summary info ============================ 2025-12-04T12:10:19.5396755Z FAILED [1.1872s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 276824064 and is now reported as 276825088 on device 0. CUDA driver allocated memory was 2053111808 and is now 2109734912. 2025-12-04T12:10:19.5397490Z 2025-12-04T12:10:19.5397665Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5398177Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5398573Z 2025-12-04T12:10:19.5398695Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5398968Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.5399351Z ======= 1 failed, 45 passed, 9 skipped, 23 deselected, 2 rerun in 31.17s ======= 2025-12-04T12:10:19.5399642Z Got exit code 1 2025-12-04T12:10:19.5399843Z Retrying single test... 2025-12-04T12:10:19.5400238Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0b5cb0d876751945.xml 2025-12-04T12:10:19.5400562Z ============================= test session starts ============================== 2025-12-04T12:10:19.5400882Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.5401208Z cachedir: .pytest_cache 2025-12-04T12:10:19.5401609Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.5402027Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.5402245Z configfile: pytest.ini 2025-12-04T12:10:19.5402735Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.5403170Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.5403680Z stepcurrent: skipping 77 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5404199Z Running 1 items in this shard 2025-12-04T12:10:19.5404325Z 2025-12-04T12:10:19.5404586Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.5879s] [100%] 2025-12-04T12:10:19.5405191Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0307s] [100%] 2025-12-04T12:10:19.5610930Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.8913s] [100%] 2025-12-04T12:10:19.5611212Z 2025-12-04T12:10:19.5611337Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.5611618Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5611895Z Traceback (most recent call last): 2025-12-04T12:10:19.5612155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5612406Z method(*args, **kwargs) 2025-12-04T12:10:19.5612644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5612901Z method(*args, **kwargs) 2025-12-04T12:10:19.5613136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5613394Z with policy(): 2025-12-04T12:10:19.5613625Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5613873Z raise RuntimeError(msg) 2025-12-04T12:10:19.5614346Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:19.5614811Z 2025-12-04T12:10:19.5614897Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5615277Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5615605Z 2025-12-04T12:10:19.5615716Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5615936Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5616163Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5616326Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5616947Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5617640Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5617846Z graph_break [] 2025-12-04T12:10:19.5617995Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5618185Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5618802Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5619411Z current_size = base.storage().size() 2025-12-04T12:10:19.5619561Z Autotune Choices Stats: 2025-12-04T12:10:19.5620029Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:19.5620609Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5620783Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5621041Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5621469Z triton_mm_17 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5622016Z triton_mm_14 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5622556Z triton_mm_8 0.0069 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5623081Z triton_mm_12 0.0080 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5623603Z triton_mm_18 0.0083 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5624141Z triton_mm_13 0.0084 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5624646Z triton_mm_15 0.0085 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5625158Z triton_mm_9 0.0085 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5625658Z triton_mm_11 0.0086 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5626162Z triton_mm_6 0.0095 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5626571Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.4118 seconds precompiling for 20 choices 2025-12-04T12:10:19.5626892Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5627129Z Traceback (most recent call last): 2025-12-04T12:10:19.5627377Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5627625Z method(*args, **kwargs) 2025-12-04T12:10:19.5627858Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5628101Z method(*args, **kwargs) 2025-12-04T12:10:19.5628329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5628567Z with policy(): 2025-12-04T12:10:19.5628788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5629028Z raise RuntimeError(msg) 2025-12-04T12:10:19.5629516Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:19.5629948Z 2025-12-04T12:10:19.5630026Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5630449Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5630750Z 2025-12-04T12:10:19.5630843Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5631051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5631222Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5631362Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5631952Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5632607Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5632795Z graph_break [] 2025-12-04T12:10:19.5632927Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5633112Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5633718Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5634295Z current_size = base.storage().size() 2025-12-04T12:10:19.5634429Z Autotune Choices Stats: 2025-12-04T12:10:19.5634879Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:19.5635352Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5635513Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5635729Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5636134Z triton_mm_17 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5636643Z triton_mm_14 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5637146Z triton_mm_8 0.0069 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5637645Z triton_mm_12 0.0080 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5638182Z triton_mm_18 0.0083 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5638686Z triton_mm_13 0.0084 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5639189Z triton_mm_15 0.0085 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5639698Z triton_mm_9 0.0085 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5640244Z triton_mm_11 0.0086 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5640745Z triton_mm_6 0.0095 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5641179Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.4118 seconds precompiling for 20 choices 2025-12-04T12:10:19.5641432Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5641600Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5641744Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5641949Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5642585Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5643148Z graph_break [] 2025-12-04T12:10:19.5643280Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5643467Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5643633Z Autotune Choices Stats: 2025-12-04T12:10:19.5644078Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.5644551Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5644713Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5644928Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5645328Z triton_mm_36 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5645830Z triton_mm_35 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5646325Z triton_mm_27 0.0071 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5646868Z triton_mm_33 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5647374Z triton_mm_37 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5647878Z triton_mm_30 0.0085 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5648378Z triton_mm_28 0.0086 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5648882Z triton_mm_32 0.0086 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5649400Z triton_mm_34 0.0087 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5649901Z triton_mm_31 0.0103 ms 62.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5650333Z SingleProcess AUTOTUNE benchmarking takes 0.1221 seconds and 0.2857 seconds precompiling for 20 choices 2025-12-04T12:10:19.5650566Z =================================== FAILURES =================================== 2025-12-04T12:10:19.5650814Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5651053Z Traceback (most recent call last): 2025-12-04T12:10:19.5651303Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5651549Z method(*args, **kwargs) 2025-12-04T12:10:19.5651786Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5652029Z method(*args, **kwargs) 2025-12-04T12:10:19.5652258Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5652499Z with policy(): 2025-12-04T12:10:19.5652665Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5652708Z raise RuntimeError(msg) 2025-12-04T12:10:19.5653111Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.5653115Z 2025-12-04T12:10:19.5653200Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5653461Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5653463Z 2025-12-04T12:10:19.5653558Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5653637Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5653692Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5653781Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5654278Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5654381Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5654430Z graph_break [] 2025-12-04T12:10:19.5654498Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5654584Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5655077Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5655153Z current_size = base.storage().size() 2025-12-04T12:10:19.5655206Z Autotune Choices Stats: 2025-12-04T12:10:19.5655576Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:19.5655651Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5655708Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5655836Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5656077Z triton_mm_17 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5656315Z triton_mm_14 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5656555Z triton_mm_8 0.0069 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5656784Z triton_mm_12 0.0080 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5657025Z triton_mm_18 0.0083 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5657254Z triton_mm_13 0.0084 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5657483Z triton_mm_15 0.0085 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5657736Z triton_mm_9 0.0085 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5657969Z triton_mm_11 0.0086 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5658207Z triton_mm_6 0.0095 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5658339Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.4118 seconds precompiling for 20 choices 2025-12-04T12:10:19.5658422Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5658468Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5658534Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5658640Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5659131Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5659192Z graph_break [] 2025-12-04T12:10:19.5659265Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5659349Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5659394Z Autotune Choices Stats: 2025-12-04T12:10:19.5659766Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.5659833Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5659893Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5660035Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5660310Z triton_mm_36 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5660543Z triton_mm_35 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5660776Z triton_mm_27 0.0071 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5661012Z triton_mm_33 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5661253Z triton_mm_37 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5661478Z triton_mm_30 0.0085 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5661741Z triton_mm_28 0.0086 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5661967Z triton_mm_32 0.0086 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5662202Z triton_mm_34 0.0087 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5662429Z triton_mm_31 0.0103 ms 62.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5662571Z SingleProcess AUTOTUNE benchmarking takes 0.1221 seconds and 0.2857 seconds precompiling for 20 choices 2025-12-04T12:10:19.5662655Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5662702Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5662920Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5663025Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5663515Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5663557Z graph_break [] 2025-12-04T12:10:19.5663630Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5663709Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5663758Z Autotune Choices Stats: 2025-12-04T12:10:19.5664125Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_46", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007199999876320362, "best_triton_pos": 0} 2025-12-04T12:10:19.5664200Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5664255Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5664385Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5664627Z triton_mm_46 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5664859Z triton_mm_56 0.0072 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5665096Z triton_mm_54 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5665323Z triton_mm_52 0.0077 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5665578Z triton_mm_55 0.0077 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5665817Z triton_mm_47 0.0083 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5666047Z triton_mm_53 0.0083 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5666278Z triton_mm_49 0.0085 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5666508Z triton_mm_50 0.0088 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5666742Z triton_mm_51 0.0091 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5666897Z SingleProcess AUTOTUNE benchmarking takes 0.1517 seconds and 0.2739 seconds precompiling for 20 choices 2025-12-04T12:10:19.5667098Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0b5cb0d876751945.xml - 2025-12-04T12:10:19.5667170Z =========================== short test summary info ============================ 2025-12-04T12:10:19.5667760Z FAILED [0.8913s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.5667763Z 2025-12-04T12:10:19.5667849Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5668109Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5668111Z 2025-12-04T12:10:19.5668210Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5668276Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.5668355Z ================== 1 failed, 187 deselected, 2 rerun in 4.53s ================== 2025-12-04T12:10:19.5668397Z Got exit code 1 2025-12-04T12:10:19.5668448Z Retrying single test... 2025-12-04T12:10:19.5668605Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8cbe9e2a7c912c39.xml 2025-12-04T12:10:19.5668667Z ============================= test session starts ============================== 2025-12-04T12:10:19.5668791Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.5668836Z cachedir: .pytest_cache 2025-12-04T12:10:19.5669005Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.5669057Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.5669109Z configfile: pytest.ini 2025-12-04T12:10:19.5669279Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.5669365Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.5669646Z stepcurrent: skipping 77 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5669703Z Running 1 items in this shard 2025-12-04T12:10:19.5669706Z 2025-12-04T12:10:19.5669924Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.5566s] [100%] 2025-12-04T12:10:19.5670171Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8610s] [100%] 2025-12-04T12:10:19.5670363Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda PASSED [0.9494s] [100%] 2025-12-04T12:10:19.5670372Z 2025-12-04T12:10:19.5670429Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.5670582Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5670633Z Traceback (most recent call last): 2025-12-04T12:10:19.5670802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5670874Z method(*args, **kwargs) 2025-12-04T12:10:19.5671037Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5671082Z method(*args, **kwargs) 2025-12-04T12:10:19.5671241Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5671282Z with policy(): 2025-12-04T12:10:19.5671444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5671489Z raise RuntimeError(msg) 2025-12-04T12:10:19.5671887Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:19.5671891Z 2025-12-04T12:10:19.5671968Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5672235Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5672238Z 2025-12-04T12:10:19.5672334Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5672411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5672464Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5672526Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5673023Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5673126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5673173Z graph_break [] 2025-12-04T12:10:19.5673240Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5673323Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5673833Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5673892Z current_size = base.storage().size() 2025-12-04T12:10:19.5673944Z Autotune Choices Stats: 2025-12-04T12:10:19.5674316Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:19.5674390Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5674446Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5674572Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5674807Z triton_mm_14 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5675055Z triton_mm_16 0.0072 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5675285Z triton_mm_8 0.0073 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5675519Z triton_mm_9 0.0080 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5675748Z triton_mm_11 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5675972Z triton_mm_13 0.0087 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5676202Z triton_mm_15 0.0096 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5676431Z triton_mm_17 0.0097 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5676665Z triton_mm_6 0.0099 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5676896Z triton_mm_5 0.0104 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5677027Z SingleProcess AUTOTUNE benchmarking takes 0.0882 seconds and 0.4072 seconds precompiling for 20 choices 2025-12-04T12:10:19.5677176Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5677224Z Traceback (most recent call last): 2025-12-04T12:10:19.5677404Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5677447Z method(*args, **kwargs) 2025-12-04T12:10:19.5677602Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5677646Z method(*args, **kwargs) 2025-12-04T12:10:19.5677801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5677840Z with policy(): 2025-12-04T12:10:19.5677998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5678039Z raise RuntimeError(msg) 2025-12-04T12:10:19.5678437Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1199570944. 2025-12-04T12:10:19.5678440Z 2025-12-04T12:10:19.5678517Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5678775Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.5678804Z 2025-12-04T12:10:19.5678895Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5678969Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5679018Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5679079Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5679571Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5679674Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5679721Z graph_break [] 2025-12-04T12:10:19.5679788Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5679870Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5680427Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5680479Z current_size = base.storage().size() 2025-12-04T12:10:19.5680531Z Autotune Choices Stats: 2025-12-04T12:10:19.5680900Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:19.5680976Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5681031Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5681161Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5681395Z triton_mm_14 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5681653Z triton_mm_16 0.0072 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5681892Z triton_mm_8 0.0073 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5682123Z triton_mm_9 0.0080 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5682356Z triton_mm_11 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5682585Z triton_mm_13 0.0087 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5682848Z triton_mm_15 0.0096 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5683081Z triton_mm_17 0.0097 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5683310Z triton_mm_6 0.0099 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5683544Z triton_mm_5 0.0104 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5683676Z SingleProcess AUTOTUNE benchmarking takes 0.0882 seconds and 0.4072 seconds precompiling for 20 choices 2025-12-04T12:10:19.5683759Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5683805Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5683871Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5683975Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5684437Z inductor [('triton_bundler_save_kernel', 152), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('async_compile_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5684478Z graph_break [] 2025-12-04T12:10:19.5684548Z aten_mm_info [('aten._scaled_mm.default_1024_16_1024', 1)] 2025-12-04T12:10:19.5684634Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5684678Z Autotune Choices Stats: 2025-12-04T12:10:19.5685154Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "_scaled_mm", "best_time": 0.007040000054985285, "best_triton_pos": 1, "best_triton_time": 0.007160000037401915, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2"} 2025-12-04T12:10:19.5685220Z AUTOTUNE scaled_mm(1024x1024, 1024x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.5685298Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5685422Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5685473Z _scaled_mm 0.0070 ms 100.0% 2025-12-04T12:10:19.5685704Z triton_mm_35 0.0072 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5685936Z triton_mm_33 0.0074 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5686166Z triton_mm_27 0.0075 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5686406Z triton_mm_37 0.0078 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.5686663Z triton_mm_31 0.0081 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5686892Z triton_mm_34 0.0083 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5687131Z triton_mm_28 0.0084 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5687361Z triton_mm_32 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.5687596Z triton_mm_36 0.0094 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5687734Z SingleProcess AUTOTUNE benchmarking takes 0.1227 seconds and 0.2841 seconds precompiling for 20 choices 2025-12-04T12:10:19.5687927Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8cbe9e2a7c912c39.xml - 2025-12-04T12:10:19.5688008Z ================== 1 passed, 187 deselected, 2 rerun in 4.39s ================== 2025-12-04T12:10:19.5688049Z Got exit code 0 2025-12-04T12:10:19.5688144Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T12:10:19.5688290Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7940c49142504303.xml 2025-12-04T12:10:19.5688357Z ============================= test session starts ============================== 2025-12-04T12:10:19.5688473Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.5688523Z cachedir: .pytest_cache 2025-12-04T12:10:19.5688686Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.5688742Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.5688787Z configfile: pytest.ini 2025-12-04T12:10:19.5688959Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.5689041Z collecting ... collected 188 items / 78 deselected / 110 selected 2025-12-04T12:10:19.5689124Z stepcurrent: skipping 78 already run items. 2025-12-04T12:10:19.5689174Z Running 110 items in this shard 2025-12-04T12:10:19.5689176Z 2025-12-04T12:10:19.5690132Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/6y/c6ynvaswxkvzl4is2ajbl6iame4d5qle75ci427cnjmhyffijoo5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5690294Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5690518Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5690715Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5691006Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5691153Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5691423Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5691568Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5691833Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5691994Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5692273Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5692412Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5692697Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5692903Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5693223Z E1204 10:51:02.805000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5693995Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/ts/cts7xnawxifeshgr42gxswlkp4hvueix4d7ckiglfk3rqvhm7u2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5694155Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5694372Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5694536Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5694825Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5694967Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5695226Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5695390Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5695645Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5695813Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5696093Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5696232Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5696517Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5696711Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5697035Z E1204 10:51:02.828000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5697770Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/c5/cc5ggiuivr5snfqf5gl4meztqartblipylsohf34cpn77emtpxxo.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5697921Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5698163Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5698322Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5698619Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5698762Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5699019Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5699165Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5699422Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5699608Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5699878Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5700022Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5700339Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5700542Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5700866Z E1204 10:51:02.829000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5701594Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/ue/cuepztjx5hwc6cn35lr4lupsh2ngqa5sbge7z2tciyg6wm4mh2nc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5701751Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5701977Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5702134Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5702427Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5702585Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5702849Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5702990Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5703254Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5703412Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5703689Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5703831Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5704107Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5704333Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5704647Z E1204 10:51:02.832000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5705380Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/5y/c5ycqrb7j4uapqa7ucfmmec6ba2bhj74pglv26srlibasz6l67e7.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5705538Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5705899Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5706065Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5706352Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5706494Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5706758Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5706897Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5707179Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5707337Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5707616Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5707752Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5708035Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5708240Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5708554Z E1204 10:51:02.836000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5709306Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpxrw4ew93/uz/cuzyocrxsruvbc5g4v6sbp2do5rkwrogymtc32osajsldsdqu5fj.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5709457Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5709679Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5709843Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5710178Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5710318Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5710580Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5710725Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5710983Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5711147Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5711422Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5711592Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5711874Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5712070Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5712389Z E1204 10:51:02.837000 572523 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5712444Z ('RERUN', {'yellow': True}) [3.4027s] [ 0%] 2025-12-04T12:10:19.5712778Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:51:04.720000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5713080Z E1204 10:51:04.720000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5713235Z E1204 10:51:04.720000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5713388Z E1204 10:51:04.722000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5713779Z E1204 10:51:04.722000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5713918Z E1204 10:51:04.722000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5714065Z E1204 10:51:04.724000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5714364Z E1204 10:51:04.724000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5714494Z E1204 10:51:04.724000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5714645Z E1204 10:51:04.788000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5714947Z E1204 10:51:04.788000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5715076Z E1204 10:51:04.788000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5715230Z E1204 10:51:04.790000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5715525Z E1204 10:51:04.790000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5715660Z E1204 10:51:04.790000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5715804Z E1204 10:51:04.792000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5716122Z E1204 10:51:04.792000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5716256Z E1204 10:51:04.792000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5716378Z ('RERUN', {'yellow': True}) [1.6504s] [ 0%] 2025-12-04T12:10:19.5716704Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:51:06.174000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5716999Z E1204 10:51:06.174000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5717136Z E1204 10:51:06.174000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5717283Z E1204 10:51:06.176000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5717609Z E1204 10:51:06.176000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5717735Z E1204 10:51:06.176000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5717886Z E1204 10:51:06.178000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5718187Z E1204 10:51:06.178000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5718316Z E1204 10:51:06.178000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5718465Z E1204 10:51:06.226000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5718758Z E1204 10:51:06.226000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5718892Z E1204 10:51:06.226000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5719036Z E1204 10:51:06.228000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5719335Z E1204 10:51:06.228000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5719466Z E1204 10:51:06.228000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5719612Z E1204 10:51:06.230000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.5719914Z E1204 10:51:06.230000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.5720042Z E1204 10:51:06.230000 572523 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.5720130Z FAILED [1.6620s] [ 0%] 2025-12-04T12:10:19.5720133Z 2025-12-04T12:10:19.5720221Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.5720375Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5720424Z Traceback (most recent call last): 2025-12-04T12:10:19.5720589Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5720634Z method(*args, **kwargs) 2025-12-04T12:10:19.5720792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5720835Z method(*args, **kwargs) 2025-12-04T12:10:19.5720992Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5721033Z with policy(): 2025-12-04T12:10:19.5721193Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5721237Z raise RuntimeError(msg) 2025-12-04T12:10:19.5721637Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 2051014656. 2025-12-04T12:10:19.5721663Z 2025-12-04T12:10:19.5721749Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5722013Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.5722015Z 2025-12-04T12:10:19.5722115Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5722197Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5722249Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5722311Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5722821Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5722931Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5722974Z graph_break [] 2025-12-04T12:10:19.5723049Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5723125Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5723619Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5723674Z current_size = base.storage().size() 2025-12-04T12:10:19.5723726Z Autotune Choices Stats: 2025-12-04T12:10:19.5724200Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.013238999992609024, "best_triton_pos": 1, "best_triton_time": 0.01592000015079975, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.5724277Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5724351Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5724483Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5724530Z _scaled_mm 0.0132 ms 100.0% 2025-12-04T12:10:19.5724779Z triton_mm_35 0.0159 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5725017Z triton_mm_15 0.0160 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5725248Z triton_mm_34 0.0174 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5725479Z triton_mm_14 0.0176 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5725738Z triton_mm_13 0.0176 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5725971Z triton_mm_31 0.0183 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5726203Z triton_mm_33 0.0191 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5726427Z triton_mm_16 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5726659Z triton_mm_32 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5726791Z SingleProcess AUTOTUNE benchmarking takes 0.1840 seconds and 1.0544 seconds precompiling for 33 choices 2025-12-04T12:10:19.5726943Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5726993Z Traceback (most recent call last): 2025-12-04T12:10:19.5727161Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5727207Z method(*args, **kwargs) 2025-12-04T12:10:19.5727366Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5727412Z method(*args, **kwargs) 2025-12-04T12:10:19.5727566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5727607Z with policy(): 2025-12-04T12:10:19.5727769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5727817Z raise RuntimeError(msg) 2025-12-04T12:10:19.5728452Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2051014656 and is now 3015704576. 2025-12-04T12:10:19.5728455Z 2025-12-04T12:10:19.5728540Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5728800Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.5728805Z 2025-12-04T12:10:19.5728901Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5728980Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5729033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5729095Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5729589Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5729698Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5729762Z graph_break [] 2025-12-04T12:10:19.5729839Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5729912Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5730428Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5730480Z current_size = base.storage().size() 2025-12-04T12:10:19.5730534Z Autotune Choices Stats: 2025-12-04T12:10:19.5731003Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.013238999992609024, "best_triton_pos": 1, "best_triton_time": 0.01592000015079975, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.5731086Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5731141Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5731273Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5731326Z _scaled_mm 0.0132 ms 100.0% 2025-12-04T12:10:19.5731565Z triton_mm_35 0.0159 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5731799Z triton_mm_15 0.0160 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5732030Z triton_mm_34 0.0174 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5732265Z triton_mm_14 0.0176 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5732515Z triton_mm_13 0.0176 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5732751Z triton_mm_31 0.0183 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5732986Z triton_mm_33 0.0191 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5733215Z triton_mm_16 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5733450Z triton_mm_32 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5733582Z SingleProcess AUTOTUNE benchmarking takes 0.1840 seconds and 1.0544 seconds precompiling for 33 choices 2025-12-04T12:10:19.5733691Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5733737Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5733801Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5733904Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5734367Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5734414Z graph_break [] 2025-12-04T12:10:19.5734480Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5734563Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5734609Z Autotune Choices Stats: 2025-12-04T12:10:19.5735078Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.01384000014513731, "best_triton_pos": 1, "best_triton_time": 0.01583999954164028, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.5735150Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5735211Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5735332Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5735383Z _scaled_mm 0.0138 ms 100.0% 2025-12-04T12:10:19.5735620Z triton_mm_73 0.0158 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5735858Z triton_mm_53 0.0165 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5736093Z triton_mm_72 0.0174 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5736341Z triton_mm_51 0.0176 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5736576Z triton_mm_52 0.0178 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5736807Z triton_mm_69 0.0185 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5737041Z triton_mm_71 0.0188 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5737277Z triton_mm_54 0.0194 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5737506Z triton_mm_70 0.0196 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5737667Z SingleProcess AUTOTUNE benchmarking takes 0.2760 seconds and 0.7951 seconds precompiling for 39 choices 2025-12-04T12:10:19.5737725Z =================================== FAILURES =================================== 2025-12-04T12:10:19.5737879Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.5737930Z Traceback (most recent call last): 2025-12-04T12:10:19.5738096Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5738143Z method(*args, **kwargs) 2025-12-04T12:10:19.5738303Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.5738347Z method(*args, **kwargs) 2025-12-04T12:10:19.5738506Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.5738547Z with policy(): 2025-12-04T12:10:19.5738710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.5738755Z raise RuntimeError(msg) 2025-12-04T12:10:19.5739160Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3015704576 and is now 3902799872. 2025-12-04T12:10:19.5739164Z 2025-12-04T12:10:19.5739245Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5739509Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.5739513Z 2025-12-04T12:10:19.5739610Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5739689Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5739739Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5739801Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5740363Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5740471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5740514Z graph_break [] 2025-12-04T12:10:19.5740588Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5740663Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5741152Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.5741203Z current_size = base.storage().size() 2025-12-04T12:10:19.5741254Z Autotune Choices Stats: 2025-12-04T12:10:19.5741728Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.013238999992609024, "best_triton_pos": 1, "best_triton_time": 0.01592000015079975, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.5741908Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5741964Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5742095Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5742140Z _scaled_mm 0.0132 ms 100.0% 2025-12-04T12:10:19.5742382Z triton_mm_35 0.0159 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5742617Z triton_mm_15 0.0160 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5742847Z triton_mm_34 0.0174 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5743079Z triton_mm_14 0.0176 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5743308Z triton_mm_13 0.0176 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5743542Z triton_mm_31 0.0183 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5743775Z triton_mm_33 0.0191 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5744002Z triton_mm_16 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5744254Z triton_mm_32 0.0195 ms 67.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5744386Z SingleProcess AUTOTUNE benchmarking takes 0.1840 seconds and 1.0544 seconds precompiling for 33 choices 2025-12-04T12:10:19.5744469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5744515Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5744585Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5744689Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5745152Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.5745199Z graph_break [] 2025-12-04T12:10:19.5745268Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5745352Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5745415Z Autotune Choices Stats: 2025-12-04T12:10:19.5745881Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.01384000014513731, "best_triton_pos": 1, "best_triton_time": 0.01583999954164028, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.5745953Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5746014Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5746137Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5746185Z _scaled_mm 0.0138 ms 100.0% 2025-12-04T12:10:19.5746419Z triton_mm_73 0.0158 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5746655Z triton_mm_53 0.0165 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5746888Z triton_mm_72 0.0174 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5747117Z triton_mm_51 0.0176 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5747350Z triton_mm_52 0.0178 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5747578Z triton_mm_69 0.0185 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5747811Z triton_mm_71 0.0188 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5748063Z triton_mm_54 0.0194 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5748295Z triton_mm_70 0.0196 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5748429Z SingleProcess AUTOTUNE benchmarking takes 0.2760 seconds and 0.7951 seconds precompiling for 39 choices 2025-12-04T12:10:19.5748507Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.5748559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.5748619Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.5748728Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.5749212Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.5749278Z graph_break [] 2025-12-04T12:10:19.5749347Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.5749430Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.5749474Z Autotune Choices Stats: 2025-12-04T12:10:19.5749847Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_111", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.015560000203549862, "best_triton_pos": 0} 2025-12-04T12:10:19.5749926Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.5749980Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.5750148Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.5750393Z triton_mm_111 0.0156 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5750628Z triton_mm_91 0.0162 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5750856Z triton_mm_89 0.0176 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5751092Z triton_mm_110 0.0177 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5751329Z triton_mm_90 0.0177 ms 87.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5751556Z triton_mm_107 0.0185 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5751819Z triton_mm_109 0.0186 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5752047Z triton_mm_92 0.0196 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5752286Z triton_mm_108 0.0196 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.5752519Z triton_mm_106 0.0217 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.5752649Z SingleProcess AUTOTUNE benchmarking takes 0.2854 seconds and 0.6265 seconds precompiling for 39 choices 2025-12-04T12:10:19.5752850Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7940c49142504303.xml - 2025-12-04T12:10:19.5752915Z =========================== short test summary info ============================ 2025-12-04T12:10:19.5753537Z FAILED [1.6620s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3015704576 and is now 3902799872. 2025-12-04T12:10:19.5753540Z 2025-12-04T12:10:19.5753618Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.5753881Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.5753883Z 2025-12-04T12:10:19.5753979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.5754045Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.5754122Z ================== 1 failed, 78 deselected, 2 rerun in 6.74s =================== 2025-12-04T12:10:19.5754162Z Got exit code 1 2025-12-04T12:10:19.5754211Z Retrying single test... 2025-12-04T12:10:19.5754358Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5c5ceebf91026326.xml 2025-12-04T12:10:19.5754426Z ============================= test session starts ============================== 2025-12-04T12:10:19.5754543Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.5754590Z cachedir: .pytest_cache 2025-12-04T12:10:19.5754756Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.5754813Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.5754854Z configfile: pytest.ini 2025-12-04T12:10:19.5755025Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.5755108Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.5755370Z stepcurrent: skipping 78 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.5755419Z Running 1 items in this shard 2025-12-04T12:10:19.5755421Z 2025-12-04T12:10:19.5755786Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:16.945887080 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5755789Z 2025-12-04T12:10:19.5755952Z [W1204 10:51:24.605091265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5755956Z 2025-12-04T12:10:19.5756112Z [W1204 10:51:24.606631154 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5756114Z 2025-12-04T12:10:19.5756274Z [W1204 10:51:24.632053847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5756277Z 2025-12-04T12:10:19.5756429Z [W1204 10:51:24.638168001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.5756431Z 2025-12-04T12:10:19.5756757Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5757068Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.5757225Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.5757746Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.5758007Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.5758246Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.5758466Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.5758670Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.5758908Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.5759134Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5759375Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5759600Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5759836Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5760082Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5760371Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5760601Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5760830Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5761058Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5761293Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5761547Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5761784Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5762003Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5762205Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5762427Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5762665Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5762885Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5763078Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5763308Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5763537Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5763767Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5763998Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5764221Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5764460Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.5764676Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.5764848Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.5765032Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.5765566Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/ts/cts7xnawxifeshgr42gxswlkp4hvueix4d7ckiglfk3rqvhm7u2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5765716Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5765961Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5766127Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5766420Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5766561Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5766822Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5766971Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5767224Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5767390Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5767670Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5767808Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5768091Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5768287Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5768612Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5768929Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.5769071Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.5769558Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.5769814Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.5770047Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.5770309Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.5770519Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.5770755Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.5770983Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5771221Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5771447Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5771684Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5771905Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5772144Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5772372Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5772603Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5772831Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5773099Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5773330Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5773560Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5773789Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5773989Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5774211Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5774448Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5774704Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5774900Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5775121Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5775367Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5775598Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5775830Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5776055Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5776260Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.5776481Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.5776645Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.5776835Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.5776950Z E1204 10:51:24.141000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.5777262Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5777582Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.5777720Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.5778207Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.5778467Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.5778693Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.5778930Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.5779134Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.5779370Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.5779593Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5779829Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5780056Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5780321Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5780548Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5780780Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5781009Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5781241Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5781466Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5781726Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5781947Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5782184Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5782408Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5782604Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5782825Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5783061Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5783317Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5783507Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5783733Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5783964Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5784193Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5784429Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5784653Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5784863Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.5785080Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.5785246Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.5785429Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.5785963Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/uz/cuzyocrxsruvbc5g4v6sbp2do5rkwrogymtc32osajsldsdqu5fj.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5978495Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5978853Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5979032Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5979328Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5979469Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.5979732Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.5979876Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.5980251Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.5980411Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.5980685Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.5980821Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.5981098Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.5981294Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.5981618Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5981919Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.5982052Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.5982547Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.5982804Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.5983062Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.5983274Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.5983477Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.5983709Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.5983930Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5984165Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5984387Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5984649Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5984871Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5985098Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5985319Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5985547Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5985768Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5985992Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5986214Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5986449Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5986671Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5986866Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5987085Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5987337Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5987555Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5987746Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5987965Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5988192Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5988416Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5988642Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5989303Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5989506Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.5989725Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.5989889Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.5990069Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.5990208Z E1204 10:51:24.185000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.5990521Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.5990818Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.5990951Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.5991437Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.5991696Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.5991948Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.5992156Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.5992358Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.5992589Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.5992813Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5993042Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5993264Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5993527Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5993747Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5993973Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5994197Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5994427Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5994643Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5994870Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5995088Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5995317Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5995538Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5995732Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5995951Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5996195Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5996418Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5996608Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.5996830Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5997056Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5997278Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5997509Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.5997746Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.5997952Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.5998162Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.5998326Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.5998505Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.5999038Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/6y/c6ynvaswxkvzl4is2ajbl6iame4d5qle75ci427cnjmhyffijoo5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.5999189Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.5999408Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.5999569Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.5999857Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.5999994Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.6000300Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.6000559Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.6000819Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.6000977Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.6001250Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.6001386Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.6001665Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.6001857Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.6002202Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6002499Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6002629Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6003111Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6003364Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6003592Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6003804Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6004005Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6004237Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6004456Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6004688Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6004929Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6005156Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6005380Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6005607Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6005830Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6006055Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6006301Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6006532Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6006748Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6006980Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6007197Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6007390Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6007609Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6007840Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6008062Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6008251Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6008476Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6008705Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6008951Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6009179Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6009401Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6009608Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6009818Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6009981Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6010195Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6010330Z E1204 10:51:24.189000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.6010638Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6010929Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6011061Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6011540Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6011795Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6012018Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6012227Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6012429Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6012657Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6012881Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6013107Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6013351Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6013578Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6013801Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6014031Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6014250Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6014481Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6014718Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6014946Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6015164Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6015395Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6015616Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6015806Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6016032Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6016260Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6016482Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6016671Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6016893Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6017121Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6017340Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6017590Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6017810Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6018015Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6018227Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6018390Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6018574Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6019099Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/c5/cc5ggiuivr5snfqf5gl4meztqartblipylsohf34cpn77emtpxxo.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.6019269Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.6019482Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.6019643Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.6019928Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.6020062Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.6020350Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.6020490Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.6020749Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.6020907Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.6021178Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.6021315Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.6021592Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.6021814Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.6022127Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6022424Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6022555Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6023040Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6023324Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6023549Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6023759Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6023959Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6024190Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6024411Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6024643Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6024862Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6025093Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6025316Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6025544Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6025765Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6026017Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6026236Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6026469Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6026687Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6026913Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6027132Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6027326Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6027583Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6027809Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6028031Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6028221Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6028440Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6028669Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6028888Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6029117Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6029335Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6029546Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6029757Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6029919Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6030174Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6030280Z E1204 10:51:24.194000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.6030587Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6030878Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6031012Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6031489Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6031767Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6031994Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6032199Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6032405Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6032633Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6032857Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6033087Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6033307Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6033535Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6033757Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6033989Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6034209Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6034458Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6034675Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6034907Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6035127Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6035353Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6035577Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6035769Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6036007Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6036234Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6036458Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6036648Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6036867Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6037099Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6040035Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6040321Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6040541Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6040748Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6040962Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6041147Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6041332Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6041902Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/ue/cuepztjx5hwc6cn35lr4lupsh2ngqa5sbge7z2tciyg6wm4mh2nc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.6042055Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.6042271Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.6042431Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.6042720Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.6042880Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.6043141Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.6043279Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.6043536Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.6043698Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.6043968Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.6044103Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.6044379Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.6044666Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.6044983Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6045280Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6045412Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6045921Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6046182Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6046408Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6046617Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6046820Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6047054Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6047274Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6047515Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6047739Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6047968Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6048192Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6048421Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6048642Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6048887Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6049110Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6049340Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6049559Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6049790Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6050009Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6050263Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6050484Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6050717Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6050943Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6051135Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6051356Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6051596Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6051818Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6052048Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6052267Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6052471Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6052681Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6052846Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6053039Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6053147Z E1204 10:51:24.198000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.6053306Z [W1204 10:51:24.662350401 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6053312Z 2025-12-04T12:10:19.6053623Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6053920Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6054053Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6054549Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6054806Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6055034Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6055242Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6055444Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6055681Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6055912Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6056147Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6056372Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6056601Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6056825Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6057055Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6057289Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6057716Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6057940Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6058172Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6058391Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6058625Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6058873Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6059069Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6059292Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6059527Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6059752Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6059945Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6060202Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6060443Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6060668Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6060897Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6061121Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6061328Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6061540Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6061720Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6061902Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6062428Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvia3gzob/5y/c5ycqrb7j4uapqa7ucfmmec6ba2bhj74pglv26srlibasz6l67e7.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.6062577Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.6062795Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.6062959Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.6063269Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.6063407Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.6063666Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.6063811Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.6064067Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.6064229Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.6064501Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.6064652Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.6064932Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.6065127Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.6065446Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6065736Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6065870Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6066367Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6066619Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6066851Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6067058Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6067265Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6067516Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6067738Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6067968Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6068189Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6068422Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6068644Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6068884Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6069110Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6069338Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6069563Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6069790Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6070013Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6070305Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6070544Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6070740Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6070961Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6071193Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6071413Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6071631Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6071851Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6072082Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6072304Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6072532Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6072757Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6072963Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.6073193Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.6073356Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.6073535Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.6073643Z E1204 10:51:24.201000 578686 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.6073699Z ('RERUN', {'yellow': True}) [11.1465s] [100%] 2025-12-04T12:10:19.6074040Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:26.610849036 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6074044Z 2025-12-04T12:10:19.6074191Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6074490Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6074795Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6074929Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6075411Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6075666Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6075922Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6076133Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6076337Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6076569Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6076789Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6077020Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6077239Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6077482Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6077701Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6077935Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6078156Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6078383Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6078604Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6078813Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6079028Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6079227Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6079458Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6079680Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6079873Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6080459Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6080687Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6080907Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6081097Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6081318Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6081518Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6081705Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6081938Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6082135Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6082328Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6082551Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6082782Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6083004Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6083231Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6083466Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6083665Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6083877Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6084077Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6084309Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6084552Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6084780Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6085003Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6085233Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6085456Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6085684Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6085904Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6086145Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6086363Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6086594Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6086813Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6087045Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6087267Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6087507Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6087729Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6087954Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6088176Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6088402Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6088626Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6088873Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6089092Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6089299Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6089496Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6089727Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6089946Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6090216Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6090449Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6090674Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6090898Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6091123Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6091343Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6091572Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6091812Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6092042Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6092261Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6092460Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6092650Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6092873Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6093093Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6093305Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6093510Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6093737Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6093959Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6094157Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6094349Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6094579Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6094778Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6094970Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6095189Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6095418Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6095637Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6095867Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6096099Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6096302Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6096517Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6096716Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6096954Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6097195Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6097402Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6097603Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6097808Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6098043Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6098270Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6098507Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6098737Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6098967Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6099188Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6099421Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6099645Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6099874Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6100150Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6100353Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6100550Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6100772Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6100979Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6101182Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6101412Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6101645Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6101868Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6102100Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6102321Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6102554Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6102781Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6103025Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6103251Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6103479Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6103705Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6103913Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6104113Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6104326Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6104537Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6104744Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6104976Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6105200Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6105408Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6105628Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6105833Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6106063Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6106288Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6106517Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6106741Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6106974Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6107216Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6107447Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6107669Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6107901Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6108121Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6108354Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6108592Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6108821Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6109045Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6109274Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6109499Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6109728Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6109973Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6110251Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6110473Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6110677Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6110870Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6111095Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6111324Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6111563Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6111798Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6112021Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6112255Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6112476Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6112706Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6112957Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6113190Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6113415Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6113608Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6113833Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6114062Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6114319Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6114552Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6114775Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6114995Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6115198Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6115403Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6115615Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6115853Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6116077Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6116291Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6116496Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6116695Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6116898Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6117140Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6117363Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6117567Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6117767Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6117964Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6118116Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6118361Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6118552Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6118778Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6119010Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6119231Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6119434Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6119625Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6119859Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6120055Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6120295Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6120520Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6120715Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6120943Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6121134Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6121371Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6121563Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6121788Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6122016Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6122239Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6122474Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6122727Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6122959Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6123180Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6123412Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6123637Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6123865Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6124103Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6124302Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6124498Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6124719Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6124942Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6125147Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6125344Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6125557Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6125787Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6126010Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6126237Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6126459Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6126654Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6126892Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6127127Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6127346Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6127581Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6127802Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6128019Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6128239Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6128437Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6128636Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6128828Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6129052Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6129283Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6129507Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6129745Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6129967Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6130203Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6130424Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6130652Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6130872Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6131130Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6131357Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6131547Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6131770Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6131998Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6132220Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6132457Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6132679Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6132896Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6133097Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6133297Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6133499Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6133730Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6133964Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6134157Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6134379Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6134609Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6134830Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6135057Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6135302Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6135514Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6135718Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6135918Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6136119Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6136347Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6136575Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6136802Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6137022Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6137249Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6137470Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6137658Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6137878Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6138114Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6138337Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6138564Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6138783Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6138998Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6139228Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6139426Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6139628Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6139858Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6140077Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6140347Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6140573Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6140816Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6141037Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6141265Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6141489Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6141718Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6141936Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6142138Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6142354Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6142549Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6142747Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6142965Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6143175Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6143374Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6143592Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6143786Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6143961Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6144090Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6144240Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6144343Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6144473Z E1204 10:51:26.161000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6144630Z [W1204 10:51:26.626333458 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6144643Z 2025-12-04T12:10:19.6144789Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6145089Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6145380Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6145513Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6145990Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6146246Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6146485Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6146689Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6146893Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6147120Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6147345Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6147597Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6147814Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6148045Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6148262Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6148491Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6148709Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6148936Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6149167Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6149365Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6149577Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6149778Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6150006Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6150256Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6150464Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6150684Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6150910Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6151129Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6151318Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6151542Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6151765Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6151955Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6152178Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6152374Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6152563Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6152780Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6153008Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6153238Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6153467Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6153688Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6153888Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6154098Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6154298Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6154525Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6154762Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6154989Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6155210Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6155436Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6155657Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6155902Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6156125Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6156354Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6156574Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6156806Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6157025Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6157256Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6157485Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6157713Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6157933Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6158168Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6158393Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6158623Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6158855Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6159082Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6159306Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6159513Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6159711Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6159944Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6160231Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6160462Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6160686Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6160917Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6161142Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6161369Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6161592Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6161831Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6162054Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6162282Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6162503Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6162706Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6162897Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6163132Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6163330Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6163542Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6163743Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6163974Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6164197Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6164414Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6164608Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6164827Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6165029Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6165219Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6165444Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6165675Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6165905Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6166137Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6166356Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6166559Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6166768Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6166973Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6167211Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6167444Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6167654Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6167854Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6168061Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6168288Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6168534Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6168766Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6168987Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6169219Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6169443Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6169682Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6169902Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6170189Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6170412Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6170612Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6170810Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6171031Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6171238Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6171439Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6171660Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6171897Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6172119Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6172352Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6172574Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6172831Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6173051Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6173285Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6173509Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6173738Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6173965Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6174170Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6174387Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6174580Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6174796Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6175005Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6175233Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6175456Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6175656Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6175868Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6176070Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6176305Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6176532Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6176759Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6177005Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6177232Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6177458Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6177687Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6177912Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6178145Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6178366Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6178609Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6178830Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6179062Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6179287Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6179516Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6179739Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6179978Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6180232Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6180460Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6180685Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6180892Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6181085Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6181339Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6181568Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6181793Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6182021Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6182244Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6182479Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6182698Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6182944Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6183166Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6183401Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6183623Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6183819Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6184042Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6184285Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6184513Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6184741Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6184969Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6185183Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6185392Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6185624Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6185825Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6186061Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6186281Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6186502Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6186706Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6186907Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6187125Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6187353Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6187581Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6187785Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6187989Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6188183Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6188347Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6188573Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6188769Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6188994Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6189224Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6189450Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6189668Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6189866Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6190123Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6190321Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6190518Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6190740Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6190933Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6191169Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6191364Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6191588Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6191782Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6192009Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6192241Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6192466Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6192709Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6192932Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6193164Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6193385Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6193617Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6193838Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6194093Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6194323Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6194523Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6194720Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6194941Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6195158Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6195376Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6195581Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6195784Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6196019Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6196245Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6196474Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6196699Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6196902Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6197128Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6197361Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6197580Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6197816Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6198037Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6198273Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6198479Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6198680Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6198877Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6199067Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6199294Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6199531Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6199756Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6199984Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6200249Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6200446Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6200668Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6200902Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6201136Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6201370Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6201590Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6201786Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6202012Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6202241Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6202498Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6202727Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6202956Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6203173Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6203380Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6203586Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6203800Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6204035Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6204259Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6204455Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6204676Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6204908Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6205136Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6205381Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6205605Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6205820Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6206027Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6206229Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6206435Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6206689Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6206910Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6207144Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6207367Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6207601Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6207821Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6208027Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6208255Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6208484Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6208711Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6208939Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6209163Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6209381Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6209597Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6209805Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6210006Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6210274Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6210495Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6210728Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6210977Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6211206Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6211431Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6211665Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6211890Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6212121Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6212472Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6212676Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6212882Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6213078Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6213276Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6213497Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6213706Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6213921Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6214120Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6214314Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6214491Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6214619Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6214770Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6214875Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6215026Z E1204 10:51:26.165000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6215184Z [W1204 10:51:26.628999071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6215187Z 2025-12-04T12:10:19.6215334Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6215636Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6215931Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6216067Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6216546Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6216815Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6217049Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6217256Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6217461Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6217692Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6217917Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6218166Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6218393Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6218627Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6218849Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6219084Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6219323Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6219555Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6219776Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6219977Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6220227Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6220430Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6220663Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6220900Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6221093Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6221314Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6221545Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6221769Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6221961Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6222184Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6222397Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6222594Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6222813Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6223018Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6223211Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6223432Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6223687Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6223908Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6224142Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6224361Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6224564Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6224777Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6224989Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6225220Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6225442Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6225678Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6225898Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6226130Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6226354Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6226591Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6226816Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6227044Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6227268Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6227494Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6227734Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6227972Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6228194Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6228425Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6228645Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6228877Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6229097Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6229340Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6229564Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6229792Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6230018Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6230259Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6230461Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6230688Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6230927Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6231156Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6231376Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6231608Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6231825Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6232084Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6232308Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6232536Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6232760Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6232987Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6233210Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6233409Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6233625Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6233847Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6234044Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6234257Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6234460Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6234696Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6234915Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6235127Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6235320Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6235542Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6235743Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6235932Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6236155Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6236401Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6236624Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6236855Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6237079Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6237283Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6237491Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6237706Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6237940Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6238167Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6238375Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6238575Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6238780Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6239009Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6239247Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6239476Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6239701Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6239933Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6240228Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6240487Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6240709Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6240943Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6241165Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6241372Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6241574Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6241796Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6242016Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6242215Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6242423Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6242654Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6242878Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6243111Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6243330Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6243576Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6243799Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6244031Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6244252Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6244485Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6244733Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6244935Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6245138Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6245331Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6245548Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6245751Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6245986Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6246230Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6246434Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6246638Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6246839Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6247073Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6247295Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6247527Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6247765Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6248061Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6248314Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6248554Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6249678Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6250172Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6250427Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6250666Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6250908Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6251152Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6251403Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6251659Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6251905Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6252154Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6252397Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6252648Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6252895Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6253103Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6253329Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6253565Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6253823Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6254072Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6254316Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6254564Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6254821Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6255073Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6255328Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6255557Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6255811Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6256041Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6256259Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6256506Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6256763Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6257007Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6257244Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6257499Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6257732Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6257972Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6258181Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6258403Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6258665Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6258908Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6259144Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6259375Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6259595Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6259830Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6260073Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6260356Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6260575Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6260795Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6261008Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6261198Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6261444Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6261646Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6261893Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6262128Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6262392Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6262614Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6262830Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6263079Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6263281Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6263509Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6263741Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6263985Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6264214Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6264424Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6264684Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6264885Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6265138Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6265377Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6265623Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6265892Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6266129Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6266379Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6266609Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6266857Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6267111Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6267377Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6267619Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6267829Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6268035Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6268279Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6268550Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6268764Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6268987Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6269204Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6269463Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6269714Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6269962Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6270242Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6270457Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6270705Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6270960Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6271193Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6271447Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6271698Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6271943Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6272174Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6272386Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6272610Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6272816Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6273094Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6273355Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6273592Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6273844Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6274076Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6274302Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6274538Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6274806Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6275053Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6275292Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6310222Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6310470Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6310705Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6311002Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6311232Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6311461Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6311686Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6311901Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6312107Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6312352Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6312557Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6312791Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6313011Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6313206Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6313429Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6313679Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6313898Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6314129Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6314351Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6314565Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6314768Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6314966Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6315178Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6315412Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6315636Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6315866Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6316086Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6316315Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6316558Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6316752Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6316973Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6317201Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6317428Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6317657Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6317894Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6318107Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6318312Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6318515Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6318719Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6318951Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6319171Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6319414Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6319635Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6319865Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6320086Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6320356Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6320581Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6320833Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6321056Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6321256Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6321464Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6321657Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6321855Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6322087Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6322296Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6322498Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6322693Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6322890Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6323063Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6323195Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6323343Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6323462Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6323595Z E1204 10:51:26.168000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6323755Z [W1204 10:51:26.675182522 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6323758Z 2025-12-04T12:10:19.6323907Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6324209Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6324512Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6324644Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6325162Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6325421Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6325649Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6325862Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6326062Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6326301Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6326527Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6326761Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6326984Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6327209Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6327433Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6327658Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6327891Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6328122Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6328341Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6328542Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6328750Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6328974Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6329200Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6329422Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6329616Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6329836Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6330064Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6330323Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6330535Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6330753Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6330954Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6331147Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6331365Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6331566Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6331753Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6331988Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6332216Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6332437Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6332667Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6332886Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6333087Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6333323Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6333528Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6333757Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6333979Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6334211Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6334428Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6334669Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6334887Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6335117Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6335338Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6335569Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6335791Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6336017Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6336248Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6336475Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6336697Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6336927Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6337145Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6337395Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6337614Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6337847Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6338066Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6338295Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6338518Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6338719Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6338937Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6339164Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6339386Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6339616Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6339837Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6340068Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6340322Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6340566Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6340784Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6341013Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6341232Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6341463Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6341716Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6341913Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6342105Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6342326Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6342529Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6342737Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6342940Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6343181Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6343400Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6343601Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6343790Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6344012Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6344210Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6344403Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6344637Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6344866Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6345086Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6345313Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6345536Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6345735Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6345964Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6346166Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6346397Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6346622Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6346827Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6347029Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6347243Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6347474Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6347700Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6347931Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6348154Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6348384Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6348609Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6348850Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6349074Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6349302Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6349521Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6349723Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6349916Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6350196Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6350399Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6350601Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6350806Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6351042Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6351270Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6351513Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6351736Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6351965Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6352187Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6352417Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6352638Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6352870Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6353123Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6353330Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6353532Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6353726Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6353937Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6354140Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6354393Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6354613Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6354819Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6355019Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6355220Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6355456Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6355686Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6355916Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6356137Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6356368Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6356591Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6356819Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6357043Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6357287Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6357512Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6357741Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6357965Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6358197Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6358438Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6358669Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6358890Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6359118Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6359338Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6359570Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6359791Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6360003Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6360237Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6360460Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6360694Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6360914Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6361146Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6361368Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6361614Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6361837Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6362066Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6362290Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6362522Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6362768Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6362962Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6363183Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6363413Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6363633Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6363865Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6364087Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6364313Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6364523Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6364726Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6364929Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6365157Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6365382Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6365608Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6365812Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6366013Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6366214Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6366443Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6366665Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6366890Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6367089Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6367283Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6367433Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6367654Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6367851Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6368071Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6368311Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6368532Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6368731Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6368926Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6369148Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6369351Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6369542Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6369779Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6369971Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6370227Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6370421Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6370641Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6370834Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6371087Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6371323Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6371548Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6371777Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6372000Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6372229Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6372472Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6372699Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6372922Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6373153Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6373375Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6373579Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6373767Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6374002Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6374218Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6374423Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6374625Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6374827Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6375058Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6375298Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6375529Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6375753Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6375947Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6376171Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6376402Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6376625Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6376865Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6377087Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6377300Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6377505Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6377708Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6377901Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6378105Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6378326Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6378556Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6378776Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6379006Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6379231Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6379439Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6379662Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6379891Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6380154Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6380384Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6380607Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6380800Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6381035Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6381267Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6381494Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6381722Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6381946Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6382159Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6382380Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6382584Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6382786Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6383020Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6383242Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6383437Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6383682Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6383915Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6384141Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6384368Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6384596Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6384810Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6385016Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6385228Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6385432Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6385666Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6385887Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6386121Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6386341Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6386585Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6386809Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6387007Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6387233Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6387461Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6387686Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6387940Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6388165Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6388381Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6388587Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6388790Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6388993Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6389227Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6389458Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6389691Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6389913Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6390183Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6390409Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6390638Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6390879Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6391109Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6391334Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6391534Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6391742Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6391937Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6392159Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6392379Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6392588Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6392790Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6392984Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6393184Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6393361Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6393503Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6393654Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6393760Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6393890Z E1204 10:51:26.214000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6394050Z [W1204 10:51:26.677303312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6394053Z 2025-12-04T12:10:19.6394202Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6394500Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6394799Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6394945Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6395426Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6395683Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6395911Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6396123Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6396347Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6396575Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6396799Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6397028Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6397252Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6397479Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6397713Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6397943Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6398165Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6398396Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6398615Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6398817Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6399025Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6399239Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6399473Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6399691Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6399888Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6400168Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6400403Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6400648Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6400841Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6401062Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6401262Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6401454Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6401673Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6401887Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6402075Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6402298Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6402531Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6402751Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6402981Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6403199Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6403413Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6403624Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6403830Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6404062Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6404281Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6404512Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6404760Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6404994Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6405213Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6405446Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6405671Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6405898Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6406130Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6406357Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6406581Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6406810Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6407028Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6407263Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6407481Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6407723Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6407942Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6408172Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6408393Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6408619Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6408860Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6409061Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6409261Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6409492Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6409713Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6409943Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6410216Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6410458Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6410677Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6410910Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6411129Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6411360Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6411584Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6411813Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6412047Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6412245Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6412438Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6412657Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6412856Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6413094Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6413293Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6413524Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6413744Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6413946Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6414136Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6414357Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6414568Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6414757Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6414979Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6415207Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6415429Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6415656Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6415877Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6416090Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6416302Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6416505Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6416736Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6416959Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6417163Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6417390Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6417595Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6417824Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6418049Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6418278Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6418505Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6418745Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6418970Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6419204Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6419427Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6419657Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6419878Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6420081Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6420340Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6420566Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6420773Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6420973Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6421178Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6421407Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6421660Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6421890Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6422117Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6422354Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6422580Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6422813Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6423053Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6423289Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6423515Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6423720Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6423923Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6424116Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6424330Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6424543Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6424777Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6425002Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6425208Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6425411Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6425613Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6425863Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6426084Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6426317Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6426540Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6426769Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6426993Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6427232Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6427457Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6427685Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6427909Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6428140Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6428362Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6428592Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6428824Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6429055Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6429275Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6429504Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6429728Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6429977Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6430233Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6430434Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6430626Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6430846Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6431077Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6431299Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6431540Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6431762Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6431992Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6432215Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6432442Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6432663Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6432892Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6433127Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6433321Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6433541Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6433770Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6433993Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6434249Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6434472Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6434687Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6434891Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6435089Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6435293Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6435525Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6435766Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6435980Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6436183Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6436386Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6436586Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6436818Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6437040Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6437254Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6437456Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6437649Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6437799Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6438020Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6438214Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6438453Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6438684Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6438907Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6439105Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6439297Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6439518Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6439728Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6439918Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6440181Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6440373Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6440595Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6440790Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6441011Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6441219Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6441439Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6441670Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6441893Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6442120Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6442343Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6442597Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6442819Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6443052Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6443272Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6443502Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6443721Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6443934Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6444124Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6444346Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6444562Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6444764Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6444965Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6445165Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6445409Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6445630Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6445861Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6446084Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6446274Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6446496Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6446741Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6446962Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6447191Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6447415Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6447632Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6447835Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6448036Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6448238Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6448430Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6448650Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6448881Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6449103Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6449330Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6449563Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6449754Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6449976Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6450238Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6450460Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6450691Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6450942Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6451133Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6451354Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6451583Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6451804Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6452039Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6452261Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6452489Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6452693Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6452892Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6453097Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6453324Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6453546Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6453750Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6453970Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6454198Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6454420Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6454649Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6454874Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6455107Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6455311Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6455509Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6455714Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6455941Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6456163Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6456394Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6456625Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6456855Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6457076Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6457268Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6457489Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6457718Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6457948Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6458175Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6458397Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6458613Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6458821Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6459020Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6460908Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6461138Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6461357Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6461584Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6461805Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6462035Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6462253Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6462496Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6462719Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6462946Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6463165Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6463362Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6463565Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6463770Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6463967Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6464180Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6464386Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6464584Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6464776Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6464989Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6465160Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6465287Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6465430Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6465534Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6465664Z E1204 10:51:26.216000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6465821Z [W1204 10:51:26.679401383 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6465823Z 2025-12-04T12:10:19.6465970Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6466265Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6466570Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6466702Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6467187Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6467443Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6467667Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6467886Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6468087Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6468318Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6468542Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6468768Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6468990Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6469249Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6469470Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6469697Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6469917Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6470170Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6470387Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6470598Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6470805Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6471005Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6471232Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6471453Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6471643Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6471859Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6472097Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6472315Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6472504Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6472721Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6472919Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6473107Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6473348Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6473545Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6473734Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6473952Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6474178Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6474397Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6474624Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6474851Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6475049Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6475257Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6475458Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6475684Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6475904Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6476140Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6476359Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6476585Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6476803Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6477029Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6477246Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6477495Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6477715Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6477941Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6478161Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6478387Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6478604Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6478840Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6479058Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6479285Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6479503Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6479729Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6479947Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6480356Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6480589Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6480790Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6480985Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6481211Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6481429Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6481654Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6481898Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6482127Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6482344Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6482573Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6482795Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6483024Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6483256Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6483485Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6483706Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6483905Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6484097Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6484317Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6484518Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6484752Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6484960Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6485193Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6485413Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6485612Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6485802Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6486045Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6486242Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6486435Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6486656Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6486884Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6487107Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6487348Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6487569Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6487767Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6487979Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6488183Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6488414Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6488638Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6488853Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6489056Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6489258Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6489493Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6489716Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6489947Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6490234Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6490462Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6490686Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6490913Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6491139Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6491373Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6491607Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6491812Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6492005Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6492229Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6492431Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6492634Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6492839Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6493080Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6493306Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6493535Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6493761Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6493990Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6494214Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6494466Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6494687Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6494919Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6495141Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6495350Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6495549Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6495755Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6495969Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6496173Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6496410Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6496632Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6496838Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6497038Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6497252Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6497486Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6497707Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6497939Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6498160Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6498393Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6498634Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6498867Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6499094Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6499323Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6499550Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6499777Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6500020Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6500279Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6500503Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6500739Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6500961Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6501191Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6501411Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6501654Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6501878Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6502078Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6502275Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6502495Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6502729Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6502976Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6503209Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6503434Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6503664Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6503887Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6504115Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6504354Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6504581Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6504808Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6505005Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6505228Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6505459Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6505679Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6505923Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6506146Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6506364Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6506569Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6506771Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6506996Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6507225Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6507451Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6507666Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6507873Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6508078Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6508278Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6508523Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6508742Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6508949Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6509150Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6509346Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6509499Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6509723Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6509931Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6510183Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6510416Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6510637Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6510838Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6511032Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6511277Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6511480Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6511671Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6511899Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6512091Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6512316Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6512510Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6512744Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6512939Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6513160Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6513394Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6513614Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6513847Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6514085Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6514316Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6514541Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6514770Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6514994Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6515225Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6515472Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6515675Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6515866Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6516088Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6516304Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6516512Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6516717Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6516929Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6517161Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6517382Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6517615Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6517835Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6518031Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6518253Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6518493Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6518718Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6518948Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6519171Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6519384Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6519613Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6519817Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6520011Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6520241Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6520462Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6520694Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6520917Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6521167Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6521392Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6521584Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6521810Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6522038Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6522261Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6522488Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6522727Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6522920Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6523143Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6523376Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6523599Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6523854Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6524074Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6524290Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6524498Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6524698Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6524903Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6525130Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6525367Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6525558Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6525784Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6526017Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6526236Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6526470Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6526700Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6526919Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6527121Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6527323Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6527530Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6527761Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6528005Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6528234Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6528461Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6528688Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6528913Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6529109Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6529330Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6529573Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6529794Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6530028Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6530279Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6530494Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6530700Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6530915Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6531124Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6531352Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6531577Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6531808Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6532031Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6532305Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6532526Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6532760Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6532981Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6533214Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6533440Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6533638Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6533857Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6534049Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6534250Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6534470Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6534679Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6534880Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6535072Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6535280Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6535454Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6535587Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6535734Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6535841Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6535968Z E1204 10:51:26.218000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6536027Z ('RERUN', {'yellow': True}) [1.6042s] [100%] 2025-12-04T12:10:19.6536385Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:27.044891619 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6536393Z 2025-12-04T12:10:19.6536538Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6536840Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6537133Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6537271Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6537752Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6538020Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6538248Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6538456Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6538660Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6538891Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6539115Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6539358Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6539579Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6539811Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6540029Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6540291Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6540510Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6540763Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6540987Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6541186Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6541402Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6541604Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6541837Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6542069Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6542263Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6542486Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6542713Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6542934Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6543124Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6543347Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6543559Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6543751Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6543971Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6544170Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6544363Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6544584Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6544837Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6545055Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6545284Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6545503Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6545701Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6545914Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6546112Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6546351Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6546568Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6546797Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6547017Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6547242Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6547463Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6547712Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6547934Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6548160Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6548380Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6548607Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6548824Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6549079Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6549297Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6549526Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6549743Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6549972Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6550222Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6550468Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6550687Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6550914Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6551135Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6551336Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6551534Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6551762Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6551998Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6552228Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6552447Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6552676Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6552895Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6553123Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6553368Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6553593Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6553814Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6554039Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6554260Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6554458Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6554660Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6554880Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6555077Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6555288Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6555488Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6555717Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6555937Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6556143Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6556334Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6556554Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6556755Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6556945Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6557168Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6557418Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6557638Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6557867Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6558084Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6558282Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6558490Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6558691Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6558931Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6559156Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6559360Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6559560Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6559762Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6559991Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6560270Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6560501Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6560721Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6560950Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6561170Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6561402Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6561647Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6561877Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6562099Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6562297Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6562491Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6562711Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6562915Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6563132Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6563335Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6563567Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6563788Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6564018Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6564237Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6564480Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6564702Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6564935Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6565161Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6565388Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6565615Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6565838Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6566044Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6566238Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6566451Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6566658Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6566888Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6567112Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6567326Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6567529Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6567731Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6567965Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6568194Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6568422Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6568658Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6568887Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6569112Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6569340Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6569566Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6569798Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6570038Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6570305Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6570529Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6570761Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6570982Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6571215Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6571454Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6571683Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6571910Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6572141Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6572366Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6572572Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6572765Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6573002Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6573232Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6573457Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6573686Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6573909Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6574142Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6574388Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6574621Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6574843Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6575075Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6575296Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6575491Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6575726Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6575955Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6576181Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6576410Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6576634Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6576848Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6577056Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6577271Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6577474Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6577706Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6577927Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6578145Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6578349Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6578577Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6578785Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6579013Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6579242Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6579447Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6579650Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6579855Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6580007Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6580265Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6580458Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6580685Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6580914Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6581137Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6581349Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6581544Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6581771Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6581969Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6582162Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6582383Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6582616Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6582838Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6583034Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6583260Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6583451Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6583677Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6583909Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6584146Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6584372Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6584594Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6584824Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6585043Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6585277Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6585498Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6585740Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6585960Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6586166Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6586362Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6586584Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6586822Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6587025Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6587228Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6587430Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6587664Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6587890Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6588119Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6588355Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6588546Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6588770Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6588999Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6589222Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6589454Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6589675Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6589905Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6590141Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6590345Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6590538Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6590735Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6590986Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6591213Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6591437Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6591667Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6591890Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6592086Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6592307Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6592551Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6592773Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6593007Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6593228Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6593421Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6593644Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6593871Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6594107Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6594337Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6594562Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6594776Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6594984Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6595213Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6595414Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6595646Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6595866Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6596060Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6596282Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6596513Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6596747Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6596976Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6597202Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6597504Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6603307Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6603543Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6603754Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6604029Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6604259Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6604491Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6604716Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6604947Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6605200Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6605398Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6605622Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6605853Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6606076Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6606307Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6606529Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6606762Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6606968Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6607168Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6607373Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6607605Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6607828Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6608059Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6608291Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6608523Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6608747Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6608977Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6609200Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6609456Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6609683Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6609888Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6610151Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6610343Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6610544Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6610760Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6610986Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6611188Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6611381Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6611581Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6611755Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6611887Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6612036Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6612145Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6612290Z E1204 10:51:27.584000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6612450Z [W1204 10:51:27.047165077 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6612455Z 2025-12-04T12:10:19.6612605Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6612906Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6613207Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6613342Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6613870Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6614130Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6614357Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6614568Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6614772Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6615003Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6615244Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6615477Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6615701Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6615927Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6616151Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6616385Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6616620Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6616853Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6617080Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6617286Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6617495Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6617700Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6617947Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6618168Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6618360Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6618581Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6618814Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6619037Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6619231Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6619461Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6619663Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6619853Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6620076Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6620316Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6620506Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6620729Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6620977Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6621202Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6621430Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6621652Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6621852Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6622091Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6622295Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6622525Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6643322Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6643560Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6643787Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6644021Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6644265Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6644495Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6644725Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6645039Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6647201Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6647434Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6647681Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6647912Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6648136Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6648368Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6648587Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6648816Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6649067Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6649293Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6649515Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6649739Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6649965Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6650194Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6650400Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6650652Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6650870Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6651103Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6651319Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6651550Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6651768Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6652021Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6652242Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6652469Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6652692Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6652920Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6653142Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6653402Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6653594Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6653819Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6654015Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6654229Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6654429Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6654660Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6654890Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6655093Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6655286Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6655506Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6655705Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6655893Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6656126Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6656356Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6656577Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6656809Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6657025Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6657223Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6657451Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6657651Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6657881Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6658102Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6658304Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6658504Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6658706Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6658945Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6659165Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6659394Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6659618Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6659845Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6660065Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6660344Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6660565Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6660793Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6661012Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6661211Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6661402Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6661650Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6661855Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6662054Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6662254Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6662481Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6662703Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6662931Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6663163Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6663391Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6663610Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6663840Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6664060Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6664287Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6664519Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6664721Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6685468Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6685661Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6685871Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6686071Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6686327Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6686546Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6686750Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6686949Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6687148Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6687378Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6687599Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6687841Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6688061Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6688288Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6688508Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6688733Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6688956Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6689213Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6689433Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6689660Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6689878Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6690134Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6690353Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6690607Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6690826Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6691055Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6691276Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6691503Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6691725Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6691937Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6692128Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6692349Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6692575Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6692794Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6693021Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6693240Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6693480Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6693702Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6693928Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6694146Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6694372Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6694591Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6694804Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6695022Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6695250Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6695472Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6695701Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6695920Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6696144Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6696345Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6696542Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6696744Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6696971Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6697191Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6697403Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6697614Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6697815Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6698014Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6698247Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6698468Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6698674Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6698895Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6699091Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6699241Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6699468Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6699665Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6699888Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6700169Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6700406Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6700610Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6700802Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6701029Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6701229Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6701420Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6701646Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6701853Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6702080Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6702272Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6702499Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6702694Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6702916Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6703176Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6703399Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6703631Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6703852Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6704084Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6704307Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6704559Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6725338Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6725574Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6725799Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6725997Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6726194Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6726418Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6726649Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6726857Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6727056Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6727264Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6727497Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6727748Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6727978Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6728199Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6728393Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6728615Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6728849Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6729072Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6729318Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6729544Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6729759Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6729965Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6730211Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6730409Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6730600Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6730842Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6731076Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6731297Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6731530Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6731751Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6731948Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6732195Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6732430Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6732653Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6732880Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6733106Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6733297Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6733533Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6733762Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6733988Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6734224Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6734444Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6734665Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6734867Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6735079Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6735288Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6735518Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6735745Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6735936Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6736163Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6736415Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6736641Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6736872Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6737093Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6737312Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6737513Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6737727Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6737929Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6738162Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6738387Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6738615Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6738839Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6739067Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6739301Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6739493Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6739717Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6739948Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6740211Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6740484Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6740707Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6740924Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6741129Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6741334Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6741539Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6741768Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6742009Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6742238Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6742462Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6742692Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6742915Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6743148Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6743369Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6743618Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6743838Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6744039Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6744241Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6744437Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6744659Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6744873Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6745084Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6745286Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6745486Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6745681Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6745857Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6745999Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6746145Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6746253Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6746382Z E1204 10:51:27.586000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6746545Z [W1204 10:51:27.049267048 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6746549Z 2025-12-04T12:10:19.6746697Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6746995Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6747292Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6747440Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6747935Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6748191Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6748420Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6748627Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6748854Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6749086Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6749309Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6749540Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6749761Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6749995Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6750244Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6750489Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6750710Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6750940Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6751161Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6751362Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6751574Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6751788Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6752023Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6752246Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6752440Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6752661Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6752888Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6753146Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6753334Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6753558Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6753758Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6753947Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6754170Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6754368Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6754573Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6754791Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6755022Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6755249Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6755475Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6755701Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6755897Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6756120Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6756323Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6756556Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6756780Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6757008Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6757259Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6757486Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6757709Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6757936Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6758159Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6758391Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6758610Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6758858Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6759077Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6759316Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6759537Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6759764Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6759988Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6760271Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6760491Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6760720Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6760940Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6761174Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6761394Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6761632Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6761833Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6762062Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6762286Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6762515Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6762740Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6762967Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6763206Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6763440Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6763664Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6763896Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6764115Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6764346Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6764579Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6764781Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6764975Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6765194Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6765394Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6765604Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6765831Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6766061Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6766285Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6766487Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6766677Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6766902Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6767099Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6767304Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6767522Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6767754Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6767978Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6768208Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6768430Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6768627Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6768854Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6769058Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6769293Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6769520Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6769724Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6769948Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6770195Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6770431Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6770652Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6770885Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6771112Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6771340Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6771579Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6771807Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6772031Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6772265Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6772487Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6772693Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6772887Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6773132Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6773338Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6773542Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6773744Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6773977Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6774224Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6774453Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6774679Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6774909Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6775135Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6775367Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6775587Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6775829Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6776050Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6776257Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6776456Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6776651Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6776866Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6777070Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6777314Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6777535Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6777744Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6777944Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6778149Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6778403Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6778624Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6778854Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6779074Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6779307Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6779528Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6779757Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6779989Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6780247Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6780469Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6780698Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6780921Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6781148Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6781382Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6781614Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6781835Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6782065Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6782284Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6782514Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6782759Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6782960Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6783154Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6783373Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6783603Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6783826Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6784056Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6784292Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6784520Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6784742Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6784969Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6785192Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6785419Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6785652Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6785846Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6786069Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6786301Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6786521Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6786753Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6786996Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6787212Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6787418Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6787617Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6787821Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6788050Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6788292Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6788505Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6788709Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6788911Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6789111Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6789342Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6789563Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6789780Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6789980Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6790234Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6790385Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6790608Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6790805Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6791052Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6791283Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6791504Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6791704Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6791897Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6792117Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6792318Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6792521Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6792745Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6792937Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6793161Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6793352Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6793573Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6793764Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6793997Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6794229Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6794447Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6794679Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6794904Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6795136Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6795380Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6795609Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6795829Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6796057Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6796280Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6796481Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6796681Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6796903Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6797117Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6797324Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6797523Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6797726Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6797957Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6798190Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6798424Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6798645Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6798843Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6799064Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6799300Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6799550Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6799780Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6800006Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6800298Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6800508Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6800708Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6800921Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6801116Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6801338Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6801572Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6801794Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6802028Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6802248Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6802457Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6802683Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6802911Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6803137Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6803365Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6803592Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6803817Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6804044Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6804279Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6804500Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6804734Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6804954Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6805182Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6805389Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6805592Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6805799Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6806028Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6806257Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6806449Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6806684Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6806918Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6807138Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6807374Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6807593Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6807812Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6808036Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6808242Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6808450Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6808680Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6808906Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6809134Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6809370Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6809598Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6809825Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6810021Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6810283Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6810516Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6810738Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6810984Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6811206Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6811423Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6811630Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6811830Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6812037Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6812293Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6812520Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6812748Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6812974Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6813211Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6813434Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6813680Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6813901Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6814134Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6814356Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6814559Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6814767Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6814960Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6815175Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6815393Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6815605Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6815804Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6816001Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6816200Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6816394Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6816526Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6816673Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6816782Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6816910Z E1204 10:51:27.588000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6817072Z [W1204 10:51:27.094321315 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6817075Z 2025-12-04T12:10:19.6817222Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6817521Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6817826Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6817955Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6818439Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6818691Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6818919Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6819131Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6819343Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6819578Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6819804Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6820038Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6820292Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6820558Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6820781Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6821009Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6821233Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6821460Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6821682Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6821881Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6822111Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6822317Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6822544Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6822767Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6822957Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6823181Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6823408Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6823647Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6823840Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6824060Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6824265Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6824456Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6824698Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6824896Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6825091Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6825312Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6825541Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6825766Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6825993Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6826227Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6826425Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6826639Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6826845Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6827072Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6827293Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6827518Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6827752Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6827980Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6828202Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6828432Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6828652Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6828904Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6829124Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6829354Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6829575Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6829804Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6830028Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6830292Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6830531Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6830757Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6830983Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6831215Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6831434Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6831665Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6831897Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6832105Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6832301Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6832536Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6832758Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6832985Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6833235Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6833466Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6833688Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6833915Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6834137Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6834367Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6834586Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6834826Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6835046Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6835249Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6835439Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6835666Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6835868Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6836095Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6836302Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6836531Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6836753Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6836950Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6837143Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6837387Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6837584Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6837775Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6837993Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6838224Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6838443Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6838672Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6838902Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6839098Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6839309Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6839511Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6839743Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6839966Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6840222Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6840442Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6840644Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6840875Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6841096Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6841325Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6841570Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6841799Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6842022Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6842249Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6842473Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6842702Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6842926Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6843139Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6843335Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6843562Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6843768Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6843971Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6844173Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6844405Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6844638Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6844872Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6845095Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6845323Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6845547Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6845796Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6846021Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6846250Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6846475Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6846682Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6846883Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6847076Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6847296Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6847498Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6847726Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6847950Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6848153Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6848352Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6848555Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6848793Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6849017Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6849249Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6849468Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6849698Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6849936Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6850204Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6850425Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6850653Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6850877Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6851105Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6851329Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6851576Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6851800Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6852028Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6852251Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6852482Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6852700Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6852941Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6853161Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6853363Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6853554Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6853776Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6854007Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6854255Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6854485Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6854705Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6854934Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6855154Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6855385Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6855606Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6855845Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6856067Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6856258Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6856480Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6856707Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6856929Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6857169Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6857389Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6857605Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6857807Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6858010Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6858216Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6858465Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6858689Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6858904Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6859109Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6859309Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6859514Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6859746Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6859978Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6860226Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6860429Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6860627Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6860778Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6861005Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6861197Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6861437Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6861672Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6861892Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6862096Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6862288Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6862541Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6862740Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6862936Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6863160Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6863352Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6863578Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6863769Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6864008Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6864199Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6864424Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6864660Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6864883Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6865115Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6865335Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6865578Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6865803Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6866032Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6866259Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6866486Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6866715Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6866941Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6867139Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6867362Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6867578Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6867787Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6867988Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6868205Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6868436Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6868661Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6868893Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6869114Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6869313Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6869535Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6869780Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6870003Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6870272Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6870498Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6870712Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6870920Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6871148Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6871345Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6871540Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6871767Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6872000Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6872224Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6872470Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6872693Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6872887Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6873109Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6873341Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6873567Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6873799Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6874037Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6874229Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6874454Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6874683Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6874908Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6875141Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6875382Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6875601Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6875807Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6876015Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6876217Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6876450Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6876687Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6876879Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6877105Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6877337Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6877562Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6877791Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6878014Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6878244Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6878450Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6878655Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6878859Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6879092Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6879314Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6879567Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6879793Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6880025Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6880292Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6880485Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6880709Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6880955Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6881176Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6881408Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6881632Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6881848Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6882051Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6882256Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6882475Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6882705Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6882931Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6883161Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6883385Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6883616Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6883877Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6884110Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6884331Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6884565Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6884788Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6884991Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.6885212Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.6885407Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.6885609Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.6885826Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.6886040Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.6886240Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.6886438Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.6886642Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.6886821Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.6886952Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.6887099Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.6887206Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.6887334Z E1204 10:51:27.633000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.6887496Z [W1204 10:51:27.096412756 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.6887499Z 2025-12-04T12:10:19.6887644Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.6887964Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.6888260Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.6888393Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.6888876Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.6889132Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.6889374Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.6889583Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.6889791Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6890027Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6890291Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6890523Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6890742Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6890986Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6891207Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6891440Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6891663Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6891893Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6892142Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6892342Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6892557Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6892757Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6892988Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6893214Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6893404Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6893646Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6893872Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6894098Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6894289Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6894513Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6894715Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6894903Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6895138Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6895337Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6895529Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6895749Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6895978Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6896203Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6896453Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6896679Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6896876Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6897087Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6897289Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6897522Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6897756Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6897983Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6898208Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6898438Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6898661Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6898888Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6899113Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6899359Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6899579Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6899808Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6900029Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6900306Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6900553Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6900783Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6901006Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6901235Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6901460Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6901686Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6901908Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6902149Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6902373Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6902585Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6902783Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.6903017Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6903240Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6903471Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6903709Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6903936Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6904161Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6904387Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6904611Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6904857Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6905082Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6905315Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6905536Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6905739Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6905931Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6906155Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6906365Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6906577Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6906783Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6907012Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6907235Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6907434Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6907629Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6907862Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6908064Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6908257Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6908478Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6908710Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6908951Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6909183Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6909402Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6909602Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6909815Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6910018Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6910288Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6910524Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6910732Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6910934Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6911140Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6911371Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6911593Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6911830Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6912065Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6912298Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6912520Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6912755Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6912979Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6913234Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6913460Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6913662Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6913856Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6914078Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6914287Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6914494Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6914715Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6914946Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6915168Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6915402Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6915623Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6915856Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6916082Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6916323Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6916549Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6916778Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6917003Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6917207Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6917431Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6917627Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6917838Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.6918042Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6918272Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6918498Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6918701Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6918920Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6919126Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6919358Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6919583Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6919812Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6920039Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6920319Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6920560Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6920794Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6921016Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6921251Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6921474Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6921748Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6921974Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6922202Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6922425Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6922653Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6922880Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6923108Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6923345Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6923579Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6923803Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6924009Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6924200Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6924425Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6924654Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6924893Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6925126Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6925348Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6925580Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6925805Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6926057Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6926277Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6926513Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6926739Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6926932Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6927159Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6927388Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6927623Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6927851Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6928081Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6928299Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6928502Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6928706Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6928920Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6929156Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6929377Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6929596Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6929802Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6930001Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6930278Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6930507Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6930732Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6930934Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6931136Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6931335Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6931484Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.6931722Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6931915Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6932142Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6932372Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6932598Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6932803Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6932994Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6933230Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6933428Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6933623Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6933844Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6934038Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6934264Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6934474Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6934700Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6934893Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6935115Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6935346Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6935570Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6935814Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6936035Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6936269Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6936491Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6936725Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6936946Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6937180Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6937416Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6937617Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.6937811Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6938032Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6938250Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6938455Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6938679Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6938884Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6939115Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6939343Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6939574Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6939798Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6940005Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6940261Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6940496Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6940719Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6940951Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6941173Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6941390Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6941613Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6941816Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6942011Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.6942203Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6942427Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6942659Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6942911Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6943145Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6943366Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6943562Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6943786Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6944023Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6944246Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6944492Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6944719Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6944912Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6945137Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6945366Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6945590Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6945828Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6946056Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6946274Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6946479Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6946683Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6946886Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6947144Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6947365Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6947561Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6947786Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6948014Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6948242Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6948485Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6948709Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6948924Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6949131Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6949336Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6949540Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6949771Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.6950003Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6950271Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6950493Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6950726Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6950948Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6951139Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.6951387Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6951616Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6951840Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6952068Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.6952291Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.6952506Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.6952721Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.6952924Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.6953126Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.6953357Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7021214Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7021491Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7021723Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7022015Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7022245Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7022480Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7022705Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7022940Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7023164Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7023404Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7023611Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7023810Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7024013Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7024231Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7024446Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7024664Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7024861Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7025062Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7025241Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7025375Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7025525Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7025636Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7025765Z E1204 10:51:27.635000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7025930Z [W1204 10:51:27.098553275 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7025944Z 2025-12-04T12:10:19.7026094Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7026400Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7026702Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7026839Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7027337Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7027634Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7027867Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7028078Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7028284Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7028518Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7028741Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7028986Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7029208Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7029441Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7029664Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7029894Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7030151Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7030382Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7030627Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7030827Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7031039Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7031243Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7031478Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7031729Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7031922Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7032148Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7032375Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7032598Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7032790Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7033015Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7033231Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7033420Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7033644Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7033843Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7034038Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7034262Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7034496Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7034732Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7034963Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7035185Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7035384Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7035596Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7035799Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7036049Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7036273Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7036505Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7036729Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7036960Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7037184Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7037426Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7037650Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7037884Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7038105Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7038338Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7038560Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7038794Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7039028Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7039261Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7039484Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7039713Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7039937Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7040239Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7040465Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7040697Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7040918Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7041127Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7041326Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7041561Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7041795Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7042027Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7042252Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7042484Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7042707Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7042936Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7043157Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7043402Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7043627Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7043861Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7044082Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7044286Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7044504Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7044726Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7044927Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7045139Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7045345Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7045577Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7045801Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7046012Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7046204Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7046425Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7046627Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7046819Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7047039Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7047272Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7047503Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7047739Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7047965Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7048166Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7048377Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7048579Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7048832Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7049053Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7049262Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7049464Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7049670Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7049904Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7050175Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7050409Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7050631Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7050867Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7051088Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7051322Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7051548Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7051793Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7052017Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7052219Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7052417Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7052637Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7052846Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7053075Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7053278Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7053513Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7053734Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7053968Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7054190Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7054436Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7054664Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7054893Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7055121Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7055350Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7055575Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7055780Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7055993Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7056191Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7056404Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7056610Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7056842Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7057068Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7057291Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7057493Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7057699Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7057931Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7058158Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7058386Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7058623Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7058854Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7059082Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7059316Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7059537Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7059769Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7059991Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7060276Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7060503Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7060732Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7060957Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7061187Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7061444Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7061677Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7061904Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7062137Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7062360Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7062566Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7062759Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7063000Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7063230Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7063456Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7063693Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7063915Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7064148Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7064369Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7064619Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7064842Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7065074Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7065299Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7065494Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7065744Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7065974Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7066201Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7066431Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7066657Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7066879Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7067083Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7067298Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7067501Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7067734Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7067958Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7068178Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7068388Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7068599Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7068804Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7069035Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7069261Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7069464Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7069666Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7069884Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7070034Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7070292Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7070489Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7070714Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7070944Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7071167Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7071386Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7071577Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7071800Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7071999Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7072192Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7072414Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7072609Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7072847Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7073037Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7073260Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7073453Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7073678Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7073909Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7074158Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7074391Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7074613Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7074846Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7075070Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7075302Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7075547Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7075778Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7076003Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7076203Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7076396Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7076618Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7076835Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7077055Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7077258Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7077463Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7077692Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7077915Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7078145Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7078390Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7078584Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7078806Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7079037Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7079260Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7079493Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7079714Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7079942Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7080187Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7080388Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7080582Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7080775Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7080999Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7081241Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7081467Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7081699Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7081922Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7082115Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7082337Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7082597Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7082819Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7083050Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7083273Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7083465Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7083688Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7083923Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7084162Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7084392Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7084616Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7084833Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7085036Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7085239Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7085454Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7085686Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7085906Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7086101Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7086324Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7086554Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7086804Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7087033Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7087257Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7087471Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7087677Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7087881Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7088082Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7088325Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7088548Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7088781Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7089001Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7089233Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7089457Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7089660Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7089885Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7090143Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7090368Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7090598Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7090825Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7091073Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7091276Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7091480Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7091680Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7091912Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7092137Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7092367Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7092603Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7092834Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7093060Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7093292Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7093519Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7093750Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7093985Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7094189Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7094393Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7094590Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7094788Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7095009Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7095244Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7095444Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7095642Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7095835Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7096013Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7096142Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7096292Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7096398Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7096543Z E1204 10:51:27.637000 578686 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7096587Z FAILED [1.4323s] [100%] 2025-12-04T12:10:19.7096593Z 2025-12-04T12:10:19.7096656Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.7096813Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.7096864Z Traceback (most recent call last): 2025-12-04T12:10:19.7097037Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7097083Z method(*args, **kwargs) 2025-12-04T12:10:19.7097240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7097282Z method(*args, **kwargs) 2025-12-04T12:10:19.7097438Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.7097478Z with policy(): 2025-12-04T12:10:19.7097635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.7097679Z raise RuntimeError(msg) 2025-12-04T12:10:19.7098089Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1973420032. 2025-12-04T12:10:19.7098094Z 2025-12-04T12:10:19.7098172Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.7098437Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.7098441Z 2025-12-04T12:10:19.7098534Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.7098614Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7098661Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7098722Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7099306Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.7099410Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7099451Z graph_break [] 2025-12-04T12:10:19.7099519Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7099600Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7100120Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.7100174Z current_size = base.storage().size() 2025-12-04T12:10:19.7100218Z Autotune Choices Stats: 2025-12-04T12:10:19.7100600Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01568000018596649, "best_triton_pos": 0} 2025-12-04T12:10:19.7100691Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7100744Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7100869Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7101113Z triton_mm_35 0.0157 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7101349Z triton_mm_15 0.0162 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7101581Z triton_mm_34 0.0174 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7101811Z triton_mm_13 0.0176 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7102058Z triton_mm_14 0.0177 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7102285Z triton_mm_31 0.0186 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7102519Z triton_mm_33 0.0188 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7102744Z triton_mm_32 0.0194 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7103001Z triton_mm_16 0.0195 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7103229Z triton_mm_29 0.0216 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7103363Z SingleProcess AUTOTUNE benchmarking takes 0.2001 seconds and 8.5377 seconds precompiling for 33 choices 2025-12-04T12:10:19.7103515Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.7103563Z Traceback (most recent call last): 2025-12-04T12:10:19.7103726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7103769Z method(*args, **kwargs) 2025-12-04T12:10:19.7103929Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7103972Z method(*args, **kwargs) 2025-12-04T12:10:19.7104127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.7104167Z with policy(): 2025-12-04T12:10:19.7104336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.7104378Z raise RuntimeError(msg) 2025-12-04T12:10:19.7104776Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1973420032 and is now 3017801728. 2025-12-04T12:10:19.7104780Z 2025-12-04T12:10:19.7104860Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.7105123Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.7105126Z 2025-12-04T12:10:19.7105217Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.7105292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7105338Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7105397Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7105952Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.7106065Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7106105Z graph_break [] 2025-12-04T12:10:19.7106172Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7106248Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7106737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.7106787Z current_size = base.storage().size() 2025-12-04T12:10:19.7106831Z Autotune Choices Stats: 2025-12-04T12:10:19.7107235Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01568000018596649, "best_triton_pos": 0} 2025-12-04T12:10:19.7107311Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7107364Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7107487Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7107724Z triton_mm_35 0.0157 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7107958Z triton_mm_15 0.0162 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7108188Z triton_mm_34 0.0174 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7108428Z triton_mm_13 0.0176 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7108655Z triton_mm_14 0.0177 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7108883Z triton_mm_31 0.0186 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7109111Z triton_mm_33 0.0188 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7109337Z triton_mm_32 0.0194 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7109561Z triton_mm_16 0.0195 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7109797Z triton_mm_29 0.0216 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7109928Z SingleProcess AUTOTUNE benchmarking takes 0.2001 seconds and 8.5377 seconds precompiling for 33 choices 2025-12-04T12:10:19.7110005Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7110048Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7110149Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7110249Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7110707Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.7110748Z graph_break [] 2025-12-04T12:10:19.7110845Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7110921Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7111287Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.7111384Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.7111426Z Autotune Choices Stats: 2025-12-04T12:10:19.7111910Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013319999910891056, "best_triton_pos": 1, "best_triton_time": 0.015999000519514084, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.7111981Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7112035Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7112169Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7112214Z _scaled_mm 0.0133 ms 100.0% 2025-12-04T12:10:19.7112449Z triton_mm_73 0.0160 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7112682Z triton_mm_53 0.0164 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7112911Z triton_mm_52 0.0175 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7113138Z triton_mm_72 0.0175 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7113364Z triton_mm_51 0.0178 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7113604Z triton_mm_69 0.0187 ms 71.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7113834Z triton_mm_71 0.0190 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7114060Z triton_mm_54 0.0196 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7114285Z triton_mm_70 0.0196 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7114420Z SingleProcess AUTOTUNE benchmarking takes 0.2901 seconds and 0.7768 seconds precompiling for 39 choices 2025-12-04T12:10:19.7114475Z =================================== FAILURES =================================== 2025-12-04T12:10:19.7114644Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.7114693Z Traceback (most recent call last): 2025-12-04T12:10:19.7114854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7114898Z method(*args, **kwargs) 2025-12-04T12:10:19.7115054Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.7115097Z method(*args, **kwargs) 2025-12-04T12:10:19.7115254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.7115294Z with policy(): 2025-12-04T12:10:19.7115450Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.7115497Z raise RuntimeError(msg) 2025-12-04T12:10:19.7115891Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3017801728 and is now 3982491648. 2025-12-04T12:10:19.7115904Z 2025-12-04T12:10:19.7115984Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.7116245Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.7116248Z 2025-12-04T12:10:19.7116341Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.7116415Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7116461Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7116520Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7117081Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.7117186Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7117237Z graph_break [] 2025-12-04T12:10:19.7117306Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7117381Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7117872Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.7117922Z current_size = base.storage().size() 2025-12-04T12:10:19.7117969Z Autotune Choices Stats: 2025-12-04T12:10:19.7118341Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01568000018596649, "best_triton_pos": 0} 2025-12-04T12:10:19.7118416Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7118468Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7118613Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7118859Z triton_mm_35 0.0157 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7119092Z triton_mm_15 0.0162 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7119322Z triton_mm_34 0.0174 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7119552Z triton_mm_13 0.0176 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7119782Z triton_mm_14 0.0177 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7120023Z triton_mm_31 0.0186 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7120417Z triton_mm_33 0.0188 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7120649Z triton_mm_32 0.0194 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7120876Z triton_mm_16 0.0195 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7121106Z triton_mm_29 0.0216 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7121237Z SingleProcess AUTOTUNE benchmarking takes 0.2001 seconds and 8.5377 seconds precompiling for 33 choices 2025-12-04T12:10:19.7121336Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7121383Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7121442Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7121547Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7122002Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.7122046Z graph_break [] 2025-12-04T12:10:19.7122111Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7122190Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7122581Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.7122680Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.7122723Z Autotune Choices Stats: 2025-12-04T12:10:19.7123202Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013319999910891056, "best_triton_pos": 1, "best_triton_time": 0.015999000519514084, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.7123278Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7123330Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7123454Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7123499Z _scaled_mm 0.0133 ms 100.0% 2025-12-04T12:10:19.7123739Z triton_mm_73 0.0160 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7123983Z triton_mm_53 0.0164 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7124212Z triton_mm_52 0.0175 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7124441Z triton_mm_72 0.0175 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7124673Z triton_mm_51 0.0178 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7124903Z triton_mm_69 0.0187 ms 71.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7125131Z triton_mm_71 0.0190 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7125370Z triton_mm_54 0.0196 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7125597Z triton_mm_70 0.0196 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7125731Z SingleProcess AUTOTUNE benchmarking takes 0.2901 seconds and 0.7768 seconds precompiling for 39 choices 2025-12-04T12:10:19.7125807Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.7125856Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.7125917Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.7126019Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.7126497Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.7126536Z graph_break [] 2025-12-04T12:10:19.7126606Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.7126683Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.7126730Z Autotune Choices Stats: 2025-12-04T12:10:19.7127200Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.01351999957114458, "best_triton_pos": 1, "best_triton_time": 0.01611899957060814, "best_triton_kernel": "triton_mm_111", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.7127272Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.7127326Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.7127452Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.7127509Z _scaled_mm 0.0135 ms 100.0% 2025-12-04T12:10:19.7127746Z triton_mm_111 0.0161 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7127976Z triton_mm_91 0.0164 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7128205Z triton_mm_89 0.0175 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7128437Z triton_mm_110 0.0176 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7128662Z triton_mm_90 0.0180 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7128894Z triton_mm_107 0.0185 ms 73.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7129143Z triton_mm_109 0.0186 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7129369Z triton_mm_92 0.0193 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.7129601Z triton_mm_108 0.0197 ms 68.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.7129731Z SingleProcess AUTOTUNE benchmarking takes 0.2972 seconds and 0.6313 seconds precompiling for 39 choices 2025-12-04T12:10:19.7129932Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5c5ceebf91026326.xml - 2025-12-04T12:10:19.7129996Z =========================== short test summary info ============================ 2025-12-04T12:10:19.7130671Z FAILED [1.4323s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3017801728 and is now 3982491648. 2025-12-04T12:10:19.7130674Z 2025-12-04T12:10:19.7130754Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.7131015Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.7131018Z 2025-12-04T12:10:19.7131110Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.7131178Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.7131254Z ================= 1 failed, 187 deselected, 2 rerun in 14.20s ================== 2025-12-04T12:10:19.7131292Z Got exit code 1 2025-12-04T12:10:19.7131337Z Retrying single test... 2025-12-04T12:10:19.7131500Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e6dc9d8fc1e220b8.xml 2025-12-04T12:10:19.7131564Z ============================= test session starts ============================== 2025-12-04T12:10:19.7131679Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.7131725Z cachedir: .pytest_cache 2025-12-04T12:10:19.7131889Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.7131943Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.7131985Z configfile: pytest.ini 2025-12-04T12:10:19.7132158Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.7132241Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.7132497Z stepcurrent: skipping 78 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.7132546Z Running 1 items in this shard 2025-12-04T12:10:19.7132548Z 2025-12-04T12:10:19.7132881Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:36.415953151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7132897Z 2025-12-04T12:10:19.7133060Z [W1204 10:51:44.826457479 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7133063Z 2025-12-04T12:10:19.7133219Z [W1204 10:51:44.831049774 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7133225Z 2025-12-04T12:10:19.7133377Z [W1204 10:51:44.860531820 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7133380Z 2025-12-04T12:10:19.7133700Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7134003Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7134164Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7134652Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7134913Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7135146Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7135358Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7135565Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7135814Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7136043Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7136280Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7136502Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7136741Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7136962Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7137207Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7137430Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7137659Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7137882Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7138109Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7138335Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7138587Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7138811Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7139004Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7139226Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7139460Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7139679Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7139884Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7140134Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7140370Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7140593Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7140827Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7141051Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7141255Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7141488Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7141651Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7141837Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7142365Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/ts/cts7xnawxifeshgr42gxswlkp4hvueix4d7ckiglfk3rqvhm7u2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7142519Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7142765Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7142925Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7143222Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7143358Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7143621Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7143763Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7144024Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7144198Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7144467Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7144608Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7144885Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7145085Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7145404Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7145701Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7145846Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7146333Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7146591Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7146819Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7147050Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7147257Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7147488Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7147715Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7147945Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7148169Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7148399Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7148636Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7148867Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7149088Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7149320Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7149540Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7149773Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7149998Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7150268Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7150490Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7150681Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7150906Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7151134Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7151391Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7151583Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7151804Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7152036Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7152259Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7152491Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7152712Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7152933Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7153149Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7153311Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7153499Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7153605Z E1204 10:51:44.361000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7153768Z [W1204 10:51:44.865673378 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7153770Z 2025-12-04T12:10:19.7154080Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7154395Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7154537Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7155023Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7155279Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7155507Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7155741Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7155948Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7156180Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7156406Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7156638Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7156865Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7157105Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7157329Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7157559Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7157781Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7158014Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7158236Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7158468Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7158699Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7158933Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7159157Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7159348Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7159571Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7159820Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7160045Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7160279Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7160502Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7160733Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7160956Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7161189Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7161426Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7161635Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7161849Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7162014Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7162198Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7162721Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/6y/c6ynvaswxkvzl4is2ajbl6iame4d5qle75ci427cnjmhyffijoo5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7162892Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7163108Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7163269Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7163559Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7163698Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7163961Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7164102Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7164386Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7164545Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7164819Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7164955Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7165239Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7165438Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7165765Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7166066Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7166199Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7166683Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7166940Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7167180Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7167393Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7167595Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7167830Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7168054Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7168291Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7168545Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7168774Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7168998Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7169226Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7169451Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7169683Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7169905Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7170182Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7170404Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7170638Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7170858Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7171053Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7171277Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7171520Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7171746Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7171938Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7172162Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7172389Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7172614Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7172876Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7173096Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7173304Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7173516Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7173680Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7173861Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7173966Z E1204 10:51:44.403000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7174290Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7174585Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7174718Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7175202Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7175455Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7175681Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7175900Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7176103Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7176332Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7176555Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7176784Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7177029Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7177260Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7177481Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7177711Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7177931Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7178161Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7178382Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7178622Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7178844Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7179073Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7179294Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7179489Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7179712Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7179951Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7180204Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7180396Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7180617Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7180847Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7181066Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7181319Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7181540Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7181745Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7181959Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7182119Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7182301Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7182826Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/ue/cuepztjx5hwc6cn35lr4lupsh2ngqa5sbge7z2tciyg6wm4mh2nc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7182988Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7183203Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7183362Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7183651Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7183784Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7184044Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7184207Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7184467Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7184623Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7184895Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7185031Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7185309Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7185531Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7185846Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7186143Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7186275Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7186763Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7187019Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7187255Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7187464Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7187668Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7187899Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7188123Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7188352Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7188586Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7188816Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7189037Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7189264Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7189485Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7189715Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7189954Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7190218Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7190438Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7190668Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7190896Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7191090Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7191325Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7191552Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7191774Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7191966Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7192190Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7192423Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7192643Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7192886Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7193108Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7193313Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7193526Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7193689Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7193869Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7194004Z E1204 10:51:44.405000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7194319Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7194615Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7194750Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7195233Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7195492Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7195736Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7195945Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7196152Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7196381Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7196609Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7196840Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7197075Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7197309Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7197529Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7197766Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7197987Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7198220Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7198465Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7198693Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7198922Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7199152Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7199378Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7199569Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7199814Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7200049Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7200309Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7200505Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7200724Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7200960Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7201180Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7201429Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7201653Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7201855Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7202074Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7202236Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7202422Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7202969Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/5y/c5ycqrb7j4uapqa7ucfmmec6ba2bhj74pglv26srlibasz6l67e7.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7203121Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7203341Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7203499Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7203793Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7203927Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7204203Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7204344Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7204607Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7204770Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7205040Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7205181Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7205458Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7205667Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7205984Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7206280Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7206416Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7206921Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7207179Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7207407Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7207617Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7207824Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7208055Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7208281Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7208524Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7208749Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7208980Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7209207Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7209440Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7209660Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7209892Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7210159Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7210393Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7210614Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7210848Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7211074Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7211292Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7211518Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7211749Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7211974Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7212165Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7212391Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7212625Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7212859Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7213091Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7213312Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7213520Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7213735Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7213900Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7214086Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7214204Z E1204 10:51:44.406000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7214518Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7214811Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7214946Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7215424Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7215715Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7215948Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7216155Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7216360Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7216592Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7216818Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7217069Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7217289Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7217521Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7217741Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7217974Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7218196Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7218430Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7218666Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7218893Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7219116Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7219344Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7219571Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7219783Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7220008Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7220275Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7220497Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7220694Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7220917Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7221148Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7221382Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7221615Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7221839Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7222044Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7222258Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7222420Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7222603Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7223150Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/uz/cuzyocrxsruvbc5g4v6sbp2do5rkwrogymtc32osajsldsdqu5fj.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7223299Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7223519Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7223677Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7223969Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7224128Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7224388Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7224531Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7224785Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7224947Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7225222Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7225362Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7225648Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7225846Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7226163Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7226457Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7226592Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7227074Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7227343Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7227573Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7227783Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7227989Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7228220Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7228467Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7228696Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7228918Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7229153Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7229376Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7229607Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7229838Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7230068Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7230361Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7230593Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7230817Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7231046Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7231267Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7231478Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7231704Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7231933Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7232157Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7232350Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7232569Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7232829Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7233049Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7233278Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7233497Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7233704Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7233921Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7234098Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7234279Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7234383Z E1204 10:51:44.407000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7234543Z [W1204 10:51:44.869961808 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7234545Z 2025-12-04T12:10:19.7234857Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7235152Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7235285Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7235764Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7236031Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7236259Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7236467Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7236673Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7236920Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7237145Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7237374Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7237595Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7237822Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7238046Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7238277Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7238509Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7238736Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7238957Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7239186Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7239407Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7239636Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7239870Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7240060Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7240321Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7240550Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7240773Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7240962Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7241209Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7241440Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7241660Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7241889Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7242109Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7242315Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7242526Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7242701Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7242883Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7243408Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpk3ww659b/c5/cc5ggiuivr5snfqf5gl4meztqartblipylsohf34cpn77emtpxxo.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.7243557Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.7243772Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.7243929Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.7244233Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.7244368Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.7244627Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.7244768Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.7245027Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.7245184Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.7245477Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.7245612Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.7245889Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.7246085Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.7246399Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7246695Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7246836Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7247320Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7247579Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7247805Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7248015Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7248215Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7248462Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7248687Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7248915Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7249138Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7249367Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7249587Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7249837Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7250060Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7250328Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7250549Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7250782Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7251002Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7251254Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7251473Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7251668Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7251893Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7252122Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7252344Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7252533Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7252770Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7252999Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7253221Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7253453Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7253674Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7253907Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.7254120Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.7254286Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.7254467Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.7254573Z E1204 10:51:44.409000 584582 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.7254630Z ('RERUN', {'yellow': True}) [10.4912s] [100%] 2025-12-04T12:10:19.7254968Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:46.552127279 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7254971Z 2025-12-04T12:10:19.7255119Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7255427Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7255726Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7255858Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7256342Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7256599Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7256838Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7257050Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7257250Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7257484Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7257706Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7257937Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7261738Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7261967Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7262192Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7262426Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7262654Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7262886Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7263105Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7263326Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7263538Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7263743Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7263972Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7264196Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7264390Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7264626Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7264859Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7265079Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7265271Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7265490Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7265694Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7265912Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7266132Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7266336Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7266525Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7266750Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7268980Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7269211Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7269459Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7269684Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7269885Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7270158Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7270362Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7270595Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7270818Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7271066Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7271294Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7271525Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7271747Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7271975Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7272214Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7272448Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7272670Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7272900Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7273120Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7273467Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7273692Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7273934Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7274155Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7274384Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7274608Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7274839Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7275065Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7275305Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7275525Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7275734Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7275936Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7276168Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7276389Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7276633Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7276855Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7277083Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7277305Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7277534Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7277775Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7278005Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7278242Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7278473Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7278693Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7278895Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7279085Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7279310Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7279518Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7279731Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7279934Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7280199Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7280425Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7280625Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7280833Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7281056Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7281258Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7281451Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7281671Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7281918Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7282137Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7282380Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7282601Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7282806Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7283021Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7283223Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7283461Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7283683Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7283902Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7284105Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7284313Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7284547Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7284769Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7285024Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7285247Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7285481Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7285702Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7285934Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7286173Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7286403Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7286640Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7286842Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7287037Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7287263Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7287472Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7287679Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7287880Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7288121Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7288345Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7288579Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7288800Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7289033Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7289270Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7289503Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7289729Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7289958Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7290251Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7290474Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7290677Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7290894Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7291106Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7291312Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7291542Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7291769Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7291974Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7292178Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7292400Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7292632Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7292860Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7293091Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7293318Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7293562Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7293786Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7294020Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7294243Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7294478Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7294711Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7294943Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7295178Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7295406Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7295633Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7295863Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7296088Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7296318Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7296556Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7296791Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7297015Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7297221Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7297412Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7297637Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7297880Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7298105Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7298338Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7298563Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7298796Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7299029Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7299272Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7299495Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7299727Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7299956Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7300189Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7300416Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7300647Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7300902Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7301137Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7301357Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7301576Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7301781Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7301986Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7302204Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7302438Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7302664Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7302879Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7303089Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7303308Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7303529Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7303758Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7303985Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7304192Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7304394Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7304593Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7304743Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7304970Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7305178Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7305407Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7305638Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7305863Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7306072Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7306275Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7306504Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7306709Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7314488Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7314717Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7314910Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7315177Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7315386Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7315612Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7315804Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7316029Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7316261Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7316485Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7316716Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7316951Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7317186Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7317411Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7317641Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7317865Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7318107Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7318332Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7318533Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7318724Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7318948Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7319163Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7319381Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7319594Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7319801Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7320032Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7320291Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7320522Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7320745Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7320939Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7321175Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7321406Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7321629Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7321863Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7322086Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7322314Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7322520Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7322721Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7322917Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7323108Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7323333Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7323577Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7323820Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7324051Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7324274Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7324468Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7324690Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7324923Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7325146Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7325386Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7325611Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7325802Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7326025Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7326258Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7326484Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7326725Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7326948Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7327165Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7327370Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7327573Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7327786Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7328029Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7328252Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7328444Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7328671Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7328901Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7329128Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7329356Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7329589Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7329806Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7330010Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7330400Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7330603Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7330834Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7331077Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7331309Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7331532Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7331762Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7331989Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7332195Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7332432Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7332663Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7332887Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7333121Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7333345Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7333561Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7333765Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7333980Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7334185Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7334416Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7334640Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7334870Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7335103Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7335332Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7335558Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7335791Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7336013Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7336254Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7336476Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7336692Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7336896Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7337092Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7337296Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7337511Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7337725Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7337927Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7338134Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7338331Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7338506Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7338637Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7338786Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7338896Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7339026Z E1204 10:51:46.106000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7339190Z [W1204 10:51:46.570859126 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7339204Z 2025-12-04T12:10:19.7339353Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7339659Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7339960Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7340130Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7340643Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7340911Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7341143Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7341352Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7341559Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7341790Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7342013Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7342247Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7342482Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7342715Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7342937Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7343168Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7343392Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7343633Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7343856Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7344056Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7344270Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7344474Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7344720Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7344943Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7345259Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7345484Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7345713Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7345938Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7346128Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7346351Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7346553Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7346754Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7346979Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7347177Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7347373Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7347592Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7347833Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7348056Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7348286Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7348509Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7348709Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7348921Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7349162Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7349407Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7349628Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7349856Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7350083Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7350340Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7350564Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7350793Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7351030Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7351263Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7351486Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7351717Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7351939Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7352181Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7352402Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7352634Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7352856Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7353084Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7353322Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7353551Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7353790Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7354022Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7354242Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7354450Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7354647Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7354880Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7355112Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7355342Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7355565Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7355794Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7356018Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7356247Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7356491Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7356721Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7356944Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7357174Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7357395Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7357608Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7357799Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7358035Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7358234Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7358451Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7358658Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7358887Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7359112Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7359324Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7359516Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7359738Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7359938Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7360164Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7360386Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7360636Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7360858Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7361089Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7361310Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7361512Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7361724Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7361940Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7362189Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7362413Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7362622Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7362825Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7363031Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7363265Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7363490Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7363742Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7363964Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7364196Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7364420Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7364651Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7364885Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7365118Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7365343Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7365543Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7365737Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7365960Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7366179Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7366391Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7366594Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7366827Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7367050Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7367284Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7367509Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7367743Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7367978Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7368209Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7368436Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7368667Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7368891Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7369108Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7369314Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7369513Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7369725Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7369933Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7370200Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7370444Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7370662Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7370866Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7371072Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7371302Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7371527Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7371759Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7371986Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7372235Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7372461Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7372693Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7372917Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7373151Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7373386Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7373618Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7373844Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7374075Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7374306Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7374547Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7374772Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7375013Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7375236Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7375471Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7375694Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7375899Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7376090Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7376328Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7376562Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7376789Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7377021Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7377242Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7377474Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7377709Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7377941Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7378164Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7378398Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7378625Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7378829Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7379054Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7379294Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7379520Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7379749Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7379976Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7380231Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7380435Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7380652Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7380858Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7381097Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7381320Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7381537Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7381745Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7381958Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7382164Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7382396Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7382620Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7382826Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7383041Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7383239Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7383403Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7383627Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7383820Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7384045Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7384274Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7384499Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7384702Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7384905Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7385132Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7385334Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7385530Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7385754Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7385950Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7386187Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7386378Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7386605Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7386797Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7387024Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7387265Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7387509Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7387745Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7387968Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7388202Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7388425Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7388660Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7388881Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7389125Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7389351Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7389553Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7389751Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7389975Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7390274Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7390494Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7390699Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7390909Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7391141Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7391369Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7391612Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7391852Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7392047Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7392271Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7392505Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7392729Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7392963Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7393185Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7393420Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7393626Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7393828Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7394027Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7394219Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7394447Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7394691Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7394918Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7395151Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7395374Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7395570Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7395805Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7396048Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7396271Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7396506Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7396734Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7396927Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7397152Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7397383Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7397619Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7397850Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7398075Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7398297Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7398501Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7398706Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7398921Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7399156Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7399378Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7399573Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7399800Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7400041Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7400314Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7400545Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7400770Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7400986Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7401196Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7401400Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7401605Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7401851Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7402074Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7402307Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7402531Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7402765Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7402990Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7403196Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7403421Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7403659Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7403887Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7404117Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7404363Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7404594Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7404797Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7405000Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7405204Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7405440Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7405663Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7405900Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7406134Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7406364Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7406586Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7406814Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7407040Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7407280Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7407503Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7407706Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7407909Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7408109Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7408309Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7408536Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7408757Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7408956Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7409152Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7409346Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7409521Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7409649Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7409891Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7409997Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7410177Z E1204 10:51:46.110000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7410336Z [W1204 10:51:46.573176034 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7410341Z 2025-12-04T12:10:19.7410489Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7410788Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7411085Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7411219Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7411716Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7411974Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7412202Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7412410Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7412615Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7412859Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7413098Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7413329Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7413549Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7413781Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7414001Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7414231Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7414451Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7414697Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7414920Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7415122Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7415332Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7415533Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7415773Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7415994Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7416188Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7416409Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7416637Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7416882Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7417072Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7417309Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7417510Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7417702Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7417926Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7418124Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7418315Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7418535Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7418775Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7418994Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7419224Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7419447Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7419645Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7419857Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7420075Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7420345Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7420566Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7420796Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7421018Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7421258Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7421494Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7421722Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7421945Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7422174Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7422395Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7422625Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7422845Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7423088Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7423308Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7423538Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7423759Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7423991Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7424228Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7424456Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7424678Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7424906Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7425127Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7425340Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7425541Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7425785Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7426005Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7426238Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7426460Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7426692Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7426915Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7427161Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7427386Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7427616Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7427841Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7428068Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7428295Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7428508Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7428700Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7428928Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7429126Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7429342Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7429555Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7429793Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7430025Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7430255Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7430449Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7430671Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7430876Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7431066Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7431290Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7431537Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7431762Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7431993Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7432212Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7432415Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7432625Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7432843Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7433079Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7433309Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7433517Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7433719Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7433936Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7434181Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7434406Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7434637Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7434865Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7435098Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7435324Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7435557Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7435796Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7436029Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7436254Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7436456Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7436652Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7436886Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7437096Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7437298Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7437507Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7437744Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7437966Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7438210Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7438443Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7438676Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7438901Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7439135Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7439362Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7439593Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7439821Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7440037Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7440282Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7440476Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7440691Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7440898Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7441130Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7441370Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7441577Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7441783Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7441987Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7442222Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7442461Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7442705Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7442929Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7443160Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7443387Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7443617Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7443846Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7444079Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7444315Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7444550Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7444774Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7445006Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7445230Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7445472Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7445697Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7445927Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7446150Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7446380Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7446618Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7446825Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7447028Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7447256Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7447487Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7447714Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7447944Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7448169Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7448411Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7448633Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7448870Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7449095Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7449329Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7449551Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7449757Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7449983Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7450247Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7450474Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7450704Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7450945Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7451162Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7451396Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7451603Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7451805Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7452039Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7452265Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7452484Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7452700Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7452903Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7453108Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7453339Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7453569Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7453775Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7453992Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7454188Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7454341Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7454569Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7454761Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7454985Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7455224Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7455460Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7455662Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7455859Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7456085Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7456284Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7456480Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7456701Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7456904Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7457128Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7457321Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7457546Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7457737Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7457965Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7458207Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7458432Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7458664Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7458888Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7459124Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7459359Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7459602Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7459823Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7460058Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7460325Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7460531Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7460726Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7460949Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7461181Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7461387Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7461594Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7461798Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7462031Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7462255Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7462499Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7462724Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7462917Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7463141Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7463372Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7463610Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7463855Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7464077Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7464295Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7464499Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7464704Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7464900Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7465094Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7465327Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7465557Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7465780Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7466010Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7466232Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7466424Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7466660Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7466891Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7467114Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7467344Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7467566Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7467776Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7468009Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7468238Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7468462Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7468693Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7468915Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7469130Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7469338Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7469549Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7469752Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7469984Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7470252Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7470445Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7470668Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7470917Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7471143Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7471374Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7471601Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7471817Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7472038Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7472252Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7472458Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7472692Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7472914Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7473149Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7473373Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7473605Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7473846Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7474044Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7474268Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7474497Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7474720Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7474949Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7475183Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7475398Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7475604Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7475806Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7476011Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7476253Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7476485Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7476716Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7476938Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7477169Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7477395Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7477625Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7477847Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7478087Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7478316Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7478516Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7478723Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7478916Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7479116Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7479347Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7479555Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7479757Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7479950Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7480182Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7480371Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7480500Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7480669Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7480775Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7480906Z E1204 10:51:46.112000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7481064Z [W1204 10:51:46.617308944 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7481066Z 2025-12-04T12:10:19.7481214Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7481512Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7481809Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7481942Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7482436Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7482694Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7482923Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7483131Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7483334Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7483584Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7483808Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7484036Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7484259Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7484488Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7484733Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7484973Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7485196Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7485425Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7485646Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7485847Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7486057Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7486261Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7486503Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7486725Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7486920Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7487140Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7487374Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7487618Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7487810Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7488035Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7488234Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7488430Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7488651Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7488862Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7489062Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7489286Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7489520Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7489743Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7489974Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7490237Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7490439Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7490664Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7490869Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7491102Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7491322Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7491554Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7491777Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7492026Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7492250Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7492480Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7492704Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7492933Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7493169Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7493410Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7493634Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7493867Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7494089Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7494321Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7494542Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7494773Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7495004Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7495236Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7495459Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7495689Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7495913Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7496127Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7496329Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7496560Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7496782Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7497014Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7497251Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7497484Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7497715Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7497947Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7498169Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7498401Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7498624Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7498858Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7499101Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7499301Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7499499Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7499720Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7499922Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7500269Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7500489Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7500723Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7500944Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7501149Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7501339Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7501563Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7501782Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7501984Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7502207Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7502437Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7502663Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7502892Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7503117Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7503320Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7503547Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7503753Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7503987Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7504213Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7504419Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7504624Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7504840Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7505070Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7505298Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7505528Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7505756Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7505995Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7506231Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7506463Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7506686Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7506923Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7507147Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7507356Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7507548Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7507787Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7507997Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7508198Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7508406Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7508637Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7508864Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7509106Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7509334Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7509570Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7509792Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7510027Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7510299Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7510546Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7510770Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7510974Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7511180Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7511374Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7511592Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7511794Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7512040Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7512266Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7512473Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7512679Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7512883Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7513117Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7513352Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7513587Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7513814Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7514044Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7514272Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7514513Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7514760Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7514990Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7515218Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7515451Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7515673Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7515904Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7516126Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7516368Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7516596Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7516831Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7517057Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7517288Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7517523Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7517725Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7517923Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7518146Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7518380Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7518617Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7518845Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7519082Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7519313Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7519540Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7519773Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7519995Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7520260Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7520495Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7520693Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7520916Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7521148Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7521372Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7521604Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7521843Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7522059Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7522266Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7522466Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7522672Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7522917Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7523140Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7523370Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7523574Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7523782Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7523985Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7524218Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7524443Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7524660Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7524861Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7525058Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7525211Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7525433Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7525629Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7525860Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7526107Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7526334Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7526535Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7526733Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7526955Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7527172Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7527379Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7527599Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7527794Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7528018Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7528217Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7528442Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7528635Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7528869Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7529099Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7529327Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7529557Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7529781Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7530013Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7530279Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7530514Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7530741Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7530974Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7531197Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7531422Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7531625Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7531849Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7532068Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7532273Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7532477Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7532681Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7532915Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7533151Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7533384Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7533614Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7533806Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7534030Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7534261Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7534496Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7534726Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7534952Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7535171Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7535376Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7535589Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7535785Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7535993Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7536215Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7536448Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7536674Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7536905Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7537132Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7537339Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7537566Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7537796Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7538023Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7538256Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7538478Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7538684Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7538906Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7539139Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7539359Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7539593Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7539828Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7540054Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7540297Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7540499Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7540706Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7540937Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7541161Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7541356Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7541594Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7541829Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7542051Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7542285Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7542511Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7542726Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7543020Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7543226Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7543437Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7543670Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7543896Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7544143Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7544366Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7544614Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7544836Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7545031Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7545253Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7545485Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7545708Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7545949Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7546174Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7546391Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7546600Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7546800Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7547008Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7547261Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7547483Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7547717Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7547940Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7548175Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7548406Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7548652Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7548878Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7549108Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7549335Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7549535Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7549744Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7549938Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7550189Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7550408Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7550616Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7550821Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7551017Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7551219Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7551405Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7551538Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7551686Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7551791Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7551922Z E1204 10:51:46.156000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7552081Z [W1204 10:51:46.619374185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7552084Z 2025-12-04T12:10:19.7552233Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7552565Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7552884Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7553018Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7553510Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7553769Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7553998Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7554209Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7554425Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7554660Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7554886Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7555117Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7555341Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7555572Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7555808Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7556040Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7556262Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7556495Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7556717Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7556930Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7557150Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7557356Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7557586Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7557814Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7558008Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7558228Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7558458Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7558690Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7558886Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7559106Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7559308Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7559502Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7559722Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7559937Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7560162Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7560385Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7560612Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7560835Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7561083Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7561316Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7561516Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7561728Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7561932Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7562165Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7562396Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7562628Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7562869Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7563104Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7563323Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7563556Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7563777Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7564010Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7564247Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7564478Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7564704Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7564934Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7565158Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7565398Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7565629Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7565860Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7566080Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7566314Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7566534Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7566766Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7566991Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7567206Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7567408Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7567637Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7567862Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7568092Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7568327Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7568558Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7568780Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7569014Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7569238Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7569483Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7569705Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7569954Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7570210Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7570410Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7570605Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7570826Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7571030Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7571240Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7571463Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7571696Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7571922Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7572126Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7572317Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7572540Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7572752Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7572946Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7573172Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7573401Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7573628Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7573870Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7574106Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7574304Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7574518Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7574723Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7574956Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7575185Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7575388Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7575603Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7575809Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7576046Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7576271Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7576503Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7576728Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7576968Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7577194Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7577425Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7577651Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7577887Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7578121Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7578350Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7578541Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7578766Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7578972Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7579174Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7579379Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7579609Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7579843Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7580073Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7580323Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7580555Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7580778Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7581012Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7581246Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7581477Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7581701Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7581907Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7582108Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7582318Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7582545Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7582750Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7582984Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7583206Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7583413Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7583614Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7583818Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7584066Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7584290Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7584522Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7584745Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7584980Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7585207Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7585447Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7585672Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7585903Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7586129Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7586361Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7586595Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7586836Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7587058Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7587293Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7587517Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7587748Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7587970Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7588202Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7588436Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7588639Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7588834Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7589058Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7589289Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7589527Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7589761Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7589986Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7590257Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7590483Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7590727Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7590951Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7591193Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7591418Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7591614Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7591838Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7592070Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7592293Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7592526Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7592764Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7592982Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7593190Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7593389Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7593596Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7593839Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7594067Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7594286Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7594490Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7594693Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7594895Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7595143Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7595375Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7595583Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7595788Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7595983Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7596135Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7596362Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7596557Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7596790Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7597025Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7597247Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7597451Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7597645Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7597869Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7598083Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7598277Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7598507Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7598700Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7598928Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7599133Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7599355Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7599559Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7599780Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7600014Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7600264Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7600497Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7600722Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7600969Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7601194Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7601424Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7601650Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7601882Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7602104Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7602324Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7602516Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7604267Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7604489Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7604701Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7604925Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7605131Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7605381Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7605603Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7605836Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7606060Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7606255Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7606482Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7606725Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7606954Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7607186Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7607412Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7607628Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7607838Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7608055Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7608250Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7608446Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7608668Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7608902Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7609220Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7609455Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7609691Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7609882Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7610143Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7610375Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7610600Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7610832Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7611072Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7611267Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7611492Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7611727Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7611949Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7612185Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7612429Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7612648Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7612856Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7613057Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7613262Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7613506Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7613734Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7613940Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7614167Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7614402Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7614624Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7614858Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7615082Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7615314Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7615519Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7615722Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7615929Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7616162Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7616388Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7616628Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7616854Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7617086Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7617313Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7617509Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7617741Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7617975Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7618210Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7618444Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7618668Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7618886Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7619093Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7619295Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7619512Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7619743Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7619968Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7620230Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7620457Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7620693Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7620933Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7621166Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7621391Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7621625Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7621853Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7622076Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7622283Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7622489Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7622691Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7622912Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7623126Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7623331Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7623525Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7623724Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7623910Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7624041Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7624189Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7624298Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7624426Z E1204 10:51:46.158000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7624586Z [W1204 10:51:46.621477136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7624590Z 2025-12-04T12:10:19.7624736Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7625054Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7625357Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7625491Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7625982Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7626251Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7626492Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7626703Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7626908Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7627140Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7627368Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7627602Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7627825Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7628076Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7628303Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7628531Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7628755Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7628983Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7629207Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7629421Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7629635Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7629840Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7630069Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7630337Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7630544Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7630783Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7631016Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7631238Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7631434Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7631654Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7631857Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7632047Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7632290Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7632492Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7632684Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7632910Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7633140Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7633363Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7633607Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7633831Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7634035Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7634247Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7634450Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7634691Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7634913Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7635152Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7635375Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7635606Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7635828Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7636059Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7636281Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7636527Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7636748Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7636978Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7637204Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7637431Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7637655Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7637896Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7638119Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7638349Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7638573Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7638807Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7639037Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7639278Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7639497Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7639704Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7639902Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7640171Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7640395Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7640623Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7640860Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7641092Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7641314Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7641545Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7641768Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7642012Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7642234Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7642466Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7642686Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7642888Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7643079Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7643316Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7643541Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7643751Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7643956Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7644185Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7644408Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7644607Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7644799Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7645034Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7645233Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7645427Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7645651Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7645884Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7646105Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7646348Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7646573Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7646772Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7646985Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7647186Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7647432Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7647655Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7647875Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7648079Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7648282Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7648515Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7648739Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7648971Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7649203Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7649437Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7649663Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7649895Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7650155Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7650386Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7650627Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7650831Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7651025Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7651250Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7651455Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7651670Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7651873Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7652118Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7652344Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7652575Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7652800Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7653028Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7653250Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7653493Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7653717Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7653947Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7654168Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7654373Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7654575Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7654782Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7654994Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7655199Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7655430Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7655652Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7655867Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7656068Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7656285Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7656515Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7656742Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7656975Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7657197Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7657427Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7657662Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7657895Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7658117Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7658348Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7658572Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7658802Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7659038Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7659268Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7659493Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7659724Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7659950Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7660234Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7660473Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7660706Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7660928Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7661133Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7661328Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7661550Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7661780Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7662017Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7662249Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7662469Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7662701Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7662925Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7663171Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7663399Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7663632Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7663856Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7664050Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7664274Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7664516Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7664749Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7664980Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7665202Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7665422Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7665626Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7665830Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7666034Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7666275Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7666501Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7666717Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7666923Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7667123Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7667337Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7667570Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7667793Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7668000Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7668201Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7668398Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7668558Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7668792Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7668985Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7669208Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7669439Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7669664Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7669867Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7670059Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7670335Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7670538Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7670729Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7670956Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7671147Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7671375Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7671578Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7671804Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7671999Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7672221Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7672459Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7672699Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7672935Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7673171Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7673402Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7673628Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7673858Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7674082Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7674312Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7674549Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7674753Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7674947Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7675171Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7675385Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7675591Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7675807Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7676013Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7676243Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7676472Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7676708Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7676945Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7677143Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7677379Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7677611Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7677839Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7678070Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7678296Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7678511Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7678730Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7678931Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7679129Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7679327Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7679550Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7679785Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7680015Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7680280Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7680503Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7680697Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7680923Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7681168Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7681395Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7681641Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7681867Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7682059Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7682285Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7682520Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7682744Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7682990Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7683211Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7683428Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7683632Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7683835Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7684042Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7684286Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7684512Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7684703Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7684926Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7685157Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7685391Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7685622Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7685855Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7686071Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7686277Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7686480Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7686682Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7686913Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7687138Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7687377Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7687601Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7687830Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7688053Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7688246Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7688484Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7688718Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7688943Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7689175Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7689399Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7689637Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7689840Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7690054Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7690293Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7690525Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7690752Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7690980Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7691205Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7691456Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7691680Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7691913Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7692137Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7692368Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7692589Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7692803Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7693011Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7693204Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7693403Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7693619Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7693842Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7694041Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7694250Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7694444Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7694618Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7694747Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7694894Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7695002Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7695133Z E1204 10:51:46.160000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7695189Z ('RERUN', {'yellow': True}) [1.5987s] [100%] 2025-12-04T12:10:19.7695527Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:51:47.960212940 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7695540Z 2025-12-04T12:10:19.7695688Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7695986Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7696283Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7696417Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7696912Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7697169Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7697399Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7697606Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7697810Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7698049Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7698273Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7698510Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7698732Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7698962Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7699182Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7699412Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7699632Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7699873Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7700214Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7700414Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7700624Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7700823Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7701053Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7701286Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7701478Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7701699Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7701929Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7702151Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7702352Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7702575Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7702786Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7702976Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7703197Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7703397Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7703588Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7703810Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7704057Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7704279Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7704508Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7704728Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7704928Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7705150Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7705363Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7705596Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7705817Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7706049Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7706271Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7706518Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7706743Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7706989Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7707212Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7707439Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7707663Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7707892Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7708115Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7708356Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7708577Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7708814Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7709036Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7709267Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7709488Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7709730Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7709955Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7710223Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7710445Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7710650Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7710868Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7711112Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7711336Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7711570Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7711790Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7712020Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7712241Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7712471Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7712703Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7712935Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7713158Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7713390Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7713613Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7713814Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7714021Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7714242Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7714443Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7714655Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7714859Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7715102Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7715336Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7715538Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7715730Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7715953Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7716154Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7716343Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7716567Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7716807Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7717030Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7717259Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7717483Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7717686Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7717898Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7718115Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7718347Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7718575Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7718779Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7718982Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7719199Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7719430Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7719664Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7719893Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7720156Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7720390Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7720615Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7720846Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7721081Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7721316Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7721539Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7721743Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7721937Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7722162Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7722388Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7722589Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7722797Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7723026Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7723250Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7723502Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7723724Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7723970Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7724192Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7724427Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7724655Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7724889Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7725114Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7725329Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7725534Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7725728Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7725945Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7726147Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7726380Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7726620Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7726825Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7727030Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7727232Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7727464Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7727696Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7727928Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7728163Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7728391Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7728619Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7728849Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7729076Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7729307Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7729544Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7729781Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7730003Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7730273Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7730495Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7730729Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7730965Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7731198Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7731427Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7731658Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7731884Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7732100Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7732311Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7732539Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7732770Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7732998Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7733227Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7733452Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7733683Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7733924Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7734158Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7734379Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7734613Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7734834Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7735032Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7735265Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7735498Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7735724Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7735958Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7736185Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7736409Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7736625Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7736827Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7737031Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7737265Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7737487Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7737707Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7737910Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7738126Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7738331Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7738562Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7738789Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7738992Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7739195Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7739404Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7739556Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7739779Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7739974Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7740235Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7740480Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7740705Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7740918Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7741115Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7741337Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7741540Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7741734Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7741956Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7742149Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7742385Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7742581Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7742805Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7742998Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7743224Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7743466Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7743691Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7743920Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7744144Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7744373Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7744610Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7744847Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7745085Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7745318Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7745540Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7745744Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7745937Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7746163Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7746379Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7746592Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7746797Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7747000Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7747235Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7747456Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7747700Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7747926Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7748121Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7748348Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7748578Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7748813Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7749043Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7749283Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7749503Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7749707Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7749912Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7750133Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7750330Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7750553Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7750798Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7751023Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7751254Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7751481Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7751675Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7751915Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7752146Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7752372Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7752606Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7752828Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7753025Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7753259Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7753505Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7753729Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7753961Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7754189Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7754403Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7754613Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7754812Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7755036Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7755270Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7755491Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7755686Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7755908Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7756153Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7756377Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7756610Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7756834Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7757049Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7757277Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7757478Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7757696Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7757926Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7758153Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7758388Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7758610Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7758843Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7759064Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7759272Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7759496Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7759729Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7759956Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7760216Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7760456Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7760673Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7760882Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7761082Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7761289Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7761533Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7761757Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7762001Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7762222Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7762455Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7762678Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7762910Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7763136Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7763378Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7763604Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7763804Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7764012Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7764204Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7764406Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7764634Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7764845Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7765047Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7765243Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7765440Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7765614Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7765759Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7765908Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7766025Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7766154Z E1204 10:51:47.499000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7766313Z [W1204 10:51:47.962484358 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7766317Z 2025-12-04T12:10:19.7766464Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7766765Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7767064Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7767197Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7767685Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7767955Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7768181Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7768390Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7768591Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7768832Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7769055Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7769285Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7769507Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7769736Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7769969Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7770235Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7770479Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7770708Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7770927Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7771130Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7771338Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7771540Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7771780Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7772004Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7772197Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7772416Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7772645Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7772865Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7773071Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7773292Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7773494Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7773685Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7773903Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7774115Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7774305Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7774538Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7774766Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7774987Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7775216Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7775435Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7775634Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7775843Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7776056Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7776284Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7776508Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7776739Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7776959Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7777199Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7777419Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7777648Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7777869Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7778096Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7778328Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7778554Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7778788Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7779014Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7779235Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7779465Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7779684Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7779913Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7780179Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7780408Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7780628Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7780859Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7781085Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7781290Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7781501Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7781728Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7781949Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7782175Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7782398Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7782642Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7782874Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7783103Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7783324Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7783555Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7783774Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7784003Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7784225Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7784435Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7784629Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7784848Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7785050Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7785257Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7785461Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7785707Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7785926Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7786126Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7786314Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7786534Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7786748Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7786940Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7787174Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7787403Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7787628Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7787860Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7788084Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7788282Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7788506Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7788712Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7788944Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7789171Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7789374Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7789577Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7789790Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7790026Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7790303Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7790532Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7790759Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7791007Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7791232Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7791474Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7791698Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7791932Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7792157Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7792365Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7792557Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7792797Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7793003Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7793205Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7793411Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7793640Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7793866Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7794108Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7794333Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7794571Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7794792Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7795024Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7795256Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7795488Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7795722Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7795930Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7796134Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7796328Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7796543Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7796749Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7796982Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7797216Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7797422Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7797626Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7797828Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7798063Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7798295Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7798528Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7798750Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7798984Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7799209Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7799448Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7799673Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7799913Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7800165Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7800396Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7800621Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7800854Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7801078Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7801327Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7801552Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7801785Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7802008Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7802239Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7802465Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7802687Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7802882Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7803106Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7803341Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7803564Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7803811Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7804049Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7804280Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7804506Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7804736Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7804962Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7805194Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7805417Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7805624Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7805848Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7806080Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7806302Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7806534Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7806759Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7806985Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7807192Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7807390Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7807596Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7807826Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7808061Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7808291Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7808494Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7808699Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7808902Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7809135Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7809359Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7809564Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7809777Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7809974Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7810163Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7810388Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7810584Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7810808Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7811054Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7811283Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7811483Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7811679Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7811902Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7812121Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7812314Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7812555Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7812752Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7812973Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7813169Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7813392Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7813588Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7813808Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7814053Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7814281Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7814510Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7814738Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7814973Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7815228Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7815458Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7815683Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7815917Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7816140Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7816352Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7816542Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7816778Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7816993Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7817205Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7817408Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7817610Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7817843Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7818064Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7818313Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7818540Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7818733Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7818957Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7819186Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7819424Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7819656Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7819882Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7820130Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7820336Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7820542Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7820750Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7820956Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7821178Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7821410Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7821633Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7821863Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7822089Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7822278Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7822514Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7822745Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7822968Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7823198Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7823420Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7823614Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7823849Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7824082Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7824303Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7824532Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7824752Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7824986Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7825202Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7825402Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7825607Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7825836Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7826059Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7826251Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7826476Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7826717Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7826939Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7827168Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7827390Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7827607Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7827822Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7828022Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7828227Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7828460Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7828684Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7828913Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7829146Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7829388Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7829607Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7829799Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7830020Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7830291Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7830514Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7830747Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7830983Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7831199Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7831403Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7831602Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7831805Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7832046Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7832270Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7832501Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7832722Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7832955Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7833188Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7833419Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7833661Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7833892Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7834115Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7834316Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7834521Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7834713Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7834911Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7835138Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7835349Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7835550Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7835742Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7835936Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7836110Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7836250Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7836398Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7836503Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7836630Z E1204 10:51:47.501000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7836790Z [W1204 10:51:47.964571649 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7836793Z 2025-12-04T12:10:19.7836939Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7837248Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7837545Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7837686Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7838170Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7838427Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7838651Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7838860Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7839072Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7839304Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7839526Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7839758Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7839979Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7840328Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7840644Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7840892Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7841113Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7841340Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7841565Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7841781Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7841992Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7842221Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7842450Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7842673Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7842863Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7843085Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7843317Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7843553Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7843747Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7843966Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7844169Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7845211Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7845693Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7846478Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7846703Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7846958Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7847204Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7847448Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7847743Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7847980Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7848260Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7848487Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7848714Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7848958Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7849196Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7849439Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7849680Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7849972Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7850262Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7850508Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7850744Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7850993Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7851386Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7851632Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7851874Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7852117Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7852354Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7852619Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7852861Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7853136Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7853372Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7853620Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7853856Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7854105Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7854340Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7854583Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7854800Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7855044Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7855284Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7855527Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7855767Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7856023Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7856266Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7856515Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7856751Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7856999Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7857245Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7857506Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7857741Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7857963Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7858172Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7858412Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7858635Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7858862Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7859186Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7859431Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7859672Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7859893Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7860136Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7860378Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7860605Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7860816Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7861052Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7861304Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7861543Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7861800Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7862041Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7862292Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7862521Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7862737Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7862990Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7863232Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7863454Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7863674Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7863911Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7864163Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7864401Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7864653Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7864894Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7865149Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7865392Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7865635Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7865878Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7866124Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7866375Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7866597Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7866816Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7867056Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7867277Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7867500Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7867717Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7867968Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7868213Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7868471Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7868714Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7868959Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7869202Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7869449Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7869705Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7869954Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7870226Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7870451Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7870669Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7870897Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7871125Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7871359Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7871609Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7871849Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7872075Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7872292Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7872516Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7872761Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7873017Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7873267Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7873505Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7873754Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7873990Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7874251Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7874496Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7874742Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7874983Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7875228Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7875478Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7875724Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7875977Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7876230Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7876469Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7876716Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7876955Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7877205Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7877454Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7877676Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7877888Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7878129Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7878376Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7878612Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7878875Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7879112Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7879361Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7879602Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7879849Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7880153Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7880412Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7880657Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7880865Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7881107Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7881356Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7881593Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7881842Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7882092Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7882328Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7882553Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7882773Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7882999Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7883245Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7883500Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7883730Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7883955Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7884174Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7884393Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7884653Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7884899Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7885124Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7885342Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7885554Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7885732Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7885976Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7886188Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7886425Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7886685Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7886924Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7887143Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7887349Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7887591Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7887824Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7888032Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7888275Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7888482Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7888725Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7888932Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7889182Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7889404Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7889641Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7889890Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7890181Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7890432Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7890667Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7890918Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7891173Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7891418Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7891658Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7891902Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7892143Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7892379Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7892589Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7892830Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7893060Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7893285Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7893500Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7893736Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7893997Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7894232Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7894483Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7894721Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7894934Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7895174Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7895425Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7895680Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7895925Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7896168Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7896401Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7896625Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7896841Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7897074Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7897287Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7901861Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7902122Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7902365Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7902641Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7902895Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7903108Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7903351Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7903599Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7903838Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7904084Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7904327Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7904547Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7904792Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7905042Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7905279Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7905527Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7905765Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7906032Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7906253Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7906472Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7906696Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7906940Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7907190Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7907407Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7907648Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7907895Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7908138Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7908387Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7908624Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7908860Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7909092Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7909314Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7909530Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7909781Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7910021Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7910298Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7910554Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7910799Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7911042Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7911251Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7911494Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7911758Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7912012Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7912260Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7912497Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7912732Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7912952Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7913175Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7913397Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7913654Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7913901Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7914144Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7914386Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7914634Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7914886Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7915135Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7915372Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7915620Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7915857Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7916078Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7916314Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7916533Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7916752Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7916985Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7917214Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7917432Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7917648Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7917860Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7918071Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7918221Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7918386Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7918516Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7918664Z E1204 10:51:47.503000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7918847Z [W1204 10:51:47.009276611 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7918851Z 2025-12-04T12:10:19.7919016Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7919355Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7919677Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.7919829Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.7920401Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.7920689Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.7920940Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.7921187Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.7921408Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7921661Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7921903Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7922152Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7922390Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7922640Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7923152Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7923397Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7923641Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7923884Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7924126Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7924353Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7924586Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7924809Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7925054Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7925294Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7925514Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7925754Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7926012Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7926253Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7926465Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7926701Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7926918Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7927124Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7927366Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7927595Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7927804Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7928043Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7928284Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7928520Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7928772Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7929008Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7929222Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7929446Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7929661Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7929902Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7930197Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7930450Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7930685Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7930928Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7931163Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7931405Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7931639Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7931883Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7932131Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7932375Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7932607Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7932850Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7933085Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7933345Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7933579Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7933821Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7934056Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7934299Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7934547Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7934790Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7935040Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7935257Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7935468Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.7935716Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7935948Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7936191Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7936439Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7936680Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7936916Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7937157Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7937394Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7937642Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7937904Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7938161Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7938405Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7938632Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7938846Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7939102Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7939323Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7939568Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7939791Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7940040Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7940326Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7940546Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7940758Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7940996Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7941228Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7941438Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7941677Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7941929Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7942168Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7942429Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7942669Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7942889Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7943121Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7943342Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7943597Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7943850Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7944087Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7944308Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7944531Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7944784Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7945026Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7945278Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7945522Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7945784Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7946029Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7946280Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7946526Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7946777Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7947028Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7947251Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7947470Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7947722Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7947958Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7948188Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7948432Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7948708Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7948958Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7949219Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7949471Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7949729Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7949980Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7950298Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7950566Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7950827Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7951079Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7951311Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7951540Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7951774Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7952015Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.7952247Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7952506Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7952758Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7952992Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7953238Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7953625Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7953884Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7954137Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7954396Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7954650Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7954913Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7955165Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7955440Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7955693Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7955955Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7956206Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7956467Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7956728Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7956987Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7957239Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7957503Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7957761Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7958034Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7958295Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7958579Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7958835Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7959069Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7959293Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7959550Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7959817Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7960084Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7960414Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7960670Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7960936Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7961191Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7961462Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7961734Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7961999Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7962257Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7962482Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7962739Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7963017Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7963276Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7963557Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7963813Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7964065Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7964303Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7964620Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7964856Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7965143Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7965400Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7965650Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7965889Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7966123Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7966361Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7966638Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7966899Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7967136Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7967367Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7967595Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7967775Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.7968043Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7968279Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7968537Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7968809Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7969067Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7969301Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7969527Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7969785Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7970030Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7970311Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7970572Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7970796Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7971053Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7971278Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7971559Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7971784Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7972044Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7972312Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7972570Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7972853Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7973124Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7973391Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7973649Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7973919Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7974178Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7974445Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7974703Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7974954Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.7975179Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7975436Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7975689Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7975927Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7976161Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7976411Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7976679Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7976938Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7977203Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7977463Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7977699Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7977983Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7978248Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7978507Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7978775Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7979031Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7979286Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7979526Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7979769Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7979996Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.7980255Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7980513Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7980776Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7981034Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7981315Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7981572Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7981807Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7982068Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7982341Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7982619Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7982903Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7983167Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7983395Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7983658Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7983928Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7984195Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7984471Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7984746Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7985003Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7985243Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7985486Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7985725Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7986000Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7986277Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7986503Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7986768Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7987037Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7987304Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7987581Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7987858Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7988116Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7988357Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7988599Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7988838Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7989109Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7989368Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7989661Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7989926Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7990235Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7990503Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7990731Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.7990999Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7991283Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7991546Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7991820Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7992079Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7992338Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.7992592Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.7992848Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.7993088Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.7993363Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.7993630Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7993901Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7994163Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7994430Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7994708Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7994977Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7995243Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7995516Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.7995775Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.7996016Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.7996271Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.7996506Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.7996738Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.7996997Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.7997249Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.7997499Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.7997745Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.7997976Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.7998189Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.7998348Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.7998534Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.7998672Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.7998828Z E1204 10:51:47.548000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.7999028Z [W1204 10:51:47.011340332 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.7999032Z 2025-12-04T12:10:19.7999210Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.7999569Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.7999912Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8000082Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8000681Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8000991Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8001265Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8001509Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8001753Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8002020Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8002286Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8002574Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8002847Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8003120Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8003378Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8003653Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8003911Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8004182Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8004444Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8004689Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8004943Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8005181Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8005451Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8005710Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8005949Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8006209Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8006475Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8006735Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8006959Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8007220Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8007462Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8007709Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8007969Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8008205Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8008434Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8008691Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8008961Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8009217Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8009500Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8009761Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8009994Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8010301Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8010537Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8010808Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8011084Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8011354Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8011614Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8011884Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8012145Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8012426Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8012701Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8012970Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8013229Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8013499Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8013758Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8014028Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8014285Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8014569Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8014831Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8015097Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8015359Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8015627Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8015897Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8016167Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8016430Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8016671Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8016904Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8017185Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8017441Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8017722Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8017978Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8018249Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8018509Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8018774Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8019035Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8019310Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8019571Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8019835Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8020095Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8020375Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8020599Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8020872Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8021105Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8021353Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8021592Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8021865Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8022139Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8022372Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8022612Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8022866Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8023103Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8023327Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8023585Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8023858Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8024115Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8024401Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8024658Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8024893Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8025138Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8025376Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8025651Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8025920Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8026163Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8026397Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8026639Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8026910Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8027192Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8027474Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8027733Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8028007Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8028268Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8028541Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8028803Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8029077Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8029354Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8029594Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8029828Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8030127Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8030373Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8030611Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8030869Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8031143Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8031403Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8031677Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8031939Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8032230Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8032508Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8032775Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8033042Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8033310Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8033574Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8033815Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8034058Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8034307Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8034556Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8034801Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8035070Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8035336Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8035576Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8035830Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8036073Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8036342Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8036608Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8036877Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8037154Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8037433Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8037698Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8037972Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8038233Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8038505Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8038765Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8039037Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8039307Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8039582Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8039848Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8040154Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8040420Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8040700Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8040966Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8041234Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8041498Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8041741Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8041983Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8042249Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8042530Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8042796Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8043064Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8043331Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8043605Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8043866Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8044155Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8044416Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8044704Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8044972Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8045200Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8045467Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8045756Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8046022Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8046291Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8046556Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8046816Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8047067Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8047310Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8047559Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8047832Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8048093Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8048351Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8048597Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8048834Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8049087Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8049355Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8049619Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8049860Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8050157Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8050393Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8050574Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8050851Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8051079Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8051340Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8051608Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8051874Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8052129Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8052370Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8052631Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8052868Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8053099Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8053361Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8053593Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8053857Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8054096Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8054362Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8054589Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8054854Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8055122Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8055387Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8055669Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8055930Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8056206Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8056464Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8056738Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8057008Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8057292Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8057556Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8057795Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8058026Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8058286Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8058546Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8058788Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8059042Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8059285Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8059553Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8059817Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8060087Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8060397Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8060635Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8060900Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8061174Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8061434Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8061711Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8062306Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8062578Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8062818Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8063062Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8063298Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8063524Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8063788Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8064056Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8064343Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8064613Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8064879Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8065111Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8065370Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8065645Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8065917Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8066192Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8066454Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8066688Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8066954Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8067234Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8067510Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8067778Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8068043Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8068301Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8068542Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8068785Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8069022Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8069308Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8069569Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8069802Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8070067Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8070380Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8070645Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8070931Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8071193Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8071445Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8071691Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8071934Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8072185Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8072459Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8072733Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8073007Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8073267Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8073540Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8073806Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8074032Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8074306Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8074572Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8074835Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8075104Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8075366Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8075620Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8075869Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8076108Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8076346Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8076617Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8076875Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8077592Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8077867Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8078135Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8078398Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8078664Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8078926Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8079193Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8079455Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8079703Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8079942Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8080223Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8080456Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8080716Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8080961Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8081218Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8081451Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8081681Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8081893Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8082051Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8082234Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8082381Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8082544Z E1204 10:51:47.550000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8082755Z [W1204 10:51:47.013400683 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8082758Z 2025-12-04T12:10:19.8082940Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8083287Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8083630Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8083798Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8084337Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8084654Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8084925Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8085168Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8085407Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8085672Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8085935Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8086212Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8086474Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8086741Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8086999Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8087269Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8087538Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8087816Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8088072Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8088309Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8088558Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8088795Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8089064Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8089320Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8089558Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8089815Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8090084Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8090385Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8090608Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8090868Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8091118Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8091348Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8091603Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8091839Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8092064Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8092333Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8092614Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8092868Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8093137Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8093393Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8093633Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8093884Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8094118Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8094401Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8094657Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8094925Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8095181Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8095448Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8095707Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8095982Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8096240Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8096508Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8096767Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8097034Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8097300Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8097577Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8097831Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8098099Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8098355Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8098623Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8098884Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8099152Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8099426Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8099693Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8099953Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8100227Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8100464Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8100750Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8101009Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8101281Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8101539Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8101814Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8102092Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8102364Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8102641Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8102908Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8103171Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8103436Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8103697Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8103930Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8104160Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8104440Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8104675Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8104927Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8105164Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8105436Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8105705Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8105943Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8106173Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8106430Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8106668Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8106893Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8107177Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8107453Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8107714Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8107989Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8108250Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8108488Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8108737Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8108979Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8109259Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8109526Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8109772Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8110009Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8110289Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8110561Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8110839Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8111107Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8111372Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8111644Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8111909Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8112196Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8112471Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8112744Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8113005Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8113248Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8113478Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8113737Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8113981Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8114232Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8114476Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8114745Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8115010Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8115282Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8115542Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8115826Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8116088Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8116361Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8116622Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8116895Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8117170Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8117419Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8117660Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8117890Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8118144Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8118382Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8118656Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8118918Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8119171Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8119413Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8119650Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8119925Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8120207Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8120479Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8120765Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8121033Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8121301Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8121570Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8121832Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8122114Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8122386Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8122657Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8122920Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8123194Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8123453Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8123727Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8123991Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8124274Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8124539Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8124807Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8125069Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8125305Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8125558Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8125825Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8126093Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8126357Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8126624Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8126900Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8127169Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8127444Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8127714Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8127974Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8128247Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8128506Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8128739Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8129009Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8129279Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8129546Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8129814Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8130078Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8130377Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8130636Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8130876Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8131115Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8131388Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8131649Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8131918Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8132157Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8132411Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8132663Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8132931Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8133196Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8133434Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8133674Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8133901Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8134099Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8134358Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8134591Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8134854Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8135118Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8135381Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8135628Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8135859Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8136121Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8136361Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8136592Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8136863Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8137091Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8137360Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8137592Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8137852Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8138084Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8138348Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8138615Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8138901Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8139169Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8139432Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8139704Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8139964Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8140274Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8140553Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8140823Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8141083Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8141324Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8141559Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8141830Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8142086Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8142339Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8142582Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8142820Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8143096Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8143360Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8143627Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8143909Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8144137Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8144401Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8144669Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8144932Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8145207Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8145476Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8145733Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8145973Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8146215Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8146444Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8146684Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8146950Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8147230Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8147492Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8147761Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8148027Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8148252Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8148516Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8148798Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8149056Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8149334Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8149597Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8149830Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8150132Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8150419Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8150684Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8150952Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8151236Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8151489Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8151747Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8151984Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8152244Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8152517Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8152779Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8153011Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8153270Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8153543Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8153822Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8154097Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8154363Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8154615Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8154868Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8155105Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8155362Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8155631Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8155900Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8156174Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8156433Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8156719Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8156978Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8157238Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8157503Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8157774Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8158039Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8158306Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8158569Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8158834Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8159080Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8159321Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8159559Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8159829Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8160134Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8160421Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8160680Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8160954Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8161220Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8161485Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8161763Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8162043Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8162311Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8162547Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8162794Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8163028Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8163261Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8163518Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8163778Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8164020Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8164249Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8164483Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8164693Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8164851Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8165033Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8165175Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8165337Z E1204 10:51:47.552000 584582 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8165406Z FAILED [1.7230s] [100%] 2025-12-04T12:10:19.8165410Z 2025-12-04T12:10:19.8165499Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8165681Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8165759Z Traceback (most recent call last): 2025-12-04T12:10:19.8165964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8166041Z method(*args, **kwargs) 2025-12-04T12:10:19.8166227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8166296Z method(*args, **kwargs) 2025-12-04T12:10:19.8166489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8166558Z with policy(): 2025-12-04T12:10:19.8166742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8166829Z raise RuntimeError(msg) 2025-12-04T12:10:19.8167286Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 2051014656. 2025-12-04T12:10:19.8167290Z 2025-12-04T12:10:19.8167396Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8167710Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8167714Z 2025-12-04T12:10:19.8167835Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8167950Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8168020Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8168112Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8168675Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8168830Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8168900Z graph_break [] 2025-12-04T12:10:19.8168996Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8169105Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8169656Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8169739Z current_size = base.storage().size() 2025-12-04T12:10:19.8169806Z Autotune Choices Stats: 2025-12-04T12:10:19.8170416Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.014800000004470348, "best_triton_pos": 1, "best_triton_time": 0.01587899960577488, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.8170518Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8170602Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8170757Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8170829Z _scaled_mm 0.0148 ms 100.0% 2025-12-04T12:10:19.8171118Z triton_mm_35 0.0159 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8171390Z triton_mm_15 0.0160 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8171675Z triton_mm_14 0.0175 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8171958Z triton_mm_34 0.0176 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8172228Z triton_mm_13 0.0176 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8172495Z triton_mm_31 0.0185 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8172767Z triton_mm_33 0.0188 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8173035Z triton_mm_16 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8173300Z triton_mm_32 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8173487Z SingleProcess AUTOTUNE benchmarking takes 0.1955 seconds and 8.2777 seconds precompiling for 33 choices 2025-12-04T12:10:19.8173669Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8173748Z Traceback (most recent call last): 2025-12-04T12:10:19.8173941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8174016Z method(*args, **kwargs) 2025-12-04T12:10:19.8174202Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8174273Z method(*args, **kwargs) 2025-12-04T12:10:19.8174456Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8174523Z with policy(): 2025-12-04T12:10:19.8174708Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8174781Z raise RuntimeError(msg) 2025-12-04T12:10:19.8175247Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2051014656 and is now 3015704576. 2025-12-04T12:10:19.8175251Z 2025-12-04T12:10:19.8175355Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8175664Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8175667Z 2025-12-04T12:10:19.8175784Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8175891Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8175960Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8176049Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8176622Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8176766Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8176835Z graph_break [] 2025-12-04T12:10:19.8176926Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8177032Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8177581Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8177660Z current_size = base.storage().size() 2025-12-04T12:10:19.8177725Z Autotune Choices Stats: 2025-12-04T12:10:19.8178265Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.014800000004470348, "best_triton_pos": 1, "best_triton_time": 0.01587899960577488, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.8178373Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8178453Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8178611Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8178680Z _scaled_mm 0.0148 ms 100.0% 2025-12-04T12:10:19.8178962Z triton_mm_35 0.0159 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8179232Z triton_mm_15 0.0160 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8179503Z triton_mm_14 0.0175 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8179782Z triton_mm_34 0.0176 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8180054Z triton_mm_13 0.0176 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8180385Z triton_mm_31 0.0185 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8180655Z triton_mm_33 0.0188 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8180937Z triton_mm_16 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8181203Z triton_mm_32 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8181385Z SingleProcess AUTOTUNE benchmarking takes 0.1955 seconds and 8.2777 seconds precompiling for 33 choices 2025-12-04T12:10:19.8181485Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8181557Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8181639Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8181771Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8182298Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8182362Z graph_break [] 2025-12-04T12:10:19.8182461Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8182560Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8182984Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.8183121Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.8183193Z Autotune Choices Stats: 2025-12-04T12:10:19.8183722Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013799999840557575, "best_triton_pos": 1, "best_triton_time": 0.015519999898970127, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.8183830Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8183907Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8184065Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8184139Z _scaled_mm 0.0138 ms 100.0% 2025-12-04T12:10:19.8184428Z triton_mm_73 0.0155 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8184701Z triton_mm_53 0.0161 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8184967Z triton_mm_51 0.0173 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8185239Z triton_mm_52 0.0175 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8185516Z triton_mm_72 0.0175 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8185788Z triton_mm_71 0.0186 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8186068Z triton_mm_69 0.0186 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8186334Z triton_mm_54 0.0196 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8186606Z triton_mm_70 0.0198 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8186768Z SingleProcess AUTOTUNE benchmarking takes 0.2912 seconds and 0.7955 seconds precompiling for 39 choices 2025-12-04T12:10:19.8186854Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8187032Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8187109Z Traceback (most recent call last): 2025-12-04T12:10:19.8187302Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8187389Z method(*args, **kwargs) 2025-12-04T12:10:19.8187582Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8187649Z method(*args, **kwargs) 2025-12-04T12:10:19.8187840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8187904Z with policy(): 2025-12-04T12:10:19.8188095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8188163Z raise RuntimeError(msg) 2025-12-04T12:10:19.8188618Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3015704576 and is now 3902799872. 2025-12-04T12:10:19.8188623Z 2025-12-04T12:10:19.8188725Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8189047Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8189052Z 2025-12-04T12:10:19.8189169Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8189272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8189341Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8189428Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8189987Z inductor [('triton_bundler_save_kernel', 304), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8190156Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8190225Z graph_break [] 2025-12-04T12:10:19.8190331Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8190438Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8191006Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8191084Z current_size = base.storage().size() 2025-12-04T12:10:19.8191148Z Autotune Choices Stats: 2025-12-04T12:10:19.8191690Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "_scaled_mm", "best_time": 0.014800000004470348, "best_triton_pos": 1, "best_triton_time": 0.01587899960577488, "best_triton_kernel": "triton_mm_35", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.8191794Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8191869Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8192026Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8192091Z _scaled_mm 0.0148 ms 100.0% 2025-12-04T12:10:19.8192373Z triton_mm_35 0.0159 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8192657Z triton_mm_15 0.0160 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8192927Z triton_mm_14 0.0175 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8193194Z triton_mm_34 0.0176 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8193464Z triton_mm_13 0.0176 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8193756Z triton_mm_31 0.0185 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8194023Z triton_mm_33 0.0188 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8194291Z triton_mm_16 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8194556Z triton_mm_32 0.0198 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8194724Z SingleProcess AUTOTUNE benchmarking takes 0.1955 seconds and 8.2777 seconds precompiling for 33 choices 2025-12-04T12:10:19.8194825Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8194908Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8194997Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8195136Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8195654Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8195717Z graph_break [] 2025-12-04T12:10:19.8195813Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8195913Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8196336Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.8196458Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.8196527Z Autotune Choices Stats: 2025-12-04T12:10:19.8197055Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013799999840557575, "best_triton_pos": 1, "best_triton_time": 0.015519999898970127, "best_triton_kernel": "triton_mm_73", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:19.8197164Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8197243Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8197394Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8197463Z _scaled_mm 0.0138 ms 100.0% 2025-12-04T12:10:19.8197736Z triton_mm_73 0.0155 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8198006Z triton_mm_53 0.0161 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8198271Z triton_mm_51 0.0173 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8198549Z triton_mm_52 0.0175 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8198820Z triton_mm_72 0.0175 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8199085Z triton_mm_71 0.0186 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8199352Z triton_mm_69 0.0186 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8199626Z triton_mm_54 0.0196 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8199904Z triton_mm_70 0.0198 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8200063Z SingleProcess AUTOTUNE benchmarking takes 0.2912 seconds and 0.7955 seconds precompiling for 39 choices 2025-12-04T12:10:19.8200220Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8200286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8200370Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8200502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8201053Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8201117Z graph_break [] 2025-12-04T12:10:19.8201206Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:19.8201308Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8201390Z Autotune Choices Stats: 2025-12-04T12:10:19.8201822Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_111", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.015960000455379486, "best_triton_pos": 0} 2025-12-04T12:10:19.8201918Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8201996Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8202145Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8202428Z triton_mm_111 0.0160 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8202697Z triton_mm_91 0.0164 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8202976Z triton_mm_110 0.0172 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8203246Z triton_mm_90 0.0175 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8203510Z triton_mm_89 0.0175 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8203782Z triton_mm_107 0.0185 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8204068Z triton_mm_109 0.0188 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8204332Z triton_mm_92 0.0194 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8204619Z triton_mm_108 0.0199 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8204885Z triton_mm_106 0.0217 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8205049Z SingleProcess AUTOTUNE benchmarking takes 0.2935 seconds and 0.6451 seconds precompiling for 39 choices 2025-12-04T12:10:19.8205280Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e6dc9d8fc1e220b8.xml - 2025-12-04T12:10:19.8205371Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8206040Z FAILED [1.7230s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3015704576 and is now 3902799872. 2025-12-04T12:10:19.8206055Z 2025-12-04T12:10:19.8206156Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8206464Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8206467Z 2025-12-04T12:10:19.8206581Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8206672Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8206771Z ================= 1 failed, 187 deselected, 2 rerun in 13.83s ================== 2025-12-04T12:10:19.8206835Z Got exit code 1 2025-12-04T12:10:19.8207079Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8207242Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.8207423Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-dbbfe187c25018be.xml 2025-12-04T12:10:19.8207517Z ============================= test session starts ============================== 2025-12-04T12:10:19.8207665Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8207730Z cachedir: .pytest_cache 2025-12-04T12:10:19.8207925Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8207997Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8208065Z configfile: pytest.ini 2025-12-04T12:10:19.8208265Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8208374Z collecting ... collected 188 items / 79 deselected / 109 selected 2025-12-04T12:10:19.8208451Z stepcurrent: skipping 79 already run items. 2025-12-04T12:10:19.8208523Z Running 109 items in this shard 2025-12-04T12:10:19.8208526Z 2025-12-04T12:10:19.8208790Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5649s] [ 0%] 2025-12-04T12:10:19.8209042Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2471s] [ 0%] 2025-12-04T12:10:19.8209275Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2189s] [ 0%] 2025-12-04T12:10:19.8209277Z 2025-12-04T12:10:19.8209357Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8209533Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8209603Z Traceback (most recent call last): 2025-12-04T12:10:19.8209799Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8209864Z method(*args, **kwargs) 2025-12-04T12:10:19.8210053Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8210167Z method(*args, **kwargs) 2025-12-04T12:10:19.8210353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8210414Z with policy(): 2025-12-04T12:10:19.8210603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8210691Z raise RuntimeError(msg) 2025-12-04T12:10:19.8211134Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.8211137Z 2025-12-04T12:10:19.8211238Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8211539Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8211542Z 2025-12-04T12:10:19.8211660Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8211759Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8211830Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8211911Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8212008Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8212136Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8212213Z graph_break [] 2025-12-04T12:10:19.8212302Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8212478Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8212548Z Traceback (most recent call last): 2025-12-04T12:10:19.8212738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8212803Z method(*args, **kwargs) 2025-12-04T12:10:19.8212990Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8213054Z method(*args, **kwargs) 2025-12-04T12:10:19.8213238Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8213298Z with policy(): 2025-12-04T12:10:19.8213487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8213565Z raise RuntimeError(msg) 2025-12-04T12:10:19.8214006Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.8214024Z 2025-12-04T12:10:19.8214124Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8214426Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8214430Z 2025-12-04T12:10:19.8214547Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8214647Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8214717Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8214798Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8214892Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8215020Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8215084Z graph_break [] 2025-12-04T12:10:19.8215170Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8215273Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8215349Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8215433Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8215557Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8217499Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8217560Z graph_break [] 2025-12-04T12:10:19.8217648Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8217725Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8217899Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8217968Z Traceback (most recent call last): 2025-12-04T12:10:19.8218155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8218218Z method(*args, **kwargs) 2025-12-04T12:10:19.8218403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8218465Z method(*args, **kwargs) 2025-12-04T12:10:19.8218646Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8218723Z with policy(): 2025-12-04T12:10:19.8218910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8218972Z raise RuntimeError(msg) 2025-12-04T12:10:19.8219411Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8219414Z 2025-12-04T12:10:19.8219512Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8219810Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8219813Z 2025-12-04T12:10:19.8219926Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8220035Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8220177Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8220276Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8220365Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8220489Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8220550Z graph_break [] 2025-12-04T12:10:19.8220634Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8220734Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8220797Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8220876Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8220999Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8221089Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8221148Z graph_break [] 2025-12-04T12:10:19.8221232Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8221331Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8221395Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8221471Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8221593Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8221679Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8221755Z graph_break [] 2025-12-04T12:10:19.8221837Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8222067Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-dbbfe187c25018be.xml - 2025-12-04T12:10:19.8222153Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8222799Z FAILED [0.2189s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8222802Z 2025-12-04T12:10:19.8222899Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8223195Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8223197Z 2025-12-04T12:10:19.8223325Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8223411Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8223504Z ================== 1 failed, 79 deselected, 2 rerun in 2.05s =================== 2025-12-04T12:10:19.8223564Z Got exit code 1 2025-12-04T12:10:19.8223627Z Retrying single test... 2025-12-04T12:10:19.8223802Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-01bb6806e1e7e41b.xml 2025-12-04T12:10:19.8223884Z ============================= test session starts ============================== 2025-12-04T12:10:19.8224025Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8224089Z cachedir: .pytest_cache 2025-12-04T12:10:19.8224279Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8224348Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8224411Z configfile: pytest.ini 2025-12-04T12:10:19.8224623Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8224724Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8225028Z stepcurrent: skipping 79 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8225095Z Running 1 items in this shard 2025-12-04T12:10:19.8225098Z 2025-12-04T12:10:19.8225345Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5646s] [100%] 2025-12-04T12:10:19.8225591Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2485s] [100%] 2025-12-04T12:10:19.8225811Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2178s] [100%] 2025-12-04T12:10:19.8225815Z 2025-12-04T12:10:19.8225891Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8226061Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8226128Z Traceback (most recent call last): 2025-12-04T12:10:19.8226316Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8226389Z method(*args, **kwargs) 2025-12-04T12:10:19.8226572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8226633Z method(*args, **kwargs) 2025-12-04T12:10:19.8226815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8226874Z with policy(): 2025-12-04T12:10:19.8227056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8227119Z raise RuntimeError(msg) 2025-12-04T12:10:19.8227556Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.8227560Z 2025-12-04T12:10:19.8227657Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8227965Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8227967Z 2025-12-04T12:10:19.8228082Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8228179Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8228244Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8228322Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8228411Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8228535Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8228595Z graph_break [] 2025-12-04T12:10:19.8228677Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8228846Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8228912Z Traceback (most recent call last): 2025-12-04T12:10:19.8229096Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8229170Z method(*args, **kwargs) 2025-12-04T12:10:19.8229352Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8229433Z method(*args, **kwargs) 2025-12-04T12:10:19.8229612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8229670Z with policy(): 2025-12-04T12:10:19.8229852Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8229914Z raise RuntimeError(msg) 2025-12-04T12:10:19.8230390Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.8230393Z 2025-12-04T12:10:19.8230490Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8230784Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8230787Z 2025-12-04T12:10:19.8230897Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8230994Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8231076Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8231154Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8231242Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8231366Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8231425Z graph_break [] 2025-12-04T12:10:19.8231509Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8231606Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8231671Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8231748Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8231870Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8231956Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8232015Z graph_break [] 2025-12-04T12:10:19.8232097Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8232171Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8232341Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8232421Z Traceback (most recent call last): 2025-12-04T12:10:19.8232608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8232669Z method(*args, **kwargs) 2025-12-04T12:10:19.8232854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8232914Z method(*args, **kwargs) 2025-12-04T12:10:19.8233095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8233153Z with policy(): 2025-12-04T12:10:19.8233336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8233398Z raise RuntimeError(msg) 2025-12-04T12:10:19.8233844Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8233847Z 2025-12-04T12:10:19.8233961Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8234253Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8234255Z 2025-12-04T12:10:19.8234366Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8234463Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8234527Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8234604Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8234691Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8234815Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8234873Z graph_break [] 2025-12-04T12:10:19.8234955Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8235053Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8235115Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8235194Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8235315Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8235414Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8235470Z graph_break [] 2025-12-04T12:10:19.8235552Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8235651Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8235714Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8235792Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8235916Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8236002Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8236061Z graph_break [] 2025-12-04T12:10:19.8236141Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8236365Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-01bb6806e1e7e41b.xml - 2025-12-04T12:10:19.8236447Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8237097Z FAILED [0.2178s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8237101Z 2025-12-04T12:10:19.8237200Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8237491Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8237493Z 2025-12-04T12:10:19.8237604Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8237690Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8237783Z ================== 1 failed, 187 deselected, 2 rerun in 2.05s ================== 2025-12-04T12:10:19.8237842Z Got exit code 1 2025-12-04T12:10:19.8237904Z Retrying single test... 2025-12-04T12:10:19.8238091Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b1a9ae2a4706be8.xml 2025-12-04T12:10:19.8238171Z ============================= test session starts ============================== 2025-12-04T12:10:19.8238323Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8238384Z cachedir: .pytest_cache 2025-12-04T12:10:19.8238573Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8238641Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8238705Z configfile: pytest.ini 2025-12-04T12:10:19.8238899Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8238999Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8239288Z stepcurrent: skipping 79 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8239354Z Running 1 items in this shard 2025-12-04T12:10:19.8239356Z 2025-12-04T12:10:19.8239604Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5673s] [100%] 2025-12-04T12:10:19.8239848Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2449s] [100%] 2025-12-04T12:10:19.8240078Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2097s] [100%] 2025-12-04T12:10:19.8240083Z 2025-12-04T12:10:19.8240206Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8240378Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8240446Z Traceback (most recent call last): 2025-12-04T12:10:19.8240633Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8240695Z method(*args, **kwargs) 2025-12-04T12:10:19.8240878Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8240939Z method(*args, **kwargs) 2025-12-04T12:10:19.8241122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8241181Z with policy(): 2025-12-04T12:10:19.8241363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8241425Z raise RuntimeError(msg) 2025-12-04T12:10:19.8241875Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.8241878Z 2025-12-04T12:10:19.8241975Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8242268Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8242272Z 2025-12-04T12:10:19.8242383Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8242478Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8242542Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8242621Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8242724Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8242847Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8242923Z graph_break [] 2025-12-04T12:10:19.8243005Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8243174Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8243241Z Traceback (most recent call last): 2025-12-04T12:10:19.8243425Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8243488Z method(*args, **kwargs) 2025-12-04T12:10:19.8243671Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8243731Z method(*args, **kwargs) 2025-12-04T12:10:19.8243914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8243972Z with policy(): 2025-12-04T12:10:19.8244154Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8244217Z raise RuntimeError(msg) 2025-12-04T12:10:19.8244650Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.8244667Z 2025-12-04T12:10:19.8244765Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8245056Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8245059Z 2025-12-04T12:10:19.8245172Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8245269Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8245334Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8245411Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8245498Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8245621Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8245680Z graph_break [] 2025-12-04T12:10:19.8245763Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8245862Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8245924Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8246013Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8246136Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8246222Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8246280Z graph_break [] 2025-12-04T12:10:19.8246363Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8246436Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8246692Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8246759Z Traceback (most recent call last): 2025-12-04T12:10:19.8246945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8247005Z method(*args, **kwargs) 2025-12-04T12:10:19.8247187Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8247247Z method(*args, **kwargs) 2025-12-04T12:10:19.8247446Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8247517Z with policy(): 2025-12-04T12:10:19.8247699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8247762Z raise RuntimeError(msg) 2025-12-04T12:10:19.8248191Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8248194Z 2025-12-04T12:10:19.8248292Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8248583Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8248585Z 2025-12-04T12:10:19.8248697Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8248794Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8248858Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8248936Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8249024Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8249158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8249217Z graph_break [] 2025-12-04T12:10:19.8249298Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8249395Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8249458Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8249539Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8249659Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8249747Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8249804Z graph_break [] 2025-12-04T12:10:19.8249886Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8249982Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8250045Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8250183Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8250304Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8250391Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8250448Z graph_break [] 2025-12-04T12:10:19.8250545Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:19.8250771Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b1a9ae2a4706be8.xml - 2025-12-04T12:10:19.8250855Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8251494Z FAILED [0.2097s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.8251498Z 2025-12-04T12:10:19.8251595Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8251902Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8251906Z 2025-12-04T12:10:19.8252016Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8252116Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8252207Z ================== 1 failed, 187 deselected, 2 rerun in 2.04s ================== 2025-12-04T12:10:19.8252266Z Got exit code 1 2025-12-04T12:10:19.8252502Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8252660Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.8252832Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e6f87b45bc89a81f.xml 2025-12-04T12:10:19.8252913Z ============================= test session starts ============================== 2025-12-04T12:10:19.8253051Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8253115Z cachedir: .pytest_cache 2025-12-04T12:10:19.8253302Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8253369Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8253431Z configfile: pytest.ini 2025-12-04T12:10:19.8253624Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8253738Z collecting ... collected 188 items / 80 deselected / 108 selected 2025-12-04T12:10:19.8253815Z stepcurrent: skipping 80 already run items. 2025-12-04T12:10:19.8253880Z Running 108 items in this shard 2025-12-04T12:10:19.8253883Z 2025-12-04T12:10:19.8254138Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5731s] [ 0%] 2025-12-04T12:10:19.8254385Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2669s] [ 0%] 2025-12-04T12:10:19.8254612Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2240s] [ 0%] 2025-12-04T12:10:19.8254615Z 2025-12-04T12:10:19.8254688Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8254858Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8254926Z Traceback (most recent call last): 2025-12-04T12:10:19.8255124Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8255188Z method(*args, **kwargs) 2025-12-04T12:10:19.8255371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8255435Z method(*args, **kwargs) 2025-12-04T12:10:19.8255615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8255673Z with policy(): 2025-12-04T12:10:19.8255855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8255919Z raise RuntimeError(msg) 2025-12-04T12:10:19.8256356Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:19.8256359Z 2025-12-04T12:10:19.8256466Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8256762Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8256776Z 2025-12-04T12:10:19.8256887Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8256984Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8257048Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8257127Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8257214Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8257338Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8257396Z graph_break [] 2025-12-04T12:10:19.8257484Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8257654Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8257723Z Traceback (most recent call last): 2025-12-04T12:10:19.8257905Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8257967Z method(*args, **kwargs) 2025-12-04T12:10:19.8258147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8258220Z method(*args, **kwargs) 2025-12-04T12:10:19.8258399Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8258458Z with policy(): 2025-12-04T12:10:19.8258640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8258704Z raise RuntimeError(msg) 2025-12-04T12:10:19.8259140Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:19.8259143Z 2025-12-04T12:10:19.8259240Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8259535Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8259538Z 2025-12-04T12:10:19.8259649Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8259757Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8259822Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8259902Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8259991Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8260185Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8260243Z graph_break [] 2025-12-04T12:10:19.8260329Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8260426Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8260490Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8260568Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8260691Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8260776Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8260836Z graph_break [] 2025-12-04T12:10:19.8260936Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8261012Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8261198Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8261267Z Traceback (most recent call last): 2025-12-04T12:10:19.8261453Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8261516Z method(*args, **kwargs) 2025-12-04T12:10:19.8261701Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8261762Z method(*args, **kwargs) 2025-12-04T12:10:19.8261942Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8262001Z with policy(): 2025-12-04T12:10:19.8262183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8262245Z raise RuntimeError(msg) 2025-12-04T12:10:19.8262687Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8262690Z 2025-12-04T12:10:19.8262800Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8263094Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8263097Z 2025-12-04T12:10:19.8263208Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8263307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8263370Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8263450Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8263536Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8263660Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8263720Z graph_break [] 2025-12-04T12:10:19.8263803Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8263903Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8263966Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8264043Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8264187Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8264276Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8264334Z graph_break [] 2025-12-04T12:10:19.8264416Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8264513Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8264576Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8264652Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8264774Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8264860Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8264919Z graph_break [] 2025-12-04T12:10:19.8265000Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8265227Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e6f87b45bc89a81f.xml - 2025-12-04T12:10:19.8265309Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8265970Z FAILED [0.2240s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8265984Z 2025-12-04T12:10:19.8266081Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8266375Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8266378Z 2025-12-04T12:10:19.8266489Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8266574Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8266665Z ================== 1 failed, 80 deselected, 2 rerun in 2.08s =================== 2025-12-04T12:10:19.8266725Z Got exit code 1 2025-12-04T12:10:19.8266787Z Retrying single test... 2025-12-04T12:10:19.8266963Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-28c2a042d57adc7f.xml 2025-12-04T12:10:19.8267043Z ============================= test session starts ============================== 2025-12-04T12:10:19.8267191Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8267254Z cachedir: .pytest_cache 2025-12-04T12:10:19.8267442Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8267510Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8267571Z configfile: pytest.ini 2025-12-04T12:10:19.8267767Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8267866Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8268159Z stepcurrent: skipping 80 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8268224Z Running 1 items in this shard 2025-12-04T12:10:19.8268226Z 2025-12-04T12:10:19.8268480Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5943s] [100%] 2025-12-04T12:10:19.8268740Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2497s] [100%] 2025-12-04T12:10:19.8268963Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2252s] [100%] 2025-12-04T12:10:19.8268967Z 2025-12-04T12:10:19.8269041Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8269211Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8269280Z Traceback (most recent call last): 2025-12-04T12:10:19.8269466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8269531Z method(*args, **kwargs) 2025-12-04T12:10:19.8269713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8269776Z method(*args, **kwargs) 2025-12-04T12:10:19.8269967Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8270027Z with policy(): 2025-12-04T12:10:19.8270249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8270334Z raise RuntimeError(msg) 2025-12-04T12:10:19.8270772Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:19.8270777Z 2025-12-04T12:10:19.8270873Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8271168Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8271171Z 2025-12-04T12:10:19.8271282Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8271379Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8271444Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8271523Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8271610Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8271735Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8271807Z graph_break [] 2025-12-04T12:10:19.8271893Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8272062Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8272130Z Traceback (most recent call last): 2025-12-04T12:10:19.8272315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8272378Z method(*args, **kwargs) 2025-12-04T12:10:19.8272558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8272622Z method(*args, **kwargs) 2025-12-04T12:10:19.8272801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8272860Z with policy(): 2025-12-04T12:10:19.8273041Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8273106Z raise RuntimeError(msg) 2025-12-04T12:10:19.8273560Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:19.8273563Z 2025-12-04T12:10:19.8273659Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8273954Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8273957Z 2025-12-04T12:10:19.8274066Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8274165Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8274228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8274307Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8274394Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8274518Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8274590Z graph_break [] 2025-12-04T12:10:19.8274677Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8274786Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8274850Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8274926Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8275052Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8275137Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8275197Z graph_break [] 2025-12-04T12:10:19.8275279Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8275354Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8275525Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8275594Z Traceback (most recent call last): 2025-12-04T12:10:19.8275778Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8275839Z method(*args, **kwargs) 2025-12-04T12:10:19.8276022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8276082Z method(*args, **kwargs) 2025-12-04T12:10:19.8276262Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8276331Z with policy(): 2025-12-04T12:10:19.8276513Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8276574Z raise RuntimeError(msg) 2025-12-04T12:10:19.8277010Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8277013Z 2025-12-04T12:10:19.8277109Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8277403Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8277407Z 2025-12-04T12:10:19.8277517Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8277613Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8277677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8277754Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8277853Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8277978Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8278036Z graph_break [] 2025-12-04T12:10:19.8278120Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8278217Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8278279Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8278357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8278479Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8278567Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8278624Z graph_break [] 2025-12-04T12:10:19.8278707Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8278803Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8278868Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8278955Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8279078Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8279176Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8279234Z graph_break [] 2025-12-04T12:10:19.8279316Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8279542Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-28c2a042d57adc7f.xml - 2025-12-04T12:10:19.8279625Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8280343Z FAILED [0.2252s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8280346Z 2025-12-04T12:10:19.8280445Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8280739Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8280741Z 2025-12-04T12:10:19.8280874Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8280958Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8281050Z ================== 1 failed, 187 deselected, 2 rerun in 2.09s ================== 2025-12-04T12:10:19.8281109Z Got exit code 1 2025-12-04T12:10:19.8281175Z Retrying single test... 2025-12-04T12:10:19.8281351Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ee4ea21f7c4a1c7e.xml 2025-12-04T12:10:19.8281433Z ============================= test session starts ============================== 2025-12-04T12:10:19.8281571Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8281634Z cachedir: .pytest_cache 2025-12-04T12:10:19.8281822Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8281891Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8281953Z configfile: pytest.ini 2025-12-04T12:10:19.8282148Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8282249Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8282553Z stepcurrent: skipping 80 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8282620Z Running 1 items in this shard 2025-12-04T12:10:19.8282623Z 2025-12-04T12:10:19.8282873Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5752s] [100%] 2025-12-04T12:10:19.8283119Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2568s] [100%] 2025-12-04T12:10:19.8283341Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2222s] [100%] 2025-12-04T12:10:19.8283343Z 2025-12-04T12:10:19.8283417Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8283600Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8283669Z Traceback (most recent call last): 2025-12-04T12:10:19.8283872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8283934Z method(*args, **kwargs) 2025-12-04T12:10:19.8284117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8284179Z method(*args, **kwargs) 2025-12-04T12:10:19.8284361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8284420Z with policy(): 2025-12-04T12:10:19.8284601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8284665Z raise RuntimeError(msg) 2025-12-04T12:10:19.8285101Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:19.8285104Z 2025-12-04T12:10:19.8285201Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8285496Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8285508Z 2025-12-04T12:10:19.8285620Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8285720Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8285784Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8285866Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8285954Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8286079Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8286138Z graph_break [] 2025-12-04T12:10:19.8286223Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8286393Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8286460Z Traceback (most recent call last): 2025-12-04T12:10:19.8286644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8286707Z method(*args, **kwargs) 2025-12-04T12:10:19.8286887Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8286959Z method(*args, **kwargs) 2025-12-04T12:10:19.8287141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8287201Z with policy(): 2025-12-04T12:10:19.8287383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8287445Z raise RuntimeError(msg) 2025-12-04T12:10:19.8287883Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:19.8287887Z 2025-12-04T12:10:19.8287983Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8288288Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8288291Z 2025-12-04T12:10:19.8288404Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8288515Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8288578Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8288658Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8288744Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8288870Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8288927Z graph_break [] 2025-12-04T12:10:19.8289014Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8289110Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8289175Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8289255Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8289376Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8289463Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8289519Z graph_break [] 2025-12-04T12:10:19.8289603Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8289677Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8289848Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8289928Z Traceback (most recent call last): 2025-12-04T12:10:19.8290145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8290207Z method(*args, **kwargs) 2025-12-04T12:10:19.8290389Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8290450Z method(*args, **kwargs) 2025-12-04T12:10:19.8290630Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8290689Z with policy(): 2025-12-04T12:10:19.8290870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8290932Z raise RuntimeError(msg) 2025-12-04T12:10:19.8291365Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8291369Z 2025-12-04T12:10:19.8291480Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8291775Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8291778Z 2025-12-04T12:10:19.8291888Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8291984Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8292048Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8292129Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8292217Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8292340Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8292398Z graph_break [] 2025-12-04T12:10:19.8292480Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8292592Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8292655Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8292732Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8292867Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8292954Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8293010Z graph_break [] 2025-12-04T12:10:19.8293093Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8293188Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8293253Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8293330Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8293451Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8293536Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.8293594Z graph_break [] 2025-12-04T12:10:19.8293676Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:19.8293901Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ee4ea21f7c4a1c7e.xml - 2025-12-04T12:10:19.8293983Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8294628Z FAILED [0.2222s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:19.8294643Z 2025-12-04T12:10:19.8294741Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8295032Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8295035Z 2025-12-04T12:10:19.8295145Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8295228Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8295318Z ================== 1 failed, 187 deselected, 2 rerun in 2.07s ================== 2025-12-04T12:10:19.8295377Z Got exit code 1 2025-12-04T12:10:19.8295614Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8295769Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.8295952Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e23288e1fb356080.xml 2025-12-04T12:10:19.8296034Z ============================= test session starts ============================== 2025-12-04T12:10:19.8296173Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8296235Z cachedir: .pytest_cache 2025-12-04T12:10:19.8296424Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8296493Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8296556Z configfile: pytest.ini 2025-12-04T12:10:19.8296749Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8296850Z collecting ... collected 188 items / 81 deselected / 107 selected 2025-12-04T12:10:19.8296925Z stepcurrent: skipping 81 already run items. 2025-12-04T12:10:19.8296991Z Running 107 items in this shard 2025-12-04T12:10:19.8296994Z 2025-12-04T12:10:19.8297256Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0008s] [ 0%] 2025-12-04T12:10:19.8297516Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7617s] [ 0%] 2025-12-04T12:10:19.8297739Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6525s] [ 0%] 2025-12-04T12:10:19.8297743Z 2025-12-04T12:10:19.8297817Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8297986Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8298051Z Traceback (most recent call last): 2025-12-04T12:10:19.8298240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8298300Z method(*args, **kwargs) 2025-12-04T12:10:19.8298483Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8298547Z method(*args, **kwargs) 2025-12-04T12:10:19.8298726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8298785Z with policy(): 2025-12-04T12:10:19.8298977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8299040Z raise RuntimeError(msg) 2025-12-04T12:10:19.8299472Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:19.8299474Z 2025-12-04T12:10:19.8299571Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8299866Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8299869Z 2025-12-04T12:10:19.8299981Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8300078Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8300168Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8300246Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8300805Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8300931Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8300988Z graph_break [] 2025-12-04T12:10:19.8301073Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8301168Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8301714Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8301801Z current_size = base.storage().size() 2025-12-04T12:10:19.8301866Z Autotune Choices Stats: 2025-12-04T12:10:19.8302283Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.8302382Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8302454Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8302605Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8302878Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8303145Z triton_mm_0 0.0072 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8303406Z triton_mm_4 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8303663Z triton_mm_1 0.0075 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8303938Z triton_mm_5 0.0077 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8304196Z triton_mm_3 0.0081 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8304455Z triton_mm_2 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8304716Z triton_mm_7 0.0094 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8304781Z _scaled_mm 0.0284 ms 22.8% 2025-12-04T12:10:19.8304952Z SingleProcess AUTOTUNE benchmarking takes 0.0440 seconds and 0.1952 seconds precompiling for 9 choices 2025-12-04T12:10:19.8305122Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8305189Z Traceback (most recent call last): 2025-12-04T12:10:19.8305376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8305438Z method(*args, **kwargs) 2025-12-04T12:10:19.8305619Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8305681Z method(*args, **kwargs) 2025-12-04T12:10:19.8305861Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8305921Z with policy(): 2025-12-04T12:10:19.8306101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8306166Z raise RuntimeError(msg) 2025-12-04T12:10:19.8306614Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:19.8306628Z 2025-12-04T12:10:19.8306725Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8307021Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8307024Z 2025-12-04T12:10:19.8307135Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8307236Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8307302Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8307385Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8307928Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8308055Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8308128Z graph_break [] 2025-12-04T12:10:19.8308214Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8308314Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8308857Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8308930Z current_size = base.storage().size() 2025-12-04T12:10:19.8308994Z Autotune Choices Stats: 2025-12-04T12:10:19.8309411Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.8309495Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8309569Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8309728Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8310005Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8310306Z triton_mm_0 0.0072 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8310566Z triton_mm_4 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8310828Z triton_mm_1 0.0075 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8311100Z triton_mm_5 0.0077 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8311373Z triton_mm_3 0.0081 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8311634Z triton_mm_2 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8311892Z triton_mm_7 0.0094 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8311959Z _scaled_mm 0.0284 ms 22.8% 2025-12-04T12:10:19.8312117Z SingleProcess AUTOTUNE benchmarking takes 0.0440 seconds and 0.1952 seconds precompiling for 9 choices 2025-12-04T12:10:19.8312219Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8312283Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8312365Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8312491Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8313044Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8313102Z graph_break [] 2025-12-04T12:10:19.8313190Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8313287Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8313353Z Autotune Choices Stats: 2025-12-04T12:10:19.8313762Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:19.8313845Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8313918Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8314066Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8314348Z triton_mm_13 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8314614Z triton_mm_10 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8314878Z triton_mm_9 0.0071 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8315137Z triton_mm_15 0.0072 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8315418Z triton_mm_11 0.0078 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8315692Z triton_mm_14 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8315952Z triton_mm_8 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8316216Z triton_mm_12 0.0100 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8316280Z _scaled_mm 0.0236 ms 27.6% 2025-12-04T12:10:19.8316437Z SingleProcess AUTOTUNE benchmarking takes 0.0428 seconds and 0.1006 seconds precompiling for 9 choices 2025-12-04T12:10:19.8316513Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8316687Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8316754Z Traceback (most recent call last): 2025-12-04T12:10:19.8316943Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8317020Z method(*args, **kwargs) 2025-12-04T12:10:19.8317204Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8317269Z method(*args, **kwargs) 2025-12-04T12:10:19.8317452Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8317514Z with policy(): 2025-12-04T12:10:19.8317697Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8317764Z raise RuntimeError(msg) 2025-12-04T12:10:19.8318201Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8318205Z 2025-12-04T12:10:19.8318306Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8318602Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8318615Z 2025-12-04T12:10:19.8318731Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8318828Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8318893Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8318976Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8319513Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8319642Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8319700Z graph_break [] 2025-12-04T12:10:19.8319788Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8319898Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8320516Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8320602Z current_size = base.storage().size() 2025-12-04T12:10:19.8320667Z Autotune Choices Stats: 2025-12-04T12:10:19.8321079Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.8321164Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8321236Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8321386Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8321655Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8321917Z triton_mm_0 0.0072 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8322196Z triton_mm_4 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8322459Z triton_mm_1 0.0075 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8322719Z triton_mm_5 0.0077 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8322980Z triton_mm_3 0.0081 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8323254Z triton_mm_2 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8323516Z triton_mm_7 0.0094 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8323580Z _scaled_mm 0.0284 ms 22.8% 2025-12-04T12:10:19.8323739Z SingleProcess AUTOTUNE benchmarking takes 0.0440 seconds and 0.1952 seconds precompiling for 9 choices 2025-12-04T12:10:19.8323835Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8323901Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8323979Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8324105Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8324659Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8324730Z graph_break [] 2025-12-04T12:10:19.8324818Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8324915Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8324979Z Autotune Choices Stats: 2025-12-04T12:10:19.8325387Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:19.8325470Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8325541Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8325691Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8325960Z triton_mm_13 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8326226Z triton_mm_10 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8326499Z triton_mm_9 0.0071 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8326759Z triton_mm_15 0.0072 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8327022Z triton_mm_11 0.0078 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8327281Z triton_mm_14 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8327544Z triton_mm_8 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8327816Z triton_mm_12 0.0100 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8327880Z _scaled_mm 0.0236 ms 27.6% 2025-12-04T12:10:19.8328038Z SingleProcess AUTOTUNE benchmarking takes 0.0428 seconds and 0.1006 seconds precompiling for 9 choices 2025-12-04T12:10:19.8328138Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8328203Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8328283Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8328412Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8328958Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8329028Z graph_break [] 2025-12-04T12:10:19.8329113Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8329210Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8329272Z Autotune Choices Stats: 2025-12-04T12:10:19.8329683Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:19.8329763Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8329835Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8329985Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8330329Z triton_mm_16 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8330393Z _scaled_mm 0.0066 ms 96.3% 2025-12-04T12:10:19.8330651Z triton_mm_19 0.0066 ms 95.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8330928Z triton_mm_20 0.0066 ms 95.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8331191Z triton_mm_18 0.0069 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8331451Z triton_mm_23 0.0073 ms 86.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8331712Z triton_mm_21 0.0075 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8331971Z triton_mm_22 0.0076 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8332254Z triton_mm_17 0.0096 ms 65.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8332411Z SingleProcess AUTOTUNE benchmarking takes 0.0584 seconds and 0.2146 seconds precompiling for 9 choices 2025-12-04T12:10:19.8332634Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e23288e1fb356080.xml - 2025-12-04T12:10:19.8332717Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8333380Z FAILED [0.6525s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8333383Z 2025-12-04T12:10:19.8333483Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8333792Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8333795Z 2025-12-04T12:10:19.8333908Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8333992Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8334083Z ================== 1 failed, 81 deselected, 2 rerun in 3.43s =================== 2025-12-04T12:10:19.8334141Z Got exit code 1 2025-12-04T12:10:19.8334203Z Retrying single test... 2025-12-04T12:10:19.8334375Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-48358a98e5851b78.xml 2025-12-04T12:10:19.8334458Z ============================= test session starts ============================== 2025-12-04T12:10:19.8334597Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8334661Z cachedir: .pytest_cache 2025-12-04T12:10:19.8334851Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8334920Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8334982Z configfile: pytest.ini 2025-12-04T12:10:19.8335178Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8335288Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8335579Z stepcurrent: skipping 81 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8335646Z Running 1 items in this shard 2025-12-04T12:10:19.8335648Z 2025-12-04T12:10:19.8335893Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0012s] [100%] 2025-12-04T12:10:19.8336138Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7348s] [100%] 2025-12-04T12:10:19.8336356Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6793s] [100%] 2025-12-04T12:10:19.8336360Z 2025-12-04T12:10:19.8336434Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8336615Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8336684Z Traceback (most recent call last): 2025-12-04T12:10:19.8336874Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8336939Z method(*args, **kwargs) 2025-12-04T12:10:19.8337121Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8337183Z method(*args, **kwargs) 2025-12-04T12:10:19.8337366Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8337425Z with policy(): 2025-12-04T12:10:19.8337609Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8337671Z raise RuntimeError(msg) 2025-12-04T12:10:19.8338118Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:19.8338132Z 2025-12-04T12:10:19.8338230Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8338526Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8338528Z 2025-12-04T12:10:19.8338639Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8338736Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8338800Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8338880Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8339419Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8339544Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8339604Z graph_break [] 2025-12-04T12:10:19.8339689Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8339801Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8340414Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8340487Z current_size = base.storage().size() 2025-12-04T12:10:19.8340549Z Autotune Choices Stats: 2025-12-04T12:10:19.8340969Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.8341053Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8341124Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8341272Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8341561Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8341825Z triton_mm_6 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8342089Z triton_mm_2 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8342351Z triton_mm_3 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8342624Z triton_mm_7 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8342882Z triton_mm_5 0.0079 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8343158Z triton_mm_0 0.0088 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8343418Z triton_mm_4 0.0094 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8343484Z _scaled_mm 0.0255 ms 24.2% 2025-12-04T12:10:19.8343643Z SingleProcess AUTOTUNE benchmarking takes 0.0435 seconds and 0.1905 seconds precompiling for 9 choices 2025-12-04T12:10:19.8343817Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8343888Z Traceback (most recent call last): 2025-12-04T12:10:19.8344077Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8344138Z method(*args, **kwargs) 2025-12-04T12:10:19.8344323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8344398Z method(*args, **kwargs) 2025-12-04T12:10:19.8344582Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8344641Z with policy(): 2025-12-04T12:10:19.8344826Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8344891Z raise RuntimeError(msg) 2025-12-04T12:10:19.8345328Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:19.8345331Z 2025-12-04T12:10:19.8345433Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8345729Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8345733Z 2025-12-04T12:10:19.8345847Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8345954Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8346025Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8346105Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8346642Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8346771Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8346829Z graph_break [] 2025-12-04T12:10:19.8346916Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8347013Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8347565Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8347646Z current_size = base.storage().size() 2025-12-04T12:10:19.8347710Z Autotune Choices Stats: 2025-12-04T12:10:19.8348125Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.8348208Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8348279Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8348429Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8348700Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8348963Z triton_mm_6 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8349245Z triton_mm_2 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8349504Z triton_mm_3 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8349763Z triton_mm_7 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8350022Z triton_mm_5 0.0079 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8350318Z triton_mm_0 0.0088 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8350596Z triton_mm_4 0.0094 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8350658Z _scaled_mm 0.0255 ms 24.2% 2025-12-04T12:10:19.8350819Z SingleProcess AUTOTUNE benchmarking takes 0.0435 seconds and 0.1905 seconds precompiling for 9 choices 2025-12-04T12:10:19.8350917Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8350983Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8351065Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8351191Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8351738Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8351801Z graph_break [] 2025-12-04T12:10:19.8351885Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8351997Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8352059Z Autotune Choices Stats: 2025-12-04T12:10:19.8352468Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.8352551Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8352621Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8352772Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8353040Z triton_mm_14 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8353304Z triton_mm_10 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8353565Z triton_mm_9 0.0068 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8353842Z triton_mm_8 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8354104Z triton_mm_15 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8354366Z triton_mm_12 0.0076 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8354626Z triton_mm_11 0.0077 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8354897Z triton_mm_13 0.0078 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8354963Z _scaled_mm 0.0243 ms 25.3% 2025-12-04T12:10:19.8355120Z SingleProcess AUTOTUNE benchmarking takes 0.0421 seconds and 0.1123 seconds precompiling for 9 choices 2025-12-04T12:10:19.8355200Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8355370Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8355440Z Traceback (most recent call last): 2025-12-04T12:10:19.8355630Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8355694Z method(*args, **kwargs) 2025-12-04T12:10:19.8355878Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8355939Z method(*args, **kwargs) 2025-12-04T12:10:19.8356132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8356191Z with policy(): 2025-12-04T12:10:19.8356374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8356447Z raise RuntimeError(msg) 2025-12-04T12:10:19.8356881Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8356884Z 2025-12-04T12:10:19.8356983Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8357279Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8357282Z 2025-12-04T12:10:19.8357394Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8357495Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8357558Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8357641Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8358184Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8358319Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8358379Z graph_break [] 2025-12-04T12:10:19.8358464Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8358560Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8359104Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8359176Z current_size = base.storage().size() 2025-12-04T12:10:19.8359237Z Autotune Choices Stats: 2025-12-04T12:10:19.8359663Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.8359744Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8359815Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8359965Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8360284Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8360547Z triton_mm_6 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8360821Z triton_mm_2 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8361081Z triton_mm_3 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8361355Z triton_mm_7 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8361615Z triton_mm_5 0.0079 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8361877Z triton_mm_0 0.0088 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8362136Z triton_mm_4 0.0094 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8362202Z _scaled_mm 0.0255 ms 24.2% 2025-12-04T12:10:19.8362359Z SingleProcess AUTOTUNE benchmarking takes 0.0435 seconds and 0.1905 seconds precompiling for 9 choices 2025-12-04T12:10:19.8362458Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8362537Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8362618Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8362742Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8363278Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8363341Z graph_break [] 2025-12-04T12:10:19.8363425Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8363523Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8363587Z Autotune Choices Stats: 2025-12-04T12:10:19.8364012Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.8364093Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8364165Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8364312Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8364581Z triton_mm_14 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8364844Z triton_mm_10 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8365108Z triton_mm_9 0.0068 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8365377Z triton_mm_8 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8365652Z triton_mm_15 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8365914Z triton_mm_12 0.0076 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8366175Z triton_mm_11 0.0077 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8366434Z triton_mm_13 0.0078 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8366499Z _scaled_mm 0.0243 ms 25.3% 2025-12-04T12:10:19.8366655Z SingleProcess AUTOTUNE benchmarking takes 0.0421 seconds and 0.1123 seconds precompiling for 9 choices 2025-12-04T12:10:19.8366753Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8366828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8366910Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8367035Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8367575Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8367637Z graph_break [] 2025-12-04T12:10:19.8367722Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8367818Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8367886Z Autotune Choices Stats: 2025-12-04T12:10:19.8368293Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:19.8368385Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8368459Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8368608Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8368880Z triton_mm_18 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8369139Z triton_mm_19 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8369403Z triton_mm_16 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8369678Z triton_mm_17 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8369953Z triton_mm_20 0.0070 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8370364Z triton_mm_21 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8370624Z triton_mm_22 0.0074 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8370885Z triton_mm_23 0.0076 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8370949Z _scaled_mm 0.0274 ms 21.9% 2025-12-04T12:10:19.8371106Z SingleProcess AUTOTUNE benchmarking takes 0.0583 seconds and 0.2093 seconds precompiling for 9 choices 2025-12-04T12:10:19.8371327Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-48358a98e5851b78.xml - 2025-12-04T12:10:19.8371427Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8372072Z FAILED [0.6793s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8372077Z 2025-12-04T12:10:19.8372174Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8372473Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8372475Z 2025-12-04T12:10:19.8372587Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8372675Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8372766Z ================== 1 failed, 187 deselected, 2 rerun in 3.43s ================== 2025-12-04T12:10:19.8372827Z Got exit code 1 2025-12-04T12:10:19.8372888Z Retrying single test... 2025-12-04T12:10:19.8373077Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1cb1a40d121a452d.xml 2025-12-04T12:10:19.8373158Z ============================= test session starts ============================== 2025-12-04T12:10:19.8373299Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8373361Z cachedir: .pytest_cache 2025-12-04T12:10:19.8373554Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8373622Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8373689Z configfile: pytest.ini 2025-12-04T12:10:19.8373884Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8373988Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8375152Z stepcurrent: skipping 81 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8375219Z Running 1 items in this shard 2025-12-04T12:10:19.8375237Z 2025-12-04T12:10:19.8375485Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1483s] [100%] 2025-12-04T12:10:19.8375729Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7707s] [100%] 2025-12-04T12:10:19.8375951Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6814s] [100%] 2025-12-04T12:10:19.8375953Z 2025-12-04T12:10:19.8376027Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8376200Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8376266Z Traceback (most recent call last): 2025-12-04T12:10:19.8376456Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8376520Z method(*args, **kwargs) 2025-12-04T12:10:19.8376705Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8376767Z method(*args, **kwargs) 2025-12-04T12:10:19.8376950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8377023Z with policy(): 2025-12-04T12:10:19.8377204Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8377270Z raise RuntimeError(msg) 2025-12-04T12:10:19.8377712Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:19.8377715Z 2025-12-04T12:10:19.8377817Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8378111Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8378114Z 2025-12-04T12:10:19.8378228Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8378325Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8378394Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8378484Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8379027Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8379154Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8379212Z graph_break [] 2025-12-04T12:10:19.8379301Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8379398Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8379954Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8380036Z current_size = base.storage().size() 2025-12-04T12:10:19.8380152Z Autotune Choices Stats: 2025-12-04T12:10:19.8380568Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:19.8380652Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8380722Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8380874Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8381147Z triton_mm_4 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8381414Z triton_mm_3 0.0068 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8381678Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8381952Z triton_mm_5 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8382215Z triton_mm_7 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8382479Z triton_mm_2 0.0076 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8382736Z triton_mm_6 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8382998Z triton_mm_0 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8383074Z _scaled_mm 0.0258 ms 26.0% 2025-12-04T12:10:19.8383236Z SingleProcess AUTOTUNE benchmarking takes 0.0453 seconds and 0.2043 seconds precompiling for 9 choices 2025-12-04T12:10:19.8383407Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8383478Z Traceback (most recent call last): 2025-12-04T12:10:19.8383666Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8383731Z method(*args, **kwargs) 2025-12-04T12:10:19.8383914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8383980Z method(*args, **kwargs) 2025-12-04T12:10:19.8384160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8384222Z with policy(): 2025-12-04T12:10:19.8384425Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8384491Z raise RuntimeError(msg) 2025-12-04T12:10:19.8384948Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:19.8384951Z 2025-12-04T12:10:19.8385050Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8385352Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8385354Z 2025-12-04T12:10:19.8385467Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8385567Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8385632Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8385715Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8386259Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8386396Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8386455Z graph_break [] 2025-12-04T12:10:19.8386541Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8386641Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8387184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8387256Z current_size = base.storage().size() 2025-12-04T12:10:19.8387319Z Autotune Choices Stats: 2025-12-04T12:10:19.8387733Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:19.8387827Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8387901Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8388050Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8388322Z triton_mm_4 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8388583Z triton_mm_3 0.0068 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8388847Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8389119Z triton_mm_5 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8389387Z triton_mm_7 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8389648Z triton_mm_2 0.0076 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8389906Z triton_mm_6 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8390215Z triton_mm_0 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8390279Z _scaled_mm 0.0258 ms 26.0% 2025-12-04T12:10:19.8390435Z SingleProcess AUTOTUNE benchmarking takes 0.0453 seconds and 0.2043 seconds precompiling for 9 choices 2025-12-04T12:10:19.8390533Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8390596Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8390677Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8390815Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8391352Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8391410Z graph_break [] 2025-12-04T12:10:19.8391495Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8391593Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8391655Z Autotune Choices Stats: 2025-12-04T12:10:19.8392062Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:19.8392142Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8392226Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8392374Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8392641Z triton_mm_11 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8392903Z triton_mm_9 0.0071 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8393166Z triton_mm_8 0.0075 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8393227Z _scaled_mm 0.0076 ms 92.6% 2025-12-04T12:10:19.8393503Z triton_mm_13 0.0077 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8393781Z triton_mm_15 0.0083 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8394039Z triton_mm_14 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8394302Z triton_mm_10 0.0092 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8394564Z triton_mm_12 0.0092 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8394723Z SingleProcess AUTOTUNE benchmarking takes 0.0444 seconds and 0.1067 seconds precompiling for 9 choices 2025-12-04T12:10:19.8394797Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8394966Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8395033Z Traceback (most recent call last): 2025-12-04T12:10:19.8395233Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8395295Z method(*args, **kwargs) 2025-12-04T12:10:19.8395479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8395541Z method(*args, **kwargs) 2025-12-04T12:10:19.8395723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8395782Z with policy(): 2025-12-04T12:10:19.8395965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8396029Z raise RuntimeError(msg) 2025-12-04T12:10:19.8396466Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8396470Z 2025-12-04T12:10:19.8396568Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8396875Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8396877Z 2025-12-04T12:10:19.8396989Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8397088Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8397152Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8397230Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8397771Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8397897Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8397965Z graph_break [] 2025-12-04T12:10:19.8398051Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8398158Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8398701Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8398770Z current_size = base.storage().size() 2025-12-04T12:10:19.8398832Z Autotune Choices Stats: 2025-12-04T12:10:19.8399248Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:19.8399330Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8399400Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8399547Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8399816Z triton_mm_4 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8400087Z triton_mm_3 0.0068 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8400368Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8400625Z triton_mm_5 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8400884Z triton_mm_7 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8401149Z triton_mm_2 0.0076 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8401429Z triton_mm_6 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8401690Z triton_mm_0 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8401753Z _scaled_mm 0.0258 ms 26.0% 2025-12-04T12:10:19.8401909Z SingleProcess AUTOTUNE benchmarking takes 0.0453 seconds and 0.2043 seconds precompiling for 9 choices 2025-12-04T12:10:19.8402005Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8402069Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8402147Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8402272Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8402816Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8402983Z graph_break [] 2025-12-04T12:10:19.8403068Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8403164Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8403229Z Autotune Choices Stats: 2025-12-04T12:10:19.8403642Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:19.8403724Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8403795Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8403944Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8404209Z triton_mm_11 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8404487Z triton_mm_9 0.0071 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8404749Z triton_mm_8 0.0075 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8404809Z _scaled_mm 0.0076 ms 92.6% 2025-12-04T12:10:19.8405069Z triton_mm_13 0.0077 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8405327Z triton_mm_15 0.0083 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8405588Z triton_mm_14 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8405861Z triton_mm_10 0.0092 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8406128Z triton_mm_12 0.0092 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8406285Z SingleProcess AUTOTUNE benchmarking takes 0.0444 seconds and 0.1067 seconds precompiling for 9 choices 2025-12-04T12:10:19.8406382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8406446Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8406525Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8406651Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8407195Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8407267Z graph_break [] 2025-12-04T12:10:19.8407351Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:19.8407447Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8407510Z Autotune Choices Stats: 2025-12-04T12:10:19.8407918Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:19.8407997Z AUTOTUNE scaled_mm(1024x32, 32x16, 1024x1, 1x16, 16) 2025-12-04T12:10:19.8408068Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8408217Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8408486Z triton_mm_23 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8408746Z triton_mm_22 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.8409020Z triton_mm_16 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8409280Z triton_mm_21 0.0075 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8409539Z triton_mm_19 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8409802Z triton_mm_18 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8410076Z triton_mm_17 0.0087 ms 71.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.8410397Z triton_mm_20 0.0096 ms 64.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8410461Z _scaled_mm 0.0243 ms 25.7% 2025-12-04T12:10:19.8410616Z SingleProcess AUTOTUNE benchmarking takes 0.0611 seconds and 0.2136 seconds precompiling for 9 choices 2025-12-04T12:10:19.8410841Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1cb1a40d121a452d.xml - 2025-12-04T12:10:19.8410925Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8411652Z FAILED [0.6814s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:19.8411670Z 2025-12-04T12:10:19.8411770Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8412064Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8412067Z 2025-12-04T12:10:19.8412179Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8412262Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8412356Z ================== 1 failed, 187 deselected, 2 rerun in 3.62s ================== 2025-12-04T12:10:19.8412415Z Got exit code 1 2025-12-04T12:10:19.8412654Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.8412809Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.8412981Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f8c72c691e4c8188.xml 2025-12-04T12:10:19.8413060Z ============================= test session starts ============================== 2025-12-04T12:10:19.8413199Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8413277Z cachedir: .pytest_cache 2025-12-04T12:10:19.8413466Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8413535Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8413598Z configfile: pytest.ini 2025-12-04T12:10:19.8413794Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8413896Z collecting ... collected 188 items / 82 deselected / 106 selected 2025-12-04T12:10:19.8413974Z stepcurrent: skipping 82 already run items. 2025-12-04T12:10:19.8414039Z Running 106 items in this shard 2025-12-04T12:10:19.8414042Z 2025-12-04T12:10:19.8415063Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp5xn4wba2/gq/cgq7i5rlvdrrvcdkbdnlm6vdtbqgbom5fqgk52pshhhcjvc26l7v.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8415246Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8415504Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8415696Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8416031Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8416197Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8416505Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8416686Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8416980Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8417170Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8417482Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8417644Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8417964Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8418193Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8418570Z E1204 10:53:17.367000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8419387Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp5xn4wba2/7m/c7mtveuav4puqt5dgzolc42kisdfqgojuryt7o7nr5gtd5i5zfds.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8419564Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8419815Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8420011Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8420396Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8420557Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8420852Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8421018Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8421323Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8421512Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8421835Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8421997Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8422311Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8422538Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8422896Z E1204 10:53:17.396000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8423701Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp5xn4wba2/g7/cg7aoufqkf5xir3qngizn4udlsv6ous2hprbqmfjppr4gvrrqyhv.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8423894Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8424143Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8424331Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8424656Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8424815Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8425131Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8425295Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8425588Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8425772Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8426083Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8426256Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8426570Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8426809Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8427164Z E1204 10:53:17.398000 603884 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8427240Z ('RERUN', {'yellow': True}) [2.8392s] [ 0%] 2025-12-04T12:10:19.8427602Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda E1204 10:53:18.791000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8427940Z E1204 10:53:18.791000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8428097Z E1204 10:53:18.791000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8428269Z E1204 10:53:18.793000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8428614Z E1204 10:53:18.793000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8428768Z E1204 10:53:18.793000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8428940Z E1204 10:53:18.795000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8429273Z E1204 10:53:18.795000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8429428Z E1204 10:53:18.795000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8429500Z ('RERUN', {'yellow': True}) [1.1025s] [ 0%] 2025-12-04T12:10:19.8429870Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda E1204 10:53:19.728000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8430256Z E1204 10:53:19.728000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8430410Z E1204 10:53:19.728000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8430581Z E1204 10:53:19.730000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8430915Z E1204 10:53:19.730000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8431068Z E1204 10:53:19.730000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8431253Z E1204 10:53:19.732000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8431587Z E1204 10:53:19.732000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:19.8431754Z E1204 10:53:19.732000 603884 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8431814Z FAILED [0.9003s] [ 0%] 2025-12-04T12:10:19.8431818Z 2025-12-04T12:10:19.8431894Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.8432067Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8432136Z Traceback (most recent call last): 2025-12-04T12:10:19.8432323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8432387Z method(*args, **kwargs) 2025-12-04T12:10:19.8432567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8432631Z method(*args, **kwargs) 2025-12-04T12:10:19.8432809Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8432870Z with policy(): 2025-12-04T12:10:19.8433050Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8434509Z raise RuntimeError(msg) 2025-12-04T12:10:19.8434956Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:19.8434961Z 2025-12-04T12:10:19.8435060Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8435365Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8435368Z 2025-12-04T12:10:19.8435481Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8435582Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8435646Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8435728Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8436377Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8436508Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8436567Z graph_break [] 2025-12-04T12:10:19.8436655Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8436752Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8437315Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8437386Z current_size = base.storage().size() 2025-12-04T12:10:19.8437448Z Autotune Choices Stats: 2025-12-04T12:10:19.8437889Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00788000039756298, "best_triton_pos": 0} 2025-12-04T12:10:19.8437981Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8438053Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8438200Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8438476Z triton_mm_13 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8438740Z triton_mm_18 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8439002Z triton_mm_10 0.0080 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8439277Z triton_mm_7 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8439539Z triton_mm_15 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8439797Z triton_mm_17 0.0086 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8440056Z triton_mm_8 0.0093 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8440364Z triton_mm_9 0.0095 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8440642Z triton_mm_11 0.0095 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8440902Z triton_mm_16 0.0096 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8441064Z SingleProcess AUTOTUNE benchmarking takes 0.0868 seconds and 0.5809 seconds precompiling for 18 choices 2025-12-04T12:10:19.8441236Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8441306Z Traceback (most recent call last): 2025-12-04T12:10:19.8441494Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8441558Z method(*args, **kwargs) 2025-12-04T12:10:19.8441742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8441820Z method(*args, **kwargs) 2025-12-04T12:10:19.8442001Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8442075Z with policy(): 2025-12-04T12:10:19.8442257Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8442320Z raise RuntimeError(msg) 2025-12-04T12:10:19.8442763Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:19.8442766Z 2025-12-04T12:10:19.8442865Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8443166Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8443169Z 2025-12-04T12:10:19.8443281Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8443378Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8443443Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8443523Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8444161Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8444289Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8444349Z graph_break [] 2025-12-04T12:10:19.8444435Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8444532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8445072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8445142Z current_size = base.storage().size() 2025-12-04T12:10:19.8445207Z Autotune Choices Stats: 2025-12-04T12:10:19.8445639Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00788000039756298, "best_triton_pos": 0} 2025-12-04T12:10:19.8445731Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8445815Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8446137Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8446615Z triton_mm_13 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8447249Z triton_mm_18 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8447816Z triton_mm_10 0.0080 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8448398Z triton_mm_7 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8448975Z triton_mm_15 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8449546Z triton_mm_17 0.0086 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8450154Z triton_mm_8 0.0093 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8450730Z triton_mm_9 0.0095 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8451324Z triton_mm_11 0.0095 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8451902Z triton_mm_16 0.0096 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8452366Z SingleProcess AUTOTUNE benchmarking takes 0.0868 seconds and 0.5809 seconds precompiling for 18 choices 2025-12-04T12:10:19.8452764Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8452977Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8453154Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8453397Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8454144Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8454781Z graph_break [] 2025-12-04T12:10:19.8454945Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8455166Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8455369Z Autotune Choices Stats: 2025-12-04T12:10:19.8455866Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007639999967068434, "best_triton_pos": 0} 2025-12-04T12:10:19.8456404Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8456597Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8456847Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8457336Z triton_mm_30 0.0076 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8457919Z triton_mm_28 0.0081 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8458478Z triton_mm_34 0.0085 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8459039Z triton_mm_38 0.0086 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8459598Z triton_mm_37 0.0086 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8460207Z triton_mm_27 0.0087 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8460766Z triton_mm_33 0.0088 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8461347Z triton_mm_32 0.0089 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8461907Z triton_mm_35 0.0089 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8462470Z triton_mm_29 0.0093 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8462929Z SingleProcess AUTOTUNE benchmarking takes 0.1244 seconds and 0.4590 seconds precompiling for 21 choices 2025-12-04T12:10:19.8463203Z =================================== FAILURES =================================== 2025-12-04T12:10:19.8463491Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.8463770Z Traceback (most recent call last): 2025-12-04T12:10:19.8464075Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8464363Z method(*args, **kwargs) 2025-12-04T12:10:19.8464635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.8464918Z method(*args, **kwargs) 2025-12-04T12:10:19.8465186Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.8465462Z with policy(): 2025-12-04T12:10:19.8465722Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.8466007Z raise RuntimeError(msg) 2025-12-04T12:10:19.8466570Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.8467046Z 2025-12-04T12:10:19.8467146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8467605Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8467943Z 2025-12-04T12:10:19.8468058Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8468307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8468508Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8468682Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8469430Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8470279Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8470502Z graph_break [] 2025-12-04T12:10:19.8470664Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8470883Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8471584Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.8472234Z current_size = base.storage().size() 2025-12-04T12:10:19.8472399Z Autotune Choices Stats: 2025-12-04T12:10:19.8472902Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00788000039756298, "best_triton_pos": 0} 2025-12-04T12:10:19.8473439Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8473633Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8473884Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8474360Z triton_mm_13 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8474931Z triton_mm_18 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8475495Z triton_mm_10 0.0080 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8476063Z triton_mm_7 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8476641Z triton_mm_15 0.0084 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8477205Z triton_mm_17 0.0086 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8477790Z triton_mm_8 0.0093 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8478347Z triton_mm_9 0.0095 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8478909Z triton_mm_11 0.0095 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8479471Z triton_mm_16 0.0096 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8479930Z SingleProcess AUTOTUNE benchmarking takes 0.0868 seconds and 0.5809 seconds precompiling for 18 choices 2025-12-04T12:10:19.8480307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8480526Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8480699Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8480938Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8481651Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8482287Z graph_break [] 2025-12-04T12:10:19.8482447Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8482667Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8482864Z Autotune Choices Stats: 2025-12-04T12:10:19.8483360Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007639999967068434, "best_triton_pos": 0} 2025-12-04T12:10:19.8483908Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8484103Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8484353Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8484807Z triton_mm_30 0.0076 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8485369Z triton_mm_28 0.0081 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8485928Z triton_mm_34 0.0085 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8486503Z triton_mm_38 0.0086 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8487078Z triton_mm_37 0.0086 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8487644Z triton_mm_27 0.0087 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8488207Z triton_mm_33 0.0088 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8488770Z triton_mm_32 0.0089 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8489352Z triton_mm_35 0.0089 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8489911Z triton_mm_29 0.0093 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8490439Z SingleProcess AUTOTUNE benchmarking takes 0.1244 seconds and 0.4590 seconds precompiling for 21 choices 2025-12-04T12:10:19.8490729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.8490930Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.8491105Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.8491344Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.8492051Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.8492686Z graph_break [] 2025-12-04T12:10:19.8492845Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.8493064Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.8493261Z Autotune Choices Stats: 2025-12-04T12:10:19.8493774Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_53", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:19.8494306Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.8494499Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.8494750Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.8495204Z triton_mm_53 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8495783Z triton_mm_57 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8496345Z triton_mm_52 0.0080 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8496933Z triton_mm_47 0.0081 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8497495Z triton_mm_50 0.0081 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8498053Z triton_mm_54 0.0083 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8498615Z triton_mm_55 0.0083 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8499176Z triton_mm_49 0.0086 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.8499760Z triton_mm_51 0.0088 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8500372Z triton_mm_46 0.0092 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.8500831Z SingleProcess AUTOTUNE benchmarking takes 0.1452 seconds and 0.3152 seconds precompiling for 21 choices 2025-12-04T12:10:19.8501251Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f8c72c691e4c8188.xml - 2025-12-04T12:10:19.8501592Z =========================== short test summary info ============================ 2025-12-04T12:10:19.8502390Z FAILED [0.9003s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.8503075Z 2025-12-04T12:10:19.8503174Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.8503609Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8503948Z 2025-12-04T12:10:19.8504060Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.8504294Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.8504510Z ================== 1 failed, 82 deselected, 2 rerun in 4.86s =================== 2025-12-04T12:10:19.8504700Z Got exit code 1 2025-12-04T12:10:19.8504841Z Retrying single test... 2025-12-04T12:10:19.8505099Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e25151b5ca7f8e55.xml 2025-12-04T12:10:19.8505395Z ============================= test session starts ============================== 2025-12-04T12:10:19.8505671Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.8505922Z cachedir: .pytest_cache 2025-12-04T12:10:19.8506198Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.8506493Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.8506653Z configfile: pytest.ini 2025-12-04T12:10:19.8506934Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.8507267Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.8507693Z stepcurrent: skipping 82 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.8508084Z Running 1 items in this shard 2025-12-04T12:10:19.8508181Z 2025-12-04T12:10:19.8508562Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:28.716190097 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8508976Z 2025-12-04T12:10:19.8509337Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8510087Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8510667Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8511418Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8512281Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8512876Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8513435Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8513954Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8514494Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8515056Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8515621Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8518337Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8518918Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8519483Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8520041Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8520655Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8521237Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8521800Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8522360Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8522917Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8523477Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8524038Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8524559Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8525077Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8525653Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8526212Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8526734Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8527250Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8527807Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8528367Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8529028Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8529603Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8530192Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8530714Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8531193Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8531639Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8532585Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp81x8f6iu/7m/c7mtveuav4puqt5dgzolc42kisdfqgojuryt7o7nr5gtd5i5zfds.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8533383Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8533850Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8534329Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8534888Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8535417Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8535915Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8536439Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8536939Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8537462Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8537998Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8538509Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8539029Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8539648Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8540388Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8541127Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8541659Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8542403Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8543266Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8543856Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8544398Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8544916Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8545458Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8546019Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8546581Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8547162Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8547723Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8548283Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8548842Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8549402Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8549975Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8550587Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8551163Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8551725Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8552291Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8552852Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8553376Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8553898Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8554460Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8555022Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8555542Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8556058Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8556615Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8557174Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8557746Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8558308Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8558840Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8559361Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8559837Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8560335Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8560730Z E1204 10:53:35.674000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.8561102Z [W1204 10:53:35.179578065 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8561330Z 2025-12-04T12:10:19.8561513Z [W1204 10:53:35.181651895 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8561734Z 2025-12-04T12:10:19.8562090Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8562814Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8563343Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8564080Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8564937Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8565527Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8566070Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8566588Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8567125Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8567703Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8568264Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8568823Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8569386Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8569950Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8570572Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8571147Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8571718Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8572275Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8572835Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8573393Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8573952Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8574509Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8575026Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8575544Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8576103Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8576660Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8577175Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8577690Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8578263Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8578822Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8579382Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8579941Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8580545Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8581082Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8581578Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8582034Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8582871Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp81x8f6iu/gq/cgq7i5rlvdrrvcdkbdnlm6vdtbqgbom5fqgk52pshhhcjvc26l7v.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8583664Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8584132Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8584606Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8585159Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8585685Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8586180Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8586689Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8587191Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8587716Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8588254Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8588779Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8589300Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8589885Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8590571Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8591305Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8591853Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8592607Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8593483Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8594072Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8594616Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8595134Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8595674Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8596235Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8596797Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8597359Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8597918Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8598476Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8599034Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8599607Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8600216Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8600777Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8601332Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8601893Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8602464Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8603052Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8603568Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8604085Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8604644Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8605203Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8605722Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8606240Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8606799Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8607360Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8607921Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8608479Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8609023Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8609541Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8610030Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8610513Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8610894Z E1204 10:53:35.719000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.8611414Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8612135Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8612676Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8613422Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8614295Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8614881Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8615419Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8615935Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8616471Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8617029Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8617588Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8618151Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8618712Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8619270Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8619832Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8620448Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8621008Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8621565Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8622123Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8622680Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8623259Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8623847Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8624365Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8624882Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8625446Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8626006Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8626522Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8627039Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8627599Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8628161Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8628721Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8629279Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8629810Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8630369Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8630863Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8631307Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8632153Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp81x8f6iu/g7/cg7aoufqkf5xir3qngizn4udlsv6ous2hprbqmfjppr4gvrrqyhv.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.8632949Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.8633430Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.8633918Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.8634492Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.8635019Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.8635513Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.8636014Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.8636514Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.8637036Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.8637574Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.8638084Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.8638604Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.8639186Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.8639813Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8640592Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8641122Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8641876Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8642737Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8643323Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8643863Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8644421Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8644980Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8645540Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8646102Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8646664Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8647223Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8647785Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8648344Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8648902Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8649460Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8650019Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8650658Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8651214Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8651794Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8652352Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8652869Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8653382Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8653938Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8654515Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8655054Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8655586Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8656145Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8656702Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8657262Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8657824Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8658356Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.8658876Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.8659349Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.8659791Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.8660210Z E1204 10:53:35.721000 609264 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.8660455Z ('RERUN', {'yellow': True}) [10.0784s] [100%] 2025-12-04T12:10:19.8660944Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:37.554174434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8661360Z 2025-12-04T12:10:19.8661538Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8662110Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8662819Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8663347Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8664080Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8664953Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8665554Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8666107Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8666640Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8667182Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8667744Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8668307Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8668866Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8669424Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8669980Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8670579Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8671135Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8671690Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8672246Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8672792Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8673307Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8673826Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8674361Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8674919Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8675450Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8675978Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8676550Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8677105Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8677621Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8678135Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8678660Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8679147Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8679660Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8680220Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8680710Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8681228Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8681784Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8682340Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8682917Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8683476Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8683999Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8684513Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8685029Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8685582Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8686154Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8686724Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8687282Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8687839Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8688399Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8688959Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8689517Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8690070Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8690670Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8691231Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8691794Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8692351Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8692907Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8693488Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8694046Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8694603Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8695162Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8695716Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8696290Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8696890Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8697445Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8697978Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8698484Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8699021Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8699585Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8700307Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8700865Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8701423Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8701984Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8702548Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8703109Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8703667Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8704243Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8704800Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8705363Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8705892Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8706387Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8706918Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8707476Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8707989Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8708509Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8709050Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8709611Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8710185Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8710676Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8710931Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8711165Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8711388Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8711646Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8711915Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8712169Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8712451Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8712706Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8712940Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8713254Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8713489Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8713777Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8714049Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8714303Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8714538Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8714774Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8715041Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8715299Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8715565Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8715820Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8716086Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8716346Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8716614Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8716870Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8717136Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8717406Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8717641Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8717868Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8718125Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8718362Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8718606Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8718853Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8719133Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8719390Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8719656Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8719913Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8720229Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8720488Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8720753Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8721011Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8721277Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8721538Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8721778Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8722012Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8722254Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8722503Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8722741Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8723006Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8723265Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8723516Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8723770Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8724023Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8724291Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8724553Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8724819Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8725079Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8725345Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8725604Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8725872Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8726131Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8726399Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8726657Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8726924Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8727193Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8727461Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8727719Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8727984Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8728242Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8728518Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8728798Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8729062Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8729322Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8729558Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8729782Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8730039Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8730400Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8730656Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8730921Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8731180Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8731448Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8731705Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8731970Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8732248Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8732514Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8732771Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8732995Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8733252Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8733528Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8733810Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8734073Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8734332Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8734583Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8734822Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8735057Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8735290Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8735555Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8735812Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8736062Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8736302Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8736534Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8736768Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8737045Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8737304Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8737540Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8737774Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8738000Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8738190Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8738458Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8738692Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8738946Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8739207Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8739462Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8739697Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8739918Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8740249Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8740476Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8740698Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8740953Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8741175Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8741428Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8741663Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8741918Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8742138Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8742394Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8742655Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8742909Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8743220Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8743484Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8743745Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8743999Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8744262Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8744515Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8744777Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8745032Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8745262Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8745483Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8745736Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8745983Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8746215Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8746458Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8746691Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8746953Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8747208Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8747468Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8747720Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8747961Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8748225Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8748485Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8748843Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8749105Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8749354Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8749585Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8749805Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8750022Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8750277Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8750485Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8750725Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8750970Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8751223Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8751470Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8751710Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8751917Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8752153Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8752400Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8752669Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8752927Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8753164Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8753370Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8753609Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8753854Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8754094Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8754338Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8754575Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8754806Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8755029Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8755246Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8755462Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8755718Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8755955Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8756164Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8756400Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8756645Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8756884Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8757150Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8757401Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8757631Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8757850Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8758070Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8758289Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8758535Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8758771Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8759014Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8759250Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8759494Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8759732Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8759936Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8760239Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8760483Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8760720Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8760962Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8761198Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8761440Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8761673Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8761902Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8762119Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8762363Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8762599Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8762842Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8763080Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8763321Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8763556Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8763799Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8764036Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8764277Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8764515Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8764740Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8764959Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8765166Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8765376Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8768646Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8768882Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8769136Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8769358Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8769567Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8769756Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8769900Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8770064Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8770230Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8770376Z E1204 10:53:37.107000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8770549Z [W1204 10:53:37.572417928 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8770552Z 2025-12-04T12:10:19.8770713Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8771029Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8771339Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8771489Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8771985Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8772275Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8772518Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8772740Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8772955Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8773196Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8773446Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8773699Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8773948Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8774189Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8774422Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8774665Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8774899Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8775140Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8775373Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8775586Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8775809Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8776025Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8776267Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8776500Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8776716Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8776951Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8777195Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8777428Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8777631Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8777873Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8778096Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8778319Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8778551Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8778762Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8778966Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8779198Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8779444Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8779677Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8779917Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8780183Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8780396Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8780619Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8780834Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8781091Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8781325Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8781567Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8781801Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8782042Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8782288Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8782543Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8782792Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8783033Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8783265Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8783506Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8783740Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8783982Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8784217Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8784461Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8784697Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8784940Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8785172Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8785413Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8785658Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8785901Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8786135Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8786350Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8786563Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8786813Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8787060Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8787312Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8787544Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8787786Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8788019Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8788261Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8788493Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8788734Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8788970Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8789210Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8789444Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8789656Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8789860Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8790160Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8790372Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8790595Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8790809Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8791050Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8791298Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8791528Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8791745Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8791978Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8792188Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8792391Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8792625Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8792866Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8793100Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8793342Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8793574Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8793790Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8794012Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8794226Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8794481Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8794718Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8794938Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8795151Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8795366Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8795621Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8795873Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8796127Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8796363Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8796607Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8796842Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8797086Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8797322Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8797564Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8797799Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8798015Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8798224Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8798465Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8798683Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8798908Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8799126Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8799370Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8799606Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8799848Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8800141Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8800400Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8800646Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8800890Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8801125Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8801371Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8801609Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8801826Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8802040Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8802246Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8802471Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8802688Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8802930Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8803166Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8803397Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8803613Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8803828Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8804071Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8804305Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8804561Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8804808Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8805061Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8805296Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8805539Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8805780Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8806027Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8806263Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8806504Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8806738Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8806982Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8807216Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8807459Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8807693Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8807946Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8808183Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8808425Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8808659Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8808872Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8809089Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8809333Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8809585Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8809820Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8810062Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8810332Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8810575Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8810810Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8811052Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8811286Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8811529Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8811765Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8811973Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8812206Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8812463Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8812701Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8812944Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8813178Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8813407Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8813646Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8813886Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8814102Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8814347Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8814582Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8814811Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8815030Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8815245Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8815460Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8815705Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8815939Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8816156Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8816369Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8816576Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8816749Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8816985Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8817191Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8817427Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8817668Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8817913Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8818136Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8818352Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8818585Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8818797Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8819002Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8819238Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8819443Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8819678Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8819882Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8820147Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8820352Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8820587Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8820828Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8821062Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8821318Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8821554Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8821796Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8822032Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8822276Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8822538Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8822793Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8823027Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8823240Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8823445Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8823680Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8823908Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8824124Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8824338Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8824555Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8824799Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8825033Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8825275Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8825523Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8825728Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8825964Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8826205Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8826439Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8826682Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8826940Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8827179Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8827395Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8827608Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8827815Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8828022Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8828257Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8828499Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8828734Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8828978Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8829218Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8829426Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8829661Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8829905Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8830199Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8830444Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8830677Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8830882Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8831116Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8831371Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8831633Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8831874Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8832109Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8832337Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8832556Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8832771Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8832986Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8833228Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8833463Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8833669Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8833904Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8834147Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8834381Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8834634Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8834870Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8835097Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8835314Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8835528Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8835752Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8836012Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8836248Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8836492Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8836726Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8836970Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8837204Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8837409Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8837642Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8837885Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8838121Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8838363Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8838599Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8838836Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8839054Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8839269Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8839487Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8839730Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8839965Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8840268Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8840515Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8840757Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8840992Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8841238Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8841473Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8841718Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8841954Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8842167Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8842386Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8842593Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8842804Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8843033Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8843254Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8843482Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8843689Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8843897Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8844084Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8844226Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8844387Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8844529Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8844673Z E1204 10:53:37.111000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8844860Z [W1204 10:53:37.574913163 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8844863Z 2025-12-04T12:10:19.8845022Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8845332Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8845641Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8845788Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8846287Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8846556Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8846795Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8847018Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8847233Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8847475Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8847721Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8847965Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8848202Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8848441Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8848675Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8848926Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8849167Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8849420Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8849652Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8849864Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8850087Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8850316Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8850560Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8850792Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8850997Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8851230Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8851473Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8851706Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8851910Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8852157Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8852371Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8852575Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8852809Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8853022Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8853224Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8853487Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8853740Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8853972Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8854213Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8854447Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8854661Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8854883Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8855098Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8855339Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8855573Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8855814Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8856047Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8856288Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8856528Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8856771Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8857005Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8857246Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8857479Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8857730Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8857973Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8858226Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8858460Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8858702Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8858936Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8859180Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8859413Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8859655Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8859889Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8860169Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8860404Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8860619Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8860830Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8861085Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8861320Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8861560Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8861794Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8862036Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8862281Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8862542Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8862789Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8863030Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8863262Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8863506Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8863740Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8863952Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8864156Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8864389Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8864602Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8864826Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8865040Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8865281Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8865523Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8865737Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8865939Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8866172Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8866381Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8866584Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8866839Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8867092Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8867326Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8867566Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8867801Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8868011Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8868234Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8868448Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8868692Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8868928Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8869145Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8869363Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8869578Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8869838Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8870074Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8870353Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8870588Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8870829Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8871078Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8871337Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8871584Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8871828Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8872062Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8872276Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8872482Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8872718Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8872935Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8873146Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8873363Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8873605Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8873840Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8874082Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8874331Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8874575Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8874810Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8875052Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8875285Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8875537Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8875784Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8876012Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8876225Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8876431Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8876659Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8876874Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8877118Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8877352Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8877570Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8877783Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8877998Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8878242Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8878476Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8878730Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8878967Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8879210Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8879445Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8879686Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8879940Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8880227Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8880475Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8880715Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8880952Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8881196Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8881430Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8881673Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8881906Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8882149Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8882385Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8882629Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8882864Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8883077Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8883296Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8883532Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8883776Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8884010Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8884253Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8884501Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8884755Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8885005Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8885246Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8885481Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8885723Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8885963Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8886169Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8886402Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8886646Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8886880Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8887123Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8887357Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8887583Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8887811Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8888025Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8888242Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8888485Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8888720Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8888957Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8889193Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8889406Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8889620Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8889864Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8890136Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8890354Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8890568Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8890774Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8890937Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8891173Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8891383Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8891618Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8891861Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8892110Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8892324Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8892530Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8892765Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8892979Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8893184Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8893432Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8893660Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8893894Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8894098Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8894334Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8894543Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8894781Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8895023Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8895259Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8895502Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8895740Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8895982Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8896217Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8896457Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8896713Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8896957Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8897190Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8897403Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8897607Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8897856Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8898109Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8898325Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8898539Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8898755Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8898999Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8899233Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8899475Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8899711Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8899917Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8900186Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8900428Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8900663Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8900903Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8901152Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8901380Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8901598Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8901811Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8902016Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8902233Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8902494Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8902736Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8902970Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8903213Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8903449Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8903653Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8903890Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8904134Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8904369Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8904614Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8904854Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8905060Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8905294Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8905549Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8905783Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8906027Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8906261Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8906489Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8906717Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8906948Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8907165Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8907410Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8907646Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8907854Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8908088Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8908330Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8908563Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8908808Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8909044Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8909273Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8909493Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8909706Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8909933Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8910207Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8910442Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8910683Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8910919Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8911174Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8911437Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8911643Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8911877Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8912121Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8912357Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8912601Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8912835Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8913062Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8913281Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8913495Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8913711Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8913953Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8914190Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8914448Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8914683Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8914925Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8915159Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8915402Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8915658Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8915911Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8916147Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8916358Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8916578Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8916785Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8916998Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8917226Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8917448Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8917660Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8917867Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8918076Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8918261Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8918401Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8918560Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8918689Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8918831Z E1204 10:53:37.114000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8918903Z ('RERUN', {'yellow': True}) [1.2624s] [100%] 2025-12-04T12:10:19.8919256Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:38.640263073 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8919259Z 2025-12-04T12:10:19.8919417Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8919726Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8920055Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8920248Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8920738Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8921010Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8921250Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8921472Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8921687Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8921928Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8922166Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8922409Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8922643Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8922884Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8923132Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8923376Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8923610Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8923853Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8924086Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8924310Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8924559Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8924788Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8925031Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8925263Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8925470Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8925705Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8925946Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8926180Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8926382Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8926616Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8926827Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8927033Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8927268Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8927477Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8927691Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8927924Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8928166Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8928399Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8928648Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8928893Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8929129Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8929352Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8929568Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8929811Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8930045Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8930322Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8930555Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8930797Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8931033Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8931273Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8931510Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8931750Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8931998Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8932240Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8932473Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8932714Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8932948Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8933193Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8933455Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8933707Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8933940Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8934180Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8934413Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8934654Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8934887Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8935102Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8935312Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.8935555Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8935790Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8936032Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8936264Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8936515Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8936749Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8936990Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8937224Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8937463Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8937708Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8937960Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8938206Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8938417Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8938619Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8938854Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8939064Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8939288Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8939501Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8939742Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8939978Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8940218Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8940422Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8940656Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8940882Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8941084Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8941319Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8941561Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8941793Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8942036Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8942292Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8942515Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8942738Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8942956Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8943202Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8943436Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8943653Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8943865Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8944080Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8944323Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8944558Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8944801Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8945035Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8945279Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8945524Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8945769Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8946004Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8946247Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8946483Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8946725Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8946941Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8947175Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8947393Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8947609Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8947825Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8948070Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8948304Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8948546Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8948782Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8949026Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8949260Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8949502Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8949748Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8949992Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8950262Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8950478Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8950691Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8950899Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8951153Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8951383Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8951626Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8951863Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8952080Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8952295Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8952510Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8952754Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8952988Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8953231Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8953467Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8953709Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8953945Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8954200Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8954435Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8954680Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8954915Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8955155Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8955399Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8955652Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8955897Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8956138Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8956373Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8956615Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8956850Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8957094Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8957329Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8957544Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8957749Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8957985Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8958226Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8958461Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8958713Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8958951Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8959193Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8959429Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8959672Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8959916Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8960209Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8960459Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8960664Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8960898Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8961141Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8961378Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8961619Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8961856Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8962086Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8962303Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8962517Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8962730Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8962973Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8963227Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8963457Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8963676Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8963887Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8964103Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8964359Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8964605Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8964835Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8965048Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8965255Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8965420Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.8965657Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8965863Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8966099Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8966343Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8966582Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8966796Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8967001Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8967239Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8967450Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8967665Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8967901Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8968108Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8968342Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8968549Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8968796Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8969019Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8969253Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8969496Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8969732Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8969976Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8970253Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8970498Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8970733Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8970977Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8971213Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8971457Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8971693Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8971906Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8972125Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8972359Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8972589Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8972805Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8973019Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8973247Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8973504Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8973753Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8973993Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8974228Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8974433Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8974669Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8974910Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8975143Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8975389Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8975623Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8975851Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8976066Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8976281Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8976500Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.8976706Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8976943Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8977185Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8977421Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8977674Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8977920Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8978143Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8978378Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8978621Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8978857Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8979100Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8979335Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8979539Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8979775Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8980020Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8980314Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8980555Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8980789Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8981031Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8981249Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8981463Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8981678Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8981922Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8982169Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8982400Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8982634Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8982876Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8983112Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8983355Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8983591Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8983818Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8984034Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8984246Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8984462Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8984706Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8984941Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8985183Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8985426Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8985669Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8985907Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8986113Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8986347Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8986599Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8986852Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8987093Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8987330Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8987558Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.8987777Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.8987991Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.8988205Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8988446Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8988681Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8988924Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8989161Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8989402Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8989639Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8989892Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8990163Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8990406Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8990641Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8990855Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.8991084Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.8991317Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.8991529Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.8991758Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.8991980Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.8992194Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.8992402Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.8992608Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.8992794Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.8992934Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.8993097Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.8993216Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.8993359Z E1204 10:53:38.179000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.8993530Z [W1204 10:53:38.642519681 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.8993532Z 2025-12-04T12:10:19.8993692Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.8994001Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.8994322Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.8994471Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.8994959Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.8995229Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.8995496Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.8995728Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.8995943Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8996183Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.8996420Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8996664Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8996900Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8997143Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8997376Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8997621Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8997853Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8998097Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8998328Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8998551Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.8998778Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.8998994Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.8999235Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.8999467Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.8999672Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.8999926Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9000212Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9000444Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9000646Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9000880Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9001091Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9001296Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9001532Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9001744Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9001947Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9002179Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9002421Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9002653Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9002895Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9003143Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9003357Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9003580Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9003795Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9004037Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9004280Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9004549Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9004780Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9005022Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9005255Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9005497Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9005731Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9005972Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9006207Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9006449Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9006683Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9006925Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9007157Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9007409Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9007642Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9007884Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9008118Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9008364Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9008603Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9008865Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9009111Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9009329Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9009544Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9009790Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9010027Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9010317Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9010550Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9010795Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9011030Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9011277Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9011516Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9011757Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9012008Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9012250Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9012491Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9012705Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9012908Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9013159Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9013385Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9013621Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9013834Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9014077Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9014311Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9014522Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9014728Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9014962Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9015177Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9015379Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9015618Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9015864Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9016100Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9016346Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9016590Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9016805Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9017029Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9017248Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9017494Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9017742Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9017985Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9018202Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9018418Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9018663Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9018902Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9019147Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9019385Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9019634Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9019871Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9020157Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9020396Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9020645Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9020882Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9021114Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9021326Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9021564Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9021789Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9022003Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9022232Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9022499Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9022735Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9022984Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9023218Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9023464Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9023699Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9023942Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9024175Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9024421Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9024658Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9024874Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9025088Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9025295Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9025530Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9025744Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9025987Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9026222Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9026439Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9026667Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9026900Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9027144Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9027379Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9027625Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9027860Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9028103Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9028339Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9028581Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9028816Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9029059Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9029293Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9029534Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9029791Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9030037Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9030316Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9030559Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9030794Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9031049Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9031295Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9031550Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9031786Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9032002Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9032208Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9032447Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9032689Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9032923Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9033164Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9033400Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9033643Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9033879Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9034122Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9034372Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9034619Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9034855Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9035060Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9035294Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9035547Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9035790Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9036046Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9036280Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9036508Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9036726Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9036940Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9037157Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9037402Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9037636Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9037865Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9038083Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9038297Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9038512Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9038767Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9039004Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9039221Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9039437Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9039644Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9039809Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9040055Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9040334Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9040569Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9040810Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9041044Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9041259Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9041464Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9041698Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9041913Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9042117Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9042355Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9042561Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9042797Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9043004Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9043255Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9043461Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9043697Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9043939Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9044177Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9044436Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9044685Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9044947Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9045182Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9045427Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9045663Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9045907Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9046142Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9046358Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9046564Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9046802Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9047030Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9047247Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9047462Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9047687Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9047932Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9048168Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9048412Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9048646Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9048862Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9049109Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9049362Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9049596Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9049841Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9050076Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9050350Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9050566Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9050781Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9050988Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9051196Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9051435Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9051676Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9051910Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9052170Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9052407Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9052613Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9052847Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9053090Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9053338Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9053598Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9053848Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9054054Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9054289Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9054534Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9054769Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9055013Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9055248Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9055477Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9055697Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9055911Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9056130Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9056377Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9056621Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9056827Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9057061Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9057304Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9057537Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9057789Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9058032Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9058269Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9058489Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9058703Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9058919Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9059161Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9059395Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9059637Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9059871Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9060159Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9060394Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9060600Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9060837Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9061096Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9061334Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9061576Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9061814Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9062041Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9062273Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9062503Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9062731Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9062975Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9063212Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9063458Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9063694Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9063936Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9064170Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9064412Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9064648Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9064892Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9065129Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9065342Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9065572Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9065778Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9065989Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9066221Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9066443Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9066667Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9066885Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9067103Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9067289Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9067430Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9067591Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9067709Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9067852Z E1204 10:53:38.181000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9068025Z [W1204 10:53:38.644516352 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9068027Z 2025-12-04T12:10:19.9068185Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9068493Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9068802Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9068949Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9069440Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9069707Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9069956Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9070219Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9070436Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9072276Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9072513Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9072777Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9073039Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9073280Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9073514Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9073756Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9073991Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9074235Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9074470Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9074683Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9074907Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9075124Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9075367Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9075600Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9075804Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9076055Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9076300Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9076534Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9076740Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9076974Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9077197Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9077422Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9077656Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9077867Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9078072Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9078307Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9078551Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9078784Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9079028Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9079262Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9079476Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9079699Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9079915Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9080195Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9080442Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9080685Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9080917Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9081158Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9081391Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9081654Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9081912Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9082152Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9082386Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9082629Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9082863Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9083105Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9083337Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9083578Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9083813Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9084056Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9084290Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9084530Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9084764Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9085016Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9085251Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9085466Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9085676Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9085916Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9086175Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9086427Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9086659Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9086899Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9087132Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9087375Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9087607Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9087850Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9088082Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9088324Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9088560Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9088772Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9088974Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9097390Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9097607Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9097833Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9098047Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9098295Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9098532Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9098778Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9098997Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9099234Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9099449Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9099651Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9099886Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9100172Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9100408Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9100648Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9100889Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9101103Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9101325Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9101543Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9101788Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9102038Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9102256Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9102474Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9102690Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9102932Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9103182Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9103439Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9103697Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9103939Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9104175Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9104420Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9104656Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9104900Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9105135Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9105351Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9105557Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9105794Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9106012Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9106224Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9106449Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9106692Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9106928Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9107169Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9107405Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9107659Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9107914Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9108158Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9108392Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9108634Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9108869Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9109088Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9109302Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9109508Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9109733Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9109949Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9110228Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9110464Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9110682Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9110912Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9111128Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9111371Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9111605Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9111848Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9112095Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9112362Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9112597Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9112840Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9113075Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9113319Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9113556Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9113797Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9114031Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9114274Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9114508Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9114753Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9114989Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9115242Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9115480Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9115722Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9115957Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9116170Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9116376Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9116630Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9116884Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9117118Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9117360Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9117597Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9117838Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9118073Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9118315Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9118548Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9118791Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9119028Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9119233Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9119467Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9119720Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9119956Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9120218Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9120455Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9120683Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9120915Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9121148Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9121375Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9121621Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9121856Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9122086Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9122305Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9122519Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9122733Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9122976Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9123210Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9123427Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9123641Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9123847Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9124012Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9124259Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9124468Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9124703Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9124947Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9125181Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9125405Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9125630Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9125864Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9126077Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9126283Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9126520Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9126729Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9126964Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9127169Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9127403Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9127608Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9127845Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9128088Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9128323Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9128578Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9128814Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9129056Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9129294Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9129536Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9129784Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9130036Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9130324Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9130537Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9130741Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9130977Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9131205Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9131422Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9131637Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9131852Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9132096Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9132332Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9132575Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9132809Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9133026Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9133265Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9133507Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9133742Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9133985Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9134232Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9134472Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9134703Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9134916Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9135122Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9135329Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9135566Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9135810Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9136045Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9136289Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9136528Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9136732Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9136969Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9137210Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9137458Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9137702Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9137938Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9138143Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9138377Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9138637Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9138881Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9139138Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9139372Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9139598Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9139817Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9140032Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9140306Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9140550Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9140786Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9140992Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9141231Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9141476Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9141709Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9141967Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9142202Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9142431Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9142648Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9142860Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9143089Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9143343Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9143595Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9143837Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9144073Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9144318Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9144554Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9144758Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9144991Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9145234Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9145469Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9145713Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9145949Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9146176Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9146405Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9146618Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9146835Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9147076Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9147311Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9147563Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9147809Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9148064Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9148299Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9148542Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9148776Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9149019Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9149253Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9149464Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9149681Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9149888Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9150140Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9150370Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9150593Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9150819Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9151027Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9151235Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9151423Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9151565Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9151724Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9151845Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9152011Z E1204 10:53:38.183000 609264 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9152085Z FAILED [1.0369s] [100%] 2025-12-04T12:10:19.9152087Z 2025-12-04T12:10:19.9152162Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9152322Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9152388Z Traceback (most recent call last): 2025-12-04T12:10:19.9152565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9152628Z method(*args, **kwargs) 2025-12-04T12:10:19.9152794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9152852Z method(*args, **kwargs) 2025-12-04T12:10:19.9153019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9153075Z with policy(): 2025-12-04T12:10:19.9153243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9153304Z raise RuntimeError(msg) 2025-12-04T12:10:19.9153719Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:19.9153722Z 2025-12-04T12:10:19.9153819Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9154096Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9154100Z 2025-12-04T12:10:19.9154206Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9154308Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9154368Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9154445Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9155014Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9155152Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9155208Z graph_break [] 2025-12-04T12:10:19.9155291Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9155385Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9155890Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9155955Z current_size = base.storage().size() 2025-12-04T12:10:19.9156012Z Autotune Choices Stats: 2025-12-04T12:10:19.9156415Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00803999975323677, "best_triton_pos": 0} 2025-12-04T12:10:19.9156518Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9156586Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9156725Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9156977Z triton_mm_13 0.0080 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9157222Z triton_mm_7 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9157462Z triton_mm_14 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9157704Z triton_mm_12 0.0083 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9157942Z triton_mm_17 0.0083 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9158283Z triton_mm_8 0.0084 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9158522Z triton_mm_9 0.0084 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9158763Z triton_mm_10 0.0086 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9159002Z triton_mm_11 0.0086 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9159241Z triton_mm_15 0.0090 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9159404Z SingleProcess AUTOTUNE benchmarking takes 0.0875 seconds and 7.9710 seconds precompiling for 18 choices 2025-12-04T12:10:19.9159564Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9159628Z Traceback (most recent call last): 2025-12-04T12:10:19.9159802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9159859Z method(*args, **kwargs) 2025-12-04T12:10:19.9160029Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9160087Z method(*args, **kwargs) 2025-12-04T12:10:19.9160296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9160351Z with policy(): 2025-12-04T12:10:19.9160535Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9160606Z raise RuntimeError(msg) 2025-12-04T12:10:19.9161012Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:19.9161031Z 2025-12-04T12:10:19.9161123Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9161399Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9161401Z 2025-12-04T12:10:19.9161504Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9161596Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9161659Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9161733Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9162300Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9162416Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9162469Z graph_break [] 2025-12-04T12:10:19.9162552Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9162643Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9163142Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9163208Z current_size = base.storage().size() 2025-12-04T12:10:19.9163264Z Autotune Choices Stats: 2025-12-04T12:10:19.9163646Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00803999975323677, "best_triton_pos": 0} 2025-12-04T12:10:19.9163747Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9163815Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9163953Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9164203Z triton_mm_13 0.0080 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9164445Z triton_mm_7 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9164683Z triton_mm_14 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9164936Z triton_mm_12 0.0083 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9165196Z triton_mm_17 0.0083 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9165433Z triton_mm_8 0.0084 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9165670Z triton_mm_9 0.0084 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9165909Z triton_mm_10 0.0086 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9166149Z triton_mm_11 0.0086 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9166387Z triton_mm_15 0.0090 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9166533Z SingleProcess AUTOTUNE benchmarking takes 0.0875 seconds and 7.9710 seconds precompiling for 18 choices 2025-12-04T12:10:19.9166622Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9166685Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9166759Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9166877Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9167378Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9167432Z graph_break [] 2025-12-04T12:10:19.9167511Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9167600Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9167995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.9168107Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.9168165Z Autotune Choices Stats: 2025-12-04T12:10:19.9168540Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_28", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00788000039756298, "best_triton_pos": 0} 2025-12-04T12:10:19.9168620Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9168684Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9168822Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9169077Z triton_mm_28 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9169343Z triton_mm_34 0.0079 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9169580Z triton_mm_37 0.0080 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9169819Z triton_mm_31 0.0081 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9170056Z triton_mm_33 0.0081 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9170335Z triton_mm_27 0.0082 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9170574Z triton_mm_35 0.0083 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9170813Z triton_mm_29 0.0087 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9171053Z triton_mm_36 0.0094 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9171295Z triton_mm_25 0.0095 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9171439Z SingleProcess AUTOTUNE benchmarking takes 0.1648 seconds and 0.4680 seconds precompiling for 21 choices 2025-12-04T12:10:19.9171509Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9171666Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9171730Z Traceback (most recent call last): 2025-12-04T12:10:19.9171915Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9171975Z method(*args, **kwargs) 2025-12-04T12:10:19.9172143Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9172201Z method(*args, **kwargs) 2025-12-04T12:10:19.9172367Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9172420Z with policy(): 2025-12-04T12:10:19.9172588Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9172645Z raise RuntimeError(msg) 2025-12-04T12:10:19.9173071Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.9173086Z 2025-12-04T12:10:19.9173177Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9173471Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9173473Z 2025-12-04T12:10:19.9173577Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9173668Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9173726Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9173800Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9174371Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9174488Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9174543Z graph_break [] 2025-12-04T12:10:19.9174622Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9174712Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9175212Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9175278Z current_size = base.storage().size() 2025-12-04T12:10:19.9175334Z Autotune Choices Stats: 2025-12-04T12:10:19.9175715Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00803999975323677, "best_triton_pos": 0} 2025-12-04T12:10:19.9175796Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9175860Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9175998Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9176259Z triton_mm_13 0.0080 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9176504Z triton_mm_7 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9176742Z triton_mm_14 0.0082 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9176982Z triton_mm_12 0.0083 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9177227Z triton_mm_17 0.0083 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9177475Z triton_mm_8 0.0084 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9177725Z triton_mm_9 0.0084 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9177961Z triton_mm_10 0.0086 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9178203Z triton_mm_11 0.0086 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9178443Z triton_mm_15 0.0090 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9178588Z SingleProcess AUTOTUNE benchmarking takes 0.0875 seconds and 7.9710 seconds precompiling for 18 choices 2025-12-04T12:10:19.9178679Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9178737Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9178810Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9178924Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9179427Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9179481Z graph_break [] 2025-12-04T12:10:19.9179559Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9179649Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9180030Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.9180174Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.9180233Z Autotune Choices Stats: 2025-12-04T12:10:19.9180622Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_28", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00788000039756298, "best_triton_pos": 0} 2025-12-04T12:10:19.9180704Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9180769Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9180904Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9181147Z triton_mm_28 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9181398Z triton_mm_34 0.0079 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9181648Z triton_mm_37 0.0080 ms 98.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9181904Z triton_mm_31 0.0081 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9182140Z triton_mm_33 0.0081 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9182380Z triton_mm_27 0.0082 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9182618Z triton_mm_35 0.0083 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9182857Z triton_mm_29 0.0087 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9183093Z triton_mm_36 0.0094 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9183334Z triton_mm_25 0.0095 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9183481Z SingleProcess AUTOTUNE benchmarking takes 0.1648 seconds and 0.4680 seconds precompiling for 21 choices 2025-12-04T12:10:19.9183571Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9183630Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9183702Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9183817Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9184322Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9184379Z graph_break [] 2025-12-04T12:10:19.9184456Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9184547Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9184604Z Autotune Choices Stats: 2025-12-04T12:10:19.9184980Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_48", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0077599999494850636, "best_triton_pos": 0} 2025-12-04T12:10:19.9185060Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9185124Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9185260Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9185535Z triton_mm_48 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9185787Z triton_mm_54 0.0078 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9186022Z triton_mm_53 0.0080 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9186261Z triton_mm_58 0.0080 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9186500Z triton_mm_52 0.0080 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9186741Z triton_mm_47 0.0082 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9186980Z triton_mm_55 0.0085 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9187219Z triton_mm_50 0.0086 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9187458Z triton_mm_57 0.0087 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9187700Z triton_mm_45 0.0098 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9187845Z SingleProcess AUTOTUNE benchmarking takes 0.1315 seconds and 0.3231 seconds precompiling for 21 choices 2025-12-04T12:10:19.9188052Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e25151b5ca7f8e55.xml - 2025-12-04T12:10:19.9188127Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9188740Z FAILED [1.0369s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.9188745Z 2025-12-04T12:10:19.9188835Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9189108Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9189110Z 2025-12-04T12:10:19.9189214Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9189295Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9189394Z ================= 1 failed, 187 deselected, 2 rerun in 12.40s ================== 2025-12-04T12:10:19.9189459Z Got exit code 1 2025-12-04T12:10:19.9189532Z Retrying single test... 2025-12-04T12:10:19.9189692Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7c0a8b2a94185468.xml 2025-12-04T12:10:19.9189766Z ============================= test session starts ============================== 2025-12-04T12:10:19.9189893Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9189954Z cachedir: .pytest_cache 2025-12-04T12:10:19.9190169Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9190235Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9190293Z configfile: pytest.ini 2025-12-04T12:10:19.9190477Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9190569Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9190839Z stepcurrent: skipping 82 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9190899Z Running 1 items in this shard 2025-12-04T12:10:19.9190901Z 2025-12-04T12:10:19.9191245Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:47.149919053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9191247Z 2025-12-04T12:10:19.9191578Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9191890Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9192040Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9192539Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9192824Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9193067Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9193292Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9193509Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9193753Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9194002Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9194269Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9194504Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9194746Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9194981Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9195225Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9195460Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9195700Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9195932Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9196174Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9196409Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9196653Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9196888Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9197101Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9197338Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9197580Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9197812Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9198016Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9198249Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9198508Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9198750Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9198993Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9199224Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9199444Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9199672Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9199846Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9200041Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9200615Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmprc4kmb68/7m/c7mtveuav4puqt5dgzolc42kisdfqgojuryt7o7nr5gtd5i5zfds.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.9200778Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.9201010Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.9201182Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.9201486Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.9201651Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.9201924Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.9202079Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.9202351Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.9202523Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.9202808Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.9202988Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.9203290Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.9203500Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.9203833Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9204145Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9204293Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9204787Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9205055Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9205295Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9205517Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9205733Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9205975Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9206222Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9206466Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9206702Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9206943Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9207176Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9207426Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9207668Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9207920Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9208150Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9208392Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9208625Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9208867Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9209101Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9209304Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9209538Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9209781Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9210015Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9210253Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9210488Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9210744Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9210980Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9211225Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9211457Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9211674Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9211912Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9212100Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9212306Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9212425Z E1204 10:53:55.387000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.9212598Z [W1204 10:53:55.857289922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9212601Z 2025-12-04T12:10:19.9212769Z [W1204 10:53:55.868663720 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9212771Z 2025-12-04T12:10:19.9213101Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9213410Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9213557Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9214049Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9214317Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9214558Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9214777Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9215002Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9215247Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9215483Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9215726Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9215960Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9216210Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9216456Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9216708Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9216940Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9217182Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9217416Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9217657Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9217892Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9218134Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9218371Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9218575Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9218809Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9219052Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9219285Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9219506Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9219739Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9219981Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9220261Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9220502Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9220752Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9220980Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9221219Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9221392Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9221587Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9222130Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmprc4kmb68/gq/cgq7i5rlvdrrvcdkbdnlm6vdtbqgbom5fqgk52pshhhcjvc26l7v.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.9222293Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.9222522Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.9222692Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.9222995Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.9223145Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.9223415Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.9223569Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.9223837Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.9224021Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.9224304Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.9224456Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.9224745Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.9224954Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.9225294Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9225612Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9225766Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9226256Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9226524Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9226766Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9226986Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9227203Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9227445Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9227683Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9227928Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9228161Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9228401Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9228646Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9228890Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9229122Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9229364Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9229598Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9229849Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9230147Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9230388Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9230621Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9230826Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9231061Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9231304Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9231537Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9231741Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9231975Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9232217Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9232449Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9232693Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9232925Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9233156Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9233382Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9233556Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9233751Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9233869Z E1204 10:53:55.396000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.9234207Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9234527Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9234691Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9235182Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9235449Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9235690Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9235909Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9236123Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9236366Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9236601Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9236843Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9237075Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9237318Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9237561Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9237802Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9238038Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9238277Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9238511Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9238762Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9239014Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9239256Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9239487Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9239693Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9239927Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9240206Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9240438Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9240642Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9240876Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9241117Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9241350Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9241594Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9241827Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9242059Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9242286Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9242460Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9242653Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9243206Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmprc4kmb68/g7/cg7aoufqkf5xir3qngizn4udlsv6ous2hprbqmfjppr4gvrrqyhv.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:19.9243381Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:19.9243624Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:19.9243795Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:19.9244095Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:19.9244243Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:19.9244515Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:19.9244669Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:19.9244937Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:19.9245109Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:19.9245394Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:19.9245545Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:19.9245836Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:19.9246042Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:19.9246368Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9246683Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9246832Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9247327Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9247594Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9247856Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9248085Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9248300Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9248542Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9248782Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9249025Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9249260Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9249502Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9249734Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9249976Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9250251Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9250491Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9250727Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9250981Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9251217Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9251461Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9251696Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9251903Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9252148Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9252409Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9252655Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9252860Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9253091Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9253335Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9253570Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9253811Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9254047Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9254264Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:19.9254492Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:19.9254668Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:19.9254865Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:19.9254982Z E1204 10:53:55.408000 614520 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:19.9255050Z ('RERUN', {'yellow': True}) [10.4512s] [100%] 2025-12-04T12:10:19.9255406Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:56.189096352 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9255410Z 2025-12-04T12:10:19.9255572Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9255883Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9256189Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9256334Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9256836Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9257121Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9257360Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9257579Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9257795Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9258037Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9258270Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9258513Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9258747Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9258991Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9259223Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9259464Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9259698Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9259950Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9260230Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9260445Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9260670Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9260884Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9261139Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9261386Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9261609Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9261845Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9262087Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9262321Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9262525Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9262757Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9262969Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9263172Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9263406Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9263620Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9263825Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9264056Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9264310Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9264546Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9264786Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9265018Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9265229Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9265462Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9265685Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9265939Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9266175Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9266416Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9266652Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9266893Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9267135Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9267376Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9267611Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9267852Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9268086Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9268328Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9268560Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9268812Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9269046Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9269289Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9269522Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9269763Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9270017Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9270299Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9270548Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9270792Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9271032Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9271250Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9271461Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9271705Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9271937Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9272180Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9272413Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9272656Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9272890Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9273131Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9273379Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9273621Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9273855Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9274096Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9274328Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9274551Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9274775Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9275008Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9275219Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9275444Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9275661Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9275905Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9276138Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9276348Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9276554Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9276788Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9277000Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9277202Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9277436Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9277688Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9277923Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9278168Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9278399Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9278613Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9278845Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9279071Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9279329Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9279562Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9279780Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9279996Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9280249Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9280494Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9280731Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9280975Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9281209Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9281453Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9281689Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9281934Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9282186Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9282430Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9282667Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9282881Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9283088Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9283336Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9283565Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9283791Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9284005Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9284249Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9284484Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9284727Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9284964Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9285208Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9285444Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9285685Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9285924Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9286164Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9286399Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9286637Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9286851Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9287059Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9287284Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9287503Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9287755Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9288001Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9288229Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9288443Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9288658Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9288901Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9289137Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9289380Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9289618Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9289863Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9290148Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9290392Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9290625Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9290867Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9291114Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9291359Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9291595Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9291837Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9292074Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9292331Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9292580Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9292835Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9293071Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9293314Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9293549Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9293764Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9293968Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9294204Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9294446Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9294684Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9294928Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9295162Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9295405Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9295651Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9295895Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9296130Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9296376Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9296611Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9296825Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9297083Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9297325Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9297558Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9297800Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9298035Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9298265Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9298482Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9298698Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9298916Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9299163Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9299401Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9299630Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9299846Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9300070Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9300328Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9300571Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9300805Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9301021Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9301250Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9301489Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9301651Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9301888Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9302092Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9302329Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9302572Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9302807Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9303021Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9303226Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9303466Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9303679Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9303883Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9304120Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9304339Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9304576Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9304780Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9305015Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9305219Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9305457Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9305720Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9305966Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9306213Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9306451Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9306696Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9306933Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9307176Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9307409Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9307653Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9307891Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9308106Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9308311Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9308545Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9308786Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9309004Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9309219Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9309435Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9309679Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9309917Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9310237Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9310485Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9310690Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9310927Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9311174Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9311411Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9311656Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9311889Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9312117Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9312334Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9312552Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9312763Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9312967Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9313203Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9313459Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9313699Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9313941Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9314178Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9314385Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9314646Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9314899Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9315131Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9315372Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9315608Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9315814Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9316052Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9316294Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9316530Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9316772Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9317010Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9317239Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9317455Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9317681Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9317900Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9318145Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9318383Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9318589Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9318827Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9319101Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9319347Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9319588Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9319823Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9320050Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9320299Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9320514Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9320730Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9320977Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9321211Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9321531Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9321768Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9322015Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9322265Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9322470Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9322709Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9322953Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9323191Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9323448Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9323696Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9323939Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9324156Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9324371Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9324588Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9324831Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9325065Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9325309Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9325546Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9325790Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9326028Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9326269Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9326505Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9326757Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9326994Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9327207Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9327425Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9327629Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9327843Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9328094Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9328326Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9328539Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9328746Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9328955Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9329145Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9329288Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9329452Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9329571Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9329714Z E1204 10:53:56.738000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9329885Z [W1204 10:53:56.203307613 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9329889Z 2025-12-04T12:10:19.9330048Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9330390Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9330697Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9330842Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9331352Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9331621Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9331862Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9332083Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9332313Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9332569Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9332824Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9333067Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9333302Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9333546Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9333781Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9334038Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9334273Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9334517Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9334750Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9334968Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9335198Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9335411Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9335665Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9335899Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9336106Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9336338Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9336579Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9336829Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9337040Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9337285Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9337498Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9337700Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9337935Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9338149Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9338357Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9338590Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9338835Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9339069Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9339310Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9339543Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9339756Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9339990Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9340243Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9340486Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9340717Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9340959Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9341193Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9341465Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9341709Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9341949Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9342182Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9342425Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9342659Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9342902Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9343133Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9343376Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9343609Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9343852Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9344084Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9344325Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9344575Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9344817Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9345052Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9345292Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9345527Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9345753Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9345978Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9346231Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9346462Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9346705Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9346938Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9347181Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9347413Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9347659Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9347893Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9348134Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9348367Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9348607Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9348840Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9349062Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9349267Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9349502Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9349713Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9349936Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9350198Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9350454Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9350700Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9350912Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9351115Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9351347Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9351560Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9351766Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9351999Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9352241Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9352476Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9352718Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9352950Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9353163Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9353405Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9353622Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9353870Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9354107Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9354327Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9354541Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9354777Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9355030Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9355265Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9355507Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9355743Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9355987Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9356222Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9356467Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9356702Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9356946Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9357181Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9357396Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9357600Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9357846Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9358065Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9358279Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9358495Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9358743Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9358990Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9359243Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9359491Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9359734Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9359968Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9360244Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9360480Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9360722Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9360957Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9361174Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9361389Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9361598Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9361823Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9362039Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9362297Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9362534Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9362751Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9362963Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9363177Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9363421Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9363685Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9363940Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9364175Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9364420Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9364656Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9364898Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9365134Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9365376Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9365610Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9365854Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9366092Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9366335Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9366570Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9366824Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9367061Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9367304Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9367539Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9367781Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9368030Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9368256Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9368482Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9368718Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9368961Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9369197Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9369439Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9369673Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9369915Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9370185Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9370428Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9370666Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9370910Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9371143Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9374845Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9375095Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9375342Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9375578Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9375820Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9376070Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9376324Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9376544Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9376760Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9376979Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9377224Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9377460Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9377689Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9377906Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9378124Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9378342Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9378585Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9378820Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9379037Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9379261Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9379471Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9379639Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9379875Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9380081Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9380368Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9380622Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9380871Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9381083Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9381289Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9381529Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9381745Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9381950Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9382184Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9382389Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9382626Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9382832Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9383067Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9383270Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9383504Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9383761Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9383997Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9384239Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9384477Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9384722Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9384976Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9385230Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9385464Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9385707Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9385945Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9386162Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9386367Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9386602Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9386829Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9387048Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9387264Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9387479Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9387722Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9387956Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9388208Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9388446Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9388651Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9388886Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9389130Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9389392Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9389646Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9389878Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9390145Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9390363Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9390580Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9390787Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9390993Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9391228Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9391472Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9391710Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9391952Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9392186Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9392390Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9392640Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9392886Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9393122Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9393366Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9393600Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9393831Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9394079Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9394320Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9394553Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9394795Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9395029Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9395257Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9395474Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9395688Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9395904Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9396149Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9396383Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9396587Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9396819Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9397077Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9397312Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9397556Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9397791Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9398020Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9398261Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9398486Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9398702Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9398942Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9399178Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9399421Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9399655Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9399896Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9400173Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9400382Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9400617Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9400858Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9401092Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9401346Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9401581Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9401808Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9402024Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9402237Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9402451Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9402708Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9402966Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9403209Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9403442Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9403688Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9403922Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9404163Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9404398Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9404638Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9404875Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9405093Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9405310Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9405514Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9405724Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9405964Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9406187Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9406400Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9406606Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9406814Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9407019Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9407171Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9407345Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9407464Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9407607Z E1204 10:53:56.742000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9407781Z [W1204 10:53:56.205708209 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9407784Z 2025-12-04T12:10:19.9407948Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9408263Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9408575Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9408721Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9409219Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9409491Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9409731Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9409952Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9410208Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9410451Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9410688Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9410929Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9411162Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9411418Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9411667Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9411924Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9412156Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9412398Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9412632Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9412846Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9413068Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9413284Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9413528Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9413760Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9413968Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9414200Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9414440Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9414682Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9414888Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9415122Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9415334Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9415537Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9415770Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9416005Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9416217Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9416449Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9416690Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9416923Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9417165Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9417398Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9417611Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9417834Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9418049Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9418292Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9418524Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9418764Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9419006Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9419248Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9419482Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9419725Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9419960Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9420239Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9420499Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9420753Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9420986Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9421226Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9421460Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9421702Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9421936Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9422175Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9422408Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9422650Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9422883Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9423125Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9423359Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9423593Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9423807Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9424049Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9424282Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9424522Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9424765Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9425016Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9425260Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9425501Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9425733Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9425977Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9426212Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9426452Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9426686Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9426899Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9427104Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9427338Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9427550Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9427773Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9427997Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9428240Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9428472Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9428682Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9428883Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9429127Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9429346Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9429558Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9429791Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9430033Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9430298Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9430541Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9430774Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9430985Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9431206Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9431422Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9431670Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9431907Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9432123Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9432351Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9432568Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9432812Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9433047Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9433288Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9433522Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9433790Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9434040Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9434283Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9434516Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9434759Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9434993Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9435208Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9435412Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9435649Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9435868Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9436083Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9436300Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9436542Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9436788Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9437033Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9437268Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9437509Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9437743Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9437997Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9438243Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9438505Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9438739Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9438955Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9439169Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9439375Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9439601Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9439815Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9440058Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9440326Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9440542Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9440757Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9440970Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9441225Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9441470Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9441716Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9441950Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9442195Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9442432Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9442697Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9442946Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9443187Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9443421Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9443662Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9443897Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9444141Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9444375Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9444618Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9444852Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9445096Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9445330Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9445571Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9445817Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9446031Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9446239Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9446473Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9446715Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9446960Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9447211Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9447455Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9447695Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9447930Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9448174Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9448411Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9448652Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9448885Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9449092Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9449327Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9449570Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9449803Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9450045Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9450343Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9450571Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9450790Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9451002Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9451220Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9451473Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9451722Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9451963Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9452179Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9452395Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9452612Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9452858Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9453091Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9453308Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9453523Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9453730Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9453895Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9454129Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9454336Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9454581Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9454825Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9455061Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9455273Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9455477Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9455713Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9455952Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9456170Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9456407Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9456612Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9456846Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9457052Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9457289Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9457493Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9457727Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9457972Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9458209Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9458453Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9458687Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9458929Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9459176Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9459419Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9459654Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9459897Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9460188Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9460432Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9460648Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9460882Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9461109Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9461327Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9461546Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9461763Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9462007Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9462243Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9462490Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9462726Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9462931Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9463166Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9463408Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9463656Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9463898Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9464133Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9464361Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9464579Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9464802Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9465030Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9465237Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9465472Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9465718Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9465956Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9466198Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9466433Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9466637Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9466873Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9467119Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9467354Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9467596Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9467829Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9468044Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9468278Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9468524Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9468758Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9469000Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9469244Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9469492Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9469709Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9469922Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9470161Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9470405Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9470644Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9470850Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9471083Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9471326Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9471561Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9471802Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9472035Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9472262Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9472495Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9472710Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9472926Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9473167Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9473403Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9473666Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9473923Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9474165Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9474398Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9474604Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9474842Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9475085Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9475318Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9475559Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9475795Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9476021Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9476239Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9476452Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9476669Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9476923Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9477158Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9477401Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9477634Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9477876Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9478129Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9478383Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9478618Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9478860Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9479096Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9479310Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9479531Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9479735Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9479945Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9480208Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9480431Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9480646Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9480854Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9481060Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9481259Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9481403Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9481567Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9481686Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9481826Z E1204 10:53:56.744000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9481896Z ('RERUN', {'yellow': True}) [1.2177s] [100%] 2025-12-04T12:10:19.9482243Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda [W1204 10:53:57.231578931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9482246Z 2025-12-04T12:10:19.9482416Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9482750Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9483059Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9483205Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9483698Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9483967Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9484206Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9484425Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9484640Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9484885Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9485123Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9485372Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9485615Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9485858Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9486092Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9486333Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9486566Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9486816Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9487060Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9487282Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9487504Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9487718Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9487960Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9488194Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9488397Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9488630Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9488869Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9489105Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9489308Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9489540Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9489752Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9489952Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9490238Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9490450Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9490655Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9490887Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9491132Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9491377Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9491641Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9491876Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9492087Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9492312Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9492526Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9492769Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9493003Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9493243Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9493477Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9493718Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9493952Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9494191Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9494423Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9494675Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9494910Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9495152Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9495383Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9495624Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9495875Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9496127Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9496360Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9496601Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9496836Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9497078Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9497312Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9497551Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9497784Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9498002Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9498214Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9498454Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9498686Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9498939Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9499174Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9499416Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9499648Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9499888Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9500168Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9500420Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9500666Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9500906Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9501140Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9501355Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9501558Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9501792Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9502003Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9502225Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9502440Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9502683Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9502915Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9503127Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9503348Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9503583Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9503795Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9503995Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9504228Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9504469Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9504724Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9504982Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9505213Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9505427Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9505652Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9505868Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9506114Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9506348Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9506566Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9506781Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9506997Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9507241Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9507482Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9507724Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9507969Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9508216Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9508449Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9508693Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9508929Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9509191Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9509438Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9509650Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9509855Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9510114Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9510333Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9510550Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9510763Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9511005Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9511240Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9511485Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9511719Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9511962Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9512196Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9512453Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9512689Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9512930Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9513164Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9513383Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9513625Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9513844Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9514067Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9514282Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9514525Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9514761Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9514978Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9515193Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9515408Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9515651Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9515886Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9516128Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9516363Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9516604Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9516851Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9517094Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9517329Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9517572Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9517808Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9518071Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9518316Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9518557Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9518791Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9519034Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9519269Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9519512Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9519749Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9519992Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9520265Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9520482Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9520687Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9520921Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9521179Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9521416Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9521662Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9521895Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9522138Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9522385Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9522645Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9522892Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9523135Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9523370Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9523575Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9523814Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9524055Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9524290Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9524532Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9524768Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9524997Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9525215Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9525429Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9525658Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9525904Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9526138Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9526365Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9526581Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9526802Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9527026Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9527281Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9527516Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9527733Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9527948Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9528155Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9528319Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9528556Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9528761Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9528998Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9529241Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9529476Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9529690Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9529894Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9530174Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9530386Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9530591Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9530826Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9531032Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9531285Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9531517Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9531753Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9531957Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9532194Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9532437Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9532673Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9532916Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9533150Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9533394Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9533629Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9533874Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9534110Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9534353Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9534599Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9534812Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9535017Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9535251Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9535481Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9535707Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9535948Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9536163Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9536404Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9536641Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9536883Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9537121Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9537324Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9537559Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9537803Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9538040Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9538284Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9538517Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9538745Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9538972Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9539187Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9539394Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9539598Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9539834Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9540145Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9540399Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9540654Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9540889Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9541095Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9541329Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9541572Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9541806Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9542050Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9542286Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9542490Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9542726Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9542967Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9543202Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9543457Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9543695Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9543924Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9544143Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9544357Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9544584Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9544838Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9545085Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9545291Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9545529Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9545771Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9546007Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9546250Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9546484Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9546712Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9546930Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9547146Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9547361Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9547603Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9547848Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9548092Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9548326Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9548568Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9548802Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9549016Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9549275Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9549516Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9549750Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9549994Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9550276Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9550504Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9550720Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9550934Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9551150Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9551397Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9551631Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9551874Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9552109Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9552367Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9552602Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9552845Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9553079Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9553319Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9553567Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9553803Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9554019Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9554226Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9554436Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9554665Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9554889Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9555102Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9555309Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9555517Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9555704Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9555846Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9556007Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9556123Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9556265Z E1204 10:53:57.770000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9556436Z [W1204 10:53:57.233859909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9556439Z 2025-12-04T12:10:19.9556616Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9556927Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9557234Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9557379Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9557879Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9558167Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9558409Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9558627Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9558844Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9559086Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9559322Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9559564Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9559799Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9560042Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9560320Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9560562Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9560794Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9561047Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9561280Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9561498Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9561721Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9561935Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9562176Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9562433Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9562652Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9562884Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9563125Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9563358Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9563561Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9563795Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9564006Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9564208Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9564443Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9564656Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9564860Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9565091Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9565336Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9565578Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9565822Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9566055Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9566267Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9566491Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9566715Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9566967Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9567209Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9567450Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9567683Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9567926Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9568160Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9568402Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9568636Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9568877Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9569111Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9569353Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9569584Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9569824Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9570066Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9570344Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9570576Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9570819Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9571052Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9571321Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9571577Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9571817Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9572052Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9572268Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9572483Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9572729Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9572961Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9573202Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9573438Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9573679Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9573913Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9574154Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9574400Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9574644Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9574879Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9575118Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9575350Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9575573Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9575786Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9576030Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9576240Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9576463Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9576679Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9576923Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9577155Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9577366Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9577568Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9577800Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9578013Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9578215Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9578448Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9578692Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9578935Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9579177Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9579410Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9579621Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9579844Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9580069Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9580372Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9580608Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9580826Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9581039Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9581255Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9581498Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9581732Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9581973Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9582209Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9582451Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9582687Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9582931Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9583164Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9583422Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9583658Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9583871Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9584075Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9584311Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9584540Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9584778Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9584993Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9585235Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9585474Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9585719Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9585956Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9586198Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9586430Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9586673Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9586912Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9587154Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9587388Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9587604Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9587828Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9588036Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9588261Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9588475Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9588718Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9588965Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9589205Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9589418Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9589633Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9589877Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9590144Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9590389Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9590624Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9590867Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9591106Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9591347Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9591584Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9591825Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9592059Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9592315Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9592550Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9592792Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9593026Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9593270Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9593530Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9593784Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9594020Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9594260Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9594497Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9594712Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9594922Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9595160Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9595401Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9595638Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9595879Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9596115Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9596358Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9596603Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9596849Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9597085Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9597329Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9597561Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9597786Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9598033Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9598286Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9598521Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9598762Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9598997Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9599227Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9599444Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9599658Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9599874Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9600149Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9600384Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9600612Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9600829Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9601056Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9601279Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9601521Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9601756Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9601971Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9602197Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9602416Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9602593Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9602829Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9603033Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9603272Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9603515Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9603752Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9603966Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9604172Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9604407Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9604620Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9604827Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9605064Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9605270Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9605516Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9605722Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9605957Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9606160Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9606395Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9606654Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9606900Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9607157Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9607395Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9607638Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9607873Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9608116Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9608349Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9608591Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9608826Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9609042Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9609251Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9609486Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9609714Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9609941Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9610198Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9610414Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9610657Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9610893Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9611148Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9611399Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9611618Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9611852Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9612096Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9612331Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9612575Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9612809Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9613037Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9613254Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9613472Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9613679Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9613884Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9614120Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9614374Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9614611Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9614854Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9615090Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9615293Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9615542Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9615795Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9616040Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9616285Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9616520Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9616729Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9616963Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9617205Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9617442Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9617684Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9617920Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9618148Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9618365Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9618579Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9618805Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9619048Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9619283Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9619491Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9619724Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9619979Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9620273Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9620527Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9620761Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9620988Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9621211Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9621428Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9621648Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9621889Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9622124Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9622369Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9622603Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9622845Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9623079Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9623308Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9623548Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9623790Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9624024Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9624265Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9624515Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9624765Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9624983Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9625198Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9625414Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9625660Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9625895Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9626136Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9626370Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9626615Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9626850Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9627092Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9627328Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9627571Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9627819Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9628032Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9628251Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9628456Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9628666Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9628909Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9629141Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9629364Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9629570Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9629779Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9629966Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9630146Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9630310Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9630429Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9630570Z E1204 10:53:57.773000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9630740Z [W1204 10:53:57.235927358 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:19.9630742Z 2025-12-04T12:10:19.9630903Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:19.9631212Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:19.9631520Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:19.9631668Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:19.9632172Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:19.9632440Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:19.9632680Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:19.9632901Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:19.9633120Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9633372Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9633633Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9633877Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9634110Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9634353Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9634587Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9634829Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9635061Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9635302Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9635535Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9635751Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9635973Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9636189Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9636431Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9636673Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9636880Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9637112Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9637353Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9637587Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9637803Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9638065Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9638277Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9638479Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9638711Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9638925Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9639130Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9639369Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9639613Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9639850Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9640132Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9640367Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9640582Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9640803Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9641032Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9641274Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9641507Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9641747Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9641982Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9642238Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9642481Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9642735Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9642970Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9643210Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9643444Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9643685Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9643918Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9644158Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9644395Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9644638Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9644870Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9645111Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9645343Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9645596Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9645832Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9646075Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9646309Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9646526Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9646760Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:19.9647011Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9647244Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9647485Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9647717Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9647960Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9648194Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9648438Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9648670Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9648912Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9649149Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9649390Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9649625Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9649846Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9650051Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9650313Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9650525Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9650746Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9650961Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9651216Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9651476Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9651688Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9651889Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9652126Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9652339Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9652543Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9652776Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9653016Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9653252Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9653493Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9653726Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9653939Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9654160Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9654388Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9654633Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9654869Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9655084Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9655297Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9655524Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9655783Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9656030Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9656274Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9656513Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9656755Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9656993Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9657235Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9657469Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9657711Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9657945Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9658163Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9658368Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9658603Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9658831Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9659044Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9659261Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9659501Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9659739Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9659991Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9660285Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9660527Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9660762Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9661005Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9661240Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9661486Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9661720Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9661937Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9662153Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9662362Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9662590Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:19.9662804Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9663051Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9663299Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9663516Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9663731Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9663945Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9664189Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9664440Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9664695Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9664942Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9665184Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9665419Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9665664Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9665900Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9666142Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9666378Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9666624Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9666859Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9667101Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9667334Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9667575Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9667820Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9668064Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9668300Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9668542Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9668778Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9669012Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9669229Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9669463Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9669705Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9669940Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9670215Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9670451Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9670695Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9670929Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9671174Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9671412Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9671654Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9671888Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9672108Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9672344Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9672588Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9672822Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9673063Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9673315Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9673553Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9673784Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9673997Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9674212Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9674456Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9674693Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9674922Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9675137Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9675351Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9675566Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9675810Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9676044Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9676261Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9676487Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9677937Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9678105Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:19.9678341Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9678547Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9678781Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9679038Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9679297Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9679510Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9679714Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9679952Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9680197Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9680403Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9680640Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9680844Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9681079Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9681285Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9681521Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9681730Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9681970Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9682230Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9682468Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9682712Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9682947Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9683188Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9683436Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9683689Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9683938Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9684184Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9684418Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9684632Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:19.9684837Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9685071Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9685298Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9685517Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9685733Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9685950Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9686197Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9686431Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9686684Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9686919Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9687125Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9687359Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9687601Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9687846Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9688099Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9688346Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9688574Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9688792Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9689007Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9689214Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:19.9689418Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9689652Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9689895Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9690167Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9690410Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9690644Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9690848Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9691101Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9691344Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9691580Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9691824Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9692059Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9692277Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9692524Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9692780Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9693012Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9693258Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9693494Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9693723Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9693941Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9694156Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9694372Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9694615Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9694851Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9695056Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9695290Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9695543Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9695779Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9696021Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9696256Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9696484Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9696712Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9696934Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9697161Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9697402Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9697636Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9697880Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9698115Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9698360Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9698593Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9698798Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:19.9699032Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9699274Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9699507Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9699750Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9699996Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9700357Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:19.9700576Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:19.9700790Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:19.9701007Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:19.9701261Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:19.9701509Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9701766Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9702001Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9702243Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9702479Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9702723Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9702957Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9703199Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:19.9703433Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:19.9703646Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:19.9703864Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:19.9704068Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:19.9704279Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:19.9704523Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:19.9704745Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:19.9704959Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:19.9705166Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:19.9705374Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:19.9705561Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:19.9705724Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:19.9705902Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:19.9706021Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:19.9706161Z E1204 10:53:57.775000 614520 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:19.9706219Z FAILED [0.9068s] [100%] 2025-12-04T12:10:19.9706221Z 2025-12-04T12:10:19.9706298Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9706460Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9706525Z Traceback (most recent call last): 2025-12-04T12:10:19.9706704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9706765Z method(*args, **kwargs) 2025-12-04T12:10:19.9706932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9706990Z method(*args, **kwargs) 2025-12-04T12:10:19.9707153Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9707208Z with policy(): 2025-12-04T12:10:19.9707376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9707434Z raise RuntimeError(msg) 2025-12-04T12:10:19.9707850Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:19.9707853Z 2025-12-04T12:10:19.9707949Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9708228Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9708232Z 2025-12-04T12:10:19.9708337Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9708432Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9708493Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9708569Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9709152Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9709271Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9709325Z graph_break [] 2025-12-04T12:10:19.9709407Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9709497Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9710010Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9710086Z current_size = base.storage().size() 2025-12-04T12:10:19.9710198Z Autotune Choices Stats: 2025-12-04T12:10:19.9710589Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007280000019818544, "best_triton_pos": 0} 2025-12-04T12:10:19.9710672Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9710738Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9710875Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9711127Z triton_mm_14 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9711369Z triton_mm_8 0.0080 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9711609Z triton_mm_10 0.0080 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9711847Z triton_mm_13 0.0081 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9712089Z triton_mm_11 0.0083 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9712325Z triton_mm_9 0.0086 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9712566Z triton_mm_12 0.0090 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9712805Z triton_mm_16 0.0097 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9712864Z _scaled_mm 0.0099 ms 73.4% 2025-12-04T12:10:19.9713117Z triton_mm_17 0.0108 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9713269Z SingleProcess AUTOTUNE benchmarking takes 0.0869 seconds and 8.1916 seconds precompiling for 18 choices 2025-12-04T12:10:19.9713427Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9713490Z Traceback (most recent call last): 2025-12-04T12:10:19.9713662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9713720Z method(*args, **kwargs) 2025-12-04T12:10:19.9713887Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9713944Z method(*args, **kwargs) 2025-12-04T12:10:19.9714121Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9714188Z with policy(): 2025-12-04T12:10:19.9714355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9714425Z raise RuntimeError(msg) 2025-12-04T12:10:19.9714830Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:19.9714832Z 2025-12-04T12:10:19.9714925Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9715201Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9715205Z 2025-12-04T12:10:19.9715309Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9715401Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9715461Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9715535Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9716100Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9716217Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9716273Z graph_break [] 2025-12-04T12:10:19.9716354Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9716442Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9716944Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9717009Z current_size = base.storage().size() 2025-12-04T12:10:19.9717065Z Autotune Choices Stats: 2025-12-04T12:10:19.9717459Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007280000019818544, "best_triton_pos": 0} 2025-12-04T12:10:19.9717543Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9717610Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9717746Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9717993Z triton_mm_14 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9718231Z triton_mm_8 0.0080 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9718479Z triton_mm_10 0.0080 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9718738Z triton_mm_13 0.0081 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9718976Z triton_mm_11 0.0083 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9719213Z triton_mm_9 0.0086 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9719452Z triton_mm_12 0.0090 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9719694Z triton_mm_16 0.0097 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9719753Z _scaled_mm 0.0099 ms 73.4% 2025-12-04T12:10:19.9719990Z triton_mm_17 0.0108 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9720173Z SingleProcess AUTOTUNE benchmarking takes 0.0869 seconds and 8.1916 seconds precompiling for 18 choices 2025-12-04T12:10:19.9720264Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9720323Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9720398Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9720512Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9721016Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9721070Z graph_break [] 2025-12-04T12:10:19.9721148Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9721237Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9721637Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.9721749Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.9721806Z Autotune Choices Stats: 2025-12-04T12:10:19.9722182Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:19.9722264Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9722327Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9722465Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9722744Z triton_mm_33 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9722998Z triton_mm_38 0.0078 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9723233Z triton_mm_30 0.0079 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9723475Z triton_mm_28 0.0080 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9723718Z triton_mm_32 0.0081 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9723961Z triton_mm_31 0.0081 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9724203Z triton_mm_27 0.0083 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9724440Z triton_mm_37 0.0083 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9724681Z triton_mm_29 0.0086 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9724920Z triton_mm_34 0.0086 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9725064Z SingleProcess AUTOTUNE benchmarking takes 0.1548 seconds and 0.4673 seconds precompiling for 21 choices 2025-12-04T12:10:19.9725134Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9725291Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9725353Z Traceback (most recent call last): 2025-12-04T12:10:19.9725537Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9725597Z method(*args, **kwargs) 2025-12-04T12:10:19.9725764Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9725823Z method(*args, **kwargs) 2025-12-04T12:10:19.9725987Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9726041Z with policy(): 2025-12-04T12:10:19.9726208Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9726267Z raise RuntimeError(msg) 2025-12-04T12:10:19.9726681Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.9726694Z 2025-12-04T12:10:19.9726785Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9727074Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9727077Z 2025-12-04T12:10:19.9727180Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9727270Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9727327Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9727400Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9727964Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9728081Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9728134Z graph_break [] 2025-12-04T12:10:19.9728212Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9728302Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9728799Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9728863Z current_size = base.storage().size() 2025-12-04T12:10:19.9728920Z Autotune Choices Stats: 2025-12-04T12:10:19.9729300Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007280000019818544, "best_triton_pos": 0} 2025-12-04T12:10:19.9729380Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9729444Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9729580Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9729841Z triton_mm_14 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9730081Z triton_mm_8 0.0080 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9730354Z triton_mm_10 0.0080 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9730591Z triton_mm_13 0.0081 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9730846Z triton_mm_11 0.0083 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9731097Z triton_mm_9 0.0086 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9731352Z triton_mm_12 0.0090 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9731591Z triton_mm_16 0.0097 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9731650Z _scaled_mm 0.0099 ms 73.4% 2025-12-04T12:10:19.9731887Z triton_mm_17 0.0108 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9732034Z SingleProcess AUTOTUNE benchmarking takes 0.0869 seconds and 8.1916 seconds precompiling for 18 choices 2025-12-04T12:10:19.9732123Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9732181Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9732253Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9732368Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9732869Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9732924Z graph_break [] 2025-12-04T12:10:19.9733003Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9733091Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9733468Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:19.9733575Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:19.9733632Z Autotune Choices Stats: 2025-12-04T12:10:19.9734027Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:19.9734110Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9734174Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9734311Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9734556Z triton_mm_33 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9734793Z triton_mm_38 0.0078 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9735043Z triton_mm_30 0.0079 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9735306Z triton_mm_28 0.0080 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9735544Z triton_mm_32 0.0081 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9735781Z triton_mm_31 0.0081 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9736022Z triton_mm_27 0.0083 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9736263Z triton_mm_37 0.0083 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9736499Z triton_mm_29 0.0086 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9736737Z triton_mm_34 0.0086 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9736880Z SingleProcess AUTOTUNE benchmarking takes 0.1548 seconds and 0.4673 seconds precompiling for 21 choices 2025-12-04T12:10:19.9736971Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9737029Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9737102Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9737216Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9737711Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9737765Z graph_break [] 2025-12-04T12:10:19.9737853Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:19.9737945Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9738001Z Autotune Choices Stats: 2025-12-04T12:10:19.9738377Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_48", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00791999977082014, "best_triton_pos": 0} 2025-12-04T12:10:19.9738456Z AUTOTUNE scaled_mm(1024x32, 32x2048, 1024x1, 1x2048, 2048) 2025-12-04T12:10:19.9738520Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9738656Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9738926Z triton_mm_48 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9739175Z triton_mm_58 0.0084 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9739423Z triton_mm_50 0.0084 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9739663Z triton_mm_51 0.0085 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9739904Z triton_mm_52 0.0085 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9740180Z triton_mm_57 0.0086 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9740419Z triton_mm_55 0.0088 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9740659Z triton_mm_47 0.0095 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9740898Z triton_mm_54 0.0098 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9741137Z triton_mm_45 0.0102 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9741282Z SingleProcess AUTOTUNE benchmarking takes 0.1267 seconds and 0.3149 seconds precompiling for 21 choices 2025-12-04T12:10:19.9741487Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7c0a8b2a94185468.xml - 2025-12-04T12:10:19.9741564Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9742179Z FAILED [0.9068s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:19.9742186Z 2025-12-04T12:10:19.9742276Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9742549Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9742552Z 2025-12-04T12:10:19.9742655Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9742734Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9742819Z ================= 1 failed, 187 deselected, 2 rerun in 12.60s ================== 2025-12-04T12:10:19.9742875Z Got exit code 1 2025-12-04T12:10:19.9743112Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9743280Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.9743440Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3be8a410e1db522b.xml 2025-12-04T12:10:19.9743514Z ============================= test session starts ============================== 2025-12-04T12:10:19.9743642Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9743700Z cachedir: .pytest_cache 2025-12-04T12:10:19.9743873Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9743938Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9743995Z configfile: pytest.ini 2025-12-04T12:10:19.9744178Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9744274Z collecting ... collected 188 items / 83 deselected / 105 selected 2025-12-04T12:10:19.9744342Z stepcurrent: skipping 83 already run items. 2025-12-04T12:10:19.9744403Z Running 105 items in this shard 2025-12-04T12:10:19.9744405Z 2025-12-04T12:10:19.9744636Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1405s] [ 0%] 2025-12-04T12:10:19.9744858Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7781s] [ 0%] 2025-12-04T12:10:19.9745057Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.6479s] [ 0%] 2025-12-04T12:10:19.9745060Z 2025-12-04T12:10:19.9745130Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9745284Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9745349Z Traceback (most recent call last): 2025-12-04T12:10:19.9745524Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9745584Z method(*args, **kwargs) 2025-12-04T12:10:19.9745751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9745808Z method(*args, **kwargs) 2025-12-04T12:10:19.9745972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9746029Z with policy(): 2025-12-04T12:10:19.9746207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9746265Z raise RuntimeError(msg) 2025-12-04T12:10:19.9746663Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:19.9746667Z 2025-12-04T12:10:19.9746759Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9747029Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9747031Z 2025-12-04T12:10:19.9747133Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9747234Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9747303Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9747377Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9747886Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9748001Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9748054Z graph_break [] 2025-12-04T12:10:19.9748131Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9748222Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9748719Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9748786Z current_size = base.storage().size() 2025-12-04T12:10:19.9748843Z Autotune Choices Stats: 2025-12-04T12:10:19.9749223Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:19.9749296Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9749364Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9749501Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9751118Z triton_mm_0 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9751367Z triton_mm_2 0.0081 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9751610Z triton_mm_1 0.0091 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9751870Z triton_mm_3 0.0095 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9751932Z _scaled_mm 0.0303 ms 25.5% 2025-12-04T12:10:19.9752077Z SingleProcess AUTOTUNE benchmarking takes 0.0300 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:19.9752243Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9752305Z Traceback (most recent call last): 2025-12-04T12:10:19.9752478Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9752535Z method(*args, **kwargs) 2025-12-04T12:10:19.9752704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9752760Z method(*args, **kwargs) 2025-12-04T12:10:19.9752928Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9752997Z with policy(): 2025-12-04T12:10:19.9753181Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9753239Z raise RuntimeError(msg) 2025-12-04T12:10:19.9753641Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:19.9753644Z 2025-12-04T12:10:19.9753735Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9754004Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9754008Z 2025-12-04T12:10:19.9754111Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9754202Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9754263Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9754338Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9754831Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9754948Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9755001Z graph_break [] 2025-12-04T12:10:19.9755083Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9755172Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9755732Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9755796Z current_size = base.storage().size() 2025-12-04T12:10:19.9755853Z Autotune Choices Stats: 2025-12-04T12:10:19.9756241Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:19.9756319Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9756387Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9756524Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9756769Z triton_mm_0 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9757009Z triton_mm_2 0.0081 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9757250Z triton_mm_1 0.0091 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9757498Z triton_mm_3 0.0095 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9757571Z _scaled_mm 0.0303 ms 25.5% 2025-12-04T12:10:19.9757715Z SingleProcess AUTOTUNE benchmarking takes 0.0300 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:19.9757804Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9757863Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9757935Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9758050Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9758542Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9758596Z graph_break [] 2025-12-04T12:10:19.9758673Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9758762Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9758818Z Autotune Choices Stats: 2025-12-04T12:10:19.9759194Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:19.9759265Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9759330Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9759465Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9759725Z triton_mm_5 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9759963Z triton_mm_4 0.0075 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9760258Z triton_mm_7 0.0096 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9760496Z triton_mm_6 0.0104 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9760553Z _scaled_mm 0.0281 ms 22.5% 2025-12-04T12:10:19.9760698Z SingleProcess AUTOTUNE benchmarking takes 0.0273 seconds and 0.1414 seconds precompiling for 5 choices 2025-12-04T12:10:19.9760765Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9760919Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9760981Z Traceback (most recent call last): 2025-12-04T12:10:19.9761152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9761209Z method(*args, **kwargs) 2025-12-04T12:10:19.9761375Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9761447Z method(*args, **kwargs) 2025-12-04T12:10:19.9761630Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9761683Z with policy(): 2025-12-04T12:10:19.9761854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9761911Z raise RuntimeError(msg) 2025-12-04T12:10:19.9762312Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9762316Z 2025-12-04T12:10:19.9762407Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9762676Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9762680Z 2025-12-04T12:10:19.9762783Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9762872Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9762931Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9763003Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9763494Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9763610Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9763664Z graph_break [] 2025-12-04T12:10:19.9763742Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9763849Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9764345Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9764408Z current_size = base.storage().size() 2025-12-04T12:10:19.9764476Z Autotune Choices Stats: 2025-12-04T12:10:19.9764849Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:19.9764923Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9764988Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9765128Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9765372Z triton_mm_0 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9765611Z triton_mm_2 0.0081 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9765862Z triton_mm_1 0.0091 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9766112Z triton_mm_3 0.0095 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9766169Z _scaled_mm 0.0303 ms 25.5% 2025-12-04T12:10:19.9766311Z SingleProcess AUTOTUNE benchmarking takes 0.0300 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:19.9766401Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9766460Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9766533Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9766648Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9767137Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9767191Z graph_break [] 2025-12-04T12:10:19.9767268Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9767358Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9767414Z Autotune Choices Stats: 2025-12-04T12:10:19.9767788Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:19.9767860Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9767937Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9768072Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9768318Z triton_mm_5 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9768564Z triton_mm_4 0.0075 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9768806Z triton_mm_7 0.0096 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9769044Z triton_mm_6 0.0104 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9769101Z _scaled_mm 0.0281 ms 22.5% 2025-12-04T12:10:19.9769244Z SingleProcess AUTOTUNE benchmarking takes 0.0273 seconds and 0.1414 seconds precompiling for 5 choices 2025-12-04T12:10:19.9769332Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9769390Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9769463Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9769578Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9770078Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9770184Z graph_break [] 2025-12-04T12:10:19.9770260Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9770348Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9770405Z Autotune Choices Stats: 2025-12-04T12:10:19.9770778Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:19.9770852Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9770916Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9771052Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9771297Z triton_mm_9 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9771539Z triton_mm_11 0.0069 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9771776Z triton_mm_10 0.0072 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9772036Z triton_mm_8 0.0080 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9772094Z _scaled_mm 0.0300 ms 21.4% 2025-12-04T12:10:19.9772238Z SingleProcess AUTOTUNE benchmarking takes 0.0346 seconds and 0.2194 seconds precompiling for 5 choices 2025-12-04T12:10:19.9772442Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3be8a410e1db522b.xml - 2025-12-04T12:10:19.9772518Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9773125Z FAILED [0.6479s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9773130Z 2025-12-04T12:10:19.9773220Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9773488Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9773491Z 2025-12-04T12:10:19.9773593Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9773672Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9773757Z ================== 1 failed, 83 deselected, 2 rerun in 3.59s =================== 2025-12-04T12:10:19.9773825Z Got exit code 1 2025-12-04T12:10:19.9773882Z Retrying single test... 2025-12-04T12:10:19.9774057Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bbd73712c571e814.xml 2025-12-04T12:10:19.9774131Z ============================= test session starts ============================== 2025-12-04T12:10:19.9774259Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9774318Z cachedir: .pytest_cache 2025-12-04T12:10:19.9774492Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9774556Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9774614Z configfile: pytest.ini 2025-12-04T12:10:19.9774798Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9774890Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9775153Z stepcurrent: skipping 83 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9775213Z Running 1 items in this shard 2025-12-04T12:10:19.9775216Z 2025-12-04T12:10:19.9775442Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9379s] [100%] 2025-12-04T12:10:19.9775664Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5469s] [100%] 2025-12-04T12:10:19.9775865Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.6402s] [100%] 2025-12-04T12:10:19.9775869Z 2025-12-04T12:10:19.9775938Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9776094Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9776155Z Traceback (most recent call last): 2025-12-04T12:10:19.9776338Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9776399Z method(*args, **kwargs) 2025-12-04T12:10:19.9776567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9776625Z method(*args, **kwargs) 2025-12-04T12:10:19.9776790Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9776854Z with policy(): 2025-12-04T12:10:19.9777021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9777080Z raise RuntimeError(msg) 2025-12-04T12:10:19.9777479Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:19.9777481Z 2025-12-04T12:10:19.9777572Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9777842Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9777846Z 2025-12-04T12:10:19.9777949Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9778039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9778111Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9778196Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9778690Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9778804Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9778858Z graph_break [] 2025-12-04T12:10:19.9778936Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9779027Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9779524Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9779589Z current_size = base.storage().size() 2025-12-04T12:10:19.9779646Z Autotune Choices Stats: 2025-12-04T12:10:19.9780027Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.9780133Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9780199Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9780337Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9780606Z triton_mm_3 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9780845Z triton_mm_2 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9781084Z triton_mm_1 0.0070 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9781333Z triton_mm_0 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9781392Z _scaled_mm 0.0303 ms 21.4% 2025-12-04T12:10:19.9781535Z SingleProcess AUTOTUNE benchmarking takes 0.0245 seconds and 0.1529 seconds precompiling for 5 choices 2025-12-04T12:10:19.9781689Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9781754Z Traceback (most recent call last): 2025-12-04T12:10:19.9781925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9781984Z method(*args, **kwargs) 2025-12-04T12:10:19.9782152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9782211Z method(*args, **kwargs) 2025-12-04T12:10:19.9782376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9782444Z with policy(): 2025-12-04T12:10:19.9782627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9782684Z raise RuntimeError(msg) 2025-12-04T12:10:19.9783085Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:19.9783087Z 2025-12-04T12:10:19.9783179Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9783555Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9783558Z 2025-12-04T12:10:19.9783661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9783751Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9783810Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9783885Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9784377Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9784494Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9784547Z graph_break [] 2025-12-04T12:10:19.9784624Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9784712Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9785225Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9785289Z current_size = base.storage().size() 2025-12-04T12:10:19.9785346Z Autotune Choices Stats: 2025-12-04T12:10:19.9785734Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.9785807Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9785873Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9786008Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9786257Z triton_mm_3 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9786496Z triton_mm_2 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9786737Z triton_mm_1 0.0070 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9787009Z triton_mm_0 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9787066Z _scaled_mm 0.0303 ms 21.4% 2025-12-04T12:10:19.9787210Z SingleProcess AUTOTUNE benchmarking takes 0.0245 seconds and 0.1529 seconds precompiling for 5 choices 2025-12-04T12:10:19.9787299Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9787356Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9787427Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9787543Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9788034Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9788093Z graph_break [] 2025-12-04T12:10:19.9788169Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9788260Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9788316Z Autotune Choices Stats: 2025-12-04T12:10:19.9788693Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.9788764Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9788829Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9788965Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9789219Z triton_mm_5 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9789460Z triton_mm_7 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9789708Z triton_mm_6 0.0075 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9789949Z triton_mm_4 0.0077 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9790007Z _scaled_mm 0.0307 ms 20.1% 2025-12-04T12:10:19.9790184Z SingleProcess AUTOTUNE benchmarking takes 0.0234 seconds and 0.1218 seconds precompiling for 5 choices 2025-12-04T12:10:19.9790254Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9790408Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9790470Z Traceback (most recent call last): 2025-12-04T12:10:19.9790643Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9790700Z method(*args, **kwargs) 2025-12-04T12:10:19.9790886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9790959Z method(*args, **kwargs) 2025-12-04T12:10:19.9791125Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9791182Z with policy(): 2025-12-04T12:10:19.9791349Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9791408Z raise RuntimeError(msg) 2025-12-04T12:10:19.9791811Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9791814Z 2025-12-04T12:10:19.9791906Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9792175Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9792179Z 2025-12-04T12:10:19.9792282Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9792371Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9792428Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9792502Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9792993Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9793108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9793163Z graph_break [] 2025-12-04T12:10:19.9793240Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9793342Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9793840Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9793903Z current_size = base.storage().size() 2025-12-04T12:10:19.9793976Z Autotune Choices Stats: 2025-12-04T12:10:19.9794358Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:19.9794431Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9794498Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9794633Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9794882Z triton_mm_3 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9795122Z triton_mm_2 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9795382Z triton_mm_1 0.0070 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9795622Z triton_mm_0 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9795679Z _scaled_mm 0.0303 ms 21.4% 2025-12-04T12:10:19.9795822Z SingleProcess AUTOTUNE benchmarking takes 0.0245 seconds and 0.1529 seconds precompiling for 5 choices 2025-12-04T12:10:19.9795910Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9795971Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9796044Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9796158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9796655Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9796707Z graph_break [] 2025-12-04T12:10:19.9796783Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9796873Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9796931Z Autotune Choices Stats: 2025-12-04T12:10:19.9797309Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:19.9797382Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9797457Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9797594Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9797838Z triton_mm_5 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9798090Z triton_mm_7 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9798329Z triton_mm_6 0.0075 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9798565Z triton_mm_4 0.0077 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9798622Z _scaled_mm 0.0307 ms 20.1% 2025-12-04T12:10:19.9798763Z SingleProcess AUTOTUNE benchmarking takes 0.0234 seconds and 0.1218 seconds precompiling for 5 choices 2025-12-04T12:10:19.9798852Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9798909Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9798981Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9799108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9799599Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9799664Z graph_break [] 2025-12-04T12:10:19.9799740Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9799830Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9799887Z Autotune Choices Stats: 2025-12-04T12:10:19.9800301Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:19.9800373Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9800440Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9800574Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9800824Z triton_mm_11 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9801066Z triton_mm_9 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9801306Z triton_mm_10 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9801559Z triton_mm_8 0.0103 ms 59.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9801616Z _scaled_mm 0.0304 ms 20.1% 2025-12-04T12:10:19.9801760Z SingleProcess AUTOTUNE benchmarking takes 0.0341 seconds and 0.2206 seconds precompiling for 5 choices 2025-12-04T12:10:19.9801963Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bbd73712c571e814.xml - 2025-12-04T12:10:19.9802062Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9802655Z FAILED [0.6402s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9802659Z 2025-12-04T12:10:19.9802749Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9803017Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9803020Z 2025-12-04T12:10:19.9803122Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9803201Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9803299Z ================== 1 failed, 187 deselected, 2 rerun in 3.14s ================== 2025-12-04T12:10:19.9803353Z Got exit code 1 2025-12-04T12:10:19.9803423Z Retrying single test... 2025-12-04T12:10:19.9803583Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d2a7a0a7110b710b.xml 2025-12-04T12:10:19.9803657Z ============================= test session starts ============================== 2025-12-04T12:10:19.9803784Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9803841Z cachedir: .pytest_cache 2025-12-04T12:10:19.9804014Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9804076Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9804133Z configfile: pytest.ini 2025-12-04T12:10:19.9804312Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9804405Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9804671Z stepcurrent: skipping 83 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9804732Z Running 1 items in this shard 2025-12-04T12:10:19.9804734Z 2025-12-04T12:10:19.9804959Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8957s] [100%] 2025-12-04T12:10:19.9805182Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5803s] [100%] 2025-12-04T12:10:19.9805384Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.6163s] [100%] 2025-12-04T12:10:19.9805388Z 2025-12-04T12:10:19.9805456Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9805611Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9805682Z Traceback (most recent call last): 2025-12-04T12:10:19.9805855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9805913Z method(*args, **kwargs) 2025-12-04T12:10:19.9806081Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9806137Z method(*args, **kwargs) 2025-12-04T12:10:19.9806313Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9806367Z with policy(): 2025-12-04T12:10:19.9806535Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9806593Z raise RuntimeError(msg) 2025-12-04T12:10:19.9806990Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:19.9806992Z 2025-12-04T12:10:19.9807083Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9807351Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9807353Z 2025-12-04T12:10:19.9807457Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9807556Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9807627Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9807701Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9808194Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9808307Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9808361Z graph_break [] 2025-12-04T12:10:19.9808438Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9808529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9809028Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9809092Z current_size = base.storage().size() 2025-12-04T12:10:19.9809150Z Autotune Choices Stats: 2025-12-04T12:10:19.9809529Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:19.9809602Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9809667Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9809804Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9810063Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9810332Z triton_mm_2 0.0074 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9810584Z triton_mm_3 0.0094 ms 73.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9810822Z triton_mm_0 0.0111 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9810881Z _scaled_mm 0.0298 ms 23.1% 2025-12-04T12:10:19.9811024Z SingleProcess AUTOTUNE benchmarking takes 0.0266 seconds and 0.1369 seconds precompiling for 5 choices 2025-12-04T12:10:19.9811178Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9811240Z Traceback (most recent call last): 2025-12-04T12:10:19.9811411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9811467Z method(*args, **kwargs) 2025-12-04T12:10:19.9811635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9811691Z method(*args, **kwargs) 2025-12-04T12:10:19.9811870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9811938Z with policy(): 2025-12-04T12:10:19.9812107Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9812165Z raise RuntimeError(msg) 2025-12-04T12:10:19.9812562Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:19.9812564Z 2025-12-04T12:10:19.9812656Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9812924Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9812927Z 2025-12-04T12:10:19.9813032Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9813120Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9813179Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9813251Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9813742Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9813855Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9813910Z graph_break [] 2025-12-04T12:10:19.9813985Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9814076Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9814587Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9814650Z current_size = base.storage().size() 2025-12-04T12:10:19.9814707Z Autotune Choices Stats: 2025-12-04T12:10:19.9815094Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:19.9815168Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9815233Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9815369Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9815617Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9815857Z triton_mm_2 0.0074 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9816100Z triton_mm_3 0.0094 ms 73.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9816359Z triton_mm_0 0.0111 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9816418Z _scaled_mm 0.0298 ms 23.1% 2025-12-04T12:10:19.9816559Z SingleProcess AUTOTUNE benchmarking takes 0.0266 seconds and 0.1369 seconds precompiling for 5 choices 2025-12-04T12:10:19.9816649Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9816706Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9816779Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9816893Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9817387Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9817440Z graph_break [] 2025-12-04T12:10:19.9817516Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9817605Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9817662Z Autotune Choices Stats: 2025-12-04T12:10:19.9818042Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:19.9818113Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9818179Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9818314Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9818578Z triton_mm_7 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9818817Z triton_mm_6 0.0072 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9819066Z triton_mm_4 0.0075 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9819307Z triton_mm_5 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9819365Z _scaled_mm 0.0306 ms 22.7% 2025-12-04T12:10:19.9819508Z SingleProcess AUTOTUNE benchmarking takes 0.0282 seconds and 0.1135 seconds precompiling for 5 choices 2025-12-04T12:10:19.9819577Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9819731Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9819792Z Traceback (most recent call last): 2025-12-04T12:10:19.9819966Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9820038Z method(*args, **kwargs) 2025-12-04T12:10:19.9820246Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9820318Z method(*args, **kwargs) 2025-12-04T12:10:19.9820485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9820538Z with policy(): 2025-12-04T12:10:19.9820706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9820763Z raise RuntimeError(msg) 2025-12-04T12:10:19.9821162Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9821165Z 2025-12-04T12:10:19.9821256Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9821531Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9821534Z 2025-12-04T12:10:19.9821637Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9821726Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9821785Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9821858Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9822351Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9822468Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9822524Z graph_break [] 2025-12-04T12:10:19.9822621Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9822714Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9823212Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9823290Z current_size = base.storage().size() 2025-12-04T12:10:19.9823351Z Autotune Choices Stats: 2025-12-04T12:10:19.9823731Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:19.9823805Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9823870Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9824006Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9824252Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9824490Z triton_mm_2 0.0074 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9824753Z triton_mm_3 0.0094 ms 73.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9824990Z triton_mm_0 0.0111 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9825049Z _scaled_mm 0.0298 ms 23.1% 2025-12-04T12:10:19.9825191Z SingleProcess AUTOTUNE benchmarking takes 0.0266 seconds and 0.1369 seconds precompiling for 5 choices 2025-12-04T12:10:19.9825282Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9825340Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9825413Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9825529Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9826022Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9826075Z graph_break [] 2025-12-04T12:10:19.9826151Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9826240Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9826298Z Autotune Choices Stats: 2025-12-04T12:10:19.9826672Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:19.9826761Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9826829Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9826964Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9827210Z triton_mm_7 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9827457Z triton_mm_6 0.0072 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9827695Z triton_mm_4 0.0075 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9827936Z triton_mm_5 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9827993Z _scaled_mm 0.0306 ms 22.7% 2025-12-04T12:10:19.9828136Z SingleProcess AUTOTUNE benchmarking takes 0.0282 seconds and 0.1135 seconds precompiling for 5 choices 2025-12-04T12:10:19.9828227Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9828286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9828357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9828483Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9828984Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9829037Z graph_break [] 2025-12-04T12:10:19.9829112Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:19.9829203Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9829260Z Autotune Choices Stats: 2025-12-04T12:10:19.9829636Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:19.9829707Z AUTOTUNE scaled_mm(1x1024, 1024x16, 1x1, 1x16, 16) 2025-12-04T12:10:19.9829773Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9829908Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9830193Z triton_mm_9 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9830435Z triton_mm_10 0.0070 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9830679Z triton_mm_11 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9830932Z triton_mm_8 0.0073 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9830989Z _scaled_mm 0.0294 ms 23.4% 2025-12-04T12:10:19.9831130Z SingleProcess AUTOTUNE benchmarking takes 0.0336 seconds and 0.2128 seconds precompiling for 5 choices 2025-12-04T12:10:19.9831332Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d2a7a0a7110b710b.xml - 2025-12-04T12:10:19.9831423Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9832014Z FAILED [0.6163s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:19.9832018Z 2025-12-04T12:10:19.9832106Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9832375Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9832377Z 2025-12-04T12:10:19.9832479Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9832557Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9832653Z ================== 1 failed, 187 deselected, 2 rerun in 3.11s ================== 2025-12-04T12:10:19.9832727Z Got exit code 1 2025-12-04T12:10:19.9832945Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9833089Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.9833247Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c1faa9ce965f4b89.xml 2025-12-04T12:10:19.9833321Z ============================= test session starts ============================== 2025-12-04T12:10:19.9833447Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9833505Z cachedir: .pytest_cache 2025-12-04T12:10:19.9833686Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9833750Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9833808Z configfile: pytest.ini 2025-12-04T12:10:19.9833986Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9834081Z collecting ... collected 188 items / 84 deselected / 104 selected 2025-12-04T12:10:19.9834150Z stepcurrent: skipping 84 already run items. 2025-12-04T12:10:19.9834210Z Running 104 items in this shard 2025-12-04T12:10:19.9834212Z 2025-12-04T12:10:19.9834441Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6823s] [ 0%] 2025-12-04T12:10:19.9834667Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9537s] [ 0%] 2025-12-04T12:10:19.9834869Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.9616s] [ 0%] 2025-12-04T12:10:19.9834872Z 2025-12-04T12:10:19.9834942Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9835107Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9835171Z Traceback (most recent call last): 2025-12-04T12:10:19.9835345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9835402Z method(*args, **kwargs) 2025-12-04T12:10:19.9835569Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9835634Z method(*args, **kwargs) 2025-12-04T12:10:19.9835803Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9835857Z with policy(): 2025-12-04T12:10:19.9836026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9836083Z raise RuntimeError(msg) 2025-12-04T12:10:19.9836486Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:19.9836488Z 2025-12-04T12:10:19.9836577Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9836850Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9836864Z 2025-12-04T12:10:19.9836967Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9837068Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9837126Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9837200Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9837703Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9837818Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9837874Z graph_break [] 2025-12-04T12:10:19.9837954Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9838043Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9838540Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9838604Z current_size = base.storage().size() 2025-12-04T12:10:19.9838661Z Autotune Choices Stats: 2025-12-04T12:10:19.9839050Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9839135Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9839202Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9839351Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9839602Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9839847Z triton_mm_17 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9840141Z triton_mm_6 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9840383Z triton_mm_14 0.0094 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9840624Z triton_mm_7 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9840866Z triton_mm_18 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9841104Z triton_mm_12 0.0102 ms 59.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9841376Z triton_mm_11 0.0106 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9841618Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9841856Z triton_mm_13 0.0113 ms 53.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9842004Z SingleProcess AUTOTUNE benchmarking takes 0.0923 seconds and 0.4145 seconds precompiling for 20 choices 2025-12-04T12:10:19.9842161Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9842223Z Traceback (most recent call last): 2025-12-04T12:10:19.9842396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9842453Z method(*args, **kwargs) 2025-12-04T12:10:19.9842621Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9842677Z method(*args, **kwargs) 2025-12-04T12:10:19.9842844Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9842897Z with policy(): 2025-12-04T12:10:19.9843066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9843125Z raise RuntimeError(msg) 2025-12-04T12:10:19.9843539Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:19.9843542Z 2025-12-04T12:10:19.9843632Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9843903Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9843906Z 2025-12-04T12:10:19.9844011Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9844110Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9844171Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9844244Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9844746Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9844859Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9844914Z graph_break [] 2025-12-04T12:10:19.9844994Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9845083Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9845578Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9845665Z current_size = base.storage().size() 2025-12-04T12:10:19.9845723Z Autotune Choices Stats: 2025-12-04T12:10:19.9846105Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9846186Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9846253Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9846390Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9846638Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9846882Z triton_mm_17 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9847125Z triton_mm_6 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9847369Z triton_mm_14 0.0094 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9847610Z triton_mm_7 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9847865Z triton_mm_18 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9848105Z triton_mm_12 0.0102 ms 59.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9848357Z triton_mm_11 0.0106 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9848600Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9848841Z triton_mm_13 0.0113 ms 53.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9848985Z SingleProcess AUTOTUNE benchmarking takes 0.0923 seconds and 0.4145 seconds precompiling for 20 choices 2025-12-04T12:10:19.9849075Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9849132Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9849209Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9849340Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9849839Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9849904Z graph_break [] 2025-12-04T12:10:19.9849985Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9850075Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9850171Z Autotune Choices Stats: 2025-12-04T12:10:19.9850550Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:19.9850629Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9850698Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9850834Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9851086Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9851324Z triton_mm_31 0.0069 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9851565Z triton_mm_26 0.0070 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9851825Z triton_mm_35 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9852067Z triton_mm_28 0.0074 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9852322Z triton_mm_25 0.0083 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9852560Z triton_mm_29 0.0085 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9852800Z triton_mm_33 0.0087 ms 68.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9853039Z triton_mm_24 0.0092 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9853280Z triton_mm_30 0.0100 ms 59.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9853438Z SingleProcess AUTOTUNE benchmarking takes 0.1149 seconds and 0.2798 seconds precompiling for 20 choices 2025-12-04T12:10:19.9853521Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9853679Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9853741Z Traceback (most recent call last): 2025-12-04T12:10:19.9853913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9853970Z method(*args, **kwargs) 2025-12-04T12:10:19.9854138Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9854195Z method(*args, **kwargs) 2025-12-04T12:10:19.9854363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9854417Z with policy(): 2025-12-04T12:10:19.9854584Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9854643Z raise RuntimeError(msg) 2025-12-04T12:10:19.9855050Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9855052Z 2025-12-04T12:10:19.9855143Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9855415Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9855417Z 2025-12-04T12:10:19.9855523Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9855612Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9855675Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9855749Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9856261Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9856375Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9856428Z graph_break [] 2025-12-04T12:10:19.9856515Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9856609Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9857106Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9857171Z current_size = base.storage().size() 2025-12-04T12:10:19.9857229Z Autotune Choices Stats: 2025-12-04T12:10:19.9857610Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9857701Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9857768Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9857918Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9858168Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9858410Z triton_mm_17 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9858654Z triton_mm_6 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9858893Z triton_mm_14 0.0094 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9859136Z triton_mm_7 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9859379Z triton_mm_18 0.0100 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9859622Z triton_mm_12 0.0102 ms 59.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9859862Z triton_mm_11 0.0106 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9860163Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9860403Z triton_mm_13 0.0113 ms 53.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9860559Z SingleProcess AUTOTUNE benchmarking takes 0.0923 seconds and 0.4145 seconds precompiling for 20 choices 2025-12-04T12:10:19.9860652Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9860713Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9860786Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9860900Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9861400Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9861454Z graph_break [] 2025-12-04T12:10:19.9861531Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9861621Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9861691Z Autotune Choices Stats: 2025-12-04T12:10:19.9862067Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:19.9862157Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9862224Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9862358Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9862604Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9862841Z triton_mm_31 0.0069 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9863083Z triton_mm_26 0.0070 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9863323Z triton_mm_35 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9863563Z triton_mm_28 0.0074 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9863805Z triton_mm_25 0.0083 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9864055Z triton_mm_29 0.0085 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9864294Z triton_mm_33 0.0087 ms 68.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9864549Z triton_mm_24 0.0092 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9864788Z triton_mm_30 0.0100 ms 59.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9864936Z SingleProcess AUTOTUNE benchmarking takes 0.1149 seconds and 0.2798 seconds precompiling for 20 choices 2025-12-04T12:10:19.9865025Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9865085Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9865157Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9865272Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9865770Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9865849Z graph_break [] 2025-12-04T12:10:19.9865924Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9866016Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9866074Z Autotune Choices Stats: 2025-12-04T12:10:19.9866449Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_45", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:19.9866527Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9866594Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9866731Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9867056Z triton_mm_45 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9867299Z triton_mm_55 0.0070 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9867540Z triton_mm_44 0.0074 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9867781Z triton_mm_54 0.0075 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9868020Z triton_mm_50 0.0077 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9868275Z triton_mm_47 0.0081 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9868513Z triton_mm_52 0.0081 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9868768Z triton_mm_43 0.0085 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9869009Z triton_mm_49 0.0097 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9869254Z triton_mm_56 0.0101 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9869398Z SingleProcess AUTOTUNE benchmarking takes 0.1500 seconds and 0.2702 seconds precompiling for 20 choices 2025-12-04T12:10:19.9869605Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c1faa9ce965f4b89.xml - 2025-12-04T12:10:19.9869682Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9870344Z FAILED [0.9616s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9870363Z 2025-12-04T12:10:19.9870454Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9870728Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9870730Z 2025-12-04T12:10:19.9870834Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9870911Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9870995Z ================== 1 failed, 84 deselected, 2 rerun in 4.62s =================== 2025-12-04T12:10:19.9871049Z Got exit code 1 2025-12-04T12:10:19.9871107Z Retrying single test... 2025-12-04T12:10:19.9871267Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d6be29c5a3c48388.xml 2025-12-04T12:10:19.9871341Z ============================= test session starts ============================== 2025-12-04T12:10:19.9871470Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9871527Z cachedir: .pytest_cache 2025-12-04T12:10:19.9871700Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9871762Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9871819Z configfile: pytest.ini 2025-12-04T12:10:19.9871998Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9872089Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9872365Z stepcurrent: skipping 84 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9872425Z Running 1 items in this shard 2025-12-04T12:10:19.9872427Z 2025-12-04T12:10:19.9872654Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.5868s] [100%] 2025-12-04T12:10:19.9872893Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0069s] [100%] 2025-12-04T12:10:19.9873095Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.8822s] [100%] 2025-12-04T12:10:19.9873099Z 2025-12-04T12:10:19.9873168Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9873326Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9873391Z Traceback (most recent call last): 2025-12-04T12:10:19.9873563Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9873621Z method(*args, **kwargs) 2025-12-04T12:10:19.9873788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9873845Z method(*args, **kwargs) 2025-12-04T12:10:19.9874011Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9874080Z with policy(): 2025-12-04T12:10:19.9874247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9874317Z raise RuntimeError(msg) 2025-12-04T12:10:19.9874716Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:19.9874719Z 2025-12-04T12:10:19.9874810Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9875086Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9875088Z 2025-12-04T12:10:19.9875192Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9875281Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9875342Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9875418Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9875918Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9876033Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9876087Z graph_break [] 2025-12-04T12:10:19.9876166Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9876257Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9876771Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9876836Z current_size = base.storage().size() 2025-12-04T12:10:19.9876894Z Autotune Choices Stats: 2025-12-04T12:10:19.9877295Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:19.9877376Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9877445Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9877582Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9877834Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9878077Z triton_mm_17 0.0066 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9878317Z triton_mm_7 0.0068 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9878566Z triton_mm_12 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9878820Z triton_mm_6 0.0075 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9879059Z triton_mm_5 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9879297Z triton_mm_14 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9879536Z triton_mm_10 0.0083 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9879780Z triton_mm_18 0.0100 ms 60.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9880020Z triton_mm_11 0.0104 ms 58.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9880200Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4172 seconds precompiling for 20 choices 2025-12-04T12:10:19.9880356Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9880421Z Traceback (most recent call last): 2025-12-04T12:10:19.9880595Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9880655Z method(*args, **kwargs) 2025-12-04T12:10:19.9880847Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9880906Z method(*args, **kwargs) 2025-12-04T12:10:19.9881071Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9881126Z with policy(): 2025-12-04T12:10:19.9881293Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9881365Z raise RuntimeError(msg) 2025-12-04T12:10:19.9881771Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:19.9881774Z 2025-12-04T12:10:19.9881867Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9882140Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9882142Z 2025-12-04T12:10:19.9882244Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9882335Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9882397Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9882472Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9882987Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9883115Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9883169Z graph_break [] 2025-12-04T12:10:19.9883247Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9883337Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9883834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9883901Z current_size = base.storage().size() 2025-12-04T12:10:19.9883958Z Autotune Choices Stats: 2025-12-04T12:10:19.9884343Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:19.9884424Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9884492Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9884629Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9884880Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9885137Z triton_mm_17 0.0066 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9885378Z triton_mm_7 0.0068 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9885628Z triton_mm_12 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9885872Z triton_mm_6 0.0075 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9886113Z triton_mm_5 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9886351Z triton_mm_14 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9886589Z triton_mm_10 0.0083 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9886845Z triton_mm_18 0.0100 ms 60.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9887098Z triton_mm_11 0.0104 ms 58.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9887245Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4172 seconds precompiling for 20 choices 2025-12-04T12:10:19.9887336Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9887396Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9887468Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9887585Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9888083Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9888136Z graph_break [] 2025-12-04T12:10:19.9888215Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9888304Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9888361Z Autotune Choices Stats: 2025-12-04T12:10:19.9888737Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:19.9888817Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9888884Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9889033Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9889280Z triton_mm_35 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9889525Z triton_mm_36 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9889776Z triton_mm_26 0.0068 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9890017Z triton_mm_31 0.0071 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9890306Z triton_mm_25 0.0074 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9890546Z triton_mm_28 0.0078 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9890785Z triton_mm_24 0.0086 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9891059Z triton_mm_29 0.0086 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9891295Z triton_mm_33 0.0090 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9891538Z triton_mm_37 0.0096 ms 61.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9891682Z SingleProcess AUTOTUNE benchmarking takes 0.1205 seconds and 0.2770 seconds precompiling for 20 choices 2025-12-04T12:10:19.9891754Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9891910Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9891974Z Traceback (most recent call last): 2025-12-04T12:10:19.9892146Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9892203Z method(*args, **kwargs) 2025-12-04T12:10:19.9892370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9892427Z method(*args, **kwargs) 2025-12-04T12:10:19.9892594Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9892648Z with policy(): 2025-12-04T12:10:19.9892816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9892874Z raise RuntimeError(msg) 2025-12-04T12:10:19.9893291Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9893294Z 2025-12-04T12:10:19.9893384Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9893655Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9893657Z 2025-12-04T12:10:19.9893772Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9893864Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9893922Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9893996Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9894495Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9894608Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9894663Z graph_break [] 2025-12-04T12:10:19.9894740Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9894829Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9895337Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9895420Z current_size = base.storage().size() 2025-12-04T12:10:19.9895476Z Autotune Choices Stats: 2025-12-04T12:10:19.9895861Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:19.9895941Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9896010Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9896146Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9896395Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9896637Z triton_mm_17 0.0066 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9896877Z triton_mm_7 0.0068 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9897114Z triton_mm_12 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9897367Z triton_mm_6 0.0075 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9897609Z triton_mm_5 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9897858Z triton_mm_14 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9898098Z triton_mm_10 0.0083 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9898344Z triton_mm_18 0.0100 ms 60.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9898582Z triton_mm_11 0.0104 ms 58.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9898726Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4172 seconds precompiling for 20 choices 2025-12-04T12:10:19.9898817Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9898875Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9898959Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9899074Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9899583Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9899636Z graph_break [] 2025-12-04T12:10:19.9899713Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9899801Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9899858Z Autotune Choices Stats: 2025-12-04T12:10:19.9900276Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:19.9900357Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9900423Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9900559Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9900806Z triton_mm_35 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9901049Z triton_mm_36 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9901295Z triton_mm_26 0.0068 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9901549Z triton_mm_31 0.0071 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9901794Z triton_mm_25 0.0074 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9902046Z triton_mm_28 0.0078 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9902286Z triton_mm_24 0.0086 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9902526Z triton_mm_29 0.0086 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9902762Z triton_mm_33 0.0090 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9903005Z triton_mm_37 0.0096 ms 61.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9903176Z SingleProcess AUTOTUNE benchmarking takes 0.1205 seconds and 0.2770 seconds precompiling for 20 choices 2025-12-04T12:10:19.9903266Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9903324Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9903397Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9903511Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9904006Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9904061Z graph_break [] 2025-12-04T12:10:19.9904136Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9904227Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9904283Z Autotune Choices Stats: 2025-12-04T12:10:19.9904659Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:19.9904737Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9904805Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9904941Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9906737Z triton_mm_54 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9907001Z triton_mm_55 0.0067 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9907243Z triton_mm_45 0.0071 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9907493Z triton_mm_50 0.0080 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9907736Z triton_mm_47 0.0080 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9907976Z triton_mm_48 0.0082 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9908220Z triton_mm_44 0.0084 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9908457Z triton_mm_43 0.0085 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9908707Z triton_mm_52 0.0089 ms 74.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9908962Z triton_mm_56 0.0102 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9909107Z SingleProcess AUTOTUNE benchmarking takes 0.1501 seconds and 0.2559 seconds precompiling for 20 choices 2025-12-04T12:10:19.9909315Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d6be29c5a3c48388.xml - 2025-12-04T12:10:19.9909393Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9909994Z FAILED [0.8822s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9910001Z 2025-12-04T12:10:19.9910124Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9910403Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9910405Z 2025-12-04T12:10:19.9910509Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9910590Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9910675Z ================== 1 failed, 187 deselected, 2 rerun in 4.50s ================== 2025-12-04T12:10:19.9910729Z Got exit code 1 2025-12-04T12:10:19.9910787Z Retrying single test... 2025-12-04T12:10:19.9910950Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1a78d76ac71ab253.xml 2025-12-04T12:10:19.9911039Z ============================= test session starts ============================== 2025-12-04T12:10:19.9911169Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9911225Z cachedir: .pytest_cache 2025-12-04T12:10:19.9911400Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9911462Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9911519Z configfile: pytest.ini 2025-12-04T12:10:19.9911714Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9911808Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9912079Z stepcurrent: skipping 84 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9912139Z Running 1 items in this shard 2025-12-04T12:10:19.9912142Z 2025-12-04T12:10:19.9912370Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.7114s] [100%] 2025-12-04T12:10:19.9912593Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0294s] [100%] 2025-12-04T12:10:19.9912798Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.8663s] [100%] 2025-12-04T12:10:19.9912821Z 2025-12-04T12:10:19.9912889Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9913060Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9913122Z Traceback (most recent call last): 2025-12-04T12:10:19.9913299Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9913356Z method(*args, **kwargs) 2025-12-04T12:10:19.9913524Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9913580Z method(*args, **kwargs) 2025-12-04T12:10:19.9913748Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9913802Z with policy(): 2025-12-04T12:10:19.9913970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9914029Z raise RuntimeError(msg) 2025-12-04T12:10:19.9914431Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:19.9914433Z 2025-12-04T12:10:19.9914526Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9914797Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9914799Z 2025-12-04T12:10:19.9914904Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9914995Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9915055Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9915128Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9915640Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9915757Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9915809Z graph_break [] 2025-12-04T12:10:19.9915900Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9915989Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9916489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9916553Z current_size = base.storage().size() 2025-12-04T12:10:19.9916611Z Autotune Choices Stats: 2025-12-04T12:10:19.9916997Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9917078Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9917156Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9917292Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9917558Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9917802Z triton_mm_7 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9918047Z triton_mm_6 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9918285Z triton_mm_14 0.0082 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9918524Z triton_mm_10 0.0083 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9918764Z triton_mm_17 0.0083 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9919001Z triton_mm_5 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9919239Z triton_mm_12 0.0101 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9919491Z triton_mm_11 0.0106 ms 57.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9919735Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9919879Z SingleProcess AUTOTUNE benchmarking takes 0.0991 seconds and 0.4294 seconds precompiling for 20 choices 2025-12-04T12:10:19.9920046Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9920148Z Traceback (most recent call last): 2025-12-04T12:10:19.9920321Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9920379Z method(*args, **kwargs) 2025-12-04T12:10:19.9920547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9920604Z method(*args, **kwargs) 2025-12-04T12:10:19.9920770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9920824Z with policy(): 2025-12-04T12:10:19.9920991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9921047Z raise RuntimeError(msg) 2025-12-04T12:10:19.9921453Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:19.9921482Z 2025-12-04T12:10:19.9921573Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9921844Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9921847Z 2025-12-04T12:10:19.9921950Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9922039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9922097Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9922172Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9922671Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9922787Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9922839Z graph_break [] 2025-12-04T12:10:19.9922918Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9923006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9923504Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9923569Z current_size = base.storage().size() 2025-12-04T12:10:19.9923626Z Autotune Choices Stats: 2025-12-04T12:10:19.9924022Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9924101Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9924168Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9924303Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9924564Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9924808Z triton_mm_7 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9925051Z triton_mm_6 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9925292Z triton_mm_14 0.0082 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9925532Z triton_mm_10 0.0083 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9925798Z triton_mm_17 0.0083 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9926037Z triton_mm_5 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9926274Z triton_mm_12 0.0101 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9926512Z triton_mm_11 0.0106 ms 57.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9926755Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9926900Z SingleProcess AUTOTUNE benchmarking takes 0.0991 seconds and 0.4294 seconds precompiling for 20 choices 2025-12-04T12:10:19.9926988Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9927047Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9927118Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9927233Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9927731Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9927795Z graph_break [] 2025-12-04T12:10:19.9927872Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9927961Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9928017Z Autotune Choices Stats: 2025-12-04T12:10:19.9928411Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:19.9928491Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9928556Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9928693Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9928940Z triton_mm_26 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9929178Z triton_mm_31 0.0068 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9929417Z triton_mm_35 0.0072 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9929672Z triton_mm_25 0.0075 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9929926Z triton_mm_28 0.0078 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9930192Z triton_mm_29 0.0079 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9930430Z triton_mm_33 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9930668Z triton_mm_24 0.0086 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9930909Z triton_mm_36 0.0097 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9931146Z triton_mm_30 0.0100 ms 67.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9931291Z SingleProcess AUTOTUNE benchmarking takes 0.1301 seconds and 0.2798 seconds precompiling for 20 choices 2025-12-04T12:10:19.9931360Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9931516Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9931579Z Traceback (most recent call last): 2025-12-04T12:10:19.9931764Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9931824Z method(*args, **kwargs) 2025-12-04T12:10:19.9931992Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9932050Z method(*args, **kwargs) 2025-12-04T12:10:19.9932214Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9932269Z with policy(): 2025-12-04T12:10:19.9932449Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9932509Z raise RuntimeError(msg) 2025-12-04T12:10:19.9932909Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9932912Z 2025-12-04T12:10:19.9933003Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9933272Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9933275Z 2025-12-04T12:10:19.9933378Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9933468Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9933539Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9933612Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9934124Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9934237Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9934289Z graph_break [] 2025-12-04T12:10:19.9934367Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9934457Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9934953Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:19.9935020Z current_size = base.storage().size() 2025-12-04T12:10:19.9935078Z Autotune Choices Stats: 2025-12-04T12:10:19.9935460Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:19.9935539Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9935607Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9935742Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9935989Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9936243Z triton_mm_7 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9936487Z triton_mm_6 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9936736Z triton_mm_14 0.0082 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9936975Z triton_mm_10 0.0083 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9937217Z triton_mm_17 0.0083 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9937453Z triton_mm_5 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9937691Z triton_mm_12 0.0101 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9937955Z triton_mm_11 0.0106 ms 57.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9938197Z triton_mm_9 0.0111 ms 54.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9938342Z SingleProcess AUTOTUNE benchmarking takes 0.0991 seconds and 0.4294 seconds precompiling for 20 choices 2025-12-04T12:10:19.9938431Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9938490Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9938563Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9938679Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9939174Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9939228Z graph_break [] 2025-12-04T12:10:19.9939304Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9939393Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9939449Z Autotune Choices Stats: 2025-12-04T12:10:19.9939826Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:19.9939906Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9939983Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9940178Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9940423Z triton_mm_26 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9940673Z triton_mm_31 0.0068 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9940913Z triton_mm_35 0.0072 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9941157Z triton_mm_25 0.0075 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9941400Z triton_mm_28 0.0078 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9941639Z triton_mm_29 0.0079 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9941891Z triton_mm_33 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9942140Z triton_mm_24 0.0086 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9942379Z triton_mm_36 0.0097 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9942617Z triton_mm_30 0.0100 ms 67.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9942760Z SingleProcess AUTOTUNE benchmarking takes 0.1301 seconds and 0.2798 seconds precompiling for 20 choices 2025-12-04T12:10:19.9942851Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9942908Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9942981Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9943093Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9943592Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:19.9943645Z graph_break [] 2025-12-04T12:10:19.9943722Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:19.9943811Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:19.9943869Z Autotune Choices Stats: 2025-12-04T12:10:19.9944267Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_55", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:19.9944347Z AUTOTUNE scaled_mm(1x1024, 1024x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:19.9944414Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:19.9944547Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:19.9944804Z triton_mm_55 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9945044Z triton_mm_50 0.0070 ms 97.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9945286Z triton_mm_45 0.0074 ms 93.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9945525Z triton_mm_54 0.0080 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:19.9945768Z triton_mm_44 0.0080 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:19.9946027Z triton_mm_52 0.0083 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9946267Z triton_mm_47 0.0085 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9946504Z triton_mm_48 0.0085 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9946741Z triton_mm_43 0.0090 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:19.9946981Z triton_mm_49 0.0105 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:19.9947126Z SingleProcess AUTOTUNE benchmarking takes 0.1482 seconds and 0.2676 seconds precompiling for 20 choices 2025-12-04T12:10:19.9947331Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1a78d76ac71ab253.xml - 2025-12-04T12:10:19.9947408Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9948004Z FAILED [0.8663s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:19.9948008Z 2025-12-04T12:10:19.9948108Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9948380Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9948383Z 2025-12-04T12:10:19.9948486Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9948564Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9948658Z ================== 1 failed, 187 deselected, 2 rerun in 4.63s ================== 2025-12-04T12:10:19.9948714Z Got exit code 1 2025-12-04T12:10:19.9948931Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9949076Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.9949234Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3db912b9c946ec5f.xml 2025-12-04T12:10:19.9949307Z ============================= test session starts ============================== 2025-12-04T12:10:19.9949433Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9949491Z cachedir: .pytest_cache 2025-12-04T12:10:19.9949663Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9949727Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9949794Z configfile: pytest.ini 2025-12-04T12:10:19.9949972Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9950077Z collecting ... collected 188 items / 85 deselected / 103 selected 2025-12-04T12:10:19.9950185Z stepcurrent: skipping 85 already run items. 2025-12-04T12:10:19.9950245Z Running 103 items in this shard 2025-12-04T12:10:19.9950247Z 2025-12-04T12:10:19.9950472Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8563s] [ 0%] 2025-12-04T12:10:19.9950692Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3714s] [ 0%] 2025-12-04T12:10:19.9950890Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3341s] [ 0%] 2025-12-04T12:10:19.9950893Z 2025-12-04T12:10:19.9950961Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9951115Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9951179Z Traceback (most recent call last): 2025-12-04T12:10:19.9951352Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9951411Z method(*args, **kwargs) 2025-12-04T12:10:19.9951577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9951634Z method(*args, **kwargs) 2025-12-04T12:10:19.9951800Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9951854Z with policy(): 2025-12-04T12:10:19.9952021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9952080Z raise RuntimeError(msg) 2025-12-04T12:10:19.9952497Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.9952501Z 2025-12-04T12:10:19.9952592Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9952862Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9952865Z 2025-12-04T12:10:19.9952978Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9953069Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9953127Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9953201Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9953283Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9953398Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9953450Z graph_break [] 2025-12-04T12:10:19.9953528Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9953678Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9953742Z Traceback (most recent call last): 2025-12-04T12:10:19.9953911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9953968Z method(*args, **kwargs) 2025-12-04T12:10:19.9954147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9954217Z method(*args, **kwargs) 2025-12-04T12:10:19.9954381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9954436Z with policy(): 2025-12-04T12:10:19.9954603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9954661Z raise RuntimeError(msg) 2025-12-04T12:10:19.9955053Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.9955056Z 2025-12-04T12:10:19.9955146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9955414Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9955417Z 2025-12-04T12:10:19.9955519Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9955608Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9955666Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9955738Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9955819Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9955933Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9955986Z graph_break [] 2025-12-04T12:10:19.9956064Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9956154Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9956212Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9956283Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9956395Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9956489Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9956543Z graph_break [] 2025-12-04T12:10:19.9956617Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9956686Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9956835Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9956898Z Traceback (most recent call last): 2025-12-04T12:10:19.9957077Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9957135Z method(*args, **kwargs) 2025-12-04T12:10:19.9957299Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9957358Z method(*args, **kwargs) 2025-12-04T12:10:19.9957523Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9957576Z with policy(): 2025-12-04T12:10:19.9957743Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9957801Z raise RuntimeError(msg) 2025-12-04T12:10:19.9958194Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9958208Z 2025-12-04T12:10:19.9958297Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9958583Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9958586Z 2025-12-04T12:10:19.9958688Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9958777Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9958835Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9958906Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9958986Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9959100Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9959153Z graph_break [] 2025-12-04T12:10:19.9959229Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9959318Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9959377Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9959447Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9959560Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9959640Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9959694Z graph_break [] 2025-12-04T12:10:19.9959768Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9959857Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9959915Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9959985Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9960126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9960206Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9960259Z graph_break [] 2025-12-04T12:10:19.9960333Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9960559Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3db912b9c946ec5f.xml - 2025-12-04T12:10:19.9960635Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9961224Z FAILED [0.3341s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9961228Z 2025-12-04T12:10:19.9961317Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9961587Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9961590Z 2025-12-04T12:10:19.9961693Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9961771Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9961854Z ================== 1 failed, 85 deselected, 2 rerun in 2.58s =================== 2025-12-04T12:10:19.9961906Z Got exit code 1 2025-12-04T12:10:19.9961962Z Retrying single test... 2025-12-04T12:10:19.9962122Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d2de5e77f4a4aa3c.xml 2025-12-04T12:10:19.9962197Z ============================= test session starts ============================== 2025-12-04T12:10:19.9962339Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9962412Z cachedir: .pytest_cache 2025-12-04T12:10:19.9962583Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9962647Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9962703Z configfile: pytest.ini 2025-12-04T12:10:19.9962881Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9962971Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9963232Z stepcurrent: skipping 85 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9963292Z Running 1 items in this shard 2025-12-04T12:10:19.9963295Z 2025-12-04T12:10:19.9963519Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6493s] [100%] 2025-12-04T12:10:19.9963741Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2764s] [100%] 2025-12-04T12:10:19.9963937Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2505s] [100%] 2025-12-04T12:10:19.9963940Z 2025-12-04T12:10:19.9964008Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9964159Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9964223Z Traceback (most recent call last): 2025-12-04T12:10:19.9964393Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9964452Z method(*args, **kwargs) 2025-12-04T12:10:19.9964618Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9964675Z method(*args, **kwargs) 2025-12-04T12:10:19.9964852Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9964906Z with policy(): 2025-12-04T12:10:19.9965073Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9965130Z raise RuntimeError(msg) 2025-12-04T12:10:19.9965532Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.9965536Z 2025-12-04T12:10:19.9965626Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9965978Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9965980Z 2025-12-04T12:10:19.9966082Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9966171Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9966229Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9966303Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9966384Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9966498Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9966566Z graph_break [] 2025-12-04T12:10:19.9966641Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9966803Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9966865Z Traceback (most recent call last): 2025-12-04T12:10:19.9967033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9967090Z method(*args, **kwargs) 2025-12-04T12:10:19.9967253Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9967310Z method(*args, **kwargs) 2025-12-04T12:10:19.9967473Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9967528Z with policy(): 2025-12-04T12:10:19.9967693Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9967756Z raise RuntimeError(msg) 2025-12-04T12:10:19.9968150Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.9968152Z 2025-12-04T12:10:19.9968241Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9968509Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9968511Z 2025-12-04T12:10:19.9968612Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9968702Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9968760Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9968837Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9968918Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9969043Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9969097Z graph_break [] 2025-12-04T12:10:19.9969171Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9969259Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9969317Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9969387Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9969499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9969590Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9969646Z graph_break [] 2025-12-04T12:10:19.9969720Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9969790Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9969940Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9970004Z Traceback (most recent call last): 2025-12-04T12:10:19.9970203Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9970261Z method(*args, **kwargs) 2025-12-04T12:10:19.9970427Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9970483Z method(*args, **kwargs) 2025-12-04T12:10:19.9970650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9970717Z with policy(): 2025-12-04T12:10:19.9970884Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9970954Z raise RuntimeError(msg) 2025-12-04T12:10:19.9971345Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9971347Z 2025-12-04T12:10:19.9971435Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9971701Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9971703Z 2025-12-04T12:10:19.9971867Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9971957Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9972016Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9972088Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9972168Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9972281Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9972334Z graph_break [] 2025-12-04T12:10:19.9972408Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9972497Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9972554Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9972624Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9972736Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9972824Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9972878Z graph_break [] 2025-12-04T12:10:19.9972953Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9973041Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9973111Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9973182Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9973295Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9973373Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9973427Z graph_break [] 2025-12-04T12:10:19.9973500Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9973721Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d2de5e77f4a4aa3c.xml - 2025-12-04T12:10:19.9973796Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9974378Z FAILED [0.2505s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9974381Z 2025-12-04T12:10:19.9974469Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9974734Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9974736Z 2025-12-04T12:10:19.9974837Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9974932Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9975029Z ================== 1 failed, 187 deselected, 2 rerun in 2.20s ================== 2025-12-04T12:10:19.9975082Z Got exit code 1 2025-12-04T12:10:19.9975139Z Retrying single test... 2025-12-04T12:10:19.9975300Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-cf0ff5cea71bc063.xml 2025-12-04T12:10:19.9975373Z ============================= test session starts ============================== 2025-12-04T12:10:19.9975497Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9975555Z cachedir: .pytest_cache 2025-12-04T12:10:19.9975728Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9975791Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9975848Z configfile: pytest.ini 2025-12-04T12:10:19.9976025Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9976117Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:19.9976377Z stepcurrent: skipping 85 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9976436Z Running 1 items in this shard 2025-12-04T12:10:19.9976438Z 2025-12-04T12:10:19.9976662Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6202s] [100%] 2025-12-04T12:10:19.9976881Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2602s] [100%] 2025-12-04T12:10:19.9977077Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2342s] [100%] 2025-12-04T12:10:19.9977080Z 2025-12-04T12:10:19.9977147Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9977309Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9977371Z Traceback (most recent call last): 2025-12-04T12:10:19.9977541Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9977599Z method(*args, **kwargs) 2025-12-04T12:10:19.9977765Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9977823Z method(*args, **kwargs) 2025-12-04T12:10:19.9977996Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9978052Z with policy(): 2025-12-04T12:10:19.9978217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9978278Z raise RuntimeError(msg) 2025-12-04T12:10:19.9978670Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.9978673Z 2025-12-04T12:10:19.9978761Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9979026Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9979039Z 2025-12-04T12:10:19.9979140Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9979243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9979301Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9979373Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9979454Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9979567Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9979619Z graph_break [] 2025-12-04T12:10:19.9979693Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9979844Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9979905Z Traceback (most recent call last): 2025-12-04T12:10:19.9980074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9980170Z method(*args, **kwargs) 2025-12-04T12:10:19.9980334Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9980396Z method(*args, **kwargs) 2025-12-04T12:10:19.9980561Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9980616Z with policy(): 2025-12-04T12:10:19.9980781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9980838Z raise RuntimeError(msg) 2025-12-04T12:10:19.9981229Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.9981232Z 2025-12-04T12:10:19.9981320Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9981603Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9981605Z 2025-12-04T12:10:19.9981708Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9981799Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9981857Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9981929Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9982010Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9982140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9982195Z graph_break [] 2025-12-04T12:10:19.9982271Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9982359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9982419Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9982489Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9982602Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9982682Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9982735Z graph_break [] 2025-12-04T12:10:19.9982809Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9982878Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9983030Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9983093Z Traceback (most recent call last): 2025-12-04T12:10:19.9983275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9983346Z method(*args, **kwargs) 2025-12-04T12:10:19.9983510Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9983567Z method(*args, **kwargs) 2025-12-04T12:10:19.9983732Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9983785Z with policy(): 2025-12-04T12:10:19.9983951Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9984008Z raise RuntimeError(msg) 2025-12-04T12:10:19.9984402Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9984406Z 2025-12-04T12:10:19.9984494Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9984760Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9984762Z 2025-12-04T12:10:19.9984863Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9984952Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9985010Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9985084Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9985165Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9985277Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9985332Z graph_break [] 2025-12-04T12:10:19.9985407Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9985496Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9985553Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9985634Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9985744Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9985824Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9985876Z graph_break [] 2025-12-04T12:10:19.9985951Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9986039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9986109Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9986179Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9986290Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9986370Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9986425Z graph_break [] 2025-12-04T12:10:19.9986497Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:19.9986704Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-cf0ff5cea71bc063.xml - 2025-12-04T12:10:19.9986779Z =========================== short test summary info ============================ 2025-12-04T12:10:19.9987353Z FAILED [0.2342s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9987367Z 2025-12-04T12:10:19.9987466Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9987731Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9987733Z 2025-12-04T12:10:19.9987837Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9987914Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:19.9987996Z ================== 1 failed, 187 deselected, 2 rerun in 2.13s ================== 2025-12-04T12:10:19.9988049Z Got exit code 1 2025-12-04T12:10:19.9988267Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:19.9988410Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:19.9988568Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2afa030b80bb72ba.xml 2025-12-04T12:10:19.9988641Z ============================= test session starts ============================== 2025-12-04T12:10:19.9988768Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:19.9988824Z cachedir: .pytest_cache 2025-12-04T12:10:19.9988997Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:19.9989058Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:19.9989114Z configfile: pytest.ini 2025-12-04T12:10:19.9989291Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:19.9989385Z collecting ... collected 188 items / 86 deselected / 102 selected 2025-12-04T12:10:19.9989453Z stepcurrent: skipping 86 already run items. 2025-12-04T12:10:19.9989516Z Running 102 items in this shard 2025-12-04T12:10:19.9989518Z 2025-12-04T12:10:19.9989756Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5630s] [ 0%] 2025-12-04T12:10:19.9989979Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2508s] [ 0%] 2025-12-04T12:10:19.9990217Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2209s] [ 0%] 2025-12-04T12:10:19.9990220Z 2025-12-04T12:10:19.9990307Z ==================================== RERUNS ==================================== 2025-12-04T12:10:19.9990461Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9990523Z Traceback (most recent call last): 2025-12-04T12:10:19.9990695Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9990754Z method(*args, **kwargs) 2025-12-04T12:10:19.9990921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9990977Z method(*args, **kwargs) 2025-12-04T12:10:19.9991143Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9991196Z with policy(): 2025-12-04T12:10:19.9991370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9991430Z raise RuntimeError(msg) 2025-12-04T12:10:19.9991840Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:19.9991857Z 2025-12-04T12:10:19.9991948Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9992215Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9992218Z 2025-12-04T12:10:19.9992320Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9992408Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9992468Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9992540Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9992623Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9992736Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9992789Z graph_break [] 2025-12-04T12:10:19.9992865Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9993018Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9993079Z Traceback (most recent call last): 2025-12-04T12:10:19.9993249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9993306Z method(*args, **kwargs) 2025-12-04T12:10:19.9993472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9993527Z method(*args, **kwargs) 2025-12-04T12:10:19.9993692Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9993746Z with policy(): 2025-12-04T12:10:19.9993912Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9993985Z raise RuntimeError(msg) 2025-12-04T12:10:19.9994381Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:19.9994383Z 2025-12-04T12:10:19.9994472Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9994752Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9994755Z 2025-12-04T12:10:19.9994859Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9994948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9995007Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9995078Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9995157Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9995269Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9995322Z graph_break [] 2025-12-04T12:10:19.9995398Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9995487Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9995545Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9995626Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9995736Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9995829Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9995881Z graph_break [] 2025-12-04T12:10:19.9995957Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9996024Z =================================== FAILURES =================================== 2025-12-04T12:10:19.9996178Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:19.9996242Z Traceback (most recent call last): 2025-12-04T12:10:19.9996409Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9996467Z method(*args, **kwargs) 2025-12-04T12:10:19.9996632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:19.9996689Z method(*args, **kwargs) 2025-12-04T12:10:19.9996853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:19.9996908Z with policy(): 2025-12-04T12:10:19.9997074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:19.9997133Z raise RuntimeError(msg) 2025-12-04T12:10:19.9997527Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:19.9997529Z 2025-12-04T12:10:19.9997621Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:19.9997889Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:19.9997894Z 2025-12-04T12:10:19.9997995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:19.9998095Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9998154Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9998226Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9998306Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9998419Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9998471Z graph_break [] 2025-12-04T12:10:19.9998548Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9998648Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9998709Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9998779Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9998892Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9998970Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9999025Z graph_break [] 2025-12-04T12:10:19.9999100Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9999189Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:19.9999246Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:19.9999316Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:19.9999426Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:19.9999505Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:19.9999557Z graph_break [] 2025-12-04T12:10:19.9999643Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:19.9999847Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2afa030b80bb72ba.xml - 2025-12-04T12:10:20.0000009Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0000609Z FAILED [0.2209s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.0000613Z 2025-12-04T12:10:20.0000702Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0000971Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0000975Z 2025-12-04T12:10:20.0001076Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0001156Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0001240Z ================== 1 failed, 86 deselected, 2 rerun in 2.05s =================== 2025-12-04T12:10:20.0001295Z Got exit code 1 2025-12-04T12:10:20.0001351Z Retrying single test... 2025-12-04T12:10:20.0001511Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f29a278cfdd70ea3.xml 2025-12-04T12:10:20.0001583Z ============================= test session starts ============================== 2025-12-04T12:10:20.0001710Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0001768Z cachedir: .pytest_cache 2025-12-04T12:10:20.0001940Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0002004Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0002062Z configfile: pytest.ini 2025-12-04T12:10:20.0002260Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0002352Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0002616Z stepcurrent: skipping 86 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0002677Z Running 1 items in this shard 2025-12-04T12:10:20.0002679Z 2025-12-04T12:10:20.0002924Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6051s] [100%] 2025-12-04T12:10:20.0003145Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2496s] [100%] 2025-12-04T12:10:20.0003347Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2185s] [100%] 2025-12-04T12:10:20.0003349Z 2025-12-04T12:10:20.0003416Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0003576Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0003640Z Traceback (most recent call last): 2025-12-04T12:10:20.0003814Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0003871Z method(*args, **kwargs) 2025-12-04T12:10:20.0004051Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0004122Z method(*args, **kwargs) 2025-12-04T12:10:20.0004288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0004342Z with policy(): 2025-12-04T12:10:20.0004509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0004566Z raise RuntimeError(msg) 2025-12-04T12:10:20.0004964Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.0004966Z 2025-12-04T12:10:20.0005057Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0005325Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0005328Z 2025-12-04T12:10:20.0005431Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0005519Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0005578Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0005649Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0005730Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0005843Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0005896Z graph_break [] 2025-12-04T12:10:20.0005971Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0006125Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0006187Z Traceback (most recent call last): 2025-12-04T12:10:20.0006355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0006433Z method(*args, **kwargs) 2025-12-04T12:10:20.0006600Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0006656Z method(*args, **kwargs) 2025-12-04T12:10:20.0006822Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0006877Z with policy(): 2025-12-04T12:10:20.0007053Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0007112Z raise RuntimeError(msg) 2025-12-04T12:10:20.0007506Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.0007510Z 2025-12-04T12:10:20.0007599Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0007864Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0007867Z 2025-12-04T12:10:20.0007970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0008059Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0008118Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0008201Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0008282Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0008407Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0008461Z graph_break [] 2025-12-04T12:10:20.0008537Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0008627Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0008686Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0008758Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0008868Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0008948Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0009002Z graph_break [] 2025-12-04T12:10:20.0009077Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0009146Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0009298Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0009360Z Traceback (most recent call last): 2025-12-04T12:10:20.0009528Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0009586Z method(*args, **kwargs) 2025-12-04T12:10:20.0009753Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0009811Z method(*args, **kwargs) 2025-12-04T12:10:20.0009974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0010027Z with policy(): 2025-12-04T12:10:20.0010233Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0010292Z raise RuntimeError(msg) 2025-12-04T12:10:20.0010702Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.0010705Z 2025-12-04T12:10:20.0010795Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0011061Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0011064Z 2025-12-04T12:10:20.0011178Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0011268Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0011326Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0011396Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0011477Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0011591Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0011643Z graph_break [] 2025-12-04T12:10:20.0011718Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0011806Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0011864Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0011933Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0012047Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0012126Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0012196Z graph_break [] 2025-12-04T12:10:20.0012270Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0012358Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0012430Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0012501Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0012611Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0012691Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0012744Z graph_break [] 2025-12-04T12:10:20.0012817Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0013021Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f29a278cfdd70ea3.xml - 2025-12-04T12:10:20.0013097Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0013680Z FAILED [0.2185s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.0013685Z 2025-12-04T12:10:20.0013773Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0014038Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0014040Z 2025-12-04T12:10:20.0014140Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0014219Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0014303Z ================== 1 failed, 187 deselected, 2 rerun in 2.09s ================== 2025-12-04T12:10:20.0014360Z Got exit code 1 2025-12-04T12:10:20.0014417Z Retrying single test... 2025-12-04T12:10:20.0014577Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0cf2ccde8e56fa18.xml 2025-12-04T12:10:20.0014658Z ============================= test session starts ============================== 2025-12-04T12:10:20.0014785Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0014841Z cachedir: .pytest_cache 2025-12-04T12:10:20.0015013Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0015075Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0015132Z configfile: pytest.ini 2025-12-04T12:10:20.0015317Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0015411Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0015674Z stepcurrent: skipping 86 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0015735Z Running 1 items in this shard 2025-12-04T12:10:20.0015737Z 2025-12-04T12:10:20.0015960Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5683s] [100%] 2025-12-04T12:10:20.0016180Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2506s] [100%] 2025-12-04T12:10:20.0016379Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2269s] [100%] 2025-12-04T12:10:20.0016391Z 2025-12-04T12:10:20.0016458Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0016624Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0016686Z Traceback (most recent call last): 2025-12-04T12:10:20.0016857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0016914Z method(*args, **kwargs) 2025-12-04T12:10:20.0017082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0017138Z method(*args, **kwargs) 2025-12-04T12:10:20.0017305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0017358Z with policy(): 2025-12-04T12:10:20.0017527Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0017586Z raise RuntimeError(msg) 2025-12-04T12:10:20.0017979Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.0017981Z 2025-12-04T12:10:20.0018071Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0018337Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0018339Z 2025-12-04T12:10:20.0018442Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0018532Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0018591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0018662Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0018743Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0018866Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0018921Z graph_break [] 2025-12-04T12:10:20.0018996Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0019148Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0019209Z Traceback (most recent call last): 2025-12-04T12:10:20.0019390Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0019447Z method(*args, **kwargs) 2025-12-04T12:10:20.0019613Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0019671Z method(*args, **kwargs) 2025-12-04T12:10:20.0019834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0019889Z with policy(): 2025-12-04T12:10:20.0020055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0020150Z raise RuntimeError(msg) 2025-12-04T12:10:20.0020547Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.0020550Z 2025-12-04T12:10:20.0020654Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0020919Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0020941Z 2025-12-04T12:10:20.0021045Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0021134Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0021192Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0021263Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0021345Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0021457Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0021511Z graph_break [] 2025-12-04T12:10:20.0021585Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0021674Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0021734Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0021803Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0021916Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0021995Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0022048Z graph_break [] 2025-12-04T12:10:20.0022121Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0022190Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0022343Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0022404Z Traceback (most recent call last): 2025-12-04T12:10:20.0022572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0022630Z method(*args, **kwargs) 2025-12-04T12:10:20.0022794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0022850Z method(*args, **kwargs) 2025-12-04T12:10:20.0023033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0023087Z with policy(): 2025-12-04T12:10:20.0023253Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0023310Z raise RuntimeError(msg) 2025-12-04T12:10:20.0023717Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.0023721Z 2025-12-04T12:10:20.0023812Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0024083Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0024086Z 2025-12-04T12:10:20.0024188Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0024280Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0024338Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0024409Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0024487Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0024605Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0024667Z graph_break [] 2025-12-04T12:10:20.0024744Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0024845Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0024902Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0024971Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0025085Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0025163Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0025218Z graph_break [] 2025-12-04T12:10:20.0025293Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0025381Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0025438Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0025509Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0025618Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0025698Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0025751Z graph_break [] 2025-12-04T12:10:20.0025825Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:20.0026031Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0cf2ccde8e56fa18.xml - 2025-12-04T12:10:20.0026107Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0026691Z FAILED [0.2269s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.0026695Z 2025-12-04T12:10:20.0026782Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0027063Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0027065Z 2025-12-04T12:10:20.0027166Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0027244Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0027328Z ================== 1 failed, 187 deselected, 2 rerun in 2.06s ================== 2025-12-04T12:10:20.0027383Z Got exit code 1 2025-12-04T12:10:20.0027611Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0027755Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.0027913Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-967c6faa8a8d142c.xml 2025-12-04T12:10:20.0027988Z ============================= test session starts ============================== 2025-12-04T12:10:20.0028117Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0028175Z cachedir: .pytest_cache 2025-12-04T12:10:20.0028348Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0028409Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0028465Z configfile: pytest.ini 2025-12-04T12:10:20.0028641Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0028734Z collecting ... collected 188 items / 87 deselected / 101 selected 2025-12-04T12:10:20.0028814Z stepcurrent: skipping 87 already run items. 2025-12-04T12:10:20.0028875Z Running 101 items in this shard 2025-12-04T12:10:20.0028892Z 2025-12-04T12:10:20.0029118Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7561s] [ 0%] 2025-12-04T12:10:20.0029339Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3990s] [ 0%] 2025-12-04T12:10:20.0029534Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.3679s] [ 0%] 2025-12-04T12:10:20.0029536Z 2025-12-04T12:10:20.0029605Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0029758Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0029821Z Traceback (most recent call last): 2025-12-04T12:10:20.0029992Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0030051Z method(*args, **kwargs) 2025-12-04T12:10:20.0030254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0030311Z method(*args, **kwargs) 2025-12-04T12:10:20.0030479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0030532Z with policy(): 2025-12-04T12:10:20.0030698Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0030755Z raise RuntimeError(msg) 2025-12-04T12:10:20.0031148Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1017118720. 2025-12-04T12:10:20.0031152Z 2025-12-04T12:10:20.0031254Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0031519Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0031522Z 2025-12-04T12:10:20.0031624Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0031713Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0031770Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0031856Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0032360Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0032474Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0032527Z graph_break [] 2025-12-04T12:10:20.0032602Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0032691Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0033191Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0033282Z current_size = base.storage().size() 2025-12-04T12:10:20.0033339Z Autotune Choices Stats: 2025-12-04T12:10:20.0033723Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009720000438392162, "best_triton_pos": 0} 2025-12-04T12:10:20.0033793Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0033858Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0033995Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0034246Z triton_mm_0 0.0097 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0034307Z _scaled_mm 0.0295 ms 33.0% 2025-12-04T12:10:20.0034449Z SingleProcess AUTOTUNE benchmarking takes 0.0142 seconds and 0.0644 seconds precompiling for 2 choices 2025-12-04T12:10:20.0034603Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0034663Z Traceback (most recent call last): 2025-12-04T12:10:20.0034834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0034890Z method(*args, **kwargs) 2025-12-04T12:10:20.0035055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0035111Z method(*args, **kwargs) 2025-12-04T12:10:20.0035277Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0035330Z with policy(): 2025-12-04T12:10:20.0035497Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0035554Z raise RuntimeError(msg) 2025-12-04T12:10:20.0035960Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1017118720 and is now 1046478848. 2025-12-04T12:10:20.0035962Z 2025-12-04T12:10:20.0036052Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0036330Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0036333Z 2025-12-04T12:10:20.0036436Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0036526Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0036585Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0036658Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0037154Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0037268Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0037338Z graph_break [] 2025-12-04T12:10:20.0037413Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0037502Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0038010Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0038074Z current_size = base.storage().size() 2025-12-04T12:10:20.0038133Z Autotune Choices Stats: 2025-12-04T12:10:20.0038513Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009720000438392162, "best_triton_pos": 0} 2025-12-04T12:10:20.0038584Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0038648Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0038785Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0039032Z triton_mm_0 0.0097 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0039091Z _scaled_mm 0.0295 ms 33.0% 2025-12-04T12:10:20.0039232Z SingleProcess AUTOTUNE benchmarking takes 0.0142 seconds and 0.0644 seconds precompiling for 2 choices 2025-12-04T12:10:20.0039321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0039379Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0039451Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0039563Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0040064Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0040155Z graph_break [] 2025-12-04T12:10:20.0040230Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0040318Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0040375Z Autotune Choices Stats: 2025-12-04T12:10:20.0040761Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:20.0040832Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0040897Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0041031Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0041276Z triton_mm_1 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0041333Z _scaled_mm 0.0290 ms 21.0% 2025-12-04T12:10:20.0041476Z SingleProcess AUTOTUNE benchmarking takes 0.0126 seconds and 0.0577 seconds precompiling for 2 choices 2025-12-04T12:10:20.0041544Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0041710Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0041787Z Traceback (most recent call last): 2025-12-04T12:10:20.0041959Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0042016Z method(*args, **kwargs) 2025-12-04T12:10:20.0042184Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0042240Z method(*args, **kwargs) 2025-12-04T12:10:20.0042405Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0042459Z with policy(): 2025-12-04T12:10:20.0042628Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0042686Z raise RuntimeError(msg) 2025-12-04T12:10:20.0043080Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1075838976. 2025-12-04T12:10:20.0043084Z 2025-12-04T12:10:20.0043175Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0043439Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0043441Z 2025-12-04T12:10:20.0043544Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0043633Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0043693Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0043767Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0044276Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0044389Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0044441Z graph_break [] 2025-12-04T12:10:20.0044517Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0044606Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0045112Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0045178Z current_size = base.storage().size() 2025-12-04T12:10:20.0045236Z Autotune Choices Stats: 2025-12-04T12:10:20.0045611Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009720000438392162, "best_triton_pos": 0} 2025-12-04T12:10:20.0045680Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0045744Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0045879Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0046137Z triton_mm_0 0.0097 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0046209Z _scaled_mm 0.0295 ms 33.0% 2025-12-04T12:10:20.0046350Z SingleProcess AUTOTUNE benchmarking takes 0.0142 seconds and 0.0644 seconds precompiling for 2 choices 2025-12-04T12:10:20.0046438Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0046498Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0046573Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0046689Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0047177Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0047233Z graph_break [] 2025-12-04T12:10:20.0047309Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0047398Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0047454Z Autotune Choices Stats: 2025-12-04T12:10:20.0047829Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:20.0047898Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0047963Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0048100Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0048353Z triton_mm_1 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0048413Z _scaled_mm 0.0290 ms 21.0% 2025-12-04T12:10:20.0048554Z SingleProcess AUTOTUNE benchmarking takes 0.0126 seconds and 0.0577 seconds precompiling for 2 choices 2025-12-04T12:10:20.0048644Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0048701Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0048786Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0048899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0049389Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0049446Z graph_break [] 2025-12-04T12:10:20.0049520Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0049609Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0049665Z Autotune Choices Stats: 2025-12-04T12:10:20.0050035Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:20.0050169Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0050234Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0050368Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0050613Z triton_mm_2 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0050670Z _scaled_mm 0.0287 ms 23.7% 2025-12-04T12:10:20.0050813Z SingleProcess AUTOTUNE benchmarking takes 0.0130 seconds and 0.0561 seconds precompiling for 2 choices 2025-12-04T12:10:20.0051016Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-967c6faa8a8d142c.xml - 2025-12-04T12:10:20.0051095Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0051679Z FAILED [0.3679s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1075838976. 2025-12-04T12:10:20.0051683Z 2025-12-04T12:10:20.0051771Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0052038Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0052040Z 2025-12-04T12:10:20.0052141Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0052220Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0052303Z ================== 1 failed, 87 deselected, 2 rerun in 2.54s =================== 2025-12-04T12:10:20.0052357Z Got exit code 1 2025-12-04T12:10:20.0052426Z Retrying single test... 2025-12-04T12:10:20.0052585Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-192c07624aa8e53b.xml 2025-12-04T12:10:20.0052660Z ============================= test session starts ============================== 2025-12-04T12:10:20.0052787Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0052845Z cachedir: .pytest_cache 2025-12-04T12:10:20.0053037Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0053098Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0053156Z configfile: pytest.ini 2025-12-04T12:10:20.0053338Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0053428Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0053691Z stepcurrent: skipping 87 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0053749Z Running 1 items in this shard 2025-12-04T12:10:20.0053751Z 2025-12-04T12:10:20.0053972Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7702s] [100%] 2025-12-04T12:10:20.0054192Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4023s] [100%] 2025-12-04T12:10:20.0054405Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.3669s] [100%] 2025-12-04T12:10:20.0054422Z 2025-12-04T12:10:20.0054490Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0054644Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0054705Z Traceback (most recent call last): 2025-12-04T12:10:20.0054879Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0054936Z method(*args, **kwargs) 2025-12-04T12:10:20.0055103Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0055162Z method(*args, **kwargs) 2025-12-04T12:10:20.0055329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0055386Z with policy(): 2025-12-04T12:10:20.0055554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0055613Z raise RuntimeError(msg) 2025-12-04T12:10:20.0056005Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1017118720. 2025-12-04T12:10:20.0056007Z 2025-12-04T12:10:20.0056097Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0056365Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0056471Z 2025-12-04T12:10:20.0056574Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0056665Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0056726Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0056812Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0057309Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0057435Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0057488Z graph_break [] 2025-12-04T12:10:20.0057567Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0057655Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0058151Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0058215Z current_size = base.storage().size() 2025-12-04T12:10:20.0058272Z Autotune Choices Stats: 2025-12-04T12:10:20.0058650Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:20.0058734Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0058811Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0058948Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0059195Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0059252Z _scaled_mm 0.0301 ms 31.9% 2025-12-04T12:10:20.0059395Z SingleProcess AUTOTUNE benchmarking takes 0.0144 seconds and 0.0637 seconds precompiling for 2 choices 2025-12-04T12:10:20.0059547Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0059611Z Traceback (most recent call last): 2025-12-04T12:10:20.0059783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0059841Z method(*args, **kwargs) 2025-12-04T12:10:20.0060006Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0060064Z method(*args, **kwargs) 2025-12-04T12:10:20.0060266Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0060320Z with policy(): 2025-12-04T12:10:20.0060485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0060543Z raise RuntimeError(msg) 2025-12-04T12:10:20.0060939Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1017118720 and is now 1046478848. 2025-12-04T12:10:20.0060945Z 2025-12-04T12:10:20.0061033Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0061310Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0061313Z 2025-12-04T12:10:20.0061414Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0061505Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0061563Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0061636Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0062139Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0062257Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0062310Z graph_break [] 2025-12-04T12:10:20.0062386Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0062474Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0062975Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0063050Z current_size = base.storage().size() 2025-12-04T12:10:20.0063106Z Autotune Choices Stats: 2025-12-04T12:10:20.0063497Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:20.0063565Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0063628Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0063763Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0064009Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0064067Z _scaled_mm 0.0301 ms 31.9% 2025-12-04T12:10:20.0064210Z SingleProcess AUTOTUNE benchmarking takes 0.0144 seconds and 0.0637 seconds precompiling for 2 choices 2025-12-04T12:10:20.0064301Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0064362Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0064435Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0064550Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0065043Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0065097Z graph_break [] 2025-12-04T12:10:20.0065174Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0065263Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0065320Z Autotune Choices Stats: 2025-12-04T12:10:20.0065699Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007199999876320362, "best_triton_pos": 0} 2025-12-04T12:10:20.0065767Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0065830Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0065975Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0066218Z triton_mm_1 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0066277Z _scaled_mm 0.0261 ms 27.6% 2025-12-04T12:10:20.0066418Z SingleProcess AUTOTUNE benchmarking takes 0.0128 seconds and 0.0554 seconds precompiling for 2 choices 2025-12-04T12:10:20.0066489Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0066642Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0066704Z Traceback (most recent call last): 2025-12-04T12:10:20.0066876Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0066932Z method(*args, **kwargs) 2025-12-04T12:10:20.0067100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0067167Z method(*args, **kwargs) 2025-12-04T12:10:20.0067332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0067396Z with policy(): 2025-12-04T12:10:20.0067564Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0067622Z raise RuntimeError(msg) 2025-12-04T12:10:20.0068019Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1075838976. 2025-12-04T12:10:20.0068021Z 2025-12-04T12:10:20.0068110Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0068379Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0068382Z 2025-12-04T12:10:20.0068486Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0068574Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0068633Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0068705Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0069198Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0069311Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0069367Z graph_break [] 2025-12-04T12:10:20.0069442Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0069532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0070045Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0070149Z current_size = base.storage().size() 2025-12-04T12:10:20.0070206Z Autotune Choices Stats: 2025-12-04T12:10:20.0070599Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:20.0070669Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0070733Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0070870Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0071112Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0071170Z _scaled_mm 0.0301 ms 31.9% 2025-12-04T12:10:20.0071311Z SingleProcess AUTOTUNE benchmarking takes 0.0144 seconds and 0.0637 seconds precompiling for 2 choices 2025-12-04T12:10:20.0071400Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0071474Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0071561Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0071674Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0072174Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0072227Z graph_break [] 2025-12-04T12:10:20.0072301Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0072393Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0072451Z Autotune Choices Stats: 2025-12-04T12:10:20.0072824Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007199999876320362, "best_triton_pos": 0} 2025-12-04T12:10:20.0072893Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0072958Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0073094Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0073337Z triton_mm_1 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0073393Z _scaled_mm 0.0261 ms 27.6% 2025-12-04T12:10:20.0073535Z SingleProcess AUTOTUNE benchmarking takes 0.0128 seconds and 0.0554 seconds precompiling for 2 choices 2025-12-04T12:10:20.0073624Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0073685Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0073757Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0073883Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0074374Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0074437Z graph_break [] 2025-12-04T12:10:20.0074514Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0074603Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0074662Z Autotune Choices Stats: 2025-12-04T12:10:20.0075035Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00803999975323677, "best_triton_pos": 0} 2025-12-04T12:10:20.0075103Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0075165Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0075301Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0075544Z triton_mm_2 0.0080 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0075614Z _scaled_mm 0.0284 ms 28.3% 2025-12-04T12:10:20.0075770Z SingleProcess AUTOTUNE benchmarking takes 0.0139 seconds and 0.0521 seconds precompiling for 2 choices 2025-12-04T12:10:20.0075974Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-192c07624aa8e53b.xml - 2025-12-04T12:10:20.0076050Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0076636Z FAILED [0.3669s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1075838976. 2025-12-04T12:10:20.0076640Z 2025-12-04T12:10:20.0076728Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0076997Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0077000Z 2025-12-04T12:10:20.0077102Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0077179Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0077264Z ================== 1 failed, 187 deselected, 2 rerun in 2.56s ================== 2025-12-04T12:10:20.0077318Z Got exit code 1 2025-12-04T12:10:20.0077374Z Retrying single test... 2025-12-04T12:10:20.0077534Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d02c8ff867b1826b.xml 2025-12-04T12:10:20.0077607Z ============================= test session starts ============================== 2025-12-04T12:10:20.0077734Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0077792Z cachedir: .pytest_cache 2025-12-04T12:10:20.0077975Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0078039Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0078097Z configfile: pytest.ini 2025-12-04T12:10:20.0078274Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0078365Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0078638Z stepcurrent: skipping 87 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0078701Z Running 1 items in this shard 2025-12-04T12:10:20.0078703Z 2025-12-04T12:10:20.0078922Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0494s] [100%] 2025-12-04T12:10:20.0079147Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5314s] [100%] 2025-12-04T12:10:20.0079342Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.5747s] [100%] 2025-12-04T12:10:20.0079345Z 2025-12-04T12:10:20.0079413Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0079565Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0079629Z Traceback (most recent call last): 2025-12-04T12:10:20.0079813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0079882Z method(*args, **kwargs) 2025-12-04T12:10:20.0080050Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0080143Z method(*args, **kwargs) 2025-12-04T12:10:20.0080311Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0080364Z with policy(): 2025-12-04T12:10:20.0080531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0080588Z raise RuntimeError(msg) 2025-12-04T12:10:20.0080985Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1017118720. 2025-12-04T12:10:20.0080989Z 2025-12-04T12:10:20.0081078Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0081349Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0081352Z 2025-12-04T12:10:20.0081454Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0081543Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0081602Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0081675Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0082168Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0082297Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0082354Z graph_break [] 2025-12-04T12:10:20.0082429Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0082518Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0083023Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0083088Z current_size = base.storage().size() 2025-12-04T12:10:20.0083146Z Autotune Choices Stats: 2025-12-04T12:10:20.0083528Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0083595Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0083658Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0083795Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0084040Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0084111Z _scaled_mm 0.0295 ms 23.2% 2025-12-04T12:10:20.0084253Z SingleProcess AUTOTUNE benchmarking takes 0.0158 seconds and 0.0715 seconds precompiling for 2 choices 2025-12-04T12:10:20.0084428Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0084490Z Traceback (most recent call last): 2025-12-04T12:10:20.0084660Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0084716Z method(*args, **kwargs) 2025-12-04T12:10:20.0084883Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0084939Z method(*args, **kwargs) 2025-12-04T12:10:20.0085107Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0085161Z with policy(): 2025-12-04T12:10:20.0085328Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0085385Z raise RuntimeError(msg) 2025-12-04T12:10:20.0085785Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1017118720 and is now 1046478848. 2025-12-04T12:10:20.0085788Z 2025-12-04T12:10:20.0085876Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0086143Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0086146Z 2025-12-04T12:10:20.0086249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0086338Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0086397Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0086470Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0086975Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0087091Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0087147Z graph_break [] 2025-12-04T12:10:20.0087233Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0087323Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0087819Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0087883Z current_size = base.storage().size() 2025-12-04T12:10:20.0087939Z Autotune Choices Stats: 2025-12-04T12:10:20.0088314Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0088383Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0088458Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0088595Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0088852Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0088910Z _scaled_mm 0.0295 ms 23.2% 2025-12-04T12:10:20.0089051Z SingleProcess AUTOTUNE benchmarking takes 0.0158 seconds and 0.0715 seconds precompiling for 2 choices 2025-12-04T12:10:20.0089141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0089198Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0089272Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0089386Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0089878Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0089934Z graph_break [] 2025-12-04T12:10:20.0090008Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0091409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0091468Z Autotune Choices Stats: 2025-12-04T12:10:20.0091844Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009518999606370926, "best_triton_pos": 0} 2025-12-04T12:10:20.0091914Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0091978Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0092140Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0092386Z triton_mm_1 0.0095 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0092444Z _scaled_mm 0.0289 ms 32.9% 2025-12-04T12:10:20.0092586Z SingleProcess AUTOTUNE benchmarking takes 0.0137 seconds and 0.0609 seconds precompiling for 2 choices 2025-12-04T12:10:20.0092674Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0092827Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0092891Z Traceback (most recent call last): 2025-12-04T12:10:20.0093063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0093120Z method(*args, **kwargs) 2025-12-04T12:10:20.0093288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0093346Z method(*args, **kwargs) 2025-12-04T12:10:20.0093511Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0093567Z with policy(): 2025-12-04T12:10:20.0093734Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0093794Z raise RuntimeError(msg) 2025-12-04T12:10:20.0094188Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1153433600. 2025-12-04T12:10:20.0094218Z 2025-12-04T12:10:20.0094310Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0094577Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0094580Z 2025-12-04T12:10:20.0094683Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0094772Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0094831Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0094905Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0095399Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0095516Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0095568Z graph_break [] 2025-12-04T12:10:20.0095643Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0095731Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0096229Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0096295Z current_size = base.storage().size() 2025-12-04T12:10:20.0096352Z Autotune Choices Stats: 2025-12-04T12:10:20.0096740Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0096809Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0096874Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0097020Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0097265Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0097324Z _scaled_mm 0.0295 ms 23.2% 2025-12-04T12:10:20.0097467Z SingleProcess AUTOTUNE benchmarking takes 0.0158 seconds and 0.0715 seconds precompiling for 2 choices 2025-12-04T12:10:20.0097554Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0097612Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0097683Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0097796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0098286Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0098362Z graph_break [] 2025-12-04T12:10:20.0098437Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0098526Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0098584Z Autotune Choices Stats: 2025-12-04T12:10:20.0098952Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009518999606370926, "best_triton_pos": 0} 2025-12-04T12:10:20.0099020Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0099083Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0099221Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0099467Z triton_mm_1 0.0095 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0099527Z _scaled_mm 0.0289 ms 32.9% 2025-12-04T12:10:20.0099667Z SingleProcess AUTOTUNE benchmarking takes 0.0137 seconds and 0.0609 seconds precompiling for 2 choices 2025-12-04T12:10:20.0099756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0099814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0099886Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0099999Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0100492Z inductor [('triton_bundler_save_kernel', 8), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('async_compile_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0100548Z graph_break [] 2025-12-04T12:10:20.0100636Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:20.0100726Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0100784Z Autotune Choices Stats: 2025-12-04T12:10:20.0101279Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "_scaled_mm", "best_time": 0.007000000216066837, "best_triton_pos": 1, "best_triton_time": 0.0077599999494850636, "best_triton_kernel": "triton_mm_2", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:20.0101347Z AUTOTUNE scaled_mm(1x32, 32x16, 1x1, 1x16, 16) 2025-12-04T12:10:20.0101412Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0101546Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0101606Z _scaled_mm 0.0070 ms 100.0% 2025-12-04T12:10:20.0101849Z triton_mm_2 0.0078 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0101990Z SingleProcess AUTOTUNE benchmarking takes 0.0173 seconds and 0.1671 seconds precompiling for 2 choices 2025-12-04T12:10:20.0102192Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d02c8ff867b1826b.xml - 2025-12-04T12:10:20.0102270Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0102873Z FAILED [0.5747s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1046478848 and is now 1153433600. 2025-12-04T12:10:20.0102890Z 2025-12-04T12:10:20.0102979Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0103248Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0103250Z 2025-12-04T12:10:20.0103352Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0103431Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0103514Z ================== 1 failed, 187 deselected, 2 rerun in 3.18s ================== 2025-12-04T12:10:20.0103571Z Got exit code 1 2025-12-04T12:10:20.0103785Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0103928Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.0104085Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7392d295b70dbd12.xml 2025-12-04T12:10:20.0104158Z ============================= test session starts ============================== 2025-12-04T12:10:20.0104286Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0104343Z cachedir: .pytest_cache 2025-12-04T12:10:20.0104518Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0104581Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0104638Z configfile: pytest.ini 2025-12-04T12:10:20.0104827Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0104921Z collecting ... collected 188 items / 88 deselected / 100 selected 2025-12-04T12:10:20.0104990Z stepcurrent: skipping 88 already run items. 2025-12-04T12:10:20.0105050Z Running 100 items in this shard 2025-12-04T12:10:20.0105052Z 2025-12-04T12:10:20.0105279Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9855s] [ 1%] 2025-12-04T12:10:20.0105513Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7485s] [ 1%] 2025-12-04T12:10:20.0105711Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.6472s] [ 1%] 2025-12-04T12:10:20.0105716Z 2025-12-04T12:10:20.0105784Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0105937Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0105998Z Traceback (most recent call last): 2025-12-04T12:10:20.0106173Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0106229Z method(*args, **kwargs) 2025-12-04T12:10:20.0106397Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0106453Z method(*args, **kwargs) 2025-12-04T12:10:20.0106632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0106696Z with policy(): 2025-12-04T12:10:20.0106863Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0106921Z raise RuntimeError(msg) 2025-12-04T12:10:20.0107319Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.0107321Z 2025-12-04T12:10:20.0107412Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0107681Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0107684Z 2025-12-04T12:10:20.0107788Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0107876Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0107936Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0108009Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0108503Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0108617Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0108671Z graph_break [] 2025-12-04T12:10:20.0108749Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0108838Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0109344Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0109409Z current_size = base.storage().size() 2025-12-04T12:10:20.0109466Z Autotune Choices Stats: 2025-12-04T12:10:20.0109854Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0109933Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0109997Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0110172Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0110420Z triton_mm_5 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0110659Z triton_mm_6 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0110898Z triton_mm_4 0.0075 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0111167Z triton_mm_3 0.0076 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0111404Z triton_mm_0 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0111638Z triton_mm_2 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0111874Z triton_mm_1 0.0087 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0112111Z triton_mm_7 0.0100 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0112170Z _scaled_mm 0.0297 ms 23.0% 2025-12-04T12:10:20.0112313Z SingleProcess AUTOTUNE benchmarking takes 0.0483 seconds and 0.1850 seconds precompiling for 9 choices 2025-12-04T12:10:20.0112467Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0112528Z Traceback (most recent call last): 2025-12-04T12:10:20.0112700Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0112757Z method(*args, **kwargs) 2025-12-04T12:10:20.0112924Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0112981Z method(*args, **kwargs) 2025-12-04T12:10:20.0113145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0113213Z with policy(): 2025-12-04T12:10:20.0113380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0113438Z raise RuntimeError(msg) 2025-12-04T12:10:20.0113850Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.0113852Z 2025-12-04T12:10:20.0113944Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0114212Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0114216Z 2025-12-04T12:10:20.0114319Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0114407Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0114466Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0114539Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0115031Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0115158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0115223Z graph_break [] 2025-12-04T12:10:20.0115301Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0115390Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0115884Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0115947Z current_size = base.storage().size() 2025-12-04T12:10:20.0116007Z Autotune Choices Stats: 2025-12-04T12:10:20.0116384Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0116462Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0116527Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0116663Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0116907Z triton_mm_5 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0117145Z triton_mm_6 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0117395Z triton_mm_4 0.0075 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0117633Z triton_mm_3 0.0076 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0117873Z triton_mm_0 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0118128Z triton_mm_2 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0118367Z triton_mm_1 0.0087 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0118604Z triton_mm_7 0.0100 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0118661Z _scaled_mm 0.0297 ms 23.0% 2025-12-04T12:10:20.0118803Z SingleProcess AUTOTUNE benchmarking takes 0.0483 seconds and 0.1850 seconds precompiling for 9 choices 2025-12-04T12:10:20.0118893Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0118951Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0119035Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0119151Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0119651Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0119704Z graph_break [] 2025-12-04T12:10:20.0119780Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0119867Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0119924Z Autotune Choices Stats: 2025-12-04T12:10:20.0120331Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007199999876320362, "best_triton_pos": 0} 2025-12-04T12:10:20.0120407Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0120471Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0120609Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0120855Z triton_mm_12 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0121094Z triton_mm_8 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0121333Z triton_mm_15 0.0077 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0121591Z triton_mm_11 0.0078 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0121829Z triton_mm_10 0.0080 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0122077Z triton_mm_13 0.0090 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0122317Z triton_mm_9 0.0094 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0122555Z triton_mm_14 0.0101 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0122612Z _scaled_mm 0.0294 ms 24.5% 2025-12-04T12:10:20.0122755Z SingleProcess AUTOTUNE benchmarking takes 0.0454 seconds and 0.0786 seconds precompiling for 9 choices 2025-12-04T12:10:20.0122824Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0122978Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0123053Z Traceback (most recent call last): 2025-12-04T12:10:20.0123224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0123295Z method(*args, **kwargs) 2025-12-04T12:10:20.0123464Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0123521Z method(*args, **kwargs) 2025-12-04T12:10:20.0123687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0123740Z with policy(): 2025-12-04T12:10:20.0123907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0123964Z raise RuntimeError(msg) 2025-12-04T12:10:20.0124363Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0124367Z 2025-12-04T12:10:20.0124457Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0124727Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0124729Z 2025-12-04T12:10:20.0124833Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0124920Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0124980Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0125053Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0125545Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0125672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0125727Z graph_break [] 2025-12-04T12:10:20.0125803Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0125892Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0126398Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0126463Z current_size = base.storage().size() 2025-12-04T12:10:20.0126520Z Autotune Choices Stats: 2025-12-04T12:10:20.0126895Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0126970Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0127033Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0127169Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0127412Z triton_mm_5 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0127673Z triton_mm_6 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0127911Z triton_mm_4 0.0075 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0128149Z triton_mm_3 0.0076 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0128389Z triton_mm_0 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0128626Z triton_mm_2 0.0081 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0128863Z triton_mm_1 0.0087 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0129099Z triton_mm_7 0.0100 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0129157Z _scaled_mm 0.0297 ms 23.0% 2025-12-04T12:10:20.0129300Z SingleProcess AUTOTUNE benchmarking takes 0.0483 seconds and 0.1850 seconds precompiling for 9 choices 2025-12-04T12:10:20.0129390Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0129450Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0129522Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0129647Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0130170Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0130224Z graph_break [] 2025-12-04T12:10:20.0130314Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0130405Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0130460Z Autotune Choices Stats: 2025-12-04T12:10:20.0130839Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007199999876320362, "best_triton_pos": 0} 2025-12-04T12:10:20.0130913Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0130977Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0131112Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0131363Z triton_mm_12 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0131616Z triton_mm_8 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0131867Z triton_mm_15 0.0077 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0132109Z triton_mm_11 0.0078 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0132346Z triton_mm_10 0.0080 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0132586Z triton_mm_13 0.0090 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0132825Z triton_mm_9 0.0094 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0133062Z triton_mm_14 0.0101 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0133119Z _scaled_mm 0.0294 ms 24.5% 2025-12-04T12:10:20.0133261Z SingleProcess AUTOTUNE benchmarking takes 0.0454 seconds and 0.0786 seconds precompiling for 9 choices 2025-12-04T12:10:20.0133350Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0133408Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0133481Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0133595Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0134106Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0134159Z graph_break [] 2025-12-04T12:10:20.0134236Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0134341Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0134400Z Autotune Choices Stats: 2025-12-04T12:10:20.0134773Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005840000230818987, "best_triton_pos": 0} 2025-12-04T12:10:20.0134847Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0134911Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0135046Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0135296Z triton_mm_17 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0135546Z triton_mm_20 0.0065 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0135796Z triton_mm_22 0.0081 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0136034Z triton_mm_16 0.0084 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0136273Z triton_mm_21 0.0085 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0136511Z triton_mm_18 0.0088 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0136750Z triton_mm_19 0.0094 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0136988Z triton_mm_23 0.0098 ms 59.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0137044Z _scaled_mm 0.0268 ms 21.8% 2025-12-04T12:10:20.0137188Z SingleProcess AUTOTUNE benchmarking takes 0.0615 seconds and 0.2020 seconds precompiling for 9 choices 2025-12-04T12:10:20.0137390Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7392d295b70dbd12.xml - 2025-12-04T12:10:20.0137468Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0138068Z FAILED [0.6472s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0138071Z 2025-12-04T12:10:20.0138161Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0138444Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0138448Z 2025-12-04T12:10:20.0138550Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0138629Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0138712Z ================== 1 failed, 88 deselected, 2 rerun in 3.40s =================== 2025-12-04T12:10:20.0138766Z Got exit code 1 2025-12-04T12:10:20.0138821Z Retrying single test... 2025-12-04T12:10:20.0138980Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-eac345264c04a5dd.xml 2025-12-04T12:10:20.0139054Z ============================= test session starts ============================== 2025-12-04T12:10:20.0139181Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0139237Z cachedir: .pytest_cache 2025-12-04T12:10:20.0139412Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0139484Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0139550Z configfile: pytest.ini 2025-12-04T12:10:20.0139730Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0139821Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0140083Z stepcurrent: skipping 88 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0140177Z Running 1 items in this shard 2025-12-04T12:10:20.0140179Z 2025-12-04T12:10:20.0140403Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3407s] [100%] 2025-12-04T12:10:20.0140623Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8283s] [100%] 2025-12-04T12:10:20.0140822Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8172s] [100%] 2025-12-04T12:10:20.0140824Z 2025-12-04T12:10:20.0140893Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0141045Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0141107Z Traceback (most recent call last): 2025-12-04T12:10:20.0141279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0141336Z method(*args, **kwargs) 2025-12-04T12:10:20.0141504Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0141561Z method(*args, **kwargs) 2025-12-04T12:10:20.0141728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0141782Z with policy(): 2025-12-04T12:10:20.0141964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0142023Z raise RuntimeError(msg) 2025-12-04T12:10:20.0142420Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.0142422Z 2025-12-04T12:10:20.0142512Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0142795Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0142800Z 2025-12-04T12:10:20.0142902Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0142993Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0143051Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0143127Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0143623Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0143737Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0143804Z graph_break [] 2025-12-04T12:10:20.0143881Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0143984Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0144481Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0144545Z current_size = base.storage().size() 2025-12-04T12:10:20.0144603Z Autotune Choices Stats: 2025-12-04T12:10:20.0144984Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.0145059Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0145124Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0145260Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0145508Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0145747Z triton_mm_2 0.0072 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0145986Z triton_mm_7 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0146234Z triton_mm_1 0.0075 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0146473Z triton_mm_3 0.0088 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0146720Z triton_mm_0 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0146957Z triton_mm_4 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0147197Z triton_mm_5 0.0096 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0147254Z _scaled_mm 0.0313 ms 19.1% 2025-12-04T12:10:20.0147398Z SingleProcess AUTOTUNE benchmarking takes 0.0481 seconds and 0.2034 seconds precompiling for 9 choices 2025-12-04T12:10:20.0147552Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0147615Z Traceback (most recent call last): 2025-12-04T12:10:20.0147785Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0147855Z method(*args, **kwargs) 2025-12-04T12:10:20.0148022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0148100Z method(*args, **kwargs) 2025-12-04T12:10:20.0148267Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0148321Z with policy(): 2025-12-04T12:10:20.0148488Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0148545Z raise RuntimeError(msg) 2025-12-04T12:10:20.0148945Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.0148948Z 2025-12-04T12:10:20.0149038Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0149312Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0149314Z 2025-12-04T12:10:20.0149416Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0149506Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0149564Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0149638Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0150166Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0150281Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0150335Z graph_break [] 2025-12-04T12:10:20.0150429Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0150520Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0151015Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0151093Z current_size = base.storage().size() 2025-12-04T12:10:20.0151151Z Autotune Choices Stats: 2025-12-04T12:10:20.0151529Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.0151605Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0151668Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0151804Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0152048Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0152286Z triton_mm_2 0.0072 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0152548Z triton_mm_7 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0152786Z triton_mm_1 0.0075 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0153025Z triton_mm_3 0.0088 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0153262Z triton_mm_0 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0153502Z triton_mm_4 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0153738Z triton_mm_5 0.0096 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0153796Z _scaled_mm 0.0313 ms 19.1% 2025-12-04T12:10:20.0153937Z SingleProcess AUTOTUNE benchmarking takes 0.0481 seconds and 0.2034 seconds precompiling for 9 choices 2025-12-04T12:10:20.0154027Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0154085Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0154158Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0154273Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0154778Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0154832Z graph_break [] 2025-12-04T12:10:20.0154908Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0154997Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0155063Z Autotune Choices Stats: 2025-12-04T12:10:20.0155436Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.0155512Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0155576Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0155713Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0155960Z triton_mm_15 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0156198Z triton_mm_13 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0156459Z triton_mm_9 0.0068 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0156698Z triton_mm_10 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0156933Z triton_mm_14 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0157175Z triton_mm_12 0.0083 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0157413Z triton_mm_8 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0157654Z triton_mm_11 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0157711Z _scaled_mm 0.0273 ms 22.3% 2025-12-04T12:10:20.0157852Z SingleProcess AUTOTUNE benchmarking takes 0.0640 seconds and 0.1955 seconds precompiling for 9 choices 2025-12-04T12:10:20.0157921Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0158076Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0158141Z Traceback (most recent call last): 2025-12-04T12:10:20.0158314Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0158371Z method(*args, **kwargs) 2025-12-04T12:10:20.0158550Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0158608Z method(*args, **kwargs) 2025-12-04T12:10:20.0158773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0158828Z with policy(): 2025-12-04T12:10:20.0158995Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0159064Z raise RuntimeError(msg) 2025-12-04T12:10:20.0159464Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0159468Z 2025-12-04T12:10:20.0159559Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0159829Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0159833Z 2025-12-04T12:10:20.0159935Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0160024Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0160082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0160197Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0160707Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0160834Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0160886Z graph_break [] 2025-12-04T12:10:20.0160963Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0161051Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0161548Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0161613Z current_size = base.storage().size() 2025-12-04T12:10:20.0161670Z Autotune Choices Stats: 2025-12-04T12:10:20.0162051Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.0162125Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0162189Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0162324Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0162569Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0162820Z triton_mm_2 0.0072 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0163058Z triton_mm_7 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0163310Z triton_mm_1 0.0075 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0163547Z triton_mm_3 0.0088 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0163787Z triton_mm_0 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0164025Z triton_mm_4 0.0094 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0164262Z triton_mm_5 0.0096 ms 62.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0164319Z _scaled_mm 0.0313 ms 19.1% 2025-12-04T12:10:20.0164478Z SingleProcess AUTOTUNE benchmarking takes 0.0481 seconds and 0.2034 seconds precompiling for 9 choices 2025-12-04T12:10:20.0164580Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0164639Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0164712Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0164826Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0165317Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0165369Z graph_break [] 2025-12-04T12:10:20.0165446Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0165535Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0165593Z Autotune Choices Stats: 2025-12-04T12:10:20.0165967Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.0166041Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0166104Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0166240Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0166484Z triton_mm_15 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0166722Z triton_mm_13 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0166969Z triton_mm_9 0.0068 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0167206Z triton_mm_10 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0167453Z triton_mm_14 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0167696Z triton_mm_12 0.0083 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0167934Z triton_mm_8 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0168257Z triton_mm_11 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0168315Z _scaled_mm 0.0273 ms 22.3% 2025-12-04T12:10:20.0168457Z SingleProcess AUTOTUNE benchmarking takes 0.0640 seconds and 0.1955 seconds precompiling for 9 choices 2025-12-04T12:10:20.0171107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0171183Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0171255Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0171372Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0171860Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0171915Z graph_break [] 2025-12-04T12:10:20.0171990Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0172080Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0172136Z Autotune Choices Stats: 2025-12-04T12:10:20.0172510Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_21", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.0172583Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0172645Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0172781Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0173024Z triton_mm_21 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0173265Z triton_mm_18 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0173519Z triton_mm_23 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0173760Z triton_mm_19 0.0064 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0174014Z triton_mm_17 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0174252Z triton_mm_22 0.0064 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0174492Z triton_mm_20 0.0067 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0174730Z triton_mm_16 0.0069 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0174787Z _scaled_mm 0.0299 ms 19.9% 2025-12-04T12:10:20.0174928Z SingleProcess AUTOTUNE benchmarking takes 0.0619 seconds and 0.2047 seconds precompiling for 9 choices 2025-12-04T12:10:20.0175144Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-eac345264c04a5dd.xml - 2025-12-04T12:10:20.0175231Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0175821Z FAILED [0.8172s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0175824Z 2025-12-04T12:10:20.0175914Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0176182Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0176185Z 2025-12-04T12:10:20.0176289Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0176366Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0176451Z ================== 1 failed, 187 deselected, 2 rerun in 4.01s ================== 2025-12-04T12:10:20.0176504Z Got exit code 1 2025-12-04T12:10:20.0176560Z Retrying single test... 2025-12-04T12:10:20.0176720Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e03cffcae3396344.xml 2025-12-04T12:10:20.0176793Z ============================= test session starts ============================== 2025-12-04T12:10:20.0176920Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0176977Z cachedir: .pytest_cache 2025-12-04T12:10:20.0177152Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0177215Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0177272Z configfile: pytest.ini 2025-12-04T12:10:20.0177462Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0177553Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0177815Z stepcurrent: skipping 88 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0177877Z Running 1 items in this shard 2025-12-04T12:10:20.0177879Z 2025-12-04T12:10:20.0178114Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0794s] [100%] 2025-12-04T12:10:20.0178335Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8699s] [100%] 2025-12-04T12:10:20.0178534Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8062s] [100%] 2025-12-04T12:10:20.0178538Z 2025-12-04T12:10:20.0178605Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0178759Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0178819Z Traceback (most recent call last): 2025-12-04T12:10:20.0178994Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0179050Z method(*args, **kwargs) 2025-12-04T12:10:20.0179220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0179286Z method(*args, **kwargs) 2025-12-04T12:10:20.0179465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0179518Z with policy(): 2025-12-04T12:10:20.0179687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0179745Z raise RuntimeError(msg) 2025-12-04T12:10:20.0180183Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.0180185Z 2025-12-04T12:10:20.0180277Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0180550Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0180554Z 2025-12-04T12:10:20.0180657Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0180748Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0180807Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0180879Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0181379Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0181493Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0181549Z graph_break [] 2025-12-04T12:10:20.0181630Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0181719Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0182233Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0182297Z current_size = base.storage().size() 2025-12-04T12:10:20.0182354Z Autotune Choices Stats: 2025-12-04T12:10:20.0182751Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:20.0182829Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0182893Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0183030Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0183277Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0183520Z triton_mm_0 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0183769Z triton_mm_2 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0184023Z triton_mm_3 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0184262Z triton_mm_4 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0184501Z triton_mm_1 0.0086 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0184738Z triton_mm_5 0.0092 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0184977Z triton_mm_7 0.0098 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0185036Z _scaled_mm 0.0288 ms 22.5% 2025-12-04T12:10:20.0185178Z SingleProcess AUTOTUNE benchmarking takes 0.0466 seconds and 0.1971 seconds precompiling for 9 choices 2025-12-04T12:10:20.0185333Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0185394Z Traceback (most recent call last): 2025-12-04T12:10:20.0185566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0185623Z method(*args, **kwargs) 2025-12-04T12:10:20.0185792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0185849Z method(*args, **kwargs) 2025-12-04T12:10:20.0186023Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0186078Z with policy(): 2025-12-04T12:10:20.0186244Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0186301Z raise RuntimeError(msg) 2025-12-04T12:10:20.0186707Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.0186710Z 2025-12-04T12:10:20.0186801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0187071Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0187073Z 2025-12-04T12:10:20.0187176Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0187265Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0187324Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0187397Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0187893Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0188027Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0188081Z graph_break [] 2025-12-04T12:10:20.0188158Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0188246Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0188742Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0188806Z current_size = base.storage().size() 2025-12-04T12:10:20.0188865Z Autotune Choices Stats: 2025-12-04T12:10:20.0189241Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:20.0189316Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0189379Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0189515Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0189759Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0189998Z triton_mm_0 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0190288Z triton_mm_2 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0190527Z triton_mm_3 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0190776Z triton_mm_4 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0191014Z triton_mm_1 0.0086 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0191251Z triton_mm_5 0.0092 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0191486Z triton_mm_7 0.0098 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0191542Z _scaled_mm 0.0288 ms 22.5% 2025-12-04T12:10:20.0191686Z SingleProcess AUTOTUNE benchmarking takes 0.0466 seconds and 0.1971 seconds precompiling for 9 choices 2025-12-04T12:10:20.0191775Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0191846Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0191918Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0192046Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0192536Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0192589Z graph_break [] 2025-12-04T12:10:20.0192665Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0192755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0192813Z Autotune Choices Stats: 2025-12-04T12:10:20.0193184Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.0193259Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0193321Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0193457Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0193702Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0193941Z triton_mm_14 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0194190Z triton_mm_15 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0194429Z triton_mm_13 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0194677Z triton_mm_9 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0194913Z triton_mm_10 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0195155Z triton_mm_11 0.0105 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0195392Z triton_mm_8 0.0107 ms 56.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0195449Z _scaled_mm 0.0319 ms 18.8% 2025-12-04T12:10:20.0195591Z SingleProcess AUTOTUNE benchmarking takes 0.0455 seconds and 0.1021 seconds precompiling for 9 choices 2025-12-04T12:10:20.0195660Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0195827Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0195900Z Traceback (most recent call last): 2025-12-04T12:10:20.0196071Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0196128Z method(*args, **kwargs) 2025-12-04T12:10:20.0196298Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0196354Z method(*args, **kwargs) 2025-12-04T12:10:20.0196521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0196573Z with policy(): 2025-12-04T12:10:20.0196740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0196796Z raise RuntimeError(msg) 2025-12-04T12:10:20.0197195Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0197198Z 2025-12-04T12:10:20.0197289Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0197558Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0197561Z 2025-12-04T12:10:20.0197663Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0197751Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0197812Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0197885Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0198392Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0198507Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0198559Z graph_break [] 2025-12-04T12:10:20.0198635Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0198723Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0199227Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0199293Z current_size = base.storage().size() 2025-12-04T12:10:20.0199350Z Autotune Choices Stats: 2025-12-04T12:10:20.0199727Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:20.0199801Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0199864Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0200000Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0200289Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0200545Z triton_mm_0 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0200781Z triton_mm_2 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0201020Z triton_mm_3 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0201262Z triton_mm_4 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0201502Z triton_mm_1 0.0086 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0201739Z triton_mm_5 0.0092 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0201974Z triton_mm_7 0.0098 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0202034Z _scaled_mm 0.0288 ms 22.5% 2025-12-04T12:10:20.0202175Z SingleProcess AUTOTUNE benchmarking takes 0.0466 seconds and 0.1971 seconds precompiling for 9 choices 2025-12-04T12:10:20.0202265Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0202323Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0202408Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0202523Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0203023Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0203078Z graph_break [] 2025-12-04T12:10:20.0203154Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0203242Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0203301Z Autotune Choices Stats: 2025-12-04T12:10:20.0203675Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.0203747Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0203811Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0203946Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0204192Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0204454Z triton_mm_14 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0204691Z triton_mm_15 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0204929Z triton_mm_13 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0205166Z triton_mm_9 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0205405Z triton_mm_10 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0205643Z triton_mm_11 0.0105 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0205881Z triton_mm_8 0.0107 ms 56.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0205937Z _scaled_mm 0.0319 ms 18.8% 2025-12-04T12:10:20.0206080Z SingleProcess AUTOTUNE benchmarking takes 0.0455 seconds and 0.1021 seconds precompiling for 9 choices 2025-12-04T12:10:20.0206170Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0206228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0206300Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0206423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0206919Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0206982Z graph_break [] 2025-12-04T12:10:20.0207058Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:20.0207147Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0207205Z Autotune Choices Stats: 2025-12-04T12:10:20.0207579Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_20", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:20.0207651Z AUTOTUNE scaled_mm(1x32, 32x2048, 1x1, 1x2048, 2048) 2025-12-04T12:10:20.0207715Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0207849Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0208096Z triton_mm_20 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0208344Z triton_mm_21 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0208597Z triton_mm_23 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0208835Z triton_mm_19 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0209072Z triton_mm_18 0.0067 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0209309Z triton_mm_22 0.0092 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0209547Z triton_mm_17 0.0096 ms 62.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0209787Z triton_mm_16 0.0097 ms 62.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0209843Z _scaled_mm 0.0110 ms 55.1% 2025-12-04T12:10:20.0209988Z SingleProcess AUTOTUNE benchmarking takes 0.0635 seconds and 0.2101 seconds precompiling for 9 choices 2025-12-04T12:10:20.0210236Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e03cffcae3396344.xml - 2025-12-04T12:10:20.0210313Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0210916Z FAILED [0.8062s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.0210919Z 2025-12-04T12:10:20.0211020Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0211289Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0211293Z 2025-12-04T12:10:20.0211395Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0211475Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0211558Z ================== 1 failed, 187 deselected, 2 rerun in 3.78s ================== 2025-12-04T12:10:20.0211613Z Got exit code 1 2025-12-04T12:10:20.0211829Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0211971Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.0212130Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d260f9002e7b4b9d.xml 2025-12-04T12:10:20.0212217Z ============================= test session starts ============================== 2025-12-04T12:10:20.0212345Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0212424Z cachedir: .pytest_cache 2025-12-04T12:10:20.0212599Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0212662Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0212720Z configfile: pytest.ini 2025-12-04T12:10:20.0212897Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0212989Z collecting ... collected 188 items / 89 deselected / 99 selected 2025-12-04T12:10:20.0213058Z stepcurrent: skipping 89 already run items. 2025-12-04T12:10:20.0213120Z Running 99 items in this shard 2025-12-04T12:10:20.0213122Z 2025-12-04T12:10:20.0213352Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4075s] [ 1%] 2025-12-04T12:10:20.0213578Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9433s] [ 1%] 2025-12-04T12:10:20.0213778Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.8827s] [ 1%] 2025-12-04T12:10:20.0213781Z 2025-12-04T12:10:20.0213848Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0214004Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0214066Z Traceback (most recent call last): 2025-12-04T12:10:20.0214241Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0214299Z method(*args, **kwargs) 2025-12-04T12:10:20.0214466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0214523Z method(*args, **kwargs) 2025-12-04T12:10:20.0214700Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0214754Z with policy(): 2025-12-04T12:10:20.0214921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0214977Z raise RuntimeError(msg) 2025-12-04T12:10:20.0215385Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.0215389Z 2025-12-04T12:10:20.0215479Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0215753Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0215755Z 2025-12-04T12:10:20.0215857Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0215945Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0216004Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0216076Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0216577Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0216710Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0216764Z graph_break [] 2025-12-04T12:10:20.0216843Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0216933Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0217427Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0217492Z current_size = base.storage().size() 2025-12-04T12:10:20.0217549Z Autotune Choices Stats: 2025-12-04T12:10:20.0217935Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0218015Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0218081Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0218219Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0218469Z triton_mm_8 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0218709Z triton_mm_14 0.0070 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0218962Z triton_mm_17 0.0072 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0219208Z triton_mm_18 0.0076 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0219455Z triton_mm_16 0.0076 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0219696Z triton_mm_13 0.0080 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0219935Z triton_mm_12 0.0085 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0220210Z triton_mm_11 0.0089 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0220449Z triton_mm_15 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0220706Z triton_mm_9 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0220863Z SingleProcess AUTOTUNE benchmarking takes 0.0908 seconds and 0.4162 seconds precompiling for 20 choices 2025-12-04T12:10:20.0221018Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0221079Z Traceback (most recent call last): 2025-12-04T12:10:20.0221251Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0221308Z method(*args, **kwargs) 2025-12-04T12:10:20.0221476Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0221531Z method(*args, **kwargs) 2025-12-04T12:10:20.0221697Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0221751Z with policy(): 2025-12-04T12:10:20.0221918Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0221976Z raise RuntimeError(msg) 2025-12-04T12:10:20.0222376Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.0222378Z 2025-12-04T12:10:20.0222467Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0222739Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0222741Z 2025-12-04T12:10:20.0222845Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0222933Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0223003Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0223075Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0223577Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0223701Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0223757Z graph_break [] 2025-12-04T12:10:20.0223836Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0223926Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0224419Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0224483Z current_size = base.storage().size() 2025-12-04T12:10:20.0224542Z Autotune Choices Stats: 2025-12-04T12:10:20.0224921Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0225018Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0225084Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0225221Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0225468Z triton_mm_8 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0225708Z triton_mm_14 0.0070 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0225948Z triton_mm_17 0.0072 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0226194Z triton_mm_18 0.0076 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0226431Z triton_mm_16 0.0076 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0226669Z triton_mm_13 0.0080 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0226907Z triton_mm_12 0.0085 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0227157Z triton_mm_11 0.0089 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0227395Z triton_mm_15 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0227647Z triton_mm_9 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0227791Z SingleProcess AUTOTUNE benchmarking takes 0.0908 seconds and 0.4162 seconds precompiling for 20 choices 2025-12-04T12:10:20.0227882Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0227941Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0228013Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0228128Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0228626Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0228678Z graph_break [] 2025-12-04T12:10:20.0228757Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0228863Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0228920Z Autotune Choices Stats: 2025-12-04T12:10:20.0229309Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0229385Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0229452Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0229587Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0229835Z triton_mm_36 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0230074Z triton_mm_35 0.0070 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0230359Z triton_mm_33 0.0073 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0230601Z triton_mm_27 0.0075 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0230845Z triton_mm_37 0.0078 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0231085Z triton_mm_34 0.0084 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0231334Z triton_mm_30 0.0084 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0231574Z triton_mm_31 0.0084 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0231823Z triton_mm_32 0.0090 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0232065Z triton_mm_29 0.0104 ms 66.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0232213Z SingleProcess AUTOTUNE benchmarking takes 0.1245 seconds and 0.2856 seconds precompiling for 20 choices 2025-12-04T12:10:20.0232283Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0232439Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0232501Z Traceback (most recent call last): 2025-12-04T12:10:20.0232674Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0232730Z method(*args, **kwargs) 2025-12-04T12:10:20.0232910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0232980Z method(*args, **kwargs) 2025-12-04T12:10:20.0233145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0233200Z with policy(): 2025-12-04T12:10:20.0233368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0233425Z raise RuntimeError(msg) 2025-12-04T12:10:20.0233827Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0233830Z 2025-12-04T12:10:20.0233924Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0234194Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0234197Z 2025-12-04T12:10:20.0234300Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0234389Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0234447Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0234519Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0235018Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0235132Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0235186Z graph_break [] 2025-12-04T12:10:20.0235262Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0235363Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0235863Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0235925Z current_size = base.storage().size() 2025-12-04T12:10:20.0235992Z Autotune Choices Stats: 2025-12-04T12:10:20.0236372Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0236450Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0236516Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0236654Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0236899Z triton_mm_8 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0237138Z triton_mm_14 0.0070 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0237402Z triton_mm_17 0.0072 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0237653Z triton_mm_18 0.0076 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0237890Z triton_mm_16 0.0076 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0238126Z triton_mm_13 0.0080 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0238364Z triton_mm_12 0.0085 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0238601Z triton_mm_11 0.0089 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0238837Z triton_mm_15 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0239080Z triton_mm_9 0.0089 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0239225Z SingleProcess AUTOTUNE benchmarking takes 0.0908 seconds and 0.4162 seconds precompiling for 20 choices 2025-12-04T12:10:20.0239314Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0239383Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0239457Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0239571Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0240077Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0240166Z graph_break [] 2025-12-04T12:10:20.0240243Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0240334Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0240391Z Autotune Choices Stats: 2025-12-04T12:10:20.0240767Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0240842Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0240908Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0241043Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0241311Z triton_mm_36 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0241563Z triton_mm_35 0.0070 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0241800Z triton_mm_33 0.0073 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0242040Z triton_mm_27 0.0075 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0242280Z triton_mm_37 0.0078 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0242520Z triton_mm_34 0.0084 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0242756Z triton_mm_30 0.0084 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0242994Z triton_mm_31 0.0084 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0243231Z triton_mm_32 0.0090 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0243483Z triton_mm_29 0.0104 ms 66.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0243628Z SingleProcess AUTOTUNE benchmarking takes 0.1245 seconds and 0.2856 seconds precompiling for 20 choices 2025-12-04T12:10:20.0243717Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0243776Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0243847Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0243981Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0244478Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0244532Z graph_break [] 2025-12-04T12:10:20.0244608Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0244698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0244754Z Autotune Choices Stats: 2025-12-04T12:10:20.0245132Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_55", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0068789999932050705, "best_triton_pos": 0} 2025-12-04T12:10:20.0245220Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0245296Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0245431Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0245676Z triton_mm_55 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0245913Z triton_mm_52 0.0071 ms 97.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0246151Z triton_mm_46 0.0073 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0246394Z triton_mm_54 0.0077 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0246632Z triton_mm_49 0.0083 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0246866Z triton_mm_50 0.0084 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0247104Z triton_mm_51 0.0085 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0247356Z triton_mm_56 0.0086 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0247593Z triton_mm_53 0.0087 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0247836Z triton_mm_47 0.0093 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0247988Z SingleProcess AUTOTUNE benchmarking takes 0.1460 seconds and 0.2734 seconds precompiling for 20 choices 2025-12-04T12:10:20.0248193Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d260f9002e7b4b9d.xml - 2025-12-04T12:10:20.0248269Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0248865Z FAILED [0.8827s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0248867Z 2025-12-04T12:10:20.0248956Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0249229Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0249249Z 2025-12-04T12:10:20.0249353Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0249431Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0249514Z ================== 1 failed, 89 deselected, 2 rerun in 4.25s =================== 2025-12-04T12:10:20.0249567Z Got exit code 1 2025-12-04T12:10:20.0249625Z Retrying single test... 2025-12-04T12:10:20.0249783Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e448f6c8db5f9c98.xml 2025-12-04T12:10:20.0249858Z ============================= test session starts ============================== 2025-12-04T12:10:20.0249986Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0250044Z cachedir: .pytest_cache 2025-12-04T12:10:20.0250284Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0250348Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0250405Z configfile: pytest.ini 2025-12-04T12:10:20.0250584Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0250674Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0250939Z stepcurrent: skipping 89 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0250998Z Running 1 items in this shard 2025-12-04T12:10:20.0251000Z 2025-12-04T12:10:20.0251227Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3976s] [100%] 2025-12-04T12:10:20.0251452Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9343s] [100%] 2025-12-04T12:10:20.0251670Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.8428s] [100%] 2025-12-04T12:10:20.0251672Z 2025-12-04T12:10:20.0251740Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0251895Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0251956Z Traceback (most recent call last): 2025-12-04T12:10:20.0252131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0252202Z method(*args, **kwargs) 2025-12-04T12:10:20.0252370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0252428Z method(*args, **kwargs) 2025-12-04T12:10:20.0252592Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0252649Z with policy(): 2025-12-04T12:10:20.0252816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0252875Z raise RuntimeError(msg) 2025-12-04T12:10:20.0253275Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.0253278Z 2025-12-04T12:10:20.0253367Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0253651Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0253667Z 2025-12-04T12:10:20.0253768Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0253860Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0253918Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0253991Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0254495Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0254611Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0254664Z graph_break [] 2025-12-04T12:10:20.0254744Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0254833Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0255332Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0255396Z current_size = base.storage().size() 2025-12-04T12:10:20.0255453Z Autotune Choices Stats: 2025-12-04T12:10:20.0255841Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0255931Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0255998Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0256135Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0256385Z triton_mm_17 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0256634Z triton_mm_14 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0256876Z triton_mm_16 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0257119Z triton_mm_8 0.0074 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0257361Z triton_mm_18 0.0079 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0257601Z triton_mm_12 0.0084 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0257863Z triton_mm_9 0.0087 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0258101Z triton_mm_11 0.0094 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0258340Z triton_mm_15 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0258578Z triton_mm_5 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0258726Z SingleProcess AUTOTUNE benchmarking takes 0.0907 seconds and 0.4010 seconds precompiling for 20 choices 2025-12-04T12:10:20.0258882Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0258944Z Traceback (most recent call last): 2025-12-04T12:10:20.0259115Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0259173Z method(*args, **kwargs) 2025-12-04T12:10:20.0259340Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0259397Z method(*args, **kwargs) 2025-12-04T12:10:20.0259569Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0259626Z with policy(): 2025-12-04T12:10:20.0259794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0259854Z raise RuntimeError(msg) 2025-12-04T12:10:20.0260314Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.0260316Z 2025-12-04T12:10:20.0260406Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0260689Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0260693Z 2025-12-04T12:10:20.0260796Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0260888Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0260947Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0261019Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0261520Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0261634Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0261688Z graph_break [] 2025-12-04T12:10:20.0261767Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0261870Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0262381Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0262445Z current_size = base.storage().size() 2025-12-04T12:10:20.0262501Z Autotune Choices Stats: 2025-12-04T12:10:20.0262882Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0262959Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0263029Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0263166Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0263418Z triton_mm_17 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0263656Z triton_mm_14 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0263893Z triton_mm_16 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0264133Z triton_mm_8 0.0074 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0264384Z triton_mm_18 0.0079 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0264623Z triton_mm_12 0.0084 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0264876Z triton_mm_9 0.0087 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0265116Z triton_mm_11 0.0094 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0265354Z triton_mm_15 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0265592Z triton_mm_5 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0265739Z SingleProcess AUTOTUNE benchmarking takes 0.0907 seconds and 0.4010 seconds precompiling for 20 choices 2025-12-04T12:10:20.0265829Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0265902Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0265995Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0266109Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0266609Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0266661Z graph_break [] 2025-12-04T12:10:20.0266740Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0266831Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0266889Z Autotune Choices Stats: 2025-12-04T12:10:20.0267262Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:20.0267340Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0267407Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0267544Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0267789Z triton_mm_35 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0268027Z triton_mm_33 0.0071 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0268279Z triton_mm_27 0.0077 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0268516Z triton_mm_31 0.0082 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0268765Z triton_mm_34 0.0084 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0269006Z triton_mm_37 0.0087 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0269247Z triton_mm_32 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0269484Z triton_mm_30 0.0091 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0269726Z triton_mm_28 0.0092 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0269965Z triton_mm_36 0.0099 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0270155Z SingleProcess AUTOTUNE benchmarking takes 0.1247 seconds and 0.2921 seconds precompiling for 20 choices 2025-12-04T12:10:20.0270226Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0270381Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0270445Z Traceback (most recent call last): 2025-12-04T12:10:20.0270618Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0270678Z method(*args, **kwargs) 2025-12-04T12:10:20.0270845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0270904Z method(*args, **kwargs) 2025-12-04T12:10:20.0271072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0271126Z with policy(): 2025-12-04T12:10:20.0271295Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0271351Z raise RuntimeError(msg) 2025-12-04T12:10:20.0271761Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0271763Z 2025-12-04T12:10:20.0271853Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0272123Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0272127Z 2025-12-04T12:10:20.0272228Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0272339Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0272397Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0272471Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0272988Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0273101Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0273156Z graph_break [] 2025-12-04T12:10:20.0273235Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0273327Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0273822Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0273886Z current_size = base.storage().size() 2025-12-04T12:10:20.0273943Z Autotune Choices Stats: 2025-12-04T12:10:20.0274327Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.0274442Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0274511Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0274648Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0274897Z triton_mm_17 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0275139Z triton_mm_14 0.0069 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0275376Z triton_mm_16 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0275618Z triton_mm_8 0.0074 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0275860Z triton_mm_18 0.0079 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0276099Z triton_mm_12 0.0084 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0276342Z triton_mm_9 0.0087 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0276589Z triton_mm_11 0.0094 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0276827Z triton_mm_15 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0277074Z triton_mm_5 0.0104 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0277219Z SingleProcess AUTOTUNE benchmarking takes 0.0907 seconds and 0.4010 seconds precompiling for 20 choices 2025-12-04T12:10:20.0277311Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0277368Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0277441Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0277555Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0278052Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0278104Z graph_break [] 2025-12-04T12:10:20.0278191Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0278280Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0278349Z Autotune Choices Stats: 2025-12-04T12:10:20.0278725Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:20.0278802Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0278867Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0279004Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0279249Z triton_mm_35 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0279488Z triton_mm_33 0.0071 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0279728Z triton_mm_27 0.0077 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0279965Z triton_mm_31 0.0082 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0280235Z triton_mm_34 0.0084 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0280491Z triton_mm_37 0.0087 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0280731Z triton_mm_32 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0280968Z triton_mm_30 0.0091 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0281223Z triton_mm_28 0.0092 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0281467Z triton_mm_36 0.0099 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0281611Z SingleProcess AUTOTUNE benchmarking takes 0.1247 seconds and 0.2921 seconds precompiling for 20 choices 2025-12-04T12:10:20.0281700Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0281758Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0281830Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0281943Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0282441Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0282521Z graph_break [] 2025-12-04T12:10:20.0282598Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0282688Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0282746Z Autotune Choices Stats: 2025-12-04T12:10:20.0283125Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:20.0283202Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0283272Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0283407Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0283654Z triton_mm_54 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0283893Z triton_mm_52 0.0079 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0284128Z triton_mm_53 0.0082 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0284369Z triton_mm_55 0.0082 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0284620Z triton_mm_56 0.0083 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0284857Z triton_mm_49 0.0084 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0285102Z triton_mm_50 0.0085 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0285344Z triton_mm_46 0.0086 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0285584Z triton_mm_51 0.0087 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0285824Z triton_mm_47 0.0092 ms 74.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0285967Z SingleProcess AUTOTUNE benchmarking takes 0.1488 seconds and 0.2649 seconds precompiling for 20 choices 2025-12-04T12:10:20.0286172Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e448f6c8db5f9c98.xml - 2025-12-04T12:10:20.0286259Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0286865Z FAILED [0.8428s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0286868Z 2025-12-04T12:10:20.0286957Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0287231Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0287235Z 2025-12-04T12:10:20.0287337Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0287416Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0287501Z ================== 1 failed, 187 deselected, 2 rerun in 4.19s ================== 2025-12-04T12:10:20.0287555Z Got exit code 1 2025-12-04T12:10:20.0287611Z Retrying single test... 2025-12-04T12:10:20.0287770Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b67904059602dee0.xml 2025-12-04T12:10:20.0287843Z ============================= test session starts ============================== 2025-12-04T12:10:20.0287970Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0288025Z cachedir: .pytest_cache 2025-12-04T12:10:20.0288201Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0288264Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0288323Z configfile: pytest.ini 2025-12-04T12:10:20.0288501Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0288600Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0288867Z stepcurrent: skipping 89 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0288927Z Running 1 items in this shard 2025-12-04T12:10:20.0288929Z 2025-12-04T12:10:20.0289165Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4030s] [100%] 2025-12-04T12:10:20.0289388Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9465s] [100%] 2025-12-04T12:10:20.0289591Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [1.0286s] [100%] 2025-12-04T12:10:20.0289593Z 2025-12-04T12:10:20.0289661Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0289818Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0289880Z Traceback (most recent call last): 2025-12-04T12:10:20.0290055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0290156Z method(*args, **kwargs) 2025-12-04T12:10:20.0290325Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0290402Z method(*args, **kwargs) 2025-12-04T12:10:20.0290567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0290637Z with policy(): 2025-12-04T12:10:20.0290804Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0290861Z raise RuntimeError(msg) 2025-12-04T12:10:20.0291261Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.0291263Z 2025-12-04T12:10:20.0291354Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0291626Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0291630Z 2025-12-04T12:10:20.0291736Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0291826Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0291885Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0291958Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0292459Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0292573Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0292627Z graph_break [] 2025-12-04T12:10:20.0292707Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0292795Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0293320Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0293385Z current_size = base.storage().size() 2025-12-04T12:10:20.0293443Z Autotune Choices Stats: 2025-12-04T12:10:20.0293838Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:20.0293919Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0293987Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0294122Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0294379Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0294621Z triton_mm_14 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0294873Z triton_mm_8 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0295128Z triton_mm_18 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0295366Z triton_mm_12 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0295604Z triton_mm_15 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0295839Z triton_mm_11 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0296083Z triton_mm_9 0.0090 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0296319Z triton_mm_13 0.0090 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0296560Z triton_mm_10 0.0100 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0296706Z SingleProcess AUTOTUNE benchmarking takes 0.0942 seconds and 0.3868 seconds precompiling for 20 choices 2025-12-04T12:10:20.0296862Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0296924Z Traceback (most recent call last): 2025-12-04T12:10:20.0297104Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0297162Z method(*args, **kwargs) 2025-12-04T12:10:20.0297329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0297385Z method(*args, **kwargs) 2025-12-04T12:10:20.0297550Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0297613Z with policy(): 2025-12-04T12:10:20.0297781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0297839Z raise RuntimeError(msg) 2025-12-04T12:10:20.0298242Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.0298244Z 2025-12-04T12:10:20.0298335Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0298605Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0298608Z 2025-12-04T12:10:20.0298712Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0298812Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0298871Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0298959Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0299459Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0299572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0299625Z graph_break [] 2025-12-04T12:10:20.0299703Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0299792Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0300325Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0300389Z current_size = base.storage().size() 2025-12-04T12:10:20.0300447Z Autotune Choices Stats: 2025-12-04T12:10:20.0300831Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:20.0300909Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0300977Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0301112Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0301377Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0301616Z triton_mm_14 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0301871Z triton_mm_8 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0302118Z triton_mm_18 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0302357Z triton_mm_12 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0302594Z triton_mm_15 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0302831Z triton_mm_11 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0303072Z triton_mm_9 0.0090 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0303337Z triton_mm_13 0.0090 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0303577Z triton_mm_10 0.0100 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0303721Z SingleProcess AUTOTUNE benchmarking takes 0.0942 seconds and 0.3868 seconds precompiling for 20 choices 2025-12-04T12:10:20.0303812Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0303872Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0303944Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0304059Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0304555Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0304612Z graph_break [] 2025-12-04T12:10:20.0304689Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0304778Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0304834Z Autotune Choices Stats: 2025-12-04T12:10:20.0305211Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:20.0305298Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0305364Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0305499Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0305746Z triton_mm_33 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0306006Z triton_mm_27 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0306251Z triton_mm_37 0.0076 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0306496Z triton_mm_36 0.0077 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0306734Z triton_mm_31 0.0082 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0306969Z triton_mm_32 0.0085 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0307218Z triton_mm_30 0.0087 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0307466Z triton_mm_34 0.0090 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0307709Z triton_mm_28 0.0095 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0307945Z triton_mm_35 0.0101 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0308089Z SingleProcess AUTOTUNE benchmarking takes 0.1288 seconds and 0.2953 seconds precompiling for 20 choices 2025-12-04T12:10:20.0308159Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0308315Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0308377Z Traceback (most recent call last): 2025-12-04T12:10:20.0308549Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0308607Z method(*args, **kwargs) 2025-12-04T12:10:20.0308773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0308832Z method(*args, **kwargs) 2025-12-04T12:10:20.0308997Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0309051Z with policy(): 2025-12-04T12:10:20.0309219Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0309277Z raise RuntimeError(msg) 2025-12-04T12:10:20.0309690Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0309692Z 2025-12-04T12:10:20.0309783Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0310067Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0310070Z 2025-12-04T12:10:20.0310203Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0310293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0310350Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0310424Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0310923Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0311039Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0311108Z graph_break [] 2025-12-04T12:10:20.0311187Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0311275Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0311791Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0311856Z current_size = base.storage().size() 2025-12-04T12:10:20.0311912Z Autotune Choices Stats: 2025-12-04T12:10:20.0312294Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:20.0312371Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0312440Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0312576Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0312825Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0313065Z triton_mm_14 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0313307Z triton_mm_8 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0313563Z triton_mm_18 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0313801Z triton_mm_12 0.0082 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0314039Z triton_mm_15 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0314287Z triton_mm_11 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0314533Z triton_mm_9 0.0090 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0314776Z triton_mm_13 0.0090 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0316243Z triton_mm_10 0.0100 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0316395Z SingleProcess AUTOTUNE benchmarking takes 0.0942 seconds and 0.3868 seconds precompiling for 20 choices 2025-12-04T12:10:20.0316502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0316577Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0316650Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0316770Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0317267Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0317321Z graph_break [] 2025-12-04T12:10:20.0317399Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0317490Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0317549Z Autotune Choices Stats: 2025-12-04T12:10:20.0317928Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:20.0318007Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0318074Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0318211Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0318454Z triton_mm_33 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0318698Z triton_mm_27 0.0072 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0318951Z triton_mm_37 0.0076 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0319192Z triton_mm_36 0.0077 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0319440Z triton_mm_31 0.0082 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0319680Z triton_mm_32 0.0085 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0319921Z triton_mm_30 0.0087 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0320188Z triton_mm_34 0.0090 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0320432Z triton_mm_28 0.0095 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0320687Z triton_mm_35 0.0101 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0320847Z SingleProcess AUTOTUNE benchmarking takes 0.1288 seconds and 0.2953 seconds precompiling for 20 choices 2025-12-04T12:10:20.0320937Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0320996Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0321068Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0321182Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0321679Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0321734Z graph_break [] 2025-12-04T12:10:20.0321812Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:20.0321902Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0321963Z Autotune Choices Stats: 2025-12-04T12:10:20.0322336Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_46", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006920000072568655, "best_triton_pos": 0} 2025-12-04T12:10:20.0322413Z AUTOTUNE scaled_mm(257x1024, 1024x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.0322480Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0322616Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0322883Z triton_mm_46 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0323124Z triton_mm_55 0.0070 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0323365Z triton_mm_52 0.0075 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0323615Z triton_mm_54 0.0077 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0323856Z triton_mm_53 0.0083 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0324101Z triton_mm_56 0.0084 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.0324337Z triton_mm_51 0.0085 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.0324575Z triton_mm_49 0.0087 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0324849Z triton_mm_47 0.0090 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0325089Z triton_mm_43 0.0106 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0325232Z SingleProcess AUTOTUNE benchmarking takes 0.1509 seconds and 0.2756 seconds precompiling for 20 choices 2025-12-04T12:10:20.0325436Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b67904059602dee0.xml - 2025-12-04T12:10:20.0325514Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0326104Z FAILED [1.0286s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.0326107Z 2025-12-04T12:10:20.0326198Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0326470Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0326474Z 2025-12-04T12:10:20.0326578Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0326657Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0326741Z ================== 1 failed, 187 deselected, 2 rerun in 4.40s ================== 2025-12-04T12:10:20.0326797Z Got exit code 1 2025-12-04T12:10:20.0327025Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.0327168Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.0327326Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2ffc76f1125ecc13.xml 2025-12-04T12:10:20.0327400Z ============================= test session starts ============================== 2025-12-04T12:10:20.0327536Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0327595Z cachedir: .pytest_cache 2025-12-04T12:10:20.0327767Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0327832Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0327889Z configfile: pytest.ini 2025-12-04T12:10:20.0328069Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0328159Z collecting ... collected 188 items / 90 deselected / 98 selected 2025-12-04T12:10:20.0328229Z stepcurrent: skipping 90 already run items. 2025-12-04T12:10:20.0328289Z Running 98 items in this shard 2025-12-04T12:10:20.0328291Z 2025-12-04T12:10:20.0329222Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/7i/c7ignet3sdjwpdnvdorijykmzjy73n6i4vn6qphywltpwti6vsgr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0329407Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0329640Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0329814Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0330154Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0330307Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0330580Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0330734Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0331004Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0331176Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0331474Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0331623Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0331911Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0332132Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0332465Z E1204 10:57:32.377000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0333212Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/54/c54sbtaegy3l6t6h5uini7glder6nktyfzkn44eaouf5wsp3jzbm.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0333374Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0333615Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0333803Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0334100Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0334249Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0334517Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0334670Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0334938Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0335110Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0335395Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0335543Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0335831Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0336048Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0336377Z E1204 10:57:32.403000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0337128Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/ur/curozslustiehcdynpneeg4lh67lyjd63uxzx3kcgevewo5tqnd5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0337291Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0337521Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0337689Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0337993Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0338148Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0338435Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0338587Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0338853Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0339025Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0339307Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0339457Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0339744Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0339950Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0340384Z E1204 10:57:32.406000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0341138Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/7a/c7a4bqqfq46giblif7grrpmrzw6ufaznee2vsz2mivxhs2gj7ijb.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0341300Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0341541Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0341712Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0342014Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0342158Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0342427Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0342577Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0342857Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0343039Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0343320Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0343468Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0343762Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0343973Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0344308Z E1204 10:57:32.408000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0345142Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/oe/coeqhkqsrdi6lh2mzkuztx2fll5fyimy4fn2nbblhkt7k3jpoehl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0345306Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0345545Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0345716Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0346015Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0346172Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0346440Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0346593Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0346860Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0347028Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0347311Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0347469Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0347770Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0347975Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0348302Z E1204 10:57:32.411000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0349038Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpmwguhvwt/2z/c2z6iwqtseqjyipde5jatpr6uwitqmrj73v25gztvms2h4mwdnfs.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0349199Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0349426Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0349595Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0349894Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0350051Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0350354Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0350506Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0350787Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0350959Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0351243Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0351391Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0351679Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0351886Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0352226Z E1204 10:57:32.415000 674819 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0352309Z ('RERUN', {'yellow': True}) [3.5104s] [ 1%] 2025-12-04T12:10:20.0352645Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:57:34.497000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0352955Z E1204 10:57:34.497000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0353101Z E1204 10:57:34.497000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0353261Z E1204 10:57:34.499000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0353568Z E1204 10:57:34.499000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0353710Z E1204 10:57:34.499000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0353868Z E1204 10:57:34.501000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0354176Z E1204 10:57:34.501000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0354317Z E1204 10:57:34.501000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0354490Z E1204 10:57:34.560000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0354795Z E1204 10:57:34.560000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0354935Z E1204 10:57:34.560000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0355105Z E1204 10:57:34.562000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0355410Z E1204 10:57:34.562000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0355553Z E1204 10:57:34.562000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0355710Z E1204 10:57:34.564000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0356014Z E1204 10:57:34.564000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0356156Z E1204 10:57:34.564000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0356223Z ('RERUN', {'yellow': True}) [1.6182s] [ 1%] 2025-12-04T12:10:20.0356570Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda E1204 10:57:35.951000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0356889Z E1204 10:57:35.951000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0357029Z E1204 10:57:35.951000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0357186Z E1204 10:57:35.953000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0357491Z E1204 10:57:35.953000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0357631Z E1204 10:57:35.953000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0357790Z E1204 10:57:35.955000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0358094Z E1204 10:57:35.955000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0358234Z E1204 10:57:35.955000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0358391Z E1204 10:57:35.997000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0358695Z E1204 10:57:35.997000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0358837Z E1204 10:57:35.997000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0359002Z E1204 10:57:35.999000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0359308Z E1204 10:57:35.999000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0359448Z E1204 10:57:35.999000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0359615Z E1204 10:57:36.001000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0359920Z E1204 10:57:36.001000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.0360062Z E1204 10:57:36.001000 674819 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0360157Z FAILED [1.4464s] [ 1%] 2025-12-04T12:10:20.0360159Z 2025-12-04T12:10:20.0360230Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.0360391Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0360453Z Traceback (most recent call last): 2025-12-04T12:10:20.0360627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0360701Z method(*args, **kwargs) 2025-12-04T12:10:20.0360869Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0360940Z method(*args, **kwargs) 2025-12-04T12:10:20.0361106Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0361160Z with policy(): 2025-12-04T12:10:20.0361328Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0361385Z raise RuntimeError(msg) 2025-12-04T12:10:20.0361795Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1973420032. 2025-12-04T12:10:20.0361798Z 2025-12-04T12:10:20.0361891Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0362168Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0362171Z 2025-12-04T12:10:20.0362275Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0362365Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0362424Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0362497Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0363065Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0363182Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0363248Z graph_break [] 2025-12-04T12:10:20.0363330Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0363421Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0363933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0364003Z current_size = base.storage().size() 2025-12-04T12:10:20.0364060Z Autotune Choices Stats: 2025-12-04T12:10:20.0364447Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01116000022739172, "best_triton_pos": 0} 2025-12-04T12:10:20.0364533Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0364600Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0364739Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0364991Z triton_mm_33 0.0112 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0365242Z triton_mm_22 0.0112 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0365493Z triton_mm_30 0.0116 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0365731Z triton_mm_21 0.0117 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0365973Z triton_mm_34 0.0118 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0366209Z triton_mm_29 0.0118 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0366449Z triton_mm_16 0.0121 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0366688Z triton_mm_23 0.0123 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0366923Z triton_mm_25 0.0132 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0367163Z triton_mm_15 0.0136 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0367317Z SingleProcess AUTOTUNE benchmarking takes 0.1763 seconds and 1.1307 seconds precompiling for 33 choices 2025-12-04T12:10:20.0367477Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0367539Z Traceback (most recent call last): 2025-12-04T12:10:20.0367713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0367769Z method(*args, **kwargs) 2025-12-04T12:10:20.0367947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0368004Z method(*args, **kwargs) 2025-12-04T12:10:20.0368172Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0368227Z with policy(): 2025-12-04T12:10:20.0368394Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0368453Z raise RuntimeError(msg) 2025-12-04T12:10:20.0368856Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1973420032 and is now 3017801728. 2025-12-04T12:10:20.0368859Z 2025-12-04T12:10:20.0368950Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0369224Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0369246Z 2025-12-04T12:10:20.0369351Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0369441Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0369501Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0369574Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0370191Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0370309Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0370361Z graph_break [] 2025-12-04T12:10:20.0370444Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0370533Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0371032Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0371096Z current_size = base.storage().size() 2025-12-04T12:10:20.0371154Z Autotune Choices Stats: 2025-12-04T12:10:20.0371539Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01116000022739172, "best_triton_pos": 0} 2025-12-04T12:10:20.0371625Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0371711Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0371850Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0372099Z triton_mm_33 0.0112 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0372356Z triton_mm_22 0.0112 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0372595Z triton_mm_30 0.0116 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0372834Z triton_mm_21 0.0117 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0373074Z triton_mm_34 0.0118 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0373314Z triton_mm_29 0.0118 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0373561Z triton_mm_16 0.0121 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0373814Z triton_mm_23 0.0123 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0374050Z triton_mm_25 0.0132 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0374290Z triton_mm_15 0.0136 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0374435Z SingleProcess AUTOTUNE benchmarking takes 0.1763 seconds and 1.1307 seconds precompiling for 33 choices 2025-12-04T12:10:20.0374526Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0374585Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0374658Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0374774Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0375242Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0375296Z graph_break [] 2025-12-04T12:10:20.0375376Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0375466Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0375523Z Autotune Choices Stats: 2025-12-04T12:10:20.0376013Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.0106800002977252, "best_triton_pos": 1, "best_triton_time": 0.010999999940395355, "best_triton_kernel": "triton_mm_72", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:20.0376095Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0376163Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0376307Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0376369Z _scaled_mm 0.0107 ms 100.0% 2025-12-04T12:10:20.0376613Z triton_mm_72 0.0110 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0376855Z triton_mm_67 0.0110 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0377095Z triton_mm_71 0.0111 ms 96.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0377332Z triton_mm_59 0.0112 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0377582Z triton_mm_60 0.0116 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0377829Z triton_mm_54 0.0121 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0378066Z triton_mm_68 0.0123 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0378304Z triton_mm_61 0.0123 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0378541Z triton_mm_63 0.0128 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0378688Z SingleProcess AUTOTUNE benchmarking takes 0.2593 seconds and 0.8529 seconds precompiling for 39 choices 2025-12-04T12:10:20.0378756Z =================================== FAILURES =================================== 2025-12-04T12:10:20.0378916Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.0378977Z Traceback (most recent call last): 2025-12-04T12:10:20.0379149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0379206Z method(*args, **kwargs) 2025-12-04T12:10:20.0379378Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.0379435Z method(*args, **kwargs) 2025-12-04T12:10:20.0379603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.0379656Z with policy(): 2025-12-04T12:10:20.0379834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.0379893Z raise RuntimeError(msg) 2025-12-04T12:10:20.0380331Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3017801728 and is now 3982491648. 2025-12-04T12:10:20.0380333Z 2025-12-04T12:10:20.0380438Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0380715Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0380718Z 2025-12-04T12:10:20.0380823Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0380911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0380970Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0381042Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0381608Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.0381737Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0381806Z graph_break [] 2025-12-04T12:10:20.0381886Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0381978Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0382475Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.0382540Z current_size = base.storage().size() 2025-12-04T12:10:20.0382598Z Autotune Choices Stats: 2025-12-04T12:10:20.0382980Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.01116000022739172, "best_triton_pos": 0} 2025-12-04T12:10:20.0383065Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0383132Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0383268Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0383518Z triton_mm_33 0.0112 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0383758Z triton_mm_22 0.0112 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0384009Z triton_mm_30 0.0116 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0384246Z triton_mm_21 0.0117 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0384489Z triton_mm_34 0.0118 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0384734Z triton_mm_29 0.0118 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0384973Z triton_mm_16 0.0121 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0385212Z triton_mm_23 0.0123 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0385447Z triton_mm_25 0.0132 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0385686Z triton_mm_15 0.0136 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0385849Z SingleProcess AUTOTUNE benchmarking takes 0.1763 seconds and 1.1307 seconds precompiling for 33 choices 2025-12-04T12:10:20.0385940Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0385999Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0386071Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0386186Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0386653Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0386708Z graph_break [] 2025-12-04T12:10:20.0386790Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0386881Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0386937Z Autotune Choices Stats: 2025-12-04T12:10:20.0387412Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.0106800002977252, "best_triton_pos": 1, "best_triton_time": 0.010999999940395355, "best_triton_kernel": "triton_mm_72", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:20.0387494Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0387562Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0387697Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0387756Z _scaled_mm 0.0107 ms 100.0% 2025-12-04T12:10:20.0388022Z triton_mm_72 0.0110 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0388260Z triton_mm_67 0.0110 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0388501Z triton_mm_71 0.0111 ms 96.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0388747Z triton_mm_59 0.0112 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0388987Z triton_mm_60 0.0116 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0389222Z triton_mm_54 0.0121 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0389458Z triton_mm_68 0.0123 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0389698Z triton_mm_61 0.0123 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0389956Z triton_mm_63 0.0128 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0390138Z SingleProcess AUTOTUNE benchmarking takes 0.2593 seconds and 0.8529 seconds precompiling for 39 choices 2025-12-04T12:10:20.0390226Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.0390285Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.0390356Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.0390471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.0390934Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.0390989Z graph_break [] 2025-12-04T12:10:20.0391070Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.0391158Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.0391215Z Autotune Choices Stats: 2025-12-04T12:10:20.0391691Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.010319000110030174, "best_triton_pos": 1, "best_triton_time": 0.0106800002977252, "best_triton_kernel": "triton_mm_105", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2"} 2025-12-04T12:10:20.0391774Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.0391840Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.0391976Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.0392048Z _scaled_mm 0.0103 ms 100.0% 2025-12-04T12:10:20.0392293Z triton_mm_105 0.0107 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0392535Z triton_mm_109 0.0112 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0392787Z triton_mm_97 0.0112 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0393033Z triton_mm_110 0.0112 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0393270Z triton_mm_92 0.0116 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0393514Z triton_mm_106 0.0118 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0393752Z triton_mm_98 0.0120 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0394022Z triton_mm_99 0.0125 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.0394262Z triton_mm_101 0.0128 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.0394405Z SingleProcess AUTOTUNE benchmarking takes 0.2575 seconds and 0.7200 seconds precompiling for 39 choices 2025-12-04T12:10:20.0394612Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2ffc76f1125ecc13.xml - 2025-12-04T12:10:20.0394689Z =========================== short test summary info ============================ 2025-12-04T12:10:20.0395291Z FAILED [1.4464s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 3017801728 and is now 3982491648. 2025-12-04T12:10:20.0395294Z 2025-12-04T12:10:20.0395385Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.0395657Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0395660Z 2025-12-04T12:10:20.0395763Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.0395842Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.0395927Z ================== 1 failed, 90 deselected, 2 rerun in 6.60s =================== 2025-12-04T12:10:20.0395980Z Got exit code 1 2025-12-04T12:10:20.0396046Z Retrying single test... 2025-12-04T12:10:20.0396207Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a5aa61b86ddf0f00.xml 2025-12-04T12:10:20.0396281Z ============================= test session starts ============================== 2025-12-04T12:10:20.0396408Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.0396466Z cachedir: .pytest_cache 2025-12-04T12:10:20.0396648Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.0396712Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.0396770Z configfile: pytest.ini 2025-12-04T12:10:20.0396951Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.0397043Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.0397312Z stepcurrent: skipping 90 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.0397372Z Running 1 items in this shard 2025-12-04T12:10:20.0397375Z 2025-12-04T12:10:20.0397720Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:57:45.951135300 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0397722Z 2025-12-04T12:10:20.0397893Z [W1204 10:57:52.285472888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0397915Z 2025-12-04T12:10:20.0398243Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0398555Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0398703Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0399204Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0399476Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0399717Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0399942Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0400193Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0400439Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0400690Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0400933Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0401187Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0401430Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0401666Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0401906Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0402140Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0402382Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0402628Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0402891Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0403123Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0403368Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0403601Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0403807Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0404046Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0404286Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0404520Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0404725Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0404970Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0405213Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0405446Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0405697Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0405930Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0406149Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0406375Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0406550Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0406746Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0407299Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/2z/c2z6iwqtseqjyipde5jatpr6uwitqmrj73v25gztvms2h4mwdnfs.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0407474Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0407704Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0407875Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0408177Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0408327Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0408599Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0408752Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0409022Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0409193Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0409488Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0409638Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0409927Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0410191Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0410521Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0410827Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0410971Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0411465Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0411746Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0412000Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0412222Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0412438Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0412681Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0412918Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0413163Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0413396Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0413638Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0413871Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0414126Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0414359Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0414599Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0414842Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0415085Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0415317Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0415557Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0415791Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0416009Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0416253Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0416497Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0416729Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0416932Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0417165Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0417406Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0417639Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0417880Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0418112Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0418329Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0418565Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0418739Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0418933Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0419061Z E1204 10:57:52.825000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0419233Z [W1204 10:57:52.325416744 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0419236Z 2025-12-04T12:10:20.0419405Z [W1204 10:57:52.326805849 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0419407Z 2025-12-04T12:10:20.0419735Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0420040Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0420212Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0420721Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0421041Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0421282Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0421502Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0421719Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0421962Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0422197Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0422437Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0422670Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0422924Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0423157Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0423398Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0423643Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0423886Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0424121Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0424363Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0424596Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0424836Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0425091Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0425295Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0425527Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0425769Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0426006Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0426213Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0426444Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0426684Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0426923Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0427165Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0427409Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0427625Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0427850Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0428032Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0428227Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0428765Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/7i/c7ignet3sdjwpdnvdorijykmzjy73n6i4vn6qphywltpwti6vsgr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0428927Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0429158Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0429338Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0429651Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0429798Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0430072Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0430259Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0430528Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0430700Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0430980Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0431130Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0431418Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0431625Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0431967Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0432274Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0432419Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0432919Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0433188Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0433427Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0433651Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0433879Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0434134Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0434370Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0434613Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0434848Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0435089Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0435325Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0435566Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0435798Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0436040Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0436275Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0436526Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0436758Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0437014Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0437247Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0437453Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0437687Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0437928Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0438161Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0438378Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0438622Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0438862Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0439095Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0439337Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0439569Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0439787Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0440011Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0440221Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0440419Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0440536Z E1204 10:57:52.863000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0440873Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0441177Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0441323Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0441822Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0442089Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0442331Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0442551Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0442779Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0443034Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0443268Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0443509Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0443743Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0443985Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0444218Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0444461Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0444694Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0444935Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0445172Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0445423Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0445656Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0445905Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0446139Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0446345Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0446578Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0446817Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0447049Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0447263Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0447507Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0447747Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0447979Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0448218Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0448451Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0448670Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0448895Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0449067Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0449260Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0449805Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/54/c54sbtaegy3l6t6h5uini7glder6nktyfzkn44eaouf5wsp3jzbm.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0449966Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0450237Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0450422Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0450726Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0450875Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0451146Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0451300Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0451569Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0451757Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0452052Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0452200Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0452487Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0452696Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0453024Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0453330Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0453474Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0453963Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0454251Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0454492Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0454714Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0454941Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0455183Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0455421Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0455663Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0455896Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0456138Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0456379Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0456634Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0456866Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0457107Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0457339Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0457584Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0457818Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0458058Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0458291Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0458495Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0458739Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0458982Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0459215Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0459428Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0459661Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0459905Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0460171Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0460413Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0460647Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0460889Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0461116Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0461289Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0461482Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0461598Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0461923Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0462228Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0462373Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0462860Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0463139Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0463379Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0463599Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0463825Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0464067Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0464302Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0464542Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0464774Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0465016Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0465258Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0465511Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0465743Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0465982Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0466214Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0466455Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0466687Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0466926Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0467161Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0467366Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0467607Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0467848Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0468080Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0468293Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0468526Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0468769Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0469001Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0469243Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0469475Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0469700Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0469941Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0470161Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0470355Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0470896Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/7a/c7a4bqqfq46giblif7grrpmrzw6ufaznee2vsz2mivxhs2gj7ijb.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0471059Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0471288Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0471456Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0471756Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0471906Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0472191Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0472344Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0472610Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0472793Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0473076Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0473229Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0473520Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0473727Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0474055Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0474386Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0474530Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0475020Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0475290Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0475535Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0475756Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0475971Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0476213Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0476450Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0476703Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0476936Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0477177Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0477420Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0477663Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0477895Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0478136Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0478369Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0478621Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0478867Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0479107Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0479339Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0479544Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0479779Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0480021Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0480285Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0480488Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0480725Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0480966Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0481213Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0481455Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0481701Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0481917Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0482143Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0482316Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0482513Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0482629Z E1204 10:57:52.867000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0482800Z [W1204 10:57:52.333116789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0482816Z 2025-12-04T12:10:20.0482984Z [W1204 10:57:52.334723571 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0482998Z 2025-12-04T12:10:20.0483322Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0483628Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0483773Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0484260Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0484528Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0484766Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0484987Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0485202Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0485456Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0485689Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0485931Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0486172Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0486414Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0486649Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0486888Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0487121Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0487362Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0487623Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0487864Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0488095Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0488338Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0488570Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0488776Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0489008Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0489248Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0489482Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0489687Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0489928Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0490207Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0490440Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0490697Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0490930Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0491148Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0491371Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0491544Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0491737Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0492286Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/ur/curozslustiehcdynpneeg4lh67lyjd63uxzx3kcgevewo5tqnd5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0492460Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0492691Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0492862Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0493160Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0493309Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0493578Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0493730Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0494000Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0494170Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0494466Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0494614Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0494903Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0495454Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0495785Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0496093Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0496237Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0496732Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0497017Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0497256Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0497478Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0497693Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0497936Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0498172Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0498415Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0498647Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0498890Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0499125Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0499374Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0499607Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0499856Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0500125Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0500368Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0500600Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0500840Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0501071Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0501289Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0501534Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0501776Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0502007Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0502211Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0502447Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0502689Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0502921Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0503163Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0503399Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0503618Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0503856Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0504031Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0504224Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0504361Z E1204 10:57:52.874000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0504682Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0504991Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0505135Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0505622Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0505909Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0506147Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0506368Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0506583Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0506825Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0507060Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0507301Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0507534Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0507775Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0508010Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0508260Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0508495Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0508745Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0508977Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0509221Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0509453Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0509693Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0509926Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0510172Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0510419Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0510660Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0510892Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0511095Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0511329Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0511571Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0511803Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0512045Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0512276Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0512496Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0512735Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0512908Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0513099Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0513646Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfark1vkd/oe/coeqhkqsrdi6lh2mzkuztx2fll5fyimy4fn2nbblhkt7k3jpoehl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.0513811Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.0514038Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.0514208Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.0514509Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.0514669Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.0514951Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.0515102Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.0515370Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.0515540Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.0515824Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.0515974Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.0516262Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.0516469Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.0516799Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0517114Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0517258Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0517758Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0518031Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0518271Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0518493Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0518709Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0518951Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0519194Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0519447Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0519680Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0519921Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0520195Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0520437Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0520671Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0520911Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0521145Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0521388Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0521643Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0521885Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0522117Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0522335Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0522568Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0522812Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0523045Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0523247Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0523481Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0523744Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0523977Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0524217Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0524449Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0524668Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.0524894Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.0525068Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.0525261Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.0525379Z E1204 10:57:52.875000 680710 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.0525448Z ('RERUN', {'yellow': True}) [10.7831s] [100%] 2025-12-04T12:10:20.0525795Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:57:54.247502001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0525799Z 2025-12-04T12:10:20.0525968Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0526276Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0526591Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0526736Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0527226Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0527493Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0527735Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0527967Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0528195Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0528438Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0528672Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0528918Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0529152Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0529395Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0529628Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0529868Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0530146Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0530392Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0530638Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0530853Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0531088Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0531303Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0531546Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0531780Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0531983Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0532218Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0532474Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0532723Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0532929Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0533161Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0533375Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0533578Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0533813Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0534024Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0534227Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0534460Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0534701Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0534947Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0535189Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0535422Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0535645Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0535869Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0536084Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0536324Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0536557Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0536798Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0537059Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0537301Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0537533Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0537773Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0538005Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0538249Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0538481Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0538721Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0538953Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0539195Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0539439Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0539680Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0539925Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0540199Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0540435Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0540677Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0540910Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0541156Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0541404Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0541717Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0541927Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0542168Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0542404Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0542644Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0542878Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0543118Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0543352Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0543593Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0543827Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0544083Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0544315Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0544568Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0544803Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0545018Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0545221Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0545455Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0545669Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0545905Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0546133Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0546373Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0546606Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0546818Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0547021Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0547256Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0547468Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0547672Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0547904Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0548147Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0548395Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0548637Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0548870Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0549090Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0549315Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0549530Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0549775Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0550011Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0550256Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0550500Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0550715Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0550958Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0551192Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0551439Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0551676Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0551917Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0552151Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0552394Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0552630Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0552884Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0553120Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0553336Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0553556Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0553791Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0554009Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0554223Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0554437Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0554686Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0554953Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0555201Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0555437Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0555681Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0555917Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0556160Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0556395Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0556637Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0556872Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0557090Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0557316Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0557526Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0557751Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0557979Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0558224Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0558461Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0558678Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0558890Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0559105Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0559366Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0559605Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0559848Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0560083Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0560360Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0560596Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0560839Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0561074Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0561317Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0561558Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0561817Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0562053Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0562305Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0562541Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0562787Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0563021Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0563263Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0563497Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0563754Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0564004Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0564219Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0564426Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0564662Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0564906Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0565141Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0565383Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0565617Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0565862Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0566099Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0566353Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0566587Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0566837Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0567073Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0567282Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0567518Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0567762Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0567996Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0568248Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0568496Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0568726Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0568942Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0569156Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0569372Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0569616Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0569854Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0570082Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0570336Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0570550Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0570778Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0571020Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0571271Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0571491Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0571705Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0571913Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0572075Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0572313Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0572518Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0572780Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0573023Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0573256Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0573470Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0573677Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0573916Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0574129Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0574334Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0574569Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0574773Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0575019Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0575223Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0575458Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0575673Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0575910Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0576154Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0576388Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0576632Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0576866Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0577119Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0577366Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0577607Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0577845Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0578087Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0578325Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0578539Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0578747Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0578983Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0579211Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0579438Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0579651Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0579869Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0580161Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0580401Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0580647Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0580881Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0581086Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0581322Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0581579Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0581829Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0582075Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0582312Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0582541Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0582760Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0582973Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0583182Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0583387Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0583625Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0583884Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0584120Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0584363Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0584609Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0584817Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0585054Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0585296Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0585531Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0585773Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0586019Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0586235Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0586470Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0586713Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0586946Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0587190Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0587425Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0587652Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0587870Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0588085Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0588317Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0588560Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0588794Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0589006Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0589242Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0589485Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0589721Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0589963Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0590235Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0590475Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0590705Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0590919Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0591133Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0591376Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0591612Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0591854Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0592093Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0592335Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0592571Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0592788Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0593025Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0593267Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0593512Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0593755Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0593990Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0594219Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0594435Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0594650Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0594877Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0595129Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0595363Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0595604Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0595838Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0596080Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0596316Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0596558Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0596792Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0597036Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0597280Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0597493Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0597709Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0597923Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0598139Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0598371Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0598595Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0598808Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0599015Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0599231Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0599429Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0599572Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0599732Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0599858Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0600000Z E1204 10:57:54.797000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0600212Z [W1204 10:57:54.262183248 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0600215Z 2025-12-04T12:10:20.0600373Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0600684Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0600995Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0601140Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0601648Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0601915Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0602157Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0602388Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0602604Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0602847Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0603081Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0603324Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0603556Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0603829Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0604066Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0604307Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0604543Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0604783Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0605017Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0605228Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0605451Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0605666Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0605907Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0606150Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0606355Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0606587Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0606836Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0607071Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0607274Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0607506Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0607718Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0607926Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0608183Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0608398Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0609831Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0610069Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0610353Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0610592Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0610832Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0611065Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0611276Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0611501Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0611740Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0611988Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0612220Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0612477Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0612712Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0612955Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0613190Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0613431Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0613668Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0613939Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0614172Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0614414Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0614648Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0614889Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0615124Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0615364Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0615598Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0615838Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0616071Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0616323Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0616557Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0616808Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0617040Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0617258Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0617468Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0617712Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0617945Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0618197Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0618444Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0618684Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0618917Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0619158Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0619392Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0619634Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0619869Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0620160Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0620393Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0620608Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0620822Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0621055Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0621276Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0621500Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0621718Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0621960Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0622193Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0622405Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0622607Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0622869Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0623080Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0623282Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0623514Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0623758Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0623993Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0624233Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0624464Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0624676Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0624898Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0625120Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0625368Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0625601Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0625829Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0626043Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0626259Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0626503Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0626738Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0626982Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0627233Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0627476Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0627709Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0627954Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0628190Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0628433Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0628668Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0628881Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0629087Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0629321Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0629550Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0629764Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0629979Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0630272Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0630507Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0630751Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0630984Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0631227Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0631463Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0631730Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0631968Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0632209Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0632445Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0632663Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0632876Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0633083Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0633306Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0633523Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0633764Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0634014Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0634231Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0634443Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0634668Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0634911Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0635147Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0635388Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0635622Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0635866Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0636121Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0636363Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0636597Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0636839Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0637072Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0637315Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0637548Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0637788Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0638024Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0638266Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0638512Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0638752Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0639002Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0639244Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0639481Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0639694Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0639897Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0640167Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0640420Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0640669Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0640911Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0641145Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0641387Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0641622Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0641865Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0642101Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0642343Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0642578Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0642785Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0643031Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0643272Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0643517Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0643760Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0643996Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0644225Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0644441Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0644655Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0644879Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0645135Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0645371Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0645597Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0645814Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0646031Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0646248Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0646490Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0646724Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0646941Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0647154Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0647371Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0647535Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0647770Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0647986Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0648224Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0648468Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0648702Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0648917Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0649121Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0649367Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0649591Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0649801Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0650037Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0650271Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0650507Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0650710Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0650945Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0651148Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0651383Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0651638Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0651874Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0652115Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0652360Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0652603Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0652837Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0653079Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0653315Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0653556Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0653803Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0654032Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0654236Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0654470Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0654699Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0654918Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0655132Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0655349Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0655592Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0655828Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0656088Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0656323Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0656528Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0656770Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0657015Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0657253Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0657494Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0657728Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0657957Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0658184Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0658411Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0658617Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0658821Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0659056Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0659298Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0659534Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0659776Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0660011Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0660247Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0660482Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0660737Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0660971Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0661225Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0661460Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0661665Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0661899Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0662142Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0662377Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0662630Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0662882Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0663109Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0663325Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0663540Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0663756Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0663999Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0664233Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0664438Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0664672Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0664923Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0665156Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0665397Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0665642Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0665872Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0666091Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0666305Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0666519Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0666762Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0667005Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0667258Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0667493Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0667734Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0667969Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0668175Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0668410Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0668650Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0668885Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0669128Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0669370Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0669598Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0669814Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0670043Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0670294Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0670539Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0670774Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0671014Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0671249Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0671505Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0671761Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0672004Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0672240Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0672481Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0672717Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0672929Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0673148Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0673355Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0673565Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0673807Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0674030Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0674241Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0674461Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0674669Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0674857Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0675000Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0675161Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0675280Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0675423Z E1204 10:57:54.801000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0675596Z [W1204 10:57:54.264767630 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0675608Z 2025-12-04T12:10:20.0675767Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0676090Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0676399Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0676545Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0677037Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0677307Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0677546Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0677765Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0677982Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0678233Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0678468Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0678710Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0678952Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0679195Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0679428Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0679669Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0679902Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0680181Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0680441Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0680651Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0680874Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0681087Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0681328Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0681562Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0681766Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0681998Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0682242Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0682476Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0682691Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0682924Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0683133Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0683351Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0683586Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0683797Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0683999Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0684231Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0684472Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0684714Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0684968Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0685202Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0685413Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0685635Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0685850Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0686093Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0686325Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0686566Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0686800Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0687050Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0687283Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0687522Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0687768Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0688010Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0688245Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0688487Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0688718Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0688959Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0689220Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0689461Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0689693Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0689934Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0690206Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0690449Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0690682Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0690925Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0691159Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0691375Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0691598Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0691839Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0692072Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0692325Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0692559Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0692802Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0693036Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0693276Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0693521Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0693774Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0694007Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0694248Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0694482Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0694694Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0694898Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0695132Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0695342Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0695566Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0695781Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0696034Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0696267Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0696479Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0696691Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0696925Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0697137Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0697338Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0697571Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0697813Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0698055Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0698308Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0698541Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0698753Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0698974Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0699189Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0699434Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0699668Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0699886Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0700225Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0700456Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0700701Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0700934Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0701187Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0701422Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0701666Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0701900Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0702142Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0702376Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0702643Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0702878Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0703091Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0703297Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0703531Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0703751Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0703964Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0704179Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0704421Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0704660Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0704913Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0705147Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0705390Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0705641Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0705884Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0706119Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0706360Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0706596Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0706817Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0707051Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0707258Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0707482Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0707698Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0707939Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0708174Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0708391Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0708605Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0708821Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0709064Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0709308Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0709549Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0709784Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0710033Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0710312Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0710556Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0710796Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0711037Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0711272Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0711541Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0711774Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0712016Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0712250Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0712493Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0712729Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0712969Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0713204Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0713447Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0713681Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0713908Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0714113Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0714348Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0714602Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0714838Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0715080Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0715314Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0715555Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0715799Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0716054Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0716288Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0716530Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0716766Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0716972Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0717207Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0717448Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0717683Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0717925Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0718162Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0718400Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0718619Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0718842Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0719057Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0719300Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0719535Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0719763Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0719984Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0720243Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0720473Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0720716Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0720949Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0721165Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0721379Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0721586Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0721749Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0721983Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0722190Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0722427Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0722690Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0722926Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0723138Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0723356Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0723592Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0723804Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0724008Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0724242Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0724448Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0724696Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0724914Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0725149Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0725352Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0725588Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0725830Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0726067Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0726307Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0726541Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0726784Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0727029Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0727273Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0727506Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0727758Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0727994Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0728208Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0728413Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0728648Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0728881Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0729106Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0729331Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0729546Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0729788Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0730022Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0730305Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0730540Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0730744Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0730982Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0731225Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0731458Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0731714Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0731948Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0732190Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0732408Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0732623Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0732832Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0733037Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0733273Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0733527Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0733774Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0734016Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0734251Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0734457Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0734691Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0734937Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0735170Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0735412Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0735645Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0735851Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0736094Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0736336Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0736578Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0736822Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0737060Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0737286Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0737504Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0737721Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0737947Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0738210Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0738447Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0738650Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0738885Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0739128Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0739368Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0739608Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0739844Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0740070Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0740327Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0740553Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0740769Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0741024Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0741259Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0741505Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0741739Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0741981Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0742216Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0742433Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0742685Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0742925Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0743162Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0743404Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0743640Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0743868Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0744085Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0744300Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0744514Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0744758Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0745002Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0745245Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0745490Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0745733Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0745970Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0746211Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0746445Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0746687Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0746931Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0747156Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0747373Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0747580Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0747790Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0748020Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0748242Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0748457Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0748663Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0748871Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0749058Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0749208Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0749370Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0749488Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0749628Z E1204 10:57:54.803000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0749810Z [W1204 10:57:54.304895994 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0749814Z 2025-12-04T12:10:20.0749973Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0750324Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0750630Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0750774Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0751267Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0751562Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0751802Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0752021Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0752235Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0752477Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0752713Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0752955Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0753191Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0753435Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0753681Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0753922Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0754154Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0754409Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0754643Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0754856Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0755079Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0755292Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0755535Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0755798Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0756002Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0756235Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0756478Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0756712Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0756916Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0757152Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0757362Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0757566Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0757799Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0758020Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0758223Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0758454Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0758704Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0758937Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0759182Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0759414Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0759626Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0759849Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0760072Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0760360Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0760593Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0760835Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0761068Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0761312Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0761551Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0761793Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0762026Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0762267Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0762512Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0762753Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0762988Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0763243Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0763477Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0763720Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0763951Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0764192Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0764424Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0764692Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0764925Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0765164Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0765398Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0765613Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0765828Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0766067Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0766301Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0766542Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0766774Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0767025Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0767258Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0767502Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0767744Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0767985Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0768219Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0768458Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0768692Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0768912Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0769129Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0769363Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0769574Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0769797Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0770011Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0770293Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0770525Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0770737Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0770941Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0771174Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0771399Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0771603Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0771840Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0772099Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0772333Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0772577Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0772809Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0773023Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0773246Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0773476Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0773737Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0773972Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0774189Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0774403Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0774619Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0774862Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0775097Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0775341Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0775579Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0775833Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0776068Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0776311Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0776557Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0776800Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0777037Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0777250Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0777458Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0777693Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0777919Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0778148Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0778363Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0778604Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0778840Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0779085Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0779320Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0779564Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0779798Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0780041Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0780335Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0780578Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0780813Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0781041Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0781256Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0781469Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0781694Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0781908Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0782152Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0782400Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0782632Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0782846Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0783061Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0783304Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0783541Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0783784Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0784019Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0784261Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0784496Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0784749Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0784985Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0785226Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0785471Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0785716Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0785951Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0786192Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0786426Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0786669Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0786925Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0787167Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0787403Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0787647Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0787883Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0788098Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0788304Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0788541Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0788782Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0789019Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0789277Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0789511Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0789754Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0789999Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0790272Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0790511Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0790754Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0790988Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0791211Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0791460Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0791702Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0791937Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0792181Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0792428Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0792658Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0792875Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0793089Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0793305Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0793548Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0793798Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0794026Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0794243Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0794469Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0794685Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0794929Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0795165Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0795383Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0795598Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0795825Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0795988Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0796223Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0796427Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0796663Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0796905Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0797139Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0797351Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0797556Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0797792Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0798006Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0798218Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0798453Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0798667Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0798902Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0799108Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0799342Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0799546Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0799784Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0800037Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0800314Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0800562Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0800796Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0801038Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0801274Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0801518Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0801752Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0801995Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0802228Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0802445Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0802663Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0802898Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0803139Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0803357Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0803573Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0803788Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0804032Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0804265Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0804521Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0804777Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0804981Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0805215Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0805456Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0805691Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0805940Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0806175Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0806402Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0806618Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0806833Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0807047Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0807254Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0807497Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0807740Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0807978Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0808222Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0808456Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0808661Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0808909Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0809162Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0809395Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0809636Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0809870Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0810076Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0810364Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0810606Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0810842Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0811084Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0811320Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0811560Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0811777Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0812005Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0812221Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0812467Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0812700Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0812904Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0813138Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0813393Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0813638Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0813880Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0814115Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0814342Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0814560Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0814774Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0814989Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0815231Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0815467Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0815709Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0815952Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0816196Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0816439Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0816648Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0816883Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0817126Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0817362Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0817605Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0817850Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0818089Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0818308Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0818520Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0818737Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0818981Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0819216Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0819459Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0819694Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0819937Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0820205Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0820460Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0820695Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0820951Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0821187Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0821402Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0821620Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0821825Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0822037Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0822289Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0822533Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0822746Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0822951Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0823159Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0823348Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0823492Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0823653Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0823772Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0823914Z E1204 10:57:54.844000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0824086Z [W1204 10:57:54.306950131 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0824089Z 2025-12-04T12:10:20.0824249Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0824556Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0824876Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0825021Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0825526Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0825795Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0826033Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0826255Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0826469Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0826722Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0826971Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0827212Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0827446Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0827686Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0827920Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0828163Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0828396Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0828637Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0828870Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0829092Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0829314Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0829527Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0829780Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0830013Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0830264Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0830497Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0830737Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0830971Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0831190Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0831440Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0831651Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0831855Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0832087Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0832299Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0832503Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0832736Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0832976Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0833215Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0833457Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0833701Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0833913Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0834150Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0834368Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0834611Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0834844Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0835085Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0835316Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0835567Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0835810Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0836055Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0836288Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0836531Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0836764Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0837005Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0837240Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0837482Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0837716Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0837967Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0838199Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0838442Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0838695Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0838936Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0839171Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0839414Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0839652Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0839873Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0840129Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0840395Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0840629Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0840870Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0841107Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0841356Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0841588Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0841831Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0842066Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0842308Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0842555Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0842798Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0843031Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0843256Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0843463Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0843696Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0843908Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0844130Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0844346Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0844605Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0844852Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0845063Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0845265Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0845499Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0845709Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0845912Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0846146Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0846388Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0846624Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0846875Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0847112Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0847323Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0847556Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0847771Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0848017Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0848252Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0848469Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0848687Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0848912Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0849167Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0849402Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0849644Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0849881Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0850167Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0850404Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0850646Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0850884Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0851127Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0851382Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0851598Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0851803Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0852054Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0852273Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0852488Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0852705Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0852947Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0853188Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0853447Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0853701Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0853945Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0854180Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0854425Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0854662Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0854909Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0855142Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0855360Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0855576Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0855785Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0856033Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0856248Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0856504Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0856740Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0856964Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0857180Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0857395Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0857642Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0857889Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0858151Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0858385Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0858632Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0858873Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0859117Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0859354Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0859597Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0859835Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0860078Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0860376Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0860623Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0860860Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0861120Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0861356Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0861608Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0861845Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0862096Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0862337Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0862588Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0862796Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0863035Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0863281Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0863520Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0863767Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0864010Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0864254Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0864497Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0864743Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0865026Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0865275Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0865513Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0865737Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0865975Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0866227Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0866463Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0866714Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0866960Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0867223Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0867452Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0867672Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0867895Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0868141Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0868384Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0868621Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0868841Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0869059Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0869281Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0869544Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0869783Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0870008Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0870292Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0870504Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0870680Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0870918Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0871132Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0871370Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0871640Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0871910Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0872128Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0872339Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0872578Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0872799Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0873007Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0873252Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0873464Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0873708Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0873919Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0874172Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0874392Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0874637Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0874900Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0875146Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0875399Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0875646Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0875893Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0876167Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0876430Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0876675Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0876929Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0877168Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0877392Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0877605Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0877849Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0878082Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0878312Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0878540Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0878783Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0879039Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0879300Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0879554Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0879797Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0880015Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0880297Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0880544Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0880807Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0881071Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0881317Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0881549Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0881778Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0882004Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0882225Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0882437Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0882677Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0882931Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0883171Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0883442Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0883685Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0883893Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0884160Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0884409Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0884653Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0884898Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0885144Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0885371Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0885625Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0885878Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0886116Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0886371Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0886617Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0886851Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0887076Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0887295Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0887518Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0887764Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0888020Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0888232Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0888470Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0888733Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0888973Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0889223Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0889460Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0889695Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0889941Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0890218Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0890442Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0890688Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0890933Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0891180Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0891427Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0891688Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0891927Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0892139Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0892378Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0892647Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0892885Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0893148Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0893390Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0893624Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0893849Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0894066Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0894290Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0894570Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0894828Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0895077Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0895315Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0895565Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0895806Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0896059Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0896297Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0896549Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0896794Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0897011Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0897249Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0897458Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0897691Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0897925Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0898155Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0898376Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0898584Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0898798Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0898988Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0899170Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0899338Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0899464Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0899607Z E1204 10:57:54.846000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0899786Z [W1204 10:57:54.308993979 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0899789Z 2025-12-04T12:10:20.0899954Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0900316Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0900632Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0900780Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0901293Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0901586Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0901829Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0902059Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0902298Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0905121Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0905363Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0905613Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0905855Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0906104Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0906374Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0906653Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0906896Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0907141Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0907380Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0907596Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0907828Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0908046Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0908296Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0908536Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0908746Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0909000Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0909244Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0909510Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0909829Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0910072Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0910330Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0910535Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0910775Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0910991Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0911222Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0911459Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0911707Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0911948Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0912195Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0912437Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0912652Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0912882Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0913100Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0913350Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0913610Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0913855Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0914112Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0914357Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0914633Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0914879Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0915120Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0915369Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0919847Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0920191Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0920435Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0920696Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0920940Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0921186Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0921436Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0921682Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0921929Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0922175Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0922450Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0922704Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0922944Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0923196Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0923440Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0923695Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0923931Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0924179Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0924422Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0924666Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0924924Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0925168Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0925409Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0925651Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0925892Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0926141Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0926376Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0926596Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0926801Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0927058Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0927274Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0927506Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0927736Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0927994Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0928239Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0928455Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0928666Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0928902Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0929121Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0929343Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0929577Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0929825Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0930061Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0930350Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0930586Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0930807Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0931037Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0931255Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0931510Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0931776Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0932002Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0932230Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0932453Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0932721Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0932957Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0933207Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0933444Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0933694Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0933949Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0934193Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0934431Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0934674Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0934917Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0935135Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0935345Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0935585Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0935805Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0936024Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0936251Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0936496Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0936741Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0936990Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0937245Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0937492Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0937731Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0937976Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0938217Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0938472Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0938712Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0938932Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0939147Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0939359Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0939589Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0939810Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0940052Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0940381Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0940604Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0940838Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0941056Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0941314Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0941554Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0941813Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0942053Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0942298Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0942534Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0942780Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0943030Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0943277Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0943512Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0943763Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0944004Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0944248Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0944489Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0944733Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0944971Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0945228Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0945464Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0945712Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0945958Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0946191Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0946399Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0946639Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0946888Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0947126Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0947375Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0947627Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0947874Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0948111Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0948359Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0948601Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0948846Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0949084Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0949291Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0949528Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0949782Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0950023Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0950340Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0950591Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0950836Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0951057Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0951274Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0951490Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0951737Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0951994Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0952227Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0952447Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0952661Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0952879Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0953124Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0953363Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0953583Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0953798Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0954011Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0954180Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.0954434Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0954641Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0954891Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0955137Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0955388Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0955604Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0955810Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0956049Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0956264Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0956486Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0956726Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0956932Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0957171Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0957376Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0957616Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0957821Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0958062Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0958311Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0958548Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0958805Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0959040Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0959287Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0959534Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0959794Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0960034Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0960320Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0960562Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0960779Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0961004Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0961240Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0961473Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0961697Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0961913Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0962137Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0962384Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0962623Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0962868Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0963109Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0963333Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0963570Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0963817Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0964079Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0964347Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0964585Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0964968Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0965192Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0965408Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0965637Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.0965844Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0966084Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0966330Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0966571Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0966819Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0967056Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0967265Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0967504Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0967750Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0968002Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0968249Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0968492Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0968710Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0968961Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0969206Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0969447Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0969693Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0969929Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0970218Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0970437Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0970658Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0970877Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0971127Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0971367Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0971572Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0971808Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0972054Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0972291Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0972551Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0972790Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0973023Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0973259Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0973495Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0973713Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0973957Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0974192Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0974438Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0974690Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0974933Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0975171Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0975376Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0975614Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0975858Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0976096Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0976340Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0976575Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0976806Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.0977041Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0977258Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.0977473Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0977731Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0977980Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0978223Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0978466Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0978709Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0978947Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0979203Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0979440Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0979686Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0979922Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0980167Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.0980386Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.0980595Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.0980806Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.0981041Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.0981268Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.0981495Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.0981704Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.0981912Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.0982123Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.0982280Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.0982449Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.0982572Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.0982715Z E1204 10:57:54.848000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.0982787Z ('RERUN', {'yellow': True}) [1.7903s] [100%] 2025-12-04T12:10:20.0983153Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:57:56.899259958 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.0983157Z 2025-12-04T12:10:20.0983320Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.0983648Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.0983958Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.0984106Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.0984606Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.0984879Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.0985122Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.0985345Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.0985562Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0985808Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0986056Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0986300Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0986548Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0986790Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0987072Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0987318Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0987555Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0987797Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0988030Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0988256Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0988479Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0988696Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0988937Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0989174Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0989386Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0989619Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0989861Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0990130Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0990336Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0990583Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0990800Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0991017Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0991251Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0991482Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0991684Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.0991919Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0992160Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0992395Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0992652Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0992884Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0993098Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.0993322Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.0993539Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.0993783Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0994018Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0994261Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0994497Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0994742Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0994987Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0995230Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0995477Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0995719Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0995966Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0996208Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0996442Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0996683Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0996926Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0997185Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0997420Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0997666Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0997901Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0998147Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0998382Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0998631Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.0998872Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.0999089Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.0999306Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.0999566Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.0999805Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1000058Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1000330Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1000577Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1000811Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1001055Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1001289Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1001537Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1001791Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1002035Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1002273Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1002487Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1002697Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1002932Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1003149Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1003375Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1003597Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1003845Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1004090Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1004304Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1004519Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1004755Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1004982Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1005186Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1005422Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1005664Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1005900Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1006153Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1006388Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1006599Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1006825Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1007042Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1007290Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1007528Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1007746Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1007963Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1008181Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1008437Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1008675Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1008928Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1009164Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1009419Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1009660Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1009906Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1010179Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1010427Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1010682Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1010902Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1011108Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1011348Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1011567Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1011790Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1012013Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1012257Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1012499Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1012743Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1012997Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1013244Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1013493Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1013739Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1013990Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1014238Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1014473Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1014693Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1014910Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1015136Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1015363Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1015578Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1015823Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1016061Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1016284Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1016500Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1016714Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1016962Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1017197Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1017453Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1017689Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1017947Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1018185Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1018439Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1018677Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1018919Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1019156Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1019399Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1019650Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1019893Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1020156Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1020402Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1020638Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1020884Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1021119Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1021365Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1021602Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1021833Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1022041Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1022277Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1022541Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1022792Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1023042Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1023279Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1023522Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1023759Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1024002Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1024256Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1024500Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1024737Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1024945Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1025181Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1025426Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1025661Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1025908Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1026148Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1026390Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1026620Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1026833Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1027065Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1027318Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1027557Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1027790Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1028009Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1028229Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1028521Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1028787Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1029025Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1029248Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1029468Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1029679Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1029848Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1030085Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1030377Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1030615Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1030864Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1031122Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1031337Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1031547Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1031796Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1032035Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1032242Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1032487Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1032699Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1032936Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1033162Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1033399Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1033608Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1033847Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1034093Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1034335Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1034580Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1034820Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1035064Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1035305Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1035560Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1035800Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1036047Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1036293Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1036524Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1036732Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1036973Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1037204Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1037426Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1037664Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1037882Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1038131Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1038370Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1038619Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1038861Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1039071Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1039311Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1039555Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1039795Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1040050Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1040328Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1040557Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1040796Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1041027Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1041237Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1041445Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1041683Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1041932Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1042182Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1042430Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1042671Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1042880Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1043118Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1043365Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1043604Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1043848Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1044089Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1044299Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1044547Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1044795Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1045034Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1045293Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1045540Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1045773Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1045995Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1046210Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1046433Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1046692Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1046934Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1047139Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1047377Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1047623Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1047860Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1048107Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1048343Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1048577Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1048799Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1049028Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1049249Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1049496Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1049753Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1050006Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1050284Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1050532Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1050770Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1050980Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1051232Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1051478Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1051715Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1051963Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1052203Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1052433Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1052654Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1052869Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1053090Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1053339Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1053591Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1053838Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1054075Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1054336Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1054585Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1054832Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1055068Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1055315Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1055556Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1055784Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1056008Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1056215Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1056432Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1056663Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1056892Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1057110Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1057321Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1057531Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1057718Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1057867Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1058039Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1058164Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1058306Z E1204 10:57:56.438000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1058484Z [W1204 10:57:56.901509463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1058487Z 2025-12-04T12:10:20.1058659Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1058984Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1059299Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1059446Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1059946Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1060271Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1060513Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1060738Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1060956Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1061204Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1061445Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1061693Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1061931Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1062175Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1062415Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1062670Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1062909Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1063162Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1063399Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1063635Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1063859Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1064077Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1064319Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1064558Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1064779Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1065018Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1065264Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1065499Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1065710Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1065946Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1066161Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1066366Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1066603Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1066820Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1067039Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1067276Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1067520Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1067768Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1068024Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1068264Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1068483Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1068707Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1068927Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1069182Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1069421Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1069664Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1069904Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1070234Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1070469Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1070714Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1070952Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1071199Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1071434Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1071691Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1071930Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1072186Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1072424Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1072682Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1072919Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1073167Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1073403Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1073654Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1073903Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1074150Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1074383Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1074606Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1074824Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1075067Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1075305Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1075546Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1075784Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1076029Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1076279Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1076523Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1076767Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1077026Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1077264Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1077510Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1077744Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1077962Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1078171Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1078415Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1078632Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1078855Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1079075Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1079319Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1079557Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1079773Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1079978Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1080248Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1080461Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1080685Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1080920Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1081165Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1081416Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1081674Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1081911Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1082125Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1082354Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1082569Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1082843Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1083082Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1083302Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1083519Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1083736Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1083986Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1084223Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1084473Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1084712Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1084956Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1085208Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1085451Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1085702Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1085946Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1086198Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1086418Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1086625Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1086867Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1087085Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1087320Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1087537Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1087786Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1088026Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1088271Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1088514Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1088759Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1088998Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1089241Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1089481Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1089744Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1089981Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1090252Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1090469Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1090697Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1090924Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1091144Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1091392Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1091627Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1091864Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1092078Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1092299Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1092543Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1092784Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1093032Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1093272Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1093521Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1093757Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1094005Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1094257Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1094499Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1094750Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1094992Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1095243Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1095485Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1095723Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1095968Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1096203Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1096458Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1096695Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1096940Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1097175Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1097397Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1097607Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1097842Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1098086Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1098319Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1098565Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1098811Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1099056Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1099303Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1099563Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1099802Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1100044Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1100432Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1100638Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1100879Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1101140Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1101375Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1101621Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1101855Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1102088Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1102308Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1102522Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1102740Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1102984Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1103235Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1103463Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1103684Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1103913Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1104141Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1104388Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1104623Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1104842Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1105057Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1105268Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1105442Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1105680Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1105887Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1106123Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1106368Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1106604Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1106821Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1107026Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1107265Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1107482Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1107698Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1107935Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1108139Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1108387Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1108602Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1108840Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1109050Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1109285Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1109530Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1109766Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1110030Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1110304Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1110550Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1110788Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1111032Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1111271Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1111515Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1111751Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1111967Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1112186Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1112424Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1112652Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1112884Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1113114Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1113335Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1113581Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1113816Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1114061Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1114310Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1114517Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1114751Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1114997Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1115237Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1115483Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1115723Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1115953Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1116176Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1116393Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1116616Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1116827Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1117064Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1117332Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1117579Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1117828Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1118064Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1118275Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1118515Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1118759Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1119011Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1119254Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1119496Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1119702Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1119942Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1120231Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1120467Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1120715Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1120951Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1121199Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1121419Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1121641Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1121878Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1122136Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1122379Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1122589Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1122831Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1123077Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1123317Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1123582Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1123818Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1124053Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1124272Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1124494Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1124713Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1124961Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1125203Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1125448Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1125702Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1125946Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1126185Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1126401Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1126652Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1126901Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1127137Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1127384Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1127621Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1127868Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1128091Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1128307Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1128527Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1128773Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1129016Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1129260Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1129500Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1129751Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1129988Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1130293Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1130529Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1130776Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1131024Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1131259Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1131483Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1131690Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1131909Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1132141Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1132369Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1132601Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1132813Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1133025Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1133215Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1133363Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1133526Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1133650Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1133794Z E1204 10:57:56.440000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1133971Z [W1204 10:57:56.903579270 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1133973Z 2025-12-04T12:10:20.1134135Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1134455Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1134787Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1134936Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1135452Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1135735Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1135984Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1136207Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1136422Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1136666Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1136914Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1137160Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1137395Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1137642Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1137880Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1138125Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1138364Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1138607Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1138846Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1139060Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1139297Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1139517Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1139771Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1140012Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1140263Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1140500Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1140742Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1140983Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1141190Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1141440Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1141657Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1141862Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1142101Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1142313Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1142524Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1142763Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1143006Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1143246Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1143488Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1143740Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1143954Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1144180Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1144415Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1144672Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1144912Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1145152Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1145389Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1145634Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1145884Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1146131Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1146364Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1146610Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1146844Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1147092Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1147325Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1147569Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1147807Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1148050Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1148297Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1148539Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1148787Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1149029Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1149278Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1149524Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1149757Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1149981Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1150239Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1150511Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1150745Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1150991Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1151229Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1151474Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1151711Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1151954Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1152190Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1152437Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1152673Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1152931Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1153165Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1153395Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1153600Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1153851Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1154066Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1154291Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1154513Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1154755Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1155002Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1155218Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1155422Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1155660Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1155872Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1156082Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1156319Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1156563Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1156798Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1157043Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1157297Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1157510Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1157735Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1157961Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1158222Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1158462Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1158685Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1158901Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1159119Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1159385Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1159621Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1159867Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1160136Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1160385Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1160625Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1160867Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1161106Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1161352Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1161592Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1161820Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1162031Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1162269Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1162502Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1162731Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1162949Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1163196Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1163433Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1163680Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1163939Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1164183Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1164423Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1164669Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1164909Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1165155Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1165393Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1165614Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1165832Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1166045Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1166281Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1166503Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1166747Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1166999Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1167236Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1167453Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1167671Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1167916Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1168159Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1168414Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1168656Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1168902Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1169138Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1169386Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1169626Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1169875Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1170152Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1170400Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1170639Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1170902Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1171141Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1171398Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1171638Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1171898Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1172137Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1172383Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1172619Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1172838Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1173058Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1173299Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1173544Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1173780Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1174028Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1174265Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1174511Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1174749Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1174996Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1175235Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1175488Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1175729Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1175945Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1176187Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1176442Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1176682Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1176929Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1177167Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1177402Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1177632Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1177850Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1178066Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1178312Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1178550Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1178780Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1179004Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1179221Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1179439Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1179683Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1179932Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1180198Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1180427Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1180637Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1180816Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1181057Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1181264Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1181508Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1181755Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1182010Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1182228Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1182438Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1182677Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1182891Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1183101Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1183341Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1183547Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1183788Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1183993Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1184248Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1184454Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1184693Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1184954Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1185201Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1185451Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1185684Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1185929Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1186167Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1186413Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1186664Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1186907Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1187145Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1187361Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1187575Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1187811Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1188041Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1188264Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1188478Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1188712Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1188956Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1189196Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1189451Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1189702Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1189914Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1190190Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1190437Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1190673Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1190921Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1191172Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1191401Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1191625Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1191840Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1192052Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1192260Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1192498Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1192746Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1192983Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1193243Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1193480Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1193687Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1193935Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1194196Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1194437Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1194680Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1194920Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1195125Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1195363Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1195620Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1195859Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1196106Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1196341Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1196575Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1196793Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1197013Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1197229Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1197476Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1197729Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1197935Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1198172Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1198426Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1198674Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1198919Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1199176Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1199406Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1199625Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1199844Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1200073Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1200359Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1200594Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1200932Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1201172Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1201417Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1201657Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1201866Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1202105Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1202374Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1202614Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1202859Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1203107Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1203362Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1203583Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1203799Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1204017Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1204266Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1204505Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1204765Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1205001Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1205245Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1205485Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1205732Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1205969Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1206215Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1206450Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1206669Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1206900Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1207113Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1207326Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1207568Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1207806Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1208022Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1208232Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1208441Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1208634Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1208778Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1208951Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1209076Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1209217Z E1204 10:57:56.442000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1209394Z [W1204 10:57:56.947228975 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1209396Z 2025-12-04T12:10:20.1209556Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1209871Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1210218Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1210367Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1210868Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1211138Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1211396Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1211617Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1211840Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1212100Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1212355Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1212602Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1212836Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1213083Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1213318Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1213577Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1213818Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1214060Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1214299Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1214513Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1214741Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1214955Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1215201Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1215438Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1215642Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1215891Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1216133Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1216373Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1216589Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1216839Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1217056Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1217261Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1217497Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1217710Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1217919Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1218171Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1218418Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1218655Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1218899Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1219137Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1219348Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1219576Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1219791Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1220037Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1220321Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1220566Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1220802Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1221056Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1221305Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1221554Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1221791Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1222037Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1222271Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1222534Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1222769Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1223014Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1223249Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1223492Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1223731Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1223974Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1224211Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1224455Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1224692Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1224946Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1225184Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1225403Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1225629Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1225884Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1226120Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1226366Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1226603Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1226848Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1229822Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1230065Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1230338Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1230581Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1230818Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1231061Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1231297Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1231510Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1231715Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1231949Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1232182Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1232407Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1232620Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1232882Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1233134Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1233348Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1233551Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1233786Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1233996Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1234215Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1234449Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1234694Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1234928Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1235170Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1235407Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1235620Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1235842Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1236059Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1236306Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1236552Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1236772Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1236985Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1237212Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1237475Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1237715Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1237960Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1238194Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1238438Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1238685Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1238929Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1239163Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1239409Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1239645Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1239861Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1240069Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1240344Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1240566Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1240780Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1241011Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1241256Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1241492Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1241750Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1241998Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1242241Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1242476Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1242718Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1242954Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1243213Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1243449Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1243667Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1243884Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1244093Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1244319Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1244534Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1244775Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1245012Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1245228Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1245458Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1245675Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1245917Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1246163Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1246413Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1246650Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1246891Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1247125Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1247368Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1247616Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1247861Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1248095Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1248338Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1248572Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1248816Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1249055Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1249296Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1249531Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1249775Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1250023Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1250310Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1250559Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1250774Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1250994Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1251229Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1251474Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1251712Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1251955Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1252207Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1252449Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1252784Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1253027Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1253262Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1253506Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1253742Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1253949Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1254183Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1254425Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1254677Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1254919Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1255174Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1255404Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1255631Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1255846Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1256064Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1256308Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1256542Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1256785Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1257004Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1257217Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1257434Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1257676Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1257913Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1258131Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1258346Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1258552Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1258718Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1258965Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1259170Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1259406Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1259658Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1259906Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1260164Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1260368Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1260605Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1260817Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1261026Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1261277Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1261487Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1261722Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1261926Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1262164Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1262369Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1262605Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1262850Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1263085Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1263344Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1263578Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1263820Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1264069Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1264327Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1264562Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1264804Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1265038Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1265252Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1265459Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1265704Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1265931Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1266149Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1266366Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1266582Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1266826Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1267061Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1267304Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1267542Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1267747Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1267993Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1268244Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1268487Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1268745Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1268980Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1269208Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1269424Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1269642Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1269850Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1270066Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1270342Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1270583Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1270819Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1271061Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1271298Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1271503Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1271737Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1271983Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1272219Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1272481Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1272715Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1272936Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1273186Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1273428Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1273661Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1273902Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1274140Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1274368Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1274601Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1274816Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1275031Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1275277Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1275511Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1275718Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1275952Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1276197Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1276433Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1276686Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1276921Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1277147Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1277376Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1277602Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1277819Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1278062Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1278296Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1278543Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1278777Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1279035Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1279272Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1279476Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1279713Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1279956Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1280237Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1280482Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1280718Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1280946Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1281177Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1281391Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1281608Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1281869Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1282116Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1282363Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1282597Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1282838Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1283073Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1283314Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1283563Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1283805Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1284040Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1284254Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1284472Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1284679Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1284890Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1285119Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1285340Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1285554Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1285771Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1285979Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1286175Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1286319Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1286491Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1286610Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1286755Z E1204 10:57:56.486000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1286928Z [W1204 10:57:56.949309901 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1286930Z 2025-12-04T12:10:20.1287092Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1287404Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1287713Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1287878Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1288374Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1288644Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1288888Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1289108Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1289323Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1289566Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1289803Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1290054Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1290323Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1290568Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1290817Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1291075Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1291312Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1291555Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1291788Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1292002Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1292238Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1292453Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1292698Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1292930Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1293135Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1293370Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1293613Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1293847Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1294052Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1294287Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1294512Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1294717Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1294949Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1295171Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1295385Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1295618Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1295860Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1296092Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1296337Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1296570Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1296795Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1297018Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1297232Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1297476Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1297709Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1297950Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1298182Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1298425Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1298660Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1298912Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1299146Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1299386Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1299628Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1299879Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1300152Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1300393Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1300628Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1300873Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1301122Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1301366Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1301598Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1301839Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1302071Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1302313Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1302547Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1302767Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1302978Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1303221Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1303467Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1303708Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1303939Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1304191Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1304441Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1304684Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1304918Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1305159Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1305392Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1305646Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1305881Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1306094Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1306299Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1306534Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1306746Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1306974Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1307189Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1307433Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1307667Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1307892Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1308097Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1308329Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1308551Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1308763Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1309001Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1309241Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1309475Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1309718Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1309950Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1310224Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1310447Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1310662Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1310909Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1311148Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1311369Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1311585Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1311804Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1312046Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1312296Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1312539Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1312777Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1313034Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1313282Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1313527Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1313761Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1314005Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1314240Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1314467Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1314674Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1314908Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1315130Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1315343Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1315560Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1315804Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1316042Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1316288Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1316523Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1316779Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1317014Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1317261Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1317504Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1317760Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1317999Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1318217Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1318434Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1318642Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1318870Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1319098Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1319343Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1319579Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1319795Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1320011Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1320259Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1320503Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1320737Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1320981Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1321232Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1321475Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1321711Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1321972Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1322222Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1322466Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1322701Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1322944Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1323182Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1323446Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1323684Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1323926Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1324165Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1324406Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1324645Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1324886Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1325123Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1325338Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1325545Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1325799Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1326041Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1326276Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1326529Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1326774Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1327020Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1327256Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1327499Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1327732Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1327987Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1328220Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1328425Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1328660Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1328902Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1329138Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1329383Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1329617Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1329845Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1330063Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1330332Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1330549Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1330805Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1331044Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1331287Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1331505Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1331717Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1331931Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1332175Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1332426Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1332642Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1332856Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1333063Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1333230Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1333468Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1333676Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1333912Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1334154Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1334390Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1334615Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1334821Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1335055Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1335279Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1335495Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1335731Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1335938Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1336172Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1336377Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1336611Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1336830Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1337065Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1337308Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1337545Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1337789Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1338025Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1338266Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1338502Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1338744Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1338994Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1339236Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1339471Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1339696Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1339910Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1340184Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1340412Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1340628Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1340842Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1341057Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1341313Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1341549Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1341789Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1342024Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1342228Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1342464Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1342705Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1342940Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1343183Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1343418Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1343660Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1343877Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1344103Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1344309Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1344529Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1344765Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1345006Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1345243Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1345488Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1345737Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1345940Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1346175Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1346418Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1346651Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1346897Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1347130Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1347336Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1347572Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1347816Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1348063Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1348305Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1348560Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1348799Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1349019Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1349232Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1349449Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1349697Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1349934Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1350183Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1350417Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1350663Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1350899Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1351146Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1351383Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1351613Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1351835Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1352049Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1352267Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1352524Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1352760Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1353017Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1353252Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1353512Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1353748Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1353954Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1354188Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1354431Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1354687Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1354929Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1355165Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1355393Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1355614Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1355837Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1356051Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1356301Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1356536Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1356782Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1357025Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1357269Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1357517Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1357772Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1358011Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1358251Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1358487Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1358700Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1358919Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1359138Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1359349Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1359580Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1359803Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1360018Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1360263Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1360472Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1360659Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1360801Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1360963Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1361083Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1361242Z E1204 10:57:56.488000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1361418Z [W1204 10:57:56.951351049 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1361420Z 2025-12-04T12:10:20.1361583Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1361906Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1362227Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1362375Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1362866Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1363135Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1363375Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1363611Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1363830Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1364074Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1364311Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1364554Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1364792Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1365034Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1365271Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1365514Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1365759Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1366005Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1366237Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1366463Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1366698Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1366915Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1367157Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1367388Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1367593Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1367825Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1368079Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1368311Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1368516Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1368750Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1368965Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1369171Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1369404Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1369619Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1369820Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1370060Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1370375Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1370609Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1370865Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1371118Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1371333Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1371555Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1371771Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1372020Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1372255Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1372513Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1372745Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1372988Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1373220Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1373465Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1373701Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1373940Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1374176Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1374416Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1374662Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1374902Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1375138Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1375389Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1375631Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1375874Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1376106Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1376350Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1376581Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1376837Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1377072Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1377288Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1377500Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1377740Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1377977Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1378218Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1378453Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1378693Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1378926Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1379177Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1379410Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1379651Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1379894Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1380184Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1380423Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1380635Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1380840Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1381072Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1381302Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1381527Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1381741Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1381985Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1382218Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1382433Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1382635Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1382869Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1383082Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1383285Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1383537Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1383779Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1384011Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1384267Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1384521Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1384736Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1384959Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1385173Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1385419Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1385655Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1385884Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1386101Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1386318Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1386565Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1386804Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1387048Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1387284Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1387528Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1387764Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1388021Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1388255Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1388499Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1388752Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1388977Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1389183Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1389419Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1389637Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1389850Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1390069Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1390360Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1390597Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1390839Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1391074Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1391318Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1391551Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1391793Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1392029Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1392271Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1392522Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1392743Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1392957Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1393177Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1393417Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1393633Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1393878Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1394113Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1394333Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1394551Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1394783Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1395025Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1395260Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1395502Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1395737Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1395982Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1396218Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1396460Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1396697Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1396952Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1397191Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1397432Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1397677Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1397929Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1398164Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1398405Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1398638Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1398883Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1399131Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1399373Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1399609Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1399822Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1400028Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1400302Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1400545Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1400781Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1401026Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1401262Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1401516Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1401751Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1401991Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1402240Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1402495Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1402730Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1402937Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1403171Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1403414Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1403664Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1403907Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1404141Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1404370Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1404590Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1404804Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1405022Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1405264Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1405500Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1405730Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1405964Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1406178Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1406392Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1406646Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1406890Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1407111Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1407324Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1407528Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1407692Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1407929Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1408147Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1408381Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1408623Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1408860Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1409078Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1409285Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1409519Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1409732Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1409936Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1410202Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1410422Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1410657Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1410876Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1411114Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1411330Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1411568Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1411812Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1412050Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1412292Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1412545Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1412786Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1413022Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1413266Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1413503Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1413748Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1413982Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1414198Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1414403Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1414640Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1414877Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1415098Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1415322Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1415537Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1415792Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1416027Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1416271Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1416506Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1416712Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1416959Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1417201Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1417436Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1417677Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1417912Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1418141Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1418358Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1418575Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1418781Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1418986Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1419236Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1419480Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1419725Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1419970Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1420265Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1420470Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1420704Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1420946Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1421184Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1421454Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1421689Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1421896Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1422131Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1422376Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1422610Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1422855Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1423091Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1423322Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1423541Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1423767Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1423983Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1424241Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1424479Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1424700Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1424936Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1425180Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1425421Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1425663Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1425911Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1426140Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1426360Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1426574Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1426790Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1427035Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1427271Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1427515Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1427750Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1427996Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1428240Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1428446Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1428692Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1428937Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1429182Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1429428Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1429665Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1429893Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1430141Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1430371Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1430586Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1430828Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1431064Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1431309Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1431547Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1431792Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1432027Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1432270Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1432505Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1432760Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1432996Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1433221Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1433455Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1433662Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1433873Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1434102Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1434326Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1434540Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1434758Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1434968Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1435154Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1435298Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1435459Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1435579Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1435721Z E1204 10:57:56.490000 680710 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1435783Z FAILED [1.6303s] [100%] 2025-12-04T12:10:20.1435786Z 2025-12-04T12:10:20.1435861Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.1436024Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.1436090Z Traceback (most recent call last): 2025-12-04T12:10:20.1436276Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1436338Z method(*args, **kwargs) 2025-12-04T12:10:20.1436507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1436569Z method(*args, **kwargs) 2025-12-04T12:10:20.1436736Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.1436794Z with policy(): 2025-12-04T12:10:20.1436970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.1437030Z raise RuntimeError(msg) 2025-12-04T12:10:20.1437448Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1973420032. 2025-12-04T12:10:20.1437450Z 2025-12-04T12:10:20.1437559Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.1437834Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.1437856Z 2025-12-04T12:10:20.1437962Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.1438060Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1438122Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1438199Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1438769Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1438888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1438956Z graph_break [] 2025-12-04T12:10:20.1439042Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1439133Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1439637Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.1439705Z current_size = base.storage().size() 2025-12-04T12:10:20.1439764Z Autotune Choices Stats: 2025-12-04T12:10:20.1440193Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009398999623954296, "best_triton_pos": 0} 2025-12-04T12:10:20.1440281Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1440354Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1440492Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1440746Z triton_mm_29 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1440992Z triton_mm_33 0.0107 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1441254Z triton_mm_34 0.0108 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1441495Z triton_mm_21 0.0110 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1441732Z triton_mm_22 0.0115 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1441984Z triton_mm_30 0.0118 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1442241Z triton_mm_16 0.0122 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1442482Z triton_mm_23 0.0123 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1442722Z triton_mm_25 0.0127 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1442965Z triton_mm_31 0.0135 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1443130Z SingleProcess AUTOTUNE benchmarking takes 0.1600 seconds and 8.3092 seconds precompiling for 33 choices 2025-12-04T12:10:20.1443290Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.1443356Z Traceback (most recent call last): 2025-12-04T12:10:20.1443531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1443590Z method(*args, **kwargs) 2025-12-04T12:10:20.1443759Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1443818Z method(*args, **kwargs) 2025-12-04T12:10:20.1443984Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.1444043Z with policy(): 2025-12-04T12:10:20.1444209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.1444270Z raise RuntimeError(msg) 2025-12-04T12:10:20.1444676Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1973420032 and is now 2940207104. 2025-12-04T12:10:20.1444681Z 2025-12-04T12:10:20.1444774Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.1445050Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.1445052Z 2025-12-04T12:10:20.1445157Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.1445251Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1445313Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1445389Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1445979Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1446107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1446163Z graph_break [] 2025-12-04T12:10:20.1446257Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1446348Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1446849Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.1446915Z current_size = base.storage().size() 2025-12-04T12:10:20.1446975Z Autotune Choices Stats: 2025-12-04T12:10:20.1447360Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009398999623954296, "best_triton_pos": 0} 2025-12-04T12:10:20.1447444Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1447525Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1447663Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1447910Z triton_mm_29 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1448155Z triton_mm_33 0.0107 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1448394Z triton_mm_34 0.0108 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1448635Z triton_mm_21 0.0110 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1448872Z triton_mm_22 0.0115 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1449110Z triton_mm_30 0.0118 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1449345Z triton_mm_16 0.0122 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1449598Z triton_mm_23 0.0123 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1449836Z triton_mm_25 0.0127 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1450076Z triton_mm_31 0.0135 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1450280Z SingleProcess AUTOTUNE benchmarking takes 0.1600 seconds and 8.3092 seconds precompiling for 33 choices 2025-12-04T12:10:20.1450385Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1450448Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1450524Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1450642Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1451141Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1451197Z graph_break [] 2025-12-04T12:10:20.1451280Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1451372Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1451759Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.1451888Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.1451949Z Autotune Choices Stats: 2025-12-04T12:10:20.1452328Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009720000438392162, "best_triton_pos": 0} 2025-12-04T12:10:20.1452415Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1452483Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1452626Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1452872Z triton_mm_67 0.0097 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1452935Z _scaled_mm 0.0105 ms 92.7% 2025-12-04T12:10:20.1453177Z triton_mm_72 0.0107 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1453415Z triton_mm_59 0.0110 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1453656Z triton_mm_71 0.0110 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1453907Z triton_mm_68 0.0114 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1454149Z triton_mm_60 0.0116 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1454408Z triton_mm_54 0.0118 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1454658Z triton_mm_61 0.0122 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1454899Z triton_mm_63 0.0126 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1455044Z SingleProcess AUTOTUNE benchmarking takes 0.2387 seconds and 0.8652 seconds precompiling for 39 choices 2025-12-04T12:10:20.1455117Z =================================== FAILURES =================================== 2025-12-04T12:10:20.1455276Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.1455343Z Traceback (most recent call last): 2025-12-04T12:10:20.1455518Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1455578Z method(*args, **kwargs) 2025-12-04T12:10:20.1455758Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.1455816Z method(*args, **kwargs) 2025-12-04T12:10:20.1455984Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.1456045Z with policy(): 2025-12-04T12:10:20.1456212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.1456272Z raise RuntimeError(msg) 2025-12-04T12:10:20.1456679Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2940207104 and is now 3906994176. 2025-12-04T12:10:20.1456682Z 2025-12-04T12:10:20.1456774Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.1457048Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.1457050Z 2025-12-04T12:10:20.1457153Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.1457244Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1457305Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1457380Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1457945Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1458074Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1458130Z graph_break [] 2025-12-04T12:10:20.1458210Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1458299Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1458806Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.1458883Z current_size = base.storage().size() 2025-12-04T12:10:20.1458942Z Autotune Choices Stats: 2025-12-04T12:10:20.1459322Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009398999623954296, "best_triton_pos": 0} 2025-12-04T12:10:20.1459406Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1459475Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1459610Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1459860Z triton_mm_29 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1460143Z triton_mm_33 0.0107 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1460400Z triton_mm_34 0.0108 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1460641Z triton_mm_21 0.0110 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1460881Z triton_mm_22 0.0115 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1461120Z triton_mm_30 0.0118 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1461363Z triton_mm_16 0.0122 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1461600Z triton_mm_23 0.0123 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1461839Z triton_mm_25 0.0127 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1462075Z triton_mm_31 0.0135 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1462234Z SingleProcess AUTOTUNE benchmarking takes 0.1600 seconds and 8.3092 seconds precompiling for 33 choices 2025-12-04T12:10:20.1462325Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1462386Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1462459Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1462578Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1463089Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1463156Z graph_break [] 2025-12-04T12:10:20.1463238Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1463329Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1463708Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.1463819Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.1463880Z Autotune Choices Stats: 2025-12-04T12:10:20.1464254Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009720000438392162, "best_triton_pos": 0} 2025-12-04T12:10:20.1464352Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1464419Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1464555Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1464801Z triton_mm_67 0.0097 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1464864Z _scaled_mm 0.0105 ms 92.7% 2025-12-04T12:10:20.1465108Z triton_mm_72 0.0107 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1465348Z triton_mm_59 0.0110 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1465591Z triton_mm_71 0.0110 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1465828Z triton_mm_68 0.0114 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1466065Z triton_mm_60 0.0116 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1466318Z triton_mm_54 0.0118 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1466559Z triton_mm_61 0.0122 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1466809Z triton_mm_63 0.0126 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1466954Z SingleProcess AUTOTUNE benchmarking takes 0.2387 seconds and 0.8652 seconds precompiling for 39 choices 2025-12-04T12:10:20.1467055Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.1467114Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.1467189Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.1467305Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.1467801Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.1467857Z graph_break [] 2025-12-04T12:10:20.1467937Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.1468030Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.1468099Z Autotune Choices Stats: 2025-12-04T12:10:20.1468474Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_105", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.01023900043219328, "best_triton_pos": 0} 2025-12-04T12:10:20.1468555Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.1468622Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.1468759Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.1469009Z triton_mm_105 0.0102 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1469068Z _scaled_mm 0.0104 ms 98.8% 2025-12-04T12:10:20.1469315Z triton_mm_110 0.0108 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1469558Z triton_mm_109 0.0110 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1469795Z triton_mm_97 0.0114 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1470036Z triton_mm_106 0.0116 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1470333Z triton_mm_92 0.0120 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1470572Z triton_mm_98 0.0123 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1470811Z triton_mm_101 0.0131 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.1471068Z triton_mm_107 0.0132 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.1471227Z SingleProcess AUTOTUNE benchmarking takes 0.2587 seconds and 0.7366 seconds precompiling for 39 choices 2025-12-04T12:10:20.1471434Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a5aa61b86ddf0f00.xml - 2025-12-04T12:10:20.1471515Z =========================== short test summary info ============================ 2025-12-04T12:10:20.1472118Z FAILED [1.6303s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2940207104 and is now 3906994176. 2025-12-04T12:10:20.1472122Z 2025-12-04T12:10:20.1472212Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.1472500Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.1472502Z 2025-12-04T12:10:20.1472606Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.1472687Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.1472773Z ================= 1 failed, 187 deselected, 2 rerun in 14.22s ================== 2025-12-04T12:10:20.1472828Z Got exit code 1 2025-12-04T12:10:20.1472885Z Retrying single test... 2025-12-04T12:10:20.1473046Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7d56d798f2d3fed1.xml 2025-12-04T12:10:20.1473123Z ============================= test session starts ============================== 2025-12-04T12:10:20.1473255Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.1473315Z cachedir: .pytest_cache 2025-12-04T12:10:20.1473492Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.1473556Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.1473616Z configfile: pytest.ini 2025-12-04T12:10:20.1473797Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.1473890Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.1474157Z stepcurrent: skipping 90 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.1474217Z Running 1 items in this shard 2025-12-04T12:10:20.1474219Z 2025-12-04T12:10:20.1474574Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:58:06.203351572 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1474578Z 2025-12-04T12:10:20.1474910Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1475230Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1475380Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1475893Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1476164Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1476405Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1476631Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1476859Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1477107Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1477344Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1477585Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1477820Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1478063Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1478299Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1478540Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1478775Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1479022Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1479264Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1479507Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1479750Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1480002Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1480274Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1480484Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1480720Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1480962Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1481196Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1481417Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1481653Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1481893Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1482130Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1482375Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1482610Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1482829Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1483058Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1483234Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1483434Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1483994Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/ur/curozslustiehcdynpneeg4lh67lyjd63uxzx3kcgevewo5tqnd5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1484159Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1484403Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1484589Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1484894Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1485046Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1485323Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1485479Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1485752Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1485943Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1486226Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1486376Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1486670Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1486880Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1487212Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1487520Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1487667Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1488173Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1488445Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1488688Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1488926Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1489162Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1489410Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1489644Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1489890Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1490154Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1490412Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1490649Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1490891Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1491128Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1491370Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1491608Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1491853Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1492086Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1492328Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1492560Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1492781Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1493014Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1493257Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1493504Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1493723Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1493957Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1494198Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1494435Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1494675Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1494922Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1495142Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1495368Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1495546Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1495740Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1495861Z E1204 10:58:14.237000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1496034Z [W1204 10:58:14.754606321 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1496036Z 2025-12-04T12:10:20.1496360Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1496665Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1496810Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1497311Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1497579Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1497830Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1498062Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1498277Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1498518Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1498750Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1498993Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1499225Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1499478Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1499712Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1499953Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1500227Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1500469Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1500702Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1500942Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1501176Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1501419Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1501669Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1501877Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1502109Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1502363Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1502618Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1502825Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1503057Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1503297Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1503531Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1503787Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1504021Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1504239Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1504463Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1504637Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1504831Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1505370Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/7a/c7a4bqqfq46giblif7grrpmrzw6ufaznee2vsz2mivxhs2gj7ijb.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1505530Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1505759Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1505931Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1506242Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1506391Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1506660Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1506825Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1507103Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1507282Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1507563Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1507711Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1508003Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1508211Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1508554Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1508859Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1509006Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1509497Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1509764Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1510005Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1510255Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1510474Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1510732Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1510967Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1511223Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1511456Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1511710Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1511943Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1512187Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1512421Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1512661Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1512911Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1513151Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1513383Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1513625Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1513859Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1514065Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1514298Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1514539Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1514771Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1514978Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1515220Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1515463Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1515707Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1515947Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1516192Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1516408Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1516637Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1516812Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1517008Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1517137Z E1204 10:58:14.293000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1517310Z [W1204 10:58:14.761180198 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1517313Z 2025-12-04T12:10:20.1517482Z [W1204 10:58:14.762579402 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1517484Z 2025-12-04T12:10:20.1517807Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1518116Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1518262Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1518752Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1519019Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1519258Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1519488Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1519703Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1519945Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1520238Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1520491Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1520725Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1520965Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1521200Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1521439Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1521688Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1521929Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1524450Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1524697Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1524932Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1525175Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1525406Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1525612Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1525848Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1526089Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1526351Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1526556Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1526788Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1527040Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1527288Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1527534Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1527766Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1527984Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1528208Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1528395Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1528590Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1529125Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/54/c54sbtaegy3l6t6h5uini7glder6nktyfzkn44eaouf5wsp3jzbm.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1529288Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1529520Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1529691Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1529993Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1530177Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1530451Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1530606Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1530890Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1531062Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1531346Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1531508Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1531809Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1532017Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1532345Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1532653Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1532799Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1533309Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1533576Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1533815Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1534037Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1534253Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1534496Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1534729Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1534975Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1535208Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1535458Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1535691Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1535943Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1536186Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1536429Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1536662Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1536903Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1537137Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1537380Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1537625Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1537828Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1538059Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1538300Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1538534Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1538739Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1538971Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1539213Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1539446Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1539707Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1539940Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1540188Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1540428Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1540615Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1540808Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1540927Z E1204 10:58:14.302000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1541097Z [W1204 10:58:14.766555487 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1541099Z 2025-12-04T12:10:20.1541424Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1541729Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1541890Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1542381Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1542647Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1542888Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1543109Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1543322Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1543562Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1543795Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1544038Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1544288Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1544529Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1544772Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1545014Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1545257Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1545498Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1545730Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1545969Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1546204Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1546459Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1546693Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1546896Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1547131Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1547372Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1547605Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1547809Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1548041Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1548281Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1548515Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1548765Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1549001Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1549226Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1549452Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1549637Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1549828Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1550420Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/oe/coeqhkqsrdi6lh2mzkuztx2fll5fyimy4fn2nbblhkt7k3jpoehl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1550580Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1550826Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1550997Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1551298Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1551444Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1551714Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1551869Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1552136Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1552307Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1552589Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1552738Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1553028Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1553248Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1553576Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1553892Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1554051Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1554543Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1554809Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1555048Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1555269Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1555504Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1555748Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1555982Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1556224Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1556458Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1556702Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1556936Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1557178Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1557412Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1557667Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1557900Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1558139Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1558382Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1558632Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1558867Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1559072Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1559305Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1559548Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1559791Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1559995Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1560259Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1560500Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1560732Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1560975Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1561208Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1561426Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1561652Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1561825Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1562033Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1562152Z E1204 10:58:14.305000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1562474Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1562793Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1562953Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1563441Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1563708Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1563947Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1564166Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1564394Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1564634Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1564866Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1565109Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1565346Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1565587Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1565819Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1566059Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1566292Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1566542Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1566776Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1567016Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1567258Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1567507Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1567745Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1567949Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1568179Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1568420Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1568664Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1568868Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1569099Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1569339Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1569572Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1569816Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1570049Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1570296Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1570520Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1570694Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1570887Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1571435Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/7i/c7ignet3sdjwpdnvdorijykmzjy73n6i4vn6qphywltpwti6vsgr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1571611Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1571843Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1572034Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1572333Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1572480Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1572751Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1572903Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1573170Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1573355Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1573636Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1573784Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1574074Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1574281Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1574610Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1574915Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1575059Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1575559Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1575827Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1576078Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1576297Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1576524Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1576766Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1576999Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1577241Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1577472Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1577724Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1577959Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1578199Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1578432Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1578673Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1578907Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1579147Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1579379Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1579621Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1579853Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1580068Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1580333Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1580587Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1580824Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1581042Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1581275Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1581515Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1581747Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1581988Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1582237Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1582451Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1582676Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1582851Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1583045Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1583163Z E1204 10:58:14.306000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1583334Z [W1204 10:58:14.768639464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1583337Z 2025-12-04T12:10:20.1583659Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1583964Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1584113Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1584612Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1584877Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1585126Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1585355Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1585571Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1585812Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1586046Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1586287Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1586529Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1586770Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1587002Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1587243Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1587476Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1587717Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1587949Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1588190Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1588422Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1588662Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1588905Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1589109Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1589356Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1589597Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1589839Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1590043Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1590315Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1590556Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1590788Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1591042Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1591274Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1591489Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1591713Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1591885Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1592080Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1592620Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpzb8ar5md/2z/c2z6iwqtseqjyipde5jatpr6uwitqmrj73v25gztvms2h4mwdnfs.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.1592780Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.1593009Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.1593178Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.1593490Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.1593637Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.1593921Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.1594075Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.1594357Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.1594528Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.1594808Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.1594956Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.1595243Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.1595461Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.1595789Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1596092Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1596238Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1596727Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1596994Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1597233Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1597452Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1597669Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1597919Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1598155Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1598409Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1598653Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1598895Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1599126Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1599367Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1599599Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1599840Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1600083Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1600357Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1600591Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1600831Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1601065Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1601268Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1601500Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1601740Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1601971Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1602188Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1602419Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1602663Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1602908Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1603163Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1603400Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1603616Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.1603840Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.1604014Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.1604210Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.1604340Z E1204 10:58:14.307000 686634 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.1604412Z ('RERUN', {'yellow': True}) [10.9473s] [100%] 2025-12-04T12:10:20.1604759Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:58:16.710364998 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1604764Z 2025-12-04T12:10:20.1604925Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1605234Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1605541Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1605687Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1606174Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1606442Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1606702Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1606921Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1607140Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1607390Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1607638Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1607880Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1608113Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1608356Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1608587Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1608844Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1609075Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1609317Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1609550Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1609762Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1609987Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1610236Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1610477Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1610712Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1610919Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1611168Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1611410Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1611643Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1611859Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1612105Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1612317Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1612521Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1612756Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1612967Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1613170Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1613416Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1613658Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1613891Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1614132Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1614367Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1614579Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1614804Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1615020Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1615263Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1615507Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1615750Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1615985Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1616236Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1616481Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1616722Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1616958Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1617198Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1617433Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1617688Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1617920Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1618162Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1618395Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1618635Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1618870Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1619113Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1619346Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1619586Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1619819Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1620070Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1620350Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1620565Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1620789Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1621046Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1621279Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1621519Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1621752Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1621993Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1622249Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1622491Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1622725Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1622965Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1623204Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1623444Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1623677Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1623887Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1624091Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1624324Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1624549Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1624771Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1624985Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1625240Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1625482Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1625696Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1625900Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1626131Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1626342Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1626554Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1626787Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1627027Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1627262Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1627504Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1627736Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1627948Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1628169Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1628384Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1628629Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1628874Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1629093Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1629306Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1629532Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1629785Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1630027Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1630306Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1630541Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1630785Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1631022Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1631280Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1631513Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1631760Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1631996Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1632211Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1632416Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1632650Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1632867Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1633081Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1633313Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1633555Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1633789Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1634050Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1634297Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1634543Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1634776Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1635020Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1635257Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1635510Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1635747Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1635963Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1636178Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1636383Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1636611Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1636828Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1637071Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1637308Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1637526Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1637758Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1637972Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1638216Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1638460Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1638717Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1638955Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1639197Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1639433Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1639674Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1639921Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1640214Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1640447Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1640692Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1640925Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1641173Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1641407Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1641649Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1641884Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1642126Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1642379Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1642620Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1642869Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1643083Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1643305Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1643545Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1643786Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1644022Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1644263Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1644513Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1644758Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1644992Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1645239Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1645475Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1645720Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1645954Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1646160Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1646395Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1646637Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1646885Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1647128Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1647377Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1647606Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1647838Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1648053Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1648268Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1648512Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1648746Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1648986Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1649201Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1649415Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1649632Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1649875Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1650151Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1650366Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1650581Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1650787Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1650952Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1651204Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1651410Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1651647Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1651902Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1652151Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1652365Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1652570Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1652809Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1653022Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1653231Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1653481Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1653687Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1653921Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1654126Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1654363Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1654568Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1654803Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1655046Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1655280Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1655524Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1655768Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1656010Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1656260Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1656512Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1656748Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1656990Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1657224Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1657441Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1657649Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1657894Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1658123Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1658340Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1658554Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1658769Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1659014Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1659249Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1659492Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1659731Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1659935Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1660212Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1660454Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1660701Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1660959Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1661193Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1661421Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1661639Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1661854Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1662059Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1662279Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1662514Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1662755Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1662990Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1663234Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1663471Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1663677Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1663913Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1664156Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1664390Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1664643Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1664879Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1665095Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1665329Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1665589Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1665825Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1666067Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1666304Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1666531Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1666763Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1666976Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1667190Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1667434Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1667671Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1667878Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1668111Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1668356Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1668590Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1668833Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1669076Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1669302Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1669530Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1669758Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1669976Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1670250Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1670484Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1670727Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1670961Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1671218Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1671451Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1671655Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1671892Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1672133Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1672369Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1672609Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1672844Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1673069Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1673310Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1673523Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1673738Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1673995Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1674241Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1674485Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1674719Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1674962Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1675196Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1675438Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1675684Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1675927Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1676161Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1676373Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1676592Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1676798Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1677009Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1677239Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1677460Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1677673Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1677891Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1678099Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1678288Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1678439Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1678609Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1678729Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1678876Z E1204 10:58:16.264000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1679049Z [W1204 10:58:16.731253375 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1679051Z 2025-12-04T12:10:20.1679210Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1679521Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1679832Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1679991Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1680511Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1680780Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1681021Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1681244Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1681460Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1681701Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1681939Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1682198Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1682433Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1682674Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1682921Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1683176Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1683411Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1683654Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1683888Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1684103Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1684338Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1684557Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1684799Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1685032Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1685237Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1685472Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1685716Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1685950Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1686157Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1686393Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1686616Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1686820Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1687051Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1687273Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1687484Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1687722Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1687965Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1688196Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1688440Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1688672Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1688905Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1689125Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1689342Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1689584Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1689816Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1690062Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1690333Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1690576Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1690810Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1691070Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1691303Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1691544Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1691792Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1692049Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1692285Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1692526Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1692757Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1693000Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1693247Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1693487Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1693718Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1693962Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1694197Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1694439Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1694672Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1694887Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1695099Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1695341Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1695589Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1695830Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1696064Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1696319Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1696561Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1696805Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1697036Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1697279Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1697511Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1697765Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1697999Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1698213Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1698419Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1698651Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1698864Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1699086Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1699299Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1699542Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1699774Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1699996Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1700322Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1700555Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1700780Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1700994Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1701229Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1701469Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1701702Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1701944Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1702177Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1702406Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1702628Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1702841Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1703085Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1703322Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1703543Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1703757Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1703974Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1704217Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1704469Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1704710Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1704945Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1705195Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1705447Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1705690Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1705923Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1706165Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1706401Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1706620Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1706837Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1707071Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1707290Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1707503Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1707720Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1707962Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1708197Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1708440Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1708679Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1708933Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1709166Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1709408Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1709655Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1709910Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1710189Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1710406Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1710621Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1710829Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1711056Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1711285Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1711528Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1711764Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1711981Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1712196Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1712411Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1712655Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1712890Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1713133Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1713381Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1713623Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1713859Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1714114Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1714360Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1714603Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1714839Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1715081Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1715316Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1715570Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1715805Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1716046Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1716280Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1716522Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1716761Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1717002Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1717236Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1717450Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1717659Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1717903Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1718146Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1718381Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1718632Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1718879Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1719120Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1719354Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1719597Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1719834Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1720134Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1720371Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1720579Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1720814Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1721061Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1721298Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1721542Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1721776Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1722005Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1722223Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1722453Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1722672Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1722938Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1723172Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1723417Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1723636Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1723849Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1724064Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1724309Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1724561Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1724777Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1724994Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1725201Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1725367Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1725605Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1725814Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1726048Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1726294Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1726528Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1726754Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1726963Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1727196Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1727420Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1727635Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1727874Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1728079Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1728315Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1728520Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1728754Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1728977Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1729211Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1729456Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1729692Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1729934Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1730212Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1730453Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1730690Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1730936Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1731172Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1731436Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1731670Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1731895Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1732113Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1732356Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1732585Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1732800Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1733014Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1733231Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1733488Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1733721Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1733964Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1734199Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1734404Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1734642Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1734884Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1735122Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1735363Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1735601Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1735840Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1736057Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1736282Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1736488Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1736704Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1736939Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1737185Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1737422Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1737665Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1737925Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1738128Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1738363Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1738606Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1738843Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1739089Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1739324Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1739531Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1739765Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1740008Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1740291Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1740534Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1740780Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1741007Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1741242Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1741455Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1741670Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1741913Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1742151Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1742375Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1742608Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1742852Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1743086Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1743330Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1743566Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1743799Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1744017Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1744231Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1744449Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1744702Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1744937Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1745190Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1745426Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1745684Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1745918Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1746124Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1746359Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1746603Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1746851Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1747094Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1747329Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1747557Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1747777Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1747992Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1748208Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1748452Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1748685Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1748929Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1749171Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1749417Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1749660Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1749914Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1750191Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1750431Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1750665Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1750877Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1751096Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1751320Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1751533Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1751762Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1751983Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1752197Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1752406Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1752615Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1752805Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1752948Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1753110Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1753235Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1753389Z E1204 10:58:16.270000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1753562Z [W1204 10:58:16.734918123 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1753564Z 2025-12-04T12:10:20.1753726Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1754044Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1754364Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1754511Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1755004Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1755275Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1755515Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1755752Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1755968Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1756210Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1756447Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1756690Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1756926Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1757167Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1757403Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1757644Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1757889Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1758132Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1758364Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1758586Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1758818Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1759035Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1759275Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1759512Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1759719Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1759952Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1760240Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1760472Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1760677Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1760908Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1761123Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1761327Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1761561Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1761775Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1761976Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1762214Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1762468Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1762703Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1762956Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1763202Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1763418Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1763641Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1763856Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1764097Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1764330Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1764589Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1764820Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1765061Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1765293Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1765538Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1765773Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1766014Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1766249Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1766490Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1766734Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1766974Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1767208Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1767458Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1767704Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1767947Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1768178Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1768419Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1768651Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1768904Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1769139Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1769355Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1769568Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1769810Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1770047Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1770323Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1770557Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1770801Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1771033Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1771289Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1771523Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1771765Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1772010Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1772271Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1772505Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1772717Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1772922Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1773153Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1773380Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1773601Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1773818Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1774063Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1774295Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1774508Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1774711Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1774945Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1775155Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1775360Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1775597Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1775850Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1776085Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1776334Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1776584Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1776797Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1777021Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1777236Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1777482Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1777718Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1777946Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1778161Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1778374Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1778619Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1778857Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1779101Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1779336Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1779578Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1779814Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1780068Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1780331Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1780573Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1780818Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1781047Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1781253Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1781491Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1781707Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1781922Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1782139Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1782396Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1782632Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1782873Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1783110Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1783352Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1783588Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1783829Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1784065Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1784305Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1784555Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1784776Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1784988Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1785209Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1785443Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1785659Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1785901Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1786136Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1786360Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1786574Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1786802Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1787045Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1787279Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1787521Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1787755Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1787999Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1788233Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1788478Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1788713Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1788964Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1789201Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1789442Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1789697Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1789948Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1790232Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1790475Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1790709Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1790952Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1791200Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1791450Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1791686Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1791900Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1792107Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1792345Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1792586Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1792820Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1793063Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1793300Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1793554Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1793791Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1794032Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1794283Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1794536Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1794886Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1795093Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1795327Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1795570Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1795818Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1796060Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1796294Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1796526Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1796746Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1796960Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1797177Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1797417Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1797657Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1797885Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1798116Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1798332Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1798547Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1798801Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1799046Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1799266Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1799477Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1799685Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1799849Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1800084Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1800347Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1800583Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1800827Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1801061Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1801277Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1801484Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1801718Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1801933Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1802138Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1802378Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1802598Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1802837Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1803058Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1803292Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1803513Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1803747Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1803993Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1804228Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1804474Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1804723Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1804969Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1805202Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1805444Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1805682Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1805925Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1806162Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1806377Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1806582Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1806818Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1807074Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1807292Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1807516Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1807734Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1807987Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1808223Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1808465Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1808704Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1808910Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1809157Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1809397Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1809632Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1809873Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1810146Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1810376Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1810595Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1810812Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1811019Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1811227Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1811476Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1811721Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1811970Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1812212Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1812463Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1812670Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1812908Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1813150Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1813387Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1813641Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1813875Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1814080Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1814313Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1814556Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1814794Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1815037Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1815271Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1815501Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1815721Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1815945Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1816161Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1816413Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1816650Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1816866Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1817103Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1817347Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1817582Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1817825Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1818070Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1818300Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1818516Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1818731Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1818949Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1819193Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1819428Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1819670Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1819906Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1820185Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1820439Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1820645Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1820894Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1821139Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1821386Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1821630Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1821864Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1822093Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1822311Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1822546Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1822764Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1823008Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1823244Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1823487Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1823722Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1823964Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1824197Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1824440Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1824674Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1824929Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1825166Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1825387Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1825606Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1825822Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1826034Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1826261Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1826483Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1826698Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1826919Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1827125Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1827310Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1827451Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1827611Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1827730Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1827872Z E1204 10:58:16.274000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1828044Z [W1204 10:58:16.780134927 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1828046Z 2025-12-04T12:10:20.1828205Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1828513Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1828822Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1828970Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1831390Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1831680Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1831937Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1832160Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1832375Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1832618Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1832854Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1833097Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1833348Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1833590Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1833823Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1834063Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1834300Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1834542Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1834774Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1834988Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1835209Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1835435Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1835675Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1835909Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1836122Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1836366Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1836608Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1836841Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1837045Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1837276Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1837488Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1837702Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1837935Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1838146Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1838348Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1838580Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1838823Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1839057Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1839298Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1839530Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1839743Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1839974Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1840222Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1840475Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1840708Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1840970Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1841204Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1841444Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1841676Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1841916Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1842164Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1842404Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1842635Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1842876Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1843111Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1843353Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1843585Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1843825Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1844056Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1844312Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1844545Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1844786Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1845028Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1845282Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1845515Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1845731Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1845940Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1846182Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1846415Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1846667Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1846900Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1847141Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1847374Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1847615Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1847847Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1848087Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1848319Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1848559Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1848802Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1849014Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1849216Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1849459Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1849682Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1849904Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1850153Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1850392Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1850625Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1850835Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1851052Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1851287Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1851497Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1851699Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1851930Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1852173Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1852404Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1852646Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1852878Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1853089Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1853325Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1853539Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1853797Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1854031Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1854263Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1854476Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1854689Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1854934Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1855169Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1855424Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1855660Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1855902Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1856137Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1856378Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1856615Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1856857Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1857092Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1857308Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1857515Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1857760Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1857975Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1858205Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1858419Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1858674Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1858908Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1859148Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1859385Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1859626Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1859872Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1860147Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1860381Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1860623Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1860858Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1861075Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1861290Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1861498Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1861721Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1861938Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1862191Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1862425Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1862656Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1862868Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1863098Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1863342Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1863579Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1863821Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1864055Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1864311Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1864545Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1864786Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1865019Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1865267Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1865503Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1865743Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1865978Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1866217Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1866463Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1866705Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1866939Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1867190Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1867436Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1867679Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1867913Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1868127Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1868333Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1868567Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1868825Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1869058Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1869301Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1869534Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1869779Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1870014Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1870289Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1870524Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1870767Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1871017Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1871222Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1871457Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1871712Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1871958Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1872200Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1872433Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1872662Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1872876Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1873104Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1873321Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1873564Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1873798Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1874024Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1874242Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1874454Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1874668Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1874911Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1875145Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1875378Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1875592Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1875799Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1875973Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1876208Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1876426Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1876659Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1876900Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1877134Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1877348Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1877567Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1877801Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1878013Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1878217Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1878453Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1878659Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1878895Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1879098Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1879334Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1879542Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1879786Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1880028Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1880293Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1880547Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1880794Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1881036Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1881269Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1881514Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1881748Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1882002Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1882236Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1882448Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1882653Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1882887Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1883116Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1883332Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1883547Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1883762Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1884005Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1884253Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1884495Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1884738Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1884943Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1885188Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1885431Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1885668Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1885910Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1886144Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1886384Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1886601Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1886814Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1887020Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1887224Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1887461Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1887705Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1887940Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1888183Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1888417Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1888632Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1888866Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1889112Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1889357Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1889609Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1889845Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1890048Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1890344Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1890584Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1890842Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1891084Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1891316Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1891544Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1891762Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1891977Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1892191Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1892431Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1892666Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1892870Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1893118Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1893359Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1893593Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1893852Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1894098Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1894326Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1894542Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1894756Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1894970Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1895226Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1895459Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1895701Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1895937Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1896179Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1896415Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1896618Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1896852Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1897094Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1897328Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1897580Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1897815Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1898052Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1898267Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1898495Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1898710Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1898950Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1899186Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1899426Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1899672Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1899915Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1900185Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1900427Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1900661Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1900904Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1901137Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1901349Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1901567Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1901772Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1902002Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1902231Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1902455Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1902681Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1902902Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1903109Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1903296Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1903437Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1903597Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1903716Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1903858Z E1204 10:58:16.319000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1904044Z [W1204 10:58:16.782243943 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1904046Z 2025-12-04T12:10:20.1904205Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1904514Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1904822Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1904969Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1905461Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1905727Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1905968Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1906188Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1906412Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1906656Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1906901Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1907143Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1907395Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1907637Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1907870Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1908112Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1908345Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1908597Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1908830Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1909041Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1909264Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1909477Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1909719Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1909953Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1910194Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1910426Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1910667Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1910913Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1911115Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1911361Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1911572Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1911790Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1912024Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1912236Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1912439Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1912671Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1912928Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1913159Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1913399Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1913632Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1913843Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1914077Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1914289Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1914530Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1914763Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1915002Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1915247Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1915488Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1915733Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1915975Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1916220Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1916460Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1916691Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1916932Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1917168Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1917423Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1917655Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1917896Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1918129Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1918368Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1918602Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1918840Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1919072Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1919313Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1919555Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1919771Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1919982Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1920266Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1920511Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1920753Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1920984Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1921223Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1921456Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1921696Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1921944Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1922185Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1922417Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1922660Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1922892Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1923104Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1923305Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1923539Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1923748Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1923983Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1924199Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1924438Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1924687Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1924908Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1925112Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1925343Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1925553Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1925757Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1925990Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1926246Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1926477Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1926718Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1926949Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1927160Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1927383Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1927596Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1927840Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1928076Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1928294Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1928519Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1928734Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1928987Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1929220Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1929474Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1929708Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1929949Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1930226Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1930468Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1930720Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1930960Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1931201Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1931414Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1931620Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1931855Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1932072Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1932285Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1932499Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1932742Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1932990Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1933233Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1933479Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1933721Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1933970Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1934210Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1934450Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1934691Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1934926Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1935155Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1935367Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1935574Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1935798Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1936013Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1936257Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1936493Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1936710Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1936922Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1937137Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1937389Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1937623Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1937873Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1938108Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1938362Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1938595Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1938836Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1939070Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1939311Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1939559Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1939801Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1940034Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1940310Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1940545Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1940787Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1941021Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1941262Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1941496Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1941757Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1941991Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1942207Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1942426Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1942673Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1942917Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1943151Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1943393Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1943628Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1943870Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1944118Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1944360Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1944594Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1944837Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1945072Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1945276Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1945510Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1945751Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1945986Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1946240Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1946475Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1946703Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1946929Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1947157Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1947372Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1947614Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1947848Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1948074Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1948303Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1948519Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1948733Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1948975Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1949210Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1949429Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1949642Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1949848Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1950010Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.1950277Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1950484Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1950732Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1950974Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1951220Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1951434Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1951656Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1951890Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1952101Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1952305Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1952540Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1952760Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1952995Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1953197Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1953435Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1953637Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1953873Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1954114Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1954348Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1954594Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1954827Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1955081Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1955313Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1955554Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1955799Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1956050Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1956284Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1956497Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1956701Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1956934Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1957183Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1957400Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1957611Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1957827Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1958070Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1958308Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1958551Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1958785Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1958990Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1959224Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1959476Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1959709Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1959951Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1960223Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1960463Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1960683Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1960895Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1961102Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.1961305Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1961556Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1961798Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1962032Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1962274Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1962508Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1962716Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1962948Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1963190Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1963426Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1963669Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1963917Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1964122Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1964355Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1964610Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1964854Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1965097Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1965330Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1965559Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1965775Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1966000Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1966215Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1966457Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1966693Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1966900Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1967135Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1967375Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1967609Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1967850Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1968085Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1968321Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1968538Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1968753Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1968977Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1969228Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1969463Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1969704Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1969940Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1970233Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1970485Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1970689Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1970924Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1971165Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1971400Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1971643Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1971876Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1972104Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.1972321Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1972534Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.1972764Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1973007Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1973241Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1973496Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1973749Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1973990Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1974224Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1974465Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1974698Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1974956Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1975189Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1975402Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.1975619Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.1975825Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.1976037Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.1976266Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.1976487Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.1976699Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.1976909Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.1977125Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.1977311Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.1977451Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.1977614Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.1977742Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.1977893Z E1204 10:58:16.321000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.1978066Z [W1204 10:58:16.784337370 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.1978069Z 2025-12-04T12:10:20.1978227Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.1978538Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.1978850Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.1978998Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.1979502Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.1979770Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.1980008Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.1980260Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.1980476Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1980717Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1980955Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1981199Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1981435Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1981689Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1981921Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1982176Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1982407Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1982664Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1982901Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1983112Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1983336Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1983550Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1983805Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1984036Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1984241Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1984476Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1984716Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1984951Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1985154Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1985386Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1985597Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1985801Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1986043Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1986253Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1986454Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1986696Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1986952Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1987185Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1987425Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1987657Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1987867Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1988103Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1988316Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1988557Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1988790Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1989032Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1989265Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1989504Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1989735Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1989976Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1990257Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1990514Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1990749Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1991009Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1991242Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1991498Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1991731Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1991971Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1992202Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1992443Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1992689Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1992929Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1993163Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1993403Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1993636Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1993854Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.1994065Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.1994306Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.1994624Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1994865Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1995108Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1995350Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1995597Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1995846Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1996079Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1996318Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1996549Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1996789Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1997023Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1997250Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1997454Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1997687Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1997898Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1998120Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.1998334Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.1998575Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.1998807Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1999018Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1999222Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.1999463Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.1999673Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.1999876Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2000146Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2000404Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2000636Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2000879Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2001111Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2001324Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2001560Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2001774Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2002018Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2002252Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2002469Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2002683Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2002897Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2003138Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2003377Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2003621Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2003866Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2004109Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2004352Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2004593Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2004837Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2005080Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2005315Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2005529Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2005734Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2005980Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2006195Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2006407Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2006622Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2006865Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2007100Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2007342Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2007575Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2007817Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2008052Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2008310Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2008544Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2008795Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2009029Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2009256Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2009472Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2009677Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2009902Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2010157Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2010414Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2010648Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2010863Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2011078Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2011292Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2011537Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2011770Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2012010Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2012245Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2012486Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2012734Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2012974Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2013220Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2013462Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2013711Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2013958Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2014192Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2014433Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2014668Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2014920Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2015155Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2015395Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2015631Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2015872Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2016107Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2016321Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2016525Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2016759Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2017001Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2017250Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2017492Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2017737Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2017988Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2018222Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2018465Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2018698Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2018940Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2019172Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2019390Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2019625Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2019867Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2020140Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2020381Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2020616Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2020843Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2021060Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2021273Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2021502Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2021746Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2021978Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2022217Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2022446Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2022661Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2022877Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2023117Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2023352Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2023567Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2023803Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2024009Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2024177Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2024414Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2024619Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2026722Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2026969Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2027203Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2027419Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2027629Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2027879Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2028091Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2028296Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2028546Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2028765Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2029001Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2029205Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2029440Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2029649Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2029887Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2030174Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2030408Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2030651Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2030885Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2031128Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2031443Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2031687Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2031921Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2032164Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2032401Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2032614Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2032819Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2033066Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2033308Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2033525Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2033744Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2033960Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2034204Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2034453Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2034695Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2034929Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2035132Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2035368Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2035611Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2035861Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2036102Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2036337Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2036565Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2036783Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2036998Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2037203Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2037418Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2037663Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2037908Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2038142Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2038387Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2038621Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2038837Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2039071Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2039311Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2039546Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2039789Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2040023Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2040295Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2040528Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2040770Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2041005Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2041247Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2041482Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2041711Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2041943Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2042177Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2042395Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2042638Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2042872Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2043076Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2043311Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2043566Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2043800Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2044044Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2044279Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2044506Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2044733Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2044946Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2045162Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2045403Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2045640Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2045887Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2046119Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2046372Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2046615Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2046822Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2047056Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2047299Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2047532Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2047785Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2048021Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2048249Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2048468Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2048682Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2048901Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2049155Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2049389Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2049632Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2049869Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2050159Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2050392Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2050634Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2050880Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2051133Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2051367Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2051578Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2051797Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2052003Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2052230Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2052460Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2052680Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2052894Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2053100Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2053309Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2053507Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2053653Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2053815Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2053934Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2054075Z E1204 10:58:16.323000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2054144Z ('RERUN', {'yellow': True}) [1.8172s] [100%] 2025-12-04T12:10:20.2054493Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda [W1204 10:58:17.352289537 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2054497Z 2025-12-04T12:10:20.2054654Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2054962Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2055279Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2055438Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2055935Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2056202Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2056442Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2056672Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2056888Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2057132Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2057367Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2057611Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2057855Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2058098Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2058330Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2058571Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2058804Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2059046Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2059278Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2059505Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2059728Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2059953Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2060244Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2060480Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2060685Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2060919Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2061174Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2061408Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2061610Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2061845Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2062061Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2062266Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2062514Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2062726Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2062929Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2063161Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2063403Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2063636Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2063878Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2064124Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2064351Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2064575Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2064787Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2065027Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2065259Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2065512Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2065746Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2065985Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2066219Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2066459Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2066692Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2066946Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2067177Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2067420Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2067651Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2067895Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2068127Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2068377Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2068610Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2068860Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2069094Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2069334Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2069568Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2069811Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2070056Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2070313Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2070525Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2070767Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2070999Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2071253Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2071486Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2071727Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2071959Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2072200Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2072433Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2072672Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2072917Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2073157Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2073402Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2073615Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2073816Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2074048Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2074259Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2074497Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2074713Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2074953Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2075186Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2075396Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2075598Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2075846Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2076056Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2076263Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2076499Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2076742Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2076973Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2077214Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2077456Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2077679Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2077902Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2078114Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2078359Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2078594Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2078823Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2079037Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2079251Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2079495Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2079728Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2079972Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2080255Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2080502Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2080737Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2080980Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2081217Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2081459Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2081693Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2081919Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2082137Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2082372Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2082592Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2082809Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2083023Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2083279Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2083513Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2083755Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2083989Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2084232Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2084472Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2084726Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2084960Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2085201Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2085438Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2085656Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2085868Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2086074Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2086308Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2086536Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2086779Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2087014Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2087231Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2087444Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2087671Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2087912Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2088146Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2088387Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2088622Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2088865Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2089111Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2089355Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2089590Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2089831Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2090066Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2090352Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2090599Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2090840Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2091088Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2091329Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2091563Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2091804Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2092039Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2092304Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2092540Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2092759Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2092963Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2093202Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2093459Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2093694Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2093934Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2094167Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2094409Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2094643Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2094887Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2095131Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2095382Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2095616Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2095822Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2096056Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2096298Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2096534Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2096789Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2097023Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2097252Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2097469Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2097684Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2097909Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2098152Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2098388Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2098616Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2098835Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2099055Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2099269Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2099525Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2099760Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2099988Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2100246Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2100453Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2100615Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2100854Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2101075Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2101310Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2101552Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2101786Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2102000Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2102205Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2102458Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2102670Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2102877Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2103112Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2103316Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2103551Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2103754Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2104001Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2104216Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2104452Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2104696Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2104931Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2105173Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2105406Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2105659Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2105892Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2106134Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2106369Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2106610Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2106854Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2107068Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2107272Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2107506Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2107734Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2107951Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2108163Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2108389Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2108648Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2108887Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2109129Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2109363Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2109569Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2109803Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2110056Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2110323Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2110566Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2110799Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2111029Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2111260Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2111472Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2111679Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2111882Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2112117Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2112358Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2112592Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2112846Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2113093Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2113298Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2113531Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2113773Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2114007Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2114249Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2114498Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2114700Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2114937Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2115179Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2115414Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2115666Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2115900Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2116127Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2116344Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2116558Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2116773Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2117016Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2117260Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2117475Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2117710Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2117951Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2118184Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2118426Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2118661Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2118899Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2119116Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2119330Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2119544Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2119787Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2120030Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2120313Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2120548Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2120789Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2121024Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2121230Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2121464Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2121725Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2121971Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2122215Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2122448Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2122675Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2122892Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2123106Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2123336Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2123578Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2123813Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2124052Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2124287Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2124540Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2124775Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2125016Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2125253Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2125496Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2125729Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2125941Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2126174Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2126388Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2126599Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2126827Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2127048Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2127261Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2127467Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2127686Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2127872Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2128013Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2128174Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2128293Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2128434Z E1204 10:58:17.891000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2128607Z [W1204 10:58:17.354515422 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2128610Z 2025-12-04T12:10:20.2128778Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2129088Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2129395Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2129542Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2130032Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2130333Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2130584Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2130816Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2131033Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2131276Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2131513Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2131754Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2132001Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2132243Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2132474Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2132714Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2132946Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2133202Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2133437Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2133648Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2133873Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2134087Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2136339Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2136578Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2136783Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2137035Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2137293Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2137531Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2137734Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2137967Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2138178Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2138396Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2138629Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2138838Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2139041Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2139274Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2139521Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2139765Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2140010Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2140269Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2140482Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2140707Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2140922Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2141167Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2141418Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2141672Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2141906Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2142146Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2142377Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2142617Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2142864Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2143105Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2143338Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2143582Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2143813Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2144056Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2144308Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2144551Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2144783Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2145025Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2145325Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2145566Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2145800Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2146053Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2146301Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2146517Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2146728Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2146970Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2147203Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2147459Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2147695Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2147936Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2148169Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2148409Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2148652Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2148893Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2149128Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2149368Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2149602Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2149817Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2150019Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2150304Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2150514Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2150751Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2150965Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2151207Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2151440Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2151654Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2151875Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2152108Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2152320Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2152521Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2152754Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2152997Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2153242Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2153484Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2153717Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2153930Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2154154Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2154369Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2154614Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2154858Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2155085Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2155301Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2155517Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2155760Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2155996Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2156239Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2156485Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2156726Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2156960Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2157201Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2157436Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2157689Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2157926Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2158139Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2158345Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2158579Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2158796Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2159008Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2159233Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2159485Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2159723Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2159966Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2160234Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2160476Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2160733Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2160974Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2161207Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2161450Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2161685Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2161904Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2162131Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2162338Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2162562Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2162778Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2163020Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2163255Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2163470Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2163696Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2163924Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2164169Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2164403Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2164644Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2164878Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2165131Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2165367Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2165608Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2165844Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2166085Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2166319Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2166570Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2166805Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2167046Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2167281Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2167522Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2167756Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2168020Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2168254Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2168506Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2168741Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2168955Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2169160Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2169394Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2169646Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2169882Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2170185Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2170421Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2170663Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2170914Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2171156Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2171390Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2171632Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2171867Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2172075Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2172309Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2172562Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2172796Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2173051Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2173286Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2173513Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2173730Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2173945Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2174173Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2174417Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2174653Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2174881Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2175097Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2175324Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2175539Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2175780Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2176016Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2176233Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2176446Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2176652Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2176816Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2177062Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2177282Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2177517Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2177757Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2177993Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2178206Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2178422Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2178657Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2178868Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2179074Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2179307Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2179513Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2179756Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2179962Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2180233Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2180437Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2180672Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2180914Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2181148Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2181402Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2181649Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2181892Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2182130Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2182371Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2182604Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2182846Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2183094Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2183308Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2183512Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2183745Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2183975Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2184204Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2184418Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2184633Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2184876Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2185112Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2185354Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2185588Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2185802Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2186046Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2186289Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2186523Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2186764Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2186997Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2187225Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2187453Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2187666Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2187872Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2188077Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2188313Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2188564Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2188799Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2189040Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2189275Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2189481Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2189716Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2189958Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2190239Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2190494Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2190728Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2190933Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2191167Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2191409Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2191643Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2191897Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2192130Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2192359Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2192576Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2192790Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2193017Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2193259Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2193492Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2193696Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2193930Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2194172Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2194407Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2194664Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2194907Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2195133Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2195350Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2195564Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2195779Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2196021Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2196266Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2196509Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2196742Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2196984Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2197218Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2197434Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2197669Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2197909Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2198143Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2198386Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2198622Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2198848Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2199076Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2199299Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2199514Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2199755Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2199987Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2200263Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2200498Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2200754Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2200987Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2201227Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2201460Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2201702Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2201950Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2202162Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2202379Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2202586Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2202797Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2203026Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2203246Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2203473Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2203690Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2203898Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2204084Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2204224Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2204384Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2204503Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2204645Z E1204 10:58:17.893000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2204828Z [W1204 10:58:17.356538179 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2204830Z 2025-12-04T12:10:20.2204991Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2205300Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2205607Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2205753Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2206258Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2206527Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2206766Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2206986Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2207201Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2207442Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2207677Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2207928Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2208171Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2208415Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2208647Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2208887Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2209119Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2209370Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2209602Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2209814Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2210037Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2210278Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2210522Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2210773Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2210977Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2211210Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2211451Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2211688Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2211890Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2212124Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2212347Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2212565Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2212797Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2213007Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2213209Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2213441Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2213682Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2213926Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2214168Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2214399Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2214614Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2214839Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2215063Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2215303Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2215535Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2215775Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2216007Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2216249Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2216482Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2216735Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2216976Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2217218Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2217449Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2217689Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2217921Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2218172Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2218404Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2218645Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2218878Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2219118Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2219351Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2219602Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2219838Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2220078Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2220344Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2220560Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2220772Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2221011Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2221259Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2221512Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2221745Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2221985Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2222217Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2222456Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2222705Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2222946Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2223178Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2223418Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2223651Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2223876Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2224080Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2224312Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2224522Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2224745Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2224963Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2225204Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2225436Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2225656Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2225868Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2226102Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2226312Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2226515Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2226749Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2226991Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2227243Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2227484Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2227715Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2227926Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2228149Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2228372Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2228616Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2228854Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2229072Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2229285Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2229501Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2229742Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2229989Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2230272Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2230508Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2230749Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2230983Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2231225Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2231476Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2231719Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2231951Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2232166Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2232370Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2232605Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2232837Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2233050Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2233265Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2233507Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2233743Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2233986Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2234219Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2234474Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2234720Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2234964Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2235197Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2235438Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2235673Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2235900Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2236115Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2236320Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2236545Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2236759Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2237004Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2237249Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2237465Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2237678Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2237892Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2238136Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2238370Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2238612Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2238857Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2239110Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2239346Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2239586Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2239820Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2240062Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2240342Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2240585Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2240818Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2241062Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2241300Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2241542Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2241791Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2242031Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2242265Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2242506Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2242741Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2242955Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2243163Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2243409Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2243671Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2243908Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2244149Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2244384Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2244624Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2244871Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2245115Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2245348Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2245590Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2245824Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2246042Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2246275Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2246516Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2246753Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2246995Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2247231Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2247457Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2247683Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2247896Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2248121Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2248364Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2248597Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2248824Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2249039Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2249265Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2249479Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2249722Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2249957Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2250208Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2250422Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2250642Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2250805Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2251040Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2251247Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2251482Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2251723Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2251957Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2252182Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2252402Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2252637Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2252848Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2253051Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2253287Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2253494Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2253745Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2253950Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2254183Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2254388Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2254623Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2254876Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2255110Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2255355Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2255588Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2255830Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2256065Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2256306Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2256550Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2256802Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2257036Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2257251Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2257454Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2257688Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2257915Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2258147Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2258360Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2258575Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2258817Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2259051Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2259306Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2259542Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2259747Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2259981Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2260257Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2260492Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2260732Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2260985Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2261225Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2261444Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2261658Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2261862Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2262068Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2262303Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2262557Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2262792Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2263033Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2263270Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2263476Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2263723Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2263965Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2264198Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2264439Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2264674Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2264879Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2265112Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2265366Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2265600Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2265855Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2266089Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2266314Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2266531Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2266744Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2266969Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2267213Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2267447Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2267652Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2267886Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2268139Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2268372Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2268614Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2268847Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2269074Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2269291Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2269505Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2269732Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2269972Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2270248Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2270493Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2270726Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2270970Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2271205Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2271427Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2271662Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2271904Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2272141Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2272382Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2272629Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2272856Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2273074Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2273286Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2273504Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2273747Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2273980Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2274235Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2274480Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2274723Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2274957Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2275201Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2275435Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2275680Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2275926Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2276138Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2276355Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2276560Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2276775Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2277023Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2277246Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2277458Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2277665Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2277872Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2278058Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2278198Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2278358Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2278475Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2278629Z E1204 10:58:17.895000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2278810Z [W1204 10:58:17.399849554 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2278814Z 2025-12-04T12:10:20.2278972Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2279281Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2279590Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2279735Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2280262Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2280545Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2280784Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2281004Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2281219Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2281474Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2281711Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2281952Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2282185Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2282425Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2282659Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2282899Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2283148Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2283403Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2283637Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2283848Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2284069Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2284285Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2284542Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2284774Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2284978Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2285211Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2285453Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2285687Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2285900Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2286132Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2286343Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2286546Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2286780Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2286992Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2287193Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2287440Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2287694Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2287928Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2288169Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2288400Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2288613Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2288835Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2289060Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2289302Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2289535Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2289776Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2290008Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2290292Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2290524Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2290765Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2290998Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2291239Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2291472Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2291714Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2291959Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2292212Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2292446Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2292689Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2292920Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2293162Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2293413Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2293656Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2293888Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2294128Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2294361Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2294578Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2294799Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2295040Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2295274Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2295514Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2295749Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2295990Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2296222Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2296472Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2296714Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2296956Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2297187Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2297429Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2297662Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2297885Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2298088Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2298319Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2298530Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2298750Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2298966Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2299218Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2299451Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2299664Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2299867Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2300138Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2300349Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2300552Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2300797Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2301051Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2301286Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2301525Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2301761Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2301972Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2302195Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2302428Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2302673Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2302907Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2303123Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2303337Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2303565Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2303811Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2304047Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2304287Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2304522Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2304764Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2304998Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2305249Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2305495Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2305739Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2305974Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2306187Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2306393Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2306632Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2306862Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2307074Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2307288Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2307531Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2307768Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2308022Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2308257Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2308499Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2308731Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2308977Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2309211Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2309451Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2309695Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2309930Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2310182Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2310387Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2310612Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2310827Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2311069Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2311321Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2311537Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2311750Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2311966Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2312209Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2312455Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2312696Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2312932Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2313174Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2313410Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2313652Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2313886Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2314139Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2314388Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2314630Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2314865Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2315105Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2315339Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2315594Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2315830Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2316071Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2316305Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2316546Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2316781Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2317011Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2317216Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2317450Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2317694Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2317932Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2318172Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2318405Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2318655Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2318899Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2319140Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2319374Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2319616Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2319850Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2320068Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2320322Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2320563Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2320796Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2321037Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2321285Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2321512Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2321729Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2321944Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2322158Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2322400Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2322634Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2322860Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2323087Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2323312Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2323527Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2323767Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2324003Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2324219Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2324448Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2324655Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2324817Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2325053Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2325259Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2325493Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2325747Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2325987Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2326200Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2326403Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2326637Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2326849Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2327052Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2327304Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2327508Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2327754Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2327959Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2328191Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2328395Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2328628Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2328885Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2329123Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2329363Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2329598Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2329840Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2330086Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2330367Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2330602Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2330844Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2331078Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2331292Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2331495Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2331746Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2331975Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2332208Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2332425Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2332639Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2332883Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2333116Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2333375Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2333608Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2333816Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2334052Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2334293Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2334543Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2334786Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2335020Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2335247Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2335467Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2335681Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2335888Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2336103Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2336337Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2336591Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2336824Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2337065Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2337299Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2337502Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2337746Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2337989Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2338225Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2338466Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2338699Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2338914Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2339147Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2339389Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2339624Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2339870Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2340143Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2340368Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2340603Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2340816Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2341046Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2341290Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2341524Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2341729Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2341966Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2342222Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2342455Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2342696Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2342928Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2343156Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2343391Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2343606Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2343821Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2344065Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2344303Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2344546Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2344781Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2345032Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2345267Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2345482Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2345718Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2345961Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2346194Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2346437Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2346683Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2346910Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2347126Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2347338Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2347553Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2347805Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2348042Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2348284Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2348517Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2348760Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2348993Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2349236Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2349480Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2349736Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2349972Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2350219Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2350436Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2350641Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2350854Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2351098Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2351322Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2351534Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2351742Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2351949Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2352134Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2352289Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2352448Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2352566Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2352706Z E1204 10:58:17.939000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2352876Z [W1204 10:58:17.401899771 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2352880Z 2025-12-04T12:10:20.2353040Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2353347Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2353656Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2353817Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2354308Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2354589Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2354826Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2355047Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2355260Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2355514Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2355750Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2355992Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2356225Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2356466Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2356710Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2356949Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2357184Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2357425Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2357659Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2357875Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2358095Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2358320Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2358571Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2358805Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2359007Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2359240Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2359480Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2359713Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2359935Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2360210Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2360421Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2360623Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2360857Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2361083Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2361289Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2361523Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2361767Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2362000Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2362241Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2362473Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2362697Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2362917Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2363147Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2363389Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2363621Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2363862Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2364095Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2364352Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2364583Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2364823Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2365055Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2365299Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2365542Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2365784Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2366021Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2366262Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2366495Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2366735Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2366966Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2367217Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2367458Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2367700Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2367931Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2368171Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2368404Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2368632Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2368843Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2369082Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2369317Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2369555Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2369791Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2370046Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2370322Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2370565Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2370797Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2371039Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2371270Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2371509Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2371755Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2371980Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2372184Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2372416Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2372626Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2372848Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2373062Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2373317Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2373549Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2373762Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2373963Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2374198Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2374425Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2374628Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2374862Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2375102Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2375335Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2375576Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2375809Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2376030Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2376271Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2376484Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2376730Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2376963Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2377179Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2377393Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2377620Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2377866Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2378098Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2378340Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2378574Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2378825Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2379059Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2379302Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2379535Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2379780Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2380015Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2380262Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2380478Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2380724Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2380940Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2381154Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2381367Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2381611Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2381846Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2382101Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2382337Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2382577Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2382811Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2383053Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2383300Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2383542Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2383776Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2383996Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2384210Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2384419Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2384644Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2384868Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2385109Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2385354Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2385574Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2385787Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2386003Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2386246Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2386495Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2386737Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2386969Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2387211Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2387446Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2387700Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2387937Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2388178Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2388413Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2388653Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2388890Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2389130Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2389374Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2389627Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2389863Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2390139Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2390372Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2390615Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2390862Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2391076Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2391281Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2391515Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2391759Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2391995Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2392249Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2392483Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2392725Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2392960Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2393201Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2393435Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2393680Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2393933Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2394151Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2394384Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2394626Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2394859Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2395100Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2395344Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2395571Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2395788Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2396002Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2396216Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2396458Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2396703Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2396929Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2397146Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2397360Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2397580Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2397825Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2398058Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2398282Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2398504Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2398712Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2398876Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2399109Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2399315Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2399550Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2399808Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2400041Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2400296Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2400504Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2400738Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2400967Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2401171Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2401406Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2401612Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2401848Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2402052Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2402286Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2402489Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2402736Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2402993Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2403225Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2403469Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2403704Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2403946Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2404193Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2404434Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2404667Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2404908Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2405142Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2405367Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2405572Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2405807Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2406035Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2406252Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2406465Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2406681Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2406933Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2407166Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2407419Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2407652Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2407857Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2408091Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2408332Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2408578Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2408820Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2409053Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2409279Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2409498Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2409733Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2409942Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2410164Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2410398Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2410641Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2410875Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2411117Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2411365Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2411571Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2411819Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2412061Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2412297Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2412539Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2412775Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2412992Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2413227Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2413469Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2413706Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2413949Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2414194Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2414423Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2414638Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2414852Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2415068Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2415310Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2415543Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2415748Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2415992Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2416245Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2416479Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2416721Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2416954Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2417182Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2417410Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2417631Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2417844Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2418087Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2418322Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2418575Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2418808Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2419049Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2419284Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2419491Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2419727Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2419969Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2420249Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2420491Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2423796Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2424028Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2424244Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2424458Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2424674Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2424939Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2425175Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2425416Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2425653Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2425895Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2426142Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2426383Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2426617Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2426859Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2427093Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2427305Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2427521Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2427737Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2427946Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2428189Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2428411Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2428622Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2428834Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2429041Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2429245Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2429387Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2429547Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2429666Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2429808Z E1204 10:58:17.941000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2429979Z [W1204 10:58:17.403945358 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2429982Z 2025-12-04T12:10:20.2430174Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2430499Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2430809Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2430952Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2431446Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2431716Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2431954Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2432186Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2432410Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2432654Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2432891Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2433133Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2433366Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2433606Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2433851Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2434091Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2434324Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2434564Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2434800Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2435023Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2435246Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2435461Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2435700Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2435936Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2436139Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2436371Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2436620Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2436864Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2437069Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2437300Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2437512Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2437715Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2437947Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2438166Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2438369Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2438602Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2438844Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2439076Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2439325Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2439558Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2439768Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2439990Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2440238Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2441967Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2442203Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2442465Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2442697Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2442952Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2443185Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2443425Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2443658Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2443898Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2444146Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2444388Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2444618Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2444859Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2445093Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2445345Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2445576Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2445816Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2446048Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2446288Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2446524Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2446764Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2447004Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2447238Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2447448Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2447689Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2447921Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2448161Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2448395Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2448645Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2448877Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2449116Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2449348Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2449589Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2449832Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2450072Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2450338Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2450554Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2450756Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2450988Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2451197Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2451431Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2451657Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2451898Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2452131Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2452341Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2452544Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2452776Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2453003Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2453205Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2453435Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2453676Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2453907Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2454161Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2454396Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2454610Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2454832Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2455047Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2455292Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2455526Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2455751Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2455964Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2456189Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2456434Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2456667Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2456910Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2457145Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2457402Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2457638Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2457878Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2458112Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2458355Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2458601Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2458814Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2459019Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2459253Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2459470Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2459683Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2459897Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2460179Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2460414Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2460668Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2460903Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2461143Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2461378Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2461619Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2461870Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2462110Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2462346Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2462565Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2462779Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2463001Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2463225Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2463440Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2463681Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2463916Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2464135Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2464347Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2464578Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2464819Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2465064Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2465305Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2465541Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2465782Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2466016Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2466270Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2466506Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2466746Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2466979Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2467221Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2467466Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2467706Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2467941Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2468182Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2468418Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2468659Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2468893Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2469144Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2469387Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2469601Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2469805Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2470038Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2470319Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2470553Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2470810Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2471042Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2471283Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2471517Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2471759Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2472003Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2472245Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2472480Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2472683Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2472918Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2473159Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2473392Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2473645Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2473889Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2474119Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2474336Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2474551Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2474764Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2475018Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2475252Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2475478Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2475695Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2475907Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2476124Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2476374Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2476609Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2476825Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2477037Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2477244Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2477408Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2477642Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2477854Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2478091Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2478345Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2478579Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2478792Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2478996Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2479232Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2479455Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2479659Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2479893Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2480133Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2480368Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2480572Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2480825Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2481028Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2481262Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2481504Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2481738Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2481979Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2482213Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2482466Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2482715Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2482956Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2483190Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2483431Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2483664Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2483890Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2484096Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2484332Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2484560Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2484776Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2484989Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2485214Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2485456Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2485690Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2485932Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2486169Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2486374Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2486610Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2486861Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2487104Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2487345Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2487578Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2487806Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2488022Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2488246Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2488452Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2488654Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2488888Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2489129Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2489363Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2489614Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2489848Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2490053Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2490323Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2490567Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2490800Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2491041Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2491287Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2491503Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2491738Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2491979Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2492214Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2492455Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2492702Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2492929Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2493144Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2493358Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2493572Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2493815Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2494061Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2494267Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2494502Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2494747Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2494983Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2495222Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2495455Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2495692Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2495920Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2496135Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2496349Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2496591Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2496823Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2497084Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2497318Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2497558Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2497792Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2497996Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2498233Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2498484Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2498718Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2498961Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2499195Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2499424Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2499640Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2499854Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2500079Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2500386Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2500621Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2500861Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2501096Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2501336Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2501588Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2501830Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2502064Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2502307Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2502540Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2502767Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2502983Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2503187Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2503397Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2503627Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2503850Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2504062Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2504270Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2504489Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2504690Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2504832Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2504992Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2505111Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2505251Z E1204 10:58:17.943000 686634 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2505310Z FAILED [1.6397s] [100%] 2025-12-04T12:10:20.2505312Z 2025-12-04T12:10:20.2505387Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2505549Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2505623Z Traceback (most recent call last): 2025-12-04T12:10:20.2505804Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2505863Z method(*args, **kwargs) 2025-12-04T12:10:20.2506032Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2506089Z method(*args, **kwargs) 2025-12-04T12:10:20.2506254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2506309Z with policy(): 2025-12-04T12:10:20.2506476Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2506533Z raise RuntimeError(msg) 2025-12-04T12:10:20.2506944Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1973420032. 2025-12-04T12:10:20.2506948Z 2025-12-04T12:10:20.2507051Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2507328Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2507330Z 2025-12-04T12:10:20.2507436Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2507531Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2507593Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2507669Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2508244Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2508361Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2508417Z graph_break [] 2025-12-04T12:10:20.2508498Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2508589Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2509100Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2509176Z current_size = base.storage().size() 2025-12-04T12:10:20.2509235Z Autotune Choices Stats: 2025-12-04T12:10:20.2509623Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:20.2509709Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2509778Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2509916Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2510201Z triton_mm_29 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2510462Z triton_mm_30 0.0116 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2510705Z triton_mm_21 0.0116 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2510946Z triton_mm_33 0.0116 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2511183Z triton_mm_16 0.0119 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2511430Z triton_mm_22 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2511672Z triton_mm_34 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2511914Z triton_mm_23 0.0124 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2512152Z triton_mm_25 0.0131 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2512391Z triton_mm_31 0.0134 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2512537Z SingleProcess AUTOTUNE benchmarking takes 0.1793 seconds and 8.4601 seconds precompiling for 33 choices 2025-12-04T12:10:20.2512697Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2512773Z Traceback (most recent call last): 2025-12-04T12:10:20.2512947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2513029Z method(*args, **kwargs) 2025-12-04T12:10:20.2513198Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2513254Z method(*args, **kwargs) 2025-12-04T12:10:20.2513421Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2513474Z with policy(): 2025-12-04T12:10:20.2513643Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2513700Z raise RuntimeError(msg) 2025-12-04T12:10:20.2514111Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1973420032 and is now 2940207104. 2025-12-04T12:10:20.2514114Z 2025-12-04T12:10:20.2514219Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2514492Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2514494Z 2025-12-04T12:10:20.2514598Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2514688Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2514749Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2514823Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2515385Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2515502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2515565Z graph_break [] 2025-12-04T12:10:20.2515649Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2515738Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2516235Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2516301Z current_size = base.storage().size() 2025-12-04T12:10:20.2516359Z Autotune Choices Stats: 2025-12-04T12:10:20.2516742Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:20.2516827Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2516894Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2517031Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2517287Z triton_mm_29 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2517540Z triton_mm_30 0.0116 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2517778Z triton_mm_21 0.0116 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2518020Z triton_mm_33 0.0116 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2518257Z triton_mm_16 0.0119 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2518503Z triton_mm_22 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2518743Z triton_mm_34 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2518983Z triton_mm_23 0.0124 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2519219Z triton_mm_25 0.0131 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2519458Z triton_mm_31 0.0134 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2519612Z SingleProcess AUTOTUNE benchmarking takes 0.1793 seconds and 8.4601 seconds precompiling for 33 choices 2025-12-04T12:10:20.2519704Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2519762Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2519836Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2519950Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2520480Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2520537Z graph_break [] 2025-12-04T12:10:20.2520616Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2520707Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2521089Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.2521217Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.2521275Z Autotune Choices Stats: 2025-12-04T12:10:20.2521666Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.010160000063478947, "best_triton_pos": 0} 2025-12-04T12:10:20.2521750Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2521819Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2521953Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2522198Z triton_mm_67 0.0102 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2522258Z _scaled_mm 0.0105 ms 96.9% 2025-12-04T12:10:20.2522497Z triton_mm_71 0.0113 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2522752Z triton_mm_60 0.0115 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2522988Z triton_mm_59 0.0116 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2523228Z triton_mm_72 0.0117 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2523463Z triton_mm_68 0.0119 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2523713Z triton_mm_54 0.0121 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2523952Z triton_mm_61 0.0126 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2524188Z triton_mm_63 0.0130 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2524333Z SingleProcess AUTOTUNE benchmarking takes 0.2723 seconds and 0.8741 seconds precompiling for 39 choices 2025-12-04T12:10:20.2524403Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2524563Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2524624Z Traceback (most recent call last): 2025-12-04T12:10:20.2524797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2524854Z method(*args, **kwargs) 2025-12-04T12:10:20.2525022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2525080Z method(*args, **kwargs) 2025-12-04T12:10:20.2525257Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2525312Z with policy(): 2025-12-04T12:10:20.2525489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2525550Z raise RuntimeError(msg) 2025-12-04T12:10:20.2525956Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2940207104 and is now 3906994176. 2025-12-04T12:10:20.2525959Z 2025-12-04T12:10:20.2526051Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2526323Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2526326Z 2025-12-04T12:10:20.2526429Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2526519Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2526592Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2526665Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2527225Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2527340Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2527393Z graph_break [] 2025-12-04T12:10:20.2527475Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2527563Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2528070Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2528134Z current_size = base.storage().size() 2025-12-04T12:10:20.2528191Z Autotune Choices Stats: 2025-12-04T12:10:20.2528571Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:20.2528657Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2528727Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2528861Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2529107Z triton_mm_29 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2529344Z triton_mm_30 0.0116 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2529596Z triton_mm_21 0.0116 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2529846Z triton_mm_33 0.0116 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2530085Z triton_mm_16 0.0119 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2530361Z triton_mm_22 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2530601Z triton_mm_34 0.0122 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2530841Z triton_mm_23 0.0124 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2531094Z triton_mm_25 0.0131 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2531333Z triton_mm_31 0.0134 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2531479Z SingleProcess AUTOTUNE benchmarking takes 0.1793 seconds and 8.4601 seconds precompiling for 33 choices 2025-12-04T12:10:20.2531569Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2531630Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2531704Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2531819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2532332Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2532387Z graph_break [] 2025-12-04T12:10:20.2532465Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2532556Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2532933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.2533042Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.2533099Z Autotune Choices Stats: 2025-12-04T12:10:20.2533476Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.010160000063478947, "best_triton_pos": 0} 2025-12-04T12:10:20.2533559Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2533638Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2533788Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2534035Z triton_mm_67 0.0102 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2534094Z _scaled_mm 0.0105 ms 96.9% 2025-12-04T12:10:20.2534337Z triton_mm_71 0.0113 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2534575Z triton_mm_60 0.0115 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2534811Z triton_mm_59 0.0116 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2535063Z triton_mm_72 0.0117 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2535299Z triton_mm_68 0.0119 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2535534Z triton_mm_54 0.0121 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2535774Z triton_mm_61 0.0126 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2536010Z triton_mm_63 0.0130 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2536163Z SingleProcess AUTOTUNE benchmarking takes 0.2723 seconds and 0.8741 seconds precompiling for 39 choices 2025-12-04T12:10:20.2536253Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2536312Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2536387Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2536502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2536996Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2537050Z graph_break [] 2025-12-04T12:10:20.2537130Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:20.2537219Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2537277Z Autotune Choices Stats: 2025-12-04T12:10:20.2537663Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_105", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.010440000332891941, "best_triton_pos": 0} 2025-12-04T12:10:20.2537757Z AUTOTUNE scaled_mm(257x1024, 1024x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2537825Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2537960Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2538208Z triton_mm_105 0.0104 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2538446Z triton_mm_97 0.0111 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2538689Z triton_mm_110 0.0113 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2538929Z triton_mm_109 0.0114 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2539181Z triton_mm_106 0.0117 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2539419Z triton_mm_92 0.0120 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2539656Z triton_mm_98 0.0121 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2539894Z triton_mm_99 0.0124 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2540178Z triton_mm_101 0.0131 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2540418Z triton_mm_107 0.0133 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2540561Z SingleProcess AUTOTUNE benchmarking takes 0.2695 seconds and 0.7163 seconds precompiling for 39 choices 2025-12-04T12:10:20.2540768Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7d56d798f2d3fed1.xml - 2025-12-04T12:10:20.2540845Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2541446Z FAILED [1.6397s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2940207104 and is now 3906994176. 2025-12-04T12:10:20.2541450Z 2025-12-04T12:10:20.2541540Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2541822Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2541842Z 2025-12-04T12:10:20.2541948Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2542026Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2542111Z ================= 1 failed, 187 deselected, 2 rerun in 14.42s ================== 2025-12-04T12:10:20.2542166Z Got exit code 1 2025-12-04T12:10:20.2542384Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2542527Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.2542687Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5d39ccdf53d7d309.xml 2025-12-04T12:10:20.2542762Z ============================= test session starts ============================== 2025-12-04T12:10:20.2542891Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2542962Z cachedir: .pytest_cache 2025-12-04T12:10:20.2543136Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2543199Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2543258Z configfile: pytest.ini 2025-12-04T12:10:20.2543439Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2543530Z collecting ... collected 188 items / 91 deselected / 97 selected 2025-12-04T12:10:20.2543599Z stepcurrent: skipping 91 already run items. 2025-12-04T12:10:20.2543660Z Running 97 items in this shard 2025-12-04T12:10:20.2543662Z 2025-12-04T12:10:20.2543892Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6146s] [ 1%] 2025-12-04T12:10:20.2544114Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2712s] [ 1%] 2025-12-04T12:10:20.2544330Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2305s] [ 1%] 2025-12-04T12:10:20.2544333Z 2025-12-04T12:10:20.2544403Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2544556Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2544618Z Traceback (most recent call last): 2025-12-04T12:10:20.2544794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2544853Z method(*args, **kwargs) 2025-12-04T12:10:20.2545021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2545078Z method(*args, **kwargs) 2025-12-04T12:10:20.2545244Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2545298Z with policy(): 2025-12-04T12:10:20.2545465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2545524Z raise RuntimeError(msg) 2025-12-04T12:10:20.2545930Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.2545933Z 2025-12-04T12:10:20.2546034Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2546303Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2546307Z 2025-12-04T12:10:20.2546411Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2546502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2546561Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2546635Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2546718Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2546833Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2546887Z graph_break [] 2025-12-04T12:10:20.2546965Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2547120Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2547193Z Traceback (most recent call last): 2025-12-04T12:10:20.2547361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2547418Z method(*args, **kwargs) 2025-12-04T12:10:20.2547583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2547640Z method(*args, **kwargs) 2025-12-04T12:10:20.2547805Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2547858Z with policy(): 2025-12-04T12:10:20.2548025Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2548085Z raise RuntimeError(msg) 2025-12-04T12:10:20.2548479Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.2548483Z 2025-12-04T12:10:20.2548584Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2548854Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2548856Z 2025-12-04T12:10:20.2548958Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2549051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2549112Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2549188Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2549271Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2549385Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2549438Z graph_break [] 2025-12-04T12:10:20.2549515Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2549604Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2549663Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2549735Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2549847Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2549927Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2549989Z graph_break [] 2025-12-04T12:10:20.2550063Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2550188Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2550343Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2550405Z Traceback (most recent call last): 2025-12-04T12:10:20.2550575Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2550632Z method(*args, **kwargs) 2025-12-04T12:10:20.2550797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2550853Z method(*args, **kwargs) 2025-12-04T12:10:20.2551019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2551075Z with policy(): 2025-12-04T12:10:20.2551240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2551299Z raise RuntimeError(msg) 2025-12-04T12:10:20.2551708Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2551711Z 2025-12-04T12:10:20.2551800Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2552067Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2552069Z 2025-12-04T12:10:20.2552170Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2552260Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2552319Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2552393Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2552473Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2552586Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2552652Z graph_break [] 2025-12-04T12:10:20.2552727Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2552815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2552874Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2552945Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2553057Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2553139Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2553196Z graph_break [] 2025-12-04T12:10:20.2553269Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2553360Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2553418Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2553489Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2553601Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2553683Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2553735Z graph_break [] 2025-12-04T12:10:20.2553809Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2554016Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5d39ccdf53d7d309.xml - 2025-12-04T12:10:20.2554106Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2554692Z FAILED [0.2305s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2554713Z 2025-12-04T12:10:20.2554801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2555067Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2555069Z 2025-12-04T12:10:20.2555171Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2555249Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2555335Z ================== 1 failed, 91 deselected, 2 rerun in 2.14s =================== 2025-12-04T12:10:20.2555399Z Got exit code 1 2025-12-04T12:10:20.2555456Z Retrying single test... 2025-12-04T12:10:20.2555615Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e37773af631fed8c.xml 2025-12-04T12:10:20.2555691Z ============================= test session starts ============================== 2025-12-04T12:10:20.2555818Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2555875Z cachedir: .pytest_cache 2025-12-04T12:10:20.2556047Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2556109Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2556167Z configfile: pytest.ini 2025-12-04T12:10:20.2556346Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2556438Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2556699Z stepcurrent: skipping 91 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2556770Z Running 1 items in this shard 2025-12-04T12:10:20.2556772Z 2025-12-04T12:10:20.2556998Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8441s] [100%] 2025-12-04T12:10:20.2557218Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3620s] [100%] 2025-12-04T12:10:20.2557416Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3370s] [100%] 2025-12-04T12:10:20.2557419Z 2025-12-04T12:10:20.2557488Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2557641Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2557703Z Traceback (most recent call last): 2025-12-04T12:10:20.2557874Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2557931Z method(*args, **kwargs) 2025-12-04T12:10:20.2558096Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2558153Z method(*args, **kwargs) 2025-12-04T12:10:20.2558326Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2558380Z with policy(): 2025-12-04T12:10:20.2558557Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2558616Z raise RuntimeError(msg) 2025-12-04T12:10:20.2559015Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.2559017Z 2025-12-04T12:10:20.2559107Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2559374Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2559378Z 2025-12-04T12:10:20.2559479Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2559570Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2559646Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2559720Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2559800Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2559915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2559969Z graph_break [] 2025-12-04T12:10:20.2560044Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2560234Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2560296Z Traceback (most recent call last): 2025-12-04T12:10:20.2560464Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2560523Z method(*args, **kwargs) 2025-12-04T12:10:20.2560688Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2560747Z method(*args, **kwargs) 2025-12-04T12:10:20.2560914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2560968Z with policy(): 2025-12-04T12:10:20.2561153Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2561212Z raise RuntimeError(msg) 2025-12-04T12:10:20.2561607Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.2561611Z 2025-12-04T12:10:20.2561700Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2561964Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2561967Z 2025-12-04T12:10:20.2562070Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2562160Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2562218Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2562291Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2562371Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2562497Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2562550Z graph_break [] 2025-12-04T12:10:20.2562625Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2562729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2562791Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2562861Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2562973Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2563054Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2563107Z graph_break [] 2025-12-04T12:10:20.2563182Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2563250Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2563403Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2563467Z Traceback (most recent call last): 2025-12-04T12:10:20.2563634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2563692Z method(*args, **kwargs) 2025-12-04T12:10:20.2563875Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2563931Z method(*args, **kwargs) 2025-12-04T12:10:20.2564095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2564150Z with policy(): 2025-12-04T12:10:20.2564316Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2564374Z raise RuntimeError(msg) 2025-12-04T12:10:20.2564769Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2564772Z 2025-12-04T12:10:20.2564861Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2565126Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2565137Z 2025-12-04T12:10:20.2565239Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2565328Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2565386Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2565457Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2565537Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2565651Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2565704Z graph_break [] 2025-12-04T12:10:20.2565780Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2565868Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2565927Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2565996Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2566109Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2566188Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2566242Z graph_break [] 2025-12-04T12:10:20.2566315Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2566404Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2566463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2566546Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2566657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2566750Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2566805Z graph_break [] 2025-12-04T12:10:20.2566878Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2567085Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e37773af631fed8c.xml - 2025-12-04T12:10:20.2567161Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2567743Z FAILED [0.3370s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2567747Z 2025-12-04T12:10:20.2567835Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2568112Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2568114Z 2025-12-04T12:10:20.2568217Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2568294Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2568378Z ================== 1 failed, 187 deselected, 2 rerun in 2.56s ================== 2025-12-04T12:10:20.2568431Z Got exit code 1 2025-12-04T12:10:20.2568488Z Retrying single test... 2025-12-04T12:10:20.2568647Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9878cc2cdda80b3b.xml 2025-12-04T12:10:20.2568721Z ============================= test session starts ============================== 2025-12-04T12:10:20.2568847Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2568906Z cachedir: .pytest_cache 2025-12-04T12:10:20.2569078Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2569153Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2569211Z configfile: pytest.ini 2025-12-04T12:10:20.2569389Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2569478Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2569739Z stepcurrent: skipping 91 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2569800Z Running 1 items in this shard 2025-12-04T12:10:20.2569802Z 2025-12-04T12:10:20.2570024Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7839s] [100%] 2025-12-04T12:10:20.2570272Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3461s] [100%] 2025-12-04T12:10:20.2570472Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3230s] [100%] 2025-12-04T12:10:20.2570474Z 2025-12-04T12:10:20.2570542Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2570711Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2570776Z Traceback (most recent call last): 2025-12-04T12:10:20.2570963Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2571021Z method(*args, **kwargs) 2025-12-04T12:10:20.2571186Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2571245Z method(*args, **kwargs) 2025-12-04T12:10:20.2571411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2571465Z with policy(): 2025-12-04T12:10:20.2571630Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2571688Z raise RuntimeError(msg) 2025-12-04T12:10:20.2572082Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.2572100Z 2025-12-04T12:10:20.2572191Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2572459Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2572461Z 2025-12-04T12:10:20.2572561Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2572651Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2572710Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2572782Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2572863Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2572977Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2573030Z graph_break [] 2025-12-04T12:10:20.2573106Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2573257Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2573319Z Traceback (most recent call last): 2025-12-04T12:10:20.2573499Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2573557Z method(*args, **kwargs) 2025-12-04T12:10:20.2573721Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2573777Z method(*args, **kwargs) 2025-12-04T12:10:20.2573941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2573995Z with policy(): 2025-12-04T12:10:20.2574160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2574218Z raise RuntimeError(msg) 2025-12-04T12:10:20.2574613Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.2574616Z 2025-12-04T12:10:20.2574704Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2574981Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2574983Z 2025-12-04T12:10:20.2575085Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2575190Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2575249Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2575322Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2575402Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2575515Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2575568Z graph_break [] 2025-12-04T12:10:20.2575643Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2575731Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2575789Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2575860Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2575971Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2576051Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2576104Z graph_break [] 2025-12-04T12:10:20.2576189Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2576258Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2576411Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2576473Z Traceback (most recent call last): 2025-12-04T12:10:20.2576640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2576698Z method(*args, **kwargs) 2025-12-04T12:10:20.2576862Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2576920Z method(*args, **kwargs) 2025-12-04T12:10:20.2577083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2577138Z with policy(): 2025-12-04T12:10:20.2577305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2577362Z raise RuntimeError(msg) 2025-12-04T12:10:20.2577763Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2577766Z 2025-12-04T12:10:20.2577855Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2578121Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2578124Z 2025-12-04T12:10:20.2578225Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2578315Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2578373Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2578444Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2578524Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2578637Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2578689Z graph_break [] 2025-12-04T12:10:20.2578769Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2578857Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2578917Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2578997Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2579118Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2579198Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2579251Z graph_break [] 2025-12-04T12:10:20.2579324Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2579414Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2579473Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2579542Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2579652Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2579730Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2579783Z graph_break [] 2025-12-04T12:10:20.2579855Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:20.2580062Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9878cc2cdda80b3b.xml - 2025-12-04T12:10:20.2580164Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2580761Z FAILED [0.3230s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.2580763Z 2025-12-04T12:10:20.2580851Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2581117Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2581120Z 2025-12-04T12:10:20.2581223Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2581304Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2581387Z ================== 1 failed, 187 deselected, 2 rerun in 2.47s ================== 2025-12-04T12:10:20.2581440Z Got exit code 1 2025-12-04T12:10:20.2581668Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2581810Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.2581968Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4689e54d3fd483e5.xml 2025-12-04T12:10:20.2582041Z ============================= test session starts ============================== 2025-12-04T12:10:20.2582167Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2582225Z cachedir: .pytest_cache 2025-12-04T12:10:20.2582398Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2582458Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2582515Z configfile: pytest.ini 2025-12-04T12:10:20.2582691Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2582781Z collecting ... collected 188 items / 92 deselected / 96 selected 2025-12-04T12:10:20.2582850Z stepcurrent: skipping 92 already run items. 2025-12-04T12:10:20.2582910Z Running 96 items in this shard 2025-12-04T12:10:20.2582912Z 2025-12-04T12:10:20.2583153Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8919s] [ 1%] 2025-12-04T12:10:20.2583391Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3699s] [ 1%] 2025-12-04T12:10:20.2583593Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3536s] [ 1%] 2025-12-04T12:10:20.2583596Z 2025-12-04T12:10:20.2583663Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2583818Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2583879Z Traceback (most recent call last): 2025-12-04T12:10:20.2584049Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2584106Z method(*args, **kwargs) 2025-12-04T12:10:20.2584273Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2584330Z method(*args, **kwargs) 2025-12-04T12:10:20.2584507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2584559Z with policy(): 2025-12-04T12:10:20.2584728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2584784Z raise RuntimeError(msg) 2025-12-04T12:10:20.2585186Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:20.2585189Z 2025-12-04T12:10:20.2585278Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2585549Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2585553Z 2025-12-04T12:10:20.2585654Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2585752Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2585812Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2585886Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2585968Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2586080Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2586134Z graph_break [] 2025-12-04T12:10:20.2586212Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2586367Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2586429Z Traceback (most recent call last): 2025-12-04T12:10:20.2586599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2586656Z method(*args, **kwargs) 2025-12-04T12:10:20.2586823Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2586878Z method(*args, **kwargs) 2025-12-04T12:10:20.2587042Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2587094Z with policy(): 2025-12-04T12:10:20.2587269Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2587326Z raise RuntimeError(msg) 2025-12-04T12:10:20.2587736Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:20.2587739Z 2025-12-04T12:10:20.2587830Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2588097Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2588099Z 2025-12-04T12:10:20.2588201Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2588290Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2588351Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2588423Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2588504Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2588630Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2588684Z graph_break [] 2025-12-04T12:10:20.2588759Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2588849Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2588907Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2588979Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2589089Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2589169Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2589221Z graph_break [] 2025-12-04T12:10:20.2589299Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2589369Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2589526Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2589588Z Traceback (most recent call last): 2025-12-04T12:10:20.2589756Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2589824Z method(*args, **kwargs) 2025-12-04T12:10:20.2589991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2590048Z method(*args, **kwargs) 2025-12-04T12:10:20.2590254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2590308Z with policy(): 2025-12-04T12:10:20.2590476Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2590535Z raise RuntimeError(msg) 2025-12-04T12:10:20.2590928Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2590932Z 2025-12-04T12:10:20.2591022Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2591291Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2591293Z 2025-12-04T12:10:20.2591394Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2591502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2591574Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2591645Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2591727Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2591839Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2591891Z graph_break [] 2025-12-04T12:10:20.2591968Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2592057Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2592115Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2592185Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2592296Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2592376Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2592429Z graph_break [] 2025-12-04T12:10:20.2592504Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2592592Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2592665Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2592736Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2592847Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2592926Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2592978Z graph_break [] 2025-12-04T12:10:20.2593052Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2593255Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4689e54d3fd483e5.xml - 2025-12-04T12:10:20.2593333Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2593918Z FAILED [0.3536s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2593934Z 2025-12-04T12:10:20.2594024Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2594291Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2594293Z 2025-12-04T12:10:20.2594393Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2594471Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2594556Z ================== 1 failed, 92 deselected, 2 rerun in 2.64s =================== 2025-12-04T12:10:20.2594612Z Got exit code 1 2025-12-04T12:10:20.2594668Z Retrying single test... 2025-12-04T12:10:20.2594828Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6f4ca8c340c869d0.xml 2025-12-04T12:10:20.2594901Z ============================= test session starts ============================== 2025-12-04T12:10:20.2595027Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2595084Z cachedir: .pytest_cache 2025-12-04T12:10:20.2595257Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2595317Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2595374Z configfile: pytest.ini 2025-12-04T12:10:20.2595558Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2595659Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2595922Z stepcurrent: skipping 92 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2595984Z Running 1 items in this shard 2025-12-04T12:10:20.2595987Z 2025-12-04T12:10:20.2596212Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6095s] [100%] 2025-12-04T12:10:20.2596433Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2897s] [100%] 2025-12-04T12:10:20.2596636Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2275s] [100%] 2025-12-04T12:10:20.2596639Z 2025-12-04T12:10:20.2596705Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2596871Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2596932Z Traceback (most recent call last): 2025-12-04T12:10:20.2597107Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2597164Z method(*args, **kwargs) 2025-12-04T12:10:20.2597332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2597388Z method(*args, **kwargs) 2025-12-04T12:10:20.2597554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2597607Z with policy(): 2025-12-04T12:10:20.2597775Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2597833Z raise RuntimeError(msg) 2025-12-04T12:10:20.2598247Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:20.2598250Z 2025-12-04T12:10:20.2598339Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2598607Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2598610Z 2025-12-04T12:10:20.2598711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2598800Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2598860Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2598932Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2599012Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2599126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2599180Z graph_break [] 2025-12-04T12:10:20.2599255Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2599409Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2599469Z Traceback (most recent call last): 2025-12-04T12:10:20.2599649Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2599706Z method(*args, **kwargs) 2025-12-04T12:10:20.2599886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2599942Z method(*args, **kwargs) 2025-12-04T12:10:20.2600136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2600189Z with policy(): 2025-12-04T12:10:20.2600355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2600414Z raise RuntimeError(msg) 2025-12-04T12:10:20.2600813Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:20.2600815Z 2025-12-04T12:10:20.2600905Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2601173Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2601194Z 2025-12-04T12:10:20.2601297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2601385Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2601445Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2601516Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2601597Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2601708Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2601762Z graph_break [] 2025-12-04T12:10:20.2601838Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2601932Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2601990Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2602061Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2602173Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2602267Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2602320Z graph_break [] 2025-12-04T12:10:20.2602396Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2602466Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2602621Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2602685Z Traceback (most recent call last): 2025-12-04T12:10:20.2602853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2602912Z method(*args, **kwargs) 2025-12-04T12:10:20.2603079Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2603137Z method(*args, **kwargs) 2025-12-04T12:10:20.2603302Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2603355Z with policy(): 2025-12-04T12:10:20.2603521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2603578Z raise RuntimeError(msg) 2025-12-04T12:10:20.2603985Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2604002Z 2025-12-04T12:10:20.2604093Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2604365Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2604370Z 2025-12-04T12:10:20.2604472Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2604562Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2604621Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2604693Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2604772Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2604884Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2604937Z graph_break [] 2025-12-04T12:10:20.2605016Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2605116Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2605174Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2605243Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2605356Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2605434Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2605487Z graph_break [] 2025-12-04T12:10:20.2605560Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2605650Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2605707Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2605778Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2605888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2605968Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2606022Z graph_break [] 2025-12-04T12:10:20.2606099Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2606319Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6f4ca8c340c869d0.xml - 2025-12-04T12:10:20.2606396Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2606978Z FAILED [0.2275s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2606982Z 2025-12-04T12:10:20.2607070Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2607338Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2607341Z 2025-12-04T12:10:20.2607441Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2607522Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2607605Z ================== 1 failed, 187 deselected, 2 rerun in 2.15s ================== 2025-12-04T12:10:20.2607659Z Got exit code 1 2025-12-04T12:10:20.2607716Z Retrying single test... 2025-12-04T12:10:20.2607886Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-934191f497df0de7.xml 2025-12-04T12:10:20.2607969Z ============================= test session starts ============================== 2025-12-04T12:10:20.2608101Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2608159Z cachedir: .pytest_cache 2025-12-04T12:10:20.2608332Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2608394Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2608451Z configfile: pytest.ini 2025-12-04T12:10:20.2608628Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2608718Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2608982Z stepcurrent: skipping 92 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2609046Z Running 1 items in this shard 2025-12-04T12:10:20.2609048Z 2025-12-04T12:10:20.2609285Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6309s] [100%] 2025-12-04T12:10:20.2609508Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3970s] [100%] 2025-12-04T12:10:20.2609708Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3289s] [100%] 2025-12-04T12:10:20.2609711Z 2025-12-04T12:10:20.2609778Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2609933Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2609995Z Traceback (most recent call last): 2025-12-04T12:10:20.2610201Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2610259Z method(*args, **kwargs) 2025-12-04T12:10:20.2610426Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2610502Z method(*args, **kwargs) 2025-12-04T12:10:20.2610670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2610723Z with policy(): 2025-12-04T12:10:20.2610891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2610949Z raise RuntimeError(msg) 2025-12-04T12:10:20.2611346Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1111490560. 2025-12-04T12:10:20.2611354Z 2025-12-04T12:10:20.2611443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2611711Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2611714Z 2025-12-04T12:10:20.2611815Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2611903Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2611963Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2612047Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2612130Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2612255Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2612310Z graph_break [] 2025-12-04T12:10:20.2612386Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2612542Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2612603Z Traceback (most recent call last): 2025-12-04T12:10:20.2612773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2612828Z method(*args, **kwargs) 2025-12-04T12:10:20.2612993Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2613050Z method(*args, **kwargs) 2025-12-04T12:10:20.2613214Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2613270Z with policy(): 2025-12-04T12:10:20.2613435Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2613507Z raise RuntimeError(msg) 2025-12-04T12:10:20.2613902Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1136656384. 2025-12-04T12:10:20.2613904Z 2025-12-04T12:10:20.2613993Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2614260Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2614262Z 2025-12-04T12:10:20.2614367Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2614456Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2614515Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2614586Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2614669Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2614791Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2614845Z graph_break [] 2025-12-04T12:10:20.2614920Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2615012Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2615073Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2615144Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2615255Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2615334Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2615388Z graph_break [] 2025-12-04T12:10:20.2615462Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2615530Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2615684Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2615750Z Traceback (most recent call last): 2025-12-04T12:10:20.2615917Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2615974Z method(*args, **kwargs) 2025-12-04T12:10:20.2616149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2616206Z method(*args, **kwargs) 2025-12-04T12:10:20.2616380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2616435Z with policy(): 2025-12-04T12:10:20.2616601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2616659Z raise RuntimeError(msg) 2025-12-04T12:10:20.2617057Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2617059Z 2025-12-04T12:10:20.2617150Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2617418Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2617423Z 2025-12-04T12:10:20.2617536Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2617625Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2617683Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2617760Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2617840Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2617953Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2618005Z graph_break [] 2025-12-04T12:10:20.2618081Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2618169Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2618229Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2618299Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2618412Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2618496Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2618549Z graph_break [] 2025-12-04T12:10:20.2618623Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2618723Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2618780Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2618851Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2618961Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2619039Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.2619092Z graph_break [] 2025-12-04T12:10:20.2619167Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:20.2619371Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-934191f497df0de7.xml - 2025-12-04T12:10:20.2619449Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2620033Z FAILED [0.3289s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1161822208. 2025-12-04T12:10:20.2620037Z 2025-12-04T12:10:20.2620156Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2620439Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2620453Z 2025-12-04T12:10:20.2620554Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2620634Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2620716Z ================== 1 failed, 187 deselected, 2 rerun in 2.38s ================== 2025-12-04T12:10:20.2620771Z Got exit code 1 2025-12-04T12:10:20.2620987Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2621130Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.2621286Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7bf4d3371d463a94.xml 2025-12-04T12:10:20.2621360Z ============================= test session starts ============================== 2025-12-04T12:10:20.2621486Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2621563Z cachedir: .pytest_cache 2025-12-04T12:10:20.2621737Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2621798Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2621856Z configfile: pytest.ini 2025-12-04T12:10:20.2622030Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2622123Z collecting ... collected 188 items / 93 deselected / 95 selected 2025-12-04T12:10:20.2622191Z stepcurrent: skipping 93 already run items. 2025-12-04T12:10:20.2622251Z Running 95 items in this shard 2025-12-04T12:10:20.2622253Z 2025-12-04T12:10:20.2622478Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0457s] [ 1%] 2025-12-04T12:10:20.2622699Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7692s] [ 1%] 2025-12-04T12:10:20.2622909Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6655s] [ 1%] 2025-12-04T12:10:20.2622912Z 2025-12-04T12:10:20.2622980Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2623132Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2623195Z Traceback (most recent call last): 2025-12-04T12:10:20.2623368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2623427Z method(*args, **kwargs) 2025-12-04T12:10:20.2623593Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2623651Z method(*args, **kwargs) 2025-12-04T12:10:20.2623817Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2623870Z with policy(): 2025-12-04T12:10:20.2624037Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2624094Z raise RuntimeError(msg) 2025-12-04T12:10:20.2624498Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.2624501Z 2025-12-04T12:10:20.2624590Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2624868Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2624871Z 2025-12-04T12:10:20.2624972Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2625062Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2625120Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2625194Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2625692Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2625806Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2625873Z graph_break [] 2025-12-04T12:10:20.2625950Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2626040Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2626538Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2626604Z current_size = base.storage().size() 2025-12-04T12:10:20.2626660Z Autotune Choices Stats: 2025-12-04T12:10:20.2627045Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:20.2627119Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2627195Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2627336Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2627591Z triton_mm_4 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2627834Z triton_mm_5 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2628071Z triton_mm_3 0.0065 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2628312Z triton_mm_0 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2628550Z triton_mm_1 0.0069 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2628796Z triton_mm_6 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2629053Z triton_mm_2 0.0077 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2629289Z triton_mm_7 0.0083 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2629349Z _scaled_mm 0.0260 ms 24.6% 2025-12-04T12:10:20.2629493Z SingleProcess AUTOTUNE benchmarking takes 0.0439 seconds and 0.1931 seconds precompiling for 9 choices 2025-12-04T12:10:20.2629648Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2629708Z Traceback (most recent call last): 2025-12-04T12:10:20.2629882Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2629949Z method(*args, **kwargs) 2025-12-04T12:10:20.2630149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2630205Z method(*args, **kwargs) 2025-12-04T12:10:20.2630371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2630425Z with policy(): 2025-12-04T12:10:20.2630594Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2630651Z raise RuntimeError(msg) 2025-12-04T12:10:20.2631049Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.2631053Z 2025-12-04T12:10:20.2631142Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2631425Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2631427Z 2025-12-04T12:10:20.2631530Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2631619Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2631678Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2631750Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2632242Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2632358Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2632412Z graph_break [] 2025-12-04T12:10:20.2632489Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2632577Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2633086Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2633162Z current_size = base.storage().size() 2025-12-04T12:10:20.2633221Z Autotune Choices Stats: 2025-12-04T12:10:20.2633599Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:20.2633673Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2633737Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2633873Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2634120Z triton_mm_4 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2634359Z triton_mm_5 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2634611Z triton_mm_3 0.0065 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2634849Z triton_mm_0 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2635088Z triton_mm_1 0.0069 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2635321Z triton_mm_6 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2635572Z triton_mm_2 0.0077 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2635808Z triton_mm_7 0.0083 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2635865Z _scaled_mm 0.0260 ms 24.6% 2025-12-04T12:10:20.2636010Z SingleProcess AUTOTUNE benchmarking takes 0.0439 seconds and 0.1931 seconds precompiling for 9 choices 2025-12-04T12:10:20.2636099Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2636159Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2636231Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2636347Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2636835Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2636890Z graph_break [] 2025-12-04T12:10:20.2636975Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2637065Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2637139Z Autotune Choices Stats: 2025-12-04T12:10:20.2637510Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:20.2637582Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2637646Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2637781Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2638024Z triton_mm_11 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2638266Z triton_mm_9 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2638514Z triton_mm_15 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2638754Z triton_mm_12 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2638992Z triton_mm_8 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2639228Z triton_mm_13 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2639479Z triton_mm_10 0.0078 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2639715Z triton_mm_14 0.0099 ms 61.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2639773Z _scaled_mm 0.0254 ms 24.1% 2025-12-04T12:10:20.2639920Z SingleProcess AUTOTUNE benchmarking takes 0.0426 seconds and 0.1130 seconds precompiling for 9 choices 2025-12-04T12:10:20.2639990Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2640184Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2640247Z Traceback (most recent call last): 2025-12-04T12:10:20.2640418Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2640476Z method(*args, **kwargs) 2025-12-04T12:10:20.2640644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2640700Z method(*args, **kwargs) 2025-12-04T12:10:20.2640865Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2640919Z with policy(): 2025-12-04T12:10:20.2641101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2641171Z raise RuntimeError(msg) 2025-12-04T12:10:20.2641570Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2641574Z 2025-12-04T12:10:20.2641663Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2641931Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2641933Z 2025-12-04T12:10:20.2642034Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2642124Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2642182Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2642256Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2642761Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2642874Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2642927Z graph_break [] 2025-12-04T12:10:20.2643002Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2643091Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2643587Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2643652Z current_size = base.storage().size() 2025-12-04T12:10:20.2643709Z Autotune Choices Stats: 2025-12-04T12:10:20.2644102Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:20.2644175Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2644240Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2644375Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2644622Z triton_mm_4 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2644861Z triton_mm_5 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2645096Z triton_mm_3 0.0065 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2645345Z triton_mm_0 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2645594Z triton_mm_1 0.0069 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2645828Z triton_mm_6 0.0069 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2646066Z triton_mm_2 0.0077 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2646300Z triton_mm_7 0.0083 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2646359Z _scaled_mm 0.0260 ms 24.6% 2025-12-04T12:10:20.2646513Z SingleProcess AUTOTUNE benchmarking takes 0.0439 seconds and 0.1931 seconds precompiling for 9 choices 2025-12-04T12:10:20.2646602Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2646661Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2646736Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2646850Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2647343Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2647398Z graph_break [] 2025-12-04T12:10:20.2647474Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2647562Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2647619Z Autotune Choices Stats: 2025-12-04T12:10:20.2648000Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:20.2648071Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2648135Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2648270Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2648515Z triton_mm_11 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2648753Z triton_mm_9 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2648991Z triton_mm_15 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2649240Z triton_mm_12 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2649488Z triton_mm_8 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2651131Z triton_mm_13 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2651379Z triton_mm_10 0.0078 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2651617Z triton_mm_14 0.0099 ms 61.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2651677Z _scaled_mm 0.0254 ms 24.1% 2025-12-04T12:10:20.2651820Z SingleProcess AUTOTUNE benchmarking takes 0.0426 seconds and 0.1130 seconds precompiling for 9 choices 2025-12-04T12:10:20.2651931Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2651989Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2652063Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2652178Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2652673Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2652727Z graph_break [] 2025-12-04T12:10:20.2652803Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2652893Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2652950Z Autotune Choices Stats: 2025-12-04T12:10:20.2653336Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_22", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.2653409Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2653473Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2653611Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2653856Z triton_mm_22 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2654096Z triton_mm_21 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2654335Z triton_mm_20 0.0063 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2654601Z triton_mm_16 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2654661Z _scaled_mm 0.0072 ms 84.9% 2025-12-04T12:10:20.2654913Z triton_mm_18 0.0072 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2655152Z triton_mm_23 0.0073 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2655388Z triton_mm_19 0.0074 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2655626Z triton_mm_17 0.0093 ms 65.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2655771Z SingleProcess AUTOTUNE benchmarking takes 0.0582 seconds and 0.2209 seconds precompiling for 9 choices 2025-12-04T12:10:20.2655988Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7bf4d3371d463a94.xml - 2025-12-04T12:10:20.2656066Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2656651Z FAILED [0.6655s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2656655Z 2025-12-04T12:10:20.2656745Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2657015Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2657018Z 2025-12-04T12:10:20.2657121Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2657211Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2657295Z ================== 1 failed, 93 deselected, 2 rerun in 3.50s =================== 2025-12-04T12:10:20.2657350Z Got exit code 1 2025-12-04T12:10:20.2657405Z Retrying single test... 2025-12-04T12:10:20.2657565Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-afc2321ffc832d6d.xml 2025-12-04T12:10:20.2657639Z ============================= test session starts ============================== 2025-12-04T12:10:20.2657765Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2657823Z cachedir: .pytest_cache 2025-12-04T12:10:20.2657997Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2658061Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2658119Z configfile: pytest.ini 2025-12-04T12:10:20.2658299Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2658390Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2658653Z stepcurrent: skipping 93 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2658713Z Running 1 items in this shard 2025-12-04T12:10:20.2658725Z 2025-12-04T12:10:20.2658952Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2854s] [100%] 2025-12-04T12:10:20.2659182Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8195s] [100%] 2025-12-04T12:10:20.2659383Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.8314s] [100%] 2025-12-04T12:10:20.2659385Z 2025-12-04T12:10:20.2659453Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2659609Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2659670Z Traceback (most recent call last): 2025-12-04T12:10:20.2659847Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2659905Z method(*args, **kwargs) 2025-12-04T12:10:20.2660072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2660167Z method(*args, **kwargs) 2025-12-04T12:10:20.2660332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2660387Z with policy(): 2025-12-04T12:10:20.2660554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2660612Z raise RuntimeError(msg) 2025-12-04T12:10:20.2661008Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.2661012Z 2025-12-04T12:10:20.2661102Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2661370Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2661372Z 2025-12-04T12:10:20.2661490Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2661579Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2661639Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2661711Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2662204Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2662320Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2662373Z graph_break [] 2025-12-04T12:10:20.2662448Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2662537Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2663035Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2663111Z current_size = base.storage().size() 2025-12-04T12:10:20.2663169Z Autotune Choices Stats: 2025-12-04T12:10:20.2663562Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:20.2663637Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2663701Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2663837Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2664083Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2664321Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2664573Z triton_mm_3 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2664808Z triton_mm_5 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2665048Z triton_mm_0 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2665284Z triton_mm_4 0.0090 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2665524Z triton_mm_2 0.0104 ms 58.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2665770Z triton_mm_1 0.0109 ms 55.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2665827Z _scaled_mm 0.0265 ms 22.8% 2025-12-04T12:10:20.2665971Z SingleProcess AUTOTUNE benchmarking takes 0.0488 seconds and 0.2055 seconds precompiling for 9 choices 2025-12-04T12:10:20.2666125Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2666187Z Traceback (most recent call last): 2025-12-04T12:10:20.2666356Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2666414Z method(*args, **kwargs) 2025-12-04T12:10:20.2666580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2666637Z method(*args, **kwargs) 2025-12-04T12:10:20.2666802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2666857Z with policy(): 2025-12-04T12:10:20.2667022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2667080Z raise RuntimeError(msg) 2025-12-04T12:10:20.2667484Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.2667498Z 2025-12-04T12:10:20.2667588Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2667857Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2667859Z 2025-12-04T12:10:20.2667962Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2668051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2668110Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2668184Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2668676Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2668805Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2668858Z graph_break [] 2025-12-04T12:10:20.2668937Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2669025Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2669520Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2669585Z current_size = base.storage().size() 2025-12-04T12:10:20.2669642Z Autotune Choices Stats: 2025-12-04T12:10:20.2670034Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:20.2670135Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2670200Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2670333Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2670578Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2670819Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2671055Z triton_mm_3 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2671291Z triton_mm_5 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2671541Z triton_mm_0 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2671792Z triton_mm_4 0.0090 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2672029Z triton_mm_2 0.0104 ms 58.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2672266Z triton_mm_1 0.0109 ms 55.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2672324Z _scaled_mm 0.0265 ms 22.8% 2025-12-04T12:10:20.2672467Z SingleProcess AUTOTUNE benchmarking takes 0.0488 seconds and 0.2055 seconds precompiling for 9 choices 2025-12-04T12:10:20.2672557Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2672629Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2672702Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2672816Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2673307Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2673360Z graph_break [] 2025-12-04T12:10:20.2673437Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2673525Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2673582Z Autotune Choices Stats: 2025-12-04T12:10:20.2673968Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.2674040Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2674106Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2674241Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2674488Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2674726Z triton_mm_8 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2674965Z triton_mm_9 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2675200Z triton_mm_13 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2675445Z triton_mm_14 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2675696Z triton_mm_10 0.0067 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2675934Z triton_mm_11 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2676172Z triton_mm_15 0.0069 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2676228Z _scaled_mm 0.0078 ms 76.5% 2025-12-04T12:10:20.2676370Z SingleProcess AUTOTUNE benchmarking takes 0.0608 seconds and 0.2226 seconds precompiling for 9 choices 2025-12-04T12:10:20.2676439Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2676605Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2676667Z Traceback (most recent call last): 2025-12-04T12:10:20.2676840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2676896Z method(*args, **kwargs) 2025-12-04T12:10:20.2677063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2677120Z method(*args, **kwargs) 2025-12-04T12:10:20.2677284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2677340Z with policy(): 2025-12-04T12:10:20.2677507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2677566Z raise RuntimeError(msg) 2025-12-04T12:10:20.2677977Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2677980Z 2025-12-04T12:10:20.2678071Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2678338Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2678340Z 2025-12-04T12:10:20.2678444Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2678532Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2678592Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2678666Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2679160Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2679274Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2679327Z graph_break [] 2025-12-04T12:10:20.2679404Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2679502Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2680000Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2680074Z current_size = base.storage().size() 2025-12-04T12:10:20.2680172Z Autotune Choices Stats: 2025-12-04T12:10:20.2680548Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:20.2680621Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2680684Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2680821Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2681082Z triton_mm_6 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2681321Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2681560Z triton_mm_3 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2681796Z triton_mm_5 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2682037Z triton_mm_0 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2682289Z triton_mm_4 0.0090 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2682526Z triton_mm_2 0.0104 ms 58.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2682765Z triton_mm_1 0.0109 ms 55.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2682823Z _scaled_mm 0.0265 ms 22.8% 2025-12-04T12:10:20.2682966Z SingleProcess AUTOTUNE benchmarking takes 0.0488 seconds and 0.2055 seconds precompiling for 9 choices 2025-12-04T12:10:20.2683054Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2683113Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2683186Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2683301Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2683802Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2683871Z graph_break [] 2025-12-04T12:10:20.2683948Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2684035Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2684093Z Autotune Choices Stats: 2025-12-04T12:10:20.2684466Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.2684537Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2684601Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2684736Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2684983Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2685243Z triton_mm_8 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2685481Z triton_mm_9 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2685719Z triton_mm_13 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2685956Z triton_mm_14 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2686207Z triton_mm_10 0.0067 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2686444Z triton_mm_11 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2686681Z triton_mm_15 0.0069 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2686739Z _scaled_mm 0.0078 ms 76.5% 2025-12-04T12:10:20.2686883Z SingleProcess AUTOTUNE benchmarking takes 0.0608 seconds and 0.2226 seconds precompiling for 9 choices 2025-12-04T12:10:20.2686971Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2687030Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2687102Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2687217Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2687712Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2687777Z graph_break [] 2025-12-04T12:10:20.2687852Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2687942Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2687999Z Autotune Choices Stats: 2025-12-04T12:10:20.2688376Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_20", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:20.2688447Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2688511Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2688648Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2688896Z triton_mm_20 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2689146Z triton_mm_21 0.0071 ms 96.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2689384Z triton_mm_17 0.0075 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2689622Z triton_mm_16 0.0088 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2689858Z triton_mm_19 0.0088 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2690142Z triton_mm_23 0.0091 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2690380Z triton_mm_22 0.0096 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2690621Z triton_mm_18 0.0104 ms 66.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2690679Z _scaled_mm 0.0287 ms 24.0% 2025-12-04T12:10:20.2690820Z SingleProcess AUTOTUNE benchmarking takes 0.0683 seconds and 0.2192 seconds precompiling for 9 choices 2025-12-04T12:10:20.2691025Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-afc2321ffc832d6d.xml - 2025-12-04T12:10:20.2691102Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2691697Z FAILED [0.8314s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2691699Z 2025-12-04T12:10:20.2691789Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2692069Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2692073Z 2025-12-04T12:10:20.2692175Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2692254Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2692340Z ================== 1 failed, 187 deselected, 2 rerun in 3.96s ================== 2025-12-04T12:10:20.2692394Z Got exit code 1 2025-12-04T12:10:20.2692450Z Retrying single test... 2025-12-04T12:10:20.2692606Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-47300065430d7107.xml 2025-12-04T12:10:20.2692680Z ============================= test session starts ============================== 2025-12-04T12:10:20.2692806Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2692864Z cachedir: .pytest_cache 2025-12-04T12:10:20.2693052Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2693116Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2693172Z configfile: pytest.ini 2025-12-04T12:10:20.2693352Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2693442Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2693704Z stepcurrent: skipping 93 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2693764Z Running 1 items in this shard 2025-12-04T12:10:20.2693766Z 2025-12-04T12:10:20.2693989Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2313s] [100%] 2025-12-04T12:10:20.2694213Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8601s] [100%] 2025-12-04T12:10:20.2694420Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7655s] [100%] 2025-12-04T12:10:20.2694423Z 2025-12-04T12:10:20.2694492Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2694644Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2694707Z Traceback (most recent call last): 2025-12-04T12:10:20.2694879Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2694939Z method(*args, **kwargs) 2025-12-04T12:10:20.2695105Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2695163Z method(*args, **kwargs) 2025-12-04T12:10:20.2695327Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2695382Z with policy(): 2025-12-04T12:10:20.2695548Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2695606Z raise RuntimeError(msg) 2025-12-04T12:10:20.2696013Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.2696027Z 2025-12-04T12:10:20.2696117Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2696386Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2696388Z 2025-12-04T12:10:20.2696490Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2696580Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2696638Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2696711Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2697204Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2697328Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2697381Z graph_break [] 2025-12-04T12:10:20.2697457Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2697546Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2698043Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2698107Z current_size = base.storage().size() 2025-12-04T12:10:20.2698165Z Autotune Choices Stats: 2025-12-04T12:10:20.2698554Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.2698627Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2698691Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2698825Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2699074Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2699313Z triton_mm_6 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2699554Z triton_mm_2 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2699793Z triton_mm_3 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2700038Z triton_mm_1 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2700377Z triton_mm_7 0.0074 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2700639Z triton_mm_5 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2700876Z triton_mm_0 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2700934Z _scaled_mm 0.0245 ms 24.6% 2025-12-04T12:10:20.2701075Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.2008 seconds precompiling for 9 choices 2025-12-04T12:10:20.2701231Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2701293Z Traceback (most recent call last): 2025-12-04T12:10:20.2701478Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2701535Z method(*args, **kwargs) 2025-12-04T12:10:20.2701704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2701761Z method(*args, **kwargs) 2025-12-04T12:10:20.2701926Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2701979Z with policy(): 2025-12-04T12:10:20.2702145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2702203Z raise RuntimeError(msg) 2025-12-04T12:10:20.2702599Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.2702604Z 2025-12-04T12:10:20.2702694Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2702975Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2702977Z 2025-12-04T12:10:20.2703081Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2703170Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2703229Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2703303Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2703796Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2703911Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2703965Z graph_break [] 2025-12-04T12:10:20.2704041Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2704130Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2704637Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2704712Z current_size = base.storage().size() 2025-12-04T12:10:20.2704769Z Autotune Choices Stats: 2025-12-04T12:10:20.2705146Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.2705219Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2705282Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2705418Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2705663Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2705914Z triton_mm_6 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2706153Z triton_mm_2 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2706389Z triton_mm_3 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2706626Z triton_mm_1 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2706872Z triton_mm_7 0.0074 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2707114Z triton_mm_5 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2707356Z triton_mm_0 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2707413Z _scaled_mm 0.0245 ms 24.6% 2025-12-04T12:10:20.2707556Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.2008 seconds precompiling for 9 choices 2025-12-04T12:10:20.2707647Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2707706Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2707778Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2707894Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2708392Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2708447Z graph_break [] 2025-12-04T12:10:20.2708521Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2708620Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2708678Z Autotune Choices Stats: 2025-12-04T12:10:20.2709057Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.2709129Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2709192Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2709327Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2709571Z triton_mm_9 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2709914Z triton_mm_13 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2710190Z triton_mm_15 0.0065 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2710427Z triton_mm_14 0.0074 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2710664Z triton_mm_11 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2710906Z triton_mm_8 0.0089 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2711163Z triton_mm_12 0.0092 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2711401Z triton_mm_10 0.0096 ms 63.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2711461Z _scaled_mm 0.0246 ms 24.7% 2025-12-04T12:10:20.2711602Z SingleProcess AUTOTUNE benchmarking takes 0.0416 seconds and 0.0971 seconds precompiling for 9 choices 2025-12-04T12:10:20.2711673Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2711829Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2711891Z Traceback (most recent call last): 2025-12-04T12:10:20.2712063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2712121Z method(*args, **kwargs) 2025-12-04T12:10:20.2712287Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2712345Z method(*args, **kwargs) 2025-12-04T12:10:20.2712525Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2712579Z with policy(): 2025-12-04T12:10:20.2712745Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2712815Z raise RuntimeError(msg) 2025-12-04T12:10:20.2713211Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2713214Z 2025-12-04T12:10:20.2713303Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2713571Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2713573Z 2025-12-04T12:10:20.2713676Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2713767Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2713826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2713916Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2714407Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2714520Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2714575Z graph_break [] 2025-12-04T12:10:20.2714650Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2714739Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2715234Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2715309Z current_size = base.storage().size() 2025-12-04T12:10:20.2715366Z Autotune Choices Stats: 2025-12-04T12:10:20.2715742Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.2715815Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2715881Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2716018Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2716269Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2716511Z triton_mm_6 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2716765Z triton_mm_2 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2717001Z triton_mm_3 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2717251Z triton_mm_1 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2717485Z triton_mm_7 0.0074 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2717721Z triton_mm_5 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2717957Z triton_mm_0 0.0090 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2718027Z _scaled_mm 0.0245 ms 24.6% 2025-12-04T12:10:20.2718169Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.2008 seconds precompiling for 9 choices 2025-12-04T12:10:20.2718262Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2718321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2718394Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2718508Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2718997Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2719053Z graph_break [] 2025-12-04T12:10:20.2719127Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2719216Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2719282Z Autotune Choices Stats: 2025-12-04T12:10:20.2719656Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.2719727Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2719792Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2719927Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2720219Z triton_mm_9 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2720458Z triton_mm_13 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2720695Z triton_mm_15 0.0065 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2720943Z triton_mm_14 0.0074 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2721193Z triton_mm_11 0.0088 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2721434Z triton_mm_8 0.0089 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2721673Z triton_mm_12 0.0092 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2721917Z triton_mm_10 0.0096 ms 63.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2721988Z _scaled_mm 0.0246 ms 24.7% 2025-12-04T12:10:20.2722130Z SingleProcess AUTOTUNE benchmarking takes 0.0416 seconds and 0.0971 seconds precompiling for 9 choices 2025-12-04T12:10:20.2722219Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2722278Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2722351Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2722464Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2722954Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2723007Z graph_break [] 2025-12-04T12:10:20.2723084Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:20.2723172Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2723230Z Autotune Choices Stats: 2025-12-04T12:10:20.2723613Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:20.2723685Z AUTOTUNE scaled_mm(257x32, 32x16, 257x1, 1x16, 16) 2025-12-04T12:10:20.2723748Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2723885Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2724127Z triton_mm_19 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2724368Z triton_mm_16 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2724604Z triton_mm_22 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.2724849Z triton_mm_21 0.0061 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2725099Z triton_mm_17 0.0066 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.2725340Z triton_mm_23 0.0072 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2725578Z triton_mm_20 0.0072 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2725816Z triton_mm_18 0.0072 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2725873Z _scaled_mm 0.0249 ms 23.6% 2025-12-04T12:10:20.2726014Z SingleProcess AUTOTUNE benchmarking takes 0.0590 seconds and 0.2302 seconds precompiling for 9 choices 2025-12-04T12:10:20.2726226Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-47300065430d7107.xml - 2025-12-04T12:10:20.2726303Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2726889Z FAILED [0.7655s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.2726893Z 2025-12-04T12:10:20.2726982Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2727251Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2727254Z 2025-12-04T12:10:20.2727364Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2727444Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2727527Z ================== 1 failed, 187 deselected, 2 rerun in 3.88s ================== 2025-12-04T12:10:20.2727581Z Got exit code 1 2025-12-04T12:10:20.2727797Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.2727941Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.2728100Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-26a8aab9f492a524.xml 2025-12-04T12:10:20.2728176Z ============================= test session starts ============================== 2025-12-04T12:10:20.2728301Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2728360Z cachedir: .pytest_cache 2025-12-04T12:10:20.2728532Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2728595Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2728652Z configfile: pytest.ini 2025-12-04T12:10:20.2728829Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2728929Z collecting ... collected 188 items / 94 deselected / 94 selected 2025-12-04T12:10:20.2729008Z stepcurrent: skipping 94 already run items. 2025-12-04T12:10:20.2729069Z Running 94 items in this shard 2025-12-04T12:10:20.2729073Z 2025-12-04T12:10:20.2729985Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmphn_t6jsv/tu/ctuiakwd5fs2iewewo4itmqf6jmngoxcdpr25nxf2ehguavpxtfw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2730194Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2730427Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2730614Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2730919Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2731067Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2731340Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2731494Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2731765Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2731952Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2732235Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2732384Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2732673Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2732884Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2733215Z E1204 10:59:53.775000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2733964Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmphn_t6jsv/nt/cnthrkjyzzqfuyg3rk6niqcu4372t5zn3jgledf3fowjbtupkffa.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2734139Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2734368Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2734537Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2734837Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2734984Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2735263Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2735416Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2735682Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2735852Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2736133Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2736281Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2736580Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2736786Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2737120Z E1204 10:59:53.780000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2737855Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmphn_t6jsv/qz/cqzbnpukywursu25rzqmtxnthlc3y2f27hrhlk2lndnhyjye5plw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2738015Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2738254Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2738434Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2738734Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2738880Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2739147Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2739300Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2739566Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2739748Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2740029Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2740199Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2740488Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2740693Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2741037Z E1204 10:59:53.782000 706508 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2741105Z ('RERUN', {'yellow': True}) [2.7493s] [ 1%] 2025-12-04T12:10:20.2741435Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda E1204 10:59:55.115000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2741743Z E1204 10:59:55.115000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2741887Z E1204 10:59:55.115000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2742045Z E1204 10:59:55.117000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2742351Z E1204 10:59:55.117000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2742493Z E1204 10:59:55.117000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2742662Z E1204 10:59:55.119000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2742982Z E1204 10:59:55.119000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2743123Z E1204 10:59:55.119000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2743191Z ('RERUN', {'yellow': True}) [1.1923s] [ 1%] 2025-12-04T12:10:20.2743516Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda E1204 10:59:56.127000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2743823Z E1204 10:59:56.127000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2743965Z E1204 10:59:56.127000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2744135Z E1204 10:59:56.129000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2744442Z E1204 10:59:56.129000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2744580Z E1204 10:59:56.129000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2744737Z E1204 10:59:56.131000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2745044Z E1204 10:59:56.131000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.2745185Z E1204 10:59:56.131000 706508 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2745240Z FAILED [1.0086s] [ 1%] 2025-12-04T12:10:20.2745242Z 2025-12-04T12:10:20.2745324Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.2745481Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2745543Z Traceback (most recent call last): 2025-12-04T12:10:20.2745713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2745771Z method(*args, **kwargs) 2025-12-04T12:10:20.2745937Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2745995Z method(*args, **kwargs) 2025-12-04T12:10:20.2746160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2746216Z with policy(): 2025-12-04T12:10:20.2746382Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2746439Z raise RuntimeError(msg) 2025-12-04T12:10:20.2746842Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:20.2746844Z 2025-12-04T12:10:20.2746945Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2747218Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2747231Z 2025-12-04T12:10:20.2747338Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2747428Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2747488Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2747563Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2748134Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2748249Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2748322Z graph_break [] 2025-12-04T12:10:20.2748402Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2748492Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2748989Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2749054Z current_size = base.storage().size() 2025-12-04T12:10:20.2749112Z Autotune Choices Stats: 2025-12-04T12:10:20.2749494Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.2749579Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2749643Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2749790Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2750038Z triton_mm_13 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2750311Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2750551Z triton_mm_17 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2750792Z triton_mm_10 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2751032Z triton_mm_11 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2751282Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2751534Z triton_mm_18 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2751774Z triton_mm_12 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2752011Z triton_mm_15 0.0072 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2752070Z _scaled_mm 0.0086 ms 72.1% 2025-12-04T12:10:20.2752215Z SingleProcess AUTOTUNE benchmarking takes 0.0769 seconds and 0.6728 seconds precompiling for 18 choices 2025-12-04T12:10:20.2752372Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2752446Z Traceback (most recent call last): 2025-12-04T12:10:20.2752618Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2752675Z method(*args, **kwargs) 2025-12-04T12:10:20.2752843Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2752898Z method(*args, **kwargs) 2025-12-04T12:10:20.2753067Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2753120Z with policy(): 2025-12-04T12:10:20.2753288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2753345Z raise RuntimeError(msg) 2025-12-04T12:10:20.2753747Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:20.2753750Z 2025-12-04T12:10:20.2753857Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2754129Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2754131Z 2025-12-04T12:10:20.2754234Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2754325Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2754385Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2754459Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2755026Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2755139Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2755193Z graph_break [] 2025-12-04T12:10:20.2755271Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2755372Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2755869Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2755944Z current_size = base.storage().size() 2025-12-04T12:10:20.2756003Z Autotune Choices Stats: 2025-12-04T12:10:20.2756381Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.2756462Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2756527Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2756663Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2756927Z triton_mm_13 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2757168Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2757407Z triton_mm_17 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2757645Z triton_mm_10 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2757887Z triton_mm_11 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2758135Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2758371Z triton_mm_18 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2758609Z triton_mm_12 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2758847Z triton_mm_15 0.0072 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2758905Z _scaled_mm 0.0086 ms 72.1% 2025-12-04T12:10:20.2759050Z SingleProcess AUTOTUNE benchmarking takes 0.0769 seconds and 0.6728 seconds precompiling for 18 choices 2025-12-04T12:10:20.2759140Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2759199Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2759272Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2759396Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2759896Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2759960Z graph_break [] 2025-12-04T12:10:20.2760039Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2760154Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2760212Z Autotune Choices Stats: 2025-12-04T12:10:20.2760583Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.2760663Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2760727Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2760878Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2761123Z triton_mm_33 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2761360Z triton_mm_29 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2761600Z triton_mm_31 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2761838Z triton_mm_34 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2762088Z triton_mm_37 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2762324Z triton_mm_38 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2762560Z triton_mm_28 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2762798Z triton_mm_35 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2763038Z triton_mm_32 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2763277Z triton_mm_27 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2763433Z SingleProcess AUTOTUNE benchmarking takes 0.1153 seconds and 0.5087 seconds precompiling for 21 choices 2025-12-04T12:10:20.2763502Z =================================== FAILURES =================================== 2025-12-04T12:10:20.2763676Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.2763739Z Traceback (most recent call last): 2025-12-04T12:10:20.2763911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2763969Z method(*args, **kwargs) 2025-12-04T12:10:20.2764136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.2764194Z method(*args, **kwargs) 2025-12-04T12:10:20.2764358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.2764412Z with policy(): 2025-12-04T12:10:20.2764579Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.2764636Z raise RuntimeError(msg) 2025-12-04T12:10:20.2765035Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.2765049Z 2025-12-04T12:10:20.2765141Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2765416Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2765418Z 2025-12-04T12:10:20.2765520Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2765609Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2765669Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2765742Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2766315Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2766429Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2766484Z graph_break [] 2025-12-04T12:10:20.2766561Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2766652Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2767147Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.2767213Z current_size = base.storage().size() 2025-12-04T12:10:20.2767271Z Autotune Choices Stats: 2025-12-04T12:10:20.2767648Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.2767739Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2767804Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2767951Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2768196Z triton_mm_13 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2768436Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2768673Z triton_mm_17 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2768911Z triton_mm_10 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2769164Z triton_mm_11 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2769400Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2769637Z triton_mm_18 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2769875Z triton_mm_12 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2770144Z triton_mm_15 0.0072 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2770218Z _scaled_mm 0.0086 ms 72.1% 2025-12-04T12:10:20.2770362Z SingleProcess AUTOTUNE benchmarking takes 0.0769 seconds and 0.6728 seconds precompiling for 18 choices 2025-12-04T12:10:20.2770452Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2770510Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2770583Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2770697Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2771198Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2771253Z graph_break [] 2025-12-04T12:10:20.2771332Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2771421Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2771479Z Autotune Choices Stats: 2025-12-04T12:10:20.2771862Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.2771953Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2772018Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2772153Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2772396Z triton_mm_33 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2772633Z triton_mm_29 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2772875Z triton_mm_31 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2773113Z triton_mm_34 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2773364Z triton_mm_37 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2773600Z triton_mm_38 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2773835Z triton_mm_28 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2774074Z triton_mm_35 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2774320Z triton_mm_32 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2774563Z triton_mm_27 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2774707Z SingleProcess AUTOTUNE benchmarking takes 0.1153 seconds and 0.5087 seconds precompiling for 21 choices 2025-12-04T12:10:20.2774796Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.2774855Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.2774928Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.2775044Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.2775537Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.2775592Z graph_break [] 2025-12-04T12:10:20.2775669Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.2775767Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.2775838Z Autotune Choices Stats: 2025-12-04T12:10:20.2776213Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_51", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:20.2776291Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.2776356Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.2776491Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.2776736Z triton_mm_51 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2776974Z triton_mm_53 0.0064 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2777224Z triton_mm_50 0.0067 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2777459Z triton_mm_48 0.0070 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2777696Z triton_mm_52 0.0070 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2777936Z triton_mm_57 0.0074 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2778191Z triton_mm_49 0.0077 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2778427Z triton_mm_58 0.0079 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.2778664Z triton_mm_54 0.0083 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.2778722Z _scaled_mm 0.0088 ms 71.8% 2025-12-04T12:10:20.2778864Z SingleProcess AUTOTUNE benchmarking takes 0.1393 seconds and 0.3606 seconds precompiling for 21 choices 2025-12-04T12:10:20.2779070Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-26a8aab9f492a524.xml - 2025-12-04T12:10:20.2779147Z =========================== short test summary info ============================ 2025-12-04T12:10:20.2779991Z FAILED [1.0086s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.2779995Z 2025-12-04T12:10:20.2780085Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.2780412Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2780416Z 2025-12-04T12:10:20.2780518Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.2780598Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.2780682Z ================== 1 failed, 94 deselected, 2 rerun in 4.97s =================== 2025-12-04T12:10:20.2780738Z Got exit code 1 2025-12-04T12:10:20.2780796Z Retrying single test... 2025-12-04T12:10:20.2780955Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-347d670f489dcd3b.xml 2025-12-04T12:10:20.2781029Z ============================= test session starts ============================== 2025-12-04T12:10:20.2781157Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.2781215Z cachedir: .pytest_cache 2025-12-04T12:10:20.2781404Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.2781468Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.2781524Z configfile: pytest.ini 2025-12-04T12:10:20.2781704Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.2781795Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.2782061Z stepcurrent: skipping 94 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.2782120Z Running 1 items in this shard 2025-12-04T12:10:20.2782122Z 2025-12-04T12:10:20.2782462Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:05.070355792 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2782467Z 2025-12-04T12:10:20.2782650Z [W1204 11:00:13.463759814 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2782652Z 2025-12-04T12:10:20.2782980Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2783288Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2783435Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2783935Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2784203Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2784454Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2784689Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2784906Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2785149Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2785384Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2785627Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2785871Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2786115Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2786347Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2786587Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2786823Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2787063Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2787305Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2787545Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2787778Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2788019Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2788252Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2788456Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2788689Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2788939Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2789181Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2789385Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2789617Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2789858Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2790134Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2790387Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2790619Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2790836Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2791061Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2791237Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2791431Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2791983Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpel7d9wiq/tu/ctuiakwd5fs2iewewo4itmqf6jmngoxcdpr25nxf2ehguavpxtfw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2792146Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2792375Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2792546Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2792848Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2792997Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2793278Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2793433Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2793712Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2793884Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2794167Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2794316Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2794605Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2794824Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2795158Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2795462Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2795608Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2796116Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2796383Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2796624Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2796844Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2797061Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2797303Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2797536Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2797786Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2798018Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2798271Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2798503Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2798744Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2798976Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2799218Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2799464Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2799703Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2799935Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2800216Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2800451Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2800669Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2800900Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2801142Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2801372Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2801577Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2801809Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2802050Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2802294Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2802549Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2802783Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2802998Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2803225Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2803399Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2803593Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2803724Z E1204 11:00:12.984000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.2804047Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2804356Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2804500Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2804989Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2805264Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2805503Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2805723Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2806044Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2806287Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2806522Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2806773Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2807006Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2807259Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2807493Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2807732Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2807966Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2808207Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2808454Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2808693Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2808925Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2809170Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2809402Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2809619Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2809851Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2810127Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2810361Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2810567Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2810800Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2811039Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2811286Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2811539Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2811772Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2811988Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2812210Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2812385Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2812580Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2813131Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpel7d9wiq/qz/cqzbnpukywursu25rzqmtxnthlc3y2f27hrhlk2lndnhyjye5plw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2813292Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2813519Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2813690Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2813990Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2814150Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2814420Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2814576Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2814844Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2815015Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2815298Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2815445Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2815748Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2815954Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2816291Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2816599Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2816743Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2817232Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2817510Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2817750Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2817970Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2818184Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2818428Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2818674Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2818917Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2819151Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2819392Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2819628Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2819868Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2820131Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2820385Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2820633Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2820873Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2821107Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2821348Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2821579Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2821796Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2822028Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2822269Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2822501Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2822706Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2822940Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2823190Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2823422Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2823667Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2823899Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2824117Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2824341Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2824514Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2824716Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2824843Z E1204 11:00:13.004000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.2825015Z [W1204 11:00:13.468914594 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2825018Z 2025-12-04T12:10:20.2825341Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2825643Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2825790Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2826280Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2826556Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2826795Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2827013Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2827229Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2827480Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2827715Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2827957Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2828189Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2828432Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2828664Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2828904Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2829146Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2829401Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2829635Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2829874Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2830139Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2830382Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2830633Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2830837Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2831070Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2831312Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2831543Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2831747Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2831990Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2832232Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2832465Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2832707Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2832941Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2833155Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2833380Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2833563Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2833766Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2834302Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpel7d9wiq/nt/cnthrkjyzzqfuyg3rk6niqcu4372t5zn3jgledf3fowjbtupkffa.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.2834462Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.2834691Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.2834859Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.2835170Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.2835317Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.2835586Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.2835738Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.2836005Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.2836176Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.2836466Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.2836616Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.2836906Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.2837113Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.2837445Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2837750Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2837894Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2838397Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2838675Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2838917Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2839137Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2839353Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2839603Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2839842Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2840085Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2840350Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2840591Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2840838Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2841080Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2841311Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2841552Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2841786Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2842028Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2842260Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2842512Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2842744Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2842961Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2843194Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2843434Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2843665Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2843868Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2844114Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2844357Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2844588Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2844831Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2845064Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2845289Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.2845512Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.2845684Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.2845879Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.2846062Z E1204 11:00:13.008000 712024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.2846133Z ('RERUN', {'yellow': True}) [10.1088s] [100%] 2025-12-04T12:10:20.2846475Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:14.810057388 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2846479Z 2025-12-04T12:10:20.2846638Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2846964Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2847270Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2847425Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2847911Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2848178Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2848417Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2848654Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2848869Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2849110Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2849345Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2849588Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2849829Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2850071Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2850347Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2850589Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2850822Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2851065Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2851297Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2851521Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2851760Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2851974Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2852218Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2852450Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2852656Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2852891Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2853147Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2853382Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2853584Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2853817Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2854029Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2854243Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2854476Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2854685Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2854890Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2855123Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2855365Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2855596Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2855846Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2856082Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2856303Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2856526Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2856740Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2856983Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2857215Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2857467Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2857699Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2857938Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2858172Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2858411Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2858654Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2858895Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2859129Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2859368Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2859600Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2859842Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2860073Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2860364Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2860609Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2860853Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2861085Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2861324Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2861561Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2861802Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2862051Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2862267Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2862479Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2862721Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2862955Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2863215Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2863448Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2863689Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2863922Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2864166Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2864398Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2864640Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2864883Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2865133Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2865368Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2865579Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2865780Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2866014Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2866224Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2866461Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2866676Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2866920Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2867152Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2867363Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2867576Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2867808Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2868021Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2868222Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2868457Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2868701Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2868932Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2869183Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2869423Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2869635Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2869856Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2870070Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2870350Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2870586Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2870820Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2871033Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2871249Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2871491Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2871725Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2871985Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2872218Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2872461Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2872697Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2872940Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2873176Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2873419Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2873664Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2873889Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2874095Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2874330Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2874548Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2874764Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2874979Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2875235Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2875468Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2875710Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2875944Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2876188Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2876433Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2876676Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2876913Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2877155Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2877391Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2877608Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2877820Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2878036Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2878261Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2878486Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2878729Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2878964Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2879181Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2879395Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2879624Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2879867Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2880136Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2880377Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2880613Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2880870Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2881104Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2881344Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2881580Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2881824Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2882057Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2882297Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2882545Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2882802Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2883038Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2883278Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2883513Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2883754Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2883989Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2884248Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2884482Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2884698Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2884901Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2885137Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2885388Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2885624Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2885864Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2886100Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2886344Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2886580Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2886825Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2887071Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2887322Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2887557Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2887762Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2887998Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2888241Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2888487Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2888730Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2888966Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2889194Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2889412Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2889627Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2889852Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2890170Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2890404Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2890633Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2890851Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2891066Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2891280Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2891537Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2891784Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2892001Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2892215Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2892420Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2892583Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2892821Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2893041Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2893276Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2893517Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2893752Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2893965Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2894184Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2894419Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2894631Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2894837Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2895072Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2895278Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2895513Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2895717Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2895962Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2896181Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2896416Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2896658Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2896896Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2897137Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2897383Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2897626Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2897860Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2898102Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2898336Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2898590Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2898824Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2899039Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2899243Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2899478Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2899708Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2899923Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2900161Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2900391Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2900646Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2900881Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2901122Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2901357Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2901560Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2901808Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2902050Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2902283Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2902524Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2902757Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2902999Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2903215Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2903429Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2903635Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2903840Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2904076Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2904316Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2904553Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2904803Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2905050Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2905254Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2905487Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2905730Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2905963Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2906216Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2906450Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2906655Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2906891Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2907132Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2907377Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2907618Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2907851Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2908078Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2908295Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2908511Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2908726Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2908972Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2909215Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2909430Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2909664Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2909905Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2910157Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2910398Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2910649Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2910878Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2911094Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2911307Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2911524Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2911767Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2912012Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2912255Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2912490Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2912732Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2912968Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2913175Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2914782Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2915030Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2915281Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2915525Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2915760Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2915987Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2916206Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2916435Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2916651Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2916894Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2917130Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2917371Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2917627Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2917869Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2918103Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2918348Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2918584Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2918825Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2919061Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2919284Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2919501Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2919720Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2919931Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2920192Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2920417Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2920630Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2920851Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2921059Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2921248Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2921392Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2921554Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2921673Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2921816Z E1204 11:00:14.360000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2922003Z [W1204 11:00:14.824477220 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2922006Z 2025-12-04T12:10:20.2922167Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2922476Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2922788Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2922935Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2923429Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2923711Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2923964Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2924186Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2924402Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2924642Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2924878Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2925119Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2925366Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2925606Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2925839Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2926080Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2926314Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2926566Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2926798Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2927010Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2927233Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2927449Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2927691Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2927923Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2928137Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2928370Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2928622Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2928854Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2929056Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2929288Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2929499Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2929714Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2929947Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2930190Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2930393Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2930627Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2930892Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2931123Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2931363Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2931595Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2931807Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2932030Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2932245Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2932503Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2932735Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2932988Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2933220Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2933460Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2933692Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2933933Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2934179Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2934420Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2934652Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2934893Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2935126Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2935377Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2935608Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2935848Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2936079Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2936320Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2936553Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2936796Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2937038Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2937288Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2937521Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2937737Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2937947Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.2938188Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2938426Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2938681Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2938914Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2939156Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2939387Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2939630Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2939872Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2940142Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2940376Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2940618Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2940852Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2941063Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2941266Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2941515Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2941741Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2941963Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2942178Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2942419Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2942654Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2942865Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2943082Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2943315Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2943525Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2943727Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2943960Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2944214Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2944447Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2944688Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2944921Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2945132Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2945353Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2945566Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2945819Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2946055Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2946282Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2946500Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2946715Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2946958Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2947192Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2947451Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2947685Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2947926Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2948161Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2948403Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2948649Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2948891Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2949125Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2949339Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2949545Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2949779Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2949996Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2950252Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2950472Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2950727Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2950962Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2951203Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2951440Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2951682Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2951930Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2952172Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2952405Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2952650Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2952885Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2953114Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2953327Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2953609Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2953833Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.2954049Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2954291Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2954526Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2954755Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2954968Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2955194Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2955437Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2955670Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2955912Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2956147Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2956398Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2956635Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2956875Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2957109Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2957350Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2957595Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2957837Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2958071Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2958313Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2958553Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2958795Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2959028Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2959279Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2959524Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2959766Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2960000Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2960245Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2960451Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2960687Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2960946Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2961179Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2961419Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2961652Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2961893Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2962141Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2962383Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2962621Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2962864Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2963098Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2963305Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2963538Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2963791Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2964042Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2964285Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2964590Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2964818Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2965034Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2965260Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2965476Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2965718Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2965952Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2966179Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2966395Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2966624Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2966838Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2967081Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2967316Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2967532Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2967748Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2967953Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2968125Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.2968361Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2968577Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2968812Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2969052Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2969287Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2969500Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2969716Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2969949Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2970202Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2970408Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2970643Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2970848Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2971093Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2971297Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2971530Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2971735Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2971969Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2972212Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2972449Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2972703Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2972949Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2973192Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2973425Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2973668Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2973902Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2974158Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2974392Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2974608Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.2974814Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2975048Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2975276Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2975503Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2975717Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2975933Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2976176Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2976411Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2976656Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2976891Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2977106Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2977352Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2977594Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2977827Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2978069Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2978303Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2978542Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2978758Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2978971Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2979179Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.2979384Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2979621Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2979875Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2980143Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2980385Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2980620Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2980825Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2981059Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2981300Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2981554Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2981809Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2982044Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2982247Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2982482Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2982726Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2982972Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2983214Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2983447Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2983673Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2983890Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2984103Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2984329Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2984570Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2984806Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2985012Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2985246Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2985487Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2985720Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2985973Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2986219Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2986446Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2986663Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2986882Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2987096Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2987350Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2987584Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2987826Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2988059Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2988301Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2988536Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2988752Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.2988986Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2989229Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2989463Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2989705Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2989941Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2990204Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.2990437Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.2990665Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.2990881Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2991123Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2991357Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2991598Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2991845Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2992087Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2992319Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2992561Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2992799Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2993053Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2993287Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2993499Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.2993716Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.2993920Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.2994133Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.2994362Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.2994582Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.2994806Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.2995022Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.2995231Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.2995415Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.2995557Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.2995717Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.2995836Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.2995978Z E1204 11:00:14.363000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.2996159Z [W1204 11:00:14.826796263 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.2996161Z 2025-12-04T12:10:20.2996321Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.2996629Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.2996938Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.2997083Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.2997591Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.2997859Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.2998098Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.2998320Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.2998534Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.2998778Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.2999022Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2999263Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2999507Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.2999747Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.2999979Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3000255Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3000489Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3000747Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3000979Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3001190Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3001411Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3001627Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3001881Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3002113Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3002316Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3002549Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3002792Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3003026Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3003230Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3003472Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3003683Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3003902Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3004134Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3004344Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3004547Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3004780Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3005032Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3005264Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3005504Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3005737Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3005948Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3006169Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3006394Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3006636Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3006870Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3007111Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3007344Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3007584Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3007815Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3008065Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3008308Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3008550Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3008783Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3009023Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3009255Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3009506Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3009740Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3009979Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3010248Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3010490Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3010735Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3010977Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3011210Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3011452Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3011685Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3011900Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3012111Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3012362Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3012594Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3012849Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3013083Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3013322Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3013555Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3013794Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3014046Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3014286Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3014518Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3014759Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3014994Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3015216Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3015419Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3015650Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3015860Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3016082Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3016296Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3016537Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3016779Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3016990Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3017201Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3017433Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3017642Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3017845Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3018077Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3018328Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3018561Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3018801Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3019033Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3019246Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3019480Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3019694Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3019938Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3020203Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3020421Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3020636Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3020850Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3021093Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3021345Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3021599Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3021835Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3022075Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3022310Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3022551Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3022799Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3023042Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3023278Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3023492Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3023696Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3023942Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3024157Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3024370Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3024587Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3024831Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3025068Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3025308Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3025552Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3025793Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3026039Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3026282Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3026515Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3026758Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3026992Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3027221Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3027434Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3027640Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3027864Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3028078Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3028330Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3028564Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3028782Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3028995Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3029211Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3029455Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3029689Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3029931Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3030214Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3030474Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3030708Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3030951Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3031185Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3031425Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3031672Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3031913Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3032148Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3032392Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3032626Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3032882Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3033115Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3033357Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3033592Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3033834Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3034068Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3034280Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3034495Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3034740Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3034984Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3035218Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3035459Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3035693Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3035935Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3036183Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3036425Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3036660Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3036902Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3037138Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3037353Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3037585Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3037828Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3038061Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3038303Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3038537Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3038765Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3038992Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3039215Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3039432Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3039672Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3039905Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3040158Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3040376Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3040605Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3040819Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3041062Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3041296Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3041513Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3041736Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3041942Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3042105Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3042338Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3042544Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3042780Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3043024Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3043271Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3043487Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3043707Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3043941Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3044153Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3044357Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3044591Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3044806Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3045048Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3045252Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3045486Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3045690Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3045925Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3046176Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3046409Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3046653Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3046889Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3047131Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3047366Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3047607Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3047855Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3048106Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3048341Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3048554Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3048759Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3048995Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3049233Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3049450Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3049663Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3049878Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3050176Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3050411Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3050666Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3050900Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3051106Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3051341Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3051583Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3051817Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3052057Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3052303Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3052541Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3052760Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3052975Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3053180Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3053385Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3053619Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3053875Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3054108Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3054350Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3054584Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3054789Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3055033Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3055274Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3055510Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3055751Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3055986Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3056190Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3056423Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3056769Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3057017Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3057261Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3057493Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3057722Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3057938Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3058164Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3058379Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3058620Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3058854Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3059058Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3059294Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3059547Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3059784Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3060025Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3060285Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3060516Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3060732Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3060946Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3061185Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3061440Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3061676Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3061917Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3062152Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3062391Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3062638Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3062844Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3063080Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3063322Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3063555Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3063798Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3064053Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3064279Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3064496Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3064710Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3064927Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3065172Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3065406Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3065659Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3065900Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3066145Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3066378Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3066621Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3066853Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3067107Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3067341Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3067552Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3067769Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3067977Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3068188Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3068426Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3068649Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3068862Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3069069Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3069276Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3069462Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3069603Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3069762Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3069890Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3070040Z E1204 11:00:14.365000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3070142Z ('RERUN', {'yellow': True}) [1.2613s] [100%] 2025-12-04T12:10:20.3070485Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:15.886268269 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3070489Z 2025-12-04T12:10:20.3070647Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3070956Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3071264Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3071424Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3071917Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3072185Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3072426Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3072657Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3072874Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3073114Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3073350Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3073593Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3073826Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3074066Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3074309Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3074550Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3074795Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3075037Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3075270Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3075483Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3075705Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3075929Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3076171Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3076402Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3076607Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3076842Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3077092Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3077326Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3077528Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3077761Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3077971Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3078174Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3078406Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3078615Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3078830Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3079071Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3079313Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3079543Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3079788Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3080021Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3080277Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3080501Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3080713Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3080958Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3081189Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3081432Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3081676Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3081917Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3082149Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3082388Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3082621Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3082861Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3083093Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3083343Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3083589Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3083833Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3084064Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3084304Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3084535Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3084790Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3085024Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3085262Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3085494Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3085733Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3085976Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3086191Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3086402Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3086644Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3086877Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3087119Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3087350Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3087601Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3087835Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3088085Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3088317Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3088557Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3088790Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3089030Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3089272Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3089483Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3089685Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3089918Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3090162Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3090397Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3090611Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3090854Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3091087Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3091298Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3091502Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3091733Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3091942Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3092155Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3092399Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3092641Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3092874Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3093115Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3093345Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3093569Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3093790Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3094004Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3094249Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3094483Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3094700Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3094924Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3095139Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3095381Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3095616Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3095860Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3096092Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3096333Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3096574Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3096834Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3097069Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3097311Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3097546Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3097759Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3097974Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3098208Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3098424Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3098636Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3098851Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3099095Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3099336Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3099578Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3099812Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3100054Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3100323Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3100565Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3100798Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3101055Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3101304Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3101520Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3101732Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3101939Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3102163Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3102391Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3102633Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3102867Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3103084Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3103297Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3103512Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3103766Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3104001Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3104244Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3104479Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3104720Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3104955Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3105196Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3105441Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3105693Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3105928Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3106170Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3106405Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3106647Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3106890Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3107134Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3107371Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3107612Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3107848Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3108104Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3108338Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3108550Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3108755Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3108992Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3109234Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3109469Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3109721Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3109955Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3110238Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3110472Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3110713Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3110947Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3111189Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3111437Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3111642Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3111876Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3112119Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3112355Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3112606Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3112841Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3113248Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3113466Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3113681Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3113897Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3114139Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3114392Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3114633Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3114849Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3115064Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3115279Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3115520Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3115755Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3115983Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3116196Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3116401Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3116564Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3116799Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3117003Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3117251Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3117492Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3117728Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3117940Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3118146Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3118380Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3118591Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3118804Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3119050Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3119258Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3119491Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3119694Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3119929Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3120165Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3120420Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3120663Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3120897Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3121139Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3121373Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3121635Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3121868Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3122111Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3122345Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3122588Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3122820Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3123034Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3123254Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3123499Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3123729Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3123945Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3124160Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3124377Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3124620Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3124867Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3125109Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3125344Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3125548Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3125785Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3126039Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3126273Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3126515Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3126749Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3126977Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3127196Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3127410Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3127626Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3127841Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3128076Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3128319Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3128553Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3128795Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3129033Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3129249Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3129483Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3129725Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3129958Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3130245Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3130497Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3130701Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3130937Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3131180Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3131415Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3131657Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3131890Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3132130Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3132360Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3132575Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3132788Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3133032Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3133267Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3133471Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3133719Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3133960Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3134194Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3134434Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3134669Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3134904Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3135123Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3135336Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3135552Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3135796Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3136030Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3136271Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3136518Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3136772Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3137007Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3137213Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3137447Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3137688Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3137921Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3138173Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3138407Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3138633Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3138849Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3139065Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3139288Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3139532Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3139766Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3140008Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3140264Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3140507Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3140742Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3140999Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3141249Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3141492Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3141726Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3141939Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3142155Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3142375Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3142585Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3142814Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3143035Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3143249Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3143456Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3143675Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3143865Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3144006Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3144168Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3144288Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3144430Z E1204 11:00:15.425000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3144600Z [W1204 11:00:15.888557932 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3144603Z 2025-12-04T12:10:20.3144761Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3145072Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3145391Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3145548Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3146037Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3146305Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3146543Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3146780Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3146995Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3147237Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3147473Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3147714Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3147950Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3148200Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3148434Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3148681Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3148913Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3149156Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3149387Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3149609Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3149831Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3150055Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3150329Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3150560Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3150765Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3150997Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3151257Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3151493Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3151696Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3151929Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3152140Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3152343Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3152590Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3152800Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3153005Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3153243Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3153484Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3153716Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3153956Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3154199Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3154423Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3154647Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3154860Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3155104Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3155336Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3155577Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3155819Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3156060Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3156294Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3156534Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3156768Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3157021Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3157253Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3157492Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3157725Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3157969Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3158200Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3158441Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3158682Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3158934Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3159167Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3159407Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3159641Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3159880Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3160153Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3160369Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3160579Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3160819Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3161052Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3161308Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3161541Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3161783Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3162014Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3162258Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3162492Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3162732Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3162975Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3163217Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3163471Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3163682Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3163885Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3164119Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3164332Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3164568Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3164784Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3165025Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3165257Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3165469Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3165672Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3165914Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3166127Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3166328Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3166560Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3166802Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3167034Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3167275Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3167517Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3167737Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3167960Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3168174Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3168418Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3168653Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3168880Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3169096Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3169311Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3169553Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3169787Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3170029Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3170306Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3170547Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3170782Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3171025Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3171262Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3171505Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3171814Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3172043Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3172261Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3172496Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3172712Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3172924Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3173140Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3173383Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3173633Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3173875Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3174109Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3174351Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3174585Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3174839Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3175073Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3175317Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3175551Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3175768Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3175981Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3176186Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3176423Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3176647Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3176891Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3177124Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3177341Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3177553Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3177778Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3178023Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3178257Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3178500Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3178736Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3178978Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3179223Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3179466Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3179701Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3179942Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3180204Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3180446Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3180679Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3180941Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3181194Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3181436Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3181670Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3181912Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3182148Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3182401Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3182636Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3182849Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3183054Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3183290Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3183547Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3183783Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3184024Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3184259Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3184500Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3184734Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3184975Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3185218Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3185461Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3185704Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3185910Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3186143Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3186386Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3186621Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3186871Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3187104Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3187332Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3187549Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3187761Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3187987Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3188230Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3188464Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3188690Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3188907Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3189121Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3189336Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3189587Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3189822Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3190050Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3190293Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3190498Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3190660Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3190894Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3191113Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3191349Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3191590Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3191824Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3192035Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3192241Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3192487Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3192700Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3192905Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3193139Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3193348Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3193582Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3193785Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3194030Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3194246Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3194480Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3194723Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3194956Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3195197Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3195433Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3195686Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3195920Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3196161Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3196394Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3198161Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3199048Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3199263Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3199470Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3199706Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3199943Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3200204Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3200418Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3200653Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3200895Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3201130Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3201378Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3201615Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3201820Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3202054Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3202299Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3202533Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3202777Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3203009Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3203238Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3203514Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3203747Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3203953Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3204157Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3204394Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3204637Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3204872Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3205122Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3205356Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3205562Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3205799Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3206039Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3206272Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3206514Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3206748Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3206952Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3207187Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3207428Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3207663Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3207932Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3208183Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3208411Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3208629Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3208843Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3209060Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3209302Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3209545Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3209752Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3209986Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3210265Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3210499Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3210740Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3210973Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3211201Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3211419Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3211631Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3211851Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3212092Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3212351Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3212607Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3212841Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3213084Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3213319Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3213525Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3213761Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3214021Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3214255Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3214498Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3214732Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3214958Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3215176Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3215389Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3215605Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3215850Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3216083Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3216326Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3216560Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3216821Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3217066Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3217306Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3217541Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3219768Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3220163Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3220378Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3220625Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3220834Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3221047Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3221277Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3221500Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3221713Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3221920Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3222129Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3222320Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3222462Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3222626Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3222748Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3222890Z E1204 11:00:15.427000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3223082Z [W1204 11:00:15.890651828 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3223085Z 2025-12-04T12:10:20.3223256Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3223581Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3223892Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3224041Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3224544Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3224812Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3225061Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3225281Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3225497Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3225740Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3225977Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3226225Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3226457Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3226700Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3226933Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3227176Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3227409Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3227661Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3227923Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3228137Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3228361Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3228577Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3228822Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3229055Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3229258Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3229501Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3229742Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3229977Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3230219Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3230453Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3230669Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3230870Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3231105Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3231315Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3231517Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3231749Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3231993Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3232256Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3232508Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3232741Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3232952Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3233176Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3233392Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3233633Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3233878Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3234118Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3234353Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3234594Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3234827Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3235069Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3235301Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3235543Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3235775Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3236018Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3236249Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3236501Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3236760Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3237002Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3237234Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3237474Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3237708Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3237950Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3238184Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3238434Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3238670Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3238888Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3239099Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3239341Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3239573Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3239815Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3240049Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3240312Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3240545Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3240784Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3241044Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3241298Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3241530Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3241771Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3242003Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3242218Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3242420Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3242652Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3242874Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3243097Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3243314Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3243554Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3243786Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3243996Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3244200Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3244433Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3244644Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3244845Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3245080Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3245333Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3245586Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3245826Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3246059Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3246271Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3246494Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3246709Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3246954Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3247197Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3247415Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3247629Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3247844Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3248087Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3248322Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3248562Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3248796Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3249038Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3249272Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3249513Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3249759Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3250020Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3250289Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3250503Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3250708Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3250943Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3251160Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3251373Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3251604Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3251847Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3252081Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3252325Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3252560Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3252802Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3253036Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3253277Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3253512Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3253755Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3253990Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3254227Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3254468Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3254675Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3254899Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3255118Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3255360Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3255597Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3255812Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3256035Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3256250Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3256495Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3256731Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3256976Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3257212Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3257452Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3257688Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3257931Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3258166Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3258409Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3258654Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3258915Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3259150Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3259392Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3259626Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3259868Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3260142Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3260384Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3260634Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3260877Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3261112Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3261326Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3261532Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3261766Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3262007Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3262243Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3262488Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3262722Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3262969Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3263229Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3263483Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3263716Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3263961Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3264194Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3264400Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3264637Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3264892Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3265127Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3265370Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3265608Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3265837Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3266120Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3266334Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3266550Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3266795Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3267028Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3267256Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3267472Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3267707Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3267931Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3268171Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3268406Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3268623Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3268838Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3269046Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3269208Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3269460Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3269664Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3269901Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3270184Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3270418Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3270633Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3270836Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3271074Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3271285Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3271490Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3271730Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3271934Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3272198Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3272415Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3272652Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3272854Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3273088Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3273331Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3273565Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3273821Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3274053Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3274296Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3274532Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3274780Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3275015Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3275256Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3275492Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3275704Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3275908Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3276143Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3276370Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3276623Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3276852Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3277068Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3277309Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3277545Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3277787Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3278022Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3278234Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3278468Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3278711Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3278945Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3279186Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3279419Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3279646Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3279864Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3280077Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3280327Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3280533Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3280772Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3281041Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3281289Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3281530Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3281763Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3281968Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3282203Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3282445Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3282692Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3282937Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3283171Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3283376Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3283611Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3283853Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3284087Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3284331Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3284566Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3284795Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3285013Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3285227Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3285462Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3285722Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3285956Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3286160Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3286394Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3286638Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3286873Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3287128Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3287363Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3287595Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3287814Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3288027Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3288242Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3288484Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3288720Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3288964Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3289198Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3289442Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3289675Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3289901Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3290195Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3290437Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3290671Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3290915Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3291152Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3291380Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3291611Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3291825Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3292039Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3292282Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3292515Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3292758Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3292996Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3293239Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3293474Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3293716Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3293950Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3294192Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3294452Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3294677Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3294894Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3295101Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3295313Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3295543Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3295766Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3295988Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3296197Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3296405Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3296592Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3296734Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3296895Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3297015Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3297157Z E1204 11:00:15.429000 712024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3297215Z FAILED [0.9405s] [100%] 2025-12-04T12:10:20.3297218Z 2025-12-04T12:10:20.3297294Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.3297454Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.3297520Z Traceback (most recent call last): 2025-12-04T12:10:20.3297699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3297759Z method(*args, **kwargs) 2025-12-04T12:10:20.3297926Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3297983Z method(*args, **kwargs) 2025-12-04T12:10:20.3298149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.3298203Z with policy(): 2025-12-04T12:10:20.3298371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.3298439Z raise RuntimeError(msg) 2025-12-04T12:10:20.3298857Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:20.3298870Z 2025-12-04T12:10:20.3298963Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3299239Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3299242Z 2025-12-04T12:10:20.3299349Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3299442Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3299505Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3299580Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3300204Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3300336Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3300393Z graph_break [] 2025-12-04T12:10:20.3300474Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3300565Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3301071Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.3301138Z current_size = base.storage().size() 2025-12-04T12:10:20.3301197Z Autotune Choices Stats: 2025-12-04T12:10:20.3301587Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.3301670Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3301737Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3301876Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3302126Z triton_mm_9 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3302369Z triton_mm_13 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3302608Z triton_mm_18 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3302849Z triton_mm_10 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3303133Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3303380Z triton_mm_15 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3303617Z triton_mm_8 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3303856Z triton_mm_11 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3304097Z triton_mm_7 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3304338Z triton_mm_16 0.0092 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3304500Z SingleProcess AUTOTUNE benchmarking takes 0.0777 seconds and 7.8771 seconds precompiling for 18 choices 2025-12-04T12:10:20.3304658Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.3304722Z Traceback (most recent call last): 2025-12-04T12:10:20.3304895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3304954Z method(*args, **kwargs) 2025-12-04T12:10:20.3305123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3305179Z method(*args, **kwargs) 2025-12-04T12:10:20.3305345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.3305398Z with policy(): 2025-12-04T12:10:20.3305568Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.3305625Z raise RuntimeError(msg) 2025-12-04T12:10:20.3306030Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:20.3306034Z 2025-12-04T12:10:20.3306128Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3306405Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3306408Z 2025-12-04T12:10:20.3306514Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3306605Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3306668Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3306742Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3307324Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3307458Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3307513Z graph_break [] 2025-12-04T12:10:20.3307592Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3307683Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3308182Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.3308247Z current_size = base.storage().size() 2025-12-04T12:10:20.3308306Z Autotune Choices Stats: 2025-12-04T12:10:20.3308691Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.3308773Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3308849Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3308988Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3309234Z triton_mm_9 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3309476Z triton_mm_13 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3309715Z triton_mm_18 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3309953Z triton_mm_10 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3310233Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3310476Z triton_mm_15 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3310714Z triton_mm_8 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3310953Z triton_mm_11 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3311194Z triton_mm_7 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3311472Z triton_mm_16 0.0092 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3311617Z SingleProcess AUTOTUNE benchmarking takes 0.0777 seconds and 7.8771 seconds precompiling for 18 choices 2025-12-04T12:10:20.3311709Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3311768Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3311842Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3311956Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3312455Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3312509Z graph_break [] 2025-12-04T12:10:20.3312589Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3312679Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3313070Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.3313180Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.3313238Z Autotune Choices Stats: 2025-12-04T12:10:20.3313619Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.3313700Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3313767Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3313902Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3314147Z triton_mm_30 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3314386Z triton_mm_33 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3314625Z triton_mm_37 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3314863Z triton_mm_32 0.0065 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3315100Z triton_mm_29 0.0066 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3315346Z triton_mm_34 0.0067 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3315608Z triton_mm_27 0.0069 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3315846Z triton_mm_28 0.0069 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3316086Z triton_mm_35 0.0070 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3316322Z triton_mm_38 0.0071 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3316469Z SingleProcess AUTOTUNE benchmarking takes 0.1455 seconds and 0.5075 seconds precompiling for 21 choices 2025-12-04T12:10:20.3316538Z =================================== FAILURES =================================== 2025-12-04T12:10:20.3316694Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.3316757Z Traceback (most recent call last): 2025-12-04T12:10:20.3316939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3316997Z method(*args, **kwargs) 2025-12-04T12:10:20.3317166Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3317224Z method(*args, **kwargs) 2025-12-04T12:10:20.3317391Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.3317446Z with policy(): 2025-12-04T12:10:20.3317615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.3317673Z raise RuntimeError(msg) 2025-12-04T12:10:20.3318075Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.3318077Z 2025-12-04T12:10:20.3318169Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3318440Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3318443Z 2025-12-04T12:10:20.3318548Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3318636Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3318696Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3318769Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3319334Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3319466Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3319520Z graph_break [] 2025-12-04T12:10:20.3319620Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3319710Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3320244Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.3320308Z current_size = base.storage().size() 2025-12-04T12:10:20.3320366Z Autotune Choices Stats: 2025-12-04T12:10:20.3320747Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.3320829Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3320894Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3321031Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3321297Z triton_mm_9 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3321536Z triton_mm_13 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3321777Z triton_mm_18 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3322022Z triton_mm_10 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3322262Z triton_mm_14 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3322501Z triton_mm_15 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3322740Z triton_mm_8 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3322982Z triton_mm_11 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3323220Z triton_mm_7 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3323460Z triton_mm_16 0.0092 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3323616Z SingleProcess AUTOTUNE benchmarking takes 0.0777 seconds and 7.8771 seconds precompiling for 18 choices 2025-12-04T12:10:20.3323733Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3323792Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3323865Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3323981Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3324477Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3324533Z graph_break [] 2025-12-04T12:10:20.3324611Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3324701Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3325080Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.3325188Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.3325245Z Autotune Choices Stats: 2025-12-04T12:10:20.3325631Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.3325712Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3325776Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3325915Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3326158Z triton_mm_30 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3326399Z triton_mm_33 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3326636Z triton_mm_37 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3326876Z triton_mm_32 0.0065 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3327112Z triton_mm_29 0.0066 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3327349Z triton_mm_34 0.0067 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3327590Z triton_mm_27 0.0069 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3327850Z triton_mm_28 0.0069 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3328103Z triton_mm_35 0.0070 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3328341Z triton_mm_38 0.0071 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3328486Z SingleProcess AUTOTUNE benchmarking takes 0.1455 seconds and 0.5075 seconds precompiling for 21 choices 2025-12-04T12:10:20.3328576Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3328635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3328707Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3328823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3329332Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3329387Z graph_break [] 2025-12-04T12:10:20.3329465Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3329553Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3329611Z Autotune Choices Stats: 2025-12-04T12:10:20.3329991Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_57", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.3330070Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3330229Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3330364Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3330609Z triton_mm_57 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3330851Z triton_mm_52 0.0071 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3331090Z triton_mm_53 0.0072 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3331328Z triton_mm_48 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3331564Z triton_mm_58 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3331820Z triton_mm_47 0.0076 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3332083Z triton_mm_50 0.0077 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3332323Z triton_mm_54 0.0078 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3332558Z triton_mm_49 0.0079 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3332804Z triton_mm_51 0.0080 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3332948Z SingleProcess AUTOTUNE benchmarking takes 0.1285 seconds and 0.3559 seconds precompiling for 21 choices 2025-12-04T12:10:20.3333153Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-347d670f489dcd3b.xml - 2025-12-04T12:10:20.3333230Z =========================== short test summary info ============================ 2025-12-04T12:10:20.3333848Z FAILED [0.9405s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.3333851Z 2025-12-04T12:10:20.3333942Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3334214Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3334216Z 2025-12-04T12:10:20.3334319Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3334397Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.3334485Z ================= 1 failed, 187 deselected, 2 rerun in 12.33s ================== 2025-12-04T12:10:20.3334538Z Got exit code 1 2025-12-04T12:10:20.3334597Z Retrying single test... 2025-12-04T12:10:20.3334757Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-debc414a89adc35f.xml 2025-12-04T12:10:20.3334832Z ============================= test session starts ============================== 2025-12-04T12:10:20.3334963Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.3335021Z cachedir: .pytest_cache 2025-12-04T12:10:20.3335195Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.3335259Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.3335317Z configfile: pytest.ini 2025-12-04T12:10:20.3335500Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.3335593Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.3335858Z stepcurrent: skipping 94 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3335930Z Running 1 items in this shard 2025-12-04T12:10:20.3335932Z 2025-12-04T12:10:20.3336284Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:24.938341272 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3336301Z 2025-12-04T12:10:20.3336471Z [W1204 11:00:32.465840616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3336474Z 2025-12-04T12:10:20.3336804Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3337112Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3337264Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3337773Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3338042Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3338284Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3338509Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3338725Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3338969Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3339205Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3339450Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3339686Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3339929Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3340198Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3340452Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3340710Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3340951Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3341184Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3341425Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3341663Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3341905Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3342138Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3342356Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3342593Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3342838Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3343071Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3343275Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3343508Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3343752Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3343986Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3344228Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3344461Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3344678Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3344917Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3345113Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3345310Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3345855Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmps75vvnso/nt/cnthrkjyzzqfuyg3rk6niqcu4372t5zn3jgledf3fowjbtupkffa.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.3346018Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.3346249Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.3346421Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.3346737Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.3346887Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.3347159Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.3347315Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.3347584Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.3347755Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.3348040Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.3348189Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.3348481Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.3348691Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.3349020Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3349327Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3349484Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3350005Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3350308Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3350547Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3350771Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3350987Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3351253Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3351488Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3351730Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3351967Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3352210Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3352444Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3352683Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3352916Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3353159Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3353391Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3353634Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3353867Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3354139Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3354385Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3354588Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3354822Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3355061Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3355295Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3355498Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3355741Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3355983Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3356217Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3356459Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3356691Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3356907Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3357130Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3357310Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3357505Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3357624Z E1204 11:00:31.988000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.3357949Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3358255Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3358413Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3358920Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3359188Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3359427Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3359649Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3359866Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3360146Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3360395Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3360637Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3360871Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3361112Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3361344Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3361588Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3361820Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3362065Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3362300Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3362541Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3362778Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3363051Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3363298Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3363502Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3363735Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3363976Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3364209Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3364414Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3364656Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3364897Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3365129Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3365374Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3365614Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3365831Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3366056Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3366231Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3366427Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3366966Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmps75vvnso/tu/ctuiakwd5fs2iewewo4itmqf6jmngoxcdpr25nxf2ehguavpxtfw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.3367127Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.3367356Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.3367556Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.3367868Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.3368016Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.3368287Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.3368441Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.3368710Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.3368883Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.3369175Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.3369324Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.3369614Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.3369824Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.3370197Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3370507Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3370652Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3371143Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3371412Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3371654Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3371874Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3372125Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3372379Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3372615Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3372858Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3373093Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3373335Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3373567Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3373826Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3374058Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3374300Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3374532Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3374774Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3375008Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3375248Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3375481Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3375686Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3375919Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3376159Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3376405Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3376630Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3376864Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3377107Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3377338Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3377579Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3377811Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3378032Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3378266Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3378440Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3378636Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3378755Z E1204 11:00:32.008000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.3378927Z [W1204 11:00:32.472076193 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3378929Z 2025-12-04T12:10:20.3379252Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3379556Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3379703Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3380226Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3380494Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3380734Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3380979Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3381206Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3381447Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3381682Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3381922Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3382163Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3382404Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3382650Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3382891Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3383125Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3383368Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3383599Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3383841Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3384073Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3384314Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3384547Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3384751Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3384984Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3385223Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3385482Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3385696Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3385928Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3386169Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3386400Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3386642Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3386875Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3387102Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3387326Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3387500Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3387694Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3388227Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmps75vvnso/qz/cqzbnpukywursu25rzqmtxnthlc3y2f27hrhlk2lndnhyjye5plw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.3388387Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.3388615Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.3388789Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.3389092Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.3389239Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.3389509Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.3389672Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.3389950Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.3390160Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.3390449Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.3390597Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.3390887Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.3391095Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.3391422Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3391742Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3391886Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3392377Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3392645Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3392885Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3393108Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3393325Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3393567Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3393804Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3394046Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3394296Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3394565Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3394798Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3395042Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3395275Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3395518Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3395750Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3395991Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3396232Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3396473Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3396705Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3396910Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3397146Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3397386Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3397621Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3397824Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3398056Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3398300Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3398532Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3398795Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3399037Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3399254Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.3399477Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.3399651Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.3399844Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.3399964Z E1204 11:00:32.011000 717280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.3400032Z ('RERUN', {'yellow': True}) [10.1243s] [100%] 2025-12-04T12:10:20.3400411Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:33.749120275 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3400433Z 2025-12-04T12:10:20.3400594Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3400900Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3401211Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3401355Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3401847Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3402115Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3402356Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3402583Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3402798Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3403042Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3403301Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3403555Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3403790Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3404029Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3404262Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3404502Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3404738Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3404990Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3405223Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3405436Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3405659Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3405875Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3406116Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3406347Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3406552Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3406786Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3407028Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3407262Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3407467Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3407722Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3407945Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3408150Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3408383Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3408594Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3408798Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3409035Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3409289Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3409524Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3409764Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3409997Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3410259Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3410482Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3410701Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3410942Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3411179Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3411419Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3411651Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3411892Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3412153Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3412406Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3412638Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3412879Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3413113Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3413355Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3413591Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3413842Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3414076Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3414316Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3414553Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3414799Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3415032Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3415274Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3415507Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3415749Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3415980Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3416196Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3416407Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3416673Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3416920Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3417162Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3417395Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3417635Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3417870Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3418112Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3418354Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3418598Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3418833Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3419075Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3419308Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3419520Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3419723Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3419956Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3420211Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3420431Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3420649Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3420889Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3421154Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3421381Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3421584Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3421819Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3422028Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3422232Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3422464Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3422720Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3422953Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3423194Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3423430Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3423640Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3423863Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3424075Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3424321Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3424558Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3424775Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3424989Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3425204Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3425473Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3425718Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3425960Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3426196Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3426437Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3426674Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3426918Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3427162Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3427408Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3427643Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3427858Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3428063Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3428299Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3428515Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3428730Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3428946Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3429188Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3429425Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3429666Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3429922Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3430216Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3430450Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3430695Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3430928Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3431171Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3431407Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3431639Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3431852Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3432062Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3432288Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3432505Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3432749Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3432983Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3433201Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3433417Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3433632Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3433876Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3434108Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3434381Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3434626Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3434867Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3435108Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3435350Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3435585Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3435827Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3436072Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3436314Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3436551Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3436792Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3437029Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3437272Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3437508Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3437753Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3437987Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3438230Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3438465Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3438678Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3438908Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3439159Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3439404Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3439639Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3439885Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3440156Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3440397Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3440649Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3440891Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3441130Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3441374Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3441611Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3441819Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3442053Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3442297Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3442531Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3442774Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3443008Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3443253Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3443503Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3443715Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3443932Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3444174Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3444411Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3444640Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3444856Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3445079Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3445296Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3445542Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3445776Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3445993Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3446206Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3446413Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3446577Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3446813Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3447019Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3447262Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3447506Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3447760Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3447984Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3448189Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3448424Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3448635Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3448841Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3449077Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3449282Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3449529Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3449733Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3449968Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3450211Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3450446Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3450690Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3450923Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3451167Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3451403Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3451646Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3451881Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3452143Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3452404Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3452646Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3452880Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3453094Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3453300Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3453535Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3453762Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3453993Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3454205Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3454423Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3454666Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3454900Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3455142Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3455375Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3455583Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3455817Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3456059Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3456292Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3456544Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3456801Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3457029Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3457247Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3457461Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3457669Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3457876Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3458115Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3458373Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3458921Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3459451Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3459984Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3460500Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3460988Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3461504Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3462029Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3462547Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3463069Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3463550Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3464050Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3464645Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3465177Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3465697Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3466213Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3466714Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3467196Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3467661Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3468148Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3468641Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3469156Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3469636Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3470144Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3470660Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3471172Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3471687Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3472199Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3472695Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3473179Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3473667Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3474160Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3474654Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3475167Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3475683Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3476198Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3476710Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3477224Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3477718Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3478203Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3478718Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3479233Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3479747Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3480297Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3480798Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3481282Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3481751Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3482218Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3482716Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3483245Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3483781Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3484295Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3484809Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3485322Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3485836Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3486351Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3486880Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3487395Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3487878Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3488347Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3488805Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3489256Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3489732Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3490256Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3490730Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3491190Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3491735Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3492163Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3492525Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3492882Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3493220Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3493515Z E1204 11:00:33.302000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3493864Z [W1204 11:00:33.766901277 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3494070Z 2025-12-04T12:10:20.3494231Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3494736Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3495397Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3495886Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3496576Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3497363Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3497907Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3498406Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3498880Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3499375Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3499889Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3500450Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3500965Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3501477Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3501986Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3502529Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3503056Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3503568Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3504077Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3504561Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3505033Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3505509Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3506012Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3506523Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3506997Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3507470Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3510200Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3510726Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3511198Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3511673Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3512159Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3512610Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3513080Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3513559Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3514034Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3514637Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3515148Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3515664Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3516173Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3516683Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3517164Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3517631Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3518121Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3518612Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3519125Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3519641Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3520195Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3520703Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3521211Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3521721Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3522230Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3522748Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3523256Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3523800Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3524331Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3524839Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3525352Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3525861Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3526373Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3526882Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3527404Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3527914Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3528427Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3528937Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3529446Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3529932Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3530427Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3530919Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3531429Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3531938Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3532449Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3532963Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3533503Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3534028Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3534543Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3535055Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3535571Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3536086Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3536603Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3537107Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3537560Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3538038Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3538524Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3538993Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3539467Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3539959Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3540509Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3540996Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3541456Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3541932Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3542417Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3542889Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3543387Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3543911Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3544431Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3544944Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3545468Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3545961Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3546438Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3546933Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3547439Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3547964Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3548464Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3548935Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3549402Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3549903Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3550455Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3550968Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3551481Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3551992Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3552542Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3553074Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3553587Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3554102Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3554619Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3555106Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3555560Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3556038Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3556547Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3557013Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3557477Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3557973Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3558498Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3559017Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3559530Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3560042Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3560585Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3561099Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3561612Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3562151Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3562680Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3563172Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3563638Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3564098Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3564567Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3565044Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3565539Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3566066Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3566555Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3567025Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3567491Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3567985Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3568499Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3569017Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3569533Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3570046Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3570599Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3571112Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3571651Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3572187Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3572698Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3573211Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3573724Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3574238Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3574750Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3575281Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3575793Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3576310Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3576824Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3577339Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3577851Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3578334Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3578788Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3579263Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3579777Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3580334Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3580847Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3581385Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3581909Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3582423Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3582936Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3583448Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3583962Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3584473Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3584962Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3585441Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3585953Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3586465Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3586978Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3587491Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3587991Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3588474Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3588941Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3589404Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3589899Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3590448Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3590979Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3591472Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3591937Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3592399Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3592891Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3593414Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3593903Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3594379Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3594834Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3595239Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3595675Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3596154Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3596629Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3597141Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3597654Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3598137Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3598589Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3599063Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3599548Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3600007Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3600542Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3601015Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3601491Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3601966Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3602447Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3602922Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3603394Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3603919Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3604434Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3604947Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3605459Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3605973Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3606484Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3606996Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3607509Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3608021Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3608532Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3609013Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3609478Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3609975Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3610512Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3610999Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3611466Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3611930Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3612426Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3612938Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3613465Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3613976Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3614454Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3614932Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3615443Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3615955Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3616466Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3616979Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3617476Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3617965Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3618432Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3618899Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3619356Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3619840Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3620383Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3620894Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3621405Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3621917Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3622390Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3622885Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3623397Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3623911Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3624431Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3624943Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3625422Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3625897Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3626410Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3626923Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3627439Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3627953Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3628472Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3628977Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3629443Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3629906Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3630437Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3630952Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3631429Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3631906Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3632431Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3632943Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3633457Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3633972Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3634471Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3634951Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3635415Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3635883Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3636377Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3636895Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3637407Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3637932Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3638468Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3638979Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3639455Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3639931Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3640500Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3641015Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3641526Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3642054Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3642553Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3643034Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3643504Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3643970Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3644468Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3644983Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3645500Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3646015Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3646530Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3647044Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3647569Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3648107Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3648621Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3649133Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3649620Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3650086Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3650582Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3651035Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3651527Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3652014Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3652487Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3652945Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3653393Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3653821Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3654182Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3654520Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3654835Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3655131Z E1204 11:00:33.306000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3655480Z [W1204 11:00:33.769190080 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3655687Z 2025-12-04T12:10:20.3655847Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3656351Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3657034Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3657540Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3658216Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3659004Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3659543Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3660035Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3660546Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3661054Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3661568Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3662082Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3662590Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3663104Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3663615Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3664124Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3664633Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3665144Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3665654Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3666138Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3666638Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3667127Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3667619Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3668131Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3668605Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3669076Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3669586Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3670141Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3670613Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3671086Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3671571Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3672020Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3672491Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3672971Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3673418Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3673888Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3674402Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3674915Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3675424Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3675948Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3676457Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3676925Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3677401Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3677892Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3678402Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3678915Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3679424Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3679947Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3680502Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3681013Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3681523Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3682033Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3682543Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3683055Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3683566Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3684076Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3684586Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3685094Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3685631Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3686152Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3686662Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3687174Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3687681Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3688193Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3688707Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3689205Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3689665Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3690186Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3690696Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3691209Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3691718Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3692231Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3692747Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3693258Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3693769Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3694277Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3694785Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3695326Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3695848Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3696328Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3696776Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3697247Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3697726Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3698193Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3698677Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3699169Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3699679Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3700270Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3700719Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3701192Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3701669Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3702118Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3702588Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3703095Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3703604Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3704112Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3704636Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3705141Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3705609Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3706089Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3706583Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3707098Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3707588Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3708055Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3708534Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3709027Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3709541Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3710053Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3710595Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3711108Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3711345Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3711587Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3711823Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3712067Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3712300Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3712531Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3712759Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3713067Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3713284Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3713498Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3713718Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3713962Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3714196Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3714454Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3714688Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3714934Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3715169Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3715411Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3715644Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3715887Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3716124Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3716341Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3716556Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3716763Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3717006Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3717243Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3717486Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3717720Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3717936Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3718152Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3718366Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3718608Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3718852Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3719093Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3719331Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3719574Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3719809Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3720050Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3720330Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3720573Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3720811Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3721055Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3721291Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3721571Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3721818Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3722062Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3722297Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3722538Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3722773Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3723015Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3723261Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3723474Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3723679Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3723913Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3724156Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3724390Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3724632Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3724868Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3725109Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3725344Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3725585Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3725821Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3726081Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3726327Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3726533Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3726767Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3727009Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3727244Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3727485Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3727730Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3727957Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3728175Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3728390Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3728604Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3728845Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3729085Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3729316Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3729532Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3729745Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3729960Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3730241Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3730497Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3730726Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3730939Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3731145Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3731307Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3731544Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3731749Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3731984Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3732240Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3732474Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3732687Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3732892Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3733133Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3733345Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3733547Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3733783Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3733988Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3734223Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3734427Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3734661Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3734884Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3735127Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3735372Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3735606Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3735848Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3736084Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3736324Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3736572Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3736813Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3737050Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3737292Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3737525Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3737739Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3737943Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3738180Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3738407Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3738625Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3738838Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3739052Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3739316Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3739559Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3739801Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3740034Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3740276Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3740512Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3740753Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3741002Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3741248Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3741483Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3741710Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3741929Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3742149Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3742356Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3742561Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3742796Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3743037Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3743272Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3743515Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3743775Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3743991Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3744228Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3744469Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3744705Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3744947Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3745181Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3745394Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3745627Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3745870Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3746105Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3746348Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3746582Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3746809Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3747027Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3747241Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3747457Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3747700Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3747934Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3748158Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3748403Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3748649Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3748882Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3749126Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3749365Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3749592Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3749818Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3750033Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3750294Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3750537Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3750771Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3751014Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3751249Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3751494Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3751728Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3751934Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3752168Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3752411Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3752670Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3752937Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3753172Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3753405Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3753622Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3753836Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3754051Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3754309Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3754544Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3754788Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3755023Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3755265Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3755500Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3755743Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3755980Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3756223Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3756459Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3756672Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3756902Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3757126Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3757342Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3757572Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3757795Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3758012Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3758220Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3758426Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3758621Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3758763Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3758923Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3759044Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3759186Z E1204 11:00:33.308000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3759256Z ('RERUN', {'yellow': True}) [1.1899s] [100%] 2025-12-04T12:10:20.3759604Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:00:34.757584313 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3759607Z 2025-12-04T12:10:20.3759767Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3760079Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3760417Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3760564Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3761054Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3761340Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3761608Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3761831Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3762053Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3762293Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3762532Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3762775Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3763008Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3763262Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3763493Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3763735Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3763967Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3764211Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3764445Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3764655Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3764880Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3765095Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3765338Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3765571Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3765787Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3766043Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3766287Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3766523Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3766726Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3766960Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3767170Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3767374Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3767617Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3767828Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3768033Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3768267Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3768508Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3768745Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3768990Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3769226Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3769439Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3769663Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3769876Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3770159Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3770426Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3770680Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3770913Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3771154Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3771390Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3771633Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3771866Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3772118Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3772353Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3772595Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3772827Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3773069Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3773300Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3773542Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3773777Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3774019Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3774254Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3774495Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3774737Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3774998Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3775231Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3775449Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3775660Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3775902Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3776135Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3776377Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3776618Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3776859Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3777093Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3777333Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3777567Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3777808Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3778042Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3778283Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3778516Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3778728Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3778929Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3779183Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3779403Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3779628Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3779844Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3780084Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3780355Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3780568Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3780770Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3781013Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3781223Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3781425Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3781661Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3781904Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3782137Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3782379Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3782616Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3782830Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3783052Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3783266Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3783523Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3783780Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3783997Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3784210Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3784427Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3784670Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3784905Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3785147Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3785393Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3785635Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3785870Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3786112Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3786345Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3786589Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3786823Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3787037Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3787242Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3787477Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3787695Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3787924Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3788152Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3788404Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3788641Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3788884Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3789118Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3789361Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3789595Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3789848Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3790083Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3790355Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3790591Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3790809Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3791023Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3791227Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3791455Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3791670Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3791911Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3792145Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3792377Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3792616Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3792833Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3793076Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3793311Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3793554Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3793791Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3794031Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3794279Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3794522Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3794760Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3795002Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3795236Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3795477Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3795710Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3795954Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3796190Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3796431Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3796667Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3796930Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3797175Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3797417Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3797652Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3797865Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3798071Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3798306Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3798559Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3798795Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3799035Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3799271Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3799513Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3799748Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3799992Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3800258Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3800500Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3800733Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3800941Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3801175Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3801444Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3801692Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3801933Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3802167Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3802395Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3802613Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3802827Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3803059Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3803300Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3803534Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3803764Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3803979Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3804195Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3804411Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3804653Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3804890Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3805106Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3805319Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3805524Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3805696Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3805951Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3806155Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3806397Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3806639Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3806879Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3807093Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3807298Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3807542Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3807754Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3807958Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3808193Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3808396Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3808631Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3808837Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3809073Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3809276Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3809509Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3809751Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3809986Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3810299Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3810546Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3810789Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3811025Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3811267Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3811503Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3811746Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3811992Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3812206Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3812411Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3812645Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3812874Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3813092Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3813305Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3813521Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3813764Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3813999Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3814239Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3814479Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3814707Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3814951Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3815194Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3815427Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3815670Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3815905Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3816132Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3816359Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3816573Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3816781Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3816985Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3817220Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3817461Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3817694Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3817937Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3818171Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3818375Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3818608Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3818851Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3819104Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3819360Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3819593Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3819797Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3820030Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3820318Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3820552Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3820807Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3821042Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3821268Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3821489Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3821702Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3821916Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3822158Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3822393Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3822597Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3822831Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3823075Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3823308Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3823575Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3823823Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3824050Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3824266Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3824479Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3824695Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3824940Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3825183Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3825426Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3825660Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3825904Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3826137Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3826343Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3826577Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3826820Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3827056Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3827297Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3827531Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3827758Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3827995Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3828220Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3828434Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3828676Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3828911Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3829155Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3829387Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3831340Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3831583Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3831829Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3832067Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3832308Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3832545Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3832761Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3832979Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3833184Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3833393Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3833623Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3833845Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3834087Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3834310Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3834518Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3834708Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3834849Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3835010Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3835130Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3835271Z E1204 11:00:34.296000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3835442Z [W1204 11:00:34.759806037 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3835445Z 2025-12-04T12:10:20.3835619Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3835928Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3836237Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3836383Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3836881Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3837153Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3837393Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3837613Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3837829Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3838070Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3838317Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3838588Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3838824Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3839066Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3839299Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3839541Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3839774Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3840015Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3840302Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3840520Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3840746Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3840960Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3841202Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3841433Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3841637Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3841871Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3842112Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3842346Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3842547Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3842792Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3843028Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3843230Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3843462Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3843674Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3843875Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3844108Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3844348Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3844588Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3844829Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3845063Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3845276Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3845497Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3845711Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3845951Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3846183Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3846425Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3846656Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3846898Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3847130Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3847391Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3847635Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3847874Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3848106Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3848346Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3848579Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3848821Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3849063Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3849304Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3849536Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3849777Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3850009Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3850281Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3850514Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3850756Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3850989Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3851206Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3851418Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3851673Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3851929Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3852169Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3852401Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3852643Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3852877Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3853117Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3853348Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3853602Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3853835Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3854075Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3854308Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3854519Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3854722Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3854956Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3855169Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3855391Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3855605Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3855846Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3856093Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3856323Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3856525Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3856760Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3856973Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3857175Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3857409Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3857649Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3857892Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3858132Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3858366Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3858577Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3858801Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3859016Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3859259Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3859495Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3859710Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3859924Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3860174Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3860415Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3860676Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3860931Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3861168Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3861410Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3861646Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3861888Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3862121Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3862377Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3862611Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3862827Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3863033Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3863268Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3863484Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3863697Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3863912Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3864155Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3864393Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3864636Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3864868Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3865138Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3865381Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3865625Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3865857Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3866099Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3866336Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3866551Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3866774Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3866980Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3867207Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3867422Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3867663Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3867898Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3868112Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3868326Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3868540Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3868781Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3869019Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3869260Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3869516Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3869766Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3870001Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3870278Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3870517Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3870759Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3870994Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3871254Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3871487Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3871731Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3871965Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3872207Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3872440Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3872682Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3872918Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3873162Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3873396Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3873608Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3873827Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3874086Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3874328Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3874563Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3874803Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3875040Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3875282Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3875526Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3875768Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3876001Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3876244Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3876477Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3876683Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3876918Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3877162Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3877398Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3877637Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3877870Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3878096Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3878333Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3878555Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3878770Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3879015Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3879248Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3879477Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3879780Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3880003Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3880258Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3880500Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3880736Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3880953Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3881169Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3881379Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3881543Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3881777Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3881983Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3882218Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3882460Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3882694Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3882933Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3883152Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3883387Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3883597Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3883802Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3884036Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3884242Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3884486Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3884690Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3884925Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3885131Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3885366Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3885608Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3885842Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3886085Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3886318Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3886559Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3886793Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3887035Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3887291Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3887548Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3887783Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3887996Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3888201Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3888437Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3888666Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3888890Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3889103Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3889320Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3889564Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3889797Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3890039Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3890314Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3890518Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3890754Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3890995Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3891230Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3891470Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3891732Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3891975Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3892190Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3892403Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3892608Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3892814Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3893050Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3893309Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3893543Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3893785Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3894020Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3894223Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3894458Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3894699Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3894933Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3895178Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3895411Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3895615Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3895849Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3896120Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3896363Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3896605Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3896838Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3897064Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3897285Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3897500Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3897726Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3897968Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3898204Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3898413Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3898646Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3898888Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3899122Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3899366Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3899600Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3899827Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3900043Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3900299Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3900539Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3900793Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3901028Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3901272Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3901507Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3901750Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3901983Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3902201Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3902434Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3902676Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3902912Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3903153Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3903389Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3903615Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3903833Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3904046Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3904260Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3904503Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3904736Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3905008Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3905255Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3905499Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3905737Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3905980Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3906217Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3906457Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3906707Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3906919Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3907139Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3907346Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3907558Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3907789Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3908010Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3908222Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3908428Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3908635Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3908821Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3908962Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3909122Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3909257Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3909422Z E1204 11:00:34.298000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3909596Z [W1204 11:00:34.761872753 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.3909599Z 2025-12-04T12:10:20.3909758Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.3910066Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.3910406Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.3910553Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.3911059Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.3911329Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.3911569Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.3911789Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.3912002Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3912245Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3912479Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3912721Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3912954Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3913195Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3913429Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3913681Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3913937Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3914178Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3914410Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3914622Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3914844Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3915062Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3915302Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3915547Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3915751Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3915983Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3916224Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3916456Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3916660Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3916891Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3917104Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3917309Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3917541Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3917752Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3917952Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3918204Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3918454Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3918687Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3918929Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3919160Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3919375Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3919596Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3919821Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3920061Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3920330Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3920574Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3920806Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3921048Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3921281Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3921525Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3921758Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3922000Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3922235Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3922474Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3922737Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3922989Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3923222Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3923463Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3923696Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3923937Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3924169Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3924422Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3924653Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3924896Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3925130Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3925345Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3925557Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.3925797Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3926032Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3926272Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3926504Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3926745Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3926987Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3927246Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3927478Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3927721Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3927952Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3928193Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3928426Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3928636Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3928848Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3929079Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3929292Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3929514Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3929730Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3929971Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3930236Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3930449Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3930651Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3930884Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3931094Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3931297Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3931557Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3931811Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3932044Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3932283Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3932516Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3932727Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3932949Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3933175Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3933419Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3933656Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3933874Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3934087Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3934301Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3934543Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3934779Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3935020Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3935254Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3935496Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3935732Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3935993Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3936236Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3936479Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3936713Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3936927Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3937133Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3937368Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3937599Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3937815Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3938031Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3938278Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3938512Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3938753Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3938986Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3939228Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3939462Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3939706Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3939939Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3940219Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3940481Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3940709Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3940924Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3941129Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3941354Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.3941570Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3941815Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3942060Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3942279Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3942492Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3942708Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3942950Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3943183Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3943425Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3943662Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3943904Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3944137Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3944378Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3944612Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3944872Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3945118Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3945359Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3945594Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3945838Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3946073Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3946318Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3946564Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3946806Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3947041Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3947284Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3947521Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3947733Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3947937Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3948171Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3948417Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3948650Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3948893Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3949137Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3949397Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3949634Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3949875Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3950126Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3950370Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3950605Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3950809Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3951059Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3951301Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3951538Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3951781Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3952015Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3952243Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3952460Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3952674Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3952888Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3953129Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3953364Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3953679Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3953928Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3954148Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3954362Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3954604Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3954839Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3955056Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3955269Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3955485Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3955650Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.3955885Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3956091Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3956325Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3956568Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3956800Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3957016Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3957222Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3957456Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3957671Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3957943Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3958200Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3958418Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3958653Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3958859Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3959093Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3959299Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3959532Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3959786Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3960021Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3960307Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3960542Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3960783Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3961017Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3961258Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3961494Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3961738Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3961971Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3962185Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.3962389Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3962649Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3962887Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3963105Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3963318Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3963533Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3963779Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3964013Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3964326Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3964559Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3964825Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3965061Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3965302Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3965539Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3965779Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3966017Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3966245Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3966462Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3966676Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3966881Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.3967098Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3967354Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3967596Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3967829Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3968073Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3968308Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3968512Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3968755Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3968997Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3969233Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3969475Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3969709Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3969914Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3970183Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3970425Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3970660Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3970901Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3971134Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3971363Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3971615Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3971841Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3972058Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3972299Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3972535Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3972740Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3972975Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3973230Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3973463Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3973706Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3973941Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3974173Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3974389Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3974603Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3974819Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3975062Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3975296Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3975537Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3975771Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3976035Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3976283Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3976487Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.3976723Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3976965Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3977200Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3977442Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3977687Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3977914Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.3978133Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.3978348Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.3978563Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.3978806Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.3979044Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3979288Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3979523Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3979765Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3979999Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3980279Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3980542Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3980795Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.3981030Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.3981241Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.3981457Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.3981662Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.3981874Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.3982115Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.3982339Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.3982552Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.3982758Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.3982965Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.3983151Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.3983292Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.3983451Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.3983572Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.3983712Z E1204 11:00:34.301000 717280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.3983772Z FAILED [0.9225s] [100%] 2025-12-04T12:10:20.3983775Z 2025-12-04T12:10:20.3983848Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.3984009Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.3984073Z Traceback (most recent call last): 2025-12-04T12:10:20.3984252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3984312Z method(*args, **kwargs) 2025-12-04T12:10:20.3984479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3984546Z method(*args, **kwargs) 2025-12-04T12:10:20.3984709Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.3984786Z with policy(): 2025-12-04T12:10:20.3984954Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.3985012Z raise RuntimeError(msg) 2025-12-04T12:10:20.3985419Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1075838976. 2025-12-04T12:10:20.3985422Z 2025-12-04T12:10:20.3985516Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3985791Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3985795Z 2025-12-04T12:10:20.3985905Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3985999Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3986060Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3986135Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3986715Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3986836Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3986890Z graph_break [] 2025-12-04T12:10:20.3986974Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3987065Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3987570Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.3987635Z current_size = base.storage().size() 2025-12-04T12:10:20.3987694Z Autotune Choices Stats: 2025-12-04T12:10:20.3988085Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007158999796956778, "best_triton_pos": 0} 2025-12-04T12:10:20.3988168Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3988235Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3988372Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3988625Z triton_mm_15 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3988866Z triton_mm_9 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3989135Z triton_mm_7 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3989385Z triton_mm_17 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3989625Z triton_mm_10 0.0076 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3989867Z triton_mm_11 0.0078 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3990135Z triton_mm_8 0.0079 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3990373Z triton_mm_18 0.0080 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3990628Z triton_mm_13 0.0080 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3990689Z _scaled_mm 0.0086 ms 83.6% 2025-12-04T12:10:20.3990836Z SingleProcess AUTOTUNE benchmarking takes 0.0873 seconds and 8.0564 seconds precompiling for 18 choices 2025-12-04T12:10:20.3990993Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.3991057Z Traceback (most recent call last): 2025-12-04T12:10:20.3991230Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3991289Z method(*args, **kwargs) 2025-12-04T12:10:20.3991456Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.3991513Z method(*args, **kwargs) 2025-12-04T12:10:20.3991679Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.3991734Z with policy(): 2025-12-04T12:10:20.3991901Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.3991960Z raise RuntimeError(msg) 2025-12-04T12:10:20.3992361Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1145044992. 2025-12-04T12:10:20.3992364Z 2025-12-04T12:10:20.3992456Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.3992727Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.3992732Z 2025-12-04T12:10:20.3992835Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.3992925Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3992998Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3993073Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3993650Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3993778Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3993832Z graph_break [] 2025-12-04T12:10:20.3993913Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3994001Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3994501Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.3994568Z current_size = base.storage().size() 2025-12-04T12:10:20.3994625Z Autotune Choices Stats: 2025-12-04T12:10:20.3995019Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007158999796956778, "best_triton_pos": 0} 2025-12-04T12:10:20.3995100Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3995167Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3995304Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.3995553Z triton_mm_15 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3995791Z triton_mm_9 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3996032Z triton_mm_7 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3996269Z triton_mm_17 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3996507Z triton_mm_10 0.0076 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3996746Z triton_mm_11 0.0078 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3996984Z triton_mm_8 0.0079 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3997221Z triton_mm_18 0.0080 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.3997478Z triton_mm_13 0.0080 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.3997547Z _scaled_mm 0.0086 ms 83.6% 2025-12-04T12:10:20.3997692Z SingleProcess AUTOTUNE benchmarking takes 0.0873 seconds and 8.0564 seconds precompiling for 18 choices 2025-12-04T12:10:20.3997782Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.3997848Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.3997926Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.3998041Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.3998540Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.3998596Z graph_break [] 2025-12-04T12:10:20.3998674Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.3998764Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.3999154Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.3999265Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.3999323Z Autotune Choices Stats: 2025-12-04T12:10:20.3999702Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.3999783Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.3999847Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.3999983Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4000258Z triton_mm_31 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4000497Z triton_mm_32 0.0070 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4000736Z triton_mm_38 0.0072 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4000973Z triton_mm_33 0.0074 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4001209Z triton_mm_30 0.0075 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4001458Z triton_mm_37 0.0076 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4001723Z triton_mm_34 0.0078 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4001964Z triton_mm_27 0.0080 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4002204Z triton_mm_35 0.0085 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4002263Z _scaled_mm 0.0086 ms 79.9% 2025-12-04T12:10:20.4002408Z SingleProcess AUTOTUNE benchmarking takes 0.1591 seconds and 0.4953 seconds precompiling for 21 choices 2025-12-04T12:10:20.4002479Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4002636Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4002699Z Traceback (most recent call last): 2025-12-04T12:10:20.4002870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4002928Z method(*args, **kwargs) 2025-12-04T12:10:20.4003114Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4003172Z method(*args, **kwargs) 2025-12-04T12:10:20.4003337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4003392Z with policy(): 2025-12-04T12:10:20.4003558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4003618Z raise RuntimeError(msg) 2025-12-04T12:10:20.4004021Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.4004024Z 2025-12-04T12:10:20.4004115Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4004390Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4004394Z 2025-12-04T12:10:20.4004497Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4004587Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4004648Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4004722Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4005285Z inductor [('triton_bundler_save_kernel', 168), ('generated_module_cache_miss', 20), ('benchmarking.InductorBenchmarker.benchmark_gpu', 18), ('select_algorithm_num_precompiles', 17), ('select_algorithm_num_precompilation_exceptions', 3), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4005399Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4005453Z graph_break [] 2025-12-04T12:10:20.4005543Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.4005631Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4006152Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4006218Z current_size = base.storage().size() 2025-12-04T12:10:20.4006274Z Autotune Choices Stats: 2025-12-04T12:10:20.4006655Z {"num_choices": 18, "num_triton_choices": 17, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007158999796956778, "best_triton_pos": 0} 2025-12-04T12:10:20.4006736Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.4006800Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4006938Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4007185Z triton_mm_15 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4007433Z triton_mm_9 0.0075 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4007674Z triton_mm_7 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4007915Z triton_mm_17 0.0075 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4008152Z triton_mm_10 0.0076 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4008390Z triton_mm_11 0.0078 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4008627Z triton_mm_8 0.0079 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4008866Z triton_mm_18 0.0080 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4009107Z triton_mm_13 0.0080 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4009165Z _scaled_mm 0.0086 ms 83.6% 2025-12-04T12:10:20.4009309Z SingleProcess AUTOTUNE benchmarking takes 0.0873 seconds and 8.0564 seconds precompiling for 18 choices 2025-12-04T12:10:20.4009397Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4009457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4009540Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4009657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4010215Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4010271Z graph_break [] 2025-12-04T12:10:20.4010348Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.4010438Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4010814Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.4010923Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.4010981Z Autotune Choices Stats: 2025-12-04T12:10:20.4011358Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:20.4011450Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.4011515Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4011652Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4011897Z triton_mm_31 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4012139Z triton_mm_32 0.0070 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4012375Z triton_mm_38 0.0072 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4012614Z triton_mm_33 0.0074 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4012853Z triton_mm_30 0.0075 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4013091Z triton_mm_37 0.0076 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4013328Z triton_mm_34 0.0078 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4013567Z triton_mm_27 0.0080 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4013819Z triton_mm_35 0.0085 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4013900Z _scaled_mm 0.0086 ms 79.9% 2025-12-04T12:10:20.4014046Z SingleProcess AUTOTUNE benchmarking takes 0.1591 seconds and 0.4953 seconds precompiling for 21 choices 2025-12-04T12:10:20.4014135Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4014194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4014266Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4014381Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4014877Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4014932Z graph_break [] 2025-12-04T12:10:20.4015011Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:20.4015100Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4015157Z Autotune Choices Stats: 2025-12-04T12:10:20.4015541Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_53", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:20.4015620Z AUTOTUNE scaled_mm(257x32, 32x2048, 257x1, 1x2048, 2048) 2025-12-04T12:10:20.4015684Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4015820Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4016063Z triton_mm_53 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4016299Z triton_mm_57 0.0070 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4016538Z triton_mm_58 0.0070 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4016774Z triton_mm_54 0.0072 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4017019Z triton_mm_47 0.0073 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4017257Z triton_mm_50 0.0074 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4017497Z triton_mm_55 0.0077 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4017739Z triton_mm_51 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4017998Z triton_mm_52 0.0078 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4018253Z triton_mm_48 0.0078 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4018398Z SingleProcess AUTOTUNE benchmarking takes 0.1234 seconds and 0.3595 seconds precompiling for 21 choices 2025-12-04T12:10:20.4018607Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-debc414a89adc35f.xml - 2025-12-04T12:10:20.4018684Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4019282Z FAILED [0.9225s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1145044992 and is now 1214251008. 2025-12-04T12:10:20.4019286Z 2025-12-04T12:10:20.4019377Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4019658Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4019661Z 2025-12-04T12:10:20.4019767Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4019846Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4019932Z ================= 1 failed, 187 deselected, 2 rerun in 12.26s ================== 2025-12-04T12:10:20.4019990Z Got exit code 1 2025-12-04T12:10:20.4020237Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4020380Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.4020541Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-51ebf3b8587cd720.xml 2025-12-04T12:10:20.4020615Z ============================= test session starts ============================== 2025-12-04T12:10:20.4020744Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4020804Z cachedir: .pytest_cache 2025-12-04T12:10:20.4020977Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4021041Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4021099Z configfile: pytest.ini 2025-12-04T12:10:20.4021279Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4021369Z collecting ... collected 188 items / 95 deselected / 93 selected 2025-12-04T12:10:20.4021440Z stepcurrent: skipping 95 already run items. 2025-12-04T12:10:20.4021502Z Running 93 items in this shard 2025-12-04T12:10:20.4021504Z 2025-12-04T12:10:20.4021735Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1779s] [ 1%] 2025-12-04T12:10:20.4021960Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8473s] [ 1%] 2025-12-04T12:10:20.4022187Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7701s] [ 1%] 2025-12-04T12:10:20.4022202Z 2025-12-04T12:10:20.4022272Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.4022425Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4022487Z Traceback (most recent call last): 2025-12-04T12:10:20.4022661Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4022719Z method(*args, **kwargs) 2025-12-04T12:10:20.4022886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4022944Z method(*args, **kwargs) 2025-12-04T12:10:20.4023109Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4023165Z with policy(): 2025-12-04T12:10:20.4023331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4023389Z raise RuntimeError(msg) 2025-12-04T12:10:20.4023802Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1035993088. 2025-12-04T12:10:20.4023805Z 2025-12-04T12:10:20.4023895Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4024164Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4024171Z 2025-12-04T12:10:20.4024274Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4024365Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4024425Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4024499Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4024996Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4025111Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4025165Z graph_break [] 2025-12-04T12:10:20.4025244Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4025334Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4025834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4025900Z current_size = base.storage().size() 2025-12-04T12:10:20.4025957Z Autotune Choices Stats: 2025-12-04T12:10:20.4026341Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.4026449Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4026518Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4026653Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4026904Z triton_mm_2 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4027145Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4027384Z triton_mm_8 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4027624Z triton_mm_3 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4027862Z triton_mm_9 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4028109Z triton_mm_4 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4028344Z triton_mm_7 0.0078 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4028581Z triton_mm_5 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4028816Z triton_mm_1 0.0089 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4029051Z triton_mm_0 0.0121 ms 53.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4029196Z SingleProcess AUTOTUNE benchmarking takes 0.0544 seconds and 0.2379 seconds precompiling for 11 choices 2025-12-04T12:10:20.4029349Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4029414Z Traceback (most recent call last): 2025-12-04T12:10:20.4029584Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4029642Z method(*args, **kwargs) 2025-12-04T12:10:20.4029807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4029864Z method(*args, **kwargs) 2025-12-04T12:10:20.4030030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4030086Z with policy(): 2025-12-04T12:10:20.4030290Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4030360Z raise RuntimeError(msg) 2025-12-04T12:10:20.4030771Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1035993088 and is now 1084227584. 2025-12-04T12:10:20.4030793Z 2025-12-04T12:10:20.4030883Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4031153Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4031156Z 2025-12-04T12:10:20.4031260Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4031349Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4031409Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4031482Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4031982Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4032110Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4032164Z graph_break [] 2025-12-04T12:10:20.4032243Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4032331Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4032829Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4032894Z current_size = base.storage().size() 2025-12-04T12:10:20.4032950Z Autotune Choices Stats: 2025-12-04T12:10:20.4033331Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.4033407Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4033474Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4033610Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4033858Z triton_mm_2 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4034098Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4034336Z triton_mm_8 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4034571Z triton_mm_3 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4034845Z triton_mm_9 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4035092Z triton_mm_4 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4035327Z triton_mm_7 0.0078 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4035566Z triton_mm_5 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4035804Z triton_mm_1 0.0089 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4036039Z triton_mm_0 0.0121 ms 53.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4036193Z SingleProcess AUTOTUNE benchmarking takes 0.0544 seconds and 0.2379 seconds precompiling for 11 choices 2025-12-04T12:10:20.4036283Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4036342Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4036414Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4036530Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4037024Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4037079Z graph_break [] 2025-12-04T12:10:20.4037158Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4037248Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4037305Z Autotune Choices Stats: 2025-12-04T12:10:20.4037681Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.4037759Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4037825Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4037961Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4038207Z triton_mm_18 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4038447Z triton_mm_16 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4038710Z triton_mm_13 0.0072 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4038959Z triton_mm_14 0.0076 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4039197Z triton_mm_17 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4039433Z triton_mm_15 0.0079 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4039674Z triton_mm_12 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4039910Z triton_mm_11 0.0087 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4040192Z triton_mm_19 0.0097 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4040429Z triton_mm_10 0.0119 ms 52.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4040574Z SingleProcess AUTOTUNE benchmarking takes 0.0602 seconds and 0.2197 seconds precompiling for 11 choices 2025-12-04T12:10:20.4040645Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4040799Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4040862Z Traceback (most recent call last): 2025-12-04T12:10:20.4041034Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4041092Z method(*args, **kwargs) 2025-12-04T12:10:20.4041258Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4041315Z method(*args, **kwargs) 2025-12-04T12:10:20.4041481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4041539Z with policy(): 2025-12-04T12:10:20.4041705Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4041766Z raise RuntimeError(msg) 2025-12-04T12:10:20.4042165Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4042168Z 2025-12-04T12:10:20.4042258Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4042530Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4042544Z 2025-12-04T12:10:20.4042646Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4042736Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4042820Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4042894Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4043390Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4043504Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4043557Z graph_break [] 2025-12-04T12:10:20.4043634Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4043723Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4044222Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4044287Z current_size = base.storage().size() 2025-12-04T12:10:20.4044344Z Autotune Choices Stats: 2025-12-04T12:10:20.4044739Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.4044817Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4044885Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4045022Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4045269Z triton_mm_2 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4045508Z triton_mm_6 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4045745Z triton_mm_8 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4045984Z triton_mm_3 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4046222Z triton_mm_9 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4046459Z triton_mm_4 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4046694Z triton_mm_7 0.0078 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4046950Z triton_mm_5 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4047196Z triton_mm_1 0.0089 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4047431Z triton_mm_0 0.0121 ms 53.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4047575Z SingleProcess AUTOTUNE benchmarking takes 0.0544 seconds and 0.2379 seconds precompiling for 11 choices 2025-12-04T12:10:20.4047663Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4047724Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4047796Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4047913Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4048417Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4048471Z graph_break [] 2025-12-04T12:10:20.4048549Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4048638Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4048697Z Autotune Choices Stats: 2025-12-04T12:10:20.4049070Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.4049145Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4049211Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4049347Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4049591Z triton_mm_18 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4049830Z triton_mm_16 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4050075Z triton_mm_13 0.0072 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4050365Z triton_mm_14 0.0076 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4050603Z triton_mm_17 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4050839Z triton_mm_15 0.0079 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4051126Z triton_mm_12 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4051363Z triton_mm_11 0.0087 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4051603Z triton_mm_19 0.0097 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4051841Z triton_mm_10 0.0119 ms 52.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4051988Z SingleProcess AUTOTUNE benchmarking takes 0.0602 seconds and 0.2197 seconds precompiling for 11 choices 2025-12-04T12:10:20.4052077Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4052136Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4052208Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4052322Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4052825Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4052880Z graph_break [] 2025-12-04T12:10:20.4052957Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4053047Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4053105Z Autotune Choices Stats: 2025-12-04T12:10:20.4053480Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_28", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007360000163316727, "best_triton_pos": 0} 2025-12-04T12:10:20.4053553Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4053619Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4053753Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4054000Z triton_mm_28 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4054244Z triton_mm_29 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4054484Z triton_mm_24 0.0074 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4054721Z triton_mm_23 0.0076 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4054978Z triton_mm_27 0.0080 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4055226Z triton_mm_26 0.0080 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4055467Z triton_mm_22 0.0080 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4055707Z triton_mm_25 0.0084 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4055947Z triton_mm_21 0.0092 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4056192Z triton_mm_20 0.0119 ms 62.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4056338Z SingleProcess AUTOTUNE benchmarking takes 0.0762 seconds and 0.2770 seconds precompiling for 11 choices 2025-12-04T12:10:20.4056551Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-51ebf3b8587cd720.xml - 2025-12-04T12:10:20.4056630Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4057224Z FAILED [0.7701s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4057228Z 2025-12-04T12:10:20.4057321Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4057593Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4057595Z 2025-12-04T12:10:20.4057698Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4057777Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4057862Z ================== 1 failed, 95 deselected, 2 rerun in 3.82s =================== 2025-12-04T12:10:20.4057919Z Got exit code 1 2025-12-04T12:10:20.4057977Z Retrying single test... 2025-12-04T12:10:20.4058138Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e430e1ede184c2e.xml 2025-12-04T12:10:20.4058211Z ============================= test session starts ============================== 2025-12-04T12:10:20.4058339Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4058396Z cachedir: .pytest_cache 2025-12-04T12:10:20.4058571Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4058634Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4058692Z configfile: pytest.ini 2025-12-04T12:10:20.4058870Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4058972Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.4059259Z stepcurrent: skipping 95 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4059322Z Running 1 items in this shard 2025-12-04T12:10:20.4059324Z 2025-12-04T12:10:20.4059550Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1185s] [100%] 2025-12-04T12:10:20.4059773Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8662s] [100%] 2025-12-04T12:10:20.4059977Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7189s] [100%] 2025-12-04T12:10:20.4059980Z 2025-12-04T12:10:20.4060049Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.4060230Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4060294Z Traceback (most recent call last): 2025-12-04T12:10:20.4060468Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4060526Z method(*args, **kwargs) 2025-12-04T12:10:20.4060711Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4060770Z method(*args, **kwargs) 2025-12-04T12:10:20.4060936Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4060992Z with policy(): 2025-12-04T12:10:20.4061162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4061221Z raise RuntimeError(msg) 2025-12-04T12:10:20.4061622Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1035993088. 2025-12-04T12:10:20.4061624Z 2025-12-04T12:10:20.4061715Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4061984Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4061986Z 2025-12-04T12:10:20.4062090Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4062179Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4062242Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4062318Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4062817Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4062931Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4062988Z graph_break [] 2025-12-04T12:10:20.4063065Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4063167Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4063679Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4063755Z current_size = base.storage().size() 2025-12-04T12:10:20.4063815Z Autotune Choices Stats: 2025-12-04T12:10:20.4064196Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006198999937623739, "best_triton_pos": 0} 2025-12-04T12:10:20.4064274Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4064344Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4064481Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4064729Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4064990Z triton_mm_8 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4065231Z triton_mm_2 0.0071 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4065469Z triton_mm_4 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4065704Z triton_mm_7 0.0078 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4065940Z triton_mm_5 0.0080 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4066179Z triton_mm_9 0.0085 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4066418Z triton_mm_1 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4066653Z triton_mm_3 0.0109 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4066891Z triton_mm_0 0.0120 ms 51.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4067033Z SingleProcess AUTOTUNE benchmarking takes 0.0519 seconds and 0.2391 seconds precompiling for 11 choices 2025-12-04T12:10:20.4067188Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4067260Z Traceback (most recent call last): 2025-12-04T12:10:20.4067430Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4067509Z method(*args, **kwargs) 2025-12-04T12:10:20.4067677Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4067733Z method(*args, **kwargs) 2025-12-04T12:10:20.4067900Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4067953Z with policy(): 2025-12-04T12:10:20.4068122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4068180Z raise RuntimeError(msg) 2025-12-04T12:10:20.4068583Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1035993088 and is now 1084227584. 2025-12-04T12:10:20.4068587Z 2025-12-04T12:10:20.4068678Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4068948Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4068950Z 2025-12-04T12:10:20.4069062Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4069151Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4069212Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4069285Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4069782Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4069897Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4069950Z graph_break [] 2025-12-04T12:10:20.4070028Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4070152Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4070650Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4070715Z current_size = base.storage().size() 2025-12-04T12:10:20.4070775Z Autotune Choices Stats: 2025-12-04T12:10:20.4071153Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006198999937623739, "best_triton_pos": 0} 2025-12-04T12:10:20.4071229Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4071299Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4071436Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4071682Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4071962Z triton_mm_8 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4072204Z triton_mm_2 0.0071 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4072440Z triton_mm_4 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4072677Z triton_mm_7 0.0078 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4072914Z triton_mm_5 0.0080 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4073152Z triton_mm_9 0.0085 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4073400Z triton_mm_1 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4073635Z triton_mm_3 0.0109 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4073874Z triton_mm_0 0.0120 ms 51.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4074017Z SingleProcess AUTOTUNE benchmarking takes 0.0519 seconds and 0.2391 seconds precompiling for 11 choices 2025-12-04T12:10:20.4074107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4074166Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4074243Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4074357Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4074852Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4074912Z graph_break [] 2025-12-04T12:10:20.4074991Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4075081Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4075137Z Autotune Choices Stats: 2025-12-04T12:10:20.4075513Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:20.4075597Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4075664Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4075820Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4076066Z triton_mm_18 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4076308Z triton_mm_19 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4076545Z triton_mm_16 0.0069 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4076785Z triton_mm_13 0.0072 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4077024Z triton_mm_12 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4077270Z triton_mm_14 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4077509Z triton_mm_17 0.0078 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4077747Z triton_mm_15 0.0078 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4077984Z triton_mm_11 0.0086 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4078219Z triton_mm_10 0.0120 ms 54.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4078363Z SingleProcess AUTOTUNE benchmarking takes 0.0516 seconds and 0.2134 seconds precompiling for 11 choices 2025-12-04T12:10:20.4078431Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4078586Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4078650Z Traceback (most recent call last): 2025-12-04T12:10:20.4078821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4078878Z method(*args, **kwargs) 2025-12-04T12:10:20.4079044Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4079101Z method(*args, **kwargs) 2025-12-04T12:10:20.4079268Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4079323Z with policy(): 2025-12-04T12:10:20.4079489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4079559Z raise RuntimeError(msg) 2025-12-04T12:10:20.4079968Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4079987Z 2025-12-04T12:10:20.4080078Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4080367Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4080369Z 2025-12-04T12:10:20.4080474Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4080561Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4080622Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4080695Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4081190Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4081304Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4081371Z graph_break [] 2025-12-04T12:10:20.4081451Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4081541Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4082036Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4082101Z current_size = base.storage().size() 2025-12-04T12:10:20.4082159Z Autotune Choices Stats: 2025-12-04T12:10:20.4082538Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006198999937623739, "best_triton_pos": 0} 2025-12-04T12:10:20.4082612Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4082680Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4082819Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4083065Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4083302Z triton_mm_8 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4083544Z triton_mm_2 0.0071 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4083779Z triton_mm_4 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4084042Z triton_mm_7 0.0078 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4084289Z triton_mm_5 0.0080 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4084527Z triton_mm_9 0.0085 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4084763Z triton_mm_1 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4084999Z triton_mm_3 0.0109 ms 57.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4085237Z triton_mm_0 0.0120 ms 51.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4085390Z SingleProcess AUTOTUNE benchmarking takes 0.0519 seconds and 0.2391 seconds precompiling for 11 choices 2025-12-04T12:10:20.4085481Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4085539Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4085612Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4085727Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4086218Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 3), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4086273Z graph_break [] 2025-12-04T12:10:20.4086349Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4086441Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4086497Z Autotune Choices Stats: 2025-12-04T12:10:20.4086873Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:20.4086948Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4087015Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4087149Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4087395Z triton_mm_18 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4087639Z triton_mm_19 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4087886Z triton_mm_16 0.0069 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4088143Z triton_mm_13 0.0072 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4088385Z triton_mm_12 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4088622Z triton_mm_14 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4088859Z triton_mm_17 0.0078 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4089095Z triton_mm_15 0.0078 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4089341Z triton_mm_11 0.0086 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4089578Z triton_mm_10 0.0120 ms 54.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4089722Z SingleProcess AUTOTUNE benchmarking takes 0.0516 seconds and 0.2134 seconds precompiling for 11 choices 2025-12-04T12:10:20.4089810Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4089871Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4089943Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4090058Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4090587Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4090640Z graph_break [] 2025-12-04T12:10:20.4090717Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4090807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4090864Z Autotune Choices Stats: 2025-12-04T12:10:20.4091242Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.4091317Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4091383Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4091519Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4091765Z triton_mm_29 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4092036Z triton_mm_22 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4092290Z triton_mm_26 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4092529Z triton_mm_28 0.0067 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4092766Z triton_mm_23 0.0072 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4093004Z triton_mm_27 0.0077 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4093241Z triton_mm_24 0.0080 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4093490Z triton_mm_25 0.0084 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4093729Z triton_mm_21 0.0095 ms 65.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4093969Z triton_mm_20 0.0120 ms 51.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4094112Z SingleProcess AUTOTUNE benchmarking takes 0.0661 seconds and 0.2255 seconds precompiling for 11 choices 2025-12-04T12:10:20.4094316Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e430e1ede184c2e.xml - 2025-12-04T12:10:20.4094393Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4094982Z FAILED [0.7189s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4094987Z 2025-12-04T12:10:20.4095076Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4095346Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4095348Z 2025-12-04T12:10:20.4095451Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4095529Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4095613Z ================== 1 failed, 187 deselected, 2 rerun in 3.72s ================== 2025-12-04T12:10:20.4095667Z Got exit code 1 2025-12-04T12:10:20.4095733Z Retrying single test... 2025-12-04T12:10:20.4095892Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d73a84840c537a44.xml 2025-12-04T12:10:20.4097334Z ============================= test session starts ============================== 2025-12-04T12:10:20.4097466Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4097525Z cachedir: .pytest_cache 2025-12-04T12:10:20.4097699Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4097764Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4097825Z configfile: pytest.ini 2025-12-04T12:10:20.4098006Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4098097Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.4098363Z stepcurrent: skipping 95 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4098428Z Running 1 items in this shard 2025-12-04T12:10:20.4098430Z 2025-12-04T12:10:20.4098658Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4134s] [100%] 2025-12-04T12:10:20.4098885Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8533s] [100%] 2025-12-04T12:10:20.4099096Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.8351s] [100%] 2025-12-04T12:10:20.4099099Z 2025-12-04T12:10:20.4099168Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.4099323Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4099386Z Traceback (most recent call last): 2025-12-04T12:10:20.4099563Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4099621Z method(*args, **kwargs) 2025-12-04T12:10:20.4099789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4099846Z method(*args, **kwargs) 2025-12-04T12:10:20.4100011Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4100066Z with policy(): 2025-12-04T12:10:20.4100271Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4100331Z raise RuntimeError(msg) 2025-12-04T12:10:20.4100733Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1035993088. 2025-12-04T12:10:20.4100738Z 2025-12-04T12:10:20.4100828Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4101098Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4101102Z 2025-12-04T12:10:20.4101203Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4101293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4101373Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4101448Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4101956Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4102083Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4102137Z graph_break [] 2025-12-04T12:10:20.4102216Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4102305Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4102812Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4102880Z current_size = base.storage().size() 2025-12-04T12:10:20.4102937Z Autotune Choices Stats: 2025-12-04T12:10:20.4103332Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.4103408Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4103476Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4103611Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4103859Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4104101Z triton_mm_2 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4104338Z triton_mm_8 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4104577Z triton_mm_3 0.0073 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4104812Z triton_mm_5 0.0078 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4105047Z triton_mm_7 0.0078 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4105282Z triton_mm_1 0.0086 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4105519Z triton_mm_9 0.0097 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4105774Z triton_mm_4 0.0113 ms 55.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4106018Z triton_mm_0 0.0119 ms 52.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4106166Z SingleProcess AUTOTUNE benchmarking takes 0.0577 seconds and 0.2997 seconds precompiling for 11 choices 2025-12-04T12:10:20.4106321Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4106385Z Traceback (most recent call last): 2025-12-04T12:10:20.4106557Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4106615Z method(*args, **kwargs) 2025-12-04T12:10:20.4106782Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4106840Z method(*args, **kwargs) 2025-12-04T12:10:20.4107006Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4107060Z with policy(): 2025-12-04T12:10:20.4107225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4107282Z raise RuntimeError(msg) 2025-12-04T12:10:20.4107694Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1035993088 and is now 1084227584. 2025-12-04T12:10:20.4107698Z 2025-12-04T12:10:20.4107788Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4108060Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4108063Z 2025-12-04T12:10:20.4108164Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4108254Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4108313Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4108387Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4108884Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4108999Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4109053Z graph_break [] 2025-12-04T12:10:20.4109130Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4109219Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4109713Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4109779Z current_size = base.storage().size() 2025-12-04T12:10:20.4109846Z Autotune Choices Stats: 2025-12-04T12:10:20.4110272Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.4110360Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4110427Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4110564Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4110808Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4111050Z triton_mm_2 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4111289Z triton_mm_8 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4111538Z triton_mm_3 0.0073 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4111774Z triton_mm_5 0.0078 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4112011Z triton_mm_7 0.0078 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4112248Z triton_mm_1 0.0086 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4112486Z triton_mm_9 0.0097 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4112721Z triton_mm_4 0.0113 ms 55.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4112957Z triton_mm_0 0.0119 ms 52.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4113103Z SingleProcess AUTOTUNE benchmarking takes 0.0577 seconds and 0.2997 seconds precompiling for 11 choices 2025-12-04T12:10:20.4113192Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4113250Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4113324Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4113438Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4113936Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4114009Z graph_break [] 2025-12-04T12:10:20.4114107Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4114196Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4114254Z Autotune Choices Stats: 2025-12-04T12:10:20.4114632Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.4114707Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4114772Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4114910Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4115158Z triton_mm_19 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4115398Z triton_mm_16 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4115648Z triton_mm_12 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4115886Z triton_mm_18 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4115947Z _scaled_mm 0.0070 ms 85.1% 2025-12-04T12:10:20.4116185Z triton_mm_13 0.0072 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4116425Z triton_mm_14 0.0076 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4116663Z triton_mm_17 0.0077 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4116898Z triton_mm_15 0.0086 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4117140Z triton_mm_11 0.0091 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4117283Z SingleProcess AUTOTUNE benchmarking takes 0.0630 seconds and 0.2304 seconds precompiling for 11 choices 2025-12-04T12:10:20.4117353Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4117507Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4117570Z Traceback (most recent call last): 2025-12-04T12:10:20.4117740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4117811Z method(*args, **kwargs) 2025-12-04T12:10:20.4117978Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4118057Z method(*args, **kwargs) 2025-12-04T12:10:20.4118224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4118277Z with policy(): 2025-12-04T12:10:20.4118445Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4118502Z raise RuntimeError(msg) 2025-12-04T12:10:20.4118907Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4118910Z 2025-12-04T12:10:20.4118999Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4119269Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4119272Z 2025-12-04T12:10:20.4119373Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4119464Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4119522Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4119606Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4120132Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4120247Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4120303Z graph_break [] 2025-12-04T12:10:20.4120380Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4120470Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4120976Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4121040Z current_size = base.storage().size() 2025-12-04T12:10:20.4121098Z Autotune Choices Stats: 2025-12-04T12:10:20.4121477Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.4121552Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4121620Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4121756Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4122001Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4122244Z triton_mm_2 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4122522Z triton_mm_8 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4122760Z triton_mm_3 0.0073 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4122997Z triton_mm_5 0.0078 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4123233Z triton_mm_7 0.0078 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4123474Z triton_mm_1 0.0086 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4123711Z triton_mm_9 0.0097 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4123958Z triton_mm_4 0.0113 ms 55.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4124197Z triton_mm_0 0.0119 ms 52.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4124343Z SingleProcess AUTOTUNE benchmarking takes 0.0577 seconds and 0.2997 seconds precompiling for 11 choices 2025-12-04T12:10:20.4124433Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4124491Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4124563Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4124677Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4125172Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4125225Z graph_break [] 2025-12-04T12:10:20.4125304Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4125393Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4125450Z Autotune Choices Stats: 2025-12-04T12:10:20.4125832Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.4125906Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4125972Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4126107Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4126378Z triton_mm_19 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4126629Z triton_mm_16 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4126870Z triton_mm_12 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4127107Z triton_mm_18 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4127167Z _scaled_mm 0.0070 ms 85.1% 2025-12-04T12:10:20.4127404Z triton_mm_13 0.0072 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4127644Z triton_mm_14 0.0076 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4127897Z triton_mm_17 0.0077 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4128133Z triton_mm_15 0.0086 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4128371Z triton_mm_11 0.0091 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4128514Z SingleProcess AUTOTUNE benchmarking takes 0.0630 seconds and 0.2304 seconds precompiling for 11 choices 2025-12-04T12:10:20.4128603Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4128661Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4128735Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4128848Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4129345Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4129400Z graph_break [] 2025-12-04T12:10:20.4129477Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:20.4129565Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4129622Z Autotune Choices Stats: 2025-12-04T12:10:20.4129998Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.4130080Z AUTOTUNE scaled_mm(33x1024, 1024x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.4130188Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4130338Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4130598Z triton_mm_29 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4130836Z triton_mm_26 0.0065 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4131076Z triton_mm_22 0.0067 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4131316Z triton_mm_28 0.0072 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4131378Z _scaled_mm 0.0073 ms 81.9% 2025-12-04T12:10:20.4131614Z triton_mm_24 0.0074 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4131863Z triton_mm_23 0.0075 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4132100Z triton_mm_25 0.0078 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.4132337Z triton_mm_27 0.0092 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4132575Z triton_mm_21 0.0099 ms 60.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4132719Z SingleProcess AUTOTUNE benchmarking takes 0.0627 seconds and 0.2479 seconds precompiling for 11 choices 2025-12-04T12:10:20.4132925Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d73a84840c537a44.xml - 2025-12-04T12:10:20.4133002Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4133592Z FAILED [0.8351s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1084227584 and is now 1132462080. 2025-12-04T12:10:20.4133596Z 2025-12-04T12:10:20.4133687Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4133956Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4133958Z 2025-12-04T12:10:20.4134060Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4134138Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4134235Z ================== 1 failed, 187 deselected, 2 rerun in 4.12s ================== 2025-12-04T12:10:20.4134290Z Got exit code 1 2025-12-04T12:10:20.4134530Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.4134673Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.4134831Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-61893e92c3b9e8a7.xml 2025-12-04T12:10:20.4134907Z ============================= test session starts ============================== 2025-12-04T12:10:20.4135038Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4135096Z cachedir: .pytest_cache 2025-12-04T12:10:20.4135267Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4135332Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4135388Z configfile: pytest.ini 2025-12-04T12:10:20.4135570Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4135660Z collecting ... collected 188 items / 96 deselected / 92 selected 2025-12-04T12:10:20.4135730Z stepcurrent: skipping 96 already run items. 2025-12-04T12:10:20.4135790Z Running 92 items in this shard 2025-12-04T12:10:20.4135792Z 2025-12-04T12:10:20.4136728Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpm0arv19b/3o/c3ozoay57awgiq6xpuxa3spuyevy367x6igacilyrp77le5xavwi.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4136895Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4137128Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4137302Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4137606Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4137756Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4138030Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4138184Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4138454Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4138625Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4138931Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4139090Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4139378Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4139587Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4139917Z E1204 11:01:15.305000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4140726Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpm0arv19b/sx/csxg5s2kbrrirurdwogcwmnmcogrpoawcfp6lpewx5v6rophy2j2.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4140889Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4141120Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4141291Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4141591Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4141738Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4142007Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4142159Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4142427Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4142597Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4142879Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4143028Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4143317Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4143553Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4143899Z E1204 11:01:15.314000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4144643Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpm0arv19b/6k/c6k4655zjwzidfz2nuoodb6m7sz4idskle2xvbdktvqwfswe43lc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4144805Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4145034Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4145202Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4145516Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4145662Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4145930Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4146082Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4146348Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4146520Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4146799Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4146948Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4147236Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4147440Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4147767Z E1204 11:01:15.317000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4148524Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpm0arv19b/ve/cveffpifa2zggcbvn5qmbmxobqre7du74dpfxngne2otfhzflg3r.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4148699Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4148926Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4149094Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4149394Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4149538Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4149816Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4149966Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4150272Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4150444Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4150726Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4150873Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4151160Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4151366Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4151691Z E1204 11:01:15.318000 732280 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4151762Z ('RERUN', {'yellow': True}) [3.0012s] [ 1%] 2025-12-04T12:10:20.4152094Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda E1204 11:01:16.631000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4152401Z E1204 11:01:16.631000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4152557Z E1204 11:01:16.631000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4152726Z E1204 11:01:16.645000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4153046Z E1204 11:01:16.645000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4153187Z E1204 11:01:16.645000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4153345Z E1204 11:01:16.678000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4153648Z E1204 11:01:16.678000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4153790Z E1204 11:01:16.678000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4153948Z E1204 11:01:16.825000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4154251Z E1204 11:01:16.825000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4154403Z E1204 11:01:16.825000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4154469Z ('RERUN', {'yellow': True}) [1.2134s] [ 1%] 2025-12-04T12:10:20.4154800Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda E1204 11:01:17.665000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4155108Z E1204 11:01:17.665000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4155248Z E1204 11:01:17.665000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4155405Z E1204 11:01:17.680000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4155709Z E1204 11:01:17.680000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4155853Z E1204 11:01:17.680000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4156009Z E1204 11:01:17.715000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4156315Z E1204 11:01:17.715000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4156455Z E1204 11:01:17.715000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4156612Z E1204 11:01:17.858000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4156917Z E1204 11:01:17.858000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.4157075Z E1204 11:01:17.858000 732280 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4157142Z FAILED [1.1517s] [ 1%] 2025-12-04T12:10:20.4157145Z 2025-12-04T12:10:20.4157215Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.4157372Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4157434Z Traceback (most recent call last): 2025-12-04T12:10:20.4157607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4157664Z method(*args, **kwargs) 2025-12-04T12:10:20.4157831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4157888Z method(*args, **kwargs) 2025-12-04T12:10:20.4158052Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4158108Z with policy(): 2025-12-04T12:10:20.4158274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4158331Z raise RuntimeError(msg) 2025-12-04T12:10:20.4158749Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1086324736. 2025-12-04T12:10:20.4158752Z 2025-12-04T12:10:20.4158844Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4159118Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4159122Z 2025-12-04T12:10:20.4159225Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4159315Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4159375Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4159449Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4160017Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4160155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4160210Z graph_break [] 2025-12-04T12:10:20.4160291Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4160381Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4160881Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4160945Z current_size = base.storage().size() 2025-12-04T12:10:20.4161003Z Autotune Choices Stats: 2025-12-04T12:10:20.4161399Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:20.4161514Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4161581Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4161717Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4161968Z triton_mm_31 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4162212Z triton_mm_32 0.0079 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4162455Z triton_mm_19 0.0080 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4162693Z triton_mm_27 0.0087 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4162942Z triton_mm_20 0.0092 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4163179Z triton_mm_15 0.0099 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4163418Z triton_mm_23 0.0102 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4163654Z triton_mm_13 0.0104 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4163895Z triton_mm_8 0.0107 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4164133Z triton_mm_14 0.0107 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4164278Z SingleProcess AUTOTUNE benchmarking takes 0.1230 seconds and 0.7581 seconds precompiling for 31 choices 2025-12-04T12:10:20.4164436Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4164497Z Traceback (most recent call last): 2025-12-04T12:10:20.4164669Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4164726Z method(*args, **kwargs) 2025-12-04T12:10:20.4164897Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4164954Z method(*args, **kwargs) 2025-12-04T12:10:20.4165119Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4165188Z with policy(): 2025-12-04T12:10:20.4165354Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4165422Z raise RuntimeError(msg) 2025-12-04T12:10:20.4165835Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1086324736 and is now 1184890880. 2025-12-04T12:10:20.4165838Z 2025-12-04T12:10:20.4165928Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4166204Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4166206Z 2025-12-04T12:10:20.4166311Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4166401Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4166463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4166537Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4167111Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4167226Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4167280Z graph_break [] 2025-12-04T12:10:20.4167361Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4167449Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4167947Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4168012Z current_size = base.storage().size() 2025-12-04T12:10:20.4168069Z Autotune Choices Stats: 2025-12-04T12:10:20.4168450Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:20.4168533Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4168600Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4168738Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4168986Z triton_mm_31 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4169230Z triton_mm_32 0.0079 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4169468Z triton_mm_19 0.0080 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4169727Z triton_mm_27 0.0087 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4169977Z triton_mm_20 0.0092 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4170249Z triton_mm_15 0.0099 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4170485Z triton_mm_23 0.0102 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4170723Z triton_mm_13 0.0104 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4170961Z triton_mm_8 0.0107 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4171212Z triton_mm_14 0.0107 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4171356Z SingleProcess AUTOTUNE benchmarking takes 0.1230 seconds and 0.7581 seconds precompiling for 31 choices 2025-12-04T12:10:20.4171448Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4171507Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4171580Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4171696Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4172198Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4172253Z graph_break [] 2025-12-04T12:10:20.4172332Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4172422Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4172480Z Autotune Choices Stats: 2025-12-04T12:10:20.4172857Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.4172937Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4173005Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4173142Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4173390Z triton_mm_65 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4173643Z triton_mm_53 0.0080 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4173728Z _scaled_mm 0.0081 ms 80.7% 2025-12-04T12:10:20.4173973Z triton_mm_66 0.0084 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4174209Z triton_mm_61 0.0090 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4174445Z triton_mm_54 0.0095 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4174683Z triton_mm_49 0.0100 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4174920Z triton_mm_57 0.0106 ms 61.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4175170Z triton_mm_42 0.0108 ms 60.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4175406Z triton_mm_47 0.0109 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4175552Z SingleProcess AUTOTUNE benchmarking takes 0.2059 seconds and 0.4289 seconds precompiling for 35 choices 2025-12-04T12:10:20.4175623Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4175780Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4175842Z Traceback (most recent call last): 2025-12-04T12:10:20.4176013Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4176071Z method(*args, **kwargs) 2025-12-04T12:10:20.4176240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4176297Z method(*args, **kwargs) 2025-12-04T12:10:20.4176462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4176519Z with policy(): 2025-12-04T12:10:20.4176685Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4176744Z raise RuntimeError(msg) 2025-12-04T12:10:20.4177151Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.4177153Z 2025-12-04T12:10:20.4177244Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4177514Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4177537Z 2025-12-04T12:10:20.4177641Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4177730Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4177810Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4177883Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4178450Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4178564Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4178618Z graph_break [] 2025-12-04T12:10:20.4178698Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4178785Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4179282Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4179345Z current_size = base.storage().size() 2025-12-04T12:10:20.4179413Z Autotune Choices Stats: 2025-12-04T12:10:20.4179792Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:20.4179874Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4179942Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4180080Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4180361Z triton_mm_31 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4180607Z triton_mm_32 0.0079 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4180844Z triton_mm_19 0.0080 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4181082Z triton_mm_27 0.0087 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4181319Z triton_mm_20 0.0092 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4181555Z triton_mm_15 0.0099 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4181795Z triton_mm_23 0.0102 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4182060Z triton_mm_13 0.0104 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4182314Z triton_mm_8 0.0107 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4182552Z triton_mm_14 0.0107 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4182695Z SingleProcess AUTOTUNE benchmarking takes 0.1230 seconds and 0.7581 seconds precompiling for 31 choices 2025-12-04T12:10:20.4182785Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4182843Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4182918Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4183031Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4183543Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4183599Z graph_break [] 2025-12-04T12:10:20.4183676Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4183766Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4183823Z Autotune Choices Stats: 2025-12-04T12:10:20.4184201Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.4184281Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4184348Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4184486Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4184733Z triton_mm_65 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4184974Z triton_mm_53 0.0080 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4185034Z _scaled_mm 0.0081 ms 80.7% 2025-12-04T12:10:20.4185274Z triton_mm_66 0.0084 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4185511Z triton_mm_61 0.0090 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4185750Z triton_mm_54 0.0095 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4186006Z triton_mm_49 0.0100 ms 65.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4186255Z triton_mm_57 0.0106 ms 61.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4186497Z triton_mm_42 0.0108 ms 60.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4186733Z triton_mm_47 0.0109 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4186877Z SingleProcess AUTOTUNE benchmarking takes 0.2059 seconds and 0.4289 seconds precompiling for 35 choices 2025-12-04T12:10:20.4186966Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4187025Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4187096Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4187209Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4187715Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4187771Z graph_break [] 2025-12-04T12:10:20.4187850Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4187940Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4187998Z Autotune Choices Stats: 2025-12-04T12:10:20.4188371Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_87", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007720000110566616, "best_triton_pos": 0} 2025-12-04T12:10:20.4188453Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4188518Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4188654Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4188901Z triton_mm_87 0.0077 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4189145Z triton_mm_99 0.0078 ms 99.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4189388Z triton_mm_100 0.0081 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4189447Z _scaled_mm 0.0082 ms 94.2% 2025-12-04T12:10:20.4189684Z triton_mm_88 0.0091 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4189940Z triton_mm_83 0.0093 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4190220Z triton_mm_91 0.0099 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4190457Z triton_mm_81 0.0104 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4190694Z triton_mm_95 0.0104 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4190935Z triton_mm_76 0.0107 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4191080Z SingleProcess AUTOTUNE benchmarking takes 0.2059 seconds and 0.2853 seconds precompiling for 35 choices 2025-12-04T12:10:20.4191285Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-61893e92c3b9e8a7.xml - 2025-12-04T12:10:20.4191362Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4191978Z FAILED [1.1517s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.4191982Z 2025-12-04T12:10:20.4192072Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4192344Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4192346Z 2025-12-04T12:10:20.4192449Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4192528Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4192612Z ================== 1 failed, 96 deselected, 2 rerun in 5.39s =================== 2025-12-04T12:10:20.4192665Z Got exit code 1 2025-12-04T12:10:20.4192722Z Retrying single test... 2025-12-04T12:10:20.4192882Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7b2a1da6e08666d5.xml 2025-12-04T12:10:20.4192956Z ============================= test session starts ============================== 2025-12-04T12:10:20.4193084Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4193143Z cachedir: .pytest_cache 2025-12-04T12:10:20.4193315Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4193379Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4193436Z configfile: pytest.ini 2025-12-04T12:10:20.4193616Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4193708Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.4193972Z stepcurrent: skipping 96 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4194045Z Running 1 items in this shard 2025-12-04T12:10:20.4194061Z 2025-12-04T12:10:20.4194417Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:27.401870434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4194419Z 2025-12-04T12:10:20.4194750Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4195060Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4195210Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4195711Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4195989Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4196230Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4196456Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4196674Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4196919Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4197157Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4197400Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4197635Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4197876Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4198109Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4198350Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4198602Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4198854Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4199086Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4199328Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4199560Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4199800Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4200033Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4200289Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4200522Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4200763Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4200998Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4201203Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4201436Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4201677Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4201909Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4202151Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4202384Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4202600Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4202825Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4203013Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4203232Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4203764Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvuk0_rj7/3o/c3ozoay57awgiq6xpuxa3spuyevy367x6igacilyrp77le5xavwi.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4203925Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4204155Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4204326Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4204632Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4204789Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4205064Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4205218Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4205490Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4205663Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4205945Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4206093Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4206381Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4206589Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4206919Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4207228Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4207373Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4207884Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4208172Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4208412Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4208637Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4208853Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4209094Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4209337Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4209577Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4209811Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4210051Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4210327Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4210569Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4210801Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4211045Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4211277Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4211519Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4211751Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4212005Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4212261Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4212464Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4212699Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4212939Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4213174Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4213378Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4213610Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4213862Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4214093Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4214335Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4214567Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4214784Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4215010Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4215185Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4215380Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4215498Z E1204 11:01:35.410000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4215671Z [W1204 11:01:35.891816769 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4215674Z 2025-12-04T12:10:20.4215998Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4216303Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4216468Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4216969Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4217235Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4217473Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4217694Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4217908Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4218159Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4218393Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4218635Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4218869Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4219112Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4219347Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4219586Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4219819Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4220060Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4220335Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4220575Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4220806Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4221072Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4221316Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4221522Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4221754Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4221993Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4222227Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4222431Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4222676Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4222916Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4223152Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4223395Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4223625Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4223844Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4224067Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4224243Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4224435Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4224976Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvuk0_rj7/sx/csxg5s2kbrrirurdwogcwmnmcogrpoawcfp6lpewx5v6rophy2j2.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4225137Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4225377Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4225575Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4225875Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4226022Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4226293Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4226446Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4226718Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4226888Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4227182Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4227330Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4227619Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4227827Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4228153Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4228458Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4228603Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4229095Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4229362Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4229601Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4229833Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4230067Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4230347Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4230580Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4230821Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4231056Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4231298Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4231531Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4231785Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4232018Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4232260Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4232493Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4232734Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4232965Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4233210Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4233443Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4233649Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4233880Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4234120Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4234377Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4234592Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4234824Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4235066Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4235300Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4235541Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4235774Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4236001Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4236225Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4236398Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4236592Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4236711Z E1204 11:01:35.431000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4236881Z [W1204 11:01:35.902303774 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4236883Z 2025-12-04T12:10:20.4237208Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4237512Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4237658Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4238145Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4238410Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4238662Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4238901Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4239115Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4239358Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4239591Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4239835Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4240069Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4240332Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4240580Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4240819Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4241052Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4241293Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4241526Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4241771Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4242003Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4242244Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4242474Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4242678Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4242908Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4243167Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4243429Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4243632Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4243865Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4244104Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4244337Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4244578Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4244810Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4245036Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4245264Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4245439Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4245631Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4246158Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvuk0_rj7/6k/c6k4655zjwzidfz2nuoodb6m7sz4idskle2xvbdktvqwfswe43lc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4246317Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4246547Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4246717Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4247015Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4247163Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4247434Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4247597Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4247886Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4248056Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4248337Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4248484Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4248772Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4248979Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4249313Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4249627Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4249773Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4250298Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4250565Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4250805Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4251025Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4251241Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4251485Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4251719Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4251960Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4252218Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4252473Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4252704Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4252946Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4253178Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4253422Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4253656Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4253907Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4254139Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4254379Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4254613Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4254816Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4255048Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4255288Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4255523Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4255726Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4255957Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4256197Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4256428Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4256689Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4256930Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4257147Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4257377Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4257549Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4257744Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4257862Z E1204 11:01:35.441000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4258032Z [W1204 11:01:35.905912671 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4258034Z 2025-12-04T12:10:20.4258374Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4258678Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4258825Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4259313Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4259581Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4259822Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4260042Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4260294Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4260537Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4260771Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4261039Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4261283Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4261527Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4261759Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4261998Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4262231Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4262472Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4262718Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4262957Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4263191Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4263436Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4263668Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4263871Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4264103Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4264344Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4264578Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4264780Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4265012Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4265251Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4265507Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4265757Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4265989Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4266206Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4266430Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4266603Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4266796Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4267332Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpvuk0_rj7/ve/cveffpifa2zggcbvn5qmbmxobqre7du74dpfxngne2otfhzflg3r.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4267494Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4267725Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4267894Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4268196Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4268342Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4268614Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4268767Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4269035Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4269205Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4269491Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4269642Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4269950Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4270200Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4270529Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4270834Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4270979Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4271467Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4271749Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4271989Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4272208Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4272426Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4272668Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4272905Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4273145Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4273379Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4273621Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4273858Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4274098Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4274329Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4274595Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4274846Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4275086Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4275318Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4275557Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4275793Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4275996Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4276237Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4276478Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4276711Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4276915Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4277145Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4277387Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4277619Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4277866Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4278097Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4278314Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4278537Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4278719Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4278921Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4279047Z E1204 11:01:35.445000 738111 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4279116Z ('RERUN', {'yellow': True}) [10.7649s] [100%] 2025-12-04T12:10:20.4279461Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:36.443551959 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4279463Z 2025-12-04T12:10:20.4279624Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4279935Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4280278Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4280423Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4280924Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4281192Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4281435Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4281655Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4281874Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4282114Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4282350Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4282589Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4282823Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4283063Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4283319Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4283573Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4283806Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4284047Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4284278Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4284493Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4284716Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4284931Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4285184Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4285417Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4285622Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4285860Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4286102Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4286333Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4286538Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4286773Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4286983Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4287186Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4287417Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4287639Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4287861Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4288093Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4288334Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4288565Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4288806Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4289038Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4289250Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4289483Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4289698Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4289946Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4290225Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4290465Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4290698Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4290941Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4291177Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4291417Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4291654Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4291898Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4292143Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4292416Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4292648Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4292890Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4293121Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4293363Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4293594Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4293850Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4294081Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4294322Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4294556Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4294797Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4295030Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4295245Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4295456Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4295702Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4295939Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4296180Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4296411Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4296672Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4296912Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4297151Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4297383Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4297623Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4297859Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4298104Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4298345Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4298557Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4298760Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4298994Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4299203Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4299426Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4299641Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4299885Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4300154Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4300367Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4300571Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4300803Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4301028Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4301259Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4301492Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4301736Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4301969Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4302217Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4302449Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4302659Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4302893Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4303108Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4303353Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4303589Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4303810Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4304023Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4304237Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4304481Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4304716Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4304960Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4305196Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4305448Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4305702Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4305945Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4306184Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4306427Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4306664Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4306877Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4307081Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4307326Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4307543Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4307761Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4307976Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4308218Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4308453Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4308694Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4308928Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4309170Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4309404Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4309648Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4309906Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4310210Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4310445Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4310661Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4310874Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4311081Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4311306Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4311521Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4311777Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4312012Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4312229Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4312444Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4312658Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4312902Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4313136Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4313379Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4313616Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4313859Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4314094Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4314348Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4314604Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4314845Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4315080Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4315322Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4315557Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4315802Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4316044Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4316286Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4316519Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4316761Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4317001Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4317241Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4317477Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4317691Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4317899Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4318133Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4318382Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4318615Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4318876Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4319120Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4319361Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4319597Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4319843Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4320078Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4320348Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4320592Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4320799Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4321032Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4321274Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4321509Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4321752Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4321991Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4322220Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4322437Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4322651Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4322866Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4323108Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4323366Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4323607Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4323823Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4324036Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4324250Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4324494Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4324728Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4324959Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4325171Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4325378Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4325544Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4325780Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4325986Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4326220Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4326466Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4326702Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4326914Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4327122Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4327355Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4327578Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4327803Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4328037Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4328242Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4328475Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4328680Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4328916Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4329120Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4329366Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4329607Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4329843Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4330086Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4330361Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4330603Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4330836Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4331080Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4331313Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4331555Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4331790Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4332018Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4332246Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4332480Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4332711Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4332927Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4333141Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4333356Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4333600Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4333849Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4334091Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4334326Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4334531Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4334766Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4335008Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4335242Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4335484Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4335724Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4335953Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4336169Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4336382Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4336609Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4336824Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4337057Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4337300Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4337535Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4337779Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4338013Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4338228Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4338469Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4338711Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4338947Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4339188Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4339422Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4339627Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4339862Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4340123Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4340359Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4340600Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4340834Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4341092Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4341328Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4341542Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4341760Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4342006Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4342241Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4342450Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4342697Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4342943Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4343177Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4343419Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4343655Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4343882Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4344098Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4344311Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4344527Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4344769Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4345006Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4345247Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4345502Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4345754Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4345988Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4346193Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4346430Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4346672Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4346905Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4347158Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4347393Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4347623Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4347840Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4348052Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4348269Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4348511Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4348745Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4348987Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4349222Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4349463Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4349711Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4349972Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4350235Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4350481Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4350714Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4350927Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4351146Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4351352Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4351583Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4351812Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4352035Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4352249Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4352454Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4352662Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4352846Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4352989Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4353151Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4353270Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4353413Z E1204 11:01:36.993000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4353587Z [W1204 11:01:37.472067230 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4353590Z 2025-12-04T12:10:20.4353748Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4354055Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4354402Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4354546Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4355038Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4355307Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4355547Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4355766Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4355989Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4356231Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4356467Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4356708Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4356941Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4357180Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4357414Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4357658Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4357890Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4358132Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4358364Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4358584Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4358834Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4359049Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4359291Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4359525Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4359731Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4359963Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4360242Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4360489Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4360692Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4360926Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4361137Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4361339Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4361576Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4361787Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4361989Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4362228Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4362470Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4362702Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4362944Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4363201Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4363424Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4363647Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4363860Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4364101Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4364334Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4364575Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4364817Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4365061Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4365294Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4365535Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4365767Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4366006Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4366238Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4366478Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4366710Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4366949Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4367182Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4367432Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4367681Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4367922Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4368154Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4368394Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4368626Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4368868Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4369101Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4369327Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4369538Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4369781Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4370014Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4370295Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4370528Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4370768Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4371002Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4371244Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4371479Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4371720Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4371977Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4372230Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4372462Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4372673Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4372874Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4373107Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4373324Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4373544Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4373771Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4374012Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4374245Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4374456Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4374657Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4374889Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4375097Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4375301Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4375534Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4375773Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4376006Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4376262Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4376513Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4376724Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4378901Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4379123Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4379375Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4379613Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4379831Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4380065Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4380315Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4380559Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4380794Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4381036Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4381272Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4381514Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4381750Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4381991Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4382232Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4382475Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4382730Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4382970Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4383175Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4383416Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4383633Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4383849Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4384065Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4384307Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4384559Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4384800Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4385037Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4385281Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4385515Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4385757Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4385990Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4386233Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4386467Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4386684Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4386899Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4387116Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4387364Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4387578Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4387820Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4388053Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4388270Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4388484Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4388698Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4388954Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4389187Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4389436Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4389669Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4389912Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4390183Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4390424Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4390659Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4390899Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4391134Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4391376Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4391626Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4391894Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4392127Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4392370Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4392603Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4392847Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4393081Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4393344Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4393581Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4393796Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4394002Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4394236Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4394479Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4394711Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4394952Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4395187Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4395428Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4395663Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4395902Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4396157Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4396408Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4396640Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4396845Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4397078Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4397321Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4397555Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4397809Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4398044Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4398272Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4398490Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4398702Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4398917Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4399157Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4399392Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4399623Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4399838Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4400053Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4400303Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4400570Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4400815Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4401031Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4401244Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4401451Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4401616Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4401852Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4402057Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4402303Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4402545Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4402780Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4402993Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4403196Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4403430Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4403643Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4403847Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4404082Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4404289Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4404522Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4404727Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4404981Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4405195Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4405429Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4405673Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4405906Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4406149Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4406383Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4406635Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4406869Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4407110Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4407345Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4407589Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4407823Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4408036Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4408241Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4408475Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4408701Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4408918Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4409131Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4409364Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4409626Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4409865Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4410140Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4410374Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4410580Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4410813Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4411067Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4411301Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4411542Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4411779Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4412005Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4412223Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4412436Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4412642Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4412848Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4413081Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4413323Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4413556Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4413826Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4414075Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4414280Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4414515Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4414755Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4414989Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4415230Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4415473Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4415678Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4415914Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4416158Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4416390Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4416633Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4416865Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4417101Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4417319Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4417533Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4417753Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4417993Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4418249Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4418462Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4418697Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4418939Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4419173Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4419415Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4419651Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4419889Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4420140Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4420354Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4420572Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4420817Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4421058Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4421299Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4421543Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4421788Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4422025Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4422232Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4422467Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4422742Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4422988Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4423230Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4423464Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4423695Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4423912Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4424124Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4424355Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4424598Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4424833Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4425080Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4425312Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4425557Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4425789Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4426032Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4426266Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4426507Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4426741Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4426963Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4427197Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4427410Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4427624Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4427853Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4428076Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4428289Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4428495Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4428712Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4428899Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4429041Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4429202Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4429324Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4429467Z E1204 11:01:37.011000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4430695Z [W1204 11:01:37.506239994 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4430698Z 2025-12-04T12:10:20.4431621Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4431938Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4432248Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4432396Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4432907Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4433176Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4433448Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4433669Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4433884Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4434126Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4434362Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4434606Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4434845Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4435087Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4435322Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4435564Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4435797Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4436083Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4436330Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4436542Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4436764Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4436978Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4437220Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4437455Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4437672Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4437916Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4438155Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4438389Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4438591Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4438823Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4439034Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4439237Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4439474Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4439684Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4439886Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4440158Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4440419Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4440668Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4440909Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4441143Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4441354Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4441578Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4441792Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4442032Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4442277Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4442531Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4442763Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4443008Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4443240Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4443480Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4443711Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4443951Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4444183Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4444424Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4444655Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4444922Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4445168Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4445408Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4445640Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4445881Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4446115Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4446355Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4446598Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4446848Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4447085Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4447301Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4447512Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4447754Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4447986Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4448226Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4448458Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4448698Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4448932Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4449174Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4449417Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4449670Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4449903Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4450181Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4450413Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4450626Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4450828Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4451082Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4451304Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4451526Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4451739Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4451981Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4452213Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4452424Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4452626Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4452859Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4453070Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4453272Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4453504Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4453761Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4454007Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4454247Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4454479Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4454690Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4454915Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4455131Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4455378Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4455622Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4455851Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4456065Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4456282Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4456525Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4456761Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4457005Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4457241Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4457483Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4457718Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4457960Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4458206Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4458460Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4458696Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4458908Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4459115Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4459349Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4459565Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4459777Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4460003Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4460304Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4460537Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4460779Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4461014Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4461257Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4461490Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4461732Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4461966Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4462208Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4462443Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4462675Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4462903Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4463115Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4463341Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4463556Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4463798Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4464034Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4464250Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4464476Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4464703Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4464946Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4465184Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4465425Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4465660Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4465900Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4466135Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4466376Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4466611Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4466853Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4467104Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4467356Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4467590Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4467832Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4468067Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4468309Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4468544Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4468798Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4469045Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4469286Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4469520Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4469733Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4469938Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4470202Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4470445Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4470680Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4470920Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4471158Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4471400Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4471646Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4471900Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4472134Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4472376Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4472609Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4472815Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4473049Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4473305Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4473552Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4473794Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4474028Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4474255Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4474473Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4474686Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4474901Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4475147Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4475381Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4475609Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4475825Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4476050Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4476276Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4476520Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4476755Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4476970Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4477186Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4477392Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4477572Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4477807Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4478022Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4478257Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4478499Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4478733Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4478946Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4479152Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4479387Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4479600Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4479805Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4480039Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4480276Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4480539Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4480744Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4480979Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4481184Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4481419Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4481661Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4481896Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4482150Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4482396Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4482638Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4482872Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4483114Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4483355Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4483598Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4483832Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4484046Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4484251Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4484485Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4484718Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4484959Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4485173Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4485389Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4485632Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4485866Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4486109Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4486343Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4486556Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4486800Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4487041Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4487284Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4487525Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4487758Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4487986Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4488202Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4488415Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4488621Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4488824Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4489059Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4489314Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4489560Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4489802Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4490036Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4490287Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4490522Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4490763Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4491010Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4491273Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4491506Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4491711Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4491946Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4492186Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4492421Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4492662Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4492896Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4493126Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4493344Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4493557Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4493784Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4494044Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4494278Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4494483Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4494717Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4494960Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4495196Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4495451Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4495695Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4495923Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4496141Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4496353Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4496568Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4496811Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4497049Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4497295Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4497528Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4497772Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4498004Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4498219Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4498463Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4498704Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4498939Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4499182Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4499418Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4499644Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4499873Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4500130Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4500345Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4500588Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4500821Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4501062Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4501297Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4501540Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4501775Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4502017Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4502252Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4502511Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4502758Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4502970Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4503187Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4503396Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4503607Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4503838Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4504058Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4504284Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4504503Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4504711Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4504898Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4505039Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4505203Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4505323Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4505466Z E1204 11:01:37.045000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4505638Z [W1204 11:01:37.650115414 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4505641Z 2025-12-04T12:10:20.4505801Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4506110Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4506418Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4506566Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4507092Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4507366Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4507605Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4507826Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4508040Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4508281Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4508516Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4508768Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4509011Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4509254Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4509487Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4509729Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4509961Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4510239Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4510471Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4510682Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4510905Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4511119Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4511383Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4511628Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4511833Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4512066Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4512308Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4512541Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4512743Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4512975Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4513198Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4513416Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4513648Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4513861Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4514064Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4514296Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4514538Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4514769Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4515010Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4515242Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4515457Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4515680Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4515902Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4516153Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4516385Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4516625Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4516857Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4517099Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4517334Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4517586Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4517827Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4518067Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4518300Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4518541Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4518775Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4519015Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4519248Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4519492Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4519724Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4519968Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4520246Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4520501Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4520734Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4520974Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4521207Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4521424Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4521637Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4521888Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4522134Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4522374Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4522605Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4522846Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4523077Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4523319Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4523551Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4523792Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4524024Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4524265Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4524497Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4524716Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4524929Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4525161Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4525373Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4525598Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4525811Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4526052Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4526296Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4526523Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4526724Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4526957Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4527169Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4527372Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4527610Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4527850Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4528083Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4528323Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4528556Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4528768Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4528999Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4529223Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4529468Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4529705Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4529921Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4530173Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4530390Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4530633Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4530881Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4531136Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4531371Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4531618Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4531853Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4532097Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4532333Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4532577Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4532810Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4533023Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4533230Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4533485Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4533715Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4533927Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4534143Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4534386Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4534621Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4534863Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4535096Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4535349Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4535593Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4535834Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4536068Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4536310Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4536546Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4536762Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4536976Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4537182Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4537407Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4537624Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4537866Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4538119Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4538334Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4538547Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4538761Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4539004Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4539239Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4539483Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4539731Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4539982Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4540253Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4540494Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4540729Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4540971Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4541207Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4541449Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4541686Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4541932Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4542166Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4542426Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4542673Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4542915Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4543150Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4543392Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4543628Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4543846Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4544071Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4544319Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4544559Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4544795Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4545035Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4545272Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4545514Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4545751Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4545995Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4546227Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4546469Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4546703Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4546918Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4547163Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4547406Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4547642Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4547889Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4548124Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4548350Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4548578Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4548802Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4549018Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4549262Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4549495Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4549725Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4549941Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4550227Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4550443Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4550688Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4550923Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4551140Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4551366Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4551584Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4551747Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4551986Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4552193Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4552428Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4552669Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4552903Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4553128Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4553346Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4553580Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4553795Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4554000Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4554234Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4554440Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4554675Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4554879Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4555113Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4555317Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4555551Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4555804Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4556052Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4556294Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4556528Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4556769Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4557005Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4557246Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4557489Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4557740Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4557977Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4558191Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4558393Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4558628Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4558856Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4559072Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4559286Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4559501Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4559743Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4559979Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4560282Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4560534Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4560739Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4560975Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4561216Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4561452Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4561692Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4561940Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4562182Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4562399Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4562615Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4562820Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4563027Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4563262Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4563505Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4563741Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4563990Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4564227Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4564431Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4564676Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4564926Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4565162Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4565405Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4565639Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4565846Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4566081Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4566334Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4566580Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4566821Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4567056Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4567283Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4567501Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4567714Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4567933Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4568177Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4568412Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4568616Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4568852Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4569104Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4569348Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4569592Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4569827Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4570058Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4570302Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4570514Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4570746Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4571004Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4571239Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4571482Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4571714Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4571958Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4572198Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4572404Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4572637Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4572879Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4573113Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4573353Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4573601Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4573841Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4574064Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4574280Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4574495Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4574737Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4574969Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4575221Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4575466Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4575708Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4575942Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4576192Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4576427Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4576668Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4576903Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4577115Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4577335Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4577541Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4577754Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4578010Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4578244Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4578460Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4578667Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4578873Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4579059Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4579202Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4579363Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4579492Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4579645Z E1204 11:01:37.189000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4579720Z ('RERUN', {'yellow': True}) [1.4262s] [100%] 2025-12-04T12:10:20.4580073Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:38.628216053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4580075Z 2025-12-04T12:10:20.4580270Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4580578Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4580884Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4581030Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4581521Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4581790Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4582032Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4582269Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4582496Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4582740Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4582975Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4583217Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4583449Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4583693Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4583925Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4584186Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4584434Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4584674Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4584906Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4585118Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4585342Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4585554Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4585796Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4586029Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4586236Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4586471Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4586721Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4586964Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4587166Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4587400Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4587613Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4587816Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4588051Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4588261Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4588473Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4588716Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4588958Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4589192Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4589432Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4589665Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4589877Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4590133Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4590349Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4590591Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4590825Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4591066Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4591311Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4591562Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4591796Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4592036Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4592269Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4592512Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4592743Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4592996Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4593248Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4593490Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4593723Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4593964Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4594198Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4594439Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4594673Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4594913Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4595146Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4595387Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4595629Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4595855Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4596065Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4596308Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4596546Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4596787Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4597020Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4597259Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4597502Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4597752Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4597985Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4598226Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4598462Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4598702Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4598935Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4599146Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4599348Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4599581Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4599791Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4600027Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4600288Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4600531Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4600764Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4600975Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4601179Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4601411Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4601621Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4601838Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4602083Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4602325Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4602560Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4602800Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4603031Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4603243Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4603465Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4603680Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4603924Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4604158Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4604374Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4604604Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4604830Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4605074Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4605309Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4605553Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4605786Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4606029Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4606272Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4606528Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4606765Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4607007Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4607242Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4607456Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4607661Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4607895Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4608113Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4608327Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4608542Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4608785Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4609029Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4609280Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4609514Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4609759Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4609997Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4610281Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4610516Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4610782Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4611030Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4611248Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4611462Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4611668Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4611894Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4612109Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4612350Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4612587Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4612806Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4613018Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4613233Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4613486Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4613733Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4613975Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4614212Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4614457Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4614695Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4614937Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4615180Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4615434Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4615668Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4615911Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4616146Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4616388Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4616622Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4616868Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4617103Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4617346Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4617582Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4617834Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4618079Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4618294Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4618499Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4618740Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4618984Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4619223Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4619475Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4619718Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4619959Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4620222Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4620464Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4620702Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4620949Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4621186Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4621392Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4621630Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4621872Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4622108Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4622362Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4622609Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4622838Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4623059Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4623275Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4623490Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4623735Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4623982Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4624222Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4624439Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4624652Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4624867Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4625113Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4625350Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4625566Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4625779Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4625985Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4626148Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4626385Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4626588Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4626833Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4627095Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4627330Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4627543Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4627747Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4627984Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4628198Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4628414Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4628660Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4628864Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4629104Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4629307Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4629544Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4629748Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4629983Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4630251Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4630486Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4630730Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4630966Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4631224Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4631471Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4631715Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4631950Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4632191Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4632426Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4632637Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4632856Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4633106Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4633334Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4633554Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4633766Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4633983Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4634225Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4634460Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4634702Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4634939Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4635146Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4635380Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4635632Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4635877Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4636119Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4636352Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4636580Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4636799Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4637017Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4637240Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4637455Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4637690Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4637932Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4638167Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4638409Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4638643Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4638849Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4639086Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4639328Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4639564Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4639806Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4640051Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4640307Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4640545Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4640786Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4641027Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4641269Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4641505Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4641745Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4641973Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4642187Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4642401Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4642644Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4642879Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4643086Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4643322Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4643562Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4643795Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4644036Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4644269Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4644516Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4644743Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4644964Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4645182Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4645423Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4645657Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4645899Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4646144Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4646395Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4646630Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4646834Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4647069Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4647312Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4647547Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4647790Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4648024Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4648253Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4648469Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4648684Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4648912Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4649169Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4649409Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4649650Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4649884Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4650167Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4650402Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4650657Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4650908Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4651152Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4651386Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4651598Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4651815Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4652021Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4652231Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4652460Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4652683Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4652894Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4653102Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4653322Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4653522Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4653663Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4653826Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4653946Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4654087Z E1204 11:01:38.167000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4654259Z [W1204 11:01:38.644089245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4654261Z 2025-12-04T12:10:20.4654422Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4654732Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4655050Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4655207Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4655698Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4655966Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4656208Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4656427Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4656644Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4656886Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4657126Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4657372Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4657605Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4657867Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4658099Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4658341Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4658574Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4658816Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4659050Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4659269Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4659503Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4659736Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4659979Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4660247Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4660451Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4660685Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4660925Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4661160Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4661363Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4661599Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4661809Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4662013Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4662261Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4662482Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4662685Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4662918Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4663164Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4663396Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4663636Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4663880Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4664103Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4664326Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4664539Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4664782Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4665020Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4665261Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4665494Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4665734Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4665966Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4666207Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4666439Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4666689Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4666935Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4667178Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4667410Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4667652Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4667883Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4668124Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4668366Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4668616Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4668849Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4669092Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4669325Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4669566Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4669798Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4670014Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4670258Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4670501Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4670732Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4670990Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4671236Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4671477Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4671711Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4671950Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4672183Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4672423Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4672656Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4672909Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4673159Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4673372Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4673576Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4673809Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4674020Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4674243Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4674457Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4674699Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4674931Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4675142Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4675345Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4675593Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4675813Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4676015Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4676248Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4676489Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4676721Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4676963Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4677218Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4677440Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4677662Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4677876Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4678120Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4678352Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4678569Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4678781Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4678997Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4679240Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4679474Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4679717Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4679959Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4680243Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4680478Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4680720Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4680954Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4681201Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4682896Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4683147Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4683369Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4683605Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4683825Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4684037Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4684255Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4684499Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4684736Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4684979Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4685214Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4685456Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4685690Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4685945Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4686192Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4686436Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4686676Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4686894Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4687109Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4687315Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4687552Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4687777Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4688019Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4688254Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4688473Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4688688Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4688903Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4689145Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4689380Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4689620Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4689855Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4690128Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4690376Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4690641Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4690877Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4691121Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4691356Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4691601Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4691835Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4692090Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4692338Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4692582Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4692818Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4693061Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4693297Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4693537Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4693773Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4693988Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4694193Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4694429Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4694683Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4694936Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4695177Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4695413Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4695660Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4695895Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4696138Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4696384Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4696637Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4696874Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4697080Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4697315Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4697557Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4697791Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4698032Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4698267Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4698493Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4698719Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4698933Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4699158Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4699411Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4699645Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4699874Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4700189Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4700403Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4700623Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4700867Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4701118Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4701346Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4701561Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4701767Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4701931Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4702167Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4702373Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4702609Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4702854Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4703090Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4703304Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4703508Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4703756Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4703979Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4704185Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4704420Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4704624Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4704863Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4705068Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4705314Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4705528Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4705763Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4706008Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4706242Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4706485Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4706722Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4706966Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4707201Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4707443Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4707677Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4707919Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4708162Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4708387Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4708593Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4708832Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4709062Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4709280Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4709495Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4709721Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4709973Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4710243Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4710486Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4710721Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4710933Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4711168Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4711411Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4711646Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4711888Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4712122Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4712351Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4712592Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4712819Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4713125Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4713330Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4713567Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4713810Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4714045Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4714303Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4714557Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4714761Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4715001Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4715244Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4715478Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4715721Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4715955Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4716160Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4716394Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4716636Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4716872Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4717126Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4717372Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4717601Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4717819Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4718033Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4718251Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4718495Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4718740Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4718958Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4719192Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4719435Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4719671Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4719914Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4720177Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4720406Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4720624Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4720837Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4721058Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4721303Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4721564Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4721824Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4722058Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4722303Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4722537Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4722743Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4722980Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4723239Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4723485Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4723728Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4723963Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4724189Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4724406Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4724621Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4724836Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4725082Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4725317Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4725560Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4725794Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4726050Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4726296Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4726539Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4726776Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4727017Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4727255Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4727468Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4727695Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4727920Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4728131Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4728360Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4728584Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4728798Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4729005Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4729215Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4729403Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4729546Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4729706Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4729824Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4729968Z E1204 11:01:38.183000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4730183Z [W1204 11:01:38.679293917 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4730186Z 2025-12-04T12:10:20.4730360Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4730683Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4730995Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4731145Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4731640Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4731911Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4732162Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4732396Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4732611Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4732854Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4733093Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4733336Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4733572Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4733814Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4734048Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4734290Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4734522Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4734763Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4735015Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4735229Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4735454Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4735669Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4735913Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4736147Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4736351Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4736593Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4736844Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4737078Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4737284Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4737519Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4737731Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4737935Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4738167Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4738379Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4738580Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4738813Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4739057Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4739303Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4739554Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4739787Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4740000Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4740266Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4740480Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4740723Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4740968Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4741225Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4741458Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4741700Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4741934Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4742174Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4742407Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4742648Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4742886Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4743128Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4743364Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4743605Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4743848Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4744103Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4744336Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4744578Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4744810Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4745051Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4745284Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4745542Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4745786Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4746001Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4746212Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4746451Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4746684Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4746925Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4747157Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4747398Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4747631Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4747872Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4748104Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4748365Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4748597Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4748838Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4749071Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4749283Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4749487Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4749718Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4749939Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4750194Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4750409Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4750651Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4750882Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4751095Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4751297Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4751537Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4751748Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4751948Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4752181Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4752422Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4752670Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4752924Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4753158Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4753371Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4753592Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4753806Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4754051Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4754304Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4754534Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4754748Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4754964Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4755206Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4755447Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4755690Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4755927Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4756168Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4756403Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4756646Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4756880Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4757133Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4757381Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4757597Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4757806Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4758040Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4758261Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4758475Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4758705Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4758960Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4759195Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4759440Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4759674Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4759916Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4760188Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4760432Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4760666Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4760908Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4761146Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4761364Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4761596Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4761823Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4762049Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4762264Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4762507Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4762742Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4762959Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4763184Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4763411Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4763660Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4763894Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4764137Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4764373Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4764615Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4764851Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4765093Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4765328Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4765572Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4765808Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4766060Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4766305Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4766548Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4766782Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4767024Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4767258Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4767499Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4767749Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4767999Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4768234Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4768448Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4768653Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4768887Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4769133Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4769369Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4769611Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4769847Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4770121Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4770369Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4770623Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4770857Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4771101Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4771336Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4771542Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4771780Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4772022Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4772270Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4772524Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4772758Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4772986Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4773204Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4773417Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4773632Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4773877Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4774112Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4774339Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4774557Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4774780Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4775003Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4775246Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4775482Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4775700Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4775915Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4776121Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4776284Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4779350Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4779570Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4779805Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4780048Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4780311Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4780524Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4780732Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4780967Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4781183Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4781388Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4781622Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4781827Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4782083Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4782300Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4782542Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4782748Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4782984Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4783226Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4783462Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4783715Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4783960Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4784202Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4784437Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4784681Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4784914Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4785156Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4785389Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4785603Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4785805Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4786040Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4786268Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4786498Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4786722Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4786938Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4787182Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4787416Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4787660Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4787895Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4788107Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4788355Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4788598Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4788832Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4789074Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4789308Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4789537Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4789755Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4789969Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4790211Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4790418Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4790656Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4790912Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4791158Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4791400Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4791635Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4791840Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4792075Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4792316Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4792567Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4792821Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4793054Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4793258Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4793491Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4793734Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4793968Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4794209Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4794444Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4794679Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4794897Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4795110Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4795337Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4795589Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4795823Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4796029Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4796266Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4796512Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4796746Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4796998Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4797248Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4797475Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4797692Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4797905Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4798121Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4798363Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4798602Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4798847Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4799081Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4799326Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4799560Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4799774Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4800018Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4800299Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4800536Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4800780Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4801014Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4801241Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4801470Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4801694Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4801909Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4802153Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4802387Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4802631Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4802866Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4803108Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4803342Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4803584Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4803823Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4804064Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4804309Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4804533Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4804755Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4804960Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4805173Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4805403Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4805624Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4805846Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4806063Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4806270Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4806458Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4806607Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4806767Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4806886Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4807028Z E1204 11:01:38.218000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4807199Z [W1204 11:01:38.841903394 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4807201Z 2025-12-04T12:10:20.4807360Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.4807672Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4807983Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4808131Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4808638Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4808917Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4809159Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4809381Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4809594Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4809839Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4810075Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4810368Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4810620Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4810860Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4811095Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4811335Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4811570Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4811811Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4812044Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4812257Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4812483Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4812699Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4812939Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4813193Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4813409Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4813643Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4813886Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4814117Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4814321Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4814560Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4814790Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4815002Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4815235Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4815448Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4815650Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4815883Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4816125Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4816359Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4816602Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4816835Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4817047Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4817269Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4817492Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4817741Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4817975Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4818217Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4818450Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4818697Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4818928Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4819178Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4819419Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4819660Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4819893Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4820166Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4820401Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4820646Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4820880Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4821120Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4821352Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4821594Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4821827Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4822081Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4822326Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4822568Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4822803Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4823019Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4823230Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.4823470Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4823716Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4823969Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4824202Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4824442Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4824676Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4824919Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4825151Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4825393Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4825626Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4825868Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4826103Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4826313Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4826527Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4826772Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4826983Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4827205Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4827420Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4827662Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4827893Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4828113Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4828324Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4828557Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4828768Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4828971Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4829205Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4829446Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4829680Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4829921Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4830185Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4830397Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4830620Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4830854Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4831116Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4831351Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4831568Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4831784Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4832002Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4832247Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4832497Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4832754Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4832990Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4833232Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4833466Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4833707Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4833942Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4834186Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4834420Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4834633Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4834844Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4835079Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4835304Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4835527Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4835743Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4835986Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4836223Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4836465Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4836702Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4836955Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4837199Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4837442Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4837676Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4837920Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4838154Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4838371Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4838586Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4838797Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4839021Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.4839237Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4839481Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4839723Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4839950Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4840200Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4840415Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4840658Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4840895Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4841139Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4841386Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4841639Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4841874Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4842116Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4842350Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4842593Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4842831Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4843080Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4843314Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4843555Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4843789Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4844030Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4844275Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4844530Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4844764Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4845009Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4845245Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4845458Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4845663Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4845908Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4846160Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4846395Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4846638Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4846873Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4847119Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4847355Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4847598Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4847833Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4848076Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4848310Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4848514Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4848773Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4849019Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4849254Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4849497Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4849732Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4849959Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4850212Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4850439Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4850666Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4850908Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4851144Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4851373Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4851589Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4851804Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4852020Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4852264Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4852499Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4852718Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4852930Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4853148Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4853323Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.4853559Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4853765Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4853999Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4854243Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4854477Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4854699Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4854914Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4855157Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4855369Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4855573Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4855808Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4856012Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4856246Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4856451Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4856685Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4856890Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4857126Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4857380Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4857630Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4857871Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4858108Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4858352Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4858586Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4858828Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4859075Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4859326Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4859559Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4859773Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.4859976Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4860249Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4860477Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4860695Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4860909Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4861125Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4861371Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4861606Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4861861Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4862106Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4862311Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4862546Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4862788Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4863027Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4863271Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4863518Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4863763Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4863979Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4864193Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4864399Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.4864604Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4864838Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4865086Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4865321Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4865563Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4865799Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4866003Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4866249Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4866500Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4866734Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4866977Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4867213Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4867419Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4867653Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4867895Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4868143Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4868393Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4868626Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4868855Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4869075Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4869291Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4869506Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4869750Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4869985Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4870225Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4870461Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4870720Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4870967Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4871216Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4871449Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4871677Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4871896Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4872109Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4872324Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4872579Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4872825Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4873068Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4873306Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4873549Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4873783Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4873988Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4874222Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4874465Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4874698Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4874941Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4875189Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4875426Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.4875642Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.4875855Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.4876071Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4876313Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4876548Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4876801Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4877046Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4877288Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4877523Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4877765Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4878000Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4878244Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4878478Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4878692Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.4878913Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.4879120Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.4879333Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.4879574Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.4879805Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.4880018Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.4880265Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.4880473Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.4880659Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.4880801Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.4880961Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.4881107Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.4881247Z E1204 11:01:38.381000 738111 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.4881320Z FAILED [1.1913s] [100%] 2025-12-04T12:10:20.4881323Z 2025-12-04T12:10:20.4881397Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.4881556Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4881621Z Traceback (most recent call last): 2025-12-04T12:10:20.4881799Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4881860Z method(*args, **kwargs) 2025-12-04T12:10:20.4882027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4882086Z method(*args, **kwargs) 2025-12-04T12:10:20.4882250Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4882307Z with policy(): 2025-12-04T12:10:20.4882474Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4882532Z raise RuntimeError(msg) 2025-12-04T12:10:20.4882947Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1086324736. 2025-12-04T12:10:20.4882950Z 2025-12-04T12:10:20.4883047Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4883323Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4883328Z 2025-12-04T12:10:20.4883433Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4883529Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4883589Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4883665Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4884261Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4884380Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4884434Z graph_break [] 2025-12-04T12:10:20.4884517Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4884610Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4885121Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4885186Z current_size = base.storage().size() 2025-12-04T12:10:20.4885246Z Autotune Choices Stats: 2025-12-04T12:10:20.4885638Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:20.4885740Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4885808Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4885945Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4886199Z triton_mm_31 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4886442Z triton_mm_19 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4886681Z triton_mm_27 0.0086 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4886923Z triton_mm_20 0.0090 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4887159Z triton_mm_15 0.0091 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4887396Z triton_mm_13 0.0095 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4887635Z triton_mm_8 0.0100 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4887872Z triton_mm_14 0.0102 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4888118Z triton_mm_23 0.0102 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4888367Z triton_mm_32 0.0106 ms 62.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4888517Z SingleProcess AUTOTUNE benchmarking takes 0.1242 seconds and 8.2527 seconds precompiling for 31 choices 2025-12-04T12:10:20.4888676Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4888741Z Traceback (most recent call last): 2025-12-04T12:10:20.4888913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4888972Z method(*args, **kwargs) 2025-12-04T12:10:20.4889140Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4889198Z method(*args, **kwargs) 2025-12-04T12:10:20.4889365Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4889420Z with policy(): 2025-12-04T12:10:20.4889598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4889657Z raise RuntimeError(msg) 2025-12-04T12:10:20.4890071Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1086324736 and is now 1184890880. 2025-12-04T12:10:20.4890075Z 2025-12-04T12:10:20.4890208Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4890484Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4890487Z 2025-12-04T12:10:20.4890590Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4890681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4890741Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4890816Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4891383Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4891501Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4891554Z graph_break [] 2025-12-04T12:10:20.4891635Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4891725Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4892223Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4892289Z current_size = base.storage().size() 2025-12-04T12:10:20.4892346Z Autotune Choices Stats: 2025-12-04T12:10:20.4892760Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:20.4892845Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4892912Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4893049Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4893299Z triton_mm_31 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4893538Z triton_mm_19 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4893778Z triton_mm_27 0.0086 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4894028Z triton_mm_20 0.0090 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4894281Z triton_mm_15 0.0091 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4894519Z triton_mm_13 0.0095 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4894758Z triton_mm_8 0.0100 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4894996Z triton_mm_14 0.0102 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4895234Z triton_mm_23 0.0102 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4895475Z triton_mm_32 0.0106 ms 62.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4895623Z SingleProcess AUTOTUNE benchmarking takes 0.1242 seconds and 8.2527 seconds precompiling for 31 choices 2025-12-04T12:10:20.4895712Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4895772Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4895845Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4895962Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4896474Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4896532Z graph_break [] 2025-12-04T12:10:20.4896626Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4896715Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4897097Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.4897206Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.4897264Z Autotune Choices Stats: 2025-12-04T12:10:20.4897643Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:20.4897727Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4897794Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4897930Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4898186Z triton_mm_65 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4898435Z triton_mm_53 0.0078 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4898676Z triton_mm_66 0.0081 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4898913Z triton_mm_54 0.0090 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4899150Z triton_mm_57 0.0100 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4899387Z triton_mm_47 0.0108 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4899632Z triton_mm_42 0.0109 ms 64.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4899871Z triton_mm_62 0.0110 ms 64.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4900146Z triton_mm_45 0.0112 ms 62.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4900383Z triton_mm_48 0.0113 ms 62.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4900542Z SingleProcess AUTOTUNE benchmarking takes 0.2200 seconds and 0.4950 seconds precompiling for 35 choices 2025-12-04T12:10:20.4900614Z =================================== FAILURES =================================== 2025-12-04T12:10:20.4900784Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.4900855Z Traceback (most recent call last): 2025-12-04T12:10:20.4901026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4901085Z method(*args, **kwargs) 2025-12-04T12:10:20.4901255Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.4901313Z method(*args, **kwargs) 2025-12-04T12:10:20.4901484Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.4901539Z with policy(): 2025-12-04T12:10:20.4901708Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.4901768Z raise RuntimeError(msg) 2025-12-04T12:10:20.4902173Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.4902192Z 2025-12-04T12:10:20.4902297Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4902571Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4902574Z 2025-12-04T12:10:20.4902676Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4902768Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4902826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4902900Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4903472Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4903587Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4903641Z graph_break [] 2025-12-04T12:10:20.4903720Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4903810Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4904309Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.4904376Z current_size = base.storage().size() 2025-12-04T12:10:20.4904436Z Autotune Choices Stats: 2025-12-04T12:10:20.4904818Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:20.4904907Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4904977Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4905127Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4905377Z triton_mm_31 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4905619Z triton_mm_19 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4905858Z triton_mm_27 0.0086 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4906096Z triton_mm_20 0.0090 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4906331Z triton_mm_15 0.0091 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4906579Z triton_mm_13 0.0095 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4906830Z triton_mm_8 0.0100 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4907069Z triton_mm_14 0.0102 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4907308Z triton_mm_23 0.0102 ms 65.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4907549Z triton_mm_32 0.0106 ms 62.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4907695Z SingleProcess AUTOTUNE benchmarking takes 0.1242 seconds and 8.2527 seconds precompiling for 31 choices 2025-12-04T12:10:20.4907788Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4907847Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4907920Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4908035Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4908535Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4908593Z graph_break [] 2025-12-04T12:10:20.4908672Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4908761Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4909161Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.4909270Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.4909328Z Autotune Choices Stats: 2025-12-04T12:10:20.4909704Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:20.4909787Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4909855Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4909990Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4910273Z triton_mm_65 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4910523Z triton_mm_53 0.0078 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4910779Z triton_mm_66 0.0081 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4911020Z triton_mm_54 0.0090 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4911260Z triton_mm_57 0.0100 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4911496Z triton_mm_47 0.0108 ms 65.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4911736Z triton_mm_42 0.0109 ms 64.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4911974Z triton_mm_62 0.0110 ms 64.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4912210Z triton_mm_45 0.0112 ms 62.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4912447Z triton_mm_48 0.0113 ms 62.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4912593Z SingleProcess AUTOTUNE benchmarking takes 0.2200 seconds and 0.4950 seconds precompiling for 35 choices 2025-12-04T12:10:20.4912681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.4912741Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.4912813Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.4912946Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.4913455Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.4913513Z graph_break [] 2025-12-04T12:10:20.4913593Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.4913683Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.4913740Z Autotune Choices Stats: 2025-12-04T12:10:20.4914115Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_99", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007240000180900097, "best_triton_pos": 0} 2025-12-04T12:10:20.4914196Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.4914262Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.4914411Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.4914659Z triton_mm_99 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4914729Z _scaled_mm 0.0081 ms 89.2% 2025-12-04T12:10:20.4914971Z triton_mm_100 0.0083 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4915210Z triton_mm_83 0.0091 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4915447Z triton_mm_81 0.0096 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.4915685Z triton_mm_95 0.0102 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4915924Z triton_mm_82 0.0104 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4916163Z triton_mm_91 0.0106 ms 68.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4916400Z triton_mm_87 0.0107 ms 67.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4916637Z triton_mm_88 0.0108 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.4916780Z SingleProcess AUTOTUNE benchmarking takes 0.2271 seconds and 0.2857 seconds precompiling for 35 choices 2025-12-04T12:10:20.4916998Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7b2a1da6e08666d5.xml - 2025-12-04T12:10:20.4917076Z =========================== short test summary info ============================ 2025-12-04T12:10:20.4917686Z FAILED [1.1913s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.4917691Z 2025-12-04T12:10:20.4917781Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.4918055Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4918057Z 2025-12-04T12:10:20.4918160Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.4918240Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.4918326Z ================= 1 failed, 187 deselected, 2 rerun in 13.40s ================== 2025-12-04T12:10:20.4918393Z Got exit code 1 2025-12-04T12:10:20.4918450Z Retrying single test... 2025-12-04T12:10:20.4918608Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-698796e13e91426e.xml 2025-12-04T12:10:20.4918693Z ============================= test session starts ============================== 2025-12-04T12:10:20.4918824Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.4918881Z cachedir: .pytest_cache 2025-12-04T12:10:20.4919055Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.4919119Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.4919178Z configfile: pytest.ini 2025-12-04T12:10:20.4919360Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.4919452Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.4919719Z stepcurrent: skipping 96 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.4919780Z Running 1 items in this shard 2025-12-04T12:10:20.4919782Z 2025-12-04T12:10:20.4920174Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:48.008784297 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4920176Z 2025-12-04T12:10:20.4920507Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4920817Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4920969Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4921479Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4921765Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4922009Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4922234Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4922451Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4922694Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4922930Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4923185Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4923430Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4923672Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4923908Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4924156Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4924389Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4924631Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4924864Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4925105Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4925337Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4925578Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4925825Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4926047Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4926281Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4926523Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4926757Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4926961Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4927196Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4927438Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4927680Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4927936Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4928173Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4928392Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4928619Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4928795Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4928992Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4929531Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpmwed7nhp/3o/c3ozoay57awgiq6xpuxa3spuyevy367x6igacilyrp77le5xavwi.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4929694Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4929924Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4930132Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4930448Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4930608Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4930881Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4931036Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4931306Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4931477Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4931761Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4931910Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4932216Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4932437Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4932768Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4933075Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4933221Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4933716Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4933985Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4934226Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4934449Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4934665Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4934922Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4935170Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4935411Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4935645Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4935887Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4936120Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4936363Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4936605Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4936856Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4937088Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4937329Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4937560Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4937802Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4938035Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4938243Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4938481Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4938721Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4938956Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4939160Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4939405Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4939656Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4939891Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4940166Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4940402Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4940623Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4940848Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4941038Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4941243Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4941365Z E1204 11:01:56.305000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4941538Z [W1204 11:01:56.795463478 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4941540Z 2025-12-04T12:10:20.4941711Z [W1204 11:01:56.796903450 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4941713Z 2025-12-04T12:10:20.4942039Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4942348Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4942495Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4942988Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4943255Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4943496Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4943725Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4943953Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4944196Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4944436Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4944678Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4944912Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4945153Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4945401Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4945642Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4945884Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4946126Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4946362Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4946603Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4946839Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4947080Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4947313Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4947516Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4947749Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4947991Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4948231Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4948447Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4948679Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4948922Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4949155Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4949398Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4949633Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4949858Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4950127Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4950300Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4950498Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4951037Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpmwed7nhp/ve/cveffpifa2zggcbvn5qmbmxobqre7du74dpfxngne2otfhzflg3r.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4951201Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4951431Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4951601Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4951906Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4952053Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4952324Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4952481Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4952763Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4952946Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4953227Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4953454Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4953745Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4953953Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4954283Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4954602Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4954762Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4955254Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4955520Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4955760Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4955981Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4956197Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4956440Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4956676Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4956921Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4957155Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4957410Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4957653Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4957895Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4958128Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4958370Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4958603Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4958847Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4959090Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4959340Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4959575Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4959779Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4960012Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4960289Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4960524Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4960734Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4960971Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4961214Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4961448Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4961690Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4961953Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4962171Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4962398Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4962571Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4962765Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4962883Z E1204 11:01:56.338000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4963207Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4963522Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4963679Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4964171Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4964437Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4964676Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4964899Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4965115Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4965357Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4965589Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4965832Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4966065Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4966316Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4966558Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4966808Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4967044Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4967283Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4967516Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4967755Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4967998Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4968247Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4968481Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4968686Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4968919Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4969160Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4969393Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4969598Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4969830Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4970071Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4970354Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4970593Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4970842Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4971073Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4971299Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4971472Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4971666Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4972207Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpmwed7nhp/sx/csxg5s2kbrrirurdwogcwmnmcogrpoawcfp6lpewx5v6rophy2j2.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.4972382Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4972611Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4972794Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4973101Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4973250Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4973519Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4973672Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4973940Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4974111Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4974393Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4974543Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4974830Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4975040Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4975388Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4975693Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4975839Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4976331Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4976597Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4976837Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4977074Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4977299Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4977540Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4977776Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4978020Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4979520Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4979762Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4979996Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4980297Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4980532Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4980777Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4981035Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4981291Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4981525Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4981768Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4982003Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4982208Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4982447Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4982702Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4982947Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4983151Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4983384Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4983625Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4983858Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4984099Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4984334Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4984553Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4984777Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4984951Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4985146Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4985264Z E1204 11:01:56.339000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.4985448Z [W1204 11:01:56.806435697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.4985451Z 2025-12-04T12:10:20.4985785Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4986094Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4986241Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4986736Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4987002Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4987252Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4987482Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4987699Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.4987941Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.4988175Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4988418Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4988654Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4988896Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4989129Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4989371Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4989604Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4989853Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4990128Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4990372Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4990612Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4990854Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4991087Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4991290Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4991523Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4991779Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4992026Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4992232Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.4992466Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4992712Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4992945Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4993186Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.4993419Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.4993635Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.4993859Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.4994033Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.4994226Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.4994789Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpmwed7nhp/6k/c6k4655zjwzidfz2nuoodb6m7sz4idskle2xvbdktvqwfswe43lc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.4994953Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.4995184Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.4995355Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.4995658Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.4995805Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.4996075Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.4996253Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.4996521Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.4996692Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.4996977Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.4997127Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.4997416Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.4997625Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.4997956Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.4998262Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.4998407Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.4998915Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.4999192Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.4999431Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.4999653Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.4999871Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5000149Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5000384Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5000643Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5000889Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5001129Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5001364Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5001608Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5001842Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5002084Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5002318Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5002560Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5002797Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5003039Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5003273Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5003489Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5003733Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5003976Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5004209Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5004412Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5004649Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5004892Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5005133Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5005384Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5005618Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5005840Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.5006064Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.5006238Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.5006433Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.5006549Z E1204 11:01:56.345000 743947 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.5006620Z ('RERUN', {'yellow': True}) [10.9754s] [100%] 2025-12-04T12:10:20.5006969Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:57.240418004 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5006971Z 2025-12-04T12:10:20.5007132Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5007441Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5007750Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5007917Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5008419Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5008687Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5008930Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5009151Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5009364Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5009617Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5009860Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5010134Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5010368Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5010609Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5010843Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5011086Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5011318Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5011560Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5011792Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5012005Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5012227Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5012468Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5012710Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5012946Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5013151Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5013384Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5013627Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5013858Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5014085Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5014332Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5014545Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5014748Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5014984Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5015196Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5015399Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5015633Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5015874Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5016108Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5016351Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5016584Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5016814Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5017047Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5017263Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5017506Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5017739Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5017981Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5018213Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5018466Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5018708Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5018955Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5019188Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5019428Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5019663Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5019904Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5020179Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5020420Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5020653Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5020896Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5021129Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5021397Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5021629Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5021870Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5022102Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5022345Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5022578Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5022793Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5023017Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5023270Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5023504Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5023745Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5023978Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5024219Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5024451Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5024693Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5024926Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5025170Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5025402Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5025652Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5025897Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5026110Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5026314Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5026548Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5026762Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5026985Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5027203Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5027457Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5027698Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5027909Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5028113Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5028346Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5028556Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5028760Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5028998Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5029241Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5029475Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5029716Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5029950Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5030210Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5030452Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5030666Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5030912Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5031153Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5031371Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5031584Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5031811Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5032067Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5032302Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5032546Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5032780Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5033024Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5033259Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5033503Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5033737Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5033981Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5034218Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5034431Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5034644Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5034889Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5035109Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5035323Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5035537Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5035781Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5036016Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5036271Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5036517Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5036761Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5036997Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5037244Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5037479Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5037722Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5037957Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5038174Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5038388Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5038596Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5038821Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5039048Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5039304Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5039540Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5039759Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5039972Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5040224Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5040467Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5040723Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5040979Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5041215Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5041459Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5041694Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5041935Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5042170Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5042412Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5042647Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5042890Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5043132Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5043375Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5043635Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5043878Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5044114Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5044357Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5044592Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5044834Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5045072Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5045295Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5045510Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5045746Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5045989Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5046225Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5046467Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5046701Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5046945Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5047184Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5047429Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5047666Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5047926Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5048173Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5048378Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5048613Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5048856Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5049091Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5049336Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5049580Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5049816Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5050035Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5050298Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5050515Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5050758Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5050994Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5051225Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5051444Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5051656Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5051871Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5052115Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5052362Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5052591Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5052805Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5053013Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5053179Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5053418Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5053626Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5053860Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5054117Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5054364Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5054578Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5054784Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5055019Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5055236Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5055444Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5055683Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5055890Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5056126Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5056334Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5056568Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5056786Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5057029Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5057273Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5057512Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5057755Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5057990Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5058233Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5058478Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5058731Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5058966Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5059209Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5059447Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5059665Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5059870Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5060144Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5060374Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5060591Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5060807Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5061022Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5061281Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5061529Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5061773Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5062008Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5062214Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5062451Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5062696Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5062945Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5063204Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5063444Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5063672Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5063889Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5064105Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5064313Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5064517Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5064753Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5064996Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5065230Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5065475Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5065720Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5065934Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5066169Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5066412Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5066647Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5066890Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5067125Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5067340Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5067588Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5067831Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5068067Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5068310Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5068546Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5068774Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5068992Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5069206Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5069426Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5069669Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5069906Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5070152Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5070399Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5070641Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5070877Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5071121Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5071356Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5071589Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5071820Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5072046Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5072261Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5072503Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5072738Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5072980Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5073216Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5073458Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5073694Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5073899Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5074132Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5074376Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5074618Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5074871Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5075107Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5075335Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5075557Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5075771Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5075987Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5076240Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5076485Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5076728Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5076964Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5077205Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5077444Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5077688Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5077923Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5078166Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5078400Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5078613Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5078831Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5079045Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5079268Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5079498Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5079722Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5079935Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5080172Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5080381Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5080567Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5080730Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5080903Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5081022Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5081168Z E1204 11:01:57.789000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5081342Z [W1204 11:01:57.268699026 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5081344Z 2025-12-04T12:10:20.5081506Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5081815Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5082123Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5082268Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5082760Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5083029Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5083268Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5083503Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5083732Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5083975Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5084210Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5084454Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5084688Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5084929Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5085173Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5085423Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5085659Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5085900Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5086132Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5086345Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5086568Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5086784Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5087027Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5087262Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5087467Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5087703Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5087955Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5088198Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5088403Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5088635Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5088848Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5089050Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5089283Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5089505Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5089720Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5089953Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5090236Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5090470Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5090711Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5090947Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5091161Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5091383Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5091604Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5091846Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5092080Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5092332Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5092580Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5092823Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5093057Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5093299Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5093533Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5093776Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5094022Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5094274Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5094507Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5094748Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5094981Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5095222Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5095455Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5095704Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5095936Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5096178Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5096411Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5096654Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5096913Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5097131Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5097343Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5097585Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5097823Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5098064Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5098300Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5098553Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5098796Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5099038Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5099270Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5099511Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5099748Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5099992Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5100263Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5100476Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5100681Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5100914Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5101125Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5101361Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5101589Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5101836Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5102069Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5102281Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5102484Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5102718Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5102943Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5103157Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5103390Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5103633Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5103870Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5104111Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5104344Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5104556Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5104780Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5104994Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5105241Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5105478Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5105704Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5105930Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5106147Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5106392Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5106627Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5106871Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5107106Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5107357Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5107603Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5107849Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5108085Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5108326Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5108561Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5108775Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5108979Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5109215Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5109431Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5109646Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5109867Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5110161Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5110409Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5110651Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5110888Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5111131Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5111367Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5111609Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5111856Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5112115Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5112349Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5112568Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5112781Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5112988Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5113214Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5113430Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5113673Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5113908Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5114127Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5114340Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5114573Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5114832Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5115068Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5115311Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5115546Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5115791Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5116030Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5116283Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5116530Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5116772Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5117007Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5117248Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5117483Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5117725Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5117968Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5118213Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5118447Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5118690Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5118923Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5119185Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5119419Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5119637Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5119843Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5120080Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5120347Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5120582Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5120839Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5121089Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5121332Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5121568Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5121809Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5122046Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5122288Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5122524Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5122729Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5122963Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5123207Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5123452Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5123706Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5123939Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5124173Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5124391Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5124603Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5124819Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5125060Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5125308Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5125545Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5125764Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5125978Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5126198Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5126441Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5126677Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5126896Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5127109Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5127317Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5127482Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5127717Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5127934Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5128184Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5128428Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5128663Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5128876Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5129081Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5129317Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5129540Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5129763Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5129999Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5130243Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5130479Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5130685Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5130919Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5131123Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5131360Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5131604Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5131839Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5132084Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5132338Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5132593Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5132831Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5133074Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5133310Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5133553Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5133789Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5134016Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5134237Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5134471Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5134699Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5134917Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5135131Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5135347Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5135592Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5135827Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5136071Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5136312Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5136517Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5136762Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5137017Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5137253Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5137496Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5137732Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5137960Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5138179Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5138405Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5138621Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5138826Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5139062Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5139305Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5139539Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5139781Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5140015Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5140262Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5140498Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5140740Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5140976Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5141234Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5141481Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5141685Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5141922Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5142165Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5142401Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5142645Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5142891Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5143132Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5143347Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5143569Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5143785Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5144027Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5144263Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5144468Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5144705Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5144947Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5145183Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5145427Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5145670Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5145906Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5146123Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5146338Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5146558Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5146802Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5147037Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5147307Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5147551Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5147792Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5148028Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5148234Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5148472Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5148716Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5148950Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5149193Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5149426Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5149655Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5149872Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5150115Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5150346Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5150592Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5150828Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5151071Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5151313Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5151556Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5151803Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5152058Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5152292Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5152537Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5152774Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5152987Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5153207Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5153412Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5153624Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5153852Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5154075Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5154288Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5154509Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5154727Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5154914Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5155056Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5155217Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5155335Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5155475Z E1204 11:01:57.807000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5155646Z [W1204 11:01:57.301522464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5155648Z 2025-12-04T12:10:20.5155807Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5156116Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5156443Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5156589Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5157082Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5157350Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5157591Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5157813Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5158028Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5158270Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5158509Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5158755Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5158998Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5159251Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5159487Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5159728Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5159961Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5160239Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5160473Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5160703Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5160939Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5161154Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5161396Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5161629Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5161833Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5162067Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5162308Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5162543Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5162749Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5162982Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5163194Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5163409Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5163661Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5163872Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5164076Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5164311Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5164555Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5164794Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5165047Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5165290Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5165501Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5165725Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5165941Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5166183Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5166417Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5166660Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5166895Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5167135Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5167369Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5167610Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5167851Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5168105Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5168442Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5168684Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5168920Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5169162Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5169396Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5169646Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5169889Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5170166Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5170399Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5170641Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5170875Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5171120Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5171353Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5171570Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5171782Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5172026Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5172260Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5172512Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5172758Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5173004Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5173237Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5173478Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5173711Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5173953Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5174198Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5174451Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5174684Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5174898Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5175110Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5175346Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5175558Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5175780Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5175996Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5176236Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5176471Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5176682Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5176901Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5177144Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5177356Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5177561Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5177793Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5178037Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5178270Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5178523Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5178767Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5178979Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5179204Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5179419Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5179665Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5179900Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5180168Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5180382Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5180597Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5180841Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5181084Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5181345Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5181590Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5181833Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5182069Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5182311Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5182547Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5182788Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5183040Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5183266Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5183472Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5183708Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5183923Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5184139Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5184356Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5184599Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5184835Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5185082Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5185318Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5185561Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5185804Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5186263Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5186502Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5186745Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5186982Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5187201Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5187418Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5187635Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5187871Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5188086Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5188329Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5188566Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5188785Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5188999Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5189218Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5189464Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5189699Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5189942Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5190202Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5190460Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5190711Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5190955Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5191189Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5191434Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5191669Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5191912Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5192160Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5192414Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5192650Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5192891Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5193126Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5193369Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5193608Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5193851Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5194086Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5194300Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5194505Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5194741Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5194995Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5195238Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5195482Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5195719Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5195962Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5196196Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5196439Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5196683Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5196939Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5197175Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5197381Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5197619Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5197862Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5198097Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5198339Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5198573Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5198803Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5199021Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5199235Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5199468Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5199711Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5199946Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5200217Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5200436Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5200648Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5200863Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5201122Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5201372Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5201593Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5201807Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5202013Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5202177Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5202415Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5202622Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5202861Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5203107Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5203343Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5203562Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5203778Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5204026Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5204237Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5204443Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5204679Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5204884Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5205120Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5205323Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5205574Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5205787Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5206023Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5206268Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5206502Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5206746Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5206981Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5207225Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5207463Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5207707Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5207941Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5208191Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5208435Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5208649Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5208854Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5209090Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5209319Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5209542Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5209755Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5209983Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5210272Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5210507Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5210752Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5210986Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5211193Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5211431Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5211678Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5211912Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5212156Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5212391Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5212633Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5212862Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5213075Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5213282Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5213486Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5213726Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5213970Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5214203Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5214464Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5214711Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5214918Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5215152Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5215396Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5215637Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5215881Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5216117Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5216321Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5216557Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5216799Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5217043Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5217294Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5217529Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5217759Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5217976Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5218191Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5218407Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5218650Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5218894Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5219109Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5219344Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5219587Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5219826Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5220071Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5220349Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5220577Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5220794Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5221008Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5221224Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5221479Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5221725Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5221973Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5222209Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5222452Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5222690Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5222895Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5223130Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5223385Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5223633Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5223878Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5224113Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5224342Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5224559Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5224772Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5224992Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5225235Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5225471Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5225714Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5225962Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5226214Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5226448Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5226692Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5226926Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5227168Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5227403Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5227626Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5227853Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5228061Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5228273Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5228502Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5228725Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5228938Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5229144Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5229351Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5229542Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5229685Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5229853Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5229973Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5230152Z E1204 11:01:57.840000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5230339Z [W1204 11:01:57.439543087 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5230341Z 2025-12-04T12:10:20.5230519Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5230830Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5231138Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5231285Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5231779Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5232062Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5232315Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5232536Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5232753Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5232995Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5233231Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5233474Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5233708Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5233956Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5234188Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5234429Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5234664Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5234913Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5235157Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5235369Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5235593Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5235807Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5236051Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5236285Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5236498Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5236745Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5236985Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5237219Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5237421Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5237656Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5237869Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5238073Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5238307Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5238519Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5238723Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5238958Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5239211Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5239455Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5239695Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5239931Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5240181Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5240404Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5240618Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5240859Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5241110Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5241362Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5241596Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5241837Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5242075Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5242315Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5242550Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5242792Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5243024Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5243266Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5243499Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5243751Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5244001Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5244246Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5244480Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5244723Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5244957Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5245198Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5245442Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5245692Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5245925Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5246146Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5246356Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5246600Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5246834Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5247076Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5247309Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5247548Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5247782Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5248025Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5248276Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5248525Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5248760Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5249005Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5249238Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5249451Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5249653Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5249897Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5250162Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5250385Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5250601Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5250845Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5251079Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5251290Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5251494Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5251727Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5251940Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5252147Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5252380Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5252637Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5252880Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5253122Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5253354Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5253566Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5253791Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5254006Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5254267Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5254515Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5254735Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5254949Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5255166Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5255412Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5255648Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5255892Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5256128Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5256372Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5256606Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5256850Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5257093Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5257344Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5257581Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5257796Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5258009Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5258244Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5258463Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5258687Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5258913Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5259155Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5259389Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5259633Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5259867Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5260144Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5260381Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5260622Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5260857Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5261098Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5261333Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5261566Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5261793Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5262002Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5262228Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5262444Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5262687Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5262924Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5263153Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5263385Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5263601Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5263842Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5264081Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5264324Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5264562Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5264804Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5265041Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5265283Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5265517Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5265761Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5266005Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5266265Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5266500Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5266744Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5266978Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5267221Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5267457Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5267709Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5267954Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5268199Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5268434Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5268647Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5268852Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5269088Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5269330Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5269566Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5269808Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5270043Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5270320Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5270566Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5270819Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5271056Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5271298Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5271534Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5271738Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5271972Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5272228Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5272479Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5272722Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5272958Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5273186Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5273403Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5273617Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5273832Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5274076Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5274314Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5274544Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5274764Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5274985Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5275214Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5275457Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5275695Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5275912Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5276126Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5276334Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5276507Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5276753Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5276957Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5277192Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5277434Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5277670Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5277884Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5278087Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5278327Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5278539Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5278744Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5278980Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5279193Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5279437Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5279641Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5279877Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5280081Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5280357Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5280600Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5280835Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5281100Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5281345Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5281589Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5281822Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5282066Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5282304Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5282550Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5282788Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5284176Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5284387Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5284624Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5284872Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5285105Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5285321Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5285539Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5285784Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5286020Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5286263Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5286500Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5286714Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5286963Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5287206Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5287447Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5287696Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5287934Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5288166Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5288385Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5288601Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5288808Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5289013Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5289248Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5289516Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5289754Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5289999Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5290276Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5290482Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5290716Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5290961Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5291211Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5291465Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5291707Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5291912Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5292148Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5292391Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5292628Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5292871Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5293106Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5293336Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5293556Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5293773Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5294020Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5294264Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5294499Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5294705Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5294944Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5295187Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5295422Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5295678Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5295923Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5296150Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5296368Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5296583Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5296800Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5297043Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5297277Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5297520Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5297757Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5298000Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5298244Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5298469Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5298704Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5298948Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5299183Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5299426Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5299665Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5299896Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5300176Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5300404Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5300619Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5300862Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5301097Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5301341Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5301576Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5301824Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5302058Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5302301Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5302536Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5302789Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5303035Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5303248Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5303466Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5303673Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5303888Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5304119Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5304341Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5304567Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5304788Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5304996Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5305184Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5305326Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5305489Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5305608Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5305750Z E1204 11:01:57.978000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5305819Z ('RERUN', {'yellow': True}) [1.3206s] [100%] 2025-12-04T12:10:20.5306171Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda [W1204 11:01:58.358989051 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5306174Z 2025-12-04T12:10:20.5306336Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5306646Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5306956Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5307101Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5307617Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5307893Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5308133Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5308355Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5308573Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5308826Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5309072Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5309316Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5309550Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5309794Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5310029Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5310308Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5310543Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5310785Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5311019Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5311234Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5311458Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5311687Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5311942Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5312178Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5312383Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5312617Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5312859Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5313093Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5313315Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5313562Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5313776Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5313983Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5314219Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5314432Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5314636Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5314871Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5315114Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5315350Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5315591Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5315827Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5316058Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5316291Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5316507Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5316750Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5316984Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5317226Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5317460Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5317702Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5317945Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5318200Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5318433Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5318675Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5318909Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5319150Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5319384Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5319626Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5319859Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5320138Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5320372Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5320627Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5320872Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5321115Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5321348Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5321593Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5321826Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5322044Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5322271Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5322525Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5322759Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5323001Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5323234Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5323477Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5323711Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5323954Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5324187Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5324429Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5324662Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5324904Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5325148Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5325373Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5325578Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5325810Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5326023Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5326247Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5326463Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5326718Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5326965Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5327177Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5327381Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5327618Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5327830Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5328034Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5328271Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5328517Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5328751Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5328995Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5329229Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5329452Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5329686Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5329900Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5330180Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5330418Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5330636Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5330851Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5331066Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5331322Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5331575Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5331823Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5332057Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5332303Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5332539Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5332781Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5333018Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5333260Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5333495Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5333711Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5333928Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5334176Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5334396Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5334614Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5334830Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5335076Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5335312Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5335553Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5335801Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5336052Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5336287Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5336536Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5336771Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5337014Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5337250Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5337468Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5337680Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5337888Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5338115Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5338341Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5338594Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5338828Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5339047Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5339261Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5339477Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5339722Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5339957Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5340243Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5340495Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5340737Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5340972Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5341215Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5341450Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5341693Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5341929Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5342170Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5342408Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5342650Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5342900Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5343155Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5343390Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5343633Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5343867Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5344112Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5344348Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5344575Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5344872Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5345106Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5345350Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5345583Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5345826Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5346061Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5346303Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5346542Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5346784Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5347020Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5347263Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5347510Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5347728Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5347966Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5348209Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5348443Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5348689Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5348923Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5349166Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5349395Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5349610Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5349827Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5350068Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5350337Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5350568Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5350788Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5351002Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5351217Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5351460Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5351695Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5351923Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5352148Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5352357Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5352521Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5352761Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5352967Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5353202Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5353444Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5353690Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5353916Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5354122Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5354358Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5354573Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5354779Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5355015Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5355220Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5355457Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5355662Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5355897Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5356101Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5356345Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5356599Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5356836Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5357080Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5357314Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5357559Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5357792Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5358045Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5358290Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5358532Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5358772Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5358987Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5359191Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5359427Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5359658Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5359877Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5360129Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5360345Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5360588Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5360838Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5361092Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5361327Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5361534Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5361769Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5362012Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5362246Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5362499Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5362748Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5362977Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5363195Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5363408Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5363616Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5363822Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5364057Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5364300Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5364534Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5364783Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5365017Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5365233Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5365486Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5365729Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5365965Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5366208Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5366444Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5366650Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5366895Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5367147Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5367381Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5367625Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5367858Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5368087Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5368304Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5368519Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5368742Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5368985Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5369221Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5369425Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5369672Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5369922Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5370200Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5370444Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5370681Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5370913Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5371128Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5371356Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5371585Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5371827Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5372064Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5372305Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5372540Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5372785Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5373022Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5373230Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5373464Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5373707Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5373940Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5374205Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5374438Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5374668Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5374889Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5375103Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5375320Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5375562Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5375807Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5376059Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5376294Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5376537Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5376772Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5377021Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5377254Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5377497Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5377732Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5377946Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5378166Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5378371Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5378604Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5378837Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5379061Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5379274Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5379481Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5379691Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5379878Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5380030Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5380232Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5380366Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5380505Z E1204 11:01:58.898000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5380678Z [W1204 11:01:58.374062851 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5380681Z 2025-12-04T12:10:20.5380848Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5381157Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5381468Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5381612Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5382105Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5382373Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5382613Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5382858Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5383087Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5383330Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5383565Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5383809Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5384043Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5384284Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5384529Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5384780Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5385016Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5385258Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5385491Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5385703Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5385927Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5386142Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5386383Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5386617Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5386821Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5387056Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5387306Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5387550Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5387754Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5387988Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5388200Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5388403Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5388637Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5388849Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5389068Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5389311Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5389553Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5389787Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5390027Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5390327Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5390540Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5390764Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5390984Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5391227Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5391462Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5391703Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5391967Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5392208Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5392442Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5392684Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5392916Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5393162Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5393393Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5393653Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5393896Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5394138Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5394371Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5394611Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5394845Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5395087Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5395321Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5395562Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5395794Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5396036Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5396281Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5396508Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5396718Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5396961Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5397201Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5397441Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5397675Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5397925Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5398173Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5398416Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5398650Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5398891Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5399125Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5399369Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5399602Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5399815Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5400018Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5400284Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5400502Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5400739Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5400966Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5401209Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5401444Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5401656Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5401859Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5402094Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5402304Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5402519Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5402763Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5403005Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5403238Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5403483Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5403715Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5403926Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5404149Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5404363Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5404609Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5404843Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5405060Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5405298Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5405512Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5405758Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5405994Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5406239Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5406473Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5406716Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5406965Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5407220Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5407455Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5407699Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5407936Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5408149Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5408356Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5408592Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5408808Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5409024Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5409244Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5409487Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5409739Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5409982Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5410256Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5410499Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5410735Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5410977Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5411215Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5411472Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5411720Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5411938Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5412150Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5412358Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5412583Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5412798Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5413041Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5413282Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5413499Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5413713Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5413928Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5414195Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5414430Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5414673Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5414908Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5415151Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5415389Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5415630Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5415881Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5416134Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5416370Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5416615Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5416852Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5417094Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5417336Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5417581Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5417817Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5418059Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5418294Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5418547Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5418791Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5419006Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5419212Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5419451Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5419694Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5419929Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5420217Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5420464Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5420706Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5420942Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5421185Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5421420Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5421669Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5421906Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5422113Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5422349Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5422592Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5422827Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5423081Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5423327Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5423558Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5423776Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5423990Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5424205Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5424449Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5424696Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5424933Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5425151Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5425365Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5425583Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5425827Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5426062Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5426280Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5426492Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5426699Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5426861Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5427098Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5427303Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5427560Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5427804Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5428038Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5428254Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5428457Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5428692Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5428903Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5429123Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5429369Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5429577Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5429814Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5430017Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5430284Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5430489Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5430726Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5430971Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5431208Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5431453Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5431693Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5431949Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5432206Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5432451Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5432686Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5432930Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5433165Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5433379Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5433598Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5433848Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5434076Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5434295Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5434508Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5434724Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5434968Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5435206Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5435451Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5435688Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5435895Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5436128Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5436382Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5436626Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5436870Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5437104Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5437332Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5437549Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5437767Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5437985Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5438200Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5438436Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5438680Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5438915Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5439157Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5439392Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5439597Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5439834Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5440077Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5440367Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5440609Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5440859Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5441076Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5441312Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5441555Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5441791Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5442035Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5442269Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5442509Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5442738Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5442952Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5443170Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5443413Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5443650Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5443857Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5444094Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5444336Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5444571Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5444813Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5445047Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5445285Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5445512Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5445727Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5445944Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5446186Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5446422Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5446663Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5446907Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5447157Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5447393Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5447597Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5447834Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5448078Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5448312Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5448554Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5448789Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5449017Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5449234Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5449447Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5449681Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5449938Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5450204Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5450448Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5450686Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5450930Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5451166Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5451424Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5451670Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5451916Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5452151Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5452363Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5452579Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5452784Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5452995Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5453223Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5453445Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5453656Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5453867Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5454086Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5454285Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5454427Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5454587Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5454707Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5454848Z E1204 11:01:58.913000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5455020Z [W1204 11:01:58.406621572 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5455023Z 2025-12-04T12:10:20.5455181Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5455491Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5455810Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5455966Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5456458Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5456724Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5456965Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5457183Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5457399Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5457641Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5457883Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5458125Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5458367Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5458618Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5458850Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5459091Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5459325Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5459565Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5459800Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5460022Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5460269Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5460497Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5460739Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5460973Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5461176Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5461410Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5461650Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5461888Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5462092Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5462326Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5462538Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5462739Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5462994Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5463205Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5463408Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5463641Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5463884Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5464117Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5464357Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5464601Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5464831Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5465054Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5465267Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5465508Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5465741Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5465986Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5466220Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5466462Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5466696Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5466936Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5467169Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5467429Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5467661Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5467905Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5468139Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5468381Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5468613Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5468854Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5469096Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5469346Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5469579Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5469819Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5470055Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5470330Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5470561Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5470779Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5470989Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5471231Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5471463Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5471717Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5471963Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5472205Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5472437Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5472677Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5472909Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5473148Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5473392Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5473646Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5473878Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5474091Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5474300Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5474534Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5474744Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5474967Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5475181Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5475422Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5475655Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5475866Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5476072Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5476325Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5476535Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5476738Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5476970Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5477215Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5477447Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5477689Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5477930Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5478153Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5478380Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5478593Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5478837Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5479072Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5479292Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5479505Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5479722Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5479965Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5480234Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5480477Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5480737Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5480980Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5481214Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5481457Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5481692Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5481934Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5482172Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5482404Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5482625Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5482860Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5483079Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5483296Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5483514Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5483765Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5484001Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5484246Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5484489Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5484731Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5484971Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5485231Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5485473Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5485718Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5485957Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5486174Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5486393Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5486600Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5486837Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5487063Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5487307Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5487545Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5487765Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5487979Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5488197Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5488440Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5488680Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5488928Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5489165Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5489409Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5489666Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5489909Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5490178Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5490420Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5490656Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5490902Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5491138Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5491393Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5491639Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5491883Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5492120Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5492360Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5492599Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5492841Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5493077Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5493293Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5493499Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5493735Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5493994Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5494241Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5494487Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5494721Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5494967Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5495203Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5495446Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5495691Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5495946Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5496182Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5496391Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5496625Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5496867Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5497102Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5497347Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5497583Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5497810Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5498029Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5498243Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5498467Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5498728Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5498966Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5499197Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5499415Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5499630Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5499849Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5500124Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5500378Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5500596Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5500813Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5501020Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5501183Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5501422Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5501629Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5501864Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5502115Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5502351Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5502564Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5502769Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5503022Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5503246Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5503452Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5503686Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5503890Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5504130Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5504334Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5504583Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5504801Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5505040Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5505282Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5505522Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5505765Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5506002Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5506244Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5506478Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5506723Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5506958Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5507204Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5507447Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5507670Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5507881Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5508116Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5508348Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5508567Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5508785Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5509011Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5509263Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5509499Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5509744Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5509980Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5510210Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5510444Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5510692Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5510928Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5511172Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5511406Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5511639Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5511869Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5512098Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5512314Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5512518Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5512753Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5513001Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5513236Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5513491Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5513737Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5513946Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5514181Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5514426Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5514661Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5514909Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5515145Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5515350Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5515584Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5515834Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5516070Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5516327Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5516574Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5516804Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5517021Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5517236Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5517454Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5517699Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5517941Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5518161Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5518396Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5518637Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5518873Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5519116Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5519352Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5519579Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5519797Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5520016Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5520269Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5520512Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5520762Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5521019Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5521253Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5521502Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5521739Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5521949Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5522187Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5522443Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5522691Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5522938Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5523173Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5523400Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5523623Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5523838Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5524053Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5524297Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5524538Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5524783Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5525018Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5525275Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5525519Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5525761Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5526000Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5526243Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5526480Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5526693Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5526918Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5527138Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5527348Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5527579Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5527800Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5528013Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5528222Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5528428Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5528616Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5528757Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5528923Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5529040Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5529182Z E1204 11:01:58.945000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5529353Z [W1204 11:01:59.562129936 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5529356Z 2025-12-04T12:10:20.5529525Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5529846Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5530190Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5530335Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5530835Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5531103Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5531355Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5531593Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5531810Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5532052Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5532288Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5532533Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5532765Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5533006Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5533238Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5533481Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5533717Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5533970Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5534216Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5534428Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5534652Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5534867Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5535108Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5535342Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5535547Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5535790Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5536040Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5536276Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5536481Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5536714Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5536926Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5537129Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5537361Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5537573Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5537775Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5538012Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5538254Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5538497Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5538751Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5538985Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5539196Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5539423Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5539642Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5539884Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5540161Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5540481Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5540715Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5540958Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5541198Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5541439Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5541672Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5541914Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5542148Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5542392Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5542624Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5542864Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5543134Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5543376Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5543609Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5543851Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5544087Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5544329Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5544564Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5544821Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5545063Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5545281Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5545493Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5545735Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5545967Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5546207Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5546443Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5546684Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5546917Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5547158Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5547399Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5547649Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5547882Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5548125Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5548358Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5548576Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5548778Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5549011Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5549237Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5549470Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5549684Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5549927Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5550319Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5550532Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5550737Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5550970Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5551181Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5551387Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5551622Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5551863Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5552111Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5552366Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5552600Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5552815Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5553036Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5553252Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5553498Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5553748Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5553976Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5554189Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5554406Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5554651Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5554886Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5555129Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5555363Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5555606Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5555841Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5556085Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5556318Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5556578Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5556822Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5557036Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5557242Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5557476Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5557693Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5557905Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5558133Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5558391Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5558626Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5558868Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5559101Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5559343Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5559577Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5559819Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5560056Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5560340Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5560580Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5560796Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5561023Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5561243Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5561470Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5561687Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5561931Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5562167Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5562383Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5562608Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5562835Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5563080Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5563321Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5563561Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5563796Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5564036Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5564271Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5564516Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5564756Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5564998Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5565233Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5565503Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5565738Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5565980Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5566216Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5566458Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5566694Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5566939Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5567189Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5567439Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5567674Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5567889Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5568095Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5568330Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5568574Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5568810Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5569051Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5569288Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5569532Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5569785Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5570037Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5570299Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5570542Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5570777Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5570984Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5571222Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5571476Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5571725Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5571969Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5572204Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5572431Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5572649Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5572863Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5573080Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5573322Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5573556Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5573785Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5574001Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5574226Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5574453Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5574696Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5574935Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5575151Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5575366Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5575572Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5575735Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5575983Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5576199Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5576435Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5576677Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5576912Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5577127Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5577332Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5577568Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5577780Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5577986Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5578222Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5578428Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5578676Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5578892Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5579128Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5579332Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5579568Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5579811Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5580050Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5580322Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5580569Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5580812Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5581047Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5581289Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5581523Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5581767Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5582005Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5582220Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5582424Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5582659Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5582889Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5583129Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5583354Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5583574Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5583817Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5584053Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5584296Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5584532Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5584753Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5584997Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5585238Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5585473Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5585715Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5585949Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5586187Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5586404Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5586618Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5587985Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5588195Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5588434Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5588694Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5588947Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5589194Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5589431Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5589636Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5589871Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5590153Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5590408Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5590667Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5590900Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5591113Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5591348Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5591592Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5591827Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5592070Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5592307Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5592534Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5592753Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5592969Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5593203Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5593459Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5593695Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5593900Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5594135Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5594380Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5594617Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5594869Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5595119Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5595346Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5595565Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5595778Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5595998Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5596244Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5596482Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5596725Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5596959Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5597203Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5597439Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5597658Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5597905Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5598148Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5598385Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5598627Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5598863Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5599095Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5599326Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5599556Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5599771Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5600016Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5600280Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5600525Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5600764Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5601009Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5601245Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5601486Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5601722Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5601964Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5602211Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5602437Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5602656Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5602865Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5603076Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5603308Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5603531Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5603757Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5603977Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5604185Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5604373Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5604521Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5604682Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5604802Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5604945Z E1204 11:01:59.101000 743947 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5605003Z FAILED [1.1266s] [100%] 2025-12-04T12:10:20.5605005Z 2025-12-04T12:10:20.5605081Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5605240Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5605304Z Traceback (most recent call last): 2025-12-04T12:10:20.5605483Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5605544Z method(*args, **kwargs) 2025-12-04T12:10:20.5605710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5605770Z method(*args, **kwargs) 2025-12-04T12:10:20.5605935Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5605992Z with policy(): 2025-12-04T12:10:20.5606163Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5606221Z raise RuntimeError(msg) 2025-12-04T12:10:20.5606654Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1086324736. 2025-12-04T12:10:20.5606658Z 2025-12-04T12:10:20.5606752Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5607031Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5607035Z 2025-12-04T12:10:20.5607139Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5607232Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5607295Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5607372Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5607942Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5608069Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5608136Z graph_break [] 2025-12-04T12:10:20.5608217Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5608308Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5608819Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5608886Z current_size = base.storage().size() 2025-12-04T12:10:20.5608944Z Autotune Choices Stats: 2025-12-04T12:10:20.5609339Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.5609422Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5609491Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5609629Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5609889Z triton_mm_31 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5610165Z triton_mm_32 0.0087 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5610405Z triton_mm_19 0.0088 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5610647Z triton_mm_27 0.0088 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5610914Z triton_mm_20 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5611151Z triton_mm_15 0.0096 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5611388Z triton_mm_23 0.0100 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5611624Z triton_mm_14 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5611866Z triton_mm_28 0.0112 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5612102Z triton_mm_13 0.0116 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5612267Z SingleProcess AUTOTUNE benchmarking takes 0.1516 seconds and 8.5387 seconds precompiling for 31 choices 2025-12-04T12:10:20.5612443Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5612507Z Traceback (most recent call last): 2025-12-04T12:10:20.5612681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5612739Z method(*args, **kwargs) 2025-12-04T12:10:20.5612907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5612964Z method(*args, **kwargs) 2025-12-04T12:10:20.5613133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5613188Z with policy(): 2025-12-04T12:10:20.5613362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5613420Z raise RuntimeError(msg) 2025-12-04T12:10:20.5613826Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1086324736 and is now 1184890880. 2025-12-04T12:10:20.5613829Z 2025-12-04T12:10:20.5613920Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5614196Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5614199Z 2025-12-04T12:10:20.5614303Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5614396Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5614457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5614530Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5615115Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5615232Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5615287Z graph_break [] 2025-12-04T12:10:20.5615366Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5615457Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5615961Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5616026Z current_size = base.storage().size() 2025-12-04T12:10:20.5616083Z Autotune Choices Stats: 2025-12-04T12:10:20.5616469Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.5616568Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5616647Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5616786Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5617037Z triton_mm_31 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5617281Z triton_mm_32 0.0087 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5617519Z triton_mm_19 0.0088 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5617760Z triton_mm_27 0.0088 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5617997Z triton_mm_20 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5618234Z triton_mm_15 0.0096 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5618472Z triton_mm_23 0.0100 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5618709Z triton_mm_14 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5619881Z triton_mm_28 0.0112 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5620183Z triton_mm_13 0.0116 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5620330Z SingleProcess AUTOTUNE benchmarking takes 0.1516 seconds and 8.5387 seconds precompiling for 31 choices 2025-12-04T12:10:20.5620423Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5620482Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5620556Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5620672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5621183Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5621238Z graph_break [] 2025-12-04T12:10:20.5621318Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5621408Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5621806Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.5621914Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.5621973Z Autotune Choices Stats: 2025-12-04T12:10:20.5622356Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_53", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0074800001457333565, "best_triton_pos": 0} 2025-12-04T12:10:20.5622439Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5622509Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5622647Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5622891Z triton_mm_53 0.0075 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5623136Z triton_mm_65 0.0076 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5623196Z _scaled_mm 0.0080 ms 93.0% 2025-12-04T12:10:20.5623436Z triton_mm_61 0.0081 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5623677Z triton_mm_66 0.0085 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5623913Z triton_mm_49 0.0091 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5624208Z triton_mm_54 0.0094 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5624454Z triton_mm_57 0.0101 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5624698Z triton_mm_47 0.0104 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5624938Z triton_mm_42 0.0108 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5625083Z SingleProcess AUTOTUNE benchmarking takes 0.2119 seconds and 0.4468 seconds precompiling for 35 choices 2025-12-04T12:10:20.5625154Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5625310Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5625374Z Traceback (most recent call last): 2025-12-04T12:10:20.5625547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5625618Z method(*args, **kwargs) 2025-12-04T12:10:20.5625784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5625842Z method(*args, **kwargs) 2025-12-04T12:10:20.5626008Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5626064Z with policy(): 2025-12-04T12:10:20.5626232Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5626290Z raise RuntimeError(msg) 2025-12-04T12:10:20.5626697Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.5626702Z 2025-12-04T12:10:20.5626794Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5627068Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5627072Z 2025-12-04T12:10:20.5627174Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5627267Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5627326Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5627399Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5627970Z inductor [('triton_bundler_save_kernel', 280), ('generated_module_cache_miss', 34), ('benchmarking.InductorBenchmarker.benchmark_gpu', 31), ('select_algorithm_num_precompiles', 30), ('select_algorithm_num_precompilation_exceptions', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5628085Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5628159Z graph_break [] 2025-12-04T12:10:20.5628237Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5628336Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5628845Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5628911Z current_size = base.storage().size() 2025-12-04T12:10:20.5628968Z Autotune Choices Stats: 2025-12-04T12:10:20.5629353Z {"num_choices": 31, "num_triton_choices": 30, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.5629434Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5629503Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5629640Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5629890Z triton_mm_31 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5630180Z triton_mm_32 0.0087 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5630418Z triton_mm_19 0.0088 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5630658Z triton_mm_27 0.0088 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5630896Z triton_mm_20 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5631137Z triton_mm_15 0.0096 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5631374Z triton_mm_23 0.0100 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5631611Z triton_mm_14 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5631849Z triton_mm_28 0.0112 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5632086Z triton_mm_13 0.0116 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5632230Z SingleProcess AUTOTUNE benchmarking takes 0.1516 seconds and 8.5387 seconds precompiling for 31 choices 2025-12-04T12:10:20.5632335Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5632395Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5632485Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5632612Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5633115Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5633170Z graph_break [] 2025-12-04T12:10:20.5633248Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5633338Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5633720Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.5633827Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.5633885Z Autotune Choices Stats: 2025-12-04T12:10:20.5634273Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_53", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0074800001457333565, "best_triton_pos": 0} 2025-12-04T12:10:20.5634354Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5634420Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5634556Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5634804Z triton_mm_53 0.0075 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5635045Z triton_mm_65 0.0076 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5635192Z _scaled_mm 0.0080 ms 93.0% 2025-12-04T12:10:20.5635430Z triton_mm_61 0.0081 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5635672Z triton_mm_66 0.0085 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5635909Z triton_mm_49 0.0091 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5636147Z triton_mm_54 0.0094 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5636387Z triton_mm_57 0.0101 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5636648Z triton_mm_47 0.0104 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5636900Z triton_mm_42 0.0108 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5637046Z SingleProcess AUTOTUNE benchmarking takes 0.2119 seconds and 0.4468 seconds precompiling for 35 choices 2025-12-04T12:10:20.5637136Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5637194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5637268Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5637382Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5637884Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5637938Z graph_break [] 2025-12-04T12:10:20.5638016Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:20.5638120Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5638177Z Autotune Choices Stats: 2025-12-04T12:10:20.5638554Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_87", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007519999984651804, "best_triton_pos": 0} 2025-12-04T12:10:20.5638634Z AUTOTUNE scaled_mm(33x1024, 1024x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5638701Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5638836Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5639081Z triton_mm_87 0.0075 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5639326Z triton_mm_99 0.0078 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5639563Z triton_mm_95 0.0080 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5639808Z triton_mm_100 0.0081 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5640046Z triton_mm_88 0.0090 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5640322Z triton_mm_83 0.0095 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5640557Z triton_mm_81 0.0101 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5640834Z triton_mm_91 0.0101 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5641076Z triton_mm_76 0.0108 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5641313Z triton_mm_82 0.0109 ms 69.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5641457Z SingleProcess AUTOTUNE benchmarking takes 0.2158 seconds and 0.2852 seconds precompiling for 35 choices 2025-12-04T12:10:20.5641663Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-698796e13e91426e.xml - 2025-12-04T12:10:20.5641741Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5642341Z FAILED [1.1266s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1184890880 and is now 1283457024. 2025-12-04T12:10:20.5642360Z 2025-12-04T12:10:20.5642450Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5642723Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5642725Z 2025-12-04T12:10:20.5642828Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5642906Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5642993Z ================= 1 failed, 187 deselected, 2 rerun in 13.44s ================== 2025-12-04T12:10:20.5643048Z Got exit code 1 2025-12-04T12:10:20.5643275Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5643419Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.5643578Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fa20ef993e294552.xml 2025-12-04T12:10:20.5643655Z ============================= test session starts ============================== 2025-12-04T12:10:20.5643784Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5643842Z cachedir: .pytest_cache 2025-12-04T12:10:20.5644018Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5644081Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5644139Z configfile: pytest.ini 2025-12-04T12:10:20.5644319Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5644411Z collecting ... collected 188 items / 97 deselected / 91 selected 2025-12-04T12:10:20.5644479Z stepcurrent: skipping 97 already run items. 2025-12-04T12:10:20.5644540Z Running 91 items in this shard 2025-12-04T12:10:20.5644542Z 2025-12-04T12:10:20.5644779Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7790s] [ 1%] 2025-12-04T12:10:20.5645010Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3390s] [ 1%] 2025-12-04T12:10:20.5645216Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3125s] [ 1%] 2025-12-04T12:10:20.5645219Z 2025-12-04T12:10:20.5645289Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5645441Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5645504Z Traceback (most recent call last): 2025-12-04T12:10:20.5645678Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5645737Z method(*args, **kwargs) 2025-12-04T12:10:20.5645906Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5645963Z method(*args, **kwargs) 2025-12-04T12:10:20.5646130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5646184Z with policy(): 2025-12-04T12:10:20.5646355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5646424Z raise RuntimeError(msg) 2025-12-04T12:10:20.5646818Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5646822Z 2025-12-04T12:10:20.5646912Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5647182Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5647185Z 2025-12-04T12:10:20.5647286Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5647376Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5647437Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5647511Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5647594Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5647709Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5647763Z graph_break [] 2025-12-04T12:10:20.5647842Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5647996Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5648058Z Traceback (most recent call last): 2025-12-04T12:10:20.5648227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5648283Z method(*args, **kwargs) 2025-12-04T12:10:20.5648449Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5648507Z method(*args, **kwargs) 2025-12-04T12:10:20.5648672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5648726Z with policy(): 2025-12-04T12:10:20.5648893Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5648971Z raise RuntimeError(msg) 2025-12-04T12:10:20.5649387Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5649390Z 2025-12-04T12:10:20.5649479Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5649751Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5649753Z 2025-12-04T12:10:20.5649855Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5649944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5650007Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5650079Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5650192Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5650306Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5650360Z graph_break [] 2025-12-04T12:10:20.5650436Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5650525Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5650599Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5650670Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5650781Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5650861Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5650913Z graph_break [] 2025-12-04T12:10:20.5650989Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5651057Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5651212Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5651273Z Traceback (most recent call last): 2025-12-04T12:10:20.5651442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5651500Z method(*args, **kwargs) 2025-12-04T12:10:20.5651666Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5651722Z method(*args, **kwargs) 2025-12-04T12:10:20.5651888Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5651941Z with policy(): 2025-12-04T12:10:20.5652109Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5652166Z raise RuntimeError(msg) 2025-12-04T12:10:20.5652563Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5652567Z 2025-12-04T12:10:20.5652657Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5652922Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5652924Z 2025-12-04T12:10:20.5653027Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5653132Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5653192Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5653275Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5653357Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5653481Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5653536Z graph_break [] 2025-12-04T12:10:20.5653611Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5653702Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5653759Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5653831Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5653941Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5654022Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5654075Z graph_break [] 2025-12-04T12:10:20.5654150Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5654239Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5654298Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5654369Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5654480Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5654573Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5654627Z graph_break [] 2025-12-04T12:10:20.5654701Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5654911Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fa20ef993e294552.xml - 2025-12-04T12:10:20.5654988Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5655568Z FAILED [0.3125s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5655570Z 2025-12-04T12:10:20.5655663Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5655929Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5655932Z 2025-12-04T12:10:20.5656033Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5656111Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5656198Z ================== 1 failed, 97 deselected, 2 rerun in 2.45s =================== 2025-12-04T12:10:20.5656252Z Got exit code 1 2025-12-04T12:10:20.5656310Z Retrying single test... 2025-12-04T12:10:20.5656471Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e5948140b00e04d6.xml 2025-12-04T12:10:20.5656543Z ============================= test session starts ============================== 2025-12-04T12:10:20.5656671Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5656728Z cachedir: .pytest_cache 2025-12-04T12:10:20.5656902Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5656964Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5657022Z configfile: pytest.ini 2025-12-04T12:10:20.5657214Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5657315Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5657585Z stepcurrent: skipping 97 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5657646Z Running 1 items in this shard 2025-12-04T12:10:20.5657648Z 2025-12-04T12:10:20.5657872Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7432s] [100%] 2025-12-04T12:10:20.5658092Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3322s] [100%] 2025-12-04T12:10:20.5658288Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2977s] [100%] 2025-12-04T12:10:20.5658293Z 2025-12-04T12:10:20.5658361Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5658514Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5658577Z Traceback (most recent call last): 2025-12-04T12:10:20.5658750Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5658820Z method(*args, **kwargs) 2025-12-04T12:10:20.5658987Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5659043Z method(*args, **kwargs) 2025-12-04T12:10:20.5659213Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5659267Z with policy(): 2025-12-04T12:10:20.5659434Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5659491Z raise RuntimeError(msg) 2025-12-04T12:10:20.5659886Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5659891Z 2025-12-04T12:10:20.5659980Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5660280Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5660283Z 2025-12-04T12:10:20.5660385Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5660474Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5660534Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5660606Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5660689Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5660802Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5660857Z graph_break [] 2025-12-04T12:10:20.5660932Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5661085Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5661146Z Traceback (most recent call last): 2025-12-04T12:10:20.5661315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5661387Z method(*args, **kwargs) 2025-12-04T12:10:20.5661565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5661621Z method(*args, **kwargs) 2025-12-04T12:10:20.5661802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5661855Z with policy(): 2025-12-04T12:10:20.5662022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5662080Z raise RuntimeError(msg) 2025-12-04T12:10:20.5662473Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5662476Z 2025-12-04T12:10:20.5662566Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5662832Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5662834Z 2025-12-04T12:10:20.5662937Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5663025Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5663105Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5663176Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5663257Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5663370Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5663423Z graph_break [] 2025-12-04T12:10:20.5663499Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5663588Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5663646Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5663718Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5663830Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5663910Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5663965Z graph_break [] 2025-12-04T12:10:20.5664039Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5664107Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5664264Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5664325Z Traceback (most recent call last): 2025-12-04T12:10:20.5664493Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5664550Z method(*args, **kwargs) 2025-12-04T12:10:20.5664717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5664773Z method(*args, **kwargs) 2025-12-04T12:10:20.5664938Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5664991Z with policy(): 2025-12-04T12:10:20.5665159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5665216Z raise RuntimeError(msg) 2025-12-04T12:10:20.5665610Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5665625Z 2025-12-04T12:10:20.5665726Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5666003Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5666005Z 2025-12-04T12:10:20.5666108Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5666198Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5666257Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5666329Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5666410Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5666521Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5666576Z graph_break [] 2025-12-04T12:10:20.5666650Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5666740Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5666797Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5666869Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5666980Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5667073Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5667126Z graph_break [] 2025-12-04T12:10:20.5667200Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5667288Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5667346Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5667416Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5667527Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5667607Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5667661Z graph_break [] 2025-12-04T12:10:20.5667735Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5667939Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e5948140b00e04d6.xml - 2025-12-04T12:10:20.5668015Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5668594Z FAILED [0.2977s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5668597Z 2025-12-04T12:10:20.5668686Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5668953Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5668955Z 2025-12-04T12:10:20.5669057Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5669136Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5669219Z ================== 1 failed, 187 deselected, 2 rerun in 2.39s ================== 2025-12-04T12:10:20.5669274Z Got exit code 1 2025-12-04T12:10:20.5669330Z Retrying single test... 2025-12-04T12:10:20.5669490Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b4389690d64cb9eb.xml 2025-12-04T12:10:20.5669578Z ============================= test session starts ============================== 2025-12-04T12:10:20.5669714Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5669771Z cachedir: .pytest_cache 2025-12-04T12:10:20.5669955Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5670017Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5670078Z configfile: pytest.ini 2025-12-04T12:10:20.5670294Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5670386Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5670647Z stepcurrent: skipping 97 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5670710Z Running 1 items in this shard 2025-12-04T12:10:20.5670712Z 2025-12-04T12:10:20.5670936Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [19.0698s] [100%] 2025-12-04T12:10:20.5671157Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4133s] [100%] 2025-12-04T12:10:20.5671354Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3877s] [100%] 2025-12-04T12:10:20.5671373Z 2025-12-04T12:10:20.5671440Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5671596Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5671657Z Traceback (most recent call last): 2025-12-04T12:10:20.5671836Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5671894Z method(*args, **kwargs) 2025-12-04T12:10:20.5672061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5672117Z method(*args, **kwargs) 2025-12-04T12:10:20.5672284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5672338Z with policy(): 2025-12-04T12:10:20.5672505Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5672562Z raise RuntimeError(msg) 2025-12-04T12:10:20.5672957Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5672960Z 2025-12-04T12:10:20.5673050Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5673318Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5673321Z 2025-12-04T12:10:20.5673423Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5673511Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5673570Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5673641Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5673723Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5673852Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5673906Z graph_break [] 2025-12-04T12:10:20.5673993Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5674146Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5674219Z Traceback (most recent call last): 2025-12-04T12:10:20.5674390Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5674446Z method(*args, **kwargs) 2025-12-04T12:10:20.5674612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5674668Z method(*args, **kwargs) 2025-12-04T12:10:20.5674833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5674888Z with policy(): 2025-12-04T12:10:20.5675055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5675113Z raise RuntimeError(msg) 2025-12-04T12:10:20.5675507Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5675523Z 2025-12-04T12:10:20.5675613Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5675878Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5675880Z 2025-12-04T12:10:20.5675983Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5676072Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5676132Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5676204Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5676285Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5676397Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5676451Z graph_break [] 2025-12-04T12:10:20.5676526Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5676616Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5676673Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5676744Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5676854Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5676935Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5676987Z graph_break [] 2025-12-04T12:10:20.5677062Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5677130Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5677284Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5677345Z Traceback (most recent call last): 2025-12-04T12:10:20.5677515Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5677571Z method(*args, **kwargs) 2025-12-04T12:10:20.5677741Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5677796Z method(*args, **kwargs) 2025-12-04T12:10:20.5677962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5678030Z with policy(): 2025-12-04T12:10:20.5678207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5678266Z raise RuntimeError(msg) 2025-12-04T12:10:20.5678674Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5678677Z 2025-12-04T12:10:20.5678767Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5679034Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5679037Z 2025-12-04T12:10:20.5679139Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5679228Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5679287Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5679360Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5679442Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5679555Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5679622Z graph_break [] 2025-12-04T12:10:20.5679696Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5679785Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5679843Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5679914Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5680026Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5680144Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5680197Z graph_break [] 2025-12-04T12:10:20.5680272Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5680360Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5680419Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5680491Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5680603Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5680683Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5680735Z graph_break [] 2025-12-04T12:10:20.5680808Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:20.5681016Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b4389690d64cb9eb.xml - 2025-12-04T12:10:20.5681095Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5681674Z FAILED [0.3877s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5681677Z 2025-12-04T12:10:20.5681766Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5682031Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5682051Z 2025-12-04T12:10:20.5682151Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5682241Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5682325Z ================= 1 failed, 187 deselected, 2 rerun in 19.89s ================== 2025-12-04T12:10:20.5682394Z Got exit code 1 2025-12-04T12:10:20.5682609Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5682754Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.5682911Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7961a457c9592e94.xml 2025-12-04T12:10:20.5682984Z ============================= test session starts ============================== 2025-12-04T12:10:20.5683111Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5683169Z cachedir: .pytest_cache 2025-12-04T12:10:20.5683343Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5683406Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5683463Z configfile: pytest.ini 2025-12-04T12:10:20.5683639Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5683741Z collecting ... collected 188 items / 98 deselected / 90 selected 2025-12-04T12:10:20.5683810Z stepcurrent: skipping 98 already run items. 2025-12-04T12:10:20.5683870Z Running 90 items in this shard 2025-12-04T12:10:20.5683872Z 2025-12-04T12:10:20.5684101Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [13.9834s] [ 1%] 2025-12-04T12:10:20.5684329Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3766s] [ 1%] 2025-12-04T12:10:20.5684530Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3451s] [ 1%] 2025-12-04T12:10:20.5684532Z 2025-12-04T12:10:20.5684600Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5684754Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5684816Z Traceback (most recent call last): 2025-12-04T12:10:20.5684987Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5685046Z method(*args, **kwargs) 2025-12-04T12:10:20.5685212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5685271Z method(*args, **kwargs) 2025-12-04T12:10:20.5685438Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5685492Z with policy(): 2025-12-04T12:10:20.5685658Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5685716Z raise RuntimeError(msg) 2025-12-04T12:10:20.5686112Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5686116Z 2025-12-04T12:10:20.5686205Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5686499Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5686502Z 2025-12-04T12:10:20.5686604Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5686705Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5686765Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5686838Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5686919Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5687032Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5687084Z graph_break [] 2025-12-04T12:10:20.5687161Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5687316Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5687379Z Traceback (most recent call last): 2025-12-04T12:10:20.5687551Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5687608Z method(*args, **kwargs) 2025-12-04T12:10:20.5687773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5687830Z method(*args, **kwargs) 2025-12-04T12:10:20.5688005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5688058Z with policy(): 2025-12-04T12:10:20.5688225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5688283Z raise RuntimeError(msg) 2025-12-04T12:10:20.5688681Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5688684Z 2025-12-04T12:10:20.5688773Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5689043Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5689048Z 2025-12-04T12:10:20.5689149Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5689240Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5689299Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5689372Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5689453Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5689568Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5689620Z graph_break [] 2025-12-04T12:10:20.5689698Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5689787Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5689845Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5689917Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5690028Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5690138Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5690191Z graph_break [] 2025-12-04T12:10:20.5690267Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5690358Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5690511Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5690586Z Traceback (most recent call last): 2025-12-04T12:10:20.5690765Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5690824Z method(*args, **kwargs) 2025-12-04T12:10:20.5690990Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5691047Z method(*args, **kwargs) 2025-12-04T12:10:20.5691213Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5691265Z with policy(): 2025-12-04T12:10:20.5691432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5691490Z raise RuntimeError(msg) 2025-12-04T12:10:20.5691893Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5691895Z 2025-12-04T12:10:20.5691984Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5692268Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5692271Z 2025-12-04T12:10:20.5692371Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5692460Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5692520Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5692592Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5692672Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5692786Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5692839Z graph_break [] 2025-12-04T12:10:20.5692916Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5693004Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5693068Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5693139Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5693250Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5693330Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5693382Z graph_break [] 2025-12-04T12:10:20.5693460Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5693548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5693606Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5693677Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5693788Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5693866Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5693920Z graph_break [] 2025-12-04T12:10:20.5693995Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5694199Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7961a457c9592e94.xml - 2025-12-04T12:10:20.5694274Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5694884Z FAILED [0.3451s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5694899Z 2025-12-04T12:10:20.5694988Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5695256Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5695259Z 2025-12-04T12:10:20.5695360Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5695437Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5695521Z ================== 1 failed, 98 deselected, 2 rerun in 14.73s ================== 2025-12-04T12:10:20.5695575Z Got exit code 1 2025-12-04T12:10:20.5695632Z Retrying single test... 2025-12-04T12:10:20.5695791Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-934104de21e5b48a.xml 2025-12-04T12:10:20.5695866Z ============================= test session starts ============================== 2025-12-04T12:10:20.5695991Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5696061Z cachedir: .pytest_cache 2025-12-04T12:10:20.5696237Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5696299Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5696356Z configfile: pytest.ini 2025-12-04T12:10:20.5696533Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5696624Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5696888Z stepcurrent: skipping 98 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5696949Z Running 1 items in this shard 2025-12-04T12:10:20.5696951Z 2025-12-04T12:10:20.5697176Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [14.1505s] [100%] 2025-12-04T12:10:20.5697399Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3151s] [100%] 2025-12-04T12:10:20.5697597Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2188s] [100%] 2025-12-04T12:10:20.5697601Z 2025-12-04T12:10:20.5697669Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5697822Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5697884Z Traceback (most recent call last): 2025-12-04T12:10:20.5698056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5698114Z method(*args, **kwargs) 2025-12-04T12:10:20.5698280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5698338Z method(*args, **kwargs) 2025-12-04T12:10:20.5698504Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5698557Z with policy(): 2025-12-04T12:10:20.5698734Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5698791Z raise RuntimeError(msg) 2025-12-04T12:10:20.5699205Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5699210Z 2025-12-04T12:10:20.5699300Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5699572Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5699574Z 2025-12-04T12:10:20.5699675Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5699765Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5699824Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5699897Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5699977Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5700205Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5700258Z graph_break [] 2025-12-04T12:10:20.5700336Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5700506Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5700567Z Traceback (most recent call last): 2025-12-04T12:10:20.5700734Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5700792Z method(*args, **kwargs) 2025-12-04T12:10:20.5700957Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5701013Z method(*args, **kwargs) 2025-12-04T12:10:20.5701179Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5701233Z with policy(): 2025-12-04T12:10:20.5701400Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5701460Z raise RuntimeError(msg) 2025-12-04T12:10:20.5701855Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5701857Z 2025-12-04T12:10:20.5701946Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5702216Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5702219Z 2025-12-04T12:10:20.5702320Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5702411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5702470Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5702542Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5702622Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5702735Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5702787Z graph_break [] 2025-12-04T12:10:20.5702864Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5702968Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5703028Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5703111Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5703224Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5703322Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5703376Z graph_break [] 2025-12-04T12:10:20.5703451Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5703521Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5703673Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5703736Z Traceback (most recent call last): 2025-12-04T12:10:20.5703904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5703961Z method(*args, **kwargs) 2025-12-04T12:10:20.5704127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5704183Z method(*args, **kwargs) 2025-12-04T12:10:20.5704348Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5704401Z with policy(): 2025-12-04T12:10:20.5704567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5704635Z raise RuntimeError(msg) 2025-12-04T12:10:20.5705033Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5705037Z 2025-12-04T12:10:20.5705125Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5705395Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5705398Z 2025-12-04T12:10:20.5705498Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5705590Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5705648Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5705720Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5705801Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5705913Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5705967Z graph_break [] 2025-12-04T12:10:20.5706042Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5706133Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5706191Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5706263Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5706374Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5706455Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5706508Z graph_break [] 2025-12-04T12:10:20.5706586Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5706674Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5706732Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5706808Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5706918Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5707011Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5707064Z graph_break [] 2025-12-04T12:10:20.5707149Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5707370Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-934104de21e5b48a.xml - 2025-12-04T12:10:20.5707446Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5708031Z FAILED [0.2188s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5708036Z 2025-12-04T12:10:20.5708124Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5708394Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5708397Z 2025-12-04T12:10:20.5708498Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5708575Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5708673Z ================= 1 failed, 187 deselected, 2 rerun in 14.70s ================== 2025-12-04T12:10:20.5708727Z Got exit code 1 2025-12-04T12:10:20.5708784Z Retrying single test... 2025-12-04T12:10:20.5708943Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d071ce36b1a217f4.xml 2025-12-04T12:10:20.5709017Z ============================= test session starts ============================== 2025-12-04T12:10:20.5709142Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5709200Z cachedir: .pytest_cache 2025-12-04T12:10:20.5709373Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5709434Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5709491Z configfile: pytest.ini 2025-12-04T12:10:20.5709667Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5709761Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5710024Z stepcurrent: skipping 98 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5710084Z Running 1 items in this shard 2025-12-04T12:10:20.5710086Z 2025-12-04T12:10:20.5710363Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [13.5183s] [100%] 2025-12-04T12:10:20.5710589Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3003s] [100%] 2025-12-04T12:10:20.5710789Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2385s] [100%] 2025-12-04T12:10:20.5710792Z 2025-12-04T12:10:20.5710859Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5711012Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5711075Z Traceback (most recent call last): 2025-12-04T12:10:20.5711275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5711333Z method(*args, **kwargs) 2025-12-04T12:10:20.5711511Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5711580Z method(*args, **kwargs) 2025-12-04T12:10:20.5711746Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5711801Z with policy(): 2025-12-04T12:10:20.5711968Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5712026Z raise RuntimeError(msg) 2025-12-04T12:10:20.5712421Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.5712425Z 2025-12-04T12:10:20.5712514Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5712785Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5712787Z 2025-12-04T12:10:20.5712901Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5712992Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5713052Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5713126Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5713207Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5713323Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5713375Z graph_break [] 2025-12-04T12:10:20.5713455Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5713608Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5713673Z Traceback (most recent call last): 2025-12-04T12:10:20.5713840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5713901Z method(*args, **kwargs) 2025-12-04T12:10:20.5714066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5714125Z method(*args, **kwargs) 2025-12-04T12:10:20.5714289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5714344Z with policy(): 2025-12-04T12:10:20.5714509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5714568Z raise RuntimeError(msg) 2025-12-04T12:10:20.5714963Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.5714966Z 2025-12-04T12:10:20.5715054Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5715325Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5715327Z 2025-12-04T12:10:20.5715428Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5715529Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5715587Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5715668Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5715749Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5715871Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5715925Z graph_break [] 2025-12-04T12:10:20.5716004Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5716092Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5716151Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5716222Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5716334Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5716415Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5716468Z graph_break [] 2025-12-04T12:10:20.5716544Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5716613Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5716768Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5716829Z Traceback (most recent call last): 2025-12-04T12:10:20.5716998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5717065Z method(*args, **kwargs) 2025-12-04T12:10:20.5717231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5717286Z method(*args, **kwargs) 2025-12-04T12:10:20.5717451Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5717506Z with policy(): 2025-12-04T12:10:20.5717673Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5717730Z raise RuntimeError(msg) 2025-12-04T12:10:20.5718125Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5718128Z 2025-12-04T12:10:20.5718216Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5718491Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5718493Z 2025-12-04T12:10:20.5718596Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5718686Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5718745Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5718817Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5718897Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5719009Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5719064Z graph_break [] 2025-12-04T12:10:20.5719140Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5719229Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5719286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5719357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5719479Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5719559Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5719622Z graph_break [] 2025-12-04T12:10:20.5719699Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5719799Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5719858Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5719928Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5720039Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5720154Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.5720208Z graph_break [] 2025-12-04T12:10:20.5720282Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:20.5720488Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d071ce36b1a217f4.xml - 2025-12-04T12:10:20.5720565Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5721155Z FAILED [0.2385s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.5721175Z 2025-12-04T12:10:20.5721264Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5721530Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5721533Z 2025-12-04T12:10:20.5721635Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5721713Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5721799Z ================= 1 failed, 187 deselected, 2 rerun in 14.08s ================== 2025-12-04T12:10:20.5721853Z Got exit code 1 2025-12-04T12:10:20.5722070Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5722212Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.5722370Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-083e5d241cdc215d.xml 2025-12-04T12:10:20.5722442Z ============================= test session starts ============================== 2025-12-04T12:10:20.5722569Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5722627Z cachedir: .pytest_cache 2025-12-04T12:10:20.5722800Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5722861Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5722918Z configfile: pytest.ini 2025-12-04T12:10:20.5723093Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5723184Z collecting ... collected 188 items / 99 deselected / 89 selected 2025-12-04T12:10:20.5723253Z stepcurrent: skipping 99 already run items. 2025-12-04T12:10:20.5723313Z Running 89 items in this shard 2025-12-04T12:10:20.5723315Z 2025-12-04T12:10:20.5723541Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0201s] [ 1%] 2025-12-04T12:10:20.5723776Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5756s] [ 1%] 2025-12-04T12:10:20.5723984Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7651s] [ 1%] 2025-12-04T12:10:20.5723998Z 2025-12-04T12:10:20.5724066Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5724221Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5724282Z Traceback (most recent call last): 2025-12-04T12:10:20.5724454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5724511Z method(*args, **kwargs) 2025-12-04T12:10:20.5724679Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5724736Z method(*args, **kwargs) 2025-12-04T12:10:20.5724903Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5724955Z with policy(): 2025-12-04T12:10:20.5725123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5725181Z raise RuntimeError(msg) 2025-12-04T12:10:20.5725595Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.5725597Z 2025-12-04T12:10:20.5725686Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5725956Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5725958Z 2025-12-04T12:10:20.5726064Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5726154Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5726214Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5726286Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5726784Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5726899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5726953Z graph_break [] 2025-12-04T12:10:20.5727031Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5727125Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5727627Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5727693Z current_size = base.storage().size() 2025-12-04T12:10:20.5727752Z Autotune Choices Stats: 2025-12-04T12:10:20.5728148Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5728233Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5728308Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5728447Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5728698Z triton_mm_2 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5728941Z triton_mm_0 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5729181Z triton_mm_3 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5729417Z triton_mm_1 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5729486Z _scaled_mm 0.0252 ms 24.6% 2025-12-04T12:10:20.5729629Z SingleProcess AUTOTUNE benchmarking takes 0.0306 seconds and 0.1448 seconds precompiling for 5 choices 2025-12-04T12:10:20.5729784Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5729846Z Traceback (most recent call last): 2025-12-04T12:10:20.5730017Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5730075Z method(*args, **kwargs) 2025-12-04T12:10:20.5730283Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5730340Z method(*args, **kwargs) 2025-12-04T12:10:20.5730507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5730561Z with policy(): 2025-12-04T12:10:20.5730729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5730787Z raise RuntimeError(msg) 2025-12-04T12:10:20.5731184Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.5731188Z 2025-12-04T12:10:20.5731277Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5731545Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5731547Z 2025-12-04T12:10:20.5731651Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5731743Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5731802Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5731874Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5732385Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5732515Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5732579Z graph_break [] 2025-12-04T12:10:20.5732656Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5732745Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5733242Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5733307Z current_size = base.storage().size() 2025-12-04T12:10:20.5733367Z Autotune Choices Stats: 2025-12-04T12:10:20.5733745Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5733818Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5733897Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5734035Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5734281Z triton_mm_2 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5734521Z triton_mm_0 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5734760Z triton_mm_3 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5734996Z triton_mm_1 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5735059Z _scaled_mm 0.0252 ms 24.6% 2025-12-04T12:10:20.5735201Z SingleProcess AUTOTUNE benchmarking takes 0.0306 seconds and 0.1448 seconds precompiling for 5 choices 2025-12-04T12:10:20.5735290Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5735350Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5735422Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5735537Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5736030Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5736085Z graph_break [] 2025-12-04T12:10:20.5736161Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5736250Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5736318Z Autotune Choices Stats: 2025-12-04T12:10:20.5736699Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007840000092983246, "best_triton_pos": 0} 2025-12-04T12:10:20.5736780Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5736845Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5736981Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5737222Z triton_mm_4 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5737459Z triton_mm_7 0.0083 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5737697Z triton_mm_6 0.0084 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5737934Z triton_mm_5 0.0084 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5738002Z _scaled_mm 0.0263 ms 29.8% 2025-12-04T12:10:20.5738152Z SingleProcess AUTOTUNE benchmarking takes 0.0294 seconds and 0.0875 seconds precompiling for 5 choices 2025-12-04T12:10:20.5738220Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5738374Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5738435Z Traceback (most recent call last): 2025-12-04T12:10:20.5738607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5738664Z method(*args, **kwargs) 2025-12-04T12:10:20.5738832Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5738888Z method(*args, **kwargs) 2025-12-04T12:10:20.5739055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5739108Z with policy(): 2025-12-04T12:10:20.5739280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5739338Z raise RuntimeError(msg) 2025-12-04T12:10:20.5739734Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5739737Z 2025-12-04T12:10:20.5739828Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5740137Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5740140Z 2025-12-04T12:10:20.5740245Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5740333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5740393Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5740480Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5741002Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5741117Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5741172Z graph_break [] 2025-12-04T12:10:20.5741247Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5741336Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5741830Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5741897Z current_size = base.storage().size() 2025-12-04T12:10:20.5741955Z Autotune Choices Stats: 2025-12-04T12:10:20.5742332Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5742417Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5742481Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5742616Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5742861Z triton_mm_2 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5743099Z triton_mm_0 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5743336Z triton_mm_3 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5743570Z triton_mm_1 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5743629Z _scaled_mm 0.0252 ms 24.6% 2025-12-04T12:10:20.5743771Z SingleProcess AUTOTUNE benchmarking takes 0.0306 seconds and 0.1448 seconds precompiling for 5 choices 2025-12-04T12:10:20.5743862Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5743920Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5743993Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5744111Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5744611Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5744680Z graph_break [] 2025-12-04T12:10:20.5744755Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5744845Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5744910Z Autotune Choices Stats: 2025-12-04T12:10:20.5745293Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007840000092983246, "best_triton_pos": 0} 2025-12-04T12:10:20.5745365Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5745432Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5745567Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5745811Z triton_mm_4 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5746048Z triton_mm_7 0.0083 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5746284Z triton_mm_6 0.0084 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5746532Z triton_mm_5 0.0084 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5746589Z _scaled_mm 0.0263 ms 29.8% 2025-12-04T12:10:20.5746732Z SingleProcess AUTOTUNE benchmarking takes 0.0294 seconds and 0.0875 seconds precompiling for 5 choices 2025-12-04T12:10:20.5746822Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5746882Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5746953Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5747068Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5747555Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5747610Z graph_break [] 2025-12-04T12:10:20.5747685Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5747776Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5747832Z Autotune Choices Stats: 2025-12-04T12:10:20.5748204Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5748275Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5748341Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5748476Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5748718Z triton_mm_9 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5748976Z triton_mm_8 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5749232Z triton_mm_11 0.0080 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5749474Z triton_mm_10 0.0102 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5749532Z _scaled_mm 0.0249 ms 24.9% 2025-12-04T12:10:20.5749710Z SingleProcess AUTOTUNE benchmarking takes 0.0375 seconds and 0.1993 seconds precompiling for 5 choices 2025-12-04T12:10:20.5749918Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-083e5d241cdc215d.xml - 2025-12-04T12:10:20.5750049Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5750728Z FAILED [0.7651s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5750754Z 2025-12-04T12:10:20.5750859Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5751127Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5751130Z 2025-12-04T12:10:20.5751235Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5751320Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5751410Z ================== 1 failed, 99 deselected, 2 rerun in 3.38s =================== 2025-12-04T12:10:20.5751464Z Got exit code 1 2025-12-04T12:10:20.5751521Z Retrying single test... 2025-12-04T12:10:20.5751683Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2541e34c85607636.xml 2025-12-04T12:10:20.5751756Z ============================= test session starts ============================== 2025-12-04T12:10:20.5751882Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5751940Z cachedir: .pytest_cache 2025-12-04T12:10:20.5752113Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5752176Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5752234Z configfile: pytest.ini 2025-12-04T12:10:20.5752414Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5752505Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5752769Z stepcurrent: skipping 99 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5752828Z Running 1 items in this shard 2025-12-04T12:10:20.5752832Z 2025-12-04T12:10:20.5753056Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [14.6925s] [100%] 2025-12-04T12:10:20.5753298Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7946s] [100%] 2025-12-04T12:10:20.5753505Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.8492s] [100%] 2025-12-04T12:10:20.5753507Z 2025-12-04T12:10:20.5753588Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5753742Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5753807Z Traceback (most recent call last): 2025-12-04T12:10:20.5753982Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5754041Z method(*args, **kwargs) 2025-12-04T12:10:20.5754207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5754268Z method(*args, **kwargs) 2025-12-04T12:10:20.5754433Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5754488Z with policy(): 2025-12-04T12:10:20.5754654Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5754713Z raise RuntimeError(msg) 2025-12-04T12:10:20.5755112Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.5755128Z 2025-12-04T12:10:20.5755220Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5755491Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5755494Z 2025-12-04T12:10:20.5755597Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5755688Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5755746Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5755819Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5756312Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5756427Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5756481Z graph_break [] 2025-12-04T12:10:20.5756557Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5756647Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5757143Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5757210Z current_size = base.storage().size() 2025-12-04T12:10:20.5757266Z Autotune Choices Stats: 2025-12-04T12:10:20.5757659Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:20.5757815Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5757929Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5758082Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5758329Z triton_mm_1 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5758571Z triton_mm_0 0.0060 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5758808Z triton_mm_3 0.0066 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5759043Z triton_mm_2 0.0079 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5759100Z _scaled_mm 0.0216 ms 27.4% 2025-12-04T12:10:20.5759256Z SingleProcess AUTOTUNE benchmarking takes 0.0289 seconds and 0.1540 seconds precompiling for 5 choices 2025-12-04T12:10:20.5759409Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5759472Z Traceback (most recent call last): 2025-12-04T12:10:20.5759643Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5759701Z method(*args, **kwargs) 2025-12-04T12:10:20.5759868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5759925Z method(*args, **kwargs) 2025-12-04T12:10:20.5760128Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5760208Z with policy(): 2025-12-04T12:10:20.5760376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5760435Z raise RuntimeError(msg) 2025-12-04T12:10:20.5760833Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.5760836Z 2025-12-04T12:10:20.5760926Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5761195Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5761198Z 2025-12-04T12:10:20.5761300Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5761391Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5761449Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5761525Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5762032Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5762173Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5762229Z graph_break [] 2025-12-04T12:10:20.5762318Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5762409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5762908Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5762973Z current_size = base.storage().size() 2025-12-04T12:10:20.5763029Z Autotune Choices Stats: 2025-12-04T12:10:20.5763414Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:20.5763484Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5763568Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5763705Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5763950Z triton_mm_1 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5764190Z triton_mm_0 0.0060 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5764427Z triton_mm_3 0.0066 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5764662Z triton_mm_2 0.0079 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5764720Z _scaled_mm 0.0216 ms 27.4% 2025-12-04T12:10:20.5764864Z SingleProcess AUTOTUNE benchmarking takes 0.0289 seconds and 0.1540 seconds precompiling for 5 choices 2025-12-04T12:10:20.5764952Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5765012Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5765086Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5765202Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5765691Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5765745Z graph_break [] 2025-12-04T12:10:20.5765822Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5765911Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5765968Z Autotune Choices Stats: 2025-12-04T12:10:20.5766364Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:20.5766445Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5766511Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5766646Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5766889Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5767127Z triton_mm_5 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5767365Z triton_mm_7 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5767600Z triton_mm_6 0.0097 ms 62.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5767668Z _scaled_mm 0.0236 ms 25.5% 2025-12-04T12:10:20.5767809Z SingleProcess AUTOTUNE benchmarking takes 0.0292 seconds and 0.1185 seconds precompiling for 5 choices 2025-12-04T12:10:20.5767878Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5768030Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5768094Z Traceback (most recent call last): 2025-12-04T12:10:20.5768267Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5768324Z method(*args, **kwargs) 2025-12-04T12:10:20.5768507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5768564Z method(*args, **kwargs) 2025-12-04T12:10:20.5768729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5768784Z with policy(): 2025-12-04T12:10:20.5768955Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5769013Z raise RuntimeError(msg) 2025-12-04T12:10:20.5769409Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5769413Z 2025-12-04T12:10:20.5769570Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5769899Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5769902Z 2025-12-04T12:10:20.5770004Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5770147Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5770206Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5770279Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5770820Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5770940Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5770995Z graph_break [] 2025-12-04T12:10:20.5771070Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5771160Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5771652Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5771728Z current_size = base.storage().size() 2025-12-04T12:10:20.5771787Z Autotune Choices Stats: 2025-12-04T12:10:20.5772169Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:20.5772254Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5772317Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5772453Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5772696Z triton_mm_1 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5772937Z triton_mm_0 0.0060 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5773172Z triton_mm_3 0.0066 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5773425Z triton_mm_2 0.0079 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5773482Z _scaled_mm 0.0216 ms 27.4% 2025-12-04T12:10:20.5773626Z SingleProcess AUTOTUNE benchmarking takes 0.0289 seconds and 0.1540 seconds precompiling for 5 choices 2025-12-04T12:10:20.5773715Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5773774Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5773853Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5773969Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5774485Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5774539Z graph_break [] 2025-12-04T12:10:20.5774640Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5774728Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5774789Z Autotune Choices Stats: 2025-12-04T12:10:20.5775197Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:20.5775270Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5775334Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5775470Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5775719Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5775959Z triton_mm_5 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5776201Z triton_mm_7 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5776457Z triton_mm_6 0.0097 ms 62.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5776514Z _scaled_mm 0.0236 ms 25.5% 2025-12-04T12:10:20.5776656Z SingleProcess AUTOTUNE benchmarking takes 0.0292 seconds and 0.1185 seconds precompiling for 5 choices 2025-12-04T12:10:20.5776745Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5776803Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5776876Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5776991Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5777481Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5777536Z graph_break [] 2025-12-04T12:10:20.5777610Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5777715Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5777771Z Autotune Choices Stats: 2025-12-04T12:10:20.5778144Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.5778213Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5778279Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5778413Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5778674Z triton_mm_10 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5778942Z triton_mm_11 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5779193Z triton_mm_8 0.0066 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5779446Z triton_mm_9 0.0080 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5779503Z _scaled_mm 0.0248 ms 25.4% 2025-12-04T12:10:20.5779645Z SingleProcess AUTOTUNE benchmarking takes 0.0349 seconds and 0.3342 seconds precompiling for 5 choices 2025-12-04T12:10:20.5779847Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2541e34c85607636.xml - 2025-12-04T12:10:20.5779924Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5787866Z FAILED [0.8492s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5787919Z 2025-12-04T12:10:20.5788026Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5788308Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5788314Z 2025-12-04T12:10:20.5788426Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5788509Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5788598Z ================= 1 failed, 187 deselected, 2 rerun in 16.36s ================== 2025-12-04T12:10:20.5788653Z Got exit code 1 2025-12-04T12:10:20.5788710Z Retrying single test... 2025-12-04T12:10:20.5788897Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c8d12b21c847ec10.xml 2025-12-04T12:10:20.5788973Z ============================= test session starts ============================== 2025-12-04T12:10:20.5789104Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5789162Z cachedir: .pytest_cache 2025-12-04T12:10:20.5789336Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5789402Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5789460Z configfile: pytest.ini 2025-12-04T12:10:20.5789646Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5789741Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5790006Z stepcurrent: skipping 99 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5790068Z Running 1 items in this shard 2025-12-04T12:10:20.5790071Z 2025-12-04T12:10:20.5790377Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0051s] [100%] 2025-12-04T12:10:20.5790597Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5638s] [100%] 2025-12-04T12:10:20.5790837Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7932s] [100%] 2025-12-04T12:10:20.5790839Z 2025-12-04T12:10:20.5790922Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5791079Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5791155Z Traceback (most recent call last): 2025-12-04T12:10:20.5791333Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5791392Z method(*args, **kwargs) 2025-12-04T12:10:20.5791561Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5791618Z method(*args, **kwargs) 2025-12-04T12:10:20.5791785Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5791839Z with policy(): 2025-12-04T12:10:20.5792010Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5792067Z raise RuntimeError(msg) 2025-12-04T12:10:20.5792467Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.5792483Z 2025-12-04T12:10:20.5792577Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5792847Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5792850Z 2025-12-04T12:10:20.5792955Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5793046Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5793106Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5793180Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5793683Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5793799Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5793854Z graph_break [] 2025-12-04T12:10:20.5793931Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5794023Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5794527Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5794593Z current_size = base.storage().size() 2025-12-04T12:10:20.5794652Z Autotune Choices Stats: 2025-12-04T12:10:20.5795034Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.5795133Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5795198Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5795361Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5795609Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5795852Z triton_mm_0 0.0068 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5796090Z triton_mm_2 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5796326Z triton_mm_3 0.0094 ms 66.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5796385Z _scaled_mm 0.0243 ms 25.7% 2025-12-04T12:10:20.5796539Z SingleProcess AUTOTUNE benchmarking takes 0.0263 seconds and 0.1427 seconds precompiling for 5 choices 2025-12-04T12:10:20.5796696Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5796759Z Traceback (most recent call last): 2025-12-04T12:10:20.5796931Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5796988Z method(*args, **kwargs) 2025-12-04T12:10:20.5797156Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5797212Z method(*args, **kwargs) 2025-12-04T12:10:20.5797381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5797436Z with policy(): 2025-12-04T12:10:20.5797606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5797664Z raise RuntimeError(msg) 2025-12-04T12:10:20.5798063Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.5798067Z 2025-12-04T12:10:20.5798157Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5798428Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5798432Z 2025-12-04T12:10:20.5798534Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5798624Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5798685Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5798758Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5799253Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5799389Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5799444Z graph_break [] 2025-12-04T12:10:20.5799531Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5799621Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5800183Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5800247Z current_size = base.storage().size() 2025-12-04T12:10:20.5800306Z Autotune Choices Stats: 2025-12-04T12:10:20.5800686Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.5800757Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5800821Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5800978Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5801226Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5801468Z triton_mm_0 0.0068 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5801708Z triton_mm_2 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5801943Z triton_mm_3 0.0094 ms 66.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5802002Z _scaled_mm 0.0243 ms 25.7% 2025-12-04T12:10:20.5802144Z SingleProcess AUTOTUNE benchmarking takes 0.0263 seconds and 0.1427 seconds precompiling for 5 choices 2025-12-04T12:10:20.5802232Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5802291Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5802366Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5802481Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5802976Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5803031Z graph_break [] 2025-12-04T12:10:20.5803106Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5803195Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5803254Z Autotune Choices Stats: 2025-12-04T12:10:20.5803643Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.5803739Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5803807Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5803943Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5804188Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5804425Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5804666Z triton_mm_4 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5804901Z triton_mm_7 0.0080 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5804968Z _scaled_mm 0.0237 ms 25.1% 2025-12-04T12:10:20.5805111Z SingleProcess AUTOTUNE benchmarking takes 0.0277 seconds and 0.1270 seconds precompiling for 5 choices 2025-12-04T12:10:20.5805179Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5805334Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5805397Z Traceback (most recent call last): 2025-12-04T12:10:20.5805571Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5805628Z method(*args, **kwargs) 2025-12-04T12:10:20.5805797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5805853Z method(*args, **kwargs) 2025-12-04T12:10:20.5806020Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5806075Z with policy(): 2025-12-04T12:10:20.5806242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5806300Z raise RuntimeError(msg) 2025-12-04T12:10:20.5806700Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5806703Z 2025-12-04T12:10:20.5806793Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5807063Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5807066Z 2025-12-04T12:10:20.5807172Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5807261Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5807321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5807395Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5807916Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5808042Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5808098Z graph_break [] 2025-12-04T12:10:20.5808178Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5808267Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5808762Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5808828Z current_size = base.storage().size() 2025-12-04T12:10:20.5808887Z Autotune Choices Stats: 2025-12-04T12:10:20.5809262Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.5809346Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5809409Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5809544Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5809787Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5810028Z triton_mm_0 0.0068 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5810300Z triton_mm_2 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5810537Z triton_mm_3 0.0094 ms 66.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5810595Z _scaled_mm 0.0243 ms 25.7% 2025-12-04T12:10:20.5810738Z SingleProcess AUTOTUNE benchmarking takes 0.0263 seconds and 0.1427 seconds precompiling for 5 choices 2025-12-04T12:10:20.5810827Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5810886Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5810959Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5811074Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5811566Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5811621Z graph_break [] 2025-12-04T12:10:20.5811698Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5811807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5811865Z Autotune Choices Stats: 2025-12-04T12:10:20.5812271Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:20.5812343Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5812408Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5812542Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5812787Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5813026Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5813262Z triton_mm_4 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5813509Z triton_mm_7 0.0080 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5813567Z _scaled_mm 0.0237 ms 25.1% 2025-12-04T12:10:20.5813709Z SingleProcess AUTOTUNE benchmarking takes 0.0277 seconds and 0.1270 seconds precompiling for 5 choices 2025-12-04T12:10:20.5813799Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5813858Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5813933Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5814047Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5814536Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5814591Z graph_break [] 2025-12-04T12:10:20.5814666Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:20.5814755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5814812Z Autotune Choices Stats: 2025-12-04T12:10:20.5815183Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005840000230818987, "best_triton_pos": 0} 2025-12-04T12:10:20.5815253Z AUTOTUNE scaled_mm(33x32, 32x16, 33x1, 1x16, 16) 2025-12-04T12:10:20.5815318Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5815453Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5815706Z triton_mm_8 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5815970Z triton_mm_10 0.0060 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.5816227Z triton_mm_11 0.0063 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5816468Z triton_mm_9 0.0080 ms 73.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5816528Z _scaled_mm 0.0248 ms 23.5% 2025-12-04T12:10:20.5816670Z SingleProcess AUTOTUNE benchmarking takes 0.0324 seconds and 0.3721 seconds precompiling for 5 choices 2025-12-04T12:10:20.5816875Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c8d12b21c847ec10.xml - 2025-12-04T12:10:20.5816954Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5817544Z FAILED [0.7932s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.5817556Z 2025-12-04T12:10:20.5817646Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5817915Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5817918Z 2025-12-04T12:10:20.5818022Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5818101Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5818185Z ================== 1 failed, 187 deselected, 2 rerun in 3.38s ================== 2025-12-04T12:10:20.5818241Z Got exit code 1 2025-12-04T12:10:20.5818458Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.5818602Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.5818762Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c34da80779c63702.xml 2025-12-04T12:10:20.5818838Z ============================= test session starts ============================== 2025-12-04T12:10:20.5818968Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5819025Z cachedir: .pytest_cache 2025-12-04T12:10:20.5819199Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5819263Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5819321Z configfile: pytest.ini 2025-12-04T12:10:20.5819499Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5819596Z collecting ... collected 188 items / 100 deselected / 88 selected 2025-12-04T12:10:20.5819667Z stepcurrent: skipping 100 already run items. 2025-12-04T12:10:20.5819728Z Running 88 items in this shard 2025-12-04T12:10:20.5819730Z 2025-12-04T12:10:20.5820715Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpbcqwy47q/s4/cs4mld3v3vcyfd2cahnfnqkn3bbzysti74qv5d77ijrgevwcynat.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.5820895Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.5821130Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.5821302Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.5821608Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.5821758Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.5822031Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.5822199Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.5822474Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.5822649Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.5822933Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.5823084Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.5823372Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.5823581Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.5823911Z E1204 11:04:57.874000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5824654Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpbcqwy47q/oa/coagq4qmcfjfq7oqp54hgwy3jxjlmmo55426spsbvks43ck2iiom.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.5824833Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.5825076Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.5825257Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.5825557Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.5825704Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.5825976Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.5826128Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.5826396Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.5826565Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.5826859Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.5827006Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.5827304Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.5827512Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.5827839Z E1204 11:04:57.919000 760428 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5827909Z ('RERUN', {'yellow': True}) [14.2571s] [ 1%] 2025-12-04T12:10:20.5828237Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda E1204 11:04:59.002000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5828549Z E1204 11:04:59.002000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.5828692Z E1204 11:04:59.002000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5828853Z E1204 11:04:59.017000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5829160Z E1204 11:04:59.017000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.5829320Z E1204 11:04:59.017000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5829388Z ('RERUN', {'yellow': True}) [1.0286s] [ 1%] 2025-12-04T12:10:20.5829735Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda E1204 11:05:00.040000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5830041Z E1204 11:05:00.040000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.5830238Z E1204 11:05:00.040000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5830397Z E1204 11:05:00.054000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5830704Z E1204 11:05:00.054000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.5830847Z E1204 11:05:00.054000 760428 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5830903Z FAILED [1.0261s] [ 1%] 2025-12-04T12:10:20.5830905Z 2025-12-04T12:10:20.5830976Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.5831146Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5831208Z Traceback (most recent call last): 2025-12-04T12:10:20.5831380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5831440Z method(*args, **kwargs) 2025-12-04T12:10:20.5831608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5831668Z method(*args, **kwargs) 2025-12-04T12:10:20.5831838Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5831893Z with policy(): 2025-12-04T12:10:20.5832060Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5832118Z raise RuntimeError(msg) 2025-12-04T12:10:20.5832519Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1048576000. 2025-12-04T12:10:20.5832523Z 2025-12-04T12:10:20.5832614Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5832892Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5832894Z 2025-12-04T12:10:20.5833002Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5833093Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5833154Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5833227Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5833800Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5833951Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5834006Z graph_break [] 2025-12-04T12:10:20.5834097Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5834188Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5834688Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5834753Z current_size = base.storage().size() 2025-12-04T12:10:20.5834812Z Autotune Choices Stats: 2025-12-04T12:10:20.5835199Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.5835281Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5835346Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5835496Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5835747Z triton_mm_15 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5835990Z triton_mm_12 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5836236Z triton_mm_6 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5836477Z triton_mm_9 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5836714Z triton_mm_5 0.0067 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5836950Z triton_mm_8 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5837187Z triton_mm_11 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5837426Z triton_mm_13 0.0076 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5837670Z triton_mm_10 0.0078 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5837931Z triton_mm_1 0.0081 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.5838077Z SingleProcess AUTOTUNE benchmarking takes 0.0718 seconds and 0.4048 seconds precompiling for 15 choices 2025-12-04T12:10:20.5838241Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5838304Z Traceback (most recent call last): 2025-12-04T12:10:20.5838477Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5838536Z method(*args, **kwargs) 2025-12-04T12:10:20.5838708Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5838764Z method(*args, **kwargs) 2025-12-04T12:10:20.5838931Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5838988Z with policy(): 2025-12-04T12:10:20.5839159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5839215Z raise RuntimeError(msg) 2025-12-04T12:10:20.5839620Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1048576000 and is now 1109393408. 2025-12-04T12:10:20.5839636Z 2025-12-04T12:10:20.5839727Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5839997Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5840001Z 2025-12-04T12:10:20.5840140Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5840231Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5840291Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5840366Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5840934Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5841050Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5841104Z graph_break [] 2025-12-04T12:10:20.5841185Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5841275Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5841775Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5841840Z current_size = base.storage().size() 2025-12-04T12:10:20.5841901Z Autotune Choices Stats: 2025-12-04T12:10:20.5842289Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.5842409Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5842475Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5842624Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5842874Z triton_mm_15 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5843114Z triton_mm_12 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5843356Z triton_mm_6 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5843598Z triton_mm_9 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5843844Z triton_mm_5 0.0067 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5844107Z triton_mm_8 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5844345Z triton_mm_11 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5844585Z triton_mm_13 0.0076 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5844822Z triton_mm_10 0.0078 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5845063Z triton_mm_1 0.0081 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.5845208Z SingleProcess AUTOTUNE benchmarking takes 0.0718 seconds and 0.4048 seconds precompiling for 15 choices 2025-12-04T12:10:20.5845302Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5845361Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5845435Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5845550Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5846055Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5846110Z graph_break [] 2025-12-04T12:10:20.5846187Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5846304Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5846361Z Autotune Choices Stats: 2025-12-04T12:10:20.5846755Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5846833Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5846898Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5847033Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5847281Z triton_mm_23 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5847522Z triton_mm_24 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5847760Z triton_mm_31 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5848011Z triton_mm_22 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5848247Z triton_mm_27 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5848488Z triton_mm_18 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5848724Z triton_mm_21 0.0069 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5848961Z triton_mm_28 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5849203Z triton_mm_25 0.0077 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5849445Z triton_mm_17 0.0082 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.5849589Z SingleProcess AUTOTUNE benchmarking takes 0.1011 seconds and 0.3365 seconds precompiling for 17 choices 2025-12-04T12:10:20.5849658Z =================================== FAILURES =================================== 2025-12-04T12:10:20.5849813Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.5849875Z Traceback (most recent call last): 2025-12-04T12:10:20.5850050Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5850140Z method(*args, **kwargs) 2025-12-04T12:10:20.5850329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.5850386Z method(*args, **kwargs) 2025-12-04T12:10:20.5850572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.5850626Z with policy(): 2025-12-04T12:10:20.5850807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.5850867Z raise RuntimeError(msg) 2025-12-04T12:10:20.5851264Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.5851266Z 2025-12-04T12:10:20.5851358Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5851629Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5851631Z 2025-12-04T12:10:20.5851736Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5851824Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5851884Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5851970Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5852533Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5852649Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5852702Z graph_break [] 2025-12-04T12:10:20.5852780Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5852871Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5853374Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.5853438Z current_size = base.storage().size() 2025-12-04T12:10:20.5853497Z Autotune Choices Stats: 2025-12-04T12:10:20.5853880Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.5853959Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5854023Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5854160Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5854408Z triton_mm_15 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5854647Z triton_mm_12 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5854918Z triton_mm_6 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5855157Z triton_mm_9 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5855394Z triton_mm_5 0.0067 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5855630Z triton_mm_8 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5855870Z triton_mm_11 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5856112Z triton_mm_13 0.0076 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5856363Z triton_mm_10 0.0078 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5856601Z triton_mm_1 0.0081 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.5856747Z SingleProcess AUTOTUNE benchmarking takes 0.0718 seconds and 0.4048 seconds precompiling for 15 choices 2025-12-04T12:10:20.5856839Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5856898Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5856970Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5857086Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5857585Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5857640Z graph_break [] 2025-12-04T12:10:20.5857719Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5857808Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5857865Z Autotune Choices Stats: 2025-12-04T12:10:20.5858240Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.5858316Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5858380Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5858518Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5858802Z triton_mm_23 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5859071Z triton_mm_24 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5859310Z triton_mm_31 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5859547Z triton_mm_22 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5859789Z triton_mm_27 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5860031Z triton_mm_18 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5860326Z triton_mm_21 0.0069 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5860562Z triton_mm_28 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5860809Z triton_mm_25 0.0077 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5861047Z triton_mm_17 0.0082 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.5861192Z SingleProcess AUTOTUNE benchmarking takes 0.1011 seconds and 0.3365 seconds precompiling for 17 choices 2025-12-04T12:10:20.5861280Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.5861339Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.5861411Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.5861525Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.5862023Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.5862076Z graph_break [] 2025-12-04T12:10:20.5862156Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.5862245Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.5862302Z Autotune Choices Stats: 2025-12-04T12:10:20.5862679Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_37", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.5862793Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.5862857Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.5863005Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.5863249Z triton_mm_37 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5863490Z triton_mm_39 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5863727Z triton_mm_40 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5863966Z triton_mm_44 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5864202Z triton_mm_43 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.5864455Z triton_mm_41 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5864692Z triton_mm_34 0.0068 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5864933Z triton_mm_45 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5865173Z triton_mm_42 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5865415Z triton_mm_47 0.0074 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.5865558Z SingleProcess AUTOTUNE benchmarking takes 0.1114 seconds and 0.3784 seconds precompiling for 17 choices 2025-12-04T12:10:20.5865765Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c34da80779c63702.xml - 2025-12-04T12:10:20.5865841Z =========================== short test summary info ============================ 2025-12-04T12:10:20.5866432Z FAILED [1.0261s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.5866436Z 2025-12-04T12:10:20.5866526Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.5866794Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5866810Z 2025-12-04T12:10:20.5866923Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.5867011Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.5867097Z ================= 1 failed, 100 deselected, 2 rerun in 16.33s ================== 2025-12-04T12:10:20.5867152Z Got exit code 1 2025-12-04T12:10:20.5867209Z Retrying single test... 2025-12-04T12:10:20.5867366Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-41e927ba8746edb0.xml 2025-12-04T12:10:20.5867440Z ============================= test session starts ============================== 2025-12-04T12:10:20.5867567Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.5867627Z cachedir: .pytest_cache 2025-12-04T12:10:20.5867801Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.5867866Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.5867923Z configfile: pytest.ini 2025-12-04T12:10:20.5868105Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.5868195Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.5868470Z stepcurrent: skipping 100 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.5868532Z Running 1 items in this shard 2025-12-04T12:10:20.5868534Z 2025-12-04T12:10:20.5868878Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:23.639791449 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5868881Z 2025-12-04T12:10:20.5869214Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5869523Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5869674Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5870226Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5870495Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5870736Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5870960Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5871191Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5871461Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5871700Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5871944Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5872178Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5872421Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5872654Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5872894Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5873141Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5873380Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5873618Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5873860Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5874094Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5874333Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5874567Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5874773Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5875009Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5875251Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5875482Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5875723Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5875965Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5876212Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5876446Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5876685Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5876918Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5877134Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.5877360Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.5877548Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.5877743Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.5878284Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpq68u7hh8/s4/cs4mld3v3vcyfd2cahnfnqkn3bbzysti74qv5d77ijrgevwcynat.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.5878445Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.5878677Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.5878847Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.5879152Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.5879300Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.5879571Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.5879727Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.5880000Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.5880236Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.5880542Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.5880693Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.5880982Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.5881190Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.5881522Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5881830Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5881993Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5882489Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5882760Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5883000Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5883222Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5883438Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5883685Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5883922Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5884164Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5884400Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5884640Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5884896Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5885155Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5885391Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5885633Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5885868Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5886114Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5886348Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5886604Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5886836Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5887042Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5887276Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5887517Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5887749Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5887953Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5888186Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5888432Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5888665Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5888907Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5889140Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5889380Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.5889615Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.5889791Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.5889984Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.5890140Z E1204 11:05:30.616000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.5890317Z [W1204 11:05:30.089313459 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5890319Z 2025-12-04T12:10:20.5890648Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5890953Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5891117Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5891611Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5891879Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5892119Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5892339Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5892556Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5892799Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5893034Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5893278Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5893511Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5893785Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5894030Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5894273Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5894506Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5894747Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5894982Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5895224Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5895467Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5895706Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5895941Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5896145Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5896379Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5896620Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5896852Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5897058Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5897290Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5897530Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5897762Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5898004Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5898261Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5898488Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.5898713Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.5898890Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.5899083Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.5899619Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpq68u7hh8/oa/coagq4qmcfjfq7oqp54hgwy3jxjlmmo55426spsbvks43ck2iiom.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.5899781Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.5900022Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.5900240Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.5900542Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.5900689Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.5900959Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.5901113Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.5901382Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.5901553Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.5901837Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.5901985Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.5902273Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.5902485Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.5902844Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5903165Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5903311Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5903799Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5904068Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5904317Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5904549Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5904762Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5905006Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5905242Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5905482Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5905716Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5905956Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5906190Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5906433Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5906666Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5906906Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5907151Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5907411Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5907644Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5907885Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5908118Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5908325Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5908559Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5908799Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5909051Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5909255Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5909491Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5909733Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5909964Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5910252Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5910483Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5910703Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.5910926Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.5911101Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.5911294Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.5911411Z E1204 11:05:30.628000 765856 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.5911494Z ('RERUN', {'yellow': True}) [23.7606s] [100%] 2025-12-04T12:10:20.5911861Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:31.367917854 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5911864Z 2025-12-04T12:10:20.5912025Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5912333Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5912639Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5912787Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5913276Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5913557Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5913801Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5914023Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5914240Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5914482Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5914718Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5914960Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5915194Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5915435Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5915669Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5915909Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5916163Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5916413Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5916647Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5916861Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5917084Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5917302Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5917544Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5917785Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5917988Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5918221Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5918464Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5918696Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5918901Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5919134Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5919346Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5919549Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5919781Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5919992Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5920237Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5920492Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5920757Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5920990Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5921234Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5921466Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5921678Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5921902Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5922116Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5922370Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5922602Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5922843Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5923075Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5923316Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5923549Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5923792Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5924025Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5924265Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5924499Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5924738Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5924992Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5925247Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5925480Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5925722Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5925959Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5926203Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5926435Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5926685Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5926917Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5927159Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5927391Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5927607Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5927817Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.5928058Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5928293Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5928534Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5928773Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5929016Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5929246Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5929509Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5929753Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5929994Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5930259Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5930501Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5930736Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5930948Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5931163Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5931395Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5931609Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5931832Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5932052Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5932294Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5932526Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5932737Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5932940Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5933172Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5933382Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5933585Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5933848Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5934101Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5934334Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5934575Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5934807Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5935019Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5935242Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5935456Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5935713Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5935947Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5936165Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5936379Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5936594Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5936838Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5937072Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5937316Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5937551Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5937793Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5938027Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5938289Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5938533Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5938775Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5939009Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5939224Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5939430Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5939670Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5939887Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5940146Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5940364Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5940607Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5940844Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5941090Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5941325Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5941569Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5941803Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5942048Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5942283Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5942524Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5942789Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5943020Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5943234Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5943440Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5943664Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5943881Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5944124Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5944358Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5944588Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5944800Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5945017Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5945261Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5945496Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5945739Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5945972Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5946215Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5946450Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5946693Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5946928Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5947191Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5947439Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5947679Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5947915Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5948158Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5948393Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5948635Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5948880Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5949123Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5949359Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5949604Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5949837Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5950052Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5950295Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5950531Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5950774Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5951011Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5951253Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5951487Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5951762Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5952008Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5952250Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5952485Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5952729Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5952963Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5953169Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5953416Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5953659Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5953897Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5954140Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5954376Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5954604Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5954822Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5955035Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5955251Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5955493Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5955730Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5955961Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5956198Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5956423Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5956638Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5956880Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5957114Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5957333Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5957546Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5957766Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5957930Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.5958165Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5958372Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5958607Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5958852Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5959087Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5959301Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5959507Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5959742Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5959955Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5960205Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5960464Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5960682Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5960929Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5961135Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5961368Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5961574Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5961810Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5962052Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5962299Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5962540Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5962775Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5963017Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5963251Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5963493Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5963726Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5963971Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5964207Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5964540Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5964743Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5964994Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5965232Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5965463Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5965679Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5965891Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5966137Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5966371Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5966612Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5966855Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5967059Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5967294Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5967535Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5967769Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5968011Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5968245Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5968475Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5968695Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5968909Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5969116Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.5969321Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5969575Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5969827Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5970063Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5970346Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5970583Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5970789Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5971024Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5971281Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5971519Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5971762Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5971999Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5972203Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5972438Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5972683Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5972929Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5973171Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5973406Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5973632Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5973867Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5974095Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5974326Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5974570Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5974804Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5975009Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5975243Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5975485Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5975735Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5975977Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5976212Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5976442Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5976659Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5976874Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5977092Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5977337Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5977574Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5977814Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5978049Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5978291Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5978544Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5978762Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5978998Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5979243Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5979479Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5979724Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5979958Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5980235Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.5980451Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.5980668Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.5980884Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5981125Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5981359Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5981601Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5981836Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5982078Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5982313Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5982554Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5982803Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5983079Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5983313Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5983526Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.5983747Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.5983954Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.5984166Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.5984394Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.5984628Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.5984841Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.5985048Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.5985258Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.5985445Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.5985589Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.5985749Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.5985869Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.5986012Z E1204 11:05:31.918000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.5986185Z [W1204 11:05:31.397170522 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.5986188Z 2025-12-04T12:10:20.5986348Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.5986659Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.5986966Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.5987120Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.5987630Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.5987901Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.5988141Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.5988363Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.5988577Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5988819Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.5989064Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5989310Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5989548Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5989789Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5990022Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5990304Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5990538Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5990780Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5991013Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5991225Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5991448Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5991687Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5991949Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5992183Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5992394Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5992631Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5992875Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5993111Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5993315Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5993560Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5993772Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5993975Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5994210Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5994420Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5994621Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.5994854Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5995098Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5995334Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5995576Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5995808Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5996031Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.5996261Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.5996487Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.5996728Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5996961Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5997203Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5997438Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5997679Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5997923Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5998165Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5998399Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5998640Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5998874Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5999115Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5999348Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.5999589Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.5999823Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6000065Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6000337Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6000605Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6000850Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6001091Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6001326Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6001570Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6001803Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6002024Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6002235Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6002489Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6002722Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6002963Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6003198Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6003437Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6003668Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6003912Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6004146Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6004386Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6004618Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6004861Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6005114Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6005340Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6005543Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6005775Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6005986Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6006209Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6006425Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6006666Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6006911Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6007123Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6007326Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6007560Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6007770Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6007971Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6008202Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6008447Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6008680Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6008921Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6009153Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6009376Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6009623Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6009838Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6010084Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6010352Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6010570Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6010789Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6011005Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6011261Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6011495Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6011740Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6011976Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6012216Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6012450Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6012694Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6012930Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6013175Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6013411Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6013624Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6013841Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6014101Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6014317Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6014531Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6014746Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6014991Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6015228Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6015469Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6015715Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6015955Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6016191Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6016434Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6016668Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6016910Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6017145Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6017364Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6017580Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6017788Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6018015Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6018239Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6018500Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6018734Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6018952Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6019167Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6019384Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6019628Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6019861Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6020145Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6020378Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6020622Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6020856Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6021097Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6021332Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6021575Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6021811Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6022051Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6022288Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6022528Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6022789Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6023044Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6023278Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6023524Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6023757Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6024001Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6024235Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6024463Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6024667Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6024900Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6025145Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6025380Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6025623Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6025858Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6026101Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6026335Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6026575Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6026812Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6027053Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6027316Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6027533Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6027768Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6028011Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6028244Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6028487Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6028720Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6028959Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6029177Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6029390Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6029606Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6029849Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6030087Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6030344Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6030562Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6030777Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6030993Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6031236Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6031470Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6031718Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6031943Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6032151Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6032315Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6032552Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6032760Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6032994Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6033237Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6033484Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6033698Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6033904Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6034140Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6034351Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6034556Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6034792Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6035000Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6035237Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6035442Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6035677Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6035881Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6036138Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6036390Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6036625Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6036868Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6037104Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6037348Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6037582Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6037835Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6038069Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6038312Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6038548Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6038764Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6038968Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6039203Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6039432Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6039651Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6039866Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6040082Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6040370Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6040633Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6040890Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6041124Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6041329Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6041563Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6041807Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6042042Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6042306Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6042539Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6042767Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6042985Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6043197Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6043404Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6043609Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6043845Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6044087Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6044324Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6044566Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6044798Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6045025Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6045271Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6045513Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6045749Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6045992Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6046228Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6046431Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6046676Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6046919Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6047153Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6047395Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6047629Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6047859Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6048075Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6048295Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6048511Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6048753Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6048988Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6049192Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6049448Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6049701Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6049938Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6050216Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6050450Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6050680Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6050896Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6051125Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6051339Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6051584Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6051819Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6052060Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6052299Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6052541Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6052779Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6052984Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6053217Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6053459Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6053692Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6053959Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6054204Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6054433Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6054649Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6054863Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6055082Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6055323Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6055575Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6055815Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6056052Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6056294Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6056529Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6056771Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6057004Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6057247Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6057481Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6057693Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6057908Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6058122Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6058342Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6058589Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6058812Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6059026Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6059235Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6059443Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6059629Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6059781Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6059940Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6060061Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6060238Z E1204 11:05:31.936000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6060308Z ('RERUN', {'yellow': True}) [1.8291s] [100%] 2025-12-04T12:10:20.6060651Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:33.098331752 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6060653Z 2025-12-04T12:10:20.6060813Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6061122Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6061430Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6061577Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6062067Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6062335Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6062593Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6062836Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6063054Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6063297Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6063532Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6063776Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6064010Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6064251Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6064496Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6066799Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6067047Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6067293Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6067528Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6067741Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6067966Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6068184Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6068428Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6068661Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6068868Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6069119Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6069384Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6069620Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6069827Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6070062Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6070325Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6070531Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6070763Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6070993Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6071195Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6071430Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6071673Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6071905Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6072148Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6072380Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6072594Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6072818Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6073032Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6073275Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6073507Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6073777Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6074020Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6074262Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6074495Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6074737Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6074973Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6075215Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6075460Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6075701Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6075936Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6076178Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6076409Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6076651Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6076883Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6077126Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6077359Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6077602Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6077837Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6078102Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6078347Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6078565Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6078778Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6079017Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6079251Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6079495Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6079727Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6079976Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6080238Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6080481Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6080716Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6080960Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6081199Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6081439Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6081672Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6081884Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6082088Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6082327Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6082562Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6082800Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6083013Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6083256Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6083488Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6083700Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6083905Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6084138Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6084361Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6084566Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6084804Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6085051Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6085283Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6085524Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6085755Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6085970Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6086195Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6086411Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6086655Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6086890Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6087130Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6087353Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6087570Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6087812Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6088049Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6088293Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6088532Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6088787Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6089020Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6089269Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6089503Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6089744Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6089979Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6090231Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6090439Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6090678Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6090897Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6091110Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6091325Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6091597Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6091845Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6092088Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6092323Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6092569Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6092804Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6093048Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6093300Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6093542Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6093780Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6093999Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6094213Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6094419Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6094646Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6094862Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6095104Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6095339Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6095555Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6095770Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6096009Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6096264Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6096499Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6096742Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6096977Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6097224Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6097459Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6097710Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6097944Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6098187Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6098424Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6098665Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6098900Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6099142Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6099377Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6099620Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6099860Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6100131Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6100394Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6100648Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6100884Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6101098Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6101309Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6101545Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6101787Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6102021Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6102274Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6102509Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6102751Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6102985Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6103228Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6103463Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6103707Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6103941Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6104146Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6104380Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6104623Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6104877Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6105130Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6105365Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6105592Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6105809Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6106024Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6106242Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6106500Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6106733Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6106961Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6107178Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6107394Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6107608Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6107851Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6108086Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6108304Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6108517Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6108723Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6108887Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6109133Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6109347Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6109595Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6109838Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6110074Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6110330Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6110536Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6110770Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6111004Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6111208Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6111445Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6111651Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6111887Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6112093Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6112328Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6112538Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6112774Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6113015Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6113250Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6113491Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6113749Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6114002Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6114237Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6114481Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6114719Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6114966Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6115199Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6115422Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6115626Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6115862Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6116091Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6116307Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6116523Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6116738Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6116983Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6117220Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6117463Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6117698Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6117902Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6118159Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6118413Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6118649Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6118889Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6119126Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6119355Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6119573Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6119800Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6120005Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6120256Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6120491Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6120735Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6120971Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6121212Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6121448Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6121653Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6121888Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6122130Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6122364Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6122637Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6122883Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6123088Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6123321Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6123564Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6123800Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6124041Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6124288Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6124515Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6124733Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6124952Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6125168Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6125412Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6125645Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6125851Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6126085Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6126327Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6126562Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6126805Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6127066Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6127313Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6127533Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6127746Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6127961Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6128206Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6128445Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6128699Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6128932Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6129175Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6129410Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6129615Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6129850Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6130136Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6130372Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6130614Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6130848Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6131078Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6131295Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6131534Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6131762Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6132005Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6132241Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6132486Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6132723Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6132966Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6133212Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6133455Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6133692Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6133933Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6134168Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6134383Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6134602Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6134808Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6135020Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6135250Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6135472Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6135684Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6135910Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6136127Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6136314Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6136457Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6136618Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6136739Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6136883Z E1204 11:05:33.637000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6137057Z [W1204 11:05:33.113666792 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6137059Z 2025-12-04T12:10:20.6137220Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6137541Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6137850Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6137997Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6138494Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6138766Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6139005Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6139228Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6139443Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6139685Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6139921Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6140195Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6140456Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6140710Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6140944Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6141184Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6141420Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6141661Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6141895Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6142120Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6142342Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6142560Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6142803Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6143035Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6143242Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6143474Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6143716Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6143949Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6144156Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6144389Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6144603Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6144848Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6145094Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6145306Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6145507Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6145740Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6145983Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6146216Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6146466Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6146698Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6146910Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6147138Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6147354Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6147595Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6147828Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6148069Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6148301Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6148542Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6148775Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6149017Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6149269Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6149519Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6149754Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6149995Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6150279Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6150526Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6150761Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6151016Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6151248Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6151490Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6151724Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6151964Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6152198Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6152439Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6152674Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6152891Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6153107Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6153347Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6153594Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6153856Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6154090Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6154333Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6154572Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6154816Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6155049Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6155291Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6155531Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6155772Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6156007Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6156220Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6156424Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6156655Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6156868Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6157093Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6157310Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6157554Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6157787Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6158010Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6158224Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6158467Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6158678Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6158880Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6159114Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6159356Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6159589Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6159839Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6160072Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6160319Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6160543Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6160758Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6161005Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6161240Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6161459Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6161673Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6161888Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6162132Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6162366Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6162642Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6162889Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6163135Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6163370Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6163612Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6163848Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6164091Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6164338Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6164553Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6164759Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6164994Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6165211Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6165424Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6165639Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6165883Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6166119Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6166360Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6166597Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6166844Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6167100Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6167352Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6167587Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6167828Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6168063Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6168361Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6168575Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6168795Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6169021Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6169241Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6169484Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6169718Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6169937Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6170198Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6170415Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6170658Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6170894Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6171136Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6171370Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6171636Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6171884Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6172125Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6172363Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6172606Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6172842Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6173083Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6173330Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6173571Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6173807Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6174054Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6174288Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6174531Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6174768Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6175013Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6175249Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6175463Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6175667Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6175911Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6176172Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6176406Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6176648Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6176882Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6177125Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6177365Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6177607Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6177865Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6178108Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6178344Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6178551Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6178786Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6179027Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6179261Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6179504Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6179741Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6179973Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6180239Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6180481Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6180709Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6180951Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6181186Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6181414Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6181631Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6181845Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6182060Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6182315Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6182551Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6182770Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6182983Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6183193Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6183358Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6183592Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6183799Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6184033Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6184275Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6184513Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6184725Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6184950Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6185195Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6185409Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6185613Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6185849Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6186055Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6186289Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6186506Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6186745Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6186953Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6187190Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6187436Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6187671Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6187913Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6188149Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6188392Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6188627Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6188869Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6189104Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6189378Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6189622Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6189837Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6190041Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6190321Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6190550Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6190771Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6191003Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6191217Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6191461Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6191698Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6191942Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6192177Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6192382Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6192619Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6192862Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6193101Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6193345Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6193578Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6193830Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6194066Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6194280Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6194487Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6194692Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6194928Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6195172Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6195421Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6195663Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6195899Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6196103Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6196338Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6196582Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6196817Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6197060Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6197296Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6197500Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6197739Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6197984Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6198238Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6198490Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6198725Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6198953Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6199171Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6199387Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6199604Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6199857Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6200129Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6200336Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6200572Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6200817Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6201053Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6201295Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6201530Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6201759Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6201977Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6202191Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6202407Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6202677Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6202930Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6203173Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6203410Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6203652Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6203887Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6204094Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6204340Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6204583Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6204817Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6205061Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6205297Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6205526Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6205745Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6205960Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6206186Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6206429Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6206665Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6206907Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6207161Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6207413Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6207649Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6207893Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6208127Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6208370Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6208607Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6208830Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6209047Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6209252Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6209463Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6209692Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6209917Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6210173Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6210380Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6210588Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6210774Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6210918Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6211077Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6211196Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6211349Z E1204 11:05:33.652000 765856 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6211407Z FAILED [1.0340s] [100%] 2025-12-04T12:10:20.6211410Z 2025-12-04T12:10:20.6211506Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6211678Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6211744Z Traceback (most recent call last): 2025-12-04T12:10:20.6211922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6211984Z method(*args, **kwargs) 2025-12-04T12:10:20.6212150Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6212207Z method(*args, **kwargs) 2025-12-04T12:10:20.6212375Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6212432Z with policy(): 2025-12-04T12:10:20.6212600Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6212658Z raise RuntimeError(msg) 2025-12-04T12:10:20.6213068Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1048576000. 2025-12-04T12:10:20.6213083Z 2025-12-04T12:10:20.6213180Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6213454Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6213458Z 2025-12-04T12:10:20.6213563Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6213658Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6213718Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6213794Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6214367Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6214486Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6214542Z graph_break [] 2025-12-04T12:10:20.6214623Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6214715Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6215219Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6215285Z current_size = base.storage().size() 2025-12-04T12:10:20.6215343Z Autotune Choices Stats: 2025-12-04T12:10:20.6215735Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:20.6215825Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6215903Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6216051Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6216304Z triton_mm_15 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6216550Z triton_mm_7 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6216788Z triton_mm_5 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6217030Z triton_mm_11 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6217268Z triton_mm_9 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6217519Z triton_mm_13 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6217753Z triton_mm_6 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6217993Z triton_mm_1 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6218053Z _scaled_mm 0.0084 ms 73.2% 2025-12-04T12:10:20.6218292Z triton_mm_14 0.0090 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6218442Z SingleProcess AUTOTUNE benchmarking takes 0.0732 seconds and 7.8139 seconds precompiling for 15 choices 2025-12-04T12:10:20.6218603Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6218667Z Traceback (most recent call last): 2025-12-04T12:10:20.6218839Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6218898Z method(*args, **kwargs) 2025-12-04T12:10:20.6219066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6219123Z method(*args, **kwargs) 2025-12-04T12:10:20.6219288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6219344Z with policy(): 2025-12-04T12:10:20.6219512Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6219571Z raise RuntimeError(msg) 2025-12-04T12:10:20.6219977Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1048576000 and is now 1109393408. 2025-12-04T12:10:20.6219999Z 2025-12-04T12:10:20.6220128Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6220416Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6220420Z 2025-12-04T12:10:20.6220525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6220618Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6220677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6220751Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6221321Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6221441Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6221510Z graph_break [] 2025-12-04T12:10:20.6221589Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6221679Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6222184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6222251Z current_size = base.storage().size() 2025-12-04T12:10:20.6222308Z Autotune Choices Stats: 2025-12-04T12:10:20.6222693Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:20.6222772Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6222837Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6222976Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6223225Z triton_mm_15 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6223466Z triton_mm_7 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6223704Z triton_mm_5 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6223943Z triton_mm_11 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6224181Z triton_mm_9 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6224458Z triton_mm_13 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6224695Z triton_mm_6 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6224933Z triton_mm_1 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6224993Z _scaled_mm 0.0084 ms 73.2% 2025-12-04T12:10:20.6225233Z triton_mm_14 0.0090 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6225378Z SingleProcess AUTOTUNE benchmarking takes 0.0732 seconds and 7.8139 seconds precompiling for 15 choices 2025-12-04T12:10:20.6225468Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6225527Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6225611Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6225726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6226225Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6226282Z graph_break [] 2025-12-04T12:10:20.6226360Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6226451Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6226829Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.6226939Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.6226997Z Autotune Choices Stats: 2025-12-04T12:10:20.6227378Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_22", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6227456Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6227521Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6227658Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6227903Z triton_mm_22 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6228141Z triton_mm_23 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6228402Z triton_mm_27 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6228655Z triton_mm_25 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6228894Z triton_mm_18 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6229131Z triton_mm_24 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6229374Z triton_mm_31 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6229612Z triton_mm_21 0.0075 ms 82.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6229863Z triton_mm_26 0.0078 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6230147Z triton_mm_28 0.0081 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6230292Z SingleProcess AUTOTUNE benchmarking takes 0.1352 seconds and 0.4838 seconds precompiling for 17 choices 2025-12-04T12:10:20.6230364Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6230520Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6230583Z Traceback (most recent call last): 2025-12-04T12:10:20.6230755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6230815Z method(*args, **kwargs) 2025-12-04T12:10:20.6230984Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6231042Z method(*args, **kwargs) 2025-12-04T12:10:20.6231207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6231263Z with policy(): 2025-12-04T12:10:20.6231432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6231490Z raise RuntimeError(msg) 2025-12-04T12:10:20.6231892Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.6231896Z 2025-12-04T12:10:20.6231988Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6232260Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6232276Z 2025-12-04T12:10:20.6232379Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6232482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6232541Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6232614Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6233187Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6233305Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6233360Z graph_break [] 2025-12-04T12:10:20.6233438Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6233527Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6234031Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6234109Z current_size = base.storage().size() 2025-12-04T12:10:20.6234166Z Autotune Choices Stats: 2025-12-04T12:10:20.6234547Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:20.6234624Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6234691Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6234826Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6235075Z triton_mm_15 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6235315Z triton_mm_7 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6235551Z triton_mm_5 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6235790Z triton_mm_11 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6236032Z triton_mm_9 0.0071 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6236273Z triton_mm_13 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6236514Z triton_mm_6 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6236786Z triton_mm_1 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6236859Z _scaled_mm 0.0084 ms 73.2% 2025-12-04T12:10:20.6237099Z triton_mm_14 0.0090 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6237244Z SingleProcess AUTOTUNE benchmarking takes 0.0732 seconds and 7.8139 seconds precompiling for 15 choices 2025-12-04T12:10:20.6237332Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6237392Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6237465Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6237580Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6238079Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6238145Z graph_break [] 2025-12-04T12:10:20.6238221Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6238311Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6238687Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.6238795Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.6238854Z Autotune Choices Stats: 2025-12-04T12:10:20.6239235Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_22", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6239313Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6239378Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6239514Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6239758Z triton_mm_22 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6240003Z triton_mm_23 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6240269Z triton_mm_27 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6240507Z triton_mm_25 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6240761Z triton_mm_18 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6241021Z triton_mm_24 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6241260Z triton_mm_31 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6241498Z triton_mm_21 0.0075 ms 82.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6241739Z triton_mm_26 0.0078 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6241979Z triton_mm_28 0.0081 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6242125Z SingleProcess AUTOTUNE benchmarking takes 0.1352 seconds and 0.4838 seconds precompiling for 17 choices 2025-12-04T12:10:20.6242233Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6242292Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6242365Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6242479Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6242977Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6243032Z graph_break [] 2025-12-04T12:10:20.6243109Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6243198Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6243256Z Autotune Choices Stats: 2025-12-04T12:10:20.6243640Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_41", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6243717Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6243782Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6243917Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6244165Z triton_mm_41 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6244405Z triton_mm_38 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6244642Z triton_mm_39 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6244901Z triton_mm_40 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6245149Z triton_mm_43 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6245389Z triton_mm_42 0.0069 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6245627Z triton_mm_45 0.0069 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6245866Z triton_mm_37 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6246103Z triton_mm_34 0.0070 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6246350Z triton_mm_44 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6246495Z SingleProcess AUTOTUNE benchmarking takes 0.1012 seconds and 0.4159 seconds precompiling for 17 choices 2025-12-04T12:10:20.6246702Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-41e927ba8746edb0.xml - 2025-12-04T12:10:20.6246780Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6247374Z FAILED [1.0340s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.6247380Z 2025-12-04T12:10:20.6247469Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6247742Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6247745Z 2025-12-04T12:10:20.6247847Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6247926Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6248013Z ================= 1 failed, 187 deselected, 2 rerun in 26.65s ================== 2025-12-04T12:10:20.6248068Z Got exit code 1 2025-12-04T12:10:20.6248125Z Retrying single test... 2025-12-04T12:10:20.6248286Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f34463a06d63a0de.xml 2025-12-04T12:10:20.6248359Z ============================= test session starts ============================== 2025-12-04T12:10:20.6248490Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6248547Z cachedir: .pytest_cache 2025-12-04T12:10:20.6248723Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6248797Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6248856Z configfile: pytest.ini 2025-12-04T12:10:20.6249049Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6249153Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6249418Z stepcurrent: skipping 100 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6249480Z Running 1 items in this shard 2025-12-04T12:10:20.6249482Z 2025-12-04T12:10:20.6249824Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:45.085457173 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6249827Z 2025-12-04T12:10:20.6250196Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6250506Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6250668Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6251166Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6251436Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6251676Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6251904Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6252121Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6252372Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6252611Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6252856Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6253094Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6253335Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6253594Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6253847Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6254082Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6254325Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6254559Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6254803Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6255041Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6255293Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6255525Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6255732Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6255967Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6256208Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6256442Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6256645Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6256879Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6257120Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6257354Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6257595Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6257837Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6258078Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.6258303Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.6258481Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.6258676Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.6259212Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp__50bfke/s4/cs4mld3v3vcyfd2cahnfnqkn3bbzysti74qv5d77ijrgevwcynat.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8) 2025-12-04T12:10:20.6259376Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.6259617Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.6259788Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.6260129Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.6260280Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.6260551Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.6260707Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.6260979Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.6261150Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.6261435Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.6261584Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.6261873Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.6262081Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.6262423Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6262760Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6262914Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6263410Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6263680Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6263923Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6264157Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6264373Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6264615Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6264850Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6265094Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6265328Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6265571Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6265807Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6266048Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6266282Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6266525Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6266758Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6267019Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6267261Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6267502Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6267742Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6267947Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6268179Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6268422Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6268664Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6268867Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6269100Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6269340Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6269572Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6269812Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6270046Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6270300Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.6270524Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.6270699Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.6270893Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.6271011Z E1204 11:05:53.015000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.6271183Z [W1204 11:05:53.488924338 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6271198Z 2025-12-04T12:10:20.6271535Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6271852Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6271998Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6272494Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6272761Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6273000Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6273232Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6273447Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6273691Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6273925Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6274167Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6274400Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6274643Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6274878Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6275117Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6275351Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6275590Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6275853Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6276102Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6276338Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6276578Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6276811Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6277018Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6277251Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6277509Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6277740Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6277946Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6278181Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6278423Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6278656Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6278895Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6279129Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6279345Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.6279570Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.6279745Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.6279938Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.6280532Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp__50bfke/oa/coagq4qmcfjfq7oqp54hgwy3jxjlmmo55426spsbvks43ck2iiom.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.6280693Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.6280923Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.6281093Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.6281395Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.6281545Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.6281815Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.6281984Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.6282257Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.6282432Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.6282716Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.6282873Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.6283165Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.6283372Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.6283700Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6284005Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6284151Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6284643Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6284933Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6285184Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6285405Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6285623Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6285863Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6286100Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6286343Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6286586Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6286831Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6287064Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6287310Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6287543Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6287784Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6288017Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6288258Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6288492Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6288733Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6288968Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6289183Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6289425Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6289676Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6289908Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6290151Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6290384Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6290626Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6290859Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6291113Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6291349Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6291567Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.6291795Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.6291967Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.6292168Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.6292284Z E1204 11:05:53.028000 771024 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.6292354Z ('RERUN', {'yellow': True}) [12.5689s] [100%] 2025-12-04T12:10:20.6292697Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:54.738095090 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6292700Z 2025-12-04T12:10:20.6292860Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6293167Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6293473Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6293638Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6294152Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6294419Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6294659Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6294879Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6295094Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6295336Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6295580Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6295821Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6296056Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6296299Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6296531Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6296772Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6297009Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6297250Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6297484Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6297697Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6297921Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6298150Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6298412Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6298645Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6298849Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6299082Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6299323Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6299557Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6299759Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6300003Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6300250Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6300455Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6300692Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6300901Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6301105Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6301336Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6301581Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6301819Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6302061Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6302293Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6302516Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6302753Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6302980Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6303223Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6303454Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6303696Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6303930Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6304169Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6304414Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6304654Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6304892Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6305137Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6305372Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6305615Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6305846Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6306089Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6306323Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6306565Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6306803Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6307063Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6307308Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6307549Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6307786Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6308026Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6308262Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6308478Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6308688Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6308946Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6309177Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6309420Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6309653Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6309895Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6310166Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6310408Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6310644Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6310886Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6311124Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6311366Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6311633Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6311857Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6312059Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6312296Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6312508Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6312732Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6312948Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6313189Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6313435Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6313646Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6313854Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6314086Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6314297Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6314499Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6314735Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6314979Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6315213Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6315456Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6315689Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6315912Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6316146Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6316376Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6316623Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6316858Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6317079Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6317293Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6317509Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6317762Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6317997Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6318242Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6318475Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6318717Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6318952Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6319194Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6319432Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6319677Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6319912Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6320162Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6320381Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6320628Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6320858Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6321071Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6321290Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6321534Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6321772Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6322017Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6322263Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6322505Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6322741Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6322983Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6323217Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6323459Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6323693Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6323911Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6324129Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6324338Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6324561Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6324787Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6325041Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6325297Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6325514Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6325728Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6325943Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6326193Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6326431Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6326683Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6326917Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6327162Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6327397Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6327640Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6327875Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6328117Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6328352Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6328597Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6328833Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6329075Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6329328Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6329578Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6329813Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6330055Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6330327Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6330570Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6330805Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6331040Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6331246Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6331483Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6331725Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6331963Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6332205Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6332441Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6332684Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6332918Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6333161Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6333397Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6333644Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6333908Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6334125Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6334360Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6334604Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6334839Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6335083Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6335320Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6335560Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6335775Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6335991Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6336207Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6336451Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6336691Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6336920Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6337139Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6337352Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6337567Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6337808Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6338043Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6338280Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6338503Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6338710Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6338874Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6339110Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6339319Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6339555Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6339798Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6340044Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6340303Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6340511Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6340747Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6340958Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6341163Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6341397Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6341611Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6341850Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6342054Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6342289Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6342493Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6342763Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6343019Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6343255Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6343497Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6343731Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6343975Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6344210Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6344548Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6344786Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6345031Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6345267Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6345479Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6345684Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6345917Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6346146Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6346366Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6346586Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6346802Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6347044Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6347302Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6347553Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6347788Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6347991Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6348227Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6348471Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6348707Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6348961Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6349193Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6349424Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6349641Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6349855Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6350062Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6350307Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6350545Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6350788Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6351024Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6351267Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6351507Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6351745Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6351991Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6352236Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6352470Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6352713Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6352949Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6353155Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6353405Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6353648Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6353888Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6354131Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6354366Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6354594Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6354811Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6355027Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6355242Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6355484Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6355719Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6355925Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6356181Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6356438Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6356677Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6356920Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6357155Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6357382Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6357600Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6357823Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6358039Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6358284Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6358520Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6360285Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6360526Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6360773Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6361015Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6361221Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6361457Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6361700Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6361936Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6362220Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6362472Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6362707Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6362924Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6363141Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6363356Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6363600Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6363846Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6364091Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6364333Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6364576Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6364811Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6365053Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6365288Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6365531Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6365767Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6365981Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6366198Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6366404Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6366635Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6366879Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6367101Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6367316Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6367527Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6367737Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6367926Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6368068Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6368242Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6368362Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6368504Z E1204 11:05:54.287000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6368680Z [W1204 11:05:54.763455124 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6368683Z 2025-12-04T12:10:20.6368844Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6369155Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6369464Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6369609Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6370144Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6370413Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6370655Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6370888Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6371117Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6371374Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6371611Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6371853Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6372088Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6372335Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6372569Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6372826Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6373058Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6373302Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6373537Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6373751Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6373973Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6374187Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6374431Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6374665Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6374870Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6375102Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6375355Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6375609Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6375815Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6376049Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6376261Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6376466Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6376699Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6376910Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6377132Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6377364Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6377609Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6377843Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6378085Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6378318Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6378530Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6378756Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6378972Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6379214Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6379447Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6379688Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6379939Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6380233Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6380470Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6380710Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6380943Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6381185Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6381418Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6381670Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6381903Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6382151Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6382383Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6382625Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6382859Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6383101Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6383334Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6383577Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6383810Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6384050Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6384308Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6384537Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6384751Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6384995Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6385228Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6385474Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6385707Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6385948Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6386191Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6386432Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6386666Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6386918Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6387151Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6387391Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6387625Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6387844Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6388049Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6388289Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6388500Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6388733Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6388968Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6389212Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6389445Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6389656Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6389862Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6390130Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6390340Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6390561Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6390796Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6391038Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6391272Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6391517Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6391753Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6391965Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6392189Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6392406Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6392650Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6392892Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6393109Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6393349Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6393582Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6393828Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6394064Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6394306Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6394544Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6394787Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6395033Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6395278Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6395514Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6395757Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6395994Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6396210Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6396418Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6396661Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6396880Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6397094Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6397311Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6397554Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6397807Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6398060Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6398296Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6398540Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6398776Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6399021Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6399255Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6399508Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6399743Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6399961Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6400219Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6400426Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6400654Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6400872Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6401119Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6401356Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6401580Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6401797Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6402013Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6402282Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6402528Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6402773Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6403007Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6403252Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6403488Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6403729Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6403977Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6404219Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6404457Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6404700Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6404934Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6405177Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6405410Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6405656Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6405892Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6406134Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6406371Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6406642Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6406886Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6407100Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6407309Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6407544Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6407788Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6408024Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6408266Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6408512Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6408754Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6408991Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6409235Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6409470Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6409712Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6409948Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6410193Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6410430Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6410673Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6410908Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6411188Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6411441Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6411670Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6411888Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6412101Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6412319Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6412563Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6412813Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6413044Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6413260Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6413475Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6413691Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6413935Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6414170Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6414387Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6414601Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6414808Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6414972Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6415207Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6415414Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6415673Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6415925Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6416168Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6416381Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6416588Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6416824Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6417039Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6417255Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6417490Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6417696Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6417931Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6418136Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6418371Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6418578Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6418815Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6419058Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6419294Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6419539Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6419774Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6420039Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6420341Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6420584Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6420817Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6421065Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6421298Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6421514Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6421730Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6421966Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6422194Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6422413Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6422626Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6422843Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6423085Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6423324Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6423568Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6423803Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6424008Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6424244Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6424514Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6424758Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6425003Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6425238Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6425466Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6425690Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6425909Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6426126Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6426333Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6426571Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6426814Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6427051Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6427294Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6427528Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6427735Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6427972Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6428215Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6428451Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6428693Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6428958Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6429176Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6429410Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6429653Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6429889Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6430169Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6430405Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6430647Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6430870Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6431086Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6431303Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6431547Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6431782Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6431986Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6432221Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6432465Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6432699Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6432942Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6433177Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6433430Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6433663Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6433876Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6434093Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6434335Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6434572Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6434815Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6435058Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6435301Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6435541Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6435754Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6435988Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6436230Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6436464Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6436706Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6436941Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6437167Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6437385Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6437599Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6437834Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6438086Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6438322Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6438564Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6438798Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6439042Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6439277Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6439527Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6439762Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6440004Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6440281Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6440497Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6440716Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6440923Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6441136Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6441367Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6441588Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6441802Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6442008Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6442241Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6442441Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6442583Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6442746Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6442864Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6443005Z E1204 11:05:54.302000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6443074Z ('RERUN', {'yellow': True}) [1.5100s] [100%] 2025-12-04T12:10:20.6443423Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda [W1204 11:05:55.264828686 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6443426Z 2025-12-04T12:10:20.6443585Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6443912Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6444219Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6444366Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6444863Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6445133Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6445375Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6445598Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6445813Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6446056Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6446291Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6446534Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6446785Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6447037Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6447272Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6447514Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6447748Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6447989Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6448221Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6448442Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6448666Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6448882Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6449123Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6449356Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6449561Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6449797Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6450039Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6450315Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6450520Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6450752Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6450964Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6451196Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6451440Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6451653Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6451857Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6452092Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6452335Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6452570Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6452822Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6453054Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6453266Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6453492Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6453707Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6453948Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6454183Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6454427Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6454661Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6454900Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6455141Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6455383Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6455635Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6455887Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6456121Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6456362Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6456598Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6456842Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6457075Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6457325Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6457557Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6457799Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6458033Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6458274Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6458508Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6458749Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6458985Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6459203Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6459414Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6459656Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6459898Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6460197Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6460448Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6460689Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6460926Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6461168Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6461405Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6461645Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6461892Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6462133Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6462366Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6462579Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6462781Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6463015Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6463226Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6463449Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6463666Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6463909Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6464142Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6464365Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6464578Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6464821Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6465038Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6465240Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6465472Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6465716Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6465951Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6466202Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6466436Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6466648Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6466871Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6467085Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6467330Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6467564Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6467785Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6468000Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6468216Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6468463Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6468698Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6468962Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6469206Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6469450Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6469684Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6469929Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6470231Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6470477Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6470725Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6470939Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6471147Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6471383Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6471601Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6471820Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6472034Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6472278Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6472513Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6472757Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6472995Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6473237Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6473501Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6473754Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6473990Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6474232Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6474468Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6474693Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6474906Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6475128Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6475353Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6475569Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6475812Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6476048Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6476265Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6476477Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6476692Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6476934Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6477169Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6477412Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6477647Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6477916Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6478159Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6478402Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6478635Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6478879Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6479115Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6479357Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6479608Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6479850Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6480086Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6480359Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6480593Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6480836Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6481069Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6481317Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6481553Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6481769Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6481977Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6482227Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6482485Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6482730Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6482973Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6483209Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6483456Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6483689Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6483932Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6484179Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6484422Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6484666Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6484873Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6485108Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6485350Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6485585Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6485829Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6486062Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6486292Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6486509Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6486744Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6486971Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6487214Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6487449Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6487676Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6487895Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6488109Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6488324Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6488575Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6488812Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6489031Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6489244Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6489454Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6489616Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6489852Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6490058Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6490331Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6490576Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6490810Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6491024Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6491262Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6491510Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6491725Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6491930Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6492166Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6492372Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6492609Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6492825Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6493061Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6493264Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6493502Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6493747Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6493981Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6494225Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6494465Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6494707Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6494941Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6495185Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6495420Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6495688Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6495931Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6496145Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6496352Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6496586Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6496815Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6497033Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6497256Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6497471Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6497713Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6497949Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6498191Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6498426Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6498632Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6498868Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6499111Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6499347Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6499590Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6499823Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6500070Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6500330Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6500544Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6500753Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6500957Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6501195Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6501439Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6501674Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6501929Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6502163Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6502370Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6502607Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6502849Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6503084Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6503327Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6503563Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6503770Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6504007Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6504248Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6504511Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6504763Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6504998Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6505227Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6505445Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6505660Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6505876Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6506122Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6506368Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6506572Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6506810Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6507052Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6507289Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6507531Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6507767Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6507995Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6508211Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6508427Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6508643Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6508907Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6509152Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6509394Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6509630Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6509871Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6510146Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6510351Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6510606Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6510848Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6511084Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6511328Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6511564Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6511792Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6512009Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6512224Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6512438Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6512680Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6512918Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6513161Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6513422Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6513676Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6513911Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6514156Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6514391Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6514634Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6514868Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6515092Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6515309Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6515517Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6515728Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6515958Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6516180Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6516393Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6516601Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6516808Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6516995Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6517137Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6517299Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6517418Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6517569Z E1204 11:05:55.804000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6517751Z [W1204 11:05:55.279304095 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.6517754Z 2025-12-04T12:10:20.6517923Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.6518232Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.6518539Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.6518687Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.6519181Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.6519457Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.6519696Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.6519918Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.6520168Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6520410Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6520647Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6520890Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6521198Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6521441Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6521672Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6521915Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6522148Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6522422Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6522669Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6522883Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6523106Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6523321Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6523564Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6523796Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6524012Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6524250Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6524492Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6524725Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6524928Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6525163Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6525374Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6525578Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6525815Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6526026Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6526230Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6526465Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6526729Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6526980Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6527222Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6527457Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6527669Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6527894Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6528109Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6528361Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6528593Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6528835Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6529071Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6529312Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6529545Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6529785Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6530020Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6530299Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6530531Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6530773Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6531004Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6531271Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6531516Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6531760Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6531993Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6532234Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6532538Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6532780Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6533028Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6533269Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6533503Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6533720Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6533930Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.6534176Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6534408Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6534650Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6534883Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6535127Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6535360Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6535600Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6535854Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6536106Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6536340Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6536579Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6536813Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6537025Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6537227Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6537473Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6537684Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6537909Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6538124Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6538365Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6538599Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6538810Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6539016Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6539249Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6539461Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6539663Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6539898Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6540203Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6540448Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6540690Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6540923Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6541134Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6541357Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6541572Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6541831Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6542065Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6542285Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6542500Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6542716Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6542959Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6543195Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6543439Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6543673Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6543916Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6544152Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6544398Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6544663Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6544921Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6545159Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6545373Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6545579Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6545815Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6546035Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6546260Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6546474Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6546718Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6546952Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6547195Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6547430Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6547676Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6547914Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6548157Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6548392Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6548635Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6548871Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6549109Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6549337Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6549545Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6549770Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.6549987Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6550270Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6550505Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6550735Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6550949Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6551165Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6551407Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6551644Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6551886Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6552121Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6552364Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6552601Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6552845Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6553083Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6553325Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6553585Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6553839Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6554074Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6554317Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6554554Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6554798Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6555033Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6555285Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6555522Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6555764Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6555998Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6556212Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6556418Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6556652Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6556897Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6557135Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6557377Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6557613Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6557867Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6558111Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6558365Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6558601Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6558843Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6559080Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6559288Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6559524Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6559775Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6560011Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6560295Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6560530Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6560759Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6560976Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6561190Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6561406Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6561651Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6561889Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6562117Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6562360Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6562585Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6562813Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6563056Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6563291Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6563510Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6563727Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6563935Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6564108Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.6564345Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6564552Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6564790Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6565032Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6565268Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6565482Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6565688Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6565924Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6566136Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6566343Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6566578Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6566808Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6567053Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6567257Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6567492Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6567696Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6567934Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6568177Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6568411Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6568664Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6568899Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6569147Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6569380Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6569623Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6569856Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6570141Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6570377Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6570590Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.6570796Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6571030Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6571284Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6571516Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6571732Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6571949Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6572191Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6572428Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6572669Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6572903Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6573123Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6573356Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6573605Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6573841Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6574085Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6574319Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6574548Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6574767Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6574981Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6575189Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.6575392Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6575640Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6575903Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6576141Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6576385Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6576624Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6576831Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6577066Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6577308Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6577557Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6577799Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6578034Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6578241Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6578479Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6578721Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6578956Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6579199Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6579434Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6579663Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6579880Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6580154Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6580383Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6580628Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6580864Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6581070Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6581308Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6581551Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6581786Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6582039Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6582273Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6582502Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6582721Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6582938Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6583154Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6583397Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6583631Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6583875Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6584109Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6584351Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6584608Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6584823Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.6585058Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6585303Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6585539Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6585782Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6586018Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6586256Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.6586472Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.6586690Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.6586908Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.6587151Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.6587386Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6587629Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6587868Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6588113Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6588349Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6588591Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6588825Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6589087Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.6589331Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.6589546Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.6589764Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.6589971Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.6590222Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.6590455Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.6590676Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.6590900Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.6591107Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.6591315Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.6591505Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.6591645Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.6591806Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.6591924Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.6592066Z E1204 11:05:55.818000 771024 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.6592124Z FAILED [1.7315s] [100%] 2025-12-04T12:10:20.6592127Z 2025-12-04T12:10:20.6592201Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6592359Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6592426Z Traceback (most recent call last): 2025-12-04T12:10:20.6592606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6592666Z method(*args, **kwargs) 2025-12-04T12:10:20.6592834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6592891Z method(*args, **kwargs) 2025-12-04T12:10:20.6593058Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6593128Z with policy(): 2025-12-04T12:10:20.6593298Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6593366Z raise RuntimeError(msg) 2025-12-04T12:10:20.6593792Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1048576000. 2025-12-04T12:10:20.6593795Z 2025-12-04T12:10:20.6593890Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6594166Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6594169Z 2025-12-04T12:10:20.6594274Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6594370Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6594432Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6594508Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6595082Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6595209Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6595266Z graph_break [] 2025-12-04T12:10:20.6595347Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6595440Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6595949Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6596015Z current_size = base.storage().size() 2025-12-04T12:10:20.6596073Z Autotune Choices Stats: 2025-12-04T12:10:20.6596463Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.6596546Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6596613Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6596752Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6597006Z triton_mm_15 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6597252Z triton_mm_10 0.0067 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6597490Z triton_mm_2 0.0068 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6597756Z triton_mm_6 0.0069 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6598010Z triton_mm_13 0.0070 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6598249Z triton_mm_11 0.0071 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6598486Z triton_mm_5 0.0073 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6598723Z triton_mm_7 0.0074 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6598965Z triton_mm_9 0.0074 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6599212Z triton_mm_8 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6599362Z SingleProcess AUTOTUNE benchmarking takes 0.0737 seconds and 7.7715 seconds precompiling for 15 choices 2025-12-04T12:10:20.6599521Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6599585Z Traceback (most recent call last): 2025-12-04T12:10:20.6599762Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6599819Z method(*args, **kwargs) 2025-12-04T12:10:20.6599989Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6600045Z method(*args, **kwargs) 2025-12-04T12:10:20.6600237Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6600292Z with policy(): 2025-12-04T12:10:20.6600459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6600516Z raise RuntimeError(msg) 2025-12-04T12:10:20.6600920Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1048576000 and is now 1109393408. 2025-12-04T12:10:20.6600924Z 2025-12-04T12:10:20.6601016Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6601288Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6601292Z 2025-12-04T12:10:20.6601394Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6601485Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6601546Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6601619Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6602227Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6602343Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6602400Z graph_break [] 2025-12-04T12:10:20.6602477Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6602570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6603072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6603138Z current_size = base.storage().size() 2025-12-04T12:10:20.6603196Z Autotune Choices Stats: 2025-12-04T12:10:20.6603580Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.6603674Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6603738Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6603877Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6604125Z triton_mm_15 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6604370Z triton_mm_10 0.0067 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6604608Z triton_mm_2 0.0068 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6604844Z triton_mm_6 0.0069 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6605086Z triton_mm_13 0.0070 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6605324Z triton_mm_11 0.0071 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6605563Z triton_mm_5 0.0073 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6605798Z triton_mm_7 0.0074 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6606057Z triton_mm_9 0.0074 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6606304Z triton_mm_8 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6606451Z SingleProcess AUTOTUNE benchmarking takes 0.0737 seconds and 7.7715 seconds precompiling for 15 choices 2025-12-04T12:10:20.6606542Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6606600Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6606675Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6606791Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6607294Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6607348Z graph_break [] 2025-12-04T12:10:20.6607428Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6607529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6607910Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.6608020Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.6608077Z Autotune Choices Stats: 2025-12-04T12:10:20.6608457Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.6608536Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6608603Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6608740Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6608986Z triton_mm_23 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6609230Z triton_mm_25 0.0062 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6609472Z triton_mm_28 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6609713Z triton_mm_18 0.0067 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6609948Z triton_mm_21 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6610250Z triton_mm_26 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6610498Z triton_mm_27 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6610736Z triton_mm_24 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6610973Z triton_mm_31 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6611212Z triton_mm_29 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6611359Z SingleProcess AUTOTUNE benchmarking takes 0.1144 seconds and 0.4676 seconds precompiling for 17 choices 2025-12-04T12:10:20.6611428Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6611599Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6611662Z Traceback (most recent call last): 2025-12-04T12:10:20.6611836Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6611893Z method(*args, **kwargs) 2025-12-04T12:10:20.6612062Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6612119Z method(*args, **kwargs) 2025-12-04T12:10:20.6612287Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6612340Z with policy(): 2025-12-04T12:10:20.6612509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6612567Z raise RuntimeError(msg) 2025-12-04T12:10:20.6612970Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.6612973Z 2025-12-04T12:10:20.6613066Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6613338Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6613341Z 2025-12-04T12:10:20.6613449Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6613538Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6613598Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6613672Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6614239Z inductor [('triton_bundler_save_kernel', 136), ('generated_module_cache_miss', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 15), ('select_algorithm_num_precompiles', 14), ('select_algorithm_num_precompilation_exceptions', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6614378Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6614432Z graph_break [] 2025-12-04T12:10:20.6614510Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6614608Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6615111Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6615176Z current_size = base.storage().size() 2025-12-04T12:10:20.6615234Z Autotune Choices Stats: 2025-12-04T12:10:20.6615617Z {"num_choices": 15, "num_triton_choices": 14, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.6615697Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6615762Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6615912Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6616160Z triton_mm_15 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6616401Z triton_mm_10 0.0067 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6616642Z triton_mm_2 0.0068 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6616879Z triton_mm_6 0.0069 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6617119Z triton_mm_13 0.0070 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6617356Z triton_mm_11 0.0071 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6617596Z triton_mm_5 0.0073 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6617834Z triton_mm_7 0.0074 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6618072Z triton_mm_9 0.0074 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6618307Z triton_mm_8 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6618473Z SingleProcess AUTOTUNE benchmarking takes 0.0737 seconds and 7.7715 seconds precompiling for 15 choices 2025-12-04T12:10:20.6618575Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6618634Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6618708Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6618823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6619322Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6619379Z graph_break [] 2025-12-04T12:10:20.6619457Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6619547Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6619928Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.6620046Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.6620140Z Autotune Choices Stats: 2025-12-04T12:10:20.6620518Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.6620598Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6620665Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6620804Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6621048Z triton_mm_23 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6621289Z triton_mm_25 0.0062 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6621527Z triton_mm_28 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6621767Z triton_mm_18 0.0067 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6622003Z triton_mm_21 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6622243Z triton_mm_26 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6622480Z triton_mm_27 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6622756Z triton_mm_24 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6622996Z triton_mm_31 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6623236Z triton_mm_29 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6623381Z SingleProcess AUTOTUNE benchmarking takes 0.1144 seconds and 0.4676 seconds precompiling for 17 choices 2025-12-04T12:10:20.6623471Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6623532Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6623605Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6623721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6624219Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6624287Z graph_break [] 2025-12-04T12:10:20.6624365Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:20.6624454Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6624511Z Autotune Choices Stats: 2025-12-04T12:10:20.6624885Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_37", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.6624961Z AUTOTUNE scaled_mm(33x32, 32x2048, 33x1, 1x2048, 2048) 2025-12-04T12:10:20.6625025Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6625160Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6625405Z triton_mm_37 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6625648Z triton_mm_47 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6625889Z triton_mm_44 0.0067 ms 93.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6626126Z triton_mm_43 0.0068 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6626364Z triton_mm_45 0.0070 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6626634Z triton_mm_34 0.0070 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6626882Z triton_mm_39 0.0073 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6627120Z triton_mm_38 0.0074 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6627355Z triton_mm_40 0.0074 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6627596Z triton_mm_41 0.0076 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6627739Z SingleProcess AUTOTUNE benchmarking takes 0.0985 seconds and 0.5091 seconds precompiling for 17 choices 2025-12-04T12:10:20.6627946Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f34463a06d63a0de.xml - 2025-12-04T12:10:20.6628033Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6628624Z FAILED [1.7315s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1170210816. 2025-12-04T12:10:20.6628629Z 2025-12-04T12:10:20.6628719Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6628990Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6628993Z 2025-12-04T12:10:20.6629096Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6629174Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6629264Z ================= 1 failed, 187 deselected, 2 rerun in 15.83s ================== 2025-12-04T12:10:20.6629318Z Got exit code 1 2025-12-04T12:10:20.6629536Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6629679Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.6629842Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2f735ed63f420198.xml 2025-12-04T12:10:20.6629917Z ============================= test session starts ============================== 2025-12-04T12:10:20.6630046Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6630150Z cachedir: .pytest_cache 2025-12-04T12:10:20.6630324Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6630388Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6630446Z configfile: pytest.ini 2025-12-04T12:10:20.6630644Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6630738Z collecting ... collected 188 items / 101 deselected / 87 selected 2025-12-04T12:10:20.6630824Z stepcurrent: skipping 101 already run items. 2025-12-04T12:10:20.6630886Z Running 87 items in this shard 2025-12-04T12:10:20.6630888Z 2025-12-04T12:10:20.6631130Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [34.7875s] [ 1%] 2025-12-04T12:10:20.6631354Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7506s] [ 1%] 2025-12-04T12:10:20.6631555Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7170s] [ 1%] 2025-12-04T12:10:20.6631559Z 2025-12-04T12:10:20.6631627Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6631784Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6631847Z Traceback (most recent call last): 2025-12-04T12:10:20.6632021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6632080Z method(*args, **kwargs) 2025-12-04T12:10:20.6632247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6632317Z method(*args, **kwargs) 2025-12-04T12:10:20.6632481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6632536Z with policy(): 2025-12-04T12:10:20.6632703Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6632762Z raise RuntimeError(msg) 2025-12-04T12:10:20.6633161Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.6633164Z 2025-12-04T12:10:20.6633255Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6633525Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6633527Z 2025-12-04T12:10:20.6633630Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6633721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6633781Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6633855Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6634352Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6634467Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6634521Z graph_break [] 2025-12-04T12:10:20.6634601Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6634689Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6635199Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6635285Z current_size = base.storage().size() 2025-12-04T12:10:20.6635344Z Autotune Choices Stats: 2025-12-04T12:10:20.6635730Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6635805Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6635872Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6636008Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6636260Z triton_mm_3 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6636500Z triton_mm_2 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6636749Z triton_mm_0 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6636988Z triton_mm_1 0.0096 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6637049Z _scaled_mm 0.0256 ms 24.3% 2025-12-04T12:10:20.6637193Z SingleProcess AUTOTUNE benchmarking takes 0.0294 seconds and 0.1831 seconds precompiling for 5 choices 2025-12-04T12:10:20.6637348Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6637411Z Traceback (most recent call last): 2025-12-04T12:10:20.6637582Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6637641Z method(*args, **kwargs) 2025-12-04T12:10:20.6637807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6637864Z method(*args, **kwargs) 2025-12-04T12:10:20.6638029Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6638084Z with policy(): 2025-12-04T12:10:20.6638252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6638310Z raise RuntimeError(msg) 2025-12-04T12:10:20.6638711Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.6638714Z 2025-12-04T12:10:20.6638805Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6639148Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6639163Z 2025-12-04T12:10:20.6639267Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6639367Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6639427Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6639501Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6640005Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6640163Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6640216Z graph_break [] 2025-12-04T12:10:20.6640297Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6640386Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6640885Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6640974Z current_size = base.storage().size() 2025-12-04T12:10:20.6641032Z Autotune Choices Stats: 2025-12-04T12:10:20.6641411Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6641486Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6641554Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6641690Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6641939Z triton_mm_3 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6642183Z triton_mm_2 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6642419Z triton_mm_0 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6642659Z triton_mm_1 0.0096 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6642719Z _scaled_mm 0.0256 ms 24.3% 2025-12-04T12:10:20.6642863Z SingleProcess AUTOTUNE benchmarking takes 0.0294 seconds and 0.1831 seconds precompiling for 5 choices 2025-12-04T12:10:20.6642954Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6643013Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6643086Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6643202Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6643707Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6643775Z graph_break [] 2025-12-04T12:10:20.6643863Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6643953Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6645635Z Autotune Choices Stats: 2025-12-04T12:10:20.6646017Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:20.6646091Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6646157Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6646294Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6646544Z triton_mm_7 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6646783Z triton_mm_6 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6647039Z triton_mm_5 0.0100 ms 67.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6647279Z triton_mm_4 0.0117 ms 57.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6647336Z _scaled_mm 0.0278 ms 24.1% 2025-12-04T12:10:20.6647481Z SingleProcess AUTOTUNE benchmarking takes 0.0307 seconds and 0.1474 seconds precompiling for 5 choices 2025-12-04T12:10:20.6647551Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6647709Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6647771Z Traceback (most recent call last): 2025-12-04T12:10:20.6647950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6648007Z method(*args, **kwargs) 2025-12-04T12:10:20.6648180Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6648238Z method(*args, **kwargs) 2025-12-04T12:10:20.6648407Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6648461Z with policy(): 2025-12-04T12:10:20.6648629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6648689Z raise RuntimeError(msg) 2025-12-04T12:10:20.6649088Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6649091Z 2025-12-04T12:10:20.6649183Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6649476Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6649479Z 2025-12-04T12:10:20.6649585Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6649683Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6649744Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6649818Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6650360Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6650477Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6650531Z graph_break [] 2025-12-04T12:10:20.6650609Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6650698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6651197Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6651278Z current_size = base.storage().size() 2025-12-04T12:10:20.6651337Z Autotune Choices Stats: 2025-12-04T12:10:20.6651717Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6651791Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6651858Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6651995Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6652244Z triton_mm_3 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6652482Z triton_mm_2 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6652720Z triton_mm_0 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6652959Z triton_mm_1 0.0096 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6653019Z _scaled_mm 0.0256 ms 24.3% 2025-12-04T12:10:20.6653161Z SingleProcess AUTOTUNE benchmarking takes 0.0294 seconds and 0.1831 seconds precompiling for 5 choices 2025-12-04T12:10:20.6653252Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6653312Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6653386Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6653519Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6654034Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6654091Z graph_break [] 2025-12-04T12:10:20.6654167Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6654259Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6654316Z Autotune Choices Stats: 2025-12-04T12:10:20.6654692Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:20.6654765Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6654831Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6654965Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6655213Z triton_mm_7 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6655461Z triton_mm_6 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6655701Z triton_mm_5 0.0100 ms 67.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6655937Z triton_mm_4 0.0117 ms 57.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6655994Z _scaled_mm 0.0278 ms 24.1% 2025-12-04T12:10:20.6656137Z SingleProcess AUTOTUNE benchmarking takes 0.0307 seconds and 0.1474 seconds precompiling for 5 choices 2025-12-04T12:10:20.6656225Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6656285Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6656357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6656472Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6656965Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6657020Z graph_break [] 2025-12-04T12:10:20.6657098Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6657186Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6657244Z Autotune Choices Stats: 2025-12-04T12:10:20.6657621Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:20.6657716Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6657782Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6657933Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6658180Z triton_mm_11 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6658423Z triton_mm_10 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6658662Z triton_mm_9 0.0104 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6658900Z triton_mm_8 0.0116 ms 56.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6658959Z _scaled_mm 0.0284 ms 23.0% 2025-12-04T12:10:20.6659101Z SingleProcess AUTOTUNE benchmarking takes 0.0289 seconds and 0.1559 seconds precompiling for 5 choices 2025-12-04T12:10:20.6659317Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2f735ed63f420198.xml - 2025-12-04T12:10:20.6659393Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6659986Z FAILED [0.7170s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6659990Z 2025-12-04T12:10:20.6660080Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6660387Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6660390Z 2025-12-04T12:10:20.6660493Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6660570Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6660657Z ================= 1 failed, 101 deselected, 2 rerun in 36.28s ================== 2025-12-04T12:10:20.6660712Z Got exit code 1 2025-12-04T12:10:20.6660770Z Retrying single test... 2025-12-04T12:10:20.6660930Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aeb441ee4d80ee1b.xml 2025-12-04T12:10:20.6661005Z ============================= test session starts ============================== 2025-12-04T12:10:20.6661132Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6661192Z cachedir: .pytest_cache 2025-12-04T12:10:20.6661366Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6661430Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6661488Z configfile: pytest.ini 2025-12-04T12:10:20.6661670Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6661779Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6662057Z stepcurrent: skipping 101 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6662117Z Running 1 items in this shard 2025-12-04T12:10:20.6662131Z 2025-12-04T12:10:20.6662360Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [30.4069s] [100%] 2025-12-04T12:10:20.6662582Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6830s] [100%] 2025-12-04T12:10:20.6662781Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.6549s] [100%] 2025-12-04T12:10:20.6662784Z 2025-12-04T12:10:20.6662852Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6663007Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6663071Z Traceback (most recent call last): 2025-12-04T12:10:20.6663244Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6663302Z method(*args, **kwargs) 2025-12-04T12:10:20.6663483Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6663540Z method(*args, **kwargs) 2025-12-04T12:10:20.6663706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6663761Z with policy(): 2025-12-04T12:10:20.6663929Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6663987Z raise RuntimeError(msg) 2025-12-04T12:10:20.6664388Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.6664393Z 2025-12-04T12:10:20.6664484Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6664754Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6664756Z 2025-12-04T12:10:20.6664857Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6664949Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6665007Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6665083Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6665578Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6665693Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6665746Z graph_break [] 2025-12-04T12:10:20.6665824Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6665913Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6666432Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6666508Z current_size = base.storage().size() 2025-12-04T12:10:20.6666566Z Autotune Choices Stats: 2025-12-04T12:10:20.6666950Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.6667022Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6667089Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6667223Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6667473Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6667714Z triton_mm_1 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6667961Z triton_mm_2 0.0076 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6668197Z triton_mm_0 0.0079 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6668256Z _scaled_mm 0.0264 ms 22.9% 2025-12-04T12:10:20.6668399Z SingleProcess AUTOTUNE benchmarking takes 0.0288 seconds and 0.1735 seconds precompiling for 5 choices 2025-12-04T12:10:20.6668553Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6668617Z Traceback (most recent call last): 2025-12-04T12:10:20.6668791Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6668848Z method(*args, **kwargs) 2025-12-04T12:10:20.6669015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6669072Z method(*args, **kwargs) 2025-12-04T12:10:20.6669237Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6669290Z with policy(): 2025-12-04T12:10:20.6669458Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6669516Z raise RuntimeError(msg) 2025-12-04T12:10:20.6669916Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.6669920Z 2025-12-04T12:10:20.6670009Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6670316Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6670331Z 2025-12-04T12:10:20.6670434Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6670535Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6670594Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6670679Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6671171Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6671288Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6671343Z graph_break [] 2025-12-04T12:10:20.6671422Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6671510Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6672008Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6672085Z current_size = base.storage().size() 2025-12-04T12:10:20.6672143Z Autotune Choices Stats: 2025-12-04T12:10:20.6672520Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.6672592Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6672660Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6672795Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6673049Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6673291Z triton_mm_1 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6673529Z triton_mm_2 0.0076 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6673768Z triton_mm_0 0.0079 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6673827Z _scaled_mm 0.0264 ms 22.9% 2025-12-04T12:10:20.6673969Z SingleProcess AUTOTUNE benchmarking takes 0.0288 seconds and 0.1735 seconds precompiling for 5 choices 2025-12-04T12:10:20.6674058Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6674119Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6674191Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6674306Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6674806Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6674885Z graph_break [] 2025-12-04T12:10:20.6674964Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6675053Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6675114Z Autotune Choices Stats: 2025-12-04T12:10:20.6675487Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:20.6675560Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6675626Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6675761Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6676008Z triton_mm_7 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6676258Z triton_mm_4 0.0080 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6676496Z triton_mm_5 0.0090 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6676734Z triton_mm_6 0.0092 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6676791Z _scaled_mm 0.0263 ms 25.0% 2025-12-04T12:10:20.6676933Z SingleProcess AUTOTUNE benchmarking takes 0.0267 seconds and 0.1297 seconds precompiling for 5 choices 2025-12-04T12:10:20.6677005Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6677160Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6677223Z Traceback (most recent call last): 2025-12-04T12:10:20.6677394Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6677452Z method(*args, **kwargs) 2025-12-04T12:10:20.6677621Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6677678Z method(*args, **kwargs) 2025-12-04T12:10:20.6677843Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6677900Z with policy(): 2025-12-04T12:10:20.6678066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6678126Z raise RuntimeError(msg) 2025-12-04T12:10:20.6678525Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6678539Z 2025-12-04T12:10:20.6678628Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6678907Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6678910Z 2025-12-04T12:10:20.6679022Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6679113Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6679172Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6679246Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6679747Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6679865Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6679918Z graph_break [] 2025-12-04T12:10:20.6679995Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6680084Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6680619Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6680699Z current_size = base.storage().size() 2025-12-04T12:10:20.6680756Z Autotune Choices Stats: 2025-12-04T12:10:20.6681138Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.6681210Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6681276Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6681410Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6681657Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6681896Z triton_mm_1 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6682136Z triton_mm_2 0.0076 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6682373Z triton_mm_0 0.0079 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6682431Z _scaled_mm 0.0264 ms 22.9% 2025-12-04T12:10:20.6682573Z SingleProcess AUTOTUNE benchmarking takes 0.0288 seconds and 0.1735 seconds precompiling for 5 choices 2025-12-04T12:10:20.6682663Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6682722Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6682814Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6682933Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6683448Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6683504Z graph_break [] 2025-12-04T12:10:20.6683580Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6683670Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6683726Z Autotune Choices Stats: 2025-12-04T12:10:20.6684102Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:20.6684174Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6684239Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6684374Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6684629Z triton_mm_7 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6684869Z triton_mm_4 0.0080 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6685109Z triton_mm_5 0.0090 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6685348Z triton_mm_6 0.0092 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6685405Z _scaled_mm 0.0263 ms 25.0% 2025-12-04T12:10:20.6685549Z SingleProcess AUTOTUNE benchmarking takes 0.0267 seconds and 0.1297 seconds precompiling for 5 choices 2025-12-04T12:10:20.6685639Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6685696Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6685773Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6685888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6686381Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6686435Z graph_break [] 2025-12-04T12:10:20.6686511Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6686599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6686656Z Autotune Choices Stats: 2025-12-04T12:10:20.6687032Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:20.6687129Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6687195Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6687341Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6687588Z triton_mm_9 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6687828Z triton_mm_10 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6688070Z triton_mm_11 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6688306Z triton_mm_8 0.0080 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6688364Z _scaled_mm 0.0268 ms 22.7% 2025-12-04T12:10:20.6688517Z SingleProcess AUTOTUNE benchmarking takes 0.0269 seconds and 0.1477 seconds precompiling for 5 choices 2025-12-04T12:10:20.6688724Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aeb441ee4d80ee1b.xml - 2025-12-04T12:10:20.6688799Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6689388Z FAILED [0.6549s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6689392Z 2025-12-04T12:10:20.6689482Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6689750Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6689753Z 2025-12-04T12:10:20.6689856Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6689934Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6690020Z ================= 1 failed, 187 deselected, 2 rerun in 31.76s ================== 2025-12-04T12:10:20.6690074Z Got exit code 1 2025-12-04T12:10:20.6690169Z Retrying single test... 2025-12-04T12:10:20.6690330Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2635fa5d6b4aa588.xml 2025-12-04T12:10:20.6690404Z ============================= test session starts ============================== 2025-12-04T12:10:20.6690531Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6690590Z cachedir: .pytest_cache 2025-12-04T12:10:20.6690763Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6690826Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6690884Z configfile: pytest.ini 2025-12-04T12:10:20.6691062Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6691176Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6691451Z stepcurrent: skipping 101 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6691523Z Running 1 items in this shard 2025-12-04T12:10:20.6691526Z 2025-12-04T12:10:20.6691751Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [8.5276s] [100%] 2025-12-04T12:10:20.6691974Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7838s] [100%] 2025-12-04T12:10:20.6692173Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9692s] [100%] 2025-12-04T12:10:20.6692176Z 2025-12-04T12:10:20.6692245Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6692399Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6692462Z Traceback (most recent call last): 2025-12-04T12:10:20.6692636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6692706Z method(*args, **kwargs) 2025-12-04T12:10:20.6692873Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6692929Z method(*args, **kwargs) 2025-12-04T12:10:20.6693095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6693149Z with policy(): 2025-12-04T12:10:20.6693317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6693374Z raise RuntimeError(msg) 2025-12-04T12:10:20.6693773Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1023410176. 2025-12-04T12:10:20.6693776Z 2025-12-04T12:10:20.6693865Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6694132Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6694134Z 2025-12-04T12:10:20.6694236Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6694327Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6694385Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6694460Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6694955Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6695069Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6695122Z graph_break [] 2025-12-04T12:10:20.6695199Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6695288Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6695809Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6695874Z current_size = base.storage().size() 2025-12-04T12:10:20.6695931Z Autotune Choices Stats: 2025-12-04T12:10:20.6696310Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007919000461697578, "best_triton_pos": 0} 2025-12-04T12:10:20.6696383Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6696449Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6696586Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6696833Z triton_mm_0 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6697076Z triton_mm_1 0.0093 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6697327Z triton_mm_3 0.0093 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6697568Z triton_mm_2 0.0099 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6697627Z _scaled_mm 0.0249 ms 31.8% 2025-12-04T12:10:20.6697772Z SingleProcess AUTOTUNE benchmarking takes 0.0280 seconds and 0.1702 seconds precompiling for 5 choices 2025-12-04T12:10:20.6697929Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6697994Z Traceback (most recent call last): 2025-12-04T12:10:20.6698164Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6698221Z method(*args, **kwargs) 2025-12-04T12:10:20.6698388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6698445Z method(*args, **kwargs) 2025-12-04T12:10:20.6698610Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6698663Z with policy(): 2025-12-04T12:10:20.6698833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6698890Z raise RuntimeError(msg) 2025-12-04T12:10:20.6699290Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1023410176 and is now 1059061760. 2025-12-04T12:10:20.6699293Z 2025-12-04T12:10:20.6699382Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6699649Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6699662Z 2025-12-04T12:10:20.6699775Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6699865Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6699935Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6700008Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6700619Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6700735Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6700790Z graph_break [] 2025-12-04T12:10:20.6700866Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6700956Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6701450Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6701533Z current_size = base.storage().size() 2025-12-04T12:10:20.6701590Z Autotune Choices Stats: 2025-12-04T12:10:20.6701965Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007919000461697578, "best_triton_pos": 0} 2025-12-04T12:10:20.6702039Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6702105Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6702241Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6702485Z triton_mm_0 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6702727Z triton_mm_1 0.0093 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6702967Z triton_mm_3 0.0093 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6703209Z triton_mm_2 0.0099 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6703267Z _scaled_mm 0.0249 ms 31.8% 2025-12-04T12:10:20.6703410Z SingleProcess AUTOTUNE benchmarking takes 0.0280 seconds and 0.1702 seconds precompiling for 5 choices 2025-12-04T12:10:20.6703500Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6703559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6703632Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6703746Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6704271Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6704325Z graph_break [] 2025-12-04T12:10:20.6704402Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6704492Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6704550Z Autotune Choices Stats: 2025-12-04T12:10:20.6704920Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6704993Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6705059Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6705194Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6705439Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6705697Z triton_mm_5 0.0069 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6705935Z triton_mm_7 0.0074 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6706174Z triton_mm_4 0.0078 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6706234Z _scaled_mm 0.0265 ms 23.6% 2025-12-04T12:10:20.6706375Z SingleProcess AUTOTUNE benchmarking takes 0.0266 seconds and 0.1275 seconds precompiling for 5 choices 2025-12-04T12:10:20.6706446Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6706599Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6706662Z Traceback (most recent call last): 2025-12-04T12:10:20.6706833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6706891Z method(*args, **kwargs) 2025-12-04T12:10:20.6707059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6707117Z method(*args, **kwargs) 2025-12-04T12:10:20.6707284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6707339Z with policy(): 2025-12-04T12:10:20.6707508Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6707567Z raise RuntimeError(msg) 2025-12-04T12:10:20.6707968Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6707984Z 2025-12-04T12:10:20.6708074Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6708356Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6708358Z 2025-12-04T12:10:20.6708473Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6708563Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6708624Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6708696Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6709191Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6709306Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6709360Z graph_break [] 2025-12-04T12:10:20.6709437Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6709527Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6710021Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6710132Z current_size = base.storage().size() 2025-12-04T12:10:20.6710191Z Autotune Choices Stats: 2025-12-04T12:10:20.6710566Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007919000461697578, "best_triton_pos": 0} 2025-12-04T12:10:20.6710639Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6710705Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6710842Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6711086Z triton_mm_0 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6711325Z triton_mm_1 0.0093 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6711565Z triton_mm_3 0.0093 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6711803Z triton_mm_2 0.0099 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6711861Z _scaled_mm 0.0249 ms 31.8% 2025-12-04T12:10:20.6712004Z SingleProcess AUTOTUNE benchmarking takes 0.0280 seconds and 0.1702 seconds precompiling for 5 choices 2025-12-04T12:10:20.6712093Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6712166Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6712238Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6712366Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6712869Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6712924Z graph_break [] 2025-12-04T12:10:20.6713001Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6713089Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6713146Z Autotune Choices Stats: 2025-12-04T12:10:20.6713517Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6713666Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6713733Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6713868Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6714126Z triton_mm_6 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6714368Z triton_mm_5 0.0069 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6714609Z triton_mm_7 0.0074 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6714846Z triton_mm_4 0.0078 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6714904Z _scaled_mm 0.0265 ms 23.6% 2025-12-04T12:10:20.6715045Z SingleProcess AUTOTUNE benchmarking takes 0.0266 seconds and 0.1275 seconds precompiling for 5 choices 2025-12-04T12:10:20.6715135Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6715193Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6716253Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6716374Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6716864Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6716919Z graph_break [] 2025-12-04T12:10:20.6716995Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:20.6717084Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6717143Z Autotune Choices Stats: 2025-12-04T12:10:20.6717543Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006639000028371811, "best_triton_pos": 0} 2025-12-04T12:10:20.6717625Z AUTOTUNE scaled_mm(3x1024, 1024x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6717690Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6717825Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6718071Z triton_mm_9 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6718310Z triton_mm_8 0.0080 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6718557Z triton_mm_10 0.0084 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6718796Z triton_mm_11 0.0096 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6718854Z _scaled_mm 0.0225 ms 29.5% 2025-12-04T12:10:20.6719007Z SingleProcess AUTOTUNE benchmarking takes 0.0354 seconds and 0.4058 seconds precompiling for 5 choices 2025-12-04T12:10:20.6719213Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2635fa5d6b4aa588.xml - 2025-12-04T12:10:20.6719290Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6719878Z FAILED [0.9692s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1059061760 and is now 1094713344. 2025-12-04T12:10:20.6719881Z 2025-12-04T12:10:20.6719971Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6720291Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6720293Z 2025-12-04T12:10:20.6720396Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6720534Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6720621Z ================= 1 failed, 187 deselected, 2 rerun in 10.30s ================== 2025-12-04T12:10:20.6720675Z Got exit code 1 2025-12-04T12:10:20.6720893Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6721035Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.6721193Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-cb630f725eb82239.xml 2025-12-04T12:10:20.6721269Z ============================= test session starts ============================== 2025-12-04T12:10:20.6721395Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6721453Z cachedir: .pytest_cache 2025-12-04T12:10:20.6721627Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6721704Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6721761Z configfile: pytest.ini 2025-12-04T12:10:20.6721962Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6722055Z collecting ... collected 188 items / 102 deselected / 86 selected 2025-12-04T12:10:20.6722129Z stepcurrent: skipping 102 already run items. 2025-12-04T12:10:20.6722191Z Running 86 items in this shard 2025-12-04T12:10:20.6722193Z 2025-12-04T12:10:20.6722424Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.5967s] [ 1%] 2025-12-04T12:10:20.6722649Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0754s] [ 1%] 2025-12-04T12:10:20.6722857Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.9663s] [ 1%] 2025-12-04T12:10:20.6722860Z 2025-12-04T12:10:20.6722929Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6723084Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6723147Z Traceback (most recent call last): 2025-12-04T12:10:20.6723335Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6723394Z method(*args, **kwargs) 2025-12-04T12:10:20.6723561Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6723618Z method(*args, **kwargs) 2025-12-04T12:10:20.6723788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6723844Z with policy(): 2025-12-04T12:10:20.6724011Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6724069Z raise RuntimeError(msg) 2025-12-04T12:10:20.6724470Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.6724474Z 2025-12-04T12:10:20.6724564Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6724851Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6724856Z 2025-12-04T12:10:20.6724958Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6725049Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6725107Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6725180Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6725683Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6725798Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6725864Z graph_break [] 2025-12-04T12:10:20.6725945Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6726034Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6726540Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6726605Z current_size = base.storage().size() 2025-12-04T12:10:20.6726665Z Autotune Choices Stats: 2025-12-04T12:10:20.6727055Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:20.6727137Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6727207Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6727341Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6727592Z triton_mm_16 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6727848Z triton_mm_17 0.0071 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6728092Z triton_mm_7 0.0071 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6728333Z triton_mm_12 0.0072 ms 97.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6728573Z triton_mm_10 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6728811Z triton_mm_14 0.0089 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6729058Z triton_mm_5 0.0090 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6729303Z triton_mm_6 0.0092 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6729546Z triton_mm_18 0.0109 ms 63.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6729784Z triton_mm_2 0.0116 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6729930Z SingleProcess AUTOTUNE benchmarking takes 0.0985 seconds and 0.4244 seconds precompiling for 20 choices 2025-12-04T12:10:20.6730133Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6730212Z Traceback (most recent call last): 2025-12-04T12:10:20.6730385Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6730443Z method(*args, **kwargs) 2025-12-04T12:10:20.6730609Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6730668Z method(*args, **kwargs) 2025-12-04T12:10:20.6730833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6730888Z with policy(): 2025-12-04T12:10:20.6731054Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6731114Z raise RuntimeError(msg) 2025-12-04T12:10:20.6731516Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.6731521Z 2025-12-04T12:10:20.6731610Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6731881Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6731894Z 2025-12-04T12:10:20.6731997Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6732088Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6732150Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6732223Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6732724Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6732840Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6732892Z graph_break [] 2025-12-04T12:10:20.6732972Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6733062Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6733575Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6733643Z current_size = base.storage().size() 2025-12-04T12:10:20.6733700Z Autotune Choices Stats: 2025-12-04T12:10:20.6734084Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:20.6734166Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6734233Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6734382Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6734642Z triton_mm_16 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6734886Z triton_mm_17 0.0071 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6735128Z triton_mm_7 0.0071 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6735365Z triton_mm_12 0.0072 ms 97.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6735602Z triton_mm_10 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6735838Z triton_mm_14 0.0089 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6736086Z triton_mm_5 0.0090 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6736329Z triton_mm_6 0.0092 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6736573Z triton_mm_18 0.0109 ms 63.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6736810Z triton_mm_2 0.0116 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6736956Z SingleProcess AUTOTUNE benchmarking takes 0.0985 seconds and 0.4244 seconds precompiling for 20 choices 2025-12-04T12:10:20.6737045Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6737104Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6737176Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6737305Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6737806Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6737861Z graph_break [] 2025-12-04T12:10:20.6737940Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6738030Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6738088Z Autotune Choices Stats: 2025-12-04T12:10:20.6738467Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.6738577Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6738644Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6738779Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6739026Z triton_mm_35 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6739268Z triton_mm_36 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6739508Z triton_mm_26 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6739745Z triton_mm_31 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6739982Z triton_mm_29 0.0086 ms 71.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6740274Z triton_mm_24 0.0089 ms 69.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6740517Z triton_mm_28 0.0090 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6740758Z triton_mm_25 0.0092 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6740998Z triton_mm_30 0.0103 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6741236Z triton_mm_33 0.0105 ms 58.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6741395Z SingleProcess AUTOTUNE benchmarking takes 0.1203 seconds and 0.3372 seconds precompiling for 20 choices 2025-12-04T12:10:20.6741466Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6741621Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6741685Z Traceback (most recent call last): 2025-12-04T12:10:20.6741857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6741916Z method(*args, **kwargs) 2025-12-04T12:10:20.6742083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6742140Z method(*args, **kwargs) 2025-12-04T12:10:20.6742305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6742375Z with policy(): 2025-12-04T12:10:20.6742541Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6742613Z raise RuntimeError(msg) 2025-12-04T12:10:20.6743016Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6743021Z 2025-12-04T12:10:20.6743111Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6743384Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6743386Z 2025-12-04T12:10:20.6743491Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6743582Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6743642Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6743718Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6744219Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6744346Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6744399Z graph_break [] 2025-12-04T12:10:20.6744480Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6744571Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6745067Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6745132Z current_size = base.storage().size() 2025-12-04T12:10:20.6745190Z Autotune Choices Stats: 2025-12-04T12:10:20.6745575Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:20.6745667Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6745735Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6745869Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6746117Z triton_mm_16 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6746359Z triton_mm_17 0.0071 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6746599Z triton_mm_7 0.0071 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6746859Z triton_mm_12 0.0072 ms 97.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6747098Z triton_mm_10 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6747337Z triton_mm_14 0.0089 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6747577Z triton_mm_5 0.0090 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6747820Z triton_mm_6 0.0092 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6748062Z triton_mm_18 0.0109 ms 63.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6748310Z triton_mm_2 0.0116 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6748455Z SingleProcess AUTOTUNE benchmarking takes 0.0985 seconds and 0.4244 seconds precompiling for 20 choices 2025-12-04T12:10:20.6748545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6748607Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6748680Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6748798Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6749295Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6749351Z graph_break [] 2025-12-04T12:10:20.6749429Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6749517Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6749577Z Autotune Choices Stats: 2025-12-04T12:10:20.6749962Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.6750042Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6750144Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6750281Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6750529Z triton_mm_35 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6750772Z triton_mm_36 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6751043Z triton_mm_26 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6751280Z triton_mm_31 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6751518Z triton_mm_29 0.0086 ms 71.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6751755Z triton_mm_24 0.0089 ms 69.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6751999Z triton_mm_28 0.0090 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6752239Z triton_mm_25 0.0092 ms 67.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6752492Z triton_mm_30 0.0103 ms 59.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6752730Z triton_mm_33 0.0105 ms 58.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6752875Z SingleProcess AUTOTUNE benchmarking takes 0.1203 seconds and 0.3372 seconds precompiling for 20 choices 2025-12-04T12:10:20.6752965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6753024Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6753099Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6753214Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6753733Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6753788Z graph_break [] 2025-12-04T12:10:20.6753868Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6753957Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6754014Z Autotune Choices Stats: 2025-12-04T12:10:20.6754391Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:20.6754470Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6754536Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6754673Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6754942Z triton_mm_54 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6755184Z triton_mm_55 0.0073 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6755423Z triton_mm_50 0.0075 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6755669Z triton_mm_44 0.0085 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6755912Z triton_mm_47 0.0087 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6756150Z triton_mm_43 0.0090 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6756397Z triton_mm_48 0.0090 ms 71.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6756639Z triton_mm_45 0.0092 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6756880Z triton_mm_52 0.0094 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6757121Z triton_mm_56 0.0110 ms 58.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6757265Z SingleProcess AUTOTUNE benchmarking takes 0.1520 seconds and 0.2817 seconds precompiling for 20 choices 2025-12-04T12:10:20.6757468Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-cb630f725eb82239.xml - 2025-12-04T12:10:20.6757546Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6758155Z FAILED [0.9663s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6758159Z 2025-12-04T12:10:20.6758249Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6758520Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6758522Z 2025-12-04T12:10:20.6758624Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6758715Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6758799Z ================== 1 failed, 102 deselected, 2 rerun in 4.66s ================== 2025-12-04T12:10:20.6758864Z Got exit code 1 2025-12-04T12:10:20.6758921Z Retrying single test... 2025-12-04T12:10:20.6759081Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-597d69a84c145524.xml 2025-12-04T12:10:20.6759155Z ============================= test session starts ============================== 2025-12-04T12:10:20.6759284Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6759341Z cachedir: .pytest_cache 2025-12-04T12:10:20.6759515Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6759578Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6759635Z configfile: pytest.ini 2025-12-04T12:10:20.6759815Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6759908Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6760215Z stepcurrent: skipping 102 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6760277Z Running 1 items in this shard 2025-12-04T12:10:20.6760279Z 2025-12-04T12:10:20.6760522Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [34.5477s] [100%] 2025-12-04T12:10:20.6760746Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.2096s] [100%] 2025-12-04T12:10:20.6760949Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.2299s] [100%] 2025-12-04T12:10:20.6760952Z 2025-12-04T12:10:20.6761020Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6761176Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6761238Z Traceback (most recent call last): 2025-12-04T12:10:20.6761411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6761470Z method(*args, **kwargs) 2025-12-04T12:10:20.6761638Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6761695Z method(*args, **kwargs) 2025-12-04T12:10:20.6761862Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6761930Z with policy(): 2025-12-04T12:10:20.6762100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6762233Z raise RuntimeError(msg) 2025-12-04T12:10:20.6762634Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.6762638Z 2025-12-04T12:10:20.6762728Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6763000Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6763018Z 2025-12-04T12:10:20.6763123Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6763212Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6763286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6763362Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6763862Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6763977Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6764032Z graph_break [] 2025-12-04T12:10:20.6764111Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6764202Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6764700Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6764774Z current_size = base.storage().size() 2025-12-04T12:10:20.6764832Z Autotune Choices Stats: 2025-12-04T12:10:20.6765215Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.6765298Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6765365Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6765501Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6765749Z triton_mm_16 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6765992Z triton_mm_17 0.0083 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6766246Z triton_mm_6 0.0090 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6766489Z triton_mm_10 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6766728Z triton_mm_14 0.0096 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6766969Z triton_mm_7 0.0104 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6767207Z triton_mm_12 0.0105 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6767466Z triton_mm_5 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6767709Z triton_mm_11 0.0108 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6767954Z triton_mm_18 0.0112 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6768097Z SingleProcess AUTOTUNE benchmarking takes 0.1014 seconds and 0.4412 seconds precompiling for 20 choices 2025-12-04T12:10:20.6768254Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6768317Z Traceback (most recent call last): 2025-12-04T12:10:20.6768489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6768546Z method(*args, **kwargs) 2025-12-04T12:10:20.6768715Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6768772Z method(*args, **kwargs) 2025-12-04T12:10:20.6768956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6769011Z with policy(): 2025-12-04T12:10:20.6769179Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6769236Z raise RuntimeError(msg) 2025-12-04T12:10:20.6769643Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.6769645Z 2025-12-04T12:10:20.6769736Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6770006Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6770010Z 2025-12-04T12:10:20.6770156Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6770245Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6770305Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6770394Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6770897Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6771012Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6771066Z graph_break [] 2025-12-04T12:10:20.6771146Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6771235Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6771745Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6771821Z current_size = base.storage().size() 2025-12-04T12:10:20.6771879Z Autotune Choices Stats: 2025-12-04T12:10:20.6772260Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.6772342Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6772408Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6772544Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6772795Z triton_mm_16 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6773036Z triton_mm_17 0.0083 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6773278Z triton_mm_6 0.0090 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6773531Z triton_mm_10 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6773772Z triton_mm_14 0.0096 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6774012Z triton_mm_7 0.0104 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6774248Z triton_mm_12 0.0105 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6774485Z triton_mm_5 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6774736Z triton_mm_11 0.0108 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6774980Z triton_mm_18 0.0112 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6775124Z SingleProcess AUTOTUNE benchmarking takes 0.1014 seconds and 0.4412 seconds precompiling for 20 choices 2025-12-04T12:10:20.6775216Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6775276Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6775348Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6775465Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6775982Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6776037Z graph_break [] 2025-12-04T12:10:20.6776115Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6776206Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6776263Z Autotune Choices Stats: 2025-12-04T12:10:20.6776639Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007120000198483467, "best_triton_pos": 0} 2025-12-04T12:10:20.6776719Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6776787Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6776922Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6777169Z triton_mm_35 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6777424Z triton_mm_24 0.0088 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6777666Z triton_mm_36 0.0096 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6777908Z triton_mm_26 0.0102 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6778144Z triton_mm_29 0.0108 ms 66.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6778383Z triton_mm_31 0.0110 ms 65.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6778634Z triton_mm_25 0.0114 ms 62.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6778878Z triton_mm_28 0.0117 ms 61.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6779115Z triton_mm_33 0.0117 ms 60.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6779351Z triton_mm_32 0.0120 ms 59.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6779496Z SingleProcess AUTOTUNE benchmarking takes 0.0934 seconds and 0.2362 seconds precompiling for 20 choices 2025-12-04T12:10:20.6779576Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6779741Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6779804Z Traceback (most recent call last): 2025-12-04T12:10:20.6779977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6780035Z method(*args, **kwargs) 2025-12-04T12:10:20.6780245Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6780303Z method(*args, **kwargs) 2025-12-04T12:10:20.6780466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6780521Z with policy(): 2025-12-04T12:10:20.6780689Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6780748Z raise RuntimeError(msg) 2025-12-04T12:10:20.6781147Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6781149Z 2025-12-04T12:10:20.6781254Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6781525Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6781527Z 2025-12-04T12:10:20.6781630Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6781721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6781781Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6781854Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6782356Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6782473Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6782526Z graph_break [] 2025-12-04T12:10:20.6782604Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6782692Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6783203Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6783268Z current_size = base.storage().size() 2025-12-04T12:10:20.6783326Z Autotune Choices Stats: 2025-12-04T12:10:20.6783708Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007799999788403511, "best_triton_pos": 0} 2025-12-04T12:10:20.6783789Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6783870Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6784004Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6784272Z triton_mm_16 0.0078 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6784515Z triton_mm_17 0.0083 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6784762Z triton_mm_6 0.0090 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6785000Z triton_mm_10 0.0091 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6785239Z triton_mm_14 0.0096 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6785477Z triton_mm_7 0.0104 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6785725Z triton_mm_12 0.0105 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6785965Z triton_mm_5 0.0108 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6786205Z triton_mm_11 0.0108 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6786447Z triton_mm_18 0.0112 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6786592Z SingleProcess AUTOTUNE benchmarking takes 0.1014 seconds and 0.4412 seconds precompiling for 20 choices 2025-12-04T12:10:20.6786681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6786740Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6786828Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6786943Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6787444Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6787499Z graph_break [] 2025-12-04T12:10:20.6787576Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6787667Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6787724Z Autotune Choices Stats: 2025-12-04T12:10:20.6788115Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007120000198483467, "best_triton_pos": 0} 2025-12-04T12:10:20.6788204Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6788273Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6788408Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6788655Z triton_mm_35 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6788894Z triton_mm_24 0.0088 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6789135Z triton_mm_36 0.0096 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6789376Z triton_mm_26 0.0102 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6789626Z triton_mm_29 0.0108 ms 66.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6789867Z triton_mm_31 0.0110 ms 65.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6790152Z triton_mm_25 0.0114 ms 62.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6790393Z triton_mm_28 0.0117 ms 61.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6790633Z triton_mm_33 0.0117 ms 60.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6790884Z triton_mm_32 0.0120 ms 59.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6791030Z SingleProcess AUTOTUNE benchmarking takes 0.0934 seconds and 0.2362 seconds precompiling for 20 choices 2025-12-04T12:10:20.6791119Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6791177Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6791250Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6791365Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6791870Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6791938Z graph_break [] 2025-12-04T12:10:20.6792017Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6792118Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6792176Z Autotune Choices Stats: 2025-12-04T12:10:20.6792550Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_55", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:20.6792630Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6792695Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6792831Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6793081Z triton_mm_55 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6793322Z triton_mm_54 0.0071 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6793559Z triton_mm_50 0.0073 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6793813Z triton_mm_44 0.0081 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6794055Z triton_mm_52 0.0084 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6794291Z triton_mm_48 0.0086 ms 73.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6794528Z triton_mm_43 0.0094 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6794770Z triton_mm_47 0.0103 ms 61.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6795020Z triton_mm_45 0.0105 ms 59.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6795263Z triton_mm_56 0.0108 ms 57.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6795407Z SingleProcess AUTOTUNE benchmarking takes 0.1500 seconds and 0.4570 seconds precompiling for 20 choices 2025-12-04T12:10:20.6795610Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-597d69a84c145524.xml - 2025-12-04T12:10:20.6795688Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6796299Z FAILED [1.2299s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6796311Z 2025-12-04T12:10:20.6796401Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6796671Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6796674Z 2025-12-04T12:10:20.6796777Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6796856Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6796943Z ================= 1 failed, 187 deselected, 2 rerun in 37.01s ================== 2025-12-04T12:10:20.6796998Z Got exit code 1 2025-12-04T12:10:20.6797056Z Retrying single test... 2025-12-04T12:10:20.6797216Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-758dcb63c92ea375.xml 2025-12-04T12:10:20.6797290Z ============================= test session starts ============================== 2025-12-04T12:10:20.6797419Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6797490Z cachedir: .pytest_cache 2025-12-04T12:10:20.6797664Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6797727Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6797785Z configfile: pytest.ini 2025-12-04T12:10:20.6797963Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6798057Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6798324Z stepcurrent: skipping 102 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6798386Z Running 1 items in this shard 2025-12-04T12:10:20.6798388Z 2025-12-04T12:10:20.6798614Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6250s] [100%] 2025-12-04T12:10:20.6798839Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1130s] [100%] 2025-12-04T12:10:20.6799040Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.0280s] [100%] 2025-12-04T12:10:20.6799054Z 2025-12-04T12:10:20.6799122Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6799278Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6799341Z Traceback (most recent call last): 2025-12-04T12:10:20.6799514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6799572Z method(*args, **kwargs) 2025-12-04T12:10:20.6799740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6799797Z method(*args, **kwargs) 2025-12-04T12:10:20.6799964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6800017Z with policy(): 2025-12-04T12:10:20.6800235Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6800292Z raise RuntimeError(msg) 2025-12-04T12:10:20.6800708Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1054867456. 2025-12-04T12:10:20.6800710Z 2025-12-04T12:10:20.6800801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6801074Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6801076Z 2025-12-04T12:10:20.6801178Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6801270Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6801330Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6801404Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6801905Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6802031Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6802085Z graph_break [] 2025-12-04T12:10:20.6802163Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6802253Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6802753Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6802818Z current_size = base.storage().size() 2025-12-04T12:10:20.6802874Z Autotune Choices Stats: 2025-12-04T12:10:20.6803263Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6803345Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6803424Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6803562Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6803811Z triton_mm_17 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6804054Z triton_mm_16 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6804295Z triton_mm_7 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6804549Z triton_mm_12 0.0072 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6804802Z triton_mm_6 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6805043Z triton_mm_9 0.0084 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6805285Z triton_mm_10 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6805525Z triton_mm_5 0.0089 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6805766Z triton_mm_14 0.0093 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6806006Z triton_mm_11 0.0109 ms 57.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6806161Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4168 seconds precompiling for 20 choices 2025-12-04T12:10:20.6806318Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6806382Z Traceback (most recent call last): 2025-12-04T12:10:20.6806558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6806616Z method(*args, **kwargs) 2025-12-04T12:10:20.6806783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6806840Z method(*args, **kwargs) 2025-12-04T12:10:20.6807007Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6807061Z with policy(): 2025-12-04T12:10:20.6807229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6807286Z raise RuntimeError(msg) 2025-12-04T12:10:20.6807698Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1054867456 and is now 1121976320. 2025-12-04T12:10:20.6807703Z 2025-12-04T12:10:20.6807793Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6808066Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6808069Z 2025-12-04T12:10:20.6808173Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6808261Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6808321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6808394Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6808907Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6809031Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6809085Z graph_break [] 2025-12-04T12:10:20.6809164Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6809254Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6809751Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6809820Z current_size = base.storage().size() 2025-12-04T12:10:20.6809879Z Autotune Choices Stats: 2025-12-04T12:10:20.6810293Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6810392Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6810458Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6810594Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6810843Z triton_mm_17 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6811088Z triton_mm_16 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6811328Z triton_mm_7 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6811568Z triton_mm_12 0.0072 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6811829Z triton_mm_6 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6812071Z triton_mm_9 0.0084 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6812310Z triton_mm_10 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6812547Z triton_mm_5 0.0089 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6812785Z triton_mm_14 0.0093 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6813056Z triton_mm_11 0.0109 ms 57.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6813200Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4168 seconds precompiling for 20 choices 2025-12-04T12:10:20.6813294Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6813353Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6813426Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6813540Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6814038Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6814092Z graph_break [] 2025-12-04T12:10:20.6814171Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6814259Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6814329Z Autotune Choices Stats: 2025-12-04T12:10:20.6814707Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:20.6814789Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6814856Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6814991Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6815239Z triton_mm_36 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6815479Z triton_mm_35 0.0065 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6815743Z triton_mm_25 0.0079 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6815982Z triton_mm_31 0.0080 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6816223Z triton_mm_28 0.0084 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6816461Z triton_mm_29 0.0094 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6816698Z triton_mm_33 0.0094 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6816960Z triton_mm_26 0.0103 ms 61.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6817197Z triton_mm_30 0.0104 ms 61.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6817440Z triton_mm_37 0.0108 ms 58.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6817584Z SingleProcess AUTOTUNE benchmarking takes 0.1106 seconds and 0.3667 seconds precompiling for 20 choices 2025-12-04T12:10:20.6817654Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6817811Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6817874Z Traceback (most recent call last): 2025-12-04T12:10:20.6818046Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6818104Z method(*args, **kwargs) 2025-12-04T12:10:20.6818271Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6818338Z method(*args, **kwargs) 2025-12-04T12:10:20.6818504Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6818558Z with policy(): 2025-12-04T12:10:20.6818724Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6818784Z raise RuntimeError(msg) 2025-12-04T12:10:20.6819187Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6819190Z 2025-12-04T12:10:20.6819280Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6819554Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6819557Z 2025-12-04T12:10:20.6819659Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6819747Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6819818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6819891Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6820434Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6820547Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6820601Z graph_break [] 2025-12-04T12:10:20.6820679Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6820769Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6821279Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6821357Z current_size = base.storage().size() 2025-12-04T12:10:20.6821416Z Autotune Choices Stats: 2025-12-04T12:10:20.6821799Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6821880Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6821946Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6822083Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6822331Z triton_mm_17 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6822572Z triton_mm_16 0.0066 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6822827Z triton_mm_7 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6823065Z triton_mm_12 0.0072 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6823308Z triton_mm_6 0.0080 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6823551Z triton_mm_9 0.0084 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6823792Z triton_mm_10 0.0086 ms 72.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6824041Z triton_mm_5 0.0089 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6824281Z triton_mm_14 0.0093 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6824519Z triton_mm_11 0.0109 ms 57.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6824663Z SingleProcess AUTOTUNE benchmarking takes 0.0817 seconds and 0.4168 seconds precompiling for 20 choices 2025-12-04T12:10:20.6824754Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6824812Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6824885Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6825010Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6825515Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6825569Z graph_break [] 2025-12-04T12:10:20.6825648Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6825738Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6825794Z Autotune Choices Stats: 2025-12-04T12:10:20.6826172Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:20.6826253Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6826320Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6826455Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6826704Z triton_mm_36 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6826958Z triton_mm_35 0.0065 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6827206Z triton_mm_25 0.0079 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6827446Z triton_mm_31 0.0080 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6827687Z triton_mm_28 0.0084 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6827925Z triton_mm_29 0.0094 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6828175Z triton_mm_33 0.0094 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6828416Z triton_mm_26 0.0103 ms 61.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6828656Z triton_mm_30 0.0104 ms 61.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6828898Z triton_mm_37 0.0108 ms 58.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6829057Z SingleProcess AUTOTUNE benchmarking takes 0.1106 seconds and 0.3667 seconds precompiling for 20 choices 2025-12-04T12:10:20.6829155Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6829215Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6829288Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6829403Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6829896Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6829952Z graph_break [] 2025-12-04T12:10:20.6830031Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:20.6830158Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6830215Z Autotune Choices Stats: 2025-12-04T12:10:20.6830590Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6830690Z AUTOTUNE scaled_mm(3x1024, 1024x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6830756Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6830891Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6831139Z triton_mm_54 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6831385Z triton_mm_55 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6831623Z triton_mm_45 0.0071 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6831867Z triton_mm_44 0.0077 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6832119Z triton_mm_47 0.0083 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6832357Z triton_mm_48 0.0084 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6832596Z triton_mm_43 0.0089 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6832834Z triton_mm_52 0.0090 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6833072Z triton_mm_50 0.0104 ms 59.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6833339Z triton_mm_56 0.0109 ms 56.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6833483Z SingleProcess AUTOTUNE benchmarking takes 0.1417 seconds and 0.3209 seconds precompiling for 20 choices 2025-12-04T12:10:20.6833689Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-758dcb63c92ea375.xml - 2025-12-04T12:10:20.6833764Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6834359Z FAILED [1.0280s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1189085184. 2025-12-04T12:10:20.6834363Z 2025-12-04T12:10:20.6834454Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6834726Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6834739Z 2025-12-04T12:10:20.6834842Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6834921Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6835004Z ================== 1 failed, 187 deselected, 2 rerun in 4.79s ================== 2025-12-04T12:10:20.6835060Z Got exit code 1 2025-12-04T12:10:20.6835281Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6835423Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.6835582Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-69b5d04694ba2e9d.xml 2025-12-04T12:10:20.6835655Z ============================= test session starts ============================== 2025-12-04T12:10:20.6835787Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6835844Z cachedir: .pytest_cache 2025-12-04T12:10:20.6836018Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6836081Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6836150Z configfile: pytest.ini 2025-12-04T12:10:20.6836328Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6836423Z collecting ... collected 188 items / 103 deselected / 85 selected 2025-12-04T12:10:20.6836494Z stepcurrent: skipping 103 already run items. 2025-12-04T12:10:20.6836557Z Running 85 items in this shard 2025-12-04T12:10:20.6836559Z 2025-12-04T12:10:20.6836784Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8468s] [ 1%] 2025-12-04T12:10:20.6837004Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3472s] [ 1%] 2025-12-04T12:10:20.6837202Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3235s] [ 1%] 2025-12-04T12:10:20.6837214Z 2025-12-04T12:10:20.6837281Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6837449Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6837512Z Traceback (most recent call last): 2025-12-04T12:10:20.6837685Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6837744Z method(*args, **kwargs) 2025-12-04T12:10:20.6837912Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6837968Z method(*args, **kwargs) 2025-12-04T12:10:20.6838135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6838188Z with policy(): 2025-12-04T12:10:20.6838357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6838414Z raise RuntimeError(msg) 2025-12-04T12:10:20.6838809Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6838821Z 2025-12-04T12:10:20.6838912Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6839178Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6839181Z 2025-12-04T12:10:20.6839285Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6839377Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6839437Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6839510Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6839593Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6839706Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6839760Z graph_break [] 2025-12-04T12:10:20.6839836Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6839990Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6840052Z Traceback (most recent call last): 2025-12-04T12:10:20.6840251Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6840307Z method(*args, **kwargs) 2025-12-04T12:10:20.6840487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6840544Z method(*args, **kwargs) 2025-12-04T12:10:20.6840710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6840765Z with policy(): 2025-12-04T12:10:20.6840933Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6840990Z raise RuntimeError(msg) 2025-12-04T12:10:20.6841383Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6841386Z 2025-12-04T12:10:20.6841490Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6841771Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6841773Z 2025-12-04T12:10:20.6841877Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6841965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6842026Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6842100Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6842184Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6842298Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6842352Z graph_break [] 2025-12-04T12:10:20.6842427Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6842519Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6842577Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6842650Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6842762Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6842842Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6842895Z graph_break [] 2025-12-04T12:10:20.6842969Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6843053Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6843211Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6843274Z Traceback (most recent call last): 2025-12-04T12:10:20.6843442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6843501Z method(*args, **kwargs) 2025-12-04T12:10:20.6843667Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6843726Z method(*args, **kwargs) 2025-12-04T12:10:20.6843889Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6843945Z with policy(): 2025-12-04T12:10:20.6844111Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6844171Z raise RuntimeError(msg) 2025-12-04T12:10:20.6844560Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6844574Z 2025-12-04T12:10:20.6844665Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6844929Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6844932Z 2025-12-04T12:10:20.6845036Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6845126Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6845187Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6845260Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6845341Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6845456Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6845509Z graph_break [] 2025-12-04T12:10:20.6845598Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6845688Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6845765Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6845836Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6845948Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6846028Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6846082Z graph_break [] 2025-12-04T12:10:20.6846156Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6846246Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6846303Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6846374Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6846484Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6846566Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6846618Z graph_break [] 2025-12-04T12:10:20.6846693Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6846897Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-69b5d04694ba2e9d.xml - 2025-12-04T12:10:20.6846974Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6847552Z FAILED [0.3235s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6847567Z 2025-12-04T12:10:20.6847657Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6847922Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6847925Z 2025-12-04T12:10:20.6848026Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6848104Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6848188Z ================== 1 failed, 103 deselected, 2 rerun in 2.54s ================== 2025-12-04T12:10:20.6848244Z Got exit code 1 2025-12-04T12:10:20.6848301Z Retrying single test... 2025-12-04T12:10:20.6848462Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d793d2c5ce42804a.xml 2025-12-04T12:10:20.6848535Z ============================= test session starts ============================== 2025-12-04T12:10:20.6848673Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6848730Z cachedir: .pytest_cache 2025-12-04T12:10:20.6848904Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6848966Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6849024Z configfile: pytest.ini 2025-12-04T12:10:20.6849201Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6849294Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6849562Z stepcurrent: skipping 103 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6849624Z Running 1 items in this shard 2025-12-04T12:10:20.6849636Z 2025-12-04T12:10:20.6849871Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [37.3879s] [100%] 2025-12-04T12:10:20.6850123Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1260s] [100%] 2025-12-04T12:10:20.6850320Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [1.0135s] [100%] 2025-12-04T12:10:20.6850323Z 2025-12-04T12:10:20.6850390Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6850541Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6850603Z Traceback (most recent call last): 2025-12-04T12:10:20.6850778Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6850836Z method(*args, **kwargs) 2025-12-04T12:10:20.6851008Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6851064Z method(*args, **kwargs) 2025-12-04T12:10:20.6851229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6851282Z with policy(): 2025-12-04T12:10:20.6851466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6851523Z raise RuntimeError(msg) 2025-12-04T12:10:20.6851918Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6851921Z 2025-12-04T12:10:20.6852012Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6852278Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6852281Z 2025-12-04T12:10:20.6852383Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6852476Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6852535Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6852607Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6852688Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6852801Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6852872Z graph_break [] 2025-12-04T12:10:20.6852947Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6853100Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6853163Z Traceback (most recent call last): 2025-12-04T12:10:20.6853331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6853387Z method(*args, **kwargs) 2025-12-04T12:10:20.6853554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6853610Z method(*args, **kwargs) 2025-12-04T12:10:20.6853774Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6853827Z with policy(): 2025-12-04T12:10:20.6853994Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6854066Z raise RuntimeError(msg) 2025-12-04T12:10:20.6854468Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6854470Z 2025-12-04T12:10:20.6854562Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6854826Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6854828Z 2025-12-04T12:10:20.6854931Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6855023Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6855082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6855154Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6855236Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6855349Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6855403Z graph_break [] 2025-12-04T12:10:20.6855477Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6855577Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6855635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6855707Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6855819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6855900Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6855954Z graph_break [] 2025-12-04T12:10:20.6856029Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6856098Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6856254Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6856316Z Traceback (most recent call last): 2025-12-04T12:10:20.6856483Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6856541Z method(*args, **kwargs) 2025-12-04T12:10:20.6856706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6856763Z method(*args, **kwargs) 2025-12-04T12:10:20.6856928Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6856993Z with policy(): 2025-12-04T12:10:20.6857160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6857219Z raise RuntimeError(msg) 2025-12-04T12:10:20.6857609Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6857612Z 2025-12-04T12:10:20.6857702Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6857966Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6857969Z 2025-12-04T12:10:20.6858083Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6858173Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6859852Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6859931Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6860014Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6860181Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6860236Z graph_break [] 2025-12-04T12:10:20.6860314Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6860403Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6860462Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6860533Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6860645Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6860728Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6860782Z graph_break [] 2025-12-04T12:10:20.6860856Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6860948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6861005Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6861076Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6861186Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6861287Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6861342Z graph_break [] 2025-12-04T12:10:20.6861415Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6861623Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d793d2c5ce42804a.xml - 2025-12-04T12:10:20.6861701Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6862286Z FAILED [1.0135s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6862291Z 2025-12-04T12:10:20.6862380Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6862648Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6862650Z 2025-12-04T12:10:20.6862752Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6862845Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6862932Z ================= 1 failed, 187 deselected, 2 rerun in 39.55s ================== 2025-12-04T12:10:20.6862988Z Got exit code 1 2025-12-04T12:10:20.6863044Z Retrying single test... 2025-12-04T12:10:20.6863206Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-544858fe93f35194.xml 2025-12-04T12:10:20.6863279Z ============================= test session starts ============================== 2025-12-04T12:10:20.6863408Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6863465Z cachedir: .pytest_cache 2025-12-04T12:10:20.6863638Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6863702Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6863782Z configfile: pytest.ini 2025-12-04T12:10:20.6863963Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6864068Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6864332Z stepcurrent: skipping 103 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6864393Z Running 1 items in this shard 2025-12-04T12:10:20.6864397Z 2025-12-04T12:10:20.6864620Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5842s] [100%] 2025-12-04T12:10:20.6864838Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2604s] [100%] 2025-12-04T12:10:20.6865036Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2407s] [100%] 2025-12-04T12:10:20.6865039Z 2025-12-04T12:10:20.6865106Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6865258Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6865320Z Traceback (most recent call last): 2025-12-04T12:10:20.6865492Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6865560Z method(*args, **kwargs) 2025-12-04T12:10:20.6865727Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6865783Z method(*args, **kwargs) 2025-12-04T12:10:20.6865949Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6866003Z with policy(): 2025-12-04T12:10:20.6866171Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6866228Z raise RuntimeError(msg) 2025-12-04T12:10:20.6866626Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6866630Z 2025-12-04T12:10:20.6866719Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6866989Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6866992Z 2025-12-04T12:10:20.6867104Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6867195Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6867255Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6867327Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6867408Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6867522Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6867576Z graph_break [] 2025-12-04T12:10:20.6867650Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6867802Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6867865Z Traceback (most recent call last): 2025-12-04T12:10:20.6868035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6868103Z method(*args, **kwargs) 2025-12-04T12:10:20.6868282Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6868338Z method(*args, **kwargs) 2025-12-04T12:10:20.6868503Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6868556Z with policy(): 2025-12-04T12:10:20.6868723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6868781Z raise RuntimeError(msg) 2025-12-04T12:10:20.6869171Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6869175Z 2025-12-04T12:10:20.6869265Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6869531Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6869533Z 2025-12-04T12:10:20.6869635Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6869737Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6869796Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6869868Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6869950Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6870064Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6870157Z graph_break [] 2025-12-04T12:10:20.6870232Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6870323Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6870381Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6870453Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6870564Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6870644Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6870698Z graph_break [] 2025-12-04T12:10:20.6870772Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6870841Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6870992Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6871055Z Traceback (most recent call last): 2025-12-04T12:10:20.6871238Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6871297Z method(*args, **kwargs) 2025-12-04T12:10:20.6871462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6871519Z method(*args, **kwargs) 2025-12-04T12:10:20.6871684Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6871739Z with policy(): 2025-12-04T12:10:20.6871908Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6871967Z raise RuntimeError(msg) 2025-12-04T12:10:20.6872360Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6872377Z 2025-12-04T12:10:20.6872479Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6872744Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6872747Z 2025-12-04T12:10:20.6872849Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6872938Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6872997Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6873069Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6873149Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6873263Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6873316Z graph_break [] 2025-12-04T12:10:20.6873392Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6873482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6873540Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6873611Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6873726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6873823Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6873877Z graph_break [] 2025-12-04T12:10:20.6873953Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6874046Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6874104Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6874176Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6874287Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6874371Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6874424Z graph_break [] 2025-12-04T12:10:20.6874500Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:20.6874709Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-544858fe93f35194.xml - 2025-12-04T12:10:20.6874787Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6875373Z FAILED [0.2407s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6875378Z 2025-12-04T12:10:20.6875468Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6875735Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6875737Z 2025-12-04T12:10:20.6875837Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6875918Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6876001Z ================== 1 failed, 187 deselected, 2 rerun in 2.10s ================== 2025-12-04T12:10:20.6876055Z Got exit code 1 2025-12-04T12:10:20.6876268Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6876422Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.6876589Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b35ebf7d979786e3.xml 2025-12-04T12:10:20.6876663Z ============================= test session starts ============================== 2025-12-04T12:10:20.6876791Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6876850Z cachedir: .pytest_cache 2025-12-04T12:10:20.6877023Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6877087Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6877143Z configfile: pytest.ini 2025-12-04T12:10:20.6877321Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6877416Z collecting ... collected 188 items / 104 deselected / 84 selected 2025-12-04T12:10:20.6877487Z stepcurrent: skipping 104 already run items. 2025-12-04T12:10:20.6877548Z Running 84 items in this shard 2025-12-04T12:10:20.6877551Z 2025-12-04T12:10:20.6877779Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [34.1519s] [ 1%] 2025-12-04T12:10:20.6878004Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3749s] [ 1%] 2025-12-04T12:10:20.6878212Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3321s] [ 1%] 2025-12-04T12:10:20.6878214Z 2025-12-04T12:10:20.6878281Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6878436Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6878500Z Traceback (most recent call last): 2025-12-04T12:10:20.6878672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6878729Z method(*args, **kwargs) 2025-12-04T12:10:20.6878895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6878953Z method(*args, **kwargs) 2025-12-04T12:10:20.6879118Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6879173Z with policy(): 2025-12-04T12:10:20.6879340Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6879400Z raise RuntimeError(msg) 2025-12-04T12:10:20.6879820Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6879822Z 2025-12-04T12:10:20.6879912Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6880216Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6880219Z 2025-12-04T12:10:20.6880321Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6880411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6880469Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6880561Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6880642Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6880767Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6880821Z graph_break [] 2025-12-04T12:10:20.6880899Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6881050Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6881113Z Traceback (most recent call last): 2025-12-04T12:10:20.6881280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6881337Z method(*args, **kwargs) 2025-12-04T12:10:20.6881502Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6881559Z method(*args, **kwargs) 2025-12-04T12:10:20.6881725Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6881779Z with policy(): 2025-12-04T12:10:20.6881946Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6882003Z raise RuntimeError(msg) 2025-12-04T12:10:20.6882398Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6882415Z 2025-12-04T12:10:20.6882505Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6882772Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6882775Z 2025-12-04T12:10:20.6882877Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6882966Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6883025Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6883098Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6883178Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6883293Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6883346Z graph_break [] 2025-12-04T12:10:20.6883423Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6883511Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6883570Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6883655Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6883768Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6883849Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6883902Z graph_break [] 2025-12-04T12:10:20.6883977Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6884045Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6884198Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6884260Z Traceback (most recent call last): 2025-12-04T12:10:20.6884427Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6884485Z method(*args, **kwargs) 2025-12-04T12:10:20.6884650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6884716Z method(*args, **kwargs) 2025-12-04T12:10:20.6884893Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6884947Z with policy(): 2025-12-04T12:10:20.6885116Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6885173Z raise RuntimeError(msg) 2025-12-04T12:10:20.6885571Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6885574Z 2025-12-04T12:10:20.6885662Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6885932Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6885935Z 2025-12-04T12:10:20.6886037Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6886125Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6886183Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6886254Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6886345Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6886458Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6886512Z graph_break [] 2025-12-04T12:10:20.6886586Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6886676Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6886735Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6886806Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6886919Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6886999Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6887052Z graph_break [] 2025-12-04T12:10:20.6887125Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6887214Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6887274Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6887343Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6887454Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6887532Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6887585Z graph_break [] 2025-12-04T12:10:20.6887670Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6887877Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b35ebf7d979786e3.xml - 2025-12-04T12:10:20.6887952Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6888536Z FAILED [0.3321s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6888539Z 2025-12-04T12:10:20.6888628Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6888895Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6888907Z 2025-12-04T12:10:20.6889021Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6889099Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6889184Z ================= 1 failed, 104 deselected, 2 rerun in 34.88s ================== 2025-12-04T12:10:20.6889238Z Got exit code 1 2025-12-04T12:10:20.6889295Z Retrying single test... 2025-12-04T12:10:20.6889458Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-817c3c856da3c695.xml 2025-12-04T12:10:20.6889532Z ============================= test session starts ============================== 2025-12-04T12:10:20.6889659Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6889719Z cachedir: .pytest_cache 2025-12-04T12:10:20.6889891Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6889955Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6890012Z configfile: pytest.ini 2025-12-04T12:10:20.6890230Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6890321Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6890609Z stepcurrent: skipping 104 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6890670Z Running 1 items in this shard 2025-12-04T12:10:20.6890672Z 2025-12-04T12:10:20.6890898Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7852s] [100%] 2025-12-04T12:10:20.6891122Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3328s] [100%] 2025-12-04T12:10:20.6891319Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2967s] [100%] 2025-12-04T12:10:20.6891322Z 2025-12-04T12:10:20.6891390Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6891543Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6891606Z Traceback (most recent call last): 2025-12-04T12:10:20.6891777Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6891835Z method(*args, **kwargs) 2025-12-04T12:10:20.6892014Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6892072Z method(*args, **kwargs) 2025-12-04T12:10:20.6892238Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6892292Z with policy(): 2025-12-04T12:10:20.6892458Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6892517Z raise RuntimeError(msg) 2025-12-04T12:10:20.6892916Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6892919Z 2025-12-04T12:10:20.6893020Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6893301Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6893303Z 2025-12-04T12:10:20.6893405Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6893496Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6893556Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6893628Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6893708Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6893822Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6893875Z graph_break [] 2025-12-04T12:10:20.6893951Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6894105Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6894169Z Traceback (most recent call last): 2025-12-04T12:10:20.6894337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6894394Z method(*args, **kwargs) 2025-12-04T12:10:20.6894565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6894637Z method(*args, **kwargs) 2025-12-04T12:10:20.6894803Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6894856Z with policy(): 2025-12-04T12:10:20.6895024Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6895081Z raise RuntimeError(msg) 2025-12-04T12:10:20.6895483Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6895485Z 2025-12-04T12:10:20.6895573Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6895840Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6895843Z 2025-12-04T12:10:20.6895944Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6896035Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6896093Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6896177Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6896258Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6896372Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6896425Z graph_break [] 2025-12-04T12:10:20.6896502Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6896591Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6896649Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6896720Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6896831Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6896911Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6896964Z graph_break [] 2025-12-04T12:10:20.6897040Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6897122Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6897286Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6897348Z Traceback (most recent call last): 2025-12-04T12:10:20.6897516Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6897573Z method(*args, **kwargs) 2025-12-04T12:10:20.6897740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6897797Z method(*args, **kwargs) 2025-12-04T12:10:20.6897962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6898015Z with policy(): 2025-12-04T12:10:20.6898183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6898241Z raise RuntimeError(msg) 2025-12-04T12:10:20.6898635Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6898637Z 2025-12-04T12:10:20.6898725Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6899002Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6899004Z 2025-12-04T12:10:20.6899107Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6899197Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6899257Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6899328Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6899411Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6899525Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6899578Z graph_break [] 2025-12-04T12:10:20.6899653Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6899742Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6899800Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6899871Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6899987Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6900068Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6900152Z graph_break [] 2025-12-04T12:10:20.6900245Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6900338Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6900398Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6900468Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6900580Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6900660Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6900716Z graph_break [] 2025-12-04T12:10:20.6900789Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6900997Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-817c3c856da3c695.xml - 2025-12-04T12:10:20.6901073Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6901671Z FAILED [0.2967s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6901686Z 2025-12-04T12:10:20.6901775Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6902041Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6902043Z 2025-12-04T12:10:20.6902144Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6902221Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6902308Z ================== 1 failed, 187 deselected, 2 rerun in 2.43s ================== 2025-12-04T12:10:20.6902362Z Got exit code 1 2025-12-04T12:10:20.6902421Z Retrying single test... 2025-12-04T12:10:20.6902579Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-dfa8998f4866b528.xml 2025-12-04T12:10:20.6902652Z ============================= test session starts ============================== 2025-12-04T12:10:20.6902778Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6902848Z cachedir: .pytest_cache 2025-12-04T12:10:20.6903019Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6903081Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6903138Z configfile: pytest.ini 2025-12-04T12:10:20.6903319Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6903410Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6903675Z stepcurrent: skipping 104 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6903736Z Running 1 items in this shard 2025-12-04T12:10:20.6903738Z 2025-12-04T12:10:20.6903963Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [33.9509s] [100%] 2025-12-04T12:10:20.6904188Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0616s] [100%] 2025-12-04T12:10:20.6904395Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [1.0435s] [100%] 2025-12-04T12:10:20.6904398Z 2025-12-04T12:10:20.6904466Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6904620Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6904686Z Traceback (most recent call last): 2025-12-04T12:10:20.6904857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6904916Z method(*args, **kwargs) 2025-12-04T12:10:20.6905084Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6905141Z method(*args, **kwargs) 2025-12-04T12:10:20.6905306Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6905370Z with policy(): 2025-12-04T12:10:20.6905538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6905594Z raise RuntimeError(msg) 2025-12-04T12:10:20.6906003Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:20.6906006Z 2025-12-04T12:10:20.6906096Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6906364Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6906366Z 2025-12-04T12:10:20.6906467Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6906558Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6906615Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6906688Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6906769Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6906882Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6906934Z graph_break [] 2025-12-04T12:10:20.6907021Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6907174Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6907237Z Traceback (most recent call last): 2025-12-04T12:10:20.6907405Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6907463Z method(*args, **kwargs) 2025-12-04T12:10:20.6907629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6907685Z method(*args, **kwargs) 2025-12-04T12:10:20.6907850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6907903Z with policy(): 2025-12-04T12:10:20.6908070Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6908128Z raise RuntimeError(msg) 2025-12-04T12:10:20.6908521Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:20.6908525Z 2025-12-04T12:10:20.6908624Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6908894Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6908896Z 2025-12-04T12:10:20.6908998Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6909087Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6909146Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6909218Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6909301Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6909412Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6909466Z graph_break [] 2025-12-04T12:10:20.6909540Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6909640Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6909698Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6909786Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6909899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6909979Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6910031Z graph_break [] 2025-12-04T12:10:20.6910146Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6910215Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6910368Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6910430Z Traceback (most recent call last): 2025-12-04T12:10:20.6910599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6910656Z method(*args, **kwargs) 2025-12-04T12:10:20.6910822Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6910877Z method(*args, **kwargs) 2025-12-04T12:10:20.6911042Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6911095Z with policy(): 2025-12-04T12:10:20.6911266Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6911339Z raise RuntimeError(msg) 2025-12-04T12:10:20.6911738Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6911741Z 2025-12-04T12:10:20.6911829Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6912096Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6912098Z 2025-12-04T12:10:20.6912200Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6912289Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6912348Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6912419Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6912500Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6912611Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6912679Z graph_break [] 2025-12-04T12:10:20.6912755Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6912845Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6912902Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6912973Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6913083Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6913163Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6913217Z graph_break [] 2025-12-04T12:10:20.6913291Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6913378Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6913437Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6913506Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6913618Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6913710Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6913764Z graph_break [] 2025-12-04T12:10:20.6913849Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:20.6914055Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-dfa8998f4866b528.xml - 2025-12-04T12:10:20.6914131Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6914715Z FAILED [1.0435s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:20.6914718Z 2025-12-04T12:10:20.6914807Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6915074Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6915076Z 2025-12-04T12:10:20.6915179Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6915256Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6915353Z ================= 1 failed, 187 deselected, 2 rerun in 36.08s ================== 2025-12-04T12:10:20.6915406Z Got exit code 1 2025-12-04T12:10:20.6915623Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6915767Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.6915927Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ce9f2d489b9baa9b.xml 2025-12-04T12:10:20.6916000Z ============================= test session starts ============================== 2025-12-04T12:10:20.6916129Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6916187Z cachedir: .pytest_cache 2025-12-04T12:10:20.6916362Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6916426Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6916483Z configfile: pytest.ini 2025-12-04T12:10:20.6916659Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6916760Z collecting ... collected 188 items / 105 deselected / 83 selected 2025-12-04T12:10:20.6916833Z stepcurrent: skipping 105 already run items. 2025-12-04T12:10:20.6916893Z Running 83 items in this shard 2025-12-04T12:10:20.6916895Z 2025-12-04T12:10:20.6917121Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [37.9011s] [ 1%] 2025-12-04T12:10:20.6917340Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1975s] [ 1%] 2025-12-04T12:10:20.6917538Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda PASSED [1.2512s] [ 1%] 2025-12-04T12:10:20.6917758Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4776s] [ 2%] 2025-12-04T12:10:20.6917987Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4688s] [ 2%] 2025-12-04T12:10:20.6918195Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.5225s] [ 2%] 2025-12-04T12:10:20.6918197Z 2025-12-04T12:10:20.6918265Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6918418Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6918481Z Traceback (most recent call last): 2025-12-04T12:10:20.6918653Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6918709Z method(*args, **kwargs) 2025-12-04T12:10:20.6918878Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6918934Z method(*args, **kwargs) 2025-12-04T12:10:20.6919100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6919154Z with policy(): 2025-12-04T12:10:20.6919320Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6919377Z raise RuntimeError(msg) 2025-12-04T12:10:20.6919771Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1017118720. 2025-12-04T12:10:20.6919784Z 2025-12-04T12:10:20.6919873Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6920173Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6920176Z 2025-12-04T12:10:20.6920278Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6920367Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6920425Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6920497Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6920998Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6921130Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6921185Z graph_break [] 2025-12-04T12:10:20.6921261Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:20.6921350Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6921850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6921919Z current_size = base.storage().size() 2025-12-04T12:10:20.6921978Z Autotune Choices Stats: 2025-12-04T12:10:20.6922376Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:20.6922459Z AUTOTUNE scaled_mm(3x32, 32x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6922524Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6922663Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6922913Z triton_mm_0 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6922975Z _scaled_mm 0.0252 ms 27.3% 2025-12-04T12:10:20.6923118Z SingleProcess AUTOTUNE benchmarking takes 0.0137 seconds and 0.0779 seconds precompiling for 2 choices 2025-12-04T12:10:20.6923273Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6923335Z Traceback (most recent call last): 2025-12-04T12:10:20.6923508Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6923565Z method(*args, **kwargs) 2025-12-04T12:10:20.6923734Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6923790Z method(*args, **kwargs) 2025-12-04T12:10:20.6923969Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6924023Z with policy(): 2025-12-04T12:10:20.6924189Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6924247Z raise RuntimeError(msg) 2025-12-04T12:10:20.6924644Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1017118720 and is now 1124073472. 2025-12-04T12:10:20.6924647Z 2025-12-04T12:10:20.6924736Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6925002Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:20.6925005Z 2025-12-04T12:10:20.6925107Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6925196Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6925258Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6925330Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6925840Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6925957Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6926012Z graph_break [] 2025-12-04T12:10:20.6926089Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:20.6926176Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6926676Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6926769Z current_size = base.storage().size() 2025-12-04T12:10:20.6926827Z Autotune Choices Stats: 2025-12-04T12:10:20.6927204Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:20.6927276Z AUTOTUNE scaled_mm(3x32, 32x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6927339Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6927478Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6927729Z triton_mm_0 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6927789Z _scaled_mm 0.0252 ms 27.3% 2025-12-04T12:10:20.6927932Z SingleProcess AUTOTUNE benchmarking takes 0.0137 seconds and 0.0779 seconds precompiling for 2 choices 2025-12-04T12:10:20.6928021Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6928080Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6928162Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6928277Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6928702Z inductor [('triton_bundler_save_kernel', 8), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:20.6928757Z graph_break [] 2025-12-04T12:10:20.6928834Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:20.6928923Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6928980Z Autotune Choices Stats: 2025-12-04T12:10:20.6929457Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "_scaled_mm", "best_time": 0.006639999803155661, "best_triton_pos": 1, "best_triton_time": 0.007240000180900097, "best_triton_kernel": "triton_mm_1", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:20.6929526Z AUTOTUNE scaled_mm(3x32, 32x16, 3x1, 1x16, 16) 2025-12-04T12:10:20.6929589Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6929737Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6929796Z _scaled_mm 0.0066 ms 100.0% 2025-12-04T12:10:20.6930042Z triton_mm_1 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6930220Z SingleProcess AUTOTUNE benchmarking takes 0.0117 seconds and 0.0671 seconds precompiling for 2 choices 2025-12-04T12:10:20.6930377Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6930439Z Traceback (most recent call last): 2025-12-04T12:10:20.6930610Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6930667Z method(*args, **kwargs) 2025-12-04T12:10:20.6930850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6930907Z method(*args, **kwargs) 2025-12-04T12:10:20.6931088Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6931142Z with policy(): 2025-12-04T12:10:20.6931314Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6931372Z raise RuntimeError(msg) 2025-12-04T12:10:20.6931769Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1073741824 and is now 1117782016. 2025-12-04T12:10:20.6931772Z 2025-12-04T12:10:20.6931863Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6932133Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6932136Z 2025-12-04T12:10:20.6932238Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6932328Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6932387Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6932473Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6932589Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6933082Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6933139Z graph_break [] 2025-12-04T12:10:20.6933216Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6933305Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6933362Z Autotune Choices Stats: 2025-12-04T12:10:20.6933744Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6933821Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6933885Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6934033Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6934280Z triton_mm_10 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6934521Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6934761Z triton_mm_7 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6934997Z triton_mm_5 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6935255Z triton_mm_6 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6935490Z triton_mm_8 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6935550Z _scaled_mm 0.0070 ms 88.6% 2025-12-04T12:10:20.6935787Z triton_mm_3 0.0070 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6936025Z triton_mm_4 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6936169Z SingleProcess AUTOTUNE benchmarking takes 0.0535 seconds and 0.3576 seconds precompiling for 9 choices 2025-12-04T12:10:20.6936328Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6936390Z Traceback (most recent call last): 2025-12-04T12:10:20.6936572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6936630Z method(*args, **kwargs) 2025-12-04T12:10:20.6936796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6936853Z method(*args, **kwargs) 2025-12-04T12:10:20.6937018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6937073Z with policy(): 2025-12-04T12:10:20.6937240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6937298Z raise RuntimeError(msg) 2025-12-04T12:10:20.6937693Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1140850688. 2025-12-04T12:10:20.6937697Z 2025-12-04T12:10:20.6937787Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6938062Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6938076Z 2025-12-04T12:10:20.6938179Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6938269Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6938328Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6938400Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6938515Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6939010Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6939074Z graph_break [] 2025-12-04T12:10:20.6939151Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6939240Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6939308Z Autotune Choices Stats: 2025-12-04T12:10:20.6939686Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6939764Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6939828Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6939965Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6940272Z triton_mm_10 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6940513Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6940752Z triton_mm_7 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6941006Z triton_mm_5 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6941248Z triton_mm_6 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6941485Z triton_mm_8 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6941544Z _scaled_mm 0.0070 ms 88.6% 2025-12-04T12:10:20.6941781Z triton_mm_3 0.0070 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6942018Z triton_mm_4 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6942179Z SingleProcess AUTOTUNE benchmarking takes 0.0535 seconds and 0.3576 seconds precompiling for 9 choices 2025-12-04T12:10:20.6942269Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6942329Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6942401Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6942515Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6943005Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6943062Z graph_break [] 2025-12-04T12:10:20.6943138Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6943242Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6943299Z Autotune Choices Stats: 2025-12-04T12:10:20.6943688Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:20.6943766Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6943829Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6943966Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6944212Z triton_mm_12 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6944452Z triton_mm_18 0.0062 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6944689Z triton_mm_17 0.0062 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6944939Z triton_mm_15 0.0070 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6945177Z triton_mm_13 0.0072 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6945416Z triton_mm_14 0.0076 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6945657Z triton_mm_11 0.0077 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6945895Z triton_mm_16 0.0094 ms 62.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6945953Z _scaled_mm 0.0280 ms 21.0% 2025-12-04T12:10:20.6946095Z SingleProcess AUTOTUNE benchmarking takes 0.0600 seconds and 0.3242 seconds precompiling for 9 choices 2025-12-04T12:10:20.6946183Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6946341Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6946404Z Traceback (most recent call last): 2025-12-04T12:10:20.6946576Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6946633Z method(*args, **kwargs) 2025-12-04T12:10:20.6946802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6946858Z method(*args, **kwargs) 2025-12-04T12:10:20.6947025Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6947078Z with policy(): 2025-12-04T12:10:20.6947247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6947315Z raise RuntimeError(msg) 2025-12-04T12:10:20.6947722Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1140850688 and is now 1163919360. 2025-12-04T12:10:20.6947725Z 2025-12-04T12:10:20.6947815Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6948088Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6948090Z 2025-12-04T12:10:20.6948191Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6948282Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6948340Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6948412Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6948527Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6949018Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6949086Z graph_break [] 2025-12-04T12:10:20.6949163Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6949252Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6949310Z Autotune Choices Stats: 2025-12-04T12:10:20.6949687Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:20.6949761Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6949827Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6949963Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6950245Z triton_mm_10 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6950506Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6950746Z triton_mm_7 0.0064 ms 96.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6950982Z triton_mm_5 0.0066 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6951221Z triton_mm_6 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6951458Z triton_mm_8 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6951543Z _scaled_mm 0.0070 ms 88.6% 2025-12-04T12:10:20.6951781Z triton_mm_3 0.0070 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6952019Z triton_mm_4 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6952163Z SingleProcess AUTOTUNE benchmarking takes 0.0535 seconds and 0.3576 seconds precompiling for 9 choices 2025-12-04T12:10:20.6952253Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6952312Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6952385Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6952500Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6952990Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6953056Z graph_break [] 2025-12-04T12:10:20.6953131Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6953219Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6953277Z Autotune Choices Stats: 2025-12-04T12:10:20.6953653Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:20.6953727Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6953792Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6953927Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6954175Z triton_mm_12 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6954426Z triton_mm_18 0.0062 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6954669Z triton_mm_17 0.0062 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6954905Z triton_mm_15 0.0070 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6955143Z triton_mm_13 0.0072 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6955382Z triton_mm_14 0.0076 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6955641Z triton_mm_11 0.0077 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6955878Z triton_mm_16 0.0094 ms 62.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6955937Z _scaled_mm 0.0280 ms 21.0% 2025-12-04T12:10:20.6956081Z SingleProcess AUTOTUNE benchmarking takes 0.0600 seconds and 0.3242 seconds precompiling for 9 choices 2025-12-04T12:10:20.6956169Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6956227Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6956301Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6956419Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6956912Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6956986Z graph_break [] 2025-12-04T12:10:20.6957065Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6957154Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6957211Z Autotune Choices Stats: 2025-12-04T12:10:20.6957586Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:20.6957661Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6957724Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6957860Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6958104Z triton_mm_23 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6958352Z triton_mm_25 0.0071 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6958595Z triton_mm_22 0.0073 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6958834Z triton_mm_20 0.0074 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6959074Z triton_mm_21 0.0074 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6959313Z triton_mm_24 0.0076 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6959577Z triton_mm_19 0.0079 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6959815Z triton_mm_26 0.0080 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6959874Z _scaled_mm 0.0256 ms 27.5% 2025-12-04T12:10:20.6960016Z SingleProcess AUTOTUNE benchmarking takes 0.0585 seconds and 0.3398 seconds precompiling for 9 choices 2025-12-04T12:10:20.6960254Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ce9f2d489b9baa9b.xml - 2025-12-04T12:10:20.6960331Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6960922Z FAILED [1.5225s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1140850688 and is now 1163919360. 2025-12-04T12:10:20.6960925Z 2025-12-04T12:10:20.6961014Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6961300Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6961304Z 2025-12-04T12:10:20.6961406Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6961487Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6961577Z ============ 1 failed, 1 passed, 105 deselected, 4 rerun in 44.84s ============= 2025-12-04T12:10:20.6961634Z Got exit code 1 2025-12-04T12:10:20.6961690Z Retrying single test... 2025-12-04T12:10:20.6961851Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bb8a35a6acbc624f.xml 2025-12-04T12:10:20.6961923Z ============================= test session starts ============================== 2025-12-04T12:10:20.6962053Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6962109Z cachedir: .pytest_cache 2025-12-04T12:10:20.6962285Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.6962347Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.6962404Z configfile: pytest.ini 2025-12-04T12:10:20.6962597Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.6962690Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.6962955Z stepcurrent: skipping 106 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6963014Z Running 1 items in this shard 2025-12-04T12:10:20.6963016Z 2025-12-04T12:10:20.6963241Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [15.1097s] [100%] 2025-12-04T12:10:20.6963464Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8787s] [100%] 2025-12-04T12:10:20.6963664Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8368s] [100%] 2025-12-04T12:10:20.6963681Z 2025-12-04T12:10:20.6963762Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.6963917Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6963980Z Traceback (most recent call last): 2025-12-04T12:10:20.6964153Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6964211Z method(*args, **kwargs) 2025-12-04T12:10:20.6964383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6964440Z method(*args, **kwargs) 2025-12-04T12:10:20.6964696Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6964752Z with policy(): 2025-12-04T12:10:20.6964920Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6964978Z raise RuntimeError(msg) 2025-12-04T12:10:20.6965383Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.6965398Z 2025-12-04T12:10:20.6965489Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6965758Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6965761Z 2025-12-04T12:10:20.6965864Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6965958Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6966018Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6966090Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6966586Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6966700Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6966754Z graph_break [] 2025-12-04T12:10:20.6966830Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6966931Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6967431Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6967495Z current_size = base.storage().size() 2025-12-04T12:10:20.6967554Z Autotune Choices Stats: 2025-12-04T12:10:20.6967933Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6968019Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6968083Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6968232Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6968479Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6968723Z triton_mm_3 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6968965Z triton_mm_6 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6969203Z triton_mm_7 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6969441Z triton_mm_0 0.0075 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6969692Z triton_mm_4 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6969930Z triton_mm_2 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6970199Z triton_mm_5 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6970257Z _scaled_mm 0.0269 ms 23.2% 2025-12-04T12:10:20.6970400Z SingleProcess AUTOTUNE benchmarking takes 0.0402 seconds and 0.2023 seconds precompiling for 9 choices 2025-12-04T12:10:20.6970555Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6970620Z Traceback (most recent call last): 2025-12-04T12:10:20.6970789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6970849Z method(*args, **kwargs) 2025-12-04T12:10:20.6971015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6971087Z method(*args, **kwargs) 2025-12-04T12:10:20.6971254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6971309Z with policy(): 2025-12-04T12:10:20.6971476Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6971534Z raise RuntimeError(msg) 2025-12-04T12:10:20.6971935Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.6971938Z 2025-12-04T12:10:20.6972029Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6972299Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6972313Z 2025-12-04T12:10:20.6972433Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6972523Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6972582Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6972655Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6973151Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6973268Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6973321Z graph_break [] 2025-12-04T12:10:20.6973400Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6973489Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6973989Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6974067Z current_size = base.storage().size() 2025-12-04T12:10:20.6974123Z Autotune Choices Stats: 2025-12-04T12:10:20.6974503Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6974579Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6974643Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6974779Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6975031Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6975271Z triton_mm_3 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6975523Z triton_mm_6 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6975762Z triton_mm_7 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6976000Z triton_mm_0 0.0075 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6976241Z triton_mm_4 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6976503Z triton_mm_2 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6976744Z triton_mm_5 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6976802Z _scaled_mm 0.0269 ms 23.2% 2025-12-04T12:10:20.6976947Z SingleProcess AUTOTUNE benchmarking takes 0.0402 seconds and 0.2023 seconds precompiling for 9 choices 2025-12-04T12:10:20.6977037Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6977095Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6977168Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6977284Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6977778Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6977831Z graph_break [] 2025-12-04T12:10:20.6977918Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6978006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6978064Z Autotune Choices Stats: 2025-12-04T12:10:20.6978436Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006599999964237213, "best_triton_pos": 0} 2025-12-04T12:10:20.6978512Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6978576Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6978711Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6978955Z triton_mm_9 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6979198Z triton_mm_12 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6979450Z triton_mm_13 0.0067 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6979690Z triton_mm_10 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6979928Z triton_mm_14 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6979986Z _scaled_mm 0.0070 ms 94.8% 2025-12-04T12:10:20.6980262Z triton_mm_15 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6980530Z triton_mm_11 0.0071 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6980768Z triton_mm_8 0.0074 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6980911Z SingleProcess AUTOTUNE benchmarking takes 0.0375 seconds and 0.1339 seconds precompiling for 9 choices 2025-12-04T12:10:20.6980981Z =================================== FAILURES =================================== 2025-12-04T12:10:20.6981137Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.6981199Z Traceback (most recent call last): 2025-12-04T12:10:20.6981372Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6981429Z method(*args, **kwargs) 2025-12-04T12:10:20.6981598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.6981654Z method(*args, **kwargs) 2025-12-04T12:10:20.6981823Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.6981877Z with policy(): 2025-12-04T12:10:20.6982060Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.6982117Z raise RuntimeError(msg) 2025-12-04T12:10:20.6982518Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.6982522Z 2025-12-04T12:10:20.6982612Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6982881Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6982884Z 2025-12-04T12:10:20.6982986Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6983076Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6983135Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6983207Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6983714Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6983829Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6983883Z graph_break [] 2025-12-04T12:10:20.6983959Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6984048Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6984549Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.6984624Z current_size = base.storage().size() 2025-12-04T12:10:20.6984681Z Autotune Choices Stats: 2025-12-04T12:10:20.6985069Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:20.6985145Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6985210Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6985345Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6985590Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6985832Z triton_mm_3 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6986071Z triton_mm_6 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6986307Z triton_mm_7 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6986556Z triton_mm_0 0.0075 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6986794Z triton_mm_4 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6987032Z triton_mm_2 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6987268Z triton_mm_5 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6987326Z _scaled_mm 0.0269 ms 23.2% 2025-12-04T12:10:20.6987470Z SingleProcess AUTOTUNE benchmarking takes 0.0402 seconds and 0.2023 seconds precompiling for 9 choices 2025-12-04T12:10:20.6987574Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6987635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6987707Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6987823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6988312Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6988368Z graph_break [] 2025-12-04T12:10:20.6988444Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6988535Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6988611Z Autotune Choices Stats: 2025-12-04T12:10:20.6988997Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006599999964237213, "best_triton_pos": 0} 2025-12-04T12:10:20.6989072Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6989136Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6989271Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6989515Z triton_mm_9 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6989758Z triton_mm_12 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6989996Z triton_mm_13 0.0067 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6990269Z triton_mm_10 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6990522Z triton_mm_14 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6990579Z _scaled_mm 0.0070 ms 94.8% 2025-12-04T12:10:20.6990819Z triton_mm_15 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6991056Z triton_mm_11 0.0071 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6991295Z triton_mm_8 0.0074 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6991436Z SingleProcess AUTOTUNE benchmarking takes 0.0375 seconds and 0.1339 seconds precompiling for 9 choices 2025-12-04T12:10:20.6991526Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.6991600Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.6991675Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.6991790Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.6992281Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.6992336Z graph_break [] 2025-12-04T12:10:20.6992412Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.6992502Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.6992559Z Autotune Choices Stats: 2025-12-04T12:10:20.6995324Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.6995409Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.6995474Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.6995609Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.6995862Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6996099Z triton_mm_21 0.0062 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6996338Z triton_mm_18 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6996577Z triton_mm_20 0.0067 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6996825Z triton_mm_17 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.6997065Z triton_mm_23 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.6997304Z triton_mm_19 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.6997540Z triton_mm_22 0.0094 ms 63.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.6997599Z _scaled_mm 0.0279 ms 21.5% 2025-12-04T12:10:20.6997740Z SingleProcess AUTOTUNE benchmarking takes 0.0753 seconds and 0.2314 seconds precompiling for 9 choices 2025-12-04T12:10:20.6997945Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bb8a35a6acbc624f.xml - 2025-12-04T12:10:20.6998032Z =========================== short test summary info ============================ 2025-12-04T12:10:20.6998622Z FAILED [0.8368s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.6998626Z 2025-12-04T12:10:20.6998714Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.6998983Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.6998985Z 2025-12-04T12:10:20.6999088Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.6999176Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.6999271Z ================= 1 failed, 187 deselected, 2 rerun in 16.85s ================== 2025-12-04T12:10:20.6999327Z Got exit code 1 2025-12-04T12:10:20.6999385Z Retrying single test... 2025-12-04T12:10:20.6999542Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6c0360169ef96203.xml 2025-12-04T12:10:20.6999615Z ============================= test session starts ============================== 2025-12-04T12:10:20.6999742Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.6999799Z cachedir: .pytest_cache 2025-12-04T12:10:20.6999973Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.7000036Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.7000128Z configfile: pytest.ini 2025-12-04T12:10:20.7000311Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.7000403Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.7000668Z stepcurrent: skipping 106 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7000726Z Running 1 items in this shard 2025-12-04T12:10:20.7000740Z 2025-12-04T12:10:20.7000967Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4238s] [100%] 2025-12-04T12:10:20.7001187Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8306s] [100%] 2025-12-04T12:10:20.7001388Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8663s] [100%] 2025-12-04T12:10:20.7001390Z 2025-12-04T12:10:20.7001461Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.7001615Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7001679Z Traceback (most recent call last): 2025-12-04T12:10:20.7001853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7001913Z method(*args, **kwargs) 2025-12-04T12:10:20.7002080Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7002137Z method(*args, **kwargs) 2025-12-04T12:10:20.7002315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7002374Z with policy(): 2025-12-04T12:10:20.7002542Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7002601Z raise RuntimeError(msg) 2025-12-04T12:10:20.7002998Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1031798784. 2025-12-04T12:10:20.7003003Z 2025-12-04T12:10:20.7003091Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7003361Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7003375Z 2025-12-04T12:10:20.7003477Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7003580Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7003639Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7003713Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7004208Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7004324Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7004377Z graph_break [] 2025-12-04T12:10:20.7004456Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7004545Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7005043Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7005121Z current_size = base.storage().size() 2025-12-04T12:10:20.7005178Z Autotune Choices Stats: 2025-12-04T12:10:20.7005564Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.7005638Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7005704Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7005841Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7006088Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7006327Z triton_mm_7 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7006581Z triton_mm_6 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7006821Z triton_mm_5 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7007058Z triton_mm_3 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7007297Z triton_mm_0 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7007534Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7007790Z triton_mm_1 0.0079 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7007848Z _scaled_mm 0.0246 ms 24.6% 2025-12-04T12:10:20.7007990Z SingleProcess AUTOTUNE benchmarking takes 0.0585 seconds and 0.2689 seconds precompiling for 9 choices 2025-12-04T12:10:20.7008145Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7008207Z Traceback (most recent call last): 2025-12-04T12:10:20.7008378Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7008436Z method(*args, **kwargs) 2025-12-04T12:10:20.7008604Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7008661Z method(*args, **kwargs) 2025-12-04T12:10:20.7008829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7008882Z with policy(): 2025-12-04T12:10:20.7009049Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7009106Z raise RuntimeError(msg) 2025-12-04T12:10:20.7009517Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1031798784 and is now 1075838976. 2025-12-04T12:10:20.7009520Z 2025-12-04T12:10:20.7009609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7009882Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7009884Z 2025-12-04T12:10:20.7009988Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7010076Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7010175Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7010252Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7010761Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7010878Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7010932Z graph_break [] 2025-12-04T12:10:20.7011009Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7011098Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7011590Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7011657Z current_size = base.storage().size() 2025-12-04T12:10:20.7011715Z Autotune Choices Stats: 2025-12-04T12:10:20.7012105Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.7012192Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7012257Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7012395Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7012641Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7012881Z triton_mm_7 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7013121Z triton_mm_6 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7013357Z triton_mm_5 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7013607Z triton_mm_3 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7013845Z triton_mm_0 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7014086Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7014322Z triton_mm_1 0.0079 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7014382Z _scaled_mm 0.0246 ms 24.6% 2025-12-04T12:10:20.7014523Z SingleProcess AUTOTUNE benchmarking takes 0.0585 seconds and 0.2689 seconds precompiling for 9 choices 2025-12-04T12:10:20.7014612Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7014670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7014743Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7014869Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7015364Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7015419Z graph_break [] 2025-12-04T12:10:20.7015494Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7015584Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7015641Z Autotune Choices Stats: 2025-12-04T12:10:20.7016020Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.7016115Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7016180Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7016316Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7016566Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7016806Z triton_mm_9 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7017045Z triton_mm_13 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7017281Z triton_mm_15 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7017517Z triton_mm_14 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7017768Z triton_mm_11 0.0082 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7018010Z triton_mm_10 0.0093 ms 64.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7018247Z triton_mm_8 0.0099 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7018305Z _scaled_mm 0.0245 ms 24.5% 2025-12-04T12:10:20.7018449Z SingleProcess AUTOTUNE benchmarking takes 0.0787 seconds and 0.2704 seconds precompiling for 9 choices 2025-12-04T12:10:20.7018519Z =================================== FAILURES =================================== 2025-12-04T12:10:20.7018674Z _ TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7018736Z Traceback (most recent call last): 2025-12-04T12:10:20.7018917Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7018976Z method(*args, **kwargs) 2025-12-04T12:10:20.7019144Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7019201Z method(*args, **kwargs) 2025-12-04T12:10:20.7019366Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7019424Z with policy(): 2025-12-04T12:10:20.7019591Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7019650Z raise RuntimeError(msg) 2025-12-04T12:10:20.7020048Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.7020064Z 2025-12-04T12:10:20.7020200Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7020471Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7020473Z 2025-12-04T12:10:20.7020576Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7020665Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7020723Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7020798Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7021295Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7021410Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7021463Z graph_break [] 2025-12-04T12:10:20.7021540Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7021648Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7022148Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7022214Z current_size = base.storage().size() 2025-12-04T12:10:20.7022270Z Autotune Choices Stats: 2025-12-04T12:10:20.7022652Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:20.7022727Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7022792Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7022926Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7023185Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7023426Z triton_mm_7 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7023667Z triton_mm_6 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7023905Z triton_mm_5 0.0069 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7024143Z triton_mm_3 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7024411Z triton_mm_0 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7024649Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7024888Z triton_mm_1 0.0079 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7024947Z _scaled_mm 0.0246 ms 24.6% 2025-12-04T12:10:20.7025088Z SingleProcess AUTOTUNE benchmarking takes 0.0585 seconds and 0.2689 seconds precompiling for 9 choices 2025-12-04T12:10:20.7025179Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7025238Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7025311Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7025425Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7025916Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7025979Z graph_break [] 2025-12-04T12:10:20.7026056Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7026145Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7026203Z Autotune Choices Stats: 2025-12-04T12:10:20.7026578Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:20.7026653Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7026720Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7026854Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7027101Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7027352Z triton_mm_9 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7027590Z triton_mm_13 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7027827Z triton_mm_15 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7028065Z triton_mm_14 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7028324Z triton_mm_11 0.0082 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7028560Z triton_mm_10 0.0093 ms 64.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7028799Z triton_mm_8 0.0099 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7028856Z _scaled_mm 0.0245 ms 24.5% 2025-12-04T12:10:20.7028999Z SingleProcess AUTOTUNE benchmarking takes 0.0787 seconds and 0.2704 seconds precompiling for 9 choices 2025-12-04T12:10:20.7029088Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7029148Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7029219Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7029335Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7029821Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7029886Z graph_break [] 2025-12-04T12:10:20.7029964Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:20.7030053Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7030142Z Autotune Choices Stats: 2025-12-04T12:10:20.7030522Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:20.7030596Z AUTOTUNE scaled_mm(3x32, 32x2048, 3x1, 1x2048, 2048) 2025-12-04T12:10:20.7030659Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:20.7030796Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:20.7031041Z triton_mm_19 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7031292Z triton_mm_21 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7031533Z triton_mm_22 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7031768Z triton_mm_23 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:20.7032006Z triton_mm_18 0.0079 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7032244Z triton_mm_16 0.0091 ms 67.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7032512Z triton_mm_17 0.0096 ms 64.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:20.7032751Z triton_mm_20 0.0098 ms 62.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7032808Z _scaled_mm 0.0273 ms 22.5% 2025-12-04T12:10:20.7032950Z SingleProcess AUTOTUNE benchmarking takes 0.0691 seconds and 0.2520 seconds precompiling for 9 choices 2025-12-04T12:10:20.7033153Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6c0360169ef96203.xml - 2025-12-04T12:10:20.7033230Z =========================== short test summary info ============================ 2025-12-04T12:10:20.7033816Z FAILED [0.8663s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1119879168. 2025-12-04T12:10:20.7033834Z 2025-12-04T12:10:20.7033923Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7034191Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7034195Z 2025-12-04T12:10:20.7034297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7034377Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.7034462Z ================== 1 failed, 187 deselected, 2 rerun in 4.14s ================== 2025-12-04T12:10:20.7034519Z Got exit code 1 2025-12-04T12:10:20.7034736Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:20.7034879Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.7035036Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-735b825c75091166.xml 2025-12-04T12:10:20.7035110Z ============================= test session starts ============================== 2025-12-04T12:10:20.7035246Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.7035306Z cachedir: .pytest_cache 2025-12-04T12:10:20.7035478Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.7035541Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.7035598Z configfile: pytest.ini 2025-12-04T12:10:20.7035777Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.7035872Z collecting ... collected 188 items / 107 deselected / 81 selected 2025-12-04T12:10:20.7035942Z stepcurrent: skipping 107 already run items. 2025-12-04T12:10:20.7036003Z Running 81 items in this shard 2025-12-04T12:10:20.7036005Z 2025-12-04T12:10:20.7036963Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/bu/cbuuzm2viaiv5nmviavrp5nmecwrl3yqy5smpjhgakd34iwlkbed.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7037144Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7037379Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7037554Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7037860Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7038011Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7038283Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7038448Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7038718Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7038890Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7039174Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7039322Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7039614Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7039833Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7040191Z E1204 11:13:01.080000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7040943Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/3k/c3kxod6ilq6ubrfj5fnkjkxkkslxqgcec42vickndloclrpjosf5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7041106Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7041360Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7041531Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7041831Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7041978Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7042250Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7042404Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7042671Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7042843Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7043142Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7043289Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7043581Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7043788Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7044118Z E1204 11:13:01.129000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7044878Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/vf/cvfmuzex7rv6wbxqxrkwmeaquhcmau3tgmikjjnugpfnzr5bx6go.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7045040Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7045268Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7045445Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7045746Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7045910Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7046179Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7046330Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7046599Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7046769Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7047051Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7047200Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7047488Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7047703Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7048031Z E1204 11:13:01.142000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7048775Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/yh/cyhpxsxflgxt6nqs6ahc5soxw3ltnrmywhxypn65iyovg6yp3ont.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7048938Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7049175Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7049347Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7049648Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7049793Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7050062Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7050262Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7050554Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7050722Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7051004Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7051152Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7051441Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7051649Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7051976Z E1204 11:13:01.176000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7052728Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/qc/cqc4eyd6o7wr33nr3us7ayg4boddpxlgd6t3ag24ccquyxuhfe3u.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7052890Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7053116Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7053285Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7053583Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7053741Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7054011Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7054161Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7054429Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7054598Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7054885Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7055059Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7055347Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7055554Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7055881Z E1204 11:13:01.182000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7056625Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmp3b1x28sb/o7/co7zkv32udemyplpnvfenp6mnv7sb5a5mzqj23arfu6glbalhm2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7056797Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7057025Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7057195Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7057494Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7057639Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7057906Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7058056Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7058333Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7058505Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7058786Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7058934Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7059224Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7059430Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7059775Z E1204 11:13:01.191000 811051 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7059844Z ('RERUN', {'yellow': True}) [3.9400s] [ 1%] 2025-12-04T12:10:20.7060233Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:13:03.153000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7060545Z E1204 11:13:03.153000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7060687Z E1204 11:13:03.153000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7060847Z E1204 11:13:03.157000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7061153Z E1204 11:13:03.157000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7061307Z E1204 11:13:03.157000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7061464Z E1204 11:13:03.159000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7061773Z E1204 11:13:03.159000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7061915Z E1204 11:13:03.159000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7062072Z E1204 11:13:03.217000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7062378Z E1204 11:13:03.217000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7062518Z E1204 11:13:03.217000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7062676Z E1204 11:13:03.219000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7062997Z E1204 11:13:03.219000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7063139Z E1204 11:13:03.219000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7063295Z E1204 11:13:03.221000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7063602Z E1204 11:13:03.221000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7063743Z E1204 11:13:03.221000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7063820Z ('RERUN', {'yellow': True}) [1.8282s] [ 1%] 2025-12-04T12:10:20.7064186Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:13:04.782000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7064492Z E1204 11:13:04.782000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7064634Z E1204 11:13:04.782000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7064790Z E1204 11:13:04.784000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7065099Z E1204 11:13:04.784000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7065241Z E1204 11:13:04.784000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7065398Z E1204 11:13:04.787000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7065705Z E1204 11:13:04.787000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7065853Z E1204 11:13:04.787000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7066011Z E1204 11:13:04.824000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7066318Z E1204 11:13:04.824000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7066458Z E1204 11:13:04.824000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7066614Z E1204 11:13:04.826000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7066922Z E1204 11:13:04.826000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7067067Z E1204 11:13:04.826000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7068705Z E1204 11:13:04.828000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7069016Z E1204 11:13:04.828000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.7069156Z E1204 11:13:04.828000 811051 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7069215Z FAILED [1.5949s] [ 1%] 2025-12-04T12:10:20.7069217Z 2025-12-04T12:10:20.7069289Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.7069466Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7069530Z Traceback (most recent call last): 2025-12-04T12:10:20.7069716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7069774Z method(*args, **kwargs) 2025-12-04T12:10:20.7069952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7070009Z method(*args, **kwargs) 2025-12-04T12:10:20.7070214Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7070269Z with policy(): 2025-12-04T12:10:20.7070438Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7070496Z raise RuntimeError(msg) 2025-12-04T12:10:20.7070931Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:20.7070935Z 2025-12-04T12:10:20.7071030Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7071320Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.7071323Z 2025-12-04T12:10:20.7071448Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7071540Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7071602Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7071677Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7072253Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7072370Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7072425Z graph_break [] 2025-12-04T12:10:20.7072509Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7072599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7073116Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7073182Z current_size = base.storage().size() 2025-12-04T12:10:20.7073243Z Autotune Choices Stats: 2025-12-04T12:10:20.7073636Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008919999934732914, "best_triton_pos": 0} 2025-12-04T12:10:20.7073717Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7073782Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7073899Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7074157Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7074434Z triton_mm_34 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7074676Z triton_mm_29 0.0105 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7074917Z triton_mm_16 0.0106 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7075157Z triton_mm_22 0.0110 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7075399Z triton_mm_30 0.0113 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7075640Z triton_mm_21 0.0119 ms 74.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7075894Z triton_mm_23 0.0120 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7076137Z triton_mm_15 0.0122 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7076382Z triton_mm_31 0.0126 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7076528Z SingleProcess AUTOTUNE benchmarking takes 0.1739 seconds and 1.3458 seconds precompiling for 33 choices 2025-12-04T12:10:20.7076702Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7076766Z Traceback (most recent call last): 2025-12-04T12:10:20.7076939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7076999Z method(*args, **kwargs) 2025-12-04T12:10:20.7077175Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7077232Z method(*args, **kwargs) 2025-12-04T12:10:20.7077397Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7077451Z with policy(): 2025-12-04T12:10:20.7077617Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7077677Z raise RuntimeError(msg) 2025-12-04T12:10:20.7078104Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:20.7078107Z 2025-12-04T12:10:20.7078198Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7078508Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.7078510Z 2025-12-04T12:10:20.7078614Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7078703Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7078765Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7078837Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7079403Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7079520Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7079573Z graph_break [] 2025-12-04T12:10:20.7079654Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7079743Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7080279Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7080358Z current_size = base.storage().size() 2025-12-04T12:10:20.7080418Z Autotune Choices Stats: 2025-12-04T12:10:20.7080805Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008919999934732914, "best_triton_pos": 0} 2025-12-04T12:10:20.7080884Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7080948Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7081064Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7081317Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7081573Z triton_mm_34 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7081819Z triton_mm_29 0.0105 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7082061Z triton_mm_16 0.0106 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7082303Z triton_mm_22 0.0110 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7082545Z triton_mm_30 0.0113 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7082810Z triton_mm_21 0.0119 ms 74.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7083054Z triton_mm_23 0.0120 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7083299Z triton_mm_15 0.0122 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7083544Z triton_mm_31 0.0126 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7083693Z SingleProcess AUTOTUNE benchmarking takes 0.1739 seconds and 1.3458 seconds precompiling for 33 choices 2025-12-04T12:10:20.7083783Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7083842Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7083914Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7084039Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7084542Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7084597Z graph_break [] 2025-12-04T12:10:20.7084678Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7084768Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7084825Z Autotune Choices Stats: 2025-12-04T12:10:20.7085205Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00863999966531992, "best_triton_pos": 0} 2025-12-04T12:10:20.7085281Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7085346Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7085471Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7085722Z triton_mm_72 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7085965Z triton_mm_71 0.0088 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7086205Z triton_mm_67 0.0108 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7086447Z triton_mm_60 0.0115 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7086708Z triton_mm_68 0.0115 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7086950Z triton_mm_59 0.0117 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7087192Z triton_mm_54 0.0118 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7087435Z triton_mm_61 0.0120 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7087678Z triton_mm_69 0.0121 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7087920Z triton_mm_53 0.0124 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7088082Z SingleProcess AUTOTUNE benchmarking takes 0.2619 seconds and 0.8163 seconds precompiling for 39 choices 2025-12-04T12:10:20.7088153Z =================================== FAILURES =================================== 2025-12-04T12:10:20.7088327Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.7088391Z Traceback (most recent call last): 2025-12-04T12:10:20.7088561Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7088620Z method(*args, **kwargs) 2025-12-04T12:10:20.7088786Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.7088844Z method(*args, **kwargs) 2025-12-04T12:10:20.7089008Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.7089063Z with policy(): 2025-12-04T12:10:20.7089230Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.7089289Z raise RuntimeError(msg) 2025-12-04T12:10:20.7089721Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.7089725Z 2025-12-04T12:10:20.7089817Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7090152Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.7090157Z 2025-12-04T12:10:20.7090259Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7090350Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7090408Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7090481Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7091067Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7091193Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7091247Z graph_break [] 2025-12-04T12:10:20.7091327Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7091416Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7091914Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.7091980Z current_size = base.storage().size() 2025-12-04T12:10:20.7092037Z Autotune Choices Stats: 2025-12-04T12:10:20.7092420Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008919999934732914, "best_triton_pos": 0} 2025-12-04T12:10:20.7092516Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7092581Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7092698Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7092951Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7093197Z triton_mm_34 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7093440Z triton_mm_29 0.0105 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7093681Z triton_mm_16 0.0106 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7093930Z triton_mm_22 0.0110 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7094173Z triton_mm_30 0.0113 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7094412Z triton_mm_21 0.0119 ms 74.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7094656Z triton_mm_23 0.0120 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7094912Z triton_mm_15 0.0122 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7095162Z triton_mm_31 0.0126 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7095308Z SingleProcess AUTOTUNE benchmarking takes 0.1739 seconds and 1.3458 seconds precompiling for 33 choices 2025-12-04T12:10:20.7095398Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7095457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7095530Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7095645Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7096143Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7096198Z graph_break [] 2025-12-04T12:10:20.7096278Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7096377Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7096434Z Autotune Choices Stats: 2025-12-04T12:10:20.7096816Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00863999966531992, "best_triton_pos": 0} 2025-12-04T12:10:20.7096893Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7096957Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7097075Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7097324Z triton_mm_72 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7097569Z triton_mm_71 0.0088 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7097828Z triton_mm_67 0.0108 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7098071Z triton_mm_60 0.0115 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7098311Z triton_mm_68 0.0115 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7098551Z triton_mm_59 0.0117 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7098792Z triton_mm_54 0.0118 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7099055Z triton_mm_61 0.0120 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7099299Z triton_mm_69 0.0121 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7099545Z triton_mm_53 0.0124 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7099688Z SingleProcess AUTOTUNE benchmarking takes 0.2619 seconds and 0.8163 seconds precompiling for 39 choices 2025-12-04T12:10:20.7099780Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.7099838Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.7099911Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.7100026Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.7100558Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.7100626Z graph_break [] 2025-12-04T12:10:20.7100707Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.7100795Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.7100855Z Autotune Choices Stats: 2025-12-04T12:10:20.7101236Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00851999968290329, "best_triton_pos": 0} 2025-12-04T12:10:20.7101313Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.7101378Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.7101493Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.7101748Z triton_mm_110 0.0085 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7102008Z triton_mm_109 0.0092 ms 93.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7102069Z _scaled_mm 0.0094 ms 90.6% 2025-12-04T12:10:20.7102312Z triton_mm_105 0.0107 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7102555Z triton_mm_92 0.0108 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7102794Z triton_mm_98 0.0112 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7103070Z triton_mm_97 0.0112 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7103315Z triton_mm_106 0.0114 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.7103557Z triton_mm_99 0.0116 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7103800Z triton_mm_91 0.0120 ms 71.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.7103945Z SingleProcess AUTOTUNE benchmarking takes 0.2369 seconds and 0.6486 seconds precompiling for 39 choices 2025-12-04T12:10:20.7104153Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-735b825c75091166.xml - 2025-12-04T12:10:20.7104229Z =========================== short test summary info ============================ 2025-12-04T12:10:20.7104868Z FAILED [1.5949s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.7104886Z 2025-12-04T12:10:20.7104978Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.7105265Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.7105267Z 2025-12-04T12:10:20.7105371Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.7105449Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.7105535Z ================== 1 failed, 107 deselected, 2 rerun in 7.38s ================== 2025-12-04T12:10:20.7105589Z Got exit code 1 2025-12-04T12:10:20.7105646Z Retrying single test... 2025-12-04T12:10:20.7105805Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3246e62b044bc4be.xml 2025-12-04T12:10:20.7105879Z ============================= test session starts ============================== 2025-12-04T12:10:20.7106019Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.7106077Z cachedir: .pytest_cache 2025-12-04T12:10:20.7106254Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.7106316Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.7106374Z configfile: pytest.ini 2025-12-04T12:10:20.7106554Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.7106648Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.7106933Z stepcurrent: skipping 107 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.7106994Z Running 1 items in this shard 2025-12-04T12:10:20.7107007Z 2025-12-04T12:10:20.7107384Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:16.568880304 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7107387Z 2025-12-04T12:10:20.7107718Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7108033Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7108182Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7108683Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7108951Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7109202Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7109428Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7109644Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7109889Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7110157Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7110400Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7110646Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7110888Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7111121Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7111365Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7111599Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7111870Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7112103Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7112343Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7112579Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7112824Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7113057Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7113264Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7113510Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7113754Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7113990Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7114196Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7114429Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7114669Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7114911Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7115153Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7115385Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7115602Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7115829Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7116005Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7116219Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7116765Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/vf/cvfmuzex7rv6wbxqxrkwmeaquhcmau3tgmikjjnugpfnzr5bx6go.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7116927Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7117157Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7117330Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7117631Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7117781Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7118064Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7118218Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7118487Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7118662Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7118945Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7119094Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7119384Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7119601Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7119931Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7120280Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7120427Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7120937Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7121215Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7121458Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7121678Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7121895Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7122138Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7122372Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7122626Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7122860Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7123106Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7123339Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7123580Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7123813Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7124065Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7124300Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7124539Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7124773Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7125013Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7125257Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7125470Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7125703Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7125946Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7126179Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7126382Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7126613Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7126853Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7127095Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7127336Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7127570Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7127785Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7128013Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7128185Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7128391Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7128511Z E1204 11:13:23.738000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7128684Z [W1204 11:13:23.294196611 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7128687Z 2025-12-04T12:10:20.7129011Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7129320Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7129467Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7129975Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7130281Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7130523Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7130744Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7130960Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7131201Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7131449Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7131690Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7131925Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7132166Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7132396Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7132637Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7132881Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7133124Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7133355Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7133598Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7133831Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7134085Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7134331Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7134536Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7134769Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7135011Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7135248Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7135452Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7135684Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7135936Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7136170Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7136412Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7136645Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7136861Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7137086Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7137273Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7137472Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7138016Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/bu/cbuuzm2viaiv5nmviavrp5nmecwrl3yqy5smpjhgakd34iwlkbed.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7138177Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7138407Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7138587Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7138899Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7139045Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7139315Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7139483Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7139753Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7139933Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7140257Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7140421Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7140709Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7140917Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7141271Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7141575Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7141721Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7142231Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7142498Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7142739Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7142959Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7143200Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7143441Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7143676Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7143919Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7144153Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7144401Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7144633Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7144873Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7145117Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7145377Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7145611Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7145854Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7146088Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7146327Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7146575Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7146781Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7147014Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7147256Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7147488Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7147717Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7147949Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7148190Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7148421Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7148666Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7148911Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7149131Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7149377Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7149549Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7149743Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7149861Z E1204 11:13:23.834000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7150033Z [W1204 11:13:23.325983690 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7150036Z 2025-12-04T12:10:20.7150401Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7150705Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7150866Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7151354Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7151628Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7151866Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7152106Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7152338Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7152578Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7152814Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7153055Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7153290Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7153529Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7153764Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7154031Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7154264Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7154507Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7154738Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7154981Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7155214Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7155470Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7155709Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7155913Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7156146Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7156390Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7156646Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7156849Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7157087Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7157329Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7157561Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7157803Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7158034Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7158250Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7158496Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7158670Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7158864Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7159404Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/3k/c3kxod6ilq6ubrfj5fnkjkxkkslxqgcec42vickndloclrpjosf5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7159572Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7159814Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7159986Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7160340Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7160486Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7160757Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7160909Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7161215Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7161384Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7161665Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7161815Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7162108Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7162316Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7162641Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7162963Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7163106Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7163604Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7163871Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7164111Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7164346Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7164562Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7164804Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7165038Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7165280Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7165514Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7165780Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7166021Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7166262Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7166494Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7166739Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7166975Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7167216Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7167463Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7167704Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7167938Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7168142Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7168382Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7168624Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7168875Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7169083Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7169316Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7169558Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7169790Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7170032Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7170336Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7170553Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7170783Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7170959Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7171152Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7171271Z E1204 11:13:23.868000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7171444Z [W1204 11:13:23.334193273 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7171447Z 2025-12-04T12:10:20.7171614Z [W1204 11:13:23.342372908 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7171641Z 2025-12-04T12:10:20.7171968Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7172274Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7172420Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7172908Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7173172Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7173428Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7173647Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7173863Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7174110Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7174349Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7174625Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7174857Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7175098Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7175332Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7175573Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7175808Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7176049Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7176299Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7176540Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7176784Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7177024Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7177259Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7177463Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7177694Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7177952Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7178184Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7178388Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7178622Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7178864Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7179122Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7179362Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7179594Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7179810Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7180035Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7180252Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7180454Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7180997Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/o7/co7zkv32udemyplpnvfenp6mnv7sb5a5mzqj23arfu6glbalhm2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7181175Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7181405Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7181577Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7181877Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7182025Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7182315Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7182470Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7182737Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7182906Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7183193Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7183345Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7183662Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7183869Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7184195Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7184500Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7184646Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7185135Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7185415Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7185655Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7185878Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7186093Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7186334Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7186569Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7186832Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7187067Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7187314Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7187546Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7187787Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7188041Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7188295Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7188528Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7188770Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7189003Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7189252Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7189485Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7189688Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7189938Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7190224Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7190457Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7190661Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7190893Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7191134Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7191386Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7191632Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7191874Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7192091Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7192322Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7192512Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7192720Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7192837Z E1204 11:13:23.873000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7193160Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7193466Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7193612Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7194105Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7194393Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7194630Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7194852Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7195068Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7195309Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7195543Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7195799Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7196033Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7196275Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7196512Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7196758Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7196991Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7197254Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7197487Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7197728Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7197968Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7198213Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7198445Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7198650Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7198896Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7199138Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7199371Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7199577Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7199810Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7200052Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7200342Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7200590Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7200824Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7201041Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7201268Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7201460Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7201667Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7202217Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/yh/cyhpxsxflgxt6nqs6ahc5soxw3ltnrmywhxypn65iyovg6yp3ont.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7202381Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7202612Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7202783Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7203095Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7203261Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7203531Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7203684Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7203953Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7204122Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7204405Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7204554Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7204863Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7205071Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7205398Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7205703Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7205853Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7206367Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7206640Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7206880Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7207104Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7207324Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7207564Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7207800Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7208061Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7208294Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7208539Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7208778Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7209022Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7209260Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7209517Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7209753Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7209992Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7210269Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7210510Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7210771Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7210978Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7211211Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7211454Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7211686Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7211892Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7212123Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7212379Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7212611Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7212859Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7213091Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7213306Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7213531Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7213705Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7213917Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7214036Z E1204 11:13:23.882000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7214206Z [W1204 11:13:23.349352607 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7214208Z 2025-12-04T12:10:20.7214529Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7214834Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7214994Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7215507Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7215774Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7216017Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7216237Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7216452Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7216692Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7216949Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7217191Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7217424Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7217664Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7217896Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7218143Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7218393Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7218639Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7218872Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7219112Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7219345Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7219608Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7219840Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7220042Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7220333Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7220575Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7220808Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7221014Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7221246Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7221510Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7221747Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7221989Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7222220Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7222436Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7222660Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7222859Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7223053Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7223587Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpci9a37e9/qc/cqc4eyd6o7wr33nr3us7ayg4boddpxlgd6t3ag24ccquyxuhfe3u.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.7223747Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.7223981Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.7224177Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.7224483Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.7224630Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.7224899Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.7225053Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.7225321Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.7225490Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.7225772Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.7225935Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.7226223Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.7226434Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.7226771Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7227075Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7227219Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7227727Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7228001Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7228242Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7228463Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7228701Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7228947Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7229182Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7229424Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7229658Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7229901Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7230175Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7230444Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7230675Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7230918Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7231150Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7231392Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7231626Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7231884Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7232118Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7232321Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7232553Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7232794Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7233033Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7233267Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7233498Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7233743Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7233974Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7234216Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7234450Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7234665Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.7234903Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.7235075Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.7235269Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.7235385Z E1204 11:13:23.888000 816968 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.7235455Z ('RERUN', {'yellow': True}) [11.6266s] [100%] 2025-12-04T12:10:20.7235820Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:25.362539073 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7235825Z 2025-12-04T12:10:20.7235987Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7236320Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7236626Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7236770Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7237258Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7237535Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7237797Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7238016Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7238231Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7238474Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7238713Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7238954Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7239187Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7239446Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7239688Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7239929Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7240199Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7240442Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7240673Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7240906Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7241133Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7241348Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7241590Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7241822Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7242044Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7242293Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7242538Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7242772Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7242975Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7243209Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7243424Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7243632Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7243878Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7244090Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7244294Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7244525Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7244766Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7244998Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7245256Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7245489Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7245704Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7245928Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7246141Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7246383Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7246645Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7246886Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7247125Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7247366Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7247604Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7247844Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7248077Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7248340Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7248575Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7248817Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7249048Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7249288Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7249526Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7249780Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7250014Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7250287Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7250522Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7250765Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7251036Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7251277Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7251509Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7251725Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7251936Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7252184Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7252417Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7252659Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7252908Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7253149Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7253382Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7253624Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7253858Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7254098Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7254348Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7254590Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7254823Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7255035Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7255241Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7255517Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7255727Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7255950Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7256164Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7256405Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7256640Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7256852Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7257055Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7257299Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7257513Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7257716Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7257954Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7258194Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7258427Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7258685Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7258920Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7259131Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7259359Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7259572Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7259818Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7260080Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7260338Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7260552Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7260773Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7261017Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7261254Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7261496Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7261750Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7261993Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7262227Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7262469Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7262702Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7262949Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7263201Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7263416Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7263624Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7263862Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7264078Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7264305Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7264541Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7264782Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7265018Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7265260Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7265495Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7265742Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7265975Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7266238Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7266476Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7266718Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7266953Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7267170Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7267383Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7267603Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7267830Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7268044Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7268286Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7268522Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7268754Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7268977Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7269197Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7269440Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7269674Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7269916Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7270185Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7270426Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7270679Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7270921Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7271161Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7271402Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7271642Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7271883Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7272146Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7272390Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7272623Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7272865Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7273099Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7273373Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7273611Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7273852Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7274088Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7274301Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7274506Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7274741Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7274987Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7275236Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7275477Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7275713Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7275957Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7276194Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7276436Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7276687Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7276928Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7277161Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7277368Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7277604Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7277870Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7278106Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7278347Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7278593Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7278820Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7279037Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7279250Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7279479Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7279721Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7279956Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7280227Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7280443Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7280657Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7280873Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7281132Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7281367Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7281582Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7281801Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7282015Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7282201Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7282451Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7282655Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7282890Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7283134Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7283373Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7283586Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7283790Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7284045Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7284257Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7284468Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7284701Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7284907Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7285142Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7285362Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7285602Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7285806Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7286042Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7286285Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7286520Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7286792Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7287027Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7287270Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7287502Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7287744Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7287984Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7288228Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7288488Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7288702Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7288908Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7289142Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7289375Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7289592Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7289825Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7290042Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7290323Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7290556Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7290799Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7291036Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7291273Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7291507Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7291748Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7291982Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7292225Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7292459Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7292692Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7292933Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7293148Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7293359Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7293564Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7293801Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7294043Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7294293Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7294536Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7294770Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7294974Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7295209Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7295456Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7295717Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7295959Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7296193Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7296399Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7296636Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7296878Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7297111Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7297367Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7297601Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7297829Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7298051Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7298265Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7298481Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7298740Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7298981Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7299185Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7299417Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7299661Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7299896Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7300200Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7300435Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7300665Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7300881Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7301096Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7301318Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7301562Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7301815Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7302058Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7302292Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7302533Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7302766Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7302982Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7303236Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7303478Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7303719Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7303958Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7304194Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7304421Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7304673Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7304888Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7305104Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7305347Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7305585Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7305829Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7306063Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7306329Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7306565Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7306806Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7307040Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7307287Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7307523Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7307754Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7307978Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7308185Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7308394Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7308625Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7308845Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7309083Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7309289Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7309502Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7309691Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7309833Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7309997Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7310154Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7310297Z E1204 11:13:25.922000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7310470Z [W1204 11:13:25.388494987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7310498Z 2025-12-04T12:10:20.7310658Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7310967Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7311276Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7311423Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7311914Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7312200Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7312440Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7312665Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7312883Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7313131Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7313384Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7313640Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7313873Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7314114Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7314347Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7314588Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7314820Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7315062Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7315309Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7315531Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7315754Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7315966Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7316207Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7316438Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7316659Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7316893Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7317133Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7317369Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7317570Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7317830Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7318057Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7318260Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7318493Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7318704Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7318911Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7319144Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7319386Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7319633Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7319873Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7320153Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7320372Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7320593Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7320808Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7321069Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7321303Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7321545Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7321776Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7322021Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7322255Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7322529Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7322768Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7323011Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7323248Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7323489Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7323723Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7323970Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7324219Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7324461Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7324695Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7324935Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7325166Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7325409Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7325658Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7325900Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7326132Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7326354Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7326565Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7326820Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7327065Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7327305Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7327544Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7327792Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7328028Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7328268Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7328500Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7328759Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7328992Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7329234Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7329465Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7329678Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7329881Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7330173Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7330388Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7330609Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7330825Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7331067Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7331312Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7331540Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7331743Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7331975Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7332187Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7332395Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7332640Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7332881Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7333132Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7333373Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7333605Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7333817Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7334038Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7334254Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7334515Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7334753Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7334971Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7335185Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7335401Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7335642Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7335913Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7336156Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7336391Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7336634Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7336870Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7337113Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7337348Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7337611Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7337848Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7338066Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7338272Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7338505Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7338725Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7338962Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7339186Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7339427Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7339661Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7339904Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7340182Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7340461Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7340695Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7340936Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7341170Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7341413Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7341653Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7341869Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7342099Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7342308Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7342541Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7342758Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7343001Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7343236Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7343467Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7343683Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7343898Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7344140Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7344374Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7344619Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7344879Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7345128Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7345364Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7345605Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7345841Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7346082Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7346317Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7346572Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7346807Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7347050Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7347284Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7347540Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7347777Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7348033Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7348270Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7348511Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7348748Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7348961Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7349181Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7349426Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7349669Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7349915Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7350196Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7350432Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7350672Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7350906Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7351165Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7351400Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7351643Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7351876Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7352083Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7352329Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7352593Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7352827Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7353068Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7353303Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7353530Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7353784Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7354006Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7358393Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7358641Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7358876Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7359107Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7359324Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7359538Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7359788Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7360032Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7360330Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7360546Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7360762Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7360973Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7361161Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7361406Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7361613Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7361851Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7362095Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7362330Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7362572Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7362778Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7363013Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7363226Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7363432Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7363668Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7363873Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7364127Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7364336Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7364577Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7364783Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7365017Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7365260Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7365498Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7365757Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7365994Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7366241Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7366476Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7366720Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7366981Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7367231Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7367465Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7367678Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7367883Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7368118Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7368346Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7368579Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7368795Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7369015Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7369264Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7369504Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7369746Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7369998Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7370253Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7370487Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7370727Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7370962Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7371208Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7371475Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7371710Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7371932Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7372145Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7372353Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7372560Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7372796Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7373053Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7373287Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7373528Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7373765Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7373968Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7374203Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7374445Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7374697Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7374940Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7375175Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7375380Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7375619Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7375899Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7376141Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7376382Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7376617Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7376845Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7377063Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7377277Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7377511Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7377757Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7378000Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7378204Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7378437Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7378681Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7378913Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7379177Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7379411Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7379639Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7379863Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7380077Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7380367Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7380614Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7380849Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7381100Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7381335Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7381579Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7381812Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7382035Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7382275Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7382522Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7382757Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7383003Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7383238Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7383483Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7383706Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7383919Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7384134Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7384379Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7384612Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7384881Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7385115Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7385358Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7385592Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7385837Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7386079Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7386329Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7386578Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7386790Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7387010Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7387215Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7387428Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7387658Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7387880Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7388109Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7388315Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7388524Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7388720Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7388862Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7389026Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7389159Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7389312Z E1204 11:13:25.927000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7389485Z [W1204 11:13:25.391080843 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7389488Z 2025-12-04T12:10:20.7389650Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7389962Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7390313Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7390463Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7390958Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7391256Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7391497Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7391722Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7391940Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7392183Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7392443Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7392687Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7392921Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7393162Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7393394Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7393662Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7393915Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7394159Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7394391Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7394603Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7394833Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7395048Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7395293Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7395541Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7395746Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7395983Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7396225Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7396457Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7396666Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7396917Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7397130Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7397332Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7397564Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7397775Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7397978Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7398251Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7398495Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7398726Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7398967Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7399199Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7399415Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7399637Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7399866Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7400153Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7400386Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7400630Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7400874Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7401116Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7401366Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7401609Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7401842Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7402081Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7402316Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7402560Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7402825Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7403067Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7403309Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7403550Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7403786Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7404032Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7404264Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7404523Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7404757Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7404998Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7405232Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7405448Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7405667Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7405926Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7406164Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7406411Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7406643Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7406884Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7407131Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7407383Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7407615Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7407857Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7408097Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7408341Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7408575Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7408785Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7409008Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7409240Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7409452Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7409674Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7409887Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7410167Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7410424Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7410644Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7410846Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7411078Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7411289Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7411491Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7411767Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7412007Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7412244Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7412484Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7412718Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7412931Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7413155Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7413387Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7413631Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7413868Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7414090Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7414303Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7414519Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7414778Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7415016Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7415259Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7415504Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7415746Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7415982Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7416247Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7416487Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7416730Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7416963Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7417180Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7417384Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7417619Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7417859Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7418071Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7418288Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7418532Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7418767Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7419015Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7419265Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7419509Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7419743Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7419986Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7420261Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7420505Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7420772Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7420989Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7421203Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7421411Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7421641Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7421856Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7422098Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7422359Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7422575Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7422789Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7423005Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7423253Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7423488Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7423750Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7423990Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7424231Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7424465Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7424706Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7424948Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7425219Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7425454Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7425699Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7425935Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7426186Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7426419Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7426661Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7426917Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7427160Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7427401Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7427642Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7427876Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7428094Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7428316Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7428556Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7428797Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7429033Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7429274Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7429525Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7429776Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7430017Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7430298Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7430532Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7430780Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7431018Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7431223Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7431476Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7431717Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7431954Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7432194Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7432428Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7432655Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7432891Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7433113Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7433327Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7433576Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7433809Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7434064Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7434280Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7434500Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7434716Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7434958Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7435195Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7435411Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7435625Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7435848Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7436018Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7436256Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7436460Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7436699Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7436946Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7437194Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7437409Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7437616Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7437852Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7438067Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7438279Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7438540Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7438745Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7438978Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7439187Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7439426Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7439632Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7439867Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7440159Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7440414Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7440660Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7440896Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7441138Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7441371Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7441613Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7441866Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7442114Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7442347Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7442562Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7442769Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7443061Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7443290Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7443506Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7443722Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7443938Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7444185Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7444426Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7444680Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7444915Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7445120Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7445360Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7445607Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7445842Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7446084Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7446330Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7446563Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7446778Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7446993Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7447199Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7447428Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7447666Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7447917Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7448153Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7448394Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7448630Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7448833Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7449068Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7449319Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7449552Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7449802Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7450035Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7450269Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7450504Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7450762Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7450998Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7451242Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7451477Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7451704Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7451947Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7452160Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7452375Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7452619Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7452865Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7453071Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7453304Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7453551Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7453802Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7454043Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7454278Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7454505Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7454722Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7454934Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7455163Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7455408Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7455641Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7455890Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7456124Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7456387Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7456620Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7456825Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7457064Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7457312Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7457552Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7457794Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7458038Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7458266Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7458484Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7458698Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7458918Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7459162Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7459396Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7459651Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7459888Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7460166Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7460407Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7460648Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7460913Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7461154Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7461391Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7461603Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7461820Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7462034Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7462249Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7462494Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7462714Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7462928Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7463135Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7463343Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7463531Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7463672Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7463834Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7463967Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7464111Z E1204 11:13:25.930000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7464284Z [W1204 11:13:25.440024740 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7464286Z 2025-12-04T12:10:20.7464446Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7464758Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7465067Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7465240Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7465730Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7466001Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7466245Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7466466Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7466681Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7466943Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7467179Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7467424Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7467662Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7467903Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7468138Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7468394Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7468628Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7468869Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7469102Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7469315Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7469547Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7469789Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7470033Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7470300Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7470505Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7470738Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7470979Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7471210Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7471426Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7471667Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7471880Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7472082Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7472315Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7472534Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7472741Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7472992Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7473233Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7473464Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7473705Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7473940Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7474181Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7474405Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7474619Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7474864Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7475097Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7475340Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7475571Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7475829Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7476061Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7476303Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7476541Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7476780Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7477017Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7477273Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7477507Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7477747Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7477979Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7478230Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7478463Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7478725Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7478958Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7479200Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7479432Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7479672Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7479912Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7480166Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7480390Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7480631Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7480866Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7481108Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7481342Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7481587Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7481837Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7482081Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7482313Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7482556Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7482789Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7483044Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7483299Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7483514Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7483718Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7483950Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7484165Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7484391Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7484606Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7484862Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7485096Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7485309Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7485512Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7485745Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7485960Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7486172Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7486413Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7486660Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7486893Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7487133Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7487366Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7487598Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7487821Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7488034Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7488279Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7488513Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7488731Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7488943Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7489172Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7489414Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7489659Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7489901Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7490177Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7490420Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7490668Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7490911Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7491145Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7491392Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7491634Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7491855Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7492088Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7492322Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7492539Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7492753Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7492970Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7493214Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7493449Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7493710Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7493950Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7494193Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7494428Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7494669Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7494903Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7495161Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7495400Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7495616Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7495829Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7496037Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7496265Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7496500Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7496743Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7496985Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7497201Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7497417Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7497633Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7497877Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7498122Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7498367Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7498610Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7498852Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7499092Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7499333Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7499580Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7499823Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7500058Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7500368Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7500601Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7500843Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7501113Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7501361Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7501595Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7501840Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7502079Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7502332Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7502566Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7502795Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7503000Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7503238Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7503484Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7503718Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7503959Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7504208Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7504452Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7504690Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7504933Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7505166Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7505420Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7505672Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7505878Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7506113Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7506355Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7506592Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7506833Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7507069Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7507307Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7507526Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7507739Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7507955Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7508197Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7508432Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7508675Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7508898Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7509112Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7509328Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7509577Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7509831Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7510064Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7510327Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7510533Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7510696Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7510939Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7511145Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7511380Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7511637Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7511871Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7512088Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7512295Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7512529Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7512743Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7512947Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7513198Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7513404Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7513638Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7513850Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7514083Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7514315Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7514553Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7514795Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7515029Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7515271Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7515507Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7515749Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7515983Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7516238Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7516473Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7516723Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7516969Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7517185Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7517390Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7517635Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7517865Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7518082Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7518296Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7518511Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7518777Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7519011Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7519253Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7519490Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7519697Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7519933Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7520214Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7520448Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7520703Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7520937Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7521166Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7521383Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7521599Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7521805Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7522024Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7522266Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7522508Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7522743Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7522989Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7523248Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7523453Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7523689Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7523931Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7524166Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7524417Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7524651Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7524856Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7525102Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7525345Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7525579Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7525820Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7526056Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7526284Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7526511Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7526732Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7526953Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7527196Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7527430Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7527663Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7527897Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7528138Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7528378Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7528622Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7528857Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7529083Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7529301Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7529529Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7529753Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7529995Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7530268Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7530510Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7530745Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7531007Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7531241Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7531446Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7531681Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7531923Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7532182Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7532423Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7532658Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7532888Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7533115Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7533335Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7533550Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7533793Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7534039Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7534281Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7534515Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7534757Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7534993Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7535238Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7535490Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7535733Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7535968Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7536186Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7536404Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7536632Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7536843Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7537073Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7537295Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7537511Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7537722Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7537928Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7538116Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7538269Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7538435Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7538554Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7538696Z E1204 11:13:25.979000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7538868Z [W1204 11:13:25.442393409 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7538870Z 2025-12-04T12:10:20.7539029Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7539336Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7539653Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7539802Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7540336Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7540614Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7540857Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7541108Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7541323Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7541564Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7541803Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7542045Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7542281Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7542521Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7542767Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7543014Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7543250Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7543492Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7543730Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7543943Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7544184Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7544400Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7544643Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7544874Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7545079Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7545314Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7545577Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7545810Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7546014Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7546247Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7546459Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7546664Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7546904Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7547128Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7547330Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7547562Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7547808Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7548040Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7548281Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7548513Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7548737Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7548959Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7549178Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7549425Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7549658Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7549923Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7550208Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7550454Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7550687Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7550928Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7551168Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7551407Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7551654Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7551893Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7552127Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7552366Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7552602Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7552848Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7553091Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7553334Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7553565Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7553806Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7554039Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7554286Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7554545Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7554761Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7554973Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7555216Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7555454Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7555695Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7555928Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7556178Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7556417Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7556659Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7556890Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7557132Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7557365Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7557617Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7557855Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7558067Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7558271Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7558503Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7558715Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7558957Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7559173Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7559417Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7559655Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7559871Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7560077Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7560364Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7560597Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7560799Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7561034Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7561275Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7561507Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7561750Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7561982Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7562210Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7562433Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7562657Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7562904Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7563139Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7563384Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7563596Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7563812Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7564057Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7564296Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7564539Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7564772Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7565039Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7565273Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7565516Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7565750Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7565992Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7566228Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7566443Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7566666Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7566902Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7567119Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7567332Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7567547Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7567808Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7568042Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7568290Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7568530Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7568775Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7569011Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7569254Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7569498Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7569739Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7569976Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7570229Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7570449Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7570656Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7570880Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7571107Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7571351Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7571588Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7571806Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7572020Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7572260Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7572509Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7572742Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7572985Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7573220Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7573461Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7573699Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7573958Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7574194Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7574438Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7574674Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7574915Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7575149Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7575401Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7575636Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7575884Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7576121Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7576363Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7576599Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7576867Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7577103Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7577316Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7577523Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7577765Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7578007Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7578242Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7578495Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7578733Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7578976Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7579210Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7579451Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7579685Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7579944Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7580218Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7580425Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7580661Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7580908Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7581158Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7581411Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7581647Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7581876Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7582091Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7582307Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7582522Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7582764Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7583014Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7583250Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7583468Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7583681Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7583896Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7584139Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7584389Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7584607Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7584821Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7585026Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7585197Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7585432Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7585663Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7585900Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7586141Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7586376Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7586589Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7586795Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7587029Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7587252Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7587463Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7587697Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7587903Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7588140Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7588347Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7588581Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7588798Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7589035Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7589276Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7589510Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7589751Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7590008Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7590290Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7590536Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7590781Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7591015Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7591259Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7591491Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7591704Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7591927Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7592161Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7592390Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7592611Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7592825Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7593042Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7593299Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7593536Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7593777Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7594013Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7594218Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7594476Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7594732Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7594966Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7595210Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7595449Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7595691Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7595908Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7596123Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7596340Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7596546Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7596782Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7597024Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7597258Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7597499Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7597747Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7597955Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7598195Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7598437Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7598670Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7598937Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7599170Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7599375Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7599608Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7599851Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7600088Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7600373Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7600607Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7600849Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7601067Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7601283Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7601498Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7601741Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7601981Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7602202Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7602438Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7602682Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7602918Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7603160Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7603423Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7603650Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7603867Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7604080Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7604296Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7604538Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7604772Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7605019Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7605268Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7605519Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7605754Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7605961Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7606198Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7606439Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7606690Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7606933Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7607167Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7607394Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7607610Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7607858Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7608073Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7608315Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7608549Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7608791Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7609026Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7609268Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7609508Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7609762Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7609998Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7610282Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7610516Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7610729Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7610953Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7611181Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7611393Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7611628Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7611851Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7612063Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7612287Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7612506Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7612693Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7612835Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7612997Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7613116Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7613258Z E1204 11:13:25.981000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7613429Z [W1204 11:13:25.444471872 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7613432Z 2025-12-04T12:10:20.7613590Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7613903Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7614233Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7614382Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7614878Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7615147Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7615385Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7615621Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7615837Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7616079Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7616322Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7616566Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7616826Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7617068Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7617301Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7617543Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7617776Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7618017Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7618249Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7618475Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7618696Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7618916Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7619158Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7619390Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7619594Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7619831Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7620129Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7620361Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7620565Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7620798Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7621009Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7621249Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7621482Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7621694Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7621896Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7622131Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7622380Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7622611Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7622852Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7623103Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7623315Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7623538Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7623754Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7623997Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7624228Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7624488Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7624721Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7624962Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7625194Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7625436Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7625701Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7625941Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7626175Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7626416Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7626649Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7626890Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7627123Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7627386Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7627617Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7627859Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7628090Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7628336Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7628572Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7628829Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7629065Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7629280Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7629494Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7629735Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7629968Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7630273Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7630506Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7630749Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7630981Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7631224Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7631455Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7631704Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7631955Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7632196Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7632431Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7632641Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7632847Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7633081Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7633308Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7633533Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7633746Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7633987Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7634226Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7634438Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7634667Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7634901Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7635112Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7635316Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7635553Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7635795Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7636027Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7636287Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7636519Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7636733Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7636955Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7637168Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7637414Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7637649Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7637882Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7638097Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7638313Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7638556Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7638795Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7639061Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7639296Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7639537Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7639773Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7640015Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7640299Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7640541Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7640798Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7641011Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7641218Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7641455Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7641672Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7641886Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7642102Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7642363Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7642605Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7642846Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7643081Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7643323Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7643602Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7643845Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7644077Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7644320Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7644556Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7644774Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7644988Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7645208Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7645433Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7645648Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7649018Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7649261Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7649480Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7649694Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7649936Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7650226Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7650468Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7650721Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7650956Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7651241Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7651476Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7651721Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7651961Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7652203Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7652441Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7652690Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7652946Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7653187Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7653422Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7653668Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7653908Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7654155Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7654408Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7654651Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7654888Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7655110Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7655324Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7655560Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7655826Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7656062Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7656305Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7656544Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7656787Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7657021Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7657262Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7657515Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7657762Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7657996Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7658209Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7658444Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7658688Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7658940Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7659183Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7659418Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7659647Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7659867Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7660085Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7660386Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7660634Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7660870Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7661104Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7661323Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7661538Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7661753Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7662010Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7662248Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7662473Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7662694Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7662902Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7663071Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7663306Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7663527Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7663763Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7664005Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7664240Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7664453Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7664689Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7664927Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7665142Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7665347Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7665584Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7665799Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7666033Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7666236Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7666485Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7666696Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7666945Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7667196Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7667434Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7667683Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7667941Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7668186Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7668423Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7668668Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7668900Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7669163Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7669396Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7669611Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7669816Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7670053Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7670341Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7670559Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7670774Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7671006Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7671250Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7671486Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7671727Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7671962Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7672167Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7672412Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7672656Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7672895Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7673145Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7673379Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7673619Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7673849Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7674063Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7674270Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7674474Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7674711Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7674955Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7675189Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7675443Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7675678Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7675883Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7676118Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7676361Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7676595Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7676854Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7677090Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7677296Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7677532Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7677774Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7678019Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7678270Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7678504Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7678732Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7678949Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7679163Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7679378Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7679622Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7679870Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7680075Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7680355Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7680601Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7680834Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7681077Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7681332Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7681560Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7681779Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7681993Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7682210Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7682470Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7682718Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7682967Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7683201Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7683444Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7683680Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7683885Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7684118Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7684373Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7684611Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7684853Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7685087Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7685313Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7685530Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7685754Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7685970Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7686213Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7686452Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7686695Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7686945Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7687200Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7687434Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7687676Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7687912Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7688154Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7688387Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7688599Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7688827Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7689039Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7689252Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7689483Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7689704Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7689919Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7690175Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7690385Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7690573Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7690715Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7690882Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7691004Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7691146Z E1204 11:13:25.983000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7691230Z ('RERUN', {'yellow': True}) [1.7244s] [100%] 2025-12-04T12:10:20.7691614Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:27.865495910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7691618Z 2025-12-04T12:10:20.7691781Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7692095Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7692406Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7692555Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7693055Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7693334Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7693575Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7693798Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7694012Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7694257Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7694492Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7694747Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7694986Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7695228Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7695461Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7695702Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7695955Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7696197Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7696430Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7696642Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7696865Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7697082Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7697322Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7697584Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7697790Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7698024Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7698267Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7698500Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7698704Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7698935Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7699156Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7699359Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7699591Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7699803Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7700008Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7700375Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7700630Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7700862Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7701107Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7701345Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7701558Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7701780Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7701995Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7702249Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7702484Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7702727Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7702959Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7703199Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7703432Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7703692Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7703925Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7704166Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7704398Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7704639Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7704896Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7705139Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7705371Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7705612Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7705845Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7706086Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7706317Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7706566Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7706798Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7707040Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7707277Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7707498Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7707710Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7707952Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7708194Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7708437Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7708670Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7708913Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7709147Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7709408Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7709643Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7709890Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7710160Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7710403Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7710635Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7710847Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7711065Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7711297Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7711510Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7711733Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7711949Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7712194Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7712431Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7712657Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7712860Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7713197Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7713408Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7713610Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7713875Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7714117Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7714349Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7714593Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7714831Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7715047Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7715270Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7715483Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7715738Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7715973Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7716191Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7716404Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7716620Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7716864Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7717111Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7717356Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7717589Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7717833Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7718068Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7718328Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7718563Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7718808Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7719049Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7719263Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7719470Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7719706Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7719926Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7720197Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7720411Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7720655Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7720889Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7721133Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7721369Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7721626Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7721862Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7722104Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7722344Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7722586Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7722848Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7723065Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7723278Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7723489Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7723713Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7723932Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7724172Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7724407Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7724638Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7724850Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7725066Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7725308Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7725543Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7725788Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7726039Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7726283Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7726516Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7726758Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7726993Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7727254Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7727488Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7727731Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7727966Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7728208Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7728446Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7728689Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7728933Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7729176Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7729414Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7729656Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7729890Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7730139Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7730344Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7730595Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7730839Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7731073Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7731317Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7731556Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7731833Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7732067Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7732309Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7732543Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7732785Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7733020Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7733226Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7733476Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7733716Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7733953Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7734194Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7734427Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7734660Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7734892Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7735109Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7735323Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7735566Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7735803Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7736031Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7736267Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7736481Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7736697Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7736938Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7737174Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7737393Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7737605Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7737827Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7737995Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7738233Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7738439Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7738673Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7738916Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7739150Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7739374Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7739579Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7739814Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7740026Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7740264Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7740514Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7740732Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7740967Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7741173Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7741410Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7741615Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7741850Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7742093Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7742338Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7742582Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7742821Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7743063Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7743296Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7743543Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7743792Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7744038Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7744273Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7744487Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7744693Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7744939Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7745178Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7745396Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7745613Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7745831Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7746075Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7746310Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7746550Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7746796Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7747002Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7747239Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7747481Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7747714Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7747957Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7748212Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7748440Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7748656Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7748870Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7749077Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7749281Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7749540Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7749784Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7750018Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7750291Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7750527Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7750731Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7750965Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7751221Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7751462Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7751704Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7751940Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7752144Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7752379Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7752633Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7752870Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7753113Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7753348Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7753576Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7753807Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7754032Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7754249Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7754496Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7754733Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7754943Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7755181Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7755421Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7755666Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7755909Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7756145Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7756372Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7756589Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7756832Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7757057Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7757304Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7757539Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7757782Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7758018Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7758271Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7758515Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7758719Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7758955Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7759196Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7759432Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7759675Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7759908Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7760186Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7760403Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7760622Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7760842Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7761084Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7761320Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7761575Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7761811Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7762054Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7762290Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7762533Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7762780Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7763043Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7763278Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7763492Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7763708Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7763916Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7764129Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7764358Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7764600Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7764815Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7765024Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7765229Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7765419Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7765563Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7765725Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7765850Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7766019Z E1204 11:13:27.404000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7766194Z [W1204 11:13:27.867746891 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7766197Z 2025-12-04T12:10:20.7766355Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7766666Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7766975Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7767134Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7767636Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7767906Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7768147Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7768369Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7768583Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7768825Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7769069Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7769317Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7769555Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7769796Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7770029Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7770310Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7770555Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7770796Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7771029Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7771241Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7771464Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7771708Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7771953Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7772192Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7772396Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7772632Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7772875Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7773107Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7773309Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7773554Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7773766Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7773972Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7774209Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7774419Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7774621Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7774863Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7775108Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7775341Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7775583Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7775815Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7776037Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7776270Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7776483Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7776726Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7776962Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7777206Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7777439Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7777681Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7777925Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7778167Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7778402Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7778642Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7778876Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7779117Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7779365Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7779624Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7779856Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7780136Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7780371Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7780642Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7780877Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7781117Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7781354Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7781598Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7781833Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7782049Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7782271Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7782512Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7782745Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7782995Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7783228Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7783471Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7783705Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7783965Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7784199Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7784440Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7784674Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7784914Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7785176Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7785388Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7785589Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7785823Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7786036Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7786269Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7786482Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7786724Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7786965Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7787176Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7787381Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7787614Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7787826Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7788028Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7788270Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7788521Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7788755Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7788997Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7789229Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7789452Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7789682Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7789898Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7790181Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7790417Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7790638Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7790852Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7791068Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7791322Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7791557Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7791802Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7792040Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7792282Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7792518Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7792777Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7793014Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7793257Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7793493Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7793706Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7793927Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7794177Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7794396Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7794610Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7794824Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7795070Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7795304Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7795546Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7795794Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7796036Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7796276Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7796519Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7796754Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7796995Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7797256Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7797473Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7797686Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7797896Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7798122Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7798358Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7798609Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7798844Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7799060Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7799274Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7799495Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7799737Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7799973Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7800267Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7800502Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7800748Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7800984Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7801235Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7801469Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7801723Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7801959Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7802203Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7802439Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7802683Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7802944Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7803187Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7803422Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7803664Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7803903Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7804151Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7804385Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7804613Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7804817Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7805054Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7805297Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7805532Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7805777Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7806010Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7806262Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7806499Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7806744Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7806978Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7807221Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7807476Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7807697Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7807936Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7808179Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7808415Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7808658Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7808892Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7809130Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7809346Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7809560Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7809775Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7810017Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7810292Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7810547Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7810781Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7810996Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7811212Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7811456Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7811692Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7811931Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7812145Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7812352Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7812516Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7812757Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7812966Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7813200Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7813442Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7813693Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7813909Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7814115Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7814350Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7814561Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7814768Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7815021Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7815233Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7815468Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7815671Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7815906Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7816114Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7816370Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7816611Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7816846Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7817088Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7817322Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7817564Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7817800Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7818051Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7818285Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7818535Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7818768Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7818981Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7819187Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7819431Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7819662Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7819878Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7820142Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7820359Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7820606Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7820875Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7821117Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7821352Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7821555Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7821796Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7822038Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7822271Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7822527Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7822762Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7822990Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7823208Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7823421Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7823629Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7823845Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7824082Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7824324Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7824559Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7824805Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7825043Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7825269Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7825503Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7825748Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7825981Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7826226Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7826461Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7826665Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7826909Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7827152Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7827388Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7827629Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7827863Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7828100Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7828326Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7828542Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7828757Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7829000Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7829235Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7829442Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7829696Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7829938Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7830222Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7830468Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7830703Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7830931Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7831150Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7831388Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7831603Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7831847Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7832080Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7832322Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7832557Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7832815Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7833052Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7833257Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7833491Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7833733Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7833968Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7834234Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7834469Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7834698Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7834914Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7835136Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7835352Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7835594Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7835837Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7836081Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7836317Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7836558Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7836792Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7837033Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7837279Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7837523Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7837759Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7837976Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7838196Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7838413Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7838636Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7838866Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7839089Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7839304Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7839516Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7839723Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7839909Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7840062Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7840265Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7840383Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7840526Z E1204 11:13:27.406000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7840698Z [W1204 11:13:27.870934060 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7840702Z 2025-12-04T12:10:20.7840860Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7841170Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7841476Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7841635Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7842128Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7842397Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7842643Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7842876Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7843104Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7843346Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7843582Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7843824Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7844061Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7844303Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7844549Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7844807Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7845040Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7845283Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7845514Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7845728Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7845958Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7846182Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7846425Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7846657Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7846862Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7847096Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7847368Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7847600Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7847803Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7848036Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7848249Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7848458Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7848689Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7848900Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7849114Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7849346Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7849590Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7849826Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7850068Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7850340Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7850572Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7850798Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7851011Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7851253Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7851487Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7851728Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7851985Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7852226Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7852460Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7852699Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7852934Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7853174Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7853410Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7853669Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7853904Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7854146Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7854378Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7854620Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7854853Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7855103Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7855340Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7855582Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7855816Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7856059Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7856315Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7856531Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7856743Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7856995Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7857227Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7857469Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7857700Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7857941Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7858182Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7858423Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7858657Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7858898Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7859131Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7859374Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7859617Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7859829Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7860031Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7860300Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7860515Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7860766Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7860979Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7861220Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7861453Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7861667Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7861872Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7862103Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7862317Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7862534Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7862770Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7863013Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7863246Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7863487Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7863719Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7863943Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7864166Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7864380Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7864626Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7864865Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7865106Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7865328Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7865545Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7865790Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7866025Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7866269Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7866504Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7866747Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7866990Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7867233Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7867468Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7867709Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7867942Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7868165Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7868383Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7868619Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7868838Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7869051Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7869268Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7869511Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7869769Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7870013Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7870286Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7870533Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7870770Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7871013Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7871264Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7871520Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7871756Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7871974Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7872186Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7872393Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7872624Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7872856Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7873101Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7873339Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7873560Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7873778Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7873996Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7874267Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7874503Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7874746Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7874982Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7875227Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7875461Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7875702Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7875951Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7876194Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7876430Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7876677Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7876914Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7877156Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7877402Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7877646Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7877882Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7878124Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7878360Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7878621Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7878856Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7879069Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7879276Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7879512Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7879756Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7879994Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7880292Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7880544Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7880791Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7881027Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7881269Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7881504Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7881766Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7882005Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7882216Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7882456Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7882698Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7882932Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7883207Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7883450Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7883676Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7883894Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7884123Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7884341Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7884583Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7884830Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7885059Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7885277Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7885493Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7885710Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7885956Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7886190Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7886418Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7886633Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7886840Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7887004Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7887239Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7887455Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7887700Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7887944Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7888181Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7888398Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7888604Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7888841Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7889053Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7889270Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7889505Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7889711Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7889944Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7890196Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7890431Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7890646Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7890882Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7892045Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7892282Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7892526Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7892770Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7893029Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7893276Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7893518Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7893754Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7893997Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7894230Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7894448Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7894666Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7894900Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7895133Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7895352Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7895566Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7895782Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7896035Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7896271Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7896557Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7896792Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7896998Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7897234Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7897488Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7897727Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7897975Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7898209Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7898457Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7898675Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7898890Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7899116Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7899321Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7899556Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7899804Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7900039Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7900320Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7900574Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7900781Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7901017Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7901284Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7901519Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7901761Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7902008Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7902216Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7902451Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7902692Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7902925Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7903167Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7903404Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7903643Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7903865Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7904081Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7904298Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7904542Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7904779Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7904995Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7905229Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7905484Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7905718Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7905963Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7906197Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7906439Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7906662Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7906877Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7907109Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7907352Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7907588Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7907830Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7908076Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7908318Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7908553Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7908758Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7908994Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7909238Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7909482Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7909731Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7909984Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7910247Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7910473Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7910687Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7910918Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7911164Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7911400Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7911642Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7911878Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7912122Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7912356Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7912611Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7912849Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7913092Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7913327Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7913542Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7913761Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7913980Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7914193Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7914435Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7914655Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7914870Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7915077Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7915299Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7915485Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7915628Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7915789Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7915910Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7916054Z E1204 11:13:27.410000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7916231Z [W1204 11:13:27.919818367 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7916233Z 2025-12-04T12:10:20.7916393Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7916704Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7917023Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7917167Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7917661Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7917930Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7918170Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7918405Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7918620Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7918875Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7919116Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7919356Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7919589Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7919839Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7920076Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7920359Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7920593Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7920836Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7921072Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7921286Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7921521Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7921743Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7921984Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7922220Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7922425Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7922657Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7922912Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7923149Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7923366Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7923599Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7923811Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7924020Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7924264Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7924477Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7924681Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7924916Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7925158Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7925392Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7925636Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7925877Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7926088Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7926310Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7926532Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7926773Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7927006Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7927260Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7927493Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7927751Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7927984Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7928226Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7928460Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7928710Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7928944Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7929185Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7929417Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7929657Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7929893Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7930176Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7930426Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7930673Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7930906Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7931149Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7931383Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7931625Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7931870Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7932089Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7932319Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.7932560Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7932796Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7933039Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7933293Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7933534Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7933769Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7934011Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7934243Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7934486Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7934717Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7934970Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7935204Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7935415Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7935628Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7935860Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7936070Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7936301Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7936517Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7936769Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7937004Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7937215Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7937417Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7937662Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7937955Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7938163Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7938400Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7938642Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7938875Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7939115Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7939359Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7939572Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7939795Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7940009Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7940294Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7940529Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7940759Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7940975Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7941208Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7941452Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7941688Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7941939Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7942186Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7942428Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7942664Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7942905Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7943140Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7943382Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7943617Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7943845Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7944051Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7944289Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7944509Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7944722Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7944938Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7945189Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7945427Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7945681Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7945915Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7946159Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7946393Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7946644Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7946881Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7947125Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7947359Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7947577Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7947789Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7947996Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7948232Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7948449Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7948692Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7948925Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7949143Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7949355Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7949600Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7949844Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7952272Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7952526Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7952763Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7953009Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7953274Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7953517Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7953753Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7953993Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7954228Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7954468Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7954704Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7954958Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7955194Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7955437Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7955673Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7955917Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7956152Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7956410Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7956645Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7956876Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7957084Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7957319Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7957573Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7957806Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7958050Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7958286Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7958529Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7958766Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7959007Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7959243Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7959494Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7959728Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7959934Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7960205Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7960447Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7960683Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7960939Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7961173Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7961416Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7961634Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7961849Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7962078Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7962320Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7962555Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7962783Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7963002Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7963216Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7963432Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7963673Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7963919Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7964137Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7964350Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7964558Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7964724Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.7964959Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7965174Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7965411Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7965665Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7965899Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7966113Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7966319Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7966563Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7966778Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7966982Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7967217Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7967422Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7967660Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7967868Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7968119Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7968322Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7968557Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7968801Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7969034Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7969277Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7969523Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7969767Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7970018Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7970302Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7970538Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7970779Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7971025Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7971240Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7971444Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7971678Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7971905Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7972122Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7972338Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7972571Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7972814Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7973049Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7973292Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7973526Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7973730Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7973977Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7974221Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7974458Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7974712Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7974950Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7975178Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7975405Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7975620Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7975829Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.7976034Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7976268Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7976512Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7976747Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7976999Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7977234Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7977441Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7977679Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7977922Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7978159Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7978411Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7978647Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7978850Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7979095Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7979340Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7979574Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7979829Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7980065Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7980327Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7980543Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7980757Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7980974Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7981216Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7981466Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7981669Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7981904Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7982148Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7982382Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7982623Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7982868Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7983096Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7983313Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7983540Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7983756Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7984000Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7984255Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7984501Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7984737Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7984979Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7985215Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7985420Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7985654Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7985906Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7986138Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7986381Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7986614Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7986845Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.7987066Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.7987288Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.7987506Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7987749Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7987993Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7988236Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7988471Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7988723Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7988959Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7989201Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7989437Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7989681Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7989917Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7990203Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.7990435Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.7990639Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.7990852Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.7991082Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.7991306Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.7991519Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.7991725Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.7991944Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.7992131Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.7992287Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.7992448Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.7992569Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.7992710Z E1204 11:13:27.459000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.7992883Z [W1204 11:13:27.922352184 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.7992886Z 2025-12-04T12:10:20.7993058Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.7993374Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.7993687Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.7993832Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.7994333Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.7994601Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.7994852Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.7995074Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.7995288Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7995531Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.7995767Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7996010Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7996252Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7996495Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7996738Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7996979Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7997211Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7997451Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7997693Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7997906Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.7998131Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.7998346Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.7998588Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7998821Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7999025Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.7999270Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7999512Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.7999747Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.7999953Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8000227Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8000440Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8000655Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8000890Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8001123Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8001326Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8001562Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8001805Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8002054Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8002295Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8002530Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8002742Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8002967Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8003182Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8003423Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8003669Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8003910Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8004146Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8004387Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8004623Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8004864Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8005106Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8005348Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8005591Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8005831Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8006064Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8006306Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8006551Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8006792Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8007024Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8007263Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8007497Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8007737Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8007972Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8008225Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8008458Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8008674Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8008889Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8009131Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8009364Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8009615Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8009850Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8010140Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8010373Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8010613Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8010846Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8011103Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8011336Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8011580Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8011811Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8012030Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8012234Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8012468Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8012695Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8012916Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8013131Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8013373Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8013607Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8013817Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8014033Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8014268Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8014490Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8014694Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8014927Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8015169Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8015410Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8015653Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8015887Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8016097Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8016320Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8016534Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8016782Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8017026Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8017243Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8017456Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8017671Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8017915Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8018149Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8018408Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8018642Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8018896Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8019133Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8019375Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8019609Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8019860Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8020129Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8020344Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8020549Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8020784Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8021000Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8021213Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8021443Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8021689Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8021924Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8022166Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8022404Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8022646Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8022891Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8023134Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8023385Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8023626Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8023861Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8024080Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8024306Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8024515Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8024739Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8024954Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8025196Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8025431Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8025649Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8025874Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8026088Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8026329Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8026566Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8026809Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8027044Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8027299Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8027534Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8027786Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8028019Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8028263Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8028498Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8028749Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8028985Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8029227Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8029464Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8029706Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8029939Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8030217Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8030466Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8030709Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8030942Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8031159Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8031363Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8031600Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8031853Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8032089Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8032345Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8032579Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8032820Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8033065Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8033309Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8033544Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8033786Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8034021Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8034225Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8034461Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8034701Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8034953Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8035195Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8035428Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8035659Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8035876Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8036089Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8036314Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8036556Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8036801Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8037028Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8037246Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8037470Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8037685Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8037928Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8038167Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8038384Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8038596Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8038805Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8038966Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8039212Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8039416Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8039651Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8039897Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8040193Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8040406Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8040624Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8040861Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8041089Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8041293Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8041528Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8041737Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8041984Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8042188Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8042426Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8042629Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8042865Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8043111Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8043349Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8043605Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8043838Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8044081Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8044317Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8044560Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8044797Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8045049Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8045284Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8045506Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8045710Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8045947Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8046175Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8046401Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8046616Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8046831Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8047074Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8047310Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8047554Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8047788Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8048002Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8048237Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8048481Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8048715Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8048959Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8049194Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8049421Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8049648Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8049861Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8050077Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8050324Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8050560Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8050821Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8051053Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8051301Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8051537Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8051742Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8051975Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8052219Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8052465Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8052705Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8052939Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8053143Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8053377Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8053621Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8053856Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8054111Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8054344Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8054584Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8054800Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8055014Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8055238Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8055481Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8055715Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8055921Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8056156Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8056398Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8056632Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8056883Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8057117Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8057345Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8057561Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8057777Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8057992Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8058236Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8058481Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8058724Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8058970Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8059212Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8059448Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8059664Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8059900Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8060178Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8060412Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8060655Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8060892Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8061120Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8061348Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8061561Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8061777Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8062019Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8062254Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8062495Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8062742Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8062984Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8063220Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8063476Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8063711Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8063954Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8064200Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8064413Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8064630Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8064836Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8065046Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8065275Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8065499Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8065723Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8065929Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8066134Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8066321Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8066463Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8066621Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8066741Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8066880Z E1204 11:13:27.461000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8067053Z [W1204 11:13:27.924480787 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8067065Z 2025-12-04T12:10:20.8067225Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8067535Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8067861Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8068008Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8068511Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8068780Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8069023Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8069242Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8069458Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8069702Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8069937Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8070232Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8070467Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8070709Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8070941Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8071186Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8071420Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8071674Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8071913Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8072137Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8072363Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8072576Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8072820Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8073066Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8073270Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8073505Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8073745Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8073977Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8074180Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8074416Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8074641Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8074841Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8075075Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8075288Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8075494Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8075727Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8075968Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8076219Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8076459Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8076703Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8076914Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8077137Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8077361Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8077605Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8077842Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8078085Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8078318Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8078558Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8078793Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8079043Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8079275Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8079517Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8079751Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8079996Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8080272Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8080525Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8080758Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8081012Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8081244Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8081484Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8081717Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8081973Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8082207Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8082448Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8082682Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8082899Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8083111Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8083354Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8083601Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8083842Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8084075Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8084315Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8084547Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8084788Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8085043Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8085287Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8085530Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8085772Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8086004Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8086218Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8086430Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8086665Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8086876Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8087097Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8087312Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8087555Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8087787Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8088007Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8088209Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8088443Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8088653Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8088856Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8089088Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8089339Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8089571Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8089825Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8090058Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8090315Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8090537Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8090765Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8091011Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8091244Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8091461Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8091675Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8091894Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8092138Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8092387Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8092629Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8092863Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8093105Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8093338Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8093581Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8093827Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8094069Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8094317Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8094531Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8094737Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8094970Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8095197Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8095412Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8095625Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8095869Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8096104Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8096347Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8096581Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8096835Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8097069Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8097310Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8097545Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8097786Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8098023Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8098247Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8098463Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8098682Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8098906Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8099124Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8099367Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8099611Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8099830Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8100042Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8100302Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8100543Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8100779Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8101021Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8101275Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8101518Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8101754Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8101996Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8102231Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8102475Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8102723Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8102968Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8103218Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8103458Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8103694Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8103937Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8104183Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8104426Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8104663Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8104904Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8105139Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8105353Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8105558Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8105803Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8106044Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8106281Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8106524Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8106759Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8107002Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8107245Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8107487Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8107731Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8107973Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8108207Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8108412Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8108659Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8108901Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8109137Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8109380Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8109613Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8109841Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8110057Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8110320Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8110534Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8110778Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8111013Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8111243Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8111459Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8111684Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8111900Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8112158Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8112395Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8112612Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8112824Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8113043Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8113206Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8113444Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8113649Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8113884Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8114126Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8114361Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8114587Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8114790Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8115026Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8115238Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8115444Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8115680Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8115885Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8116130Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8116334Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8116580Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8116783Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8117018Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8117279Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8117514Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8117760Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8117995Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8118241Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8118477Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8118723Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8118970Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8119211Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8119446Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8119659Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8119864Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8120137Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8120366Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8120597Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8120811Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8121038Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8121281Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8121516Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8121772Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8122007Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8122212Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8122447Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8122693Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8122927Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8123169Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8123414Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8123641Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8123859Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8124072Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8124279Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8124484Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8124720Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8124973Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8125211Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8125463Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8125697Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8125900Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8126151Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8126394Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8126628Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8126870Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8127105Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8127311Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8127547Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8127788Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8128032Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8128273Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8128506Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8128733Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8128949Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8129162Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8129386Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8129632Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8129878Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8130083Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8130354Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8130609Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8130842Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8131083Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8131317Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8131544Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8131761Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8131976Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8132192Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8132446Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8132679Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8132921Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8133155Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8133396Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8133629Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8133845Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8134080Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8134340Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8134576Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8134818Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8135060Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8135290Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8135506Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8135722Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8135937Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8136179Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8136413Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8136653Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8136899Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8137140Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8137375Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8137617Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8137854Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8138112Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8138349Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8138561Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8138789Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8138996Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8139208Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8139445Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8139667Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8139880Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8140086Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8140332Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8140519Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8140659Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8140819Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8140936Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8141094Z E1204 11:13:27.463000 816968 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8141153Z FAILED [1.6170s] [100%] 2025-12-04T12:10:20.8141156Z 2025-12-04T12:10:20.8141230Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.8141409Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.8141473Z Traceback (most recent call last): 2025-12-04T12:10:20.8141652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8141712Z method(*args, **kwargs) 2025-12-04T12:10:20.8141879Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8141937Z method(*args, **kwargs) 2025-12-04T12:10:20.8142103Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.8142160Z with policy(): 2025-12-04T12:10:20.8142328Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.8142385Z raise RuntimeError(msg) 2025-12-04T12:10:20.8142825Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:20.8142842Z 2025-12-04T12:10:20.8142935Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.8143228Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.8143231Z 2025-12-04T12:10:20.8143338Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.8143432Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8143495Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8143571Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8144154Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8144273Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8144331Z graph_break [] 2025-12-04T12:10:20.8144413Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8144508Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8145010Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.8145077Z current_size = base.storage().size() 2025-12-04T12:10:20.8145136Z Autotune Choices Stats: 2025-12-04T12:10:20.8145529Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009960000403225422, "best_triton_pos": 0} 2025-12-04T12:10:20.8145621Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8145687Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8145804Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8146062Z triton_mm_34 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8146308Z triton_mm_29 0.0104 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8146549Z triton_mm_16 0.0116 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8146801Z triton_mm_21 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8147044Z triton_mm_22 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8147299Z triton_mm_23 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8147548Z triton_mm_15 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8147793Z triton_mm_33 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8148045Z triton_mm_30 0.0120 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8148289Z triton_mm_31 0.0128 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8148437Z SingleProcess AUTOTUNE benchmarking takes 0.1678 seconds and 8.5806 seconds precompiling for 33 choices 2025-12-04T12:10:20.8148613Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.8148675Z Traceback (most recent call last): 2025-12-04T12:10:20.8148848Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8148906Z method(*args, **kwargs) 2025-12-04T12:10:20.8149073Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8149130Z method(*args, **kwargs) 2025-12-04T12:10:20.8149296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.8149360Z with policy(): 2025-12-04T12:10:20.8149529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.8149587Z raise RuntimeError(msg) 2025-12-04T12:10:20.8150012Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:20.8150015Z 2025-12-04T12:10:20.8150146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.8150434Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.8150437Z 2025-12-04T12:10:20.8150542Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.8150631Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8150694Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8150768Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8151360Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8151490Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8151544Z graph_break [] 2025-12-04T12:10:20.8151626Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8151714Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8152214Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.8152277Z current_size = base.storage().size() 2025-12-04T12:10:20.8152348Z Autotune Choices Stats: 2025-12-04T12:10:20.8152735Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009960000403225422, "best_triton_pos": 0} 2025-12-04T12:10:20.8152814Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8152879Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8152994Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8153247Z triton_mm_34 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8153493Z triton_mm_29 0.0104 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8153733Z triton_mm_16 0.0116 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8153988Z triton_mm_21 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8154230Z triton_mm_22 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8154474Z triton_mm_23 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8154717Z triton_mm_15 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8154962Z triton_mm_33 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8155213Z triton_mm_30 0.0120 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8155461Z triton_mm_31 0.0128 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8155620Z SingleProcess AUTOTUNE benchmarking takes 0.1678 seconds and 8.5806 seconds precompiling for 33 choices 2025-12-04T12:10:20.8155711Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8155769Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8155843Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8155958Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8156467Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8156528Z graph_break [] 2025-12-04T12:10:20.8156609Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8156698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8157077Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.8157185Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.8157242Z Autotune Choices Stats: 2025-12-04T12:10:20.8157629Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00791999977082014, "best_triton_pos": 0} 2025-12-04T12:10:20.8157706Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8157773Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8157902Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8158154Z triton_mm_72 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8158402Z triton_mm_71 0.0095 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8158462Z _scaled_mm 0.0096 ms 82.5% 2025-12-04T12:10:20.8158702Z triton_mm_67 0.0106 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8158943Z triton_mm_54 0.0112 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8159186Z triton_mm_60 0.0113 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8159436Z triton_mm_68 0.0116 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8159677Z triton_mm_59 0.0116 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8159929Z triton_mm_61 0.0121 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8160203Z triton_mm_53 0.0122 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8160350Z SingleProcess AUTOTUNE benchmarking takes 0.2646 seconds and 0.8321 seconds precompiling for 39 choices 2025-12-04T12:10:20.8160436Z =================================== FAILURES =================================== 2025-12-04T12:10:20.8160615Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.8160681Z Traceback (most recent call last): 2025-12-04T12:10:20.8160854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8160912Z method(*args, **kwargs) 2025-12-04T12:10:20.8161083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.8161139Z method(*args, **kwargs) 2025-12-04T12:10:20.8161306Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.8161362Z with policy(): 2025-12-04T12:10:20.8161529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.8161590Z raise RuntimeError(msg) 2025-12-04T12:10:20.8162010Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.8162026Z 2025-12-04T12:10:20.8162120Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.8162407Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.8162410Z 2025-12-04T12:10:20.8162514Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.8162603Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8162663Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8162737Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8163301Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8163416Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8163468Z graph_break [] 2025-12-04T12:10:20.8163563Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8163651Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8164151Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.8164229Z current_size = base.storage().size() 2025-12-04T12:10:20.8164288Z Autotune Choices Stats: 2025-12-04T12:10:20.8164673Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009960000403225422, "best_triton_pos": 0} 2025-12-04T12:10:20.8164754Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8164829Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8164944Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8165198Z triton_mm_34 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8165439Z triton_mm_29 0.0104 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8165680Z triton_mm_16 0.0116 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8165918Z triton_mm_21 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8166157Z triton_mm_22 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8166417Z triton_mm_23 0.0118 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8166660Z triton_mm_15 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8166905Z triton_mm_33 0.0120 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8167148Z triton_mm_30 0.0120 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8167392Z triton_mm_31 0.0128 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8167537Z SingleProcess AUTOTUNE benchmarking takes 0.1678 seconds and 8.5806 seconds precompiling for 33 choices 2025-12-04T12:10:20.8167638Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8167698Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8167773Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8167898Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8168395Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8168454Z graph_break [] 2025-12-04T12:10:20.8168533Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8168622Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8169011Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.8169120Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.8169177Z Autotune Choices Stats: 2025-12-04T12:10:20.8169564Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00791999977082014, "best_triton_pos": 0} 2025-12-04T12:10:20.8169646Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8169713Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8169828Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8170077Z triton_mm_72 0.0079 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8170366Z triton_mm_71 0.0095 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8170438Z _scaled_mm 0.0096 ms 82.5% 2025-12-04T12:10:20.8170679Z triton_mm_67 0.0106 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8170918Z triton_mm_54 0.0112 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8171163Z triton_mm_60 0.0113 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8171402Z triton_mm_68 0.0116 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8171643Z triton_mm_59 0.0116 ms 68.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8171898Z triton_mm_61 0.0121 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8172143Z triton_mm_53 0.0122 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8172305Z SingleProcess AUTOTUNE benchmarking takes 0.2646 seconds and 0.8321 seconds precompiling for 39 choices 2025-12-04T12:10:20.8172396Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.8172456Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.8172532Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.8172646Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.8173155Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.8173210Z graph_break [] 2025-12-04T12:10:20.8173292Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.8173384Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.8173441Z Autotune Choices Stats: 2025-12-04T12:10:20.8173826Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_109", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009200000204145908, "best_triton_pos": 0} 2025-12-04T12:10:20.8173902Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.8173966Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.8174082Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.8174335Z triton_mm_109 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8174605Z triton_mm_110 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8174849Z triton_mm_105 0.0109 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8175088Z triton_mm_97 0.0117 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8175329Z triton_mm_106 0.0117 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8175572Z triton_mm_98 0.0119 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8175826Z triton_mm_92 0.0120 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.8176073Z triton_mm_99 0.0120 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8176327Z triton_mm_91 0.0126 ms 73.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8176570Z triton_mm_107 0.0132 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.8176712Z SingleProcess AUTOTUNE benchmarking takes 0.2780 seconds and 0.6345 seconds precompiling for 39 choices 2025-12-04T12:10:20.8176927Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3246e62b044bc4be.xml - 2025-12-04T12:10:20.8177012Z =========================== short test summary info ============================ 2025-12-04T12:10:20.8177652Z FAILED [1.6170s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.8177657Z 2025-12-04T12:10:20.8177750Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.8178041Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.8178043Z 2025-12-04T12:10:20.8178148Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.8178226Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.8178312Z ================= 1 failed, 187 deselected, 2 rerun in 14.99s ================== 2025-12-04T12:10:20.8178367Z Got exit code 1 2025-12-04T12:10:20.8178438Z Retrying single test... 2025-12-04T12:10:20.8178598Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f15a9b444f5a4e4e.xml 2025-12-04T12:10:20.8178678Z ============================= test session starts ============================== 2025-12-04T12:10:20.8178812Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.8178871Z cachedir: .pytest_cache 2025-12-04T12:10:20.8179046Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.8179111Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.8179171Z configfile: pytest.ini 2025-12-04T12:10:20.8179353Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.8179445Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.8179729Z stepcurrent: skipping 107 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.8179790Z Running 1 items in this shard 2025-12-04T12:10:20.8179792Z 2025-12-04T12:10:20.8180216Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:38.630865948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8180219Z 2025-12-04T12:10:20.8180393Z [W1204 11:13:45.259141205 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8180416Z 2025-12-04T12:10:20.8180748Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8181059Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8181210Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8181729Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8181999Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8182244Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8182468Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8182687Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8182931Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8183182Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8183424Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8183660Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8183901Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8184135Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8184378Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8184621Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8184864Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8185107Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8185348Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8185583Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8185824Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8186070Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8186277Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8186511Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8186753Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8186992Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8187197Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8187431Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8187682Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8187919Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8188162Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8188395Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8188614Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8188840Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8189024Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8189221Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8189766Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/vf/cvfmuzex7rv6wbxqxrkwmeaquhcmau3tgmikjjnugpfnzr5bx6go.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8189943Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8190218Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8190390Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8190708Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8190859Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8191132Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8191286Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8191556Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8191728Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8192010Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8192175Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8192464Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8192676Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8193007Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8193315Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8193461Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8193967Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8194246Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8194486Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8194708Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8194925Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8195179Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8195417Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8195659Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8195895Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8196136Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8196370Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8196611Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8196854Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8197099Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8197332Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8197575Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8197810Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8198051Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8198293Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8198497Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8198746Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8198986Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8199219Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8199432Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8199665Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8199908Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8200172Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8200414Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8200647Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8200865Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8201103Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8201279Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8201478Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8201595Z E1204 11:13:45.686000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8201919Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8202224Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8202370Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8202870Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8203149Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8203391Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8203608Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8203823Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8204077Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8204312Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8204556Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8204789Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8205032Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8205266Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8205507Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8205750Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8205992Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8206225Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8206466Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8206699Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8206940Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8207188Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8207395Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8207636Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8207882Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8208115Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8208329Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8208560Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8208802Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8209034Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8209275Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8209509Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8209727Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8209951Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8210172Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8210370Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8210911Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/bu/cbuuzm2viaiv5nmviavrp5nmecwrl3yqy5smpjhgakd34iwlkbed.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8211075Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8211305Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8211476Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8211792Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8211952Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8212224Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8212383Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8212652Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8212822Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8213124Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8213275Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8213565Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8213771Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8214101Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8214408Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8214566Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8215054Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8215322Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8215560Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8215782Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8216016Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8216259Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8216496Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8216751Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8216985Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8217226Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8217466Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8217707Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8217940Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8218181Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8218414Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8218655Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8218890Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8219139Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8219374Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8219577Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8219810Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8220050Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8220328Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8220545Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8220778Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8221029Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8221260Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8221504Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8221736Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8221966Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8222192Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8222366Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8222560Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8222677Z E1204 11:13:45.816000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8222848Z [W1204 11:13:45.297205312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8222852Z 2025-12-04T12:10:20.8223173Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8223478Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8223636Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8224126Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8224392Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8224631Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8224851Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8225075Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8225317Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8225562Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8225803Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8226038Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8226290Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8226523Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8226764Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8226997Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8227240Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8227472Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8227714Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8227955Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8228196Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8228434Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8228637Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8228871Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8229113Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8229357Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8229560Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8229793Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8230045Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8230309Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8230550Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8230797Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8231015Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8231239Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8231411Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8231605Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8232145Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/yh/cyhpxsxflgxt6nqs6ahc5soxw3ltnrmywhxypn65iyovg6yp3ont.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8232307Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8232553Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8232724Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8233024Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8233172Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8233443Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8233595Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8233878Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8234049Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8234333Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8234493Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8234783Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8234989Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8235325Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8235631Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8235776Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8236266Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8236533Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8236773Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8237004Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8237219Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8237461Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8237695Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8237936Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8238170Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8238419Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8238652Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8238903Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8239138Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8239378Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8239610Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8239862Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8240125Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8240367Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8240599Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8240805Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8241041Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8241282Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8241529Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8241733Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8241965Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8242206Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8242439Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8242680Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8242924Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8243141Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8243378Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8243552Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8243744Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8243862Z E1204 11:13:45.838000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8244033Z [W1204 11:13:45.318436967 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8244049Z 2025-12-04T12:10:20.8245648Z [W1204 11:13:45.319155487 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8245653Z 2025-12-04T12:10:20.8245825Z [W1204 11:13:45.324324540 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8245829Z 2025-12-04T12:10:20.8246153Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8246461Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8246608Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8247101Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8247385Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8247625Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8247850Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8248066Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8248310Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8248558Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8248803Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8249047Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8249288Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8249521Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8249762Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8250013Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8250284Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8250516Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8250757Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8250990Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8251231Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8251465Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8251683Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8251916Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8252157Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8252390Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8252594Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8252828Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8253078Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8253311Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8253567Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8253800Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8254018Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8254242Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8254429Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8254626Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8255173Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/qc/cqc4eyd6o7wr33nr3us7ayg4boddpxlgd6t3ag24ccquyxuhfe3u.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8255337Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8255568Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8255741Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8256044Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8256203Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8256473Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8256628Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8256899Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8257071Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8257357Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8257507Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8257807Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8258025Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8258354Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8258661Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8258808Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8259309Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8259577Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8259819Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8260043Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8260303Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8260548Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8260797Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8261040Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8261273Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8261514Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8261848Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8262089Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8262337Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8262580Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8262825Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8263066Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8263302Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8263553Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8263785Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8263991Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8264224Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8264466Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8264699Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8264902Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8265134Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8265384Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8265625Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8265864Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8266097Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8266315Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8266540Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8266725Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8266918Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8267051Z E1204 11:13:45.858000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8267373Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8267681Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8267828Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8268328Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8268595Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8268833Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8269053Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8269268Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8269510Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8269755Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8269997Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8270286Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8270529Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8270763Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8271003Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8271251Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8271492Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8271736Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8271978Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8272209Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8272464Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8272700Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8272906Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8273139Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8273379Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8273611Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8273814Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8274045Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8274298Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8274530Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8274772Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8275003Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8275221Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8275445Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8275627Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8275822Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8276372Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/3k/c3kxod6ilq6ubrfj5fnkjkxkkslxqgcec42vickndloclrpjosf5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8276536Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8276763Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8276942Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8277241Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8277390Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8277666Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8277818Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8278087Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8278256Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8278539Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8278700Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8278989Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8279196Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8279527Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8279833Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8279976Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8280517Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8280796Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8281038Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8281261Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8281491Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8281734Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8281967Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8282211Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8282444Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8282687Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8282921Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8283175Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8283408Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8283651Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8283883Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8284125Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8284357Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8284612Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8284845Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8285059Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8285293Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8285534Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8285767Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8285981Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8286215Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8286456Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8286689Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8286934Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8287166Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8287388Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8287623Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8287797Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8287990Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8288108Z E1204 11:13:45.864000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8288433Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8288738Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8288882Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8289385Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8289662Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8289900Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8290154Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8290389Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8290632Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8290869Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8291111Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8291345Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8291586Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8291819Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8292085Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8292316Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8292557Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8292789Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8293030Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8293268Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8293536Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8293770Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8293990Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8294222Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8294461Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8294693Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8294910Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8295143Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8295385Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8295616Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8295858Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8296090Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8296307Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8296543Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8296715Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8296907Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8297451Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp8buzzrwz/o7/co7zkv32udemyplpnvfenp6mnv7sb5a5mzqj23arfu6glbalhm2t.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.8297613Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.8297840Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.8298023Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.8298324Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.8298481Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.8298749Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.8298902Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.8299170Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.8299350Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.8299633Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.8299783Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.8300070Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.8300316Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.8300645Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8300950Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8301123Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8301616Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8301883Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8302122Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8302344Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8302571Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8302817Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8303069Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8303311Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8303543Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8303783Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8304035Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8304275Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8304508Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8304750Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8304983Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8305227Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8305460Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8305713Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8305945Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8306150Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8306381Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8306622Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8306853Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8307064Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8307297Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8307548Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8307784Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8308024Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8308266Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8308483Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.8308708Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.8308882Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.8309076Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.8309193Z E1204 11:13:45.868000 822892 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.8309264Z ('RERUN', {'yellow': True}) [11.4837s] [100%] 2025-12-04T12:10:20.8309631Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:47.329022566 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8309635Z 2025-12-04T12:10:20.8309795Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8310150Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8310459Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8310603Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8311093Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8311364Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8311617Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8311837Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8312065Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8312308Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8312542Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8312799Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8313032Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8313271Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8313504Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8313745Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8313979Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8314219Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8314464Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8314677Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8314899Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8315115Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8315356Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8315589Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8315792Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8316035Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8316276Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8316519Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8316724Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8316956Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8317185Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8317388Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8317619Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8317830Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8318032Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8318265Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8318506Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8318739Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8318991Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8319224Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8319436Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8319658Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8319874Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8320153Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8320402Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8320644Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8320895Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8321137Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8321371Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8321626Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8321857Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8322099Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8322332Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8322572Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8322805Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8323046Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8323291Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8323533Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8323766Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8324009Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8324242Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8324484Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8324725Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8324968Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8325214Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8325429Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8325640Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8325881Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8326126Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8326367Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8326600Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8326840Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8327073Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8327315Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8327548Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8327798Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8328029Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8328271Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8328505Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8328716Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8328919Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8329160Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8329372Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8329604Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8329818Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8330060Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8330332Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8330558Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8330762Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8330998Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8331210Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8331415Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8331648Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8331888Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8332133Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8332373Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8332608Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8332819Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8333042Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8333256Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8333501Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8333756Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8333974Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8334208Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8334422Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8334665Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8334911Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8335152Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8335387Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8335631Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8335866Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8336111Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8336346Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8336597Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8336832Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8337046Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8337253Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8337491Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8337707Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8337921Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8338150Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8338394Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8338641Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8338884Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8339123Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8339377Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8339615Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8339858Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8340138Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8340380Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8340614Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8340833Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8341060Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8341267Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8341492Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8341707Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8341951Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8342186Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8342404Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8342628Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8342843Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8343097Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8343333Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8343576Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8343820Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8344062Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8344297Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8344539Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8344774Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8345015Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8345251Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8345503Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8345738Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8345980Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8346214Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8346456Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8346690Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8346941Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8347175Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8347428Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8347662Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8347876Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8348083Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8348328Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8348572Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8348805Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8349047Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8349281Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8349522Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8349757Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8350008Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8350283Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8350528Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8350762Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8350966Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8351200Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8351460Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8351694Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8351951Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8352186Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8352413Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8352631Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8352857Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8353077Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8353319Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8353553Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8353780Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8353996Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8354209Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8354436Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8354678Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8354913Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8355134Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8355348Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8355554Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8355717Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8355970Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8356180Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8356425Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8356668Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8356903Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8357126Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8357332Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8357567Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8357780Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8357987Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8358224Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8358434Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8358673Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8358895Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8359130Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8359337Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8359572Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8359861Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8360206Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8360524Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8360760Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8361033Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8361266Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8361508Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8361756Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8361999Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8362234Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8362455Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8362661Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8362895Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8363124Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8363339Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8363566Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8363783Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8364033Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8364269Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8364510Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8364749Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8364966Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8365209Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8365467Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8365703Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8365943Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8366193Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8366431Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8366653Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8366872Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8367084Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8367287Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8367522Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8367792Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8368054Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8368295Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8368529Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8368734Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8368968Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8369227Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8369470Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8369713Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8369957Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8370323Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8370556Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8370811Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8371057Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8371298Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8371533Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8371761Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8371978Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8372192Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8372406Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8372662Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8372897Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8373103Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8373338Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8373579Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8373814Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8374069Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8374305Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8374558Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8374777Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8374988Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8375204Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8375456Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8375691Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8375933Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8376167Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8376412Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8376647Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8376851Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8377095Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8377336Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8377570Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8377811Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8378044Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8378270Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8378496Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8378712Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8378954Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8379195Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8379428Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8379671Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8379917Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8380219Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8380454Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8380695Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8380930Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8381172Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8381409Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8381637Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8381855Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8382063Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8382279Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8382511Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8382737Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8382966Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8383181Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8383403Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8383593Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8383738Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8383906Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8384027Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8384198Z E1204 11:13:47.881000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8384371Z [W1204 11:13:47.348394475 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8384375Z 2025-12-04T12:10:20.8384535Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8384844Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8385151Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8385297Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8385788Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8386067Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8386307Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8386526Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8386744Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8386987Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8387221Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8387471Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8387705Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8387956Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8388189Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8388429Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8388671Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8388927Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8389159Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8389372Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8389596Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8389810Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8390052Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8390345Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8390550Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8390783Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8391025Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8391260Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8391463Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8391698Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8391938Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8392143Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8392396Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8392609Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8392815Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8393048Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8393305Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8393538Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8393778Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8394011Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8394224Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8394446Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8394661Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8394922Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8395154Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8395395Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8395627Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8395868Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8396101Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8396370Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8396616Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8396869Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8397102Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8397342Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8397598Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8397840Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8398072Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8398313Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8398544Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8398785Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8399020Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8399271Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8399504Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8399747Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8399979Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8400224Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8400437Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8400680Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8400928Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8401168Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8401424Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8401667Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8401899Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8402159Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8402391Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8402634Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8402866Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8403106Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8403339Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8403565Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8403785Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8404020Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8404231Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8404453Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8404667Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8404917Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8405148Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8405377Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8405585Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8405835Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8406055Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8406261Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8406514Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8406758Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8406995Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8407236Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8407467Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8407679Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8407900Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8408115Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8408375Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8408612Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8408830Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8409043Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8409259Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8409499Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8409745Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8409987Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8410271Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8410515Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8410762Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8411034Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8411269Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8411512Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8411749Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8411963Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8412170Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8412403Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8412620Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8412845Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8413063Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8413306Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8413541Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8413785Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8414018Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8414274Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8414508Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8414764Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8415000Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8415242Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8415486Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8415704Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8415918Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8416126Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8416351Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8416567Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8416809Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8417043Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8417270Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8417483Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8417698Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8417941Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8418178Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8418419Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8418671Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8418912Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8419163Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8419404Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8419639Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8419890Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8420175Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8420417Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8420652Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8420894Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8421127Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8421370Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8421618Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8421858Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8422092Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8422333Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8422568Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8422782Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8422988Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8423244Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8423486Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8423736Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8423977Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8424211Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8424463Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8424699Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8424940Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8425171Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8425413Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8425648Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8425854Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8426101Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8426341Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8426575Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8426815Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8427050Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8427277Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8427504Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8427719Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8427944Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8428185Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8428418Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8428646Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8428871Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8429084Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8429300Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8429541Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8429775Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8429991Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8430243Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8430463Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8430624Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8430859Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8431068Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8431302Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8431543Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8431777Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8432004Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8432209Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8432457Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8432669Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8432873Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8433107Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8433325Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8433559Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8433764Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8433997Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8434203Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8434439Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8434680Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8434929Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8435172Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8435406Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8435647Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8435879Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8436123Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8436363Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8436607Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8436857Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8437071Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8437277Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8437511Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8437753Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8437970Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8438183Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8438399Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8438643Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8438878Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8439121Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8439365Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8439571Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8439806Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8440048Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8440324Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8440567Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8440815Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8441045Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8441273Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8441487Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8441692Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8441896Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8442145Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8442386Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8442622Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8442863Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8443099Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8443304Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8443539Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8443804Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8444039Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8444281Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8444514Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8444721Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8444954Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8445204Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8445440Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8445693Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8445928Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8446154Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8446372Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8446596Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8446813Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8447055Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8447287Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8447493Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8447727Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8447970Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8448216Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8448459Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8448693Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8448919Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8449135Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8449349Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8449574Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8449817Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8450060Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8450331Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8450565Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8450807Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8451062Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8451268Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8451504Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8451747Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8451984Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8452230Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8452463Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8452701Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8452919Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8453133Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8453349Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8453592Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8453825Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8454078Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8454312Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8454573Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8454810Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8455051Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8455286Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8455537Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8455772Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8455985Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8456203Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8456409Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8456619Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8456850Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8457082Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8457294Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8457500Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8457705Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8457891Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8458032Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8458194Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8458312Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8458463Z E1204 11:13:47.887000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8458634Z [W1204 11:13:47.351065610 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8458646Z 2025-12-04T12:10:20.8458804Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8459111Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8459423Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8459568Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8460067Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8460378Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8460616Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8460836Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8461051Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8461291Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8461538Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8461784Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8462018Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8462259Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8462492Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8462734Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8462978Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8463219Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8463464Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8463677Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8463899Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8464126Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8464371Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8464603Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8464806Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8465039Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8465281Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8465514Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8465717Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8465959Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8466171Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8466390Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8466905Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8467401Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8467853Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8468332Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8468891Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8469422Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8469941Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8470493Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8470978Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8471466Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8471954Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8472470Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8473059Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8473572Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8474081Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8474592Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8475123Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8475632Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8476146Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8476657Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8477168Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8477676Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8478196Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8478708Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8479234Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8479744Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8480289Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8480810Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8481318Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8481830Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8482345Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8482857Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8483368Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8483853Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8484331Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8484815Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8485329Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8485842Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8486353Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8486863Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8487374Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8487901Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8488410Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8488935Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8489444Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8489952Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8490520Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8491000Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8491447Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8491919Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8492398Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8492870Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8493344Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8493834Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8494357Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8494840Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8495293Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8495766Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8496245Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8496690Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8497172Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8497685Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8498211Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8498723Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8499232Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8499713Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8500232Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8500710Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8501204Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8501717Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8502205Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8502671Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8503140Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8503647Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8504160Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8504673Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8505186Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8505699Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8506214Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8506746Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8507260Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8507787Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8508302Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8508788Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8509243Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8509726Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8510249Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8510714Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8511179Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8511674Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8512187Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8512698Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8513226Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8513740Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8514253Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8514770Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8515282Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8515794Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8516318Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8516809Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8517287Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8517742Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8518207Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8518682Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8519185Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8519702Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8520227Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8520692Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8521156Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8521648Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8522161Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8522688Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8523198Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8523714Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8524225Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8524738Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8525256Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8525781Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8526293Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8526819Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8527332Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8527846Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8528378Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8528888Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8529405Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8529916Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8530460Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8530974Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8531484Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8531980Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8532432Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8532908Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8533425Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8533937Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8534450Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8534962Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8535485Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8535998Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8536522Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8537032Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8537546Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8538070Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8538546Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8539022Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8539535Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8540046Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8540598Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8541109Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8541630Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8542113Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8546097Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8546598Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8547103Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8547631Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8548181Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8548668Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8549135Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8549632Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8550178Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8550705Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8551221Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8551700Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8552160Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8552569Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8553020Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8553505Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8554000Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8554532Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8555076Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8555565Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8556033Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8556518Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8557012Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8557474Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8557973Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8558450Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8558943Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8559422Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8559899Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8560412Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8560900Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8561421Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8561938Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8562449Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8562965Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8563476Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8563988Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8564515Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8565029Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8565542Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8566055Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8566541Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8566996Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8567483Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8567983Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8568477Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8568943Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8569407Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8569905Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8570480Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8570998Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8571514Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8571991Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8572470Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8572986Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8573501Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8574028Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8574540Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8575042Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8575525Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8575990Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8576456Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8576925Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8577403Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8577930Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8578444Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8578960Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8579472Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8579967Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8580503Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8581021Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8581535Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8582049Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8582562Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8583037Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8583526Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8584041Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8584555Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8585070Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8585582Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8586086Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8586580Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8587052Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8587528Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8588022Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8588537Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8589014Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8589500Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8590014Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8590575Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8591089Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8591606Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8592108Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8592590Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8593072Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8593537Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8594034Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8594548Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8595062Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8595575Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8596101Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8596614Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8597110Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8597588Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8598103Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8598620Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8599146Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8599662Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8600222Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8600703Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8601174Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8601640Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8602135Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8602665Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8603182Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8603695Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8604209Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8604723Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8605237Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8605763Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8606278Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8606805Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8607290Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8607758Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8608218Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8608684Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8609161Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8609651Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8610170Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8610625Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8611076Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8611508Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8611884Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8612227Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8612541Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8612839Z E1204 11:13:47.890000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8613190Z [W1204 11:13:47.392649592 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8613397Z 2025-12-04T12:10:20.8613559Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8614068Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8614731Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8615230Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8615917Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8616729Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8617270Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8617764Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8618247Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8618739Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8619252Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8619764Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8620309Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8620819Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8621330Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8621855Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8622369Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8622879Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8623391Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8623874Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8624345Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8624830Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8625323Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8625845Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8626319Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8626790Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8627300Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8627826Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8628305Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8628781Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8629260Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8629710Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8630222Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8630702Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8631163Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8631636Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8632145Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8632659Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8633168Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8633676Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8634170Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8634639Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8635124Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8635614Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8636124Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8636636Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8637160Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8637672Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8638188Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8638697Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8639208Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8639719Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8640256Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8640782Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8641290Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8641802Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8642311Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8642821Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8643332Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8643859Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8644372Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8644892Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8645401Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8645909Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8646431Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8646915Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8647376Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8647863Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8648375Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8648888Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8649397Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8649907Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8650461Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8650972Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8651485Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8651993Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8652503Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8653011Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8653546Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8654026Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8654490Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8654961Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8655440Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8655919Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8656392Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8656884Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8657396Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8657876Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8658324Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8658796Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8659275Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8659735Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8660238Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8660748Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8661259Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8661767Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8662277Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8662771Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8663241Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8663728Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8664221Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8664733Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8665223Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8665700Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8666165Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8666659Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8667171Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8667685Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8668198Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8668712Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8669238Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8669753Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8670315Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8670830Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8671345Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8671829Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8672294Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8672770Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8673270Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8673739Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8674205Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8674699Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8675224Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8675738Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8676255Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8676764Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8677275Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8677785Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8678297Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8678827Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8679341Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8679830Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8680337Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8680796Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8681263Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8681754Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8682247Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8682773Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8683260Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8683724Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8684190Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8684695Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8685209Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8685722Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8686235Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8686749Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8687260Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8687771Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8688302Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8688814Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8689327Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8689839Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8690395Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8690906Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8691432Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8691945Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8692469Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8692990Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8693503Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8694028Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8694540Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8695026Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8695477Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8695950Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8696462Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8696975Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8697490Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8698015Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8698526Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8699041Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8699553Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8700063Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8700691Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8701214Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8701687Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8702175Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8702687Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8703202Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8703727Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8704241Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8704738Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8705220Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8705689Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8706163Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8706664Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8707196Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8707702Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8708193Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8708669Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8709144Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8709644Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8710194Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8710693Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8711168Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8711640Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8712045Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8712477Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8712951Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8713438Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8713953Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8714469Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8714959Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8715411Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8715885Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8716367Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8716833Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8717306Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8717782Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8718254Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8718729Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8719203Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8719691Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8720201Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8720712Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8720962Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8721204Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8721438Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8721698Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8721934Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8722176Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8722411Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8722655Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8722888Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8723102Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8723320Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8723554Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8723782Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8724000Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8724216Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8724431Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8724688Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8724924Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8725178Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8725411Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8725616Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8725851Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8726102Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8726337Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8726579Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8726816Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8727045Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8727267Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8727485Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8727702Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8727906Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8728140Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8728383Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8728617Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8728860Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8729102Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8729307Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8729541Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8729793Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8730028Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8730309Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8730558Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8730763Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8730998Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8731240Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8731473Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8731716Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8731951Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8732192Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8732409Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8732623Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8732839Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8733081Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8733317Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8733533Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8733767Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8734009Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8734255Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8734498Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8734737Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8734974Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8735193Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8735407Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8735623Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8735864Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8736101Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8736344Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8736589Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8736832Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8737068Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8737272Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8737506Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8737748Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8737992Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8738234Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8738486Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8738714Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8738933Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8739146Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8739370Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8739613Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8739849Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8740117Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8740351Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8740593Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8740827Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8741087Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8741321Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8741565Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8741800Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8742011Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8742230Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8742446Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8742660Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8742901Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8743124Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8743338Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8743545Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8743764Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8743953Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8744096Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8744256Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8744377Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8744519Z E1204 11:13:47.931000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8744693Z [W1204 11:13:47.394744424 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8744695Z 2025-12-04T12:10:20.8744854Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8745164Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8745481Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8745626Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8746124Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8746395Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8746635Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8746864Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8747079Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8747331Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8747565Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8747813Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8748050Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8748302Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8748542Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8748782Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8749016Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8749256Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8749490Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8749701Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8749934Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8750189Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8750430Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8750672Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8750875Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8751109Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8751363Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8751595Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8751813Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8752045Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8752260Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8752466Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8752713Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8752930Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8753134Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8753368Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8753610Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8753848Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8754093Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8754339Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8754551Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8754775Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8754991Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8755236Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8755473Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8755732Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8755966Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8756221Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8756453Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8756697Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8756934Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8757186Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8757420Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8757660Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8757893Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8758133Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8758367Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8758608Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8758855Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8759096Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8759328Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8759569Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8759800Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8760042Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8760328Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8760545Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8760768Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8761010Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8761242Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8761482Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8761728Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8761969Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8762203Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8762443Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8762675Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8762916Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8763148Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8763403Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8763637Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8763851Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8764056Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8764288Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8764499Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8764730Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8764946Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8765198Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8765434Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8765646Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8765849Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8766091Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8766301Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8766505Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8766736Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8766978Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8767210Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8767450Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8767694Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8767909Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8768131Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8768345Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8768591Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8768830Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8769057Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8769271Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8769496Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8769738Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8769972Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8770263Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8770516Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8770760Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8770995Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8771236Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8771481Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8771726Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8771965Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8772205Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8772414Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8772657Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8772877Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8773099Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8773315Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8773582Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8773827Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8774084Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8774323Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8774570Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8774813Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8775071Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8775308Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8775553Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8775788Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8776007Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8776220Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8776427Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8776664Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8776879Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8777123Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8777357Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8777576Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8777790Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8778017Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8778260Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8778505Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8778746Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8778980Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8779223Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8779466Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8779710Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8779947Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8780237Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8780473Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8780713Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8780948Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8781201Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8781439Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8781683Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8781918Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8782161Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8782396Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8782651Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8782886Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8783113Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8783320Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8783555Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8783813Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8784049Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8784292Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8784527Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8784772Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8785006Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8785248Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8785482Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8785733Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8785970Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8786177Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8786411Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8786654Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8786886Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8787140Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8787373Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8787618Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8787837Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8788051Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8788286Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8788529Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8788766Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8788993Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8789212Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8789429Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8789644Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8789886Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8790169Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8790389Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8790603Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8790813Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8790977Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8791213Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8791434Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8791671Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8791928Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8792164Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8792381Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8792586Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8792832Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8793046Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8793250Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8793488Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8793693Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8793931Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8794137Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8794386Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8794589Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8794824Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8795067Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8795301Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8795545Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8795790Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8796034Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8796270Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8796523Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8796761Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8797004Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8797250Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8797464Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8797668Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8797902Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8798133Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8798352Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8798565Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8798795Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8799040Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8799275Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8799517Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8799751Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8799960Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8800230Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8800485Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8800719Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8800971Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8801205Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8801434Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8801665Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8801880Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8802086Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8802291Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8802525Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8802769Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8803003Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8803260Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8803492Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8803699Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8803935Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8804177Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8804413Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8804664Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8804901Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8805104Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8805354Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8805597Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8805831Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8806084Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8806323Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8806553Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8806769Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8806984Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8807202Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8807445Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8807689Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8807894Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8808129Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8808372Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8808609Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8808852Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8809095Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8809323Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8809540Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8809763Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8809978Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8810259Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8810509Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8810753Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8810988Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8811233Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8811468Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8811673Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8811907Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8812164Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8812399Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8812641Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8812878Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8813109Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8813325Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8813549Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8813766Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8814007Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8814256Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8814497Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8814732Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8814984Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8815220Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8815463Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8815696Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8815938Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8816172Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8816385Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8816612Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8816816Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8817031Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8817260Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8817481Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8817696Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8817904Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8818120Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8818307Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8818458Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8818617Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8818737Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8818877Z E1204 11:13:47.933000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8819049Z [W1204 11:13:47.396784468 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8819051Z 2025-12-04T12:10:20.8819219Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8819531Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8819841Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8819986Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8820517Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8820784Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8821038Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8821260Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8821475Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8821718Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8821954Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8822197Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8822448Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8822691Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8822938Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8823179Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8823411Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8823652Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8823897Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8824109Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8824332Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8824548Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8824795Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8825028Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8825232Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8825476Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8825716Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8825948Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8826152Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8826383Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8826598Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8826814Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8827049Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8827270Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8827473Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8827707Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8827948Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8828190Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8828431Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8828667Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8828877Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8829099Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8829314Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8829558Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8829803Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8830043Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8830324Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8830566Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8830800Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8831041Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8831285Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8831527Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8831778Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8832021Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8832254Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8832495Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8832744Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8832987Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8833220Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8833460Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8833693Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8833933Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8834166Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8834420Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8834651Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8834869Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8835079Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8835321Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8835555Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8835804Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8836038Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8836287Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8836521Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8836762Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8836996Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8837247Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8837479Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8837720Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8837952Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8838165Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8838368Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8838601Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8838830Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8839052Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8839269Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8839511Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8839743Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8839955Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8840214Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8840448Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8840670Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8840872Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8841104Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8841346Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8841594Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8841837Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8842070Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8842281Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8842503Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8842717Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8842964Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8843213Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8843429Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8843643Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8843859Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8844102Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8844336Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8844589Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8844825Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8845078Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8845314Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8845556Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8845791Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8846042Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8846278Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8846491Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8846696Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8846930Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8847147Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8847359Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8847583Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8847829Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8848063Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8848306Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8848542Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8848783Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8849027Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8849269Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8849514Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8849755Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8849991Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8850245Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8850473Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8850682Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8850908Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8851122Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8851364Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8851600Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8851817Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8852042Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8852257Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8852501Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8852737Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8852979Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8853215Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8853470Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8853705Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8853959Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8854192Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8854435Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8854670Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8854928Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8855164Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8855406Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8855640Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8855882Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8856117Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8856360Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8856604Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8856846Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8857080Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8857297Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8857501Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8857736Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8857989Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8858224Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8858477Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8858712Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8858954Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8859198Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8859440Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8859677Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8859919Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8860196Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8860402Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8860637Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8860878Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8861125Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8861368Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8861602Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8861830Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8862048Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8862261Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8862488Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8862733Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8865239Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8865477Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8865697Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8865911Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8866151Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8866396Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8866635Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8866855Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8867068Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8867275Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8867439Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8867690Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8867895Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8868133Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8868378Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8868613Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8868830Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8869046Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8869286Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8869509Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8869712Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8869950Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8870201Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8870451Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8870657Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8870891Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8871095Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8871333Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8871578Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8871814Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8872070Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8872303Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8872551Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8872785Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8873029Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8873265Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8873519Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8873754Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8873968Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8874192Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8874426Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8874655Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8874885Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8875099Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8875316Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8875560Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8875797Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8876039Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8876276Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8876494Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8876727Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8876971Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8877205Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8877449Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8877688Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8877915Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8878144Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8878358Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8878576Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8878780Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8879016Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8879267Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8879502Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8879748Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8879982Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8880229Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8880463Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8880707Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8880962Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8881202Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8881438Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8881642Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8881877Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8882120Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8882357Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8882613Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8882847Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8883089Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8883306Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8883520Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8883746Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8883990Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8884229Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8884436Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8884671Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8884914Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8885150Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8885401Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8885635Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8885864Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8886082Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8886296Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8886512Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8886758Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8887002Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8887245Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8887489Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8887733Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8887971Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8888184Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8888422Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8888663Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8888901Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8889145Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8889379Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8889608Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8889834Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8890048Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8890296Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8890541Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8890778Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8891019Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8891255Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8891518Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8891754Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8892009Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8892247Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8892489Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8892734Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8892949Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8893166Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8893372Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8893583Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8893814Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8894039Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8894265Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8894471Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8894681Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8894870Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8895013Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8895175Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8895296Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8895437Z E1204 11:13:47.935000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8895507Z ('RERUN', {'yellow': True}) [1.8259s] [100%] 2025-12-04T12:10:20.8895886Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:13:49.958659109 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8895889Z 2025-12-04T12:10:20.8896061Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8896372Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8896684Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8896831Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8897337Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8897609Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8897849Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8898071Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8898285Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8898529Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8898781Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8899023Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8899257Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8899497Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8899732Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8899973Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8900248Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8900489Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8900734Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8900949Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8901174Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8901389Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8901647Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8901880Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8902085Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8902317Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8902561Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8902793Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8903001Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8903249Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8903461Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8903665Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8903898Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8904110Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8904313Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8904547Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8904797Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8905030Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8905283Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8905515Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8905727Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8905959Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8906175Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8906415Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8906648Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8906889Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8907120Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8907361Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8907609Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8907850Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8908086Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8908328Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8908562Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8908802Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8909045Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8909286Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8909529Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8909768Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8910000Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8910276Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8910520Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8910765Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8910999Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8911241Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8911474Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8911688Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8911899Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8912154Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8912387Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8912629Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8912864Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8913104Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8913336Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8913589Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8913822Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8914075Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8914308Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8914548Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8914781Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8915008Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8915214Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8915447Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8915658Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8915880Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8916095Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8916337Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8916581Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8916793Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8916995Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8917228Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8917439Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8917645Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8917886Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8918128Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8918370Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8918610Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8918843Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8919054Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8919286Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8919502Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8919745Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8919983Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8920236Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8920450Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8920665Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8920927Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8921161Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8921403Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8921638Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8921880Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8922115Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8922371Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8922606Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8922860Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8923093Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8923306Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8923511Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8923765Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8923984Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8924196Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8924412Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8924655Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8924893Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8925137Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8925382Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8925625Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8925860Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8926102Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8926336Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8926580Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8926822Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8927043Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8927269Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8927477Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8927702Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8927917Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8928170Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8928406Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8928624Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8928838Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8929053Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8929296Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8929533Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8929784Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8930017Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8930291Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8930526Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8930766Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8931001Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8931255Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8931490Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8931746Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8931982Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8932224Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8932459Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8932724Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8932959Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8933202Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8933435Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8933678Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8933912Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8934125Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8934345Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8934580Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8934821Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8935056Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8935296Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8935530Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8935781Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8936017Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8936268Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8936503Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8936747Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8936990Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8937196Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8937430Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8937672Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8937906Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8938149Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8938383Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8938610Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8938837Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8939051Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8939268Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8939510Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8939745Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8939973Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8940234Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8940449Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8940682Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8940925Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8941159Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8941378Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8941607Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8941814Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8941978Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.8942211Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8942416Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8942651Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8942893Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8943140Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8943352Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8943558Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8943795Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8944009Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8944214Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8944450Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8944665Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8944900Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8945114Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8945348Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8945551Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8945795Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8946038Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8946276Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8946518Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8946752Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8946995Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8947229Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8947480Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8947713Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8947955Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8948188Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8948401Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8948607Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8948841Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8949079Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8949300Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8949525Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8949740Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8949982Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8950262Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8950505Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8950738Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8950945Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8951181Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8951423Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8951659Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8951914Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8952148Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8952375Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8952593Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8952808Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8953016Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.8953350Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8953602Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8953847Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8954094Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8954337Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8954572Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8954788Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8955023Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8955264Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8955501Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8955744Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8955983Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8956191Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8956424Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8956676Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8956909Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8957152Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8957386Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8957614Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8957831Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8958063Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8958279Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8958531Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8958766Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8958971Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8959214Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8959459Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8959693Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8959935Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8960204Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8960435Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8960659Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8960872Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8961101Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8961343Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8961580Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8961822Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8962059Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8962302Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8962547Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8962752Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8963000Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8963243Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8963476Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8963731Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8963967Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8964267Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.8964486Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8964699Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8964915Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8965156Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8965393Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8965648Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8965881Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8966122Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8966357Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8966600Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8966833Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8967084Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8967320Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8967542Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.8967761Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.8967967Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.8968188Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.8968417Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.8968640Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.8968853Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.8969059Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.8969267Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.8969453Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.8969597Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.8969767Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.8969888Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.8970029Z E1204 11:13:49.497000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.8970252Z [W1204 11:13:49.961355264 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.8970255Z 2025-12-04T12:10:20.8970415Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.8970725Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.8971034Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.8971179Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.8971688Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.8971969Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.8972209Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.8972431Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.8972659Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8972904Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8973140Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8973382Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8973616Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8973857Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8974089Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8974346Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8974579Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8974820Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8975054Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8975268Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8975492Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8975718Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8975959Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8976203Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8976406Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8976641Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8976884Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8977125Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8977333Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8977565Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8977776Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8977979Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8978212Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8978424Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8978635Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8978869Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8979110Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8979345Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8979587Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8979823Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8980034Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8980316Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8980532Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8980785Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8981019Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8981262Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8981508Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8981750Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8981982Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8982224Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8982457Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8982698Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8982934Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8983185Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8983417Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8983657Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8983891Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8984131Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8984366Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8984618Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8984850Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8985100Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8985331Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8985572Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8985807Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8986033Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8986244Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.8986485Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8986717Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8986959Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8987193Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8987433Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8987678Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8987922Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8988153Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8988394Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8988626Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8988867Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8989109Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8989322Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8989538Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8989770Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8989981Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8990237Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8990475Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8990718Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8990952Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8991164Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8991366Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8991600Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8991810Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8992026Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8992257Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8992501Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8992734Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8992974Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8993207Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8993417Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8993651Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.8993865Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8994126Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8994363Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8994580Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8994802Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8995018Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8995260Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8995496Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8995738Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8995973Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8996215Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8996462Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8996705Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8996940Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8997181Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8997416Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8997632Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.8997835Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.8998080Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8998295Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.8998519Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.8998734Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.8998978Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.8999227Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8999470Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.8999705Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.8999947Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9000225Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9000468Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9000702Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9000960Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9001195Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9001415Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9001630Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9001840Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9002066Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9002280Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9002537Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9002773Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9003003Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9003218Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9003435Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9003695Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9003935Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9004177Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9004412Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9004654Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9004888Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9005130Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9005377Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9005618Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9005919Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9006161Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9006398Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9006640Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9006885Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9007129Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9007383Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9007624Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9007859Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9008101Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9008346Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9008561Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9008768Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9009002Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9009245Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9009478Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9009721Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9009964Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9010252Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9010488Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9010730Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9010965Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9011209Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9011457Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9011664Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9011910Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9012154Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9012388Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9012631Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9012877Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9013107Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9013325Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9013539Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9013756Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9013997Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9014232Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9014471Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9014688Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9014905Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9015120Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9015362Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9015597Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9015825Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9016039Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9016257Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9016421Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9016657Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9016864Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9017108Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9017353Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9017586Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9017802Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9018009Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9018245Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9018458Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9018672Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9018907Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9019111Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9019347Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9019551Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9019787Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9019990Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9020270Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9020516Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9020764Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9021007Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9021241Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9021497Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9021733Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9021976Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9022211Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9022453Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9022689Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9022903Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9023110Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9023361Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9023589Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9023808Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9024022Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9024238Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9024482Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9024731Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9024975Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9025218Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9025426Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9025659Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9025914Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9026150Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9026392Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9026629Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9026856Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9027073Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9027286Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9027492Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9027707Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9027943Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9028185Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9028420Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9028663Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9028895Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9029112Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9029347Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9029598Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9029833Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9030076Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9030355Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9030560Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9030795Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9031040Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9031272Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9031515Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9031748Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9031977Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9032207Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9032422Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9032639Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9032882Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9033120Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9033325Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9033572Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9033816Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9034065Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9034308Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9034540Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9034777Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9034996Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9035211Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9035426Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9035670Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9035907Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9036148Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9036382Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9036633Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9036867Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9037081Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9037316Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9037558Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9037791Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9038042Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9038277Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9038514Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9038731Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9038943Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9039168Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9039410Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9039647Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9039889Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9040156Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9040399Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9040633Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9040874Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9041126Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9041369Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9041603Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9041817Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9042036Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9042243Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9042465Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9042694Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9042931Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9043143Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9043349Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9043556Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9043753Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9043895Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9044055Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9044175Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9044315Z E1204 11:13:49.500000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9044488Z [W1204 11:13:49.964817339 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9044490Z 2025-12-04T12:10:20.9044649Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9044958Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9045266Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9045421Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9045913Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9046181Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9046421Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9046640Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9046866Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9047109Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9047353Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9047596Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9047828Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9048078Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9048312Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9048552Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9048785Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9049025Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9049260Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9049471Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9049704Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9049918Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9050204Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9050437Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9050641Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9050874Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9051126Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9051360Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9051566Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9051813Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9052025Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9052228Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9052472Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9052683Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9052884Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9053118Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9053359Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9053592Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9053832Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9054065Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9054288Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9054510Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9054725Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9054965Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9055199Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9055439Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9055687Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9055928Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9056172Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9056415Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9056648Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9056909Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9057141Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9057382Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9057615Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9057857Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9058089Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9058329Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9058574Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9058814Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9059048Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9059290Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9059523Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9059765Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9060007Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9060259Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9060483Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9060726Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9060960Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9061201Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9061447Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9061688Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9061921Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9062161Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9062395Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9062636Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9062869Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9063126Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9063360Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9063572Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9063775Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9064013Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9064224Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9064458Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9064676Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9064916Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9065159Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9065371Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9065574Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9065817Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9066028Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9066231Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9066464Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9066706Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9066938Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9067179Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9067425Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9067634Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9067857Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9068071Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9068318Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9068555Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9068773Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9068996Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9069212Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9069466Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9069699Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9069942Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9070219Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9070461Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9070696Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9070939Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9071174Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9071416Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9071652Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9071879Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9072084Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9072319Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9072535Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9072749Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9072965Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9073210Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9073457Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9073701Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9073953Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9074196Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9074430Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9074681Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9074918Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9075158Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9075392Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9075612Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9075825Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9076033Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9076267Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9076481Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9076723Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9076959Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9077176Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9077388Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9077603Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9077858Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9078095Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9078346Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9078581Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9078829Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9079071Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9079314Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9079547Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9079789Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9080023Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9080308Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9080547Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9080801Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9081036Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9081278Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9081512Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9081756Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9081990Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9082243Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9082477Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9082702Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9082908Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9083145Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9083388Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9083634Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9083876Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9084111Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9084353Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9084586Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9084828Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9085064Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9085317Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9085551Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9085757Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9085992Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9086235Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9086471Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9086724Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9086960Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9087200Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9087419Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9087638Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9087856Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9088111Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9088354Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9088582Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9088800Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9089013Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9089231Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9089475Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9089721Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9089943Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9090203Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9090410Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9090572Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9090810Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9091014Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9091270Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9091519Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9091770Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9091987Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9092191Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9092443Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9092658Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9092865Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9093103Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9093310Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9093547Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9093751Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9093986Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9094202Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9094439Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9094687Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9094924Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9095167Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9095403Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9095663Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9095898Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9096152Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9096388Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9096629Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9096875Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9097089Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9097296Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9097532Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9097761Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9097983Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9098196Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9098411Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9098665Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9098904Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9099148Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9099383Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9099594Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9099830Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9100085Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9100358Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9100614Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9100851Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9101078Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9101297Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9101523Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9101734Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9101941Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9102178Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9102424Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9102658Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9102902Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9103149Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9103353Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9103588Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9103834Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9104070Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9104314Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9104560Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9104765Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9105012Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9105253Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9105487Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9105730Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9105972Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9106202Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9106419Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9106635Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9106851Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9107095Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9107331Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9107550Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9107785Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9108027Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9108261Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9108502Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9108738Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9108978Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9109197Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9109423Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9109637Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9109881Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9110153Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9110408Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9110644Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9110885Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9111120Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9111324Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9111561Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9111806Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9112054Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9112295Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9112530Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9112758Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9112974Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9113187Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9113417Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9113661Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9113912Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9114154Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9114389Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9114642Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9114877Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9115120Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9115354Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9115598Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9115831Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9116045Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9116263Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9116481Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9116693Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9116923Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9117146Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9117357Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9117564Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9117779Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9117966Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9118118Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9118278Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9118397Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9118538Z E1204 11:13:49.503000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9118713Z [W1204 11:13:49.005740909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9118715Z 2025-12-04T12:10:20.9118876Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9119200Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9119509Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9119655Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9120183Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9120451Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9120706Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9120925Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9121142Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9121385Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9121618Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9121862Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9122095Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9122348Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9122580Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9122835Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9123068Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9123309Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9123564Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9123776Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9123999Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9124214Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9124457Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9124692Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9124896Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9125139Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9125378Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9125611Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9125815Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9126050Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9126263Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9126464Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9126710Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9126922Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9127134Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9127368Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9127609Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9127842Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9128092Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9128327Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9128538Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9128763Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9128977Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9129221Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9129454Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9129704Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9129938Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9130219Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9130453Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9130695Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9130927Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9131181Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9131413Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9131668Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9131900Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9132141Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9132384Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9132626Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9132859Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9133099Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9133332Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9133572Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9133807Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9134062Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9134294Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9134510Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9134720Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9134964Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9135196Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9135436Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9135681Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9135923Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9136167Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9136409Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9136643Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9136893Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9137128Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9137370Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9137602Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9137815Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9138017Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9139160Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9139389Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9139615Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9139829Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9140074Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9140348Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9140561Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9140763Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9140994Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9141205Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9141432Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9141667Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9141910Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9142158Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9142403Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9142637Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9142852Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9143074Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9143289Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9143535Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9143816Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9144047Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9144260Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9144478Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9144720Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9144957Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9145199Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9145434Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9145679Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9145924Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9146167Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9146401Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9146654Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9146891Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9147106Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9147311Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9147545Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9147763Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9147980Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9148214Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9148466Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9148701Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9148944Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9149179Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9149422Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9149657Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9149898Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9150183Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9150438Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9150674Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9150890Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9151116Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9151324Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9151551Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9151767Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9152010Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9152249Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9152466Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9152692Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9152925Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9153169Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9153405Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9153648Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9153884Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9154125Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9154359Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9154601Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9154849Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9155091Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9155326Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9155578Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9155815Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9156058Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9156292Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9156532Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9156766Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9157007Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9157271Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9157512Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9157748Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9157962Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9158168Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9158405Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9158646Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9158880Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9159120Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9159367Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9159609Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9159843Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9160132Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9160368Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9160611Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9160844Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9161050Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9161286Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9161541Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9161790Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9162032Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9162267Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9162494Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9162714Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9162928Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9163142Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9163388Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9163635Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9163863Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9164079Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9164293Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9164517Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9164759Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9164994Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9165211Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9165423Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9165629Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9165791Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9166038Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9166252Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9166487Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9166728Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9166962Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9167177Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9167384Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9167619Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9169523Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9169748Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9169984Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9170240Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9170475Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9170698Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9170937Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9171144Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9171379Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9171624Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9171859Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9172117Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9172372Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9172617Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9172856Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9173100Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9173337Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9173589Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9173828Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9174051Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9174270Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9174505Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9174736Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9174952Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9175178Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9175396Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9175646Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9175885Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9176128Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9176365Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9176591Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9176835Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9177077Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9177314Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9177562Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9177797Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9178026Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9178244Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9178463Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9178686Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9178893Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9179130Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9179372Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9179620Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9179864Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9180150Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9180355Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9180590Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9180836Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9181088Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9181344Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9181578Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9181788Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9182021Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9182264Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9182499Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9182739Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9182973Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9183213Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9183434Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9183647Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9183864Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9184122Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9184358Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9184565Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9184798Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9185041Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9185274Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9185530Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9185775Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9186002Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9186220Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9186437Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9186658Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9186904Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9187140Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9187385Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9187631Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9187877Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9188110Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9188315Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9188559Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9188803Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9189041Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9189281Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9189518Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9189744Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9189973Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9190254Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9190468Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9190711Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9190946Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9191189Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9191429Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9191673Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9191909Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9192165Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9192400Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9192727Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9192985Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9193197Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9193417Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9193626Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9193836Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9194068Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9194289Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9194517Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9194737Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9194946Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9195133Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9195276Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9195438Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9195560Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9195703Z E1204 11:13:49.544000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9195874Z [W1204 11:13:49.008026099 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9195878Z 2025-12-04T12:10:20.9196037Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9196350Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9196672Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9196819Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9197316Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9197602Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9197844Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9198067Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9198282Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9198527Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9198766Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9199021Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9199263Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9199504Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9199737Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9199979Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9200244Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9200484Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9200717Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9200932Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9201174Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9201391Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9201635Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9201882Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9202086Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9202323Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9202564Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9202795Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9202999Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9203232Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9203459Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9203675Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9203906Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9204119Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9204321Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9204556Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9204799Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9205029Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9205272Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9205514Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9205727Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9205953Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9206169Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9206422Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9206657Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9206899Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9207130Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9207370Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9207600Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9207853Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9208097Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9208343Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9208577Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9208817Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9209051Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9209290Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9209522Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9209764Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9210016Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9210304Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9210534Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9210792Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9211027Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9211268Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9211501Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9211716Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9211930Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9212171Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9212431Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9212675Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9212908Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9213150Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9213382Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9213626Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9213858Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9214098Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9214331Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9214584Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9214818Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9215032Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9215246Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9215479Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9215690Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9215914Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9216127Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9216369Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9216600Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9216841Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9217043Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9217275Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9217490Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9217695Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9217928Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9218169Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9218402Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9218643Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9218890Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9219104Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9219325Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9219539Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9219797Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9220040Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9220297Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9220512Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9220728Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9220969Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9221217Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9221470Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9221707Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9221949Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9222183Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9222432Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9222667Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9222909Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9223142Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9223369Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9223573Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9223807Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9224023Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9224247Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9224465Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9224709Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9224946Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9225187Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9225422Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9225681Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9225932Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9226179Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9226413Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9226656Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9226891Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9227110Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9227323Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9227529Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9227765Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9227980Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9228224Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9228461Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9228688Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9228904Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9229118Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9229361Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9229595Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9229837Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9230082Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9230377Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9230615Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9230859Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9231094Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9231336Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9231570Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9231810Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9232044Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9232302Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9232536Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9232777Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9233024Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9233271Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9233508Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9233749Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9233983Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9234197Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9234403Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9234665Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9234908Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9235142Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9235384Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9235623Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9235867Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9236101Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9236343Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9236588Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9236832Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9237066Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9237270Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9237514Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9237758Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9237993Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9238238Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9238473Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9238699Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9238927Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9239149Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9239364Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9239604Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9239839Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9240068Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9240320Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9240534Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9240752Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9240993Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9241241Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9241458Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9241671Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9241889Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9242054Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9242289Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9242495Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9242731Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9242974Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9243208Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9243438Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9243654Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9243888Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9244101Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9244304Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9244541Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9244747Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9244993Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9245200Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9245444Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9245649Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9245883Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9246126Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9246370Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9246614Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9246849Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9247090Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9247326Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9247570Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9247816Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9248069Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9248303Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9248516Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9248720Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9248955Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9249184Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9249400Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9249615Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9249842Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9250087Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9250374Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9250619Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9250868Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9251073Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9251309Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9251549Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9251784Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9252026Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9252276Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9252521Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9252738Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9252952Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9253229Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9253436Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9253670Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9253913Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9254148Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9254406Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9254645Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9254851Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9255085Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9255339Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9255575Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9255818Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9256051Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9256255Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9256489Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9256743Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9256990Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9257235Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9257470Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9257697Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9257914Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9258127Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9258342Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9258583Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9258827Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9259032Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9259268Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9259513Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9259764Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9260007Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9260274Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9260502Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9260721Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9260935Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9261169Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9261424Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9261657Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9261903Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9262140Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9262383Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9262618Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9262822Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9263057Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9263313Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9263547Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9263789Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9264023Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9264262Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9264484Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9264698Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9264913Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9265154Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9265391Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9265644Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9265887Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9266129Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9266363Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9266608Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9266845Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9267087Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9267322Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9267534Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9267760Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9267966Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9268177Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9268405Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9268638Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9268854Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9269062Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9269268Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9269454Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9269597Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9269756Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9269876Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9270038Z E1204 11:13:49.547000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9270247Z [W1204 11:13:49.010080413 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9270249Z 2025-12-04T12:10:20.9270409Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9270718Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9271027Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9271174Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9271675Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9271943Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9272196Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9272417Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9272631Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9272873Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9273120Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9273364Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9273598Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9273841Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9274074Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9274314Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9274560Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9274813Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9275044Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9275256Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9275479Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9275694Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9275935Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9276173Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9276378Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9276626Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9276868Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9277100Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9277303Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9277544Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9277757Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9277960Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9278191Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9278403Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9278608Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9278851Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9279101Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9279332Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9279573Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9279804Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9280016Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9280270Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9280484Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9280725Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9280974Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9281216Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9281448Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9281688Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9281932Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9282175Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9282408Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9282649Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9282887Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9283128Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9283374Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9283628Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9283860Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9284101Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9284334Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9284577Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9284809Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9285049Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9285281Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9285535Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9285767Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9285983Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9286204Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9286445Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9286681Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9286921Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9287155Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9287395Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9287629Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9287884Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9288127Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9288369Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9288604Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9288845Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9289078Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9289289Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9289493Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9289725Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9289948Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9290211Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9290428Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9290685Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9290921Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9291134Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9291337Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9291570Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9291781Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9291985Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9292230Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9292490Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9292731Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9292972Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9293205Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9293416Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9293639Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9293852Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9294100Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9294348Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9294565Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9294780Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9294994Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9295251Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9295486Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9295730Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9295964Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9296208Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9296442Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9296693Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9296939Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9297181Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9297417Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9297635Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9297840Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9298076Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9298293Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9298507Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9298734Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9298978Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9299212Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9299452Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9299698Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9299942Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9300221Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9300463Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9300697Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9300939Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9301187Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9301417Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9301628Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9301836Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9302063Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9302280Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9302525Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9302757Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9302976Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9303202Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9303420Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9303663Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9303896Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9304150Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9304388Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9304630Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9304864Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9305106Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9305341Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9305593Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9305838Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9306078Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9306312Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9306553Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9306789Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9307031Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9307263Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9307505Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9307750Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9307992Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9308225Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9308448Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9308653Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9308888Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9309133Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9309369Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9309612Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9309848Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9310159Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9310393Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9310634Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9310868Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9311110Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9311345Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9311551Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9311788Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9312030Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9312277Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9312519Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9312751Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9312990Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9313209Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9313421Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9313637Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9313878Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9314118Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9314345Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9314586Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9314799Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9315014Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9315258Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9315493Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9315712Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9315925Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9316132Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9316297Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9316547Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9316754Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9316990Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9317234Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9317479Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9317696Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9317904Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9318139Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9318351Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9318556Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9318805Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9319019Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9319255Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9319459Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9319693Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9319901Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9320171Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9320417Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9320650Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9320907Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9321143Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9321388Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9321623Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9321877Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9322112Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9322355Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9322588Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9322801Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9323005Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9323254Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9323492Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9323714Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9323928Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9324145Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9324388Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9324623Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9324864Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9325097Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9325316Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9325550Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9325792Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9326027Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9326287Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9326522Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9326750Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9326966Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9327177Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9327385Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9327589Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9327845Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9328088Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9328324Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9328569Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9328805Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9329009Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9329243Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9329485Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9329730Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9329972Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9330253Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9330458Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9330706Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9330953Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9331187Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9331428Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9331661Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9331890Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9332119Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9332344Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9332559Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9332800Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9333035Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9333242Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9333478Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9333719Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9333952Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9334207Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9334440Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9334669Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9334884Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9335107Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9335326Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9335569Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9335811Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9336054Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9336291Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9336542Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9336791Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9336996Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9337229Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9337471Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9337706Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9337949Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9338186Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9338416Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9338646Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9338860Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9339075Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9339316Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9339560Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9339802Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9340039Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9340316Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9340548Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9340793Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9341042Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9341295Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9341530Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9341742Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9341960Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9342165Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9342376Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9342604Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9342826Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9343067Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9343276Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9343483Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9343669Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9343810Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9343982Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9344103Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9344244Z E1204 11:13:49.549000 822892 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9344306Z FAILED [1.6498s] [100%] 2025-12-04T12:10:20.9344308Z 2025-12-04T12:10:20.9344383Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.9344560Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9344623Z Traceback (most recent call last): 2025-12-04T12:10:20.9344803Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9344862Z method(*args, **kwargs) 2025-12-04T12:10:20.9345030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9345088Z method(*args, **kwargs) 2025-12-04T12:10:20.9345253Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9345330Z with policy(): 2025-12-04T12:10:20.9345499Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9345557Z raise RuntimeError(msg) 2025-12-04T12:10:20.9345988Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:20.9345990Z 2025-12-04T12:10:20.9346087Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9346379Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.9346383Z 2025-12-04T12:10:20.9346491Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9346585Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9346647Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9346722Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9347295Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9347421Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9347476Z graph_break [] 2025-12-04T12:10:20.9347560Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9347650Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9348157Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9348223Z current_size = base.storage().size() 2025-12-04T12:10:20.9348291Z Autotune Choices Stats: 2025-12-04T12:10:20.9348690Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00848000030964613, "best_triton_pos": 0} 2025-12-04T12:10:20.9348772Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9348837Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9348952Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9349211Z triton_mm_34 0.0085 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9349459Z triton_mm_33 0.0089 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9349713Z triton_mm_30 0.0113 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9349964Z triton_mm_16 0.0113 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9350241Z triton_mm_21 0.0115 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9350484Z triton_mm_22 0.0118 ms 71.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9350727Z triton_mm_23 0.0124 ms 68.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9350971Z triton_mm_15 0.0124 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9351213Z triton_mm_31 0.0128 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9351453Z triton_mm_29 0.0136 ms 62.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9351616Z SingleProcess AUTOTUNE benchmarking takes 0.1783 seconds and 8.8670 seconds precompiling for 33 choices 2025-12-04T12:10:20.9351791Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9351855Z Traceback (most recent call last): 2025-12-04T12:10:20.9352027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9352085Z method(*args, **kwargs) 2025-12-04T12:10:20.9352252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9352312Z method(*args, **kwargs) 2025-12-04T12:10:20.9352488Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9352544Z with policy(): 2025-12-04T12:10:20.9352711Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9352774Z raise RuntimeError(msg) 2025-12-04T12:10:20.9353195Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:20.9353198Z 2025-12-04T12:10:20.9353289Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9353579Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.9353582Z 2025-12-04T12:10:20.9353686Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9353778Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9353861Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9353936Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9354498Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9354616Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9354671Z graph_break [] 2025-12-04T12:10:20.9354752Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9354842Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9355345Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9355410Z current_size = base.storage().size() 2025-12-04T12:10:20.9355467Z Autotune Choices Stats: 2025-12-04T12:10:20.9355856Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00848000030964613, "best_triton_pos": 0} 2025-12-04T12:10:20.9355943Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9356010Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9356125Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9356378Z triton_mm_34 0.0085 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9356621Z triton_mm_33 0.0089 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9356875Z triton_mm_30 0.0113 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9357121Z triton_mm_16 0.0113 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9357364Z triton_mm_21 0.0115 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9357608Z triton_mm_22 0.0118 ms 71.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9357849Z triton_mm_23 0.0124 ms 68.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9358108Z triton_mm_15 0.0124 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9358366Z triton_mm_31 0.0128 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9358606Z triton_mm_29 0.0136 ms 62.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9358752Z SingleProcess AUTOTUNE benchmarking takes 0.1783 seconds and 8.8670 seconds precompiling for 33 choices 2025-12-04T12:10:20.9358843Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9358903Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9358978Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9359094Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9359594Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9359648Z graph_break [] 2025-12-04T12:10:20.9359728Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9359827Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9360257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.9360366Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.9360423Z Autotune Choices Stats: 2025-12-04T12:10:20.9360809Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009080000221729279, "best_triton_pos": 0} 2025-12-04T12:10:20.9360899Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9360965Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9361080Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9361330Z triton_mm_72 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9361392Z _scaled_mm 0.0093 ms 97.8% 2025-12-04T12:10:20.9361637Z triton_mm_71 0.0093 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9361880Z triton_mm_67 0.0108 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9362122Z triton_mm_54 0.0114 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9362388Z triton_mm_68 0.0115 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9362629Z triton_mm_60 0.0115 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9362870Z triton_mm_61 0.0119 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9363114Z triton_mm_53 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9363354Z triton_mm_59 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9363497Z SingleProcess AUTOTUNE benchmarking takes 0.2554 seconds and 0.8407 seconds precompiling for 39 choices 2025-12-04T12:10:20.9363568Z =================================== FAILURES =================================== 2025-12-04T12:10:20.9363740Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9363806Z Traceback (most recent call last): 2025-12-04T12:10:20.9363992Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9364051Z method(*args, **kwargs) 2025-12-04T12:10:20.9364220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9364278Z method(*args, **kwargs) 2025-12-04T12:10:20.9364448Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9364502Z with policy(): 2025-12-04T12:10:20.9364670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9364728Z raise RuntimeError(msg) 2025-12-04T12:10:20.9365161Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.9365167Z 2025-12-04T12:10:20.9365258Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9365547Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.9365549Z 2025-12-04T12:10:20.9365651Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9365740Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9365799Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9365873Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9366445Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9366573Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9366626Z graph_break [] 2025-12-04T12:10:20.9366706Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9366796Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9367295Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9367362Z current_size = base.storage().size() 2025-12-04T12:10:20.9367419Z Autotune Choices Stats: 2025-12-04T12:10:20.9367805Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00848000030964613, "best_triton_pos": 0} 2025-12-04T12:10:20.9367885Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9367951Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9368064Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9368317Z triton_mm_34 0.0085 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9368578Z triton_mm_33 0.0089 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9368820Z triton_mm_30 0.0113 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9369061Z triton_mm_16 0.0113 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9369313Z triton_mm_21 0.0115 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9369556Z triton_mm_22 0.0118 ms 71.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9369800Z triton_mm_23 0.0124 ms 68.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9370040Z triton_mm_15 0.0124 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9370325Z triton_mm_31 0.0128 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9370579Z triton_mm_29 0.0136 ms 62.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9370739Z SingleProcess AUTOTUNE benchmarking takes 0.1783 seconds and 8.8670 seconds precompiling for 33 choices 2025-12-04T12:10:20.9370828Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9370890Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9370963Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9371079Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9371582Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9371638Z graph_break [] 2025-12-04T12:10:20.9371717Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9371806Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9372185Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:20.9372294Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:20.9372368Z Autotune Choices Stats: 2025-12-04T12:10:20.9372753Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009080000221729279, "best_triton_pos": 0} 2025-12-04T12:10:20.9372833Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9372897Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9373013Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9373264Z triton_mm_72 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9373335Z _scaled_mm 0.0093 ms 97.8% 2025-12-04T12:10:20.9373582Z triton_mm_71 0.0093 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9373829Z triton_mm_67 0.0108 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9374070Z triton_mm_54 0.0114 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9374310Z triton_mm_68 0.0115 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9374555Z triton_mm_60 0.0115 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9374827Z triton_mm_61 0.0119 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9375068Z triton_mm_53 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9375308Z triton_mm_59 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9375452Z SingleProcess AUTOTUNE benchmarking takes 0.2554 seconds and 0.8407 seconds precompiling for 39 choices 2025-12-04T12:10:20.9375542Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9375603Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9375678Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9375791Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9376288Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9376342Z graph_break [] 2025-12-04T12:10:20.9376422Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9376521Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9376578Z Autotune Choices Stats: 2025-12-04T12:10:20.9376964Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008799999952316284, "best_triton_pos": 0} 2025-12-04T12:10:20.9377041Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9377105Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9377218Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9377481Z triton_mm_110 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9377727Z triton_mm_105 0.0107 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9377968Z triton_mm_92 0.0108 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9378209Z triton_mm_98 0.0112 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9378451Z triton_mm_106 0.0115 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9378703Z triton_mm_97 0.0116 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9378960Z triton_mm_109 0.0116 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9379204Z triton_mm_99 0.0120 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9379448Z triton_mm_91 0.0121 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9379693Z triton_mm_107 0.0126 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9379838Z SingleProcess AUTOTUNE benchmarking takes 0.2749 seconds and 0.6698 seconds precompiling for 39 choices 2025-12-04T12:10:20.9380046Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f15a9b444f5a4e4e.xml - 2025-12-04T12:10:20.9380155Z =========================== short test summary info ============================ 2025-12-04T12:10:20.9380797Z FAILED [1.6498s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.9380820Z 2025-12-04T12:10:20.9380910Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9381198Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.9381201Z 2025-12-04T12:10:20.9381303Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9381383Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.9381481Z ================= 1 failed, 187 deselected, 2 rerun in 14.98s ================== 2025-12-04T12:10:20.9381539Z Got exit code 1 2025-12-04T12:10:20.9381773Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:20.9381918Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:20.9382077Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1c67a793923764a0.xml 2025-12-04T12:10:20.9382151Z ============================= test session starts ============================== 2025-12-04T12:10:20.9382280Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.9382339Z cachedir: .pytest_cache 2025-12-04T12:10:20.9382514Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.9382580Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.9382641Z configfile: pytest.ini 2025-12-04T12:10:20.9382824Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.9382944Z collecting ... collected 188 items / 108 deselected / 80 selected 2025-12-04T12:10:20.9383015Z stepcurrent: skipping 108 already run items. 2025-12-04T12:10:20.9383076Z Running 80 items in this shard 2025-12-04T12:10:20.9383078Z 2025-12-04T12:10:20.9384019Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/xc/cxc2anr2xernse3b4gn5u4tsll3tzesipkn3uljpaeetr7cv5ujy.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9384191Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9384425Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9384597Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9384900Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9387762Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9388039Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9388195Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9388466Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9388648Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9388930Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9389081Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9389370Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9389579Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9389908Z E1204 11:14:00.860000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9390710Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/h4/ch47pvrzjzy5l5g2miliwosvjhxn2mhzlg2mjae43rcj5ww67u2e.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9390885Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9391114Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9391284Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9391583Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9391730Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9392002Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9392156Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9392422Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9392604Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9392888Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9393035Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9393350Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9393560Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9393885Z E1204 11:14:00.932000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9394628Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/o7/co7nrawcubelamwpu3mcromhl3rhimqzkugscfu76wuqi2vn7vou.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9394788Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9395025Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9395204Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9395504Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9395657Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9395928Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9396084Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9396350Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9396520Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9396802Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9396961Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9397252Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9397458Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9397782Z E1204 11:14:00.940000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9398532Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/qc/cqciw5xrx6t6adzlfw6exhkjy4a5qtvq355pcgigeez7m5gbvgsq.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9398693Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9398924Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9399094Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9399395Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9399560Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9399829Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9399980Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9400296Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9400474Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9400759Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9400907Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9401194Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9401400Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9401738Z E1204 11:14:00.988000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9402473Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/d4/cd4i7s52vinet5572rcfxtekz764iiph45k3npnsrojp3frclnvc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9402644Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9402872Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9403042Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9403344Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9403488Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9403758Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9403907Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9404189Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9404370Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9404651Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9404799Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9405086Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9405293Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9405621Z E1204 11:14:00.996000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9406362Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpeforlo77/js/cjsnt5ybembe2fnsixw3iunffuzd23eh7q4zydss5y55mal5cucl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9406532Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9406759Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9406927Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9407236Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9407381Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9407650Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9407803Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9408072Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9408241Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9408525Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9408693Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9408981Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9409185Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9409511Z E1204 11:14:01.000000 828814 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9409581Z ('RERUN', {'yellow': True}) [3.9390s] [ 1%] 2025-12-04T12:10:20.9409930Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:14:02.901000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9410279Z E1204 11:14:02.901000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9410422Z E1204 11:14:02.901000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9410583Z E1204 11:14:02.904000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9410912Z E1204 11:14:02.904000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9411055Z E1204 11:14:02.904000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9411213Z E1204 11:14:02.906000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9411518Z E1204 11:14:02.906000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9411671Z E1204 11:14:02.906000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9411828Z E1204 11:14:02.963000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9412134Z E1204 11:14:02.963000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9412275Z E1204 11:14:02.963000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9412432Z E1204 11:14:02.965000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9412736Z E1204 11:14:02.965000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9412878Z E1204 11:14:02.965000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9413048Z E1204 11:14:02.967000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9413364Z E1204 11:14:02.967000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9413503Z E1204 11:14:02.967000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9413569Z ('RERUN', {'yellow': True}) [1.8223s] [ 1%] 2025-12-04T12:10:20.9413920Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:14:04.524000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9414226Z E1204 11:14:04.524000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9414367Z E1204 11:14:04.524000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9414523Z E1204 11:14:04.527000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9414827Z E1204 11:14:04.527000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9414967Z E1204 11:14:04.527000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9415133Z E1204 11:14:04.530000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9415441Z E1204 11:14:04.530000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9415580Z E1204 11:14:04.530000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9415737Z E1204 11:14:04.571000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9416050Z E1204 11:14:04.571000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9416193Z E1204 11:14:04.571000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9416350Z E1204 11:14:04.573000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9416654Z E1204 11:14:04.573000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9416794Z E1204 11:14:04.573000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9416950Z E1204 11:14:04.575000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9417254Z E1204 11:14:04.575000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:20.9417393Z E1204 11:14:04.575000 828814 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9417473Z FAILED [1.5419s] [ 1%] 2025-12-04T12:10:20.9417475Z 2025-12-04T12:10:20.9417546Z ==================================== RERUNS ==================================== 2025-12-04T12:10:20.9417719Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9417784Z Traceback (most recent call last): 2025-12-04T12:10:20.9417956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9418015Z method(*args, **kwargs) 2025-12-04T12:10:20.9418181Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9418240Z method(*args, **kwargs) 2025-12-04T12:10:20.9418404Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9418460Z with policy(): 2025-12-04T12:10:20.9418627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9418685Z raise RuntimeError(msg) 2025-12-04T12:10:20.9419104Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:20.9419107Z 2025-12-04T12:10:20.9419201Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9419487Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:20.9419500Z 2025-12-04T12:10:20.9419605Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9419698Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9419758Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9419834Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9420451Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9420569Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9420623Z graph_break [] 2025-12-04T12:10:20.9420707Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9420796Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9421296Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9421360Z current_size = base.storage().size() 2025-12-04T12:10:20.9421420Z Autotune Choices Stats: 2025-12-04T12:10:20.9421821Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:20.9421912Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9421978Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9422092Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9422345Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9422589Z triton_mm_34 0.0090 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9422830Z triton_mm_16 0.0110 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9423069Z triton_mm_30 0.0118 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9423309Z triton_mm_23 0.0119 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9423550Z triton_mm_15 0.0122 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9423800Z triton_mm_31 0.0127 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9424038Z triton_mm_21 0.0129 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9424276Z triton_mm_29 0.0131 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9424529Z triton_mm_35 0.0138 ms 64.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9424677Z SingleProcess AUTOTUNE benchmarking takes 0.1748 seconds and 1.4396 seconds precompiling for 33 choices 2025-12-04T12:10:20.9424851Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9424917Z Traceback (most recent call last): 2025-12-04T12:10:20.9425089Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9425147Z method(*args, **kwargs) 2025-12-04T12:10:20.9425314Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9425372Z method(*args, **kwargs) 2025-12-04T12:10:20.9425537Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9425593Z with policy(): 2025-12-04T12:10:20.9425759Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9425818Z raise RuntimeError(msg) 2025-12-04T12:10:20.9426263Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:20.9426267Z 2025-12-04T12:10:20.9426357Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9426644Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:20.9426647Z 2025-12-04T12:10:20.9426750Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9426840Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9426900Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9426978Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9427540Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9427659Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9427711Z graph_break [] 2025-12-04T12:10:20.9427803Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9427891Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9428391Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9428458Z current_size = base.storage().size() 2025-12-04T12:10:20.9428515Z Autotune Choices Stats: 2025-12-04T12:10:20.9428909Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:20.9428987Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9429054Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9429170Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9429418Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9429658Z triton_mm_34 0.0090 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9429897Z triton_mm_16 0.0110 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9430187Z triton_mm_30 0.0118 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9430440Z triton_mm_23 0.0119 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9430679Z triton_mm_15 0.0122 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9430919Z triton_mm_31 0.0127 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9431158Z triton_mm_21 0.0129 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9431394Z triton_mm_29 0.0131 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9431635Z triton_mm_35 0.0138 ms 64.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9431781Z SingleProcess AUTOTUNE benchmarking takes 0.1748 seconds and 1.4396 seconds precompiling for 33 choices 2025-12-04T12:10:20.9431881Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9431941Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9432013Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9432130Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9432628Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9432683Z graph_break [] 2025-12-04T12:10:20.9432766Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9432868Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9432928Z Autotune Choices Stats: 2025-12-04T12:10:20.9433304Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008960000239312649, "best_triton_pos": 0} 2025-12-04T12:10:20.9433382Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9433446Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9433560Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9433807Z triton_mm_72 0.0090 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9433867Z _scaled_mm 0.0096 ms 93.0% 2025-12-04T12:10:20.9434116Z triton_mm_71 0.0099 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9434362Z triton_mm_60 0.0106 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9434601Z triton_mm_67 0.0108 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9434836Z triton_mm_54 0.0110 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9435074Z triton_mm_68 0.0120 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9435312Z triton_mm_61 0.0120 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9435551Z triton_mm_53 0.0121 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9435787Z triton_mm_59 0.0124 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9435941Z SingleProcess AUTOTUNE benchmarking takes 0.2556 seconds and 0.8124 seconds precompiling for 39 choices 2025-12-04T12:10:20.9436010Z =================================== FAILURES =================================== 2025-12-04T12:10:20.9436182Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:20.9436246Z Traceback (most recent call last): 2025-12-04T12:10:20.9436418Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9436477Z method(*args, **kwargs) 2025-12-04T12:10:20.9436642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:20.9436700Z method(*args, **kwargs) 2025-12-04T12:10:20.9436872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:20.9436929Z with policy(): 2025-12-04T12:10:20.9437095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:20.9437155Z raise RuntimeError(msg) 2025-12-04T12:10:20.9437573Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.9437576Z 2025-12-04T12:10:20.9437666Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9437953Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:20.9437956Z 2025-12-04T12:10:20.9438058Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9438147Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9438228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9438303Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9438863Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9438978Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9439033Z graph_break [] 2025-12-04T12:10:20.9439112Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9439200Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9439699Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:20.9439763Z current_size = base.storage().size() 2025-12-04T12:10:20.9439820Z Autotune Choices Stats: 2025-12-04T12:10:20.9440239Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:20.9440333Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9440399Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9440512Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9440761Z triton_mm_33 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9441001Z triton_mm_34 0.0090 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9441252Z triton_mm_16 0.0110 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9441494Z triton_mm_30 0.0118 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9441734Z triton_mm_23 0.0119 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9441973Z triton_mm_15 0.0122 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9442211Z triton_mm_31 0.0127 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9442468Z triton_mm_21 0.0129 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9442716Z triton_mm_29 0.0131 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9442956Z triton_mm_35 0.0138 ms 64.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9443103Z SingleProcess AUTOTUNE benchmarking takes 0.1748 seconds and 1.4396 seconds precompiling for 33 choices 2025-12-04T12:10:20.9443192Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9443251Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9443324Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9443440Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9443938Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9443992Z graph_break [] 2025-12-04T12:10:20.9444072Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9444169Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9444227Z Autotune Choices Stats: 2025-12-04T12:10:20.9444603Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008960000239312649, "best_triton_pos": 0} 2025-12-04T12:10:20.9444681Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9444744Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9444858Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9445113Z triton_mm_72 0.0090 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9445175Z _scaled_mm 0.0096 ms 93.0% 2025-12-04T12:10:20.9445415Z triton_mm_71 0.0099 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9445654Z triton_mm_60 0.0106 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9445892Z triton_mm_67 0.0108 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9446128Z triton_mm_54 0.0110 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9446374Z triton_mm_68 0.0120 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9446621Z triton_mm_61 0.0120 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9446861Z triton_mm_53 0.0121 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9447098Z triton_mm_59 0.0124 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9447241Z SingleProcess AUTOTUNE benchmarking takes 0.2556 seconds and 0.8124 seconds precompiling for 39 choices 2025-12-04T12:10:20.9447331Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:20.9447390Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:20.9447463Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:20.9447576Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:20.9448072Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:20.9448136Z graph_break [] 2025-12-04T12:10:20.9448216Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:20.9448304Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:20.9448363Z Autotune Choices Stats: 2025-12-04T12:10:20.9448741Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00827999971807003, "best_triton_pos": 0} 2025-12-04T12:10:20.9448818Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:20.9448887Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:20.9449009Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:20.9449259Z triton_mm_110 0.0083 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9449503Z triton_mm_109 0.0092 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9449563Z _scaled_mm 0.0095 ms 87.3% 2025-12-04T12:10:20.9449799Z triton_mm_105 0.0110 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9450037Z triton_mm_98 0.0114 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9450312Z triton_mm_92 0.0114 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9450575Z triton_mm_97 0.0114 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9450815Z triton_mm_106 0.0120 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:20.9451053Z triton_mm_99 0.0124 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9451295Z triton_mm_107 0.0127 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:20.9451438Z SingleProcess AUTOTUNE benchmarking takes 0.2817 seconds and 0.6451 seconds precompiling for 39 choices 2025-12-04T12:10:20.9451644Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1c67a793923764a0.xml - 2025-12-04T12:10:20.9451719Z =========================== short test summary info ============================ 2025-12-04T12:10:20.9452354Z FAILED [1.5419s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:20.9452372Z 2025-12-04T12:10:20.9452463Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:20.9452749Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:20.9452752Z 2025-12-04T12:10:20.9456295Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:20.9456378Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:20.9456472Z ================== 1 failed, 108 deselected, 2 rerun in 7.32s ================== 2025-12-04T12:10:20.9456558Z Got exit code 1 2025-12-04T12:10:20.9456618Z Retrying single test... 2025-12-04T12:10:20.9456819Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe0eb0f1af3fb618.xml 2025-12-04T12:10:20.9456915Z ============================= test session starts ============================== 2025-12-04T12:10:20.9457048Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:20.9457108Z cachedir: .pytest_cache 2025-12-04T12:10:20.9457283Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:20.9457347Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:20.9457405Z configfile: pytest.ini 2025-12-04T12:10:20.9457584Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:20.9457678Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:20.9457963Z stepcurrent: skipping 108 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:20.9458026Z Running 1 items in this shard 2025-12-04T12:10:20.9458041Z 2025-12-04T12:10:20.9458415Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:15.712442003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9458418Z 2025-12-04T12:10:20.9458589Z [W1204 11:14:22.256545039 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9458592Z 2025-12-04T12:10:20.9458759Z [W1204 11:14:22.267400628 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9458762Z 2025-12-04T12:10:20.9458924Z [W1204 11:14:22.268168528 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9458926Z 2025-12-04T12:10:20.9459257Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9459567Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9459715Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9460257Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9460541Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9460781Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9461028Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9461246Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9461488Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9461726Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9461968Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9462200Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9462441Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9462697Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9462940Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9463171Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9463413Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9463649Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9463888Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9464121Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9464359Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9464607Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9464811Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9465045Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9465284Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9465526Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9465733Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9465968Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9466209Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9466440Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9466680Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9466922Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9467149Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9467377Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9467554Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9467750Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9468286Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/xc/cxc2anr2xernse3b4gn5u4tsll3tzesipkn3uljpaeetr7cv5ujy.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9468448Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9468680Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9468850Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9469162Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9469312Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9469583Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9469737Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9470014Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9470236Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9470517Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9470668Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9470956Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9471166Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9471511Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9471828Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9471973Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9472467Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9472737Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9472977Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9473198Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9473414Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9473667Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9473902Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9474142Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9474376Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9474627Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9474862Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9475104Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9475335Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9475576Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9475807Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9476058Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9476301Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9476539Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9476770Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9476975Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9477209Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9477447Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9477678Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9477883Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9478133Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9478376Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9478606Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9478864Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9479095Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9479313Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9479539Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9479711Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9479903Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9480021Z E1204 11:14:22.764000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9480399Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9480720Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9480869Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9481358Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9481624Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9481865Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9482083Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9482298Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9482551Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9482784Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9483027Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9483259Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9483510Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9483743Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9483984Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9484215Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9484455Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9484686Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9484935Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9485179Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9485420Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9485653Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9485858Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9486089Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9486330Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9486560Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9486764Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9487003Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9487244Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9487476Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9487717Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9487963Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9488180Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9488405Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9488576Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9488771Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9489311Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/o7/co7nrawcubelamwpu3mcromhl3rhimqzkugscfu76wuqi2vn7vou.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9489484Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9489713Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9489881Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9490220Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9490369Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9490641Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9490795Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9491061Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9491233Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9491526Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9491676Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9491962Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9492169Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9492509Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9492816Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9492961Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9493449Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9493715Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9493966Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9494209Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9494423Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9494664Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9494899Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9495143Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9495376Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9495619Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9495850Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9496102Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9496334Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9496575Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9496816Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9497056Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9497288Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9497530Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9497763Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9497966Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9498198Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9498458Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9498690Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9498892Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9499125Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9499369Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9499601Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9499840Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9500075Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9500326Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9500567Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9500741Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9500935Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9501052Z E1204 11:14:22.830000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9501385Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9501694Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9501838Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9502323Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9502589Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9502843Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9503076Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9503289Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9503530Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9503764Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9504006Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9504237Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9504477Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9504709Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9504963Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9505194Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9505432Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9505674Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9505914Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9506145Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9506387Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9506617Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9506820Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9507051Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9507315Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9507547Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9507749Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9507981Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9508221Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9508453Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9508691Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9508923Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9509137Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9509371Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9509545Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9509738Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9510317Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/h4/ch47pvrzjzy5l5g2miliwosvjhxn2mhzlg2mjae43rcj5ww67u2e.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9510479Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9510708Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9510876Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9511177Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9511323Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9511610Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9511775Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9512043Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9512215Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9512496Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9512644Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9512934Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9513140Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9513467Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9513770Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9513929Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9514418Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9514698Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9514938Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9515158Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9515372Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9515612Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9515846Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9516098Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9516339Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9516578Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9516809Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9517051Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9517285Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9517524Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9517756Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9517996Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9518238Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9518478Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9518709Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9518913Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9519154Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9519397Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9519629Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9519832Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9520063Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9520342Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9520588Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9520840Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9521071Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9521286Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9521510Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9521683Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9521879Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9521994Z E1204 11:14:22.837000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9522165Z [W1204 11:14:22.314952851 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9522167Z 2025-12-04T12:10:20.9522490Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9522807Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9522952Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9523449Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9523714Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9523953Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9524172Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9524388Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9524628Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9524864Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9525125Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9525357Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9525596Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9525828Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9526069Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9526300Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9526540Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9526774Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9527024Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9527256Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9527496Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9527727Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9527948Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9528181Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9528422Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9528652Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9528855Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9529088Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9529341Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9529582Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9529822Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9530056Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9530282Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9530507Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9530679Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9530870Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9531398Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/js/cjsnt5ybembe2fnsixw3iunffuzd23eh7q4zydss5y55mal5cucl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9531577Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9531804Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9531970Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9532281Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9532426Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9532699Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9532851Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9533120Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9533291Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9533571Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9533721Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9534034Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9534239Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9534565Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9534868Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9535013Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9535503Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9535768Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9536016Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9536238Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9536452Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9536692Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9536934Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9537176Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9537409Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9537648Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9537882Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9538123Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9538364Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9538617Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9538848Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9539088Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9539320Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9539560Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9539791Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9539993Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9540256Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9540510Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9540743Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9540946Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9541189Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9541432Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9541665Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9541906Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9542137Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9542353Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9542578Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9542763Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9542967Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9543083Z E1204 11:14:22.853000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9543405Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9543707Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9543854Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9544340Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9544603Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9544859Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9545080Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9545294Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9545534Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9545775Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9546017Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9546249Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9546489Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9546720Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9546959Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9547203Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9547454Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9547686Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9547926Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9548159Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9548400Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9548634Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9548837Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9549068Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9549320Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9549551Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9549754Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9549993Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9550283Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9550516Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9550756Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9550987Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9551202Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9551426Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9551609Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9551814Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9552344Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/qc/cqciw5xrx6t6adzlfw6exhkjy4a5qtvq355pcgigeez7m5gbvgsq.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9552504Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9552733Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9552901Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9553204Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9553350Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9553620Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9553784Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9554052Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9554221Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9554512Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9554661Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9554951Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9555159Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9555485Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9555789Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9555934Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9556429Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9556705Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9556944Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9557164Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9557380Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9557620Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9557854Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9558094Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9558336Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9558577Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9558807Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9559058Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9559290Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9559530Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9559762Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9560004Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9560276Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9560535Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9560785Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9560988Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9561220Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9561459Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9561693Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9561897Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9562129Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9562371Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9562617Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9562858Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9563093Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9563307Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9563543Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9563717Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9563910Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9564026Z E1204 11:14:22.888000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9564195Z [W1204 11:14:22.372678862 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9564198Z 2025-12-04T12:10:20.9564518Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9564826Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9564980Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9565478Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9565745Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9565983Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9566202Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9566417Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9566658Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9566892Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9567145Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9567379Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9567619Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9567859Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9568102Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9568334Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9568575Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9568806Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9569048Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9569280Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9569542Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9569777Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9569978Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9570239Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9570480Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9570711Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9570915Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9571146Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9571385Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9571632Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9571875Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9572107Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9572336Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9572564Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9572736Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9572929Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9573458Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp_s8vyksi/d4/cd4i7s52vinet5572rcfxtekz764iiph45k3npnsrojp3frclnvc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:20.9573620Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:20.9573859Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:20.9574039Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:20.9574344Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:20.9574489Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:20.9574762Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:20.9574914Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:20.9575183Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:20.9575352Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:20.9575635Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:20.9575788Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:20.9576086Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:20.9576292Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:20.9576616Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9576931Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9577076Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9577565Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9577832Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9578069Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9578306Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9578528Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9578769Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9579004Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9579246Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9579480Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9579720Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9579952Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9580227Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9580472Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9580715Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9580945Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9581187Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9581432Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9581676Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9581907Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9582112Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9582344Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9582582Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9582825Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9583039Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9583270Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9583510Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9583743Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9583986Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9584218Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9584435Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:20.9584658Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:20.9584842Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:20.9585035Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:20.9585152Z E1204 11:14:22.912000 834730 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:20.9585222Z ('RERUN', {'yellow': True}) [11.5067s] [100%] 2025-12-04T12:10:20.9585586Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:24.409413254 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9585597Z 2025-12-04T12:10:20.9585759Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9586068Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9586374Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9586518Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9587005Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9587281Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9587529Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9587748Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9587961Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9588203Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9588438Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9588677Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9588910Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9589150Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9589392Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9589633Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9589865Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9590152Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9590385Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9590597Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9590819Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9591034Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9591277Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9591511Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9591738Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9591971Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9592210Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9592442Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9592646Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9592878Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9593092Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9593294Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9593525Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9593748Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9593950Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9594181Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9594421Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9594671Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9594917Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9595149Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9595362Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9595584Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9595798Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9596048Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9596290Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9596531Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9596762Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9597003Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9597235Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9597477Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9597710Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9597954Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9598196Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9598435Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9598667Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9598916Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9599150Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9599392Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9599624Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9599864Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9600140Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9600385Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9600646Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9600887Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9601118Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9601333Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9601546Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9601786Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9602016Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9602255Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9602486Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9602740Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9602971Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9603211Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9603453Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9603696Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9603928Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9604169Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9604402Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9604613Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9604815Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9605064Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9605276Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9605497Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9605712Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9605956Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9606188Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9606398Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9606599Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9606831Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9607050Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9607252Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9607486Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9607735Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9607969Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9608211Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9608442Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9608651Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9608873Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9609086Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9609338Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9609587Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9609802Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9610018Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9610273Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9610516Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9610751Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9610991Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9611227Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9611486Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9611721Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9611961Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9612206Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9612451Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9612686Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9612902Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9613106Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9613341Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9613557Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9613785Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9614012Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9614253Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9614487Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9614730Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9614964Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9615204Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9615440Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9615682Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9615927Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9616168Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9616400Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9616624Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9616837Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9617045Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9617273Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9617486Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9617729Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9617963Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9618188Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9618416Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9618629Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9618871Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9619105Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9619349Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9619583Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9619826Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9620061Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9620351Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9620586Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9620825Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9621072Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9621312Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9621547Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9621791Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9622029Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9622271Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9622504Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9622770Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9623003Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9623245Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9623479Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9623693Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9623899Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9624132Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9624379Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9624611Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9624866Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9625101Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9625341Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9625586Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9625828Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9626062Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9626303Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9626538Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9626746Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9626990Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9627251Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9627483Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9627726Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9627960Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9628187Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9628404Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9628615Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9628829Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9629084Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9629324Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9629553Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9629769Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9629992Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9630239Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9630488Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9630724Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9630940Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9631153Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9631358Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9631546Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9631780Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9631984Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9632217Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9632458Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9632693Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9632905Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9633112Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9633349Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9633575Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9633779Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9634016Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9634218Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9634464Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9634669Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9634903Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9635106Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9635338Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9635585Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9635832Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9636082Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9636315Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9636556Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9636790Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9637030Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9637265Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9637507Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9637741Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9637964Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9638169Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9638404Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9638630Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9638857Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9639072Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9639286Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9639528Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9639761Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9640003Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9640290Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9640510Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9640744Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9640986Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9641220Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9641462Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9641695Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9641921Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9642137Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9642362Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9642568Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9642774Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9643007Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9643261Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9643496Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9643738Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9643972Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9644174Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9644408Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9644667Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9644911Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9645153Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9645388Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9645592Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9645825Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9646067Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9646299Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9646541Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9646783Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9647011Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9647227Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9647441Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9647669Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9647912Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9648146Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9648348Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9648582Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9648824Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9649067Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9649317Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9649549Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9649776Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9649995Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9650265Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9650481Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9650721Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9650956Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9651210Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9651445Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9651687Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9651921Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9652138Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9652375Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9652618Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9652850Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9653093Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9653327Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9653566Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9653796Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9654012Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9654226Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9654468Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9654703Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9654946Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9655178Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9655421Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9655665Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9655909Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9656142Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9656385Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9656628Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9656841Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9657059Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9657265Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9657478Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9657705Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9657940Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9658163Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9658367Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9658576Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9658761Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9658904Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9659065Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9659184Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9659324Z E1204 11:14:24.961000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9659498Z [W1204 11:14:24.427959383 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9659501Z 2025-12-04T12:10:20.9659662Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9659979Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9660330Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9660474Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9660984Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9661254Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9661494Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9661715Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9661930Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9662175Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9662421Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9662677Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9662909Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9663150Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9663383Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9663624Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9663858Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9664098Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9664332Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9664559Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9664782Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9664996Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9665243Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9665475Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9665679Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9665911Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9666151Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9666383Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9666586Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9666829Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9667051Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9667253Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9667486Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9667698Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9667899Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9668133Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9668373Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9668606Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9668856Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9669090Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9669303Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9669525Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9669749Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9669990Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9670257Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9670499Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9670731Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9670971Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9671215Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9671467Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9671701Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9671942Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9672173Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9672413Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9672645Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9672884Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9673116Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9673368Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9673602Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9673841Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9674089Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9674331Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9674563Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9674804Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9675035Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9675251Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9675462Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9675711Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9675954Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9676194Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9676428Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9676670Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9676903Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9677143Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9677376Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9677617Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9677867Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9678108Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9678339Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9678561Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9678765Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9678998Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9679209Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9679430Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9679646Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9679887Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9680168Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9680395Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9680597Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9680830Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9681041Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9681247Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9681479Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9681721Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9681956Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9682211Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9682446Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9682657Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9682880Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9683105Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9683351Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9683587Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9683804Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9684018Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9684232Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9684488Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9684734Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9684979Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9685215Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9685456Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9685692Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9685933Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9686170Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9686412Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9686659Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9686874Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9687078Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9687313Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9687536Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9687754Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9687969Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9688212Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9688449Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9688694Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9688941Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9689192Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9689427Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9689667Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9689903Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9690175Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9690408Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9690624Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9690838Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9691064Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9691290Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9691505Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9691747Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9692000Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9692219Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9692432Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9692647Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9692889Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9693127Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9693384Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9693636Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9693880Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9694115Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9694358Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9694594Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9694835Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9695068Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9695309Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9695554Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9695796Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9696032Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9696285Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9696518Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9696760Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9696995Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9697236Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9697470Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9697685Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9697910Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9698145Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9698388Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9698622Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9698865Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9699099Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9699341Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9699575Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9699814Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9700058Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9700404Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9700641Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9700860Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9701095Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9701336Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9701570Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9701811Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9702043Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9702283Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9702510Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9702725Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9702941Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9703184Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9703421Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9703648Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9703863Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9704075Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9704289Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9704543Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9704777Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9704994Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9705215Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9705424Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9705587Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9705823Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9706028Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9706262Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9706503Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9706747Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9706972Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9707176Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9707411Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9707623Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9707829Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9708065Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9708268Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9708502Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9708716Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9709051Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9709255Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9709489Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9709744Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9710033Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9710311Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9710547Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9710792Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9711028Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9711295Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9711541Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9711781Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9712015Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9712228Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9712431Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9712666Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9712895Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9713113Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9713338Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9713553Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9713796Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9714032Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9714288Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9714522Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9714727Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9714960Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9715204Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9715439Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9715698Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9715941Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9716167Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9716385Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9716597Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9716807Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9717014Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9717248Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9717492Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9717740Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9717983Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9718216Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9718420Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9718664Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9718907Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9719142Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9719382Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9719619Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9719823Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9720073Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9720364Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9720596Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9720839Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9721071Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9721299Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9721514Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9721728Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9721943Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9722199Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9722438Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9722645Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9722879Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9723131Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9723367Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9723610Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9723842Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9724070Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9724285Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9724510Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9724738Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9724983Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9725217Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9725457Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9725692Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9725933Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9726168Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9726373Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9726618Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9726861Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9727095Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9727343Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9727594Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9727822Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9728038Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9728251Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9728468Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9728709Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9728954Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9729205Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9729438Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9729683Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9729918Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9730197Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9730430Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9730671Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9730904Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9731131Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9731349Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9731554Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9731767Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9732007Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9732233Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9732446Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9732652Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9732860Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9733046Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9733190Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9733359Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9733489Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9733628Z E1204 11:14:24.967000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9733801Z [W1204 11:14:24.430665248 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9733804Z 2025-12-04T12:10:20.9733961Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9734273Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9734583Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9734728Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9735222Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9735499Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9735738Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9735957Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9736171Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9736425Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9736660Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9736906Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9737140Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9737382Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9737617Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9737866Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9738110Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9738348Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9738581Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9738792Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9739015Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9739232Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9739475Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9739708Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9739921Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9740206Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9740446Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9740679Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9740897Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9741130Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9741342Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9741544Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9741778Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9741988Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9742202Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9742447Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9742687Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9742920Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9743161Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9743394Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9743606Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9743828Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9744042Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9744305Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9744539Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9744779Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9745010Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9745259Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9745492Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9745734Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9745964Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9746206Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9746437Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9746692Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9746937Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9747180Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9747412Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9747654Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9747887Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9748125Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9748357Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9748598Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9748846Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9749090Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9749321Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9749545Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9749755Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9749999Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9750248Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9750489Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9750721Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9750961Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9751209Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9751463Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9751696Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9751938Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9752172Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9752414Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9752646Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9752857Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9753061Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9753309Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9753520Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9753740Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9753955Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9754208Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9754441Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9754652Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9754853Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9755085Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9755294Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9755506Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9755749Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9755990Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9756223Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9756467Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9756698Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9756909Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9757130Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9757343Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9757598Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9757833Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9758050Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9758262Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9758494Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9758741Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9758974Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9759216Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9759449Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9759690Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9759935Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9760240Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9761793Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9762040Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9762275Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9762490Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9762698Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9762934Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9763151Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9763390Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9763606Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9763849Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9764081Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9764336Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9764574Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9764820Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9765057Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9765298Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9765537Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9765793Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9766043Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9766260Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9766473Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9766680Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9766904Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9767121Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9767364Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9767599Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9767826Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9768040Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9768253Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9768494Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9768740Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9768983Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9769218Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9769459Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9769696Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9769939Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9770224Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9770477Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9770710Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9770953Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9771192Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9771434Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9771669Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9771909Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9772146Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9772399Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9772634Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9772878Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9773122Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9773335Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9773540Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9773777Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9774018Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9774254Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9774500Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9774743Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9774995Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9775229Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9775471Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9775707Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9775948Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9776183Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9776391Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9776625Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9776882Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9777120Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9777366Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9777611Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9777839Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9778056Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9778269Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9778481Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9778724Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9778958Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9779210Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9779428Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9779641Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9779855Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9780131Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9780366Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9780581Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9780792Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9780998Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9781172Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9781408Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9781614Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9781854Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9782111Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9782346Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9782560Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9782762Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9782996Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9783208Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9783427Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9783676Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9783878Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9784116Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9784321Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9784555Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9784758Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9784994Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9785236Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9785468Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9785721Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9785954Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9786195Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9786438Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9786684Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9786920Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9787161Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9787393Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9787607Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9787822Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9788063Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9788291Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9788507Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9788724Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9788943Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9789186Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9789420Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9789661Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9789908Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9790160Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9790394Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9790636Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9790883Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9791126Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9791360Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9791588Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9791804Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9792017Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9792223Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9792452Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9792688Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9792929Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9793163Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9793406Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9793639Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9793849Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9794083Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9794325Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9794570Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9794812Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9795045Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9795262Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9795497Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9795738Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9795972Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9796216Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9796449Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9796688Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9796915Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9797127Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9797342Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9797584Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9797818Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9798022Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9798256Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9798499Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9798747Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9798987Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9799221Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9799447Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9799673Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9799888Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9800121Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9800363Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9800596Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9800840Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9801092Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9801345Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9801578Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9801783Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9802017Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9802259Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9802493Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9802737Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9802969Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9803208Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9803425Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9803640Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9803854Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9804106Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9804341Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9804583Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9804817Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9805058Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9805292Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9805545Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9805791Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9806033Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9806266Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9806478Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9806694Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9806901Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9807110Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9807340Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9807573Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9807785Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9807992Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9808201Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9808398Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9808540Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9808703Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9808822Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9808965Z E1204 11:14:24.969000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9809139Z [W1204 11:14:25.472059011 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9809142Z 2025-12-04T12:10:20.9809300Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9809611Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9809930Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9810114Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9810611Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9810879Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9811121Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9811340Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9811557Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9811800Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9812059Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9812302Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9812533Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9812778Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9813027Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9813269Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9813502Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9813743Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9813975Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9814187Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9814424Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9814649Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9814889Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9815121Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9815327Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9815561Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9815801Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9816033Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9816236Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9816480Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9816691Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9816892Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9817124Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9817343Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9817547Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9817782Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9818022Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9818253Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9818493Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9818736Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9818956Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9819178Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9819389Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9819631Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9819868Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9820160Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9820392Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9820634Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9820879Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9821121Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9821353Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9821593Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9821835Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9822080Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9822312Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9822552Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9822783Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9823025Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9823271Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9823522Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9823758Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9823998Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9824233Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9824473Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9824706Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9824922Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9825133Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9825385Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9825618Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9825857Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9826090Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9826343Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9826577Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9826816Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9827049Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9827289Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9827522Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9827771Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9828018Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9828231Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9828434Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9828665Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9828876Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9829097Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9829310Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9829551Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9829793Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9830004Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9830238Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9830469Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9830693Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9830894Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9831127Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9831367Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9831598Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9831838Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9832081Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9832312Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9832532Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9832746Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9832990Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9833224Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9833440Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9833652Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9833867Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9834121Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9834362Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9834604Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9834836Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9835087Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9835322Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9835564Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9835796Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9836038Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9836275Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9836499Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9836718Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9836952Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9837170Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9837382Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9837599Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9837841Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9838074Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9838319Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9838563Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9838804Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9839038Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9839279Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9839523Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9839765Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9839999Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9840251Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9840468Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9840675Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9840912Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9841139Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9841379Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9841615Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9841830Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9842043Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9842259Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9842502Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9842736Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9842988Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9843222Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9843463Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9843696Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9843949Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9844184Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9844427Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9844660Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9844902Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9845137Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9845393Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9845636Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9845876Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9846110Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9846352Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9846592Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9846832Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9847067Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9847281Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9847496Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9847730Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9847970Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9848212Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9848455Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9848692Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9848935Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9849167Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9849408Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9849641Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9849893Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9850169Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9850373Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9850611Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9850852Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9851087Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9851327Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9851564Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9851792Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9852021Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9852235Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9852449Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9852708Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9852941Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9853169Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9853385Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9853598Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9853814Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9854055Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9854303Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9854531Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9854748Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9854956Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9855118Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9855354Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9855557Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9855791Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9856032Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9856279Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9856494Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9856700Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9856934Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9857153Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9857357Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9857592Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9857796Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9858029Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9858233Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9858468Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9858696Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9858930Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9859170Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9859404Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9859646Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9859879Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9860164Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9860397Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9860640Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9860890Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9861131Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9861364Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9861594Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9861799Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9862032Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9862260Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9862474Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9862691Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9862908Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9863173Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9863407Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9863647Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9863881Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9864086Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9864319Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9864561Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9864798Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9865039Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9865282Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9865509Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9865724Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9865947Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9866155Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9866357Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9866593Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9866837Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9867071Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9867313Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9867565Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9867769Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9868002Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9868243Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9868478Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9868722Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9868956Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9869160Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9869392Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9869655Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9869889Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9870160Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9870405Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9870632Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9870851Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9871065Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9871279Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9871521Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9871754Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9871983Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9872216Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9872456Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9872690Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9872934Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9873169Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9873394Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9873610Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9873821Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9874048Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9874292Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9874524Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9874776Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9875012Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9875252Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9875486Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9875689Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9875923Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9876164Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9876417Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9876657Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9876893Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9877121Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9877337Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9877551Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9877764Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9878006Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9878239Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9878497Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9878732Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9878975Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9879219Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9879460Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9879695Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9879935Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9880204Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9880416Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9880646Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9880863Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9881078Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9881307Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9881528Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9881742Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9881949Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9882154Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9882341Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9882483Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9882656Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9882776Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9882921Z E1204 11:14:25.011000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9883092Z [W1204 11:14:25.474131844 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9883095Z 2025-12-04T12:10:20.9883254Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9883575Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9883884Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9884033Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9884521Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9884790Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9885044Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9885273Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9885489Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9885730Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9885964Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9886206Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9886440Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9886680Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9886917Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9887170Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9887403Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9887645Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9887876Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9888097Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9888321Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9888534Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9888774Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9889008Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9889212Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9889455Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9889707Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9889939Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9890166Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9890398Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9890609Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9890812Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9891046Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9891258Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9891474Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9891707Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9891948Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9892178Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9892431Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9892663Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9892875Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9893100Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9893312Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9893554Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9893786Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9894056Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9894288Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9894528Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9894761Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9895001Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9895240Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9895480Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9895714Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9895962Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9896194Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9896436Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9896667Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9896918Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9897153Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9897397Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9897628Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9897870Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9898102Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9898352Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9898599Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9898813Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9899024Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9899270Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9899502Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9899743Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9899975Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9900247Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9900501Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9900742Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9900974Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9901214Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9901459Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9901700Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9901932Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9902143Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9902346Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9902576Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9902811Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9903049Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9903262Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9903509Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9903741Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9903951Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9904154Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9904385Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9904597Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9904815Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9905047Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9905288Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9905522Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9905774Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9906006Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9906217Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9906438Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9906653Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9906897Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9907144Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9907375Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9907590Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9907806Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9908049Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9908285Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9908526Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9908761Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9909003Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9909248Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9909496Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9909730Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9909972Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9910262Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9910476Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9910683Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9910917Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9911132Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9911346Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9911567Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9911855Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9912092Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9912335Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9912567Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9912811Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9913046Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9913288Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9913523Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9913784Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9914017Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9914233Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9914446Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9914662Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9914888Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9915103Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9915345Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9915581Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9915797Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9916020Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9916243Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9916485Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9916718Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9916960Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9917196Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9917438Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9917674Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9917915Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9918158Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9918399Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9918633Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9918874Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9919115Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9919359Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9919595Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9919840Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9920075Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9920364Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9920614Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9920866Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9921100Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9921313Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9921518Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9921755Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9921997Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9922231Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9922471Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9922726Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9922966Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9923201Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9923458Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9923691Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9923936Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9924169Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9924376Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9924610Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9924851Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9925096Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9925346Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9925579Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9925808Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9926026Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9926240Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9926454Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9926696Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9926930Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9927171Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9927387Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9927602Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9927819Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9928080Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9928315Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9928531Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9928743Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9928949Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9929110Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:20.9929353Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9929568Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9929805Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9930049Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9930315Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9930527Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9930732Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9930964Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9931177Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9931381Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9931628Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9931835Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9932069Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9932286Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9932520Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9932725Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9932959Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9933199Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9933432Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9933674Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9933935Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9934177Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9934410Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9934654Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9934886Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9935131Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9935363Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9935576Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9935778Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9936026Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9936254Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9936469Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9936691Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9936907Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9937149Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9937382Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9937623Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9937857Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9938062Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9938316Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9938557Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9938791Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9939033Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9939267Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9939495Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9939709Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9939925Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9940160Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9940379Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9940616Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9940860Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9941106Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9941346Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9941581Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9941784Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9942021Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9942264Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9942497Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9942763Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9942996Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9943200Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9943432Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9943674Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9943910Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9944154Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9944388Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9944614Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9944848Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9945061Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9945276Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9945527Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9945762Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9945965Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9946200Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9946443Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9946676Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9946921Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9947173Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9947399Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9947614Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9947828Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9948047Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9948293Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9948528Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9948769Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9949003Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9949253Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9949486Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9949689Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9949931Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9950211Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9950446Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9950688Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9950923Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9951149Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:20.9951366Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9951602Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9951817Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9952058Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9952294Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9952539Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9952773Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9953015Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9953251Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9953490Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9953740Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9953981Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9954218Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9954442Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:20.9954660Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:20.9954865Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:20.9955077Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:20.9955307Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:20.9955530Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:20.9955743Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:20.9955958Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:20.9956174Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:20.9956445Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:20.9956588Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:20.9956749Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:20.9956868Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:20.9957009Z E1204 11:14:25.013000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:20.9957179Z [W1204 11:14:25.476150587 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:20.9957182Z 2025-12-04T12:10:20.9957340Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:20.9957648Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:20.9957956Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:20.9958115Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:20.9958608Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:20.9958890Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:20.9959129Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:20.9959350Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:20.9959564Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9959805Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9960041Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9960317Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9960570Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9960811Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9961043Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9961284Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9961516Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9961757Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9961988Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9962201Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9962439Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9962656Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9962897Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9963129Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9963346Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9963580Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9963822Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9964054Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9964259Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9964496Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9964716Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9964929Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9965161Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9965371Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9965571Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9965804Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9966045Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9966275Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9966518Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9966749Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9966972Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9967194Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9967407Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9967656Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9967888Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9968129Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9968362Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9968605Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9968836Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9969077Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9969327Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9969567Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9969799Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9970039Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9970309Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9970552Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9970785Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9971027Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9971271Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9971511Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9971744Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9971984Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9972228Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9972473Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9972709Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9972923Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9973135Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:20.9973375Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9973625Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9973878Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9974110Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9974351Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9974585Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9974829Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9975061Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9975305Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9975538Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9975788Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9976021Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9976230Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9976433Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9976681Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9976896Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9977118Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9977332Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9977573Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9977805Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9978032Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9978244Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9978477Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9978688Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9978891Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9979124Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9979364Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9979595Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9979835Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9980076Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9980325Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9980547Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9980764Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9981027Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9981264Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9981479Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9981693Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9981908Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9982152Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9982387Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9982651Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9982890Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9983132Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9983367Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9983611Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9983844Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9984086Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9984320Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9984545Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9984749Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9984988Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9985204Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9985424Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9985642Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9985883Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9986117Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9986357Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9986591Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9986843Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9987087Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9987334Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9987568Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9987810Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9988044Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9988262Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9988476Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9988682Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:20.9988911Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:20.9989136Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9989378Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9989610Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9989835Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:20.9990050Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:20.9990303Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:20.9990547Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:20.9990782Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9991027Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9991262Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9991532Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9991766Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9992007Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9992243Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9992484Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9992719Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9992962Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9993197Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9993454Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9993688Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9993931Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9994163Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9994423Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9994660Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9994900Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9995139Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9995353Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:20.9995557Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9995803Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9996057Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9996291Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9996532Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9996768Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9997012Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9997248Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9997488Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9997721Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9997977Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9998211Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9998418Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:20.9998650Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9998901Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9999140Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9999383Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:20.9999616Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:20.9999843Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0000063Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0000334Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0000563Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0000806Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0001041Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0001270Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0001486Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0001698Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0001911Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0002153Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0002412Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0002631Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0002844Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0003050Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0003234Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0003469Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0003675Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0003908Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0004149Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0004385Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0004598Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0004827Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0005061Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0005277Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0005480Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0005716Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0005920Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0006155Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0006359Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0006592Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0006807Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0007042Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0007286Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0007532Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0007774Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0008010Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0008251Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0008485Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0008728Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0008963Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0009232Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0009470Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0009684Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0009887Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0010162Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0010389Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0010607Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0010823Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0011037Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0011305Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0011541Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0011782Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0012032Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0012237Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0012474Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0012718Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0012952Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0013193Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0013430Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0013696Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0013912Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0014126Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0014331Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0014536Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0014772Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0015016Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0015248Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0015494Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0015740Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0015944Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0016177Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0016428Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0016663Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0016907Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0017141Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0017347Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0017583Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0017825Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0018067Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0018317Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0018550Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0018777Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0018996Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0019211Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0019429Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0019670Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0019904Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0020164Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0020397Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0020639Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0020885Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0021127Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0021362Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0021594Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0021811Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0022024Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0022240Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0022497Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0022748Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0022988Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0023222Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0023463Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0023699Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0023903Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0024136Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0024378Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0024625Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0024868Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0025102Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0025341Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0025559Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0025772Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0025990Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0026229Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0026466Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0026710Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0026953Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0027212Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0027445Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0027689Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0027923Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0028165Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0028398Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0028610Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0028826Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0029041Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0029252Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0029478Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0029709Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0029924Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0030169Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0030379Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0030564Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0030704Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0030863Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0030982Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0031121Z E1204 11:14:25.015000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0031216Z ('RERUN', {'yellow': True}) [1.6980s] [100%] 2025-12-04T12:10:21.0031581Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:26.928732830 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0031584Z 2025-12-04T12:10:21.0031742Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0032053Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0032361Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0032511Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0033000Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0033266Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0033519Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0033740Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0033955Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0034208Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0034444Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0034687Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0034917Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0035158Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0035390Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0035642Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0035883Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0036124Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0036356Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0036567Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0036790Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0037004Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0037244Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0037477Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0037690Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0037924Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0038165Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0038398Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0038616Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0038849Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0039061Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0039262Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0039494Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0039703Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0039905Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0040182Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0040434Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0040665Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0040906Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0041141Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0041351Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0041573Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0041786Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0042027Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0042273Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0042514Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0042745Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0042997Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0043230Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0043470Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0043702Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0043941Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0044176Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0044417Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0044681Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0044920Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0045151Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0045391Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0045625Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0045867Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0046098Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0046340Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0046583Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0046822Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0047055Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0047269Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0047489Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0047731Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0047964Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0048205Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0048437Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0048678Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0048921Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0049169Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0049401Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0049640Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0049872Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0050146Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0050381Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0050591Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0050793Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0051038Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0051248Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0051470Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0051682Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0051935Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0052168Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0052381Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0052583Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0052814Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0053024Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0053226Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0053484Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0053724Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0053955Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0054195Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0054428Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0054639Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0054860Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0055074Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0055319Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0055562Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0055779Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0055990Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0056217Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0056460Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0056695Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0056937Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0057170Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0057413Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0057646Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0057908Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0058141Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0058384Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0058620Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0058834Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0059039Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0059271Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0059487Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0059698Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0059922Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0060204Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0060438Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0060700Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0060934Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0061175Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0061409Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0061650Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0061884Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0062139Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0062385Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0064176Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0064396Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0064603Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0064829Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0065045Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0065285Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0065522Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0065736Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0065971Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0066188Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0066429Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0066674Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0066917Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0067153Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0067394Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0067629Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0067872Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0068106Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0068370Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0068602Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0068843Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0069079Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0069320Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0069554Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0069794Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0070028Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0070320Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0070557Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0070798Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0071032Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0071259Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0071465Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0071701Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0071941Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0072178Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0072421Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0072669Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0072923Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0073156Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0073398Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0073632Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0073876Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0074113Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0074319Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0074555Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0074808Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0075045Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0075286Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0075521Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0075759Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0075977Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0076190Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0076405Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0076649Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0076882Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0077120Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0077346Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0077560Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0077775Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0078017Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0078251Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0078469Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0078682Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0078889Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0079060Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0079297Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0079501Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0079737Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0079988Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0080278Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0080494Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0080697Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0080930Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0081142Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0081348Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0081601Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0081818Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0082053Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0082259Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0082495Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0082699Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0082935Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0083177Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0083411Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0083668Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0083902Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0084143Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0084389Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0084634Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0084867Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0085110Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0085343Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0085556Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0085761Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0086003Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0086242Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0086459Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0086675Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0086891Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0087133Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0087368Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0087609Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0087843Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0088057Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0088291Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0088534Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0088778Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0089020Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0089253Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0089484Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0089701Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0089915Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0090165Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0090381Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0090629Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0090871Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0091106Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0091347Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0091584Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0091789Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0092022Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0092263Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0092509Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0092754Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0092988Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0093191Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0093438Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0093682Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0093917Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0094157Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0094393Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0094620Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0094846Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0095069Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0095284Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0095526Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0095759Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0095964Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0096197Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0096438Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0096674Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0096926Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0097161Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0097387Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0097621Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0097833Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0098051Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0098294Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0098526Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0098769Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0099004Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0099254Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0099502Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0099705Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0099940Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0100217Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0100452Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0100693Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0100929Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0101156Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0101389Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0101605Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0101822Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0102079Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0102312Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0102555Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0102792Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0103033Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0103269Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0103511Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0103771Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0104011Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0104245Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0104459Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0104675Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0104882Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0105092Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0105322Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0105543Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0105767Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0105973Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0106178Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0106365Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0106516Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0106679Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0106798Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0106942Z E1204 11:14:26.467000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0107112Z [W1204 11:14:26.931119289 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0107115Z 2025-12-04T12:10:21.0107274Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0107584Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0107891Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0108056Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0108549Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0108820Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0109061Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0109282Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0109496Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0109736Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0109970Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0110255Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0110488Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0110728Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0110977Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0111220Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0111453Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0111695Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0111927Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0112140Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0112363Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0112601Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0112842Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0113074Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0113278Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0113510Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0113752Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0113984Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0114186Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0114418Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0114647Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0114850Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0115082Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0115304Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0115506Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0115738Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0115979Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0116209Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0116450Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0116680Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0116905Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0117138Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0117350Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0117591Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0117822Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0118064Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0118295Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0118535Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0118767Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0119018Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0119253Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0119492Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0119735Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0119976Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0120246Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0120486Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0120716Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0120957Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0121189Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0121456Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0121687Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0121927Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0122160Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0122401Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0122634Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0122847Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0123058Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0123301Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0123545Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0123786Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0124016Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0124268Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0124499Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0124741Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0124973Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0125214Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0125447Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0125696Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0125938Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0126148Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0126351Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0126582Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0126792Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0127016Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0127230Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0127472Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0127703Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0127926Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0128128Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0128360Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0128579Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0128781Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0129014Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0129259Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0129492Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0129733Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0129965Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0130229Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0130450Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0130662Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0130907Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0131142Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0131362Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0131575Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0131790Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0132032Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0132280Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0132522Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0132756Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0133010Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0133245Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0133489Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0133723Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0133964Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0134198Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0134413Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0134641Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0134879Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0135095Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0135309Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0135526Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0135769Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0136003Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0136244Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0136478Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0136731Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0136966Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0137207Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0137452Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0137695Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0137931Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0138148Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0138361Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0138568Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0138794Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0139018Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0139273Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0139510Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0139727Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0139941Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0140200Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0140444Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0140677Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0140920Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0141168Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0141411Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0141647Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0141901Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0142137Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0142380Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0142614Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0142854Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0143088Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0143330Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0143589Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0143922Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0144157Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0144398Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0144634Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0144875Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0145110Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0145324Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0145541Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0145779Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0146022Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0146256Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0146507Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0146742Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0146984Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0147218Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0147460Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0147696Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0147956Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0148200Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0148405Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0148641Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0148883Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0149119Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0149360Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0149594Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0149825Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0150053Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0150308Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0150524Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0150765Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0151012Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0151247Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0151466Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0151679Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0151893Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0152137Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0152385Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0152611Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0152824Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0153029Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0153192Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0153426Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0153631Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0153868Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0154110Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0154346Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0154571Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0154779Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0155013Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0155239Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0155442Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0155677Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0155884Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0156118Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0156322Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0156554Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0156768Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0157011Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0157251Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0157486Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0157728Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0157965Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0158206Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0158439Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0158683Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0158926Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0159168Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0159400Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0159623Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0159826Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0160064Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0160332Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0160547Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0160761Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0160975Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0161236Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0161483Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0161723Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0161962Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0162168Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0162404Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0162646Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0162880Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0163122Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0163368Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0163595Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0163811Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0164025Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0164249Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0164457Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0164694Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0164935Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0165169Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0165409Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0165654Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0165868Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0166109Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0166351Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0166584Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0166826Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0167059Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0167262Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0167495Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0167748Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0167983Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0168225Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0168459Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0168699Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0168917Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0169129Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0169344Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0169586Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0169818Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0170035Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0170313Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0170554Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0170786Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0171031Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0171265Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0171492Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0171709Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0171922Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0172152Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0172396Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0172629Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0172870Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0173115Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0173358Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0173592Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0173797Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0174031Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0174277Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0174525Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0174776Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0175009Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0175236Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0175455Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0175669Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0175883Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0176124Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0176360Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0176613Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0176846Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0177087Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0177331Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0177572Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0177808Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0178048Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0178287Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0178500Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0178717Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0178931Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0179156Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0179384Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0179604Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0179820Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0180025Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0180277Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0180463Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0180606Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0180767Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0180909Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0181053Z E1204 11:14:26.470000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0181227Z [W1204 11:14:26.934463946 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0181229Z 2025-12-04T12:10:21.0181387Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0181707Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0182013Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0182161Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0182655Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0182925Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0183164Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0183409Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0183624Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0183864Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0184100Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0184346Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0184580Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0184820Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0185054Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0185294Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0185536Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0185779Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0186009Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0186235Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0186462Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0186677Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0186920Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0187151Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0187355Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0187587Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0187846Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0188079Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0188281Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0188515Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0188726Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0188929Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0189161Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0189373Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0189574Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0189818Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0190060Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0190326Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0190581Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0190813Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0191027Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0191248Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0191461Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0191704Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0191935Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0192187Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0192432Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0192675Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0192907Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0193148Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0193381Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0193620Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0193852Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0194092Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0194337Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0194582Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0194814Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0195065Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0195297Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0195539Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0195771Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0196013Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0196246Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0196488Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0196751Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0196966Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0197178Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0197418Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0197654Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0197894Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0198128Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0198370Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0198605Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0198862Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0199095Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0199334Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0199581Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0199823Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0200061Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0200303Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0200506Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0200741Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0200954Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0201201Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0201416Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0201657Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0201890Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0202103Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0202306Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0202538Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0202750Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0202955Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0203201Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0203442Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0203675Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0203928Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0204162Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0204375Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0204598Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0204813Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0205057Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0205293Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0205517Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0205739Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0205952Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0206195Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0206429Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0206673Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0206912Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0207152Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0207388Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0207641Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0207876Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0208117Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0208360Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0208574Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0208783Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0209023Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0209240Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0209455Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0209672Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0209926Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0210205Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0210445Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0210680Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0210925Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0211160Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0211402Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0211634Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0211877Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0212128Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0212346Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0212559Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0212777Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0213003Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0213220Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0213462Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0213696Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0213913Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0214126Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0214358Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0214614Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0214847Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0215093Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0215326Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0215569Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0215803Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0216043Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0216281Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0216536Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0216771Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0217012Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0217257Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0217501Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0217735Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0217978Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0218211Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0218453Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0218689Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0218949Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0219186Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0219399Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0219605Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0219840Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0220082Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0220338Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0220580Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0220814Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0221075Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0221312Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0221552Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0221799Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0222042Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0222276Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0222481Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0222714Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0222957Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0223206Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0223461Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0223696Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0223923Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0224140Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0224354Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0224572Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0224814Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0225050Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0225293Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0225509Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0225722Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0225935Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0226186Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0226420Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0226638Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0226853Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0227060Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0227225Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0227461Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0227677Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0227921Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0228163Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0228398Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0228611Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0228817Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0229050Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0229267Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0229470Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0229715Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0229921Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0230194Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0230403Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0230658Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0230864Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0231099Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0231344Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0231578Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0231819Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0232065Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0232317Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0232552Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0232793Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0233028Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0233273Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0233511Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0233724Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0233929Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0234178Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0234407Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0234625Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0234839Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0235062Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0235310Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0235546Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0235788Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0236022Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0236226Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0236473Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0236723Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0236957Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0237198Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0237438Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0237666Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0237883Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0238097Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0238303Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0238518Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0238753Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0238995Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0239228Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0239484Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0239719Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0239922Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0240220Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0240461Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0240695Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0240952Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0241197Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0241403Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0241638Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0241880Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0242112Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0242355Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0242588Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0242815Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0243044Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0243258Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0243475Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0243716Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0243965Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0244172Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0244404Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0244645Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0244879Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0245121Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0245365Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0245605Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0245821Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0246035Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0246250Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0246492Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0246726Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0246966Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0247200Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0247459Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0247697Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0247901Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0248134Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0248386Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0248620Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0248862Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0249096Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0249323Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0249541Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0249769Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0249993Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0250270Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0250505Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0250747Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0250980Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0251223Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0251456Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0251701Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0251949Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0252191Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0252425Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0252635Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0252864Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0253069Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0253280Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0253508Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0253732Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0253946Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0254165Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0254384Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0254569Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0254710Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0254870Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0254988Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0255129Z E1204 11:14:26.473000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0255301Z [W1204 11:14:26.984677844 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0255304Z 2025-12-04T12:10:21.0255462Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0255775Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0256083Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0256236Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0256729Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0257007Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0257245Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0257466Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0257683Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0257927Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0258162Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0258403Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0258646Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0258895Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0259127Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0259367Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0259600Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0259845Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0260076Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0260329Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0260552Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0260782Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0261024Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0261260Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0261484Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0261715Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0261961Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0262195Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0262399Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0262631Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0262844Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0263058Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0263312Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0263523Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0263724Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0263961Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0264202Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0264435Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0264675Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0264909Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0265131Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0265353Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0265567Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0265808Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0266054Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0266296Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0266528Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0266768Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0266999Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0267240Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0267482Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0267733Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0267970Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0268210Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0268443Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0268685Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0268916Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0269156Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0269388Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0269641Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0269874Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0270153Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0270398Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0270639Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0270870Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0271086Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0271297Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0271538Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0271774Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0272029Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0272276Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0272518Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0272751Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0272994Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0273226Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0273468Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0273700Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0273943Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0274192Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0274405Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0274608Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0274848Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0275062Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0275283Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0275498Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0275737Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0275974Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0276190Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0276403Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0276645Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0276854Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0277055Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0277287Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0277528Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0277761Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0278000Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0278234Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0278456Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0278680Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0278893Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0279137Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0279381Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0279599Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0279812Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0280026Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0280310Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0280545Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0280811Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0281059Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0281300Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0281534Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0281775Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0282010Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0282253Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0282489Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0282706Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0282926Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0283161Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0283376Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0283587Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0283814Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0284059Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0284294Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0284535Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0284772Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0285013Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0285261Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0285520Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0285754Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0285998Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0286233Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0286454Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0286667Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0286875Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0287100Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0287326Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0287572Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0287806Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0288024Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0288245Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0288465Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0288709Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0288943Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0289187Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0289420Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0289674Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0289918Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0290197Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0290434Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0290677Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0290914Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0291154Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0291390Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0291631Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0291880Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0292121Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0292355Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0292610Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0292845Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0293090Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0293325Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0293540Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0293748Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0293984Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0294258Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0294494Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0294737Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0294972Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0295214Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0295448Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0295688Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0295921Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0296161Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0296407Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0296618Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0296851Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0297113Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0297347Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0297591Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0297824Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0298052Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0298276Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0298501Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0298731Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0298971Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0299207Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0299434Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0299652Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0299866Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0300080Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0300360Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0300594Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0300828Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0301041Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0301249Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0301411Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0301657Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0301864Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0302099Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0302342Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0302577Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0302794Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0303010Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0303257Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0303469Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0303672Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0303906Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0304110Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0304347Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0304553Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0304790Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0305006Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0305241Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0305486Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0305718Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0305969Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0306204Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0306446Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0306679Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0306923Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0307156Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0307407Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0307652Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0307866Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0308069Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0308304Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0308532Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0308751Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0308965Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0309181Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0309434Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0309668Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0309913Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0310181Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0310399Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0310633Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0310879Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0311114Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0311354Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0311589Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0311829Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0312060Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0312273Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0312482Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0312686Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0312925Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0313168Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0313403Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0313647Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0313898Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0314105Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0314340Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0314580Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0314825Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0315070Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0315308Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0315514Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0315749Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0315991Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0316235Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0316486Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0316719Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0316949Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0317173Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0317387Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0317603Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0317844Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0318079Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0318296Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0318531Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0318774Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0319009Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0319267Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0319504Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0319732Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0319947Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0320192Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0320409Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0320664Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0320912Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0321156Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0321393Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0321634Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0321869Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0322075Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0322308Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0322552Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0322797Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0323042Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0323277Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0323503Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0323731Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0323945Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0324161Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0324401Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0324634Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0325792Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0326041Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0326300Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0326533Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0326776Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0327017Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0327261Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0327495Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0327707Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0327924Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0328134Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0328346Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0328574Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0328796Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0329017Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0329224Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0329432Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0329620Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0329760Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0329921Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0330042Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0330274Z E1204 11:14:26.523000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0330476Z [W1204 11:14:26.987155932 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0330495Z 2025-12-04T12:10:21.0330655Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0330962Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0331269Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0331418Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0331909Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0332178Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0332418Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0332638Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0332854Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0333094Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0333326Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0333582Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0333816Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0334060Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0334296Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0334540Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0334795Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0335044Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0335287Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0335498Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0335721Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0335936Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0336178Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0336413Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0336615Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0336848Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0337089Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0337322Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0337523Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0337757Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0337980Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0338185Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0338422Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0338632Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0338833Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0339064Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0339328Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0339573Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0339812Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0340046Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0340305Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0340534Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0340750Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0340993Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0341225Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0341465Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0341699Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0341938Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0342168Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0342427Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0342664Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0342906Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0343137Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0343379Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0343625Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0343879Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0344123Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0344364Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0344600Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0344840Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0345074Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0345314Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0345547Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0345789Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0346023Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0346240Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0346453Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0346709Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0346942Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0347183Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0347415Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0347659Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0347893Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0348152Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0348393Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0348648Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0348881Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0349123Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0349356Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0349568Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0349769Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0350001Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0350251Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0350477Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0350696Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0350937Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0351180Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0351393Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0351596Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0351827Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0352039Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0352240Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0352488Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0352747Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0352992Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0353233Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0353464Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0353675Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0353900Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0354114Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0354358Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0354592Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0354811Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0355025Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0355239Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0355481Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0355726Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0355970Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0356204Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0356445Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0356681Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0356937Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0357184Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0357434Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0357667Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0357880Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0358090Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0358325Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0358543Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0358763Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0358978Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0359221Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0359458Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0359700Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0359932Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0360236Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0360472Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0360712Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0360949Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0361192Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0361443Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0361673Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0361899Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0362123Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0362348Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0362562Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0362804Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0363041Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0363258Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0363473Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0363688Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0363930Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0364164Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0364404Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0364661Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0364907Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0365145Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0365387Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0365623Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0365879Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0368046Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0368316Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0368554Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0368803Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0369044Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0369287Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0369522Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0369763Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0369999Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0370303Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0370538Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0370751Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0370980Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0371221Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0371466Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0371703Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0371948Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0372185Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0372453Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0372722Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0372965Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0373202Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0373448Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0373686Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0373892Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0374129Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0374372Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0374610Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0374852Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0375091Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0375328Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0375559Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0375777Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0375992Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0376239Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0376474Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0376723Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0376959Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0377200Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0377420Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0377663Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0377903Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0378122Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0378338Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0378545Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0378711Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0378949Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0379156Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0379400Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0379644Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0379880Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0380152Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0380360Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0380600Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0380814Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0381020Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0381258Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0381510Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0381779Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0381985Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0382226Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0382433Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0382676Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0382921Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0383159Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0383406Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0383647Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0383897Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0384132Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0384379Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0384629Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0384878Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0385118Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0385333Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0385542Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0385780Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0386045Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0386276Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0386494Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0386725Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0386971Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0387213Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0387457Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0387703Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0387910Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0388149Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0388399Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0388634Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0388883Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0389133Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0389369Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0389588Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0389813Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0390027Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0390269Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0390531Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0390817Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0391057Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0391300Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0391542Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0391753Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0391997Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0392246Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0392483Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0392729Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0392972Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0393179Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0393420Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0393680Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0393927Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0394176Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0394420Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0394655Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0394875Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0395123Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0395353Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0395601Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0395839Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0396054Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0396296Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0396542Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0396783Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0397026Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0397266Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0397495Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0397719Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0397938Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0398172Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0398423Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0398662Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0398909Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0399145Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0399394Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0399675Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0399895Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0400192Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0400438Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0400681Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0400924Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0401165Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0401396Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0401617Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0401838Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0402058Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0402312Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0402550Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0402814Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0403055Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0403300Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0403541Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0403787Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0404103Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0404362Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0404623Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0404845Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0405062Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0405274Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0405487Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0405724Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0405946Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0406167Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0406382Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0406594Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0406790Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0406933Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0407101Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0407235Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0407385Z E1204 11:14:26.526000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0407560Z [W1204 11:14:26.989240775 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0407570Z 2025-12-04T12:10:21.0407734Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0408052Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0408365Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0408533Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0409065Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0409352Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0409601Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0409824Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0410047Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0410327Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0410568Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0410817Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0411057Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0411305Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0411540Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0411788Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0412041Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0412289Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0412522Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0412743Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0412979Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0413217Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0413476Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0413728Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0413940Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0414177Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0414423Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0414663Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0414872Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0415112Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0415327Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0415535Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0415770Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0415988Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0416196Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0416447Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0416697Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0416935Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0417183Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0417417Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0417639Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0417907Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0418136Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0418385Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0418622Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0418869Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0419109Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0419356Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0419595Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0419838Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0420079Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0420426Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0420665Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0420909Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0421160Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0421411Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0421645Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0421890Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0422126Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0422395Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0422650Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0422906Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0423143Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0423390Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0423631Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0423850Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0424065Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0424311Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0424547Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0424795Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0425030Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0425276Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0425525Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0425773Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0426011Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0426252Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0426491Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0426733Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0426984Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0427210Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0427430Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0427679Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0427894Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0428123Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0428340Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0428586Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0428820Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0429037Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0429247Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0429483Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0429708Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0429912Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0430221Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0430466Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0430704Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0430953Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0431187Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0431417Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0431668Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0431905Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0432152Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0432392Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0432617Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0432832Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0433054Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0433299Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0433539Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0433784Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0434027Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0434276Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0434513Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0434782Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0435020Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0435270Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0435507Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0435726Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0435950Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0436199Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0436433Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0436648Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0436868Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0437113Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0437353Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0437605Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0437840Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0438097Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0438334Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0438582Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0438821Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0439062Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0439313Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0439535Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0439755Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0439962Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0440244Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0440466Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0440739Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0440991Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0441208Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0441423Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0441643Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0441892Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0442133Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0442379Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0442620Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0442864Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0443103Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0443346Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0443586Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0443845Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0444081Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0444334Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0444570Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0444830Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0445076Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0445333Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0445584Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0445827Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0446067Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0446312Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0446555Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0446770Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0446980Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0447223Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0447469Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0447709Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0447952Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0448204Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0448452Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0448693Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0448940Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0449174Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0449420Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0449668Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0449908Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0450190Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0450434Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0450678Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0450922Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0451163Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0451394Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0451616Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0451835Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0452053Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0452301Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0452535Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0452785Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0453004Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0453222Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0453443Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0453685Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0453923Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0454162Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0454392Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0454612Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0454781Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0455021Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0455229Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0455470Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0455714Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0455951Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0456166Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0456376Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0456617Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0456833Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0457043Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0457290Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0457499Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0457735Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0457945Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0458182Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0458388Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0458638Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0458907Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0459146Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0459389Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0459629Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0459877Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0460158Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0460404Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0460638Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0460888Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0461127Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0461345Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0461554Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0461804Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0462037Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0462256Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0462475Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0462691Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0462939Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0463191Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0463471Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0463711Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0463917Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0464155Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0464399Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0464640Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0464885Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0465126Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0465358Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0465577Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0465796Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0466002Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0466222Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0466461Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0466704Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0466943Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0467189Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0467426Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0467642Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0467914Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0468159Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0468393Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0468638Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0468874Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0469083Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0469320Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0469566Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0469802Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0470046Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0470322Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0470548Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0470781Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0470996Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0471218Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0471465Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0471699Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0471906Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0472155Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0472427Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0472665Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0472906Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0473144Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0473378Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0473598Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0473812Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0474032Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0474278Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0474513Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0474757Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0474990Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0475247Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0475487Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0475695Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0475932Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0476173Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0476410Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0476684Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0476932Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0477158Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0477381Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0477601Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0477817Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0478062Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0478298Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0478542Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0478778Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0479023Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0479260Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0479505Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0479753Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0479996Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0480270Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0480482Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0480701Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0480910Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0481138Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0481398Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0481623Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0481838Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0482044Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0482254Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0482443Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0482584Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0482747Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0482866Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0483009Z E1204 11:14:26.528000 834730 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0483070Z FAILED [1.6078s] [100%] 2025-12-04T12:10:21.0483074Z 2025-12-04T12:10:21.0483151Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.0483330Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.0483403Z Traceback (most recent call last): 2025-12-04T12:10:21.0483589Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0483654Z method(*args, **kwargs) 2025-12-04T12:10:21.0483822Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0483882Z method(*args, **kwargs) 2025-12-04T12:10:21.0484062Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.0484122Z with policy(): 2025-12-04T12:10:21.0484289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.0484352Z raise RuntimeError(msg) 2025-12-04T12:10:21.0484784Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.0484788Z 2025-12-04T12:10:21.0484885Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.0485182Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.0485206Z 2025-12-04T12:10:21.0485316Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.0485421Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0485485Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0485590Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0486160Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0486282Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0486346Z graph_break [] 2025-12-04T12:10:21.0486431Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0486529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0487032Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.0487106Z current_size = base.storage().size() 2025-12-04T12:10:21.0487167Z Autotune Choices Stats: 2025-12-04T12:10:21.0487568Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008559999987483025, "best_triton_pos": 0} 2025-12-04T12:10:21.0487652Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0487727Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0487847Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0488108Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0488357Z triton_mm_33 0.0097 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0488608Z triton_mm_29 0.0106 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0488853Z triton_mm_22 0.0112 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0489093Z triton_mm_21 0.0119 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0489341Z triton_mm_30 0.0120 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0489587Z triton_mm_23 0.0121 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0489852Z triton_mm_15 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0490141Z triton_mm_31 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0490381Z triton_mm_16 0.0129 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0490534Z SingleProcess AUTOTUNE benchmarking takes 0.1712 seconds and 8.8538 seconds precompiling for 33 choices 2025-12-04T12:10:21.0490710Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.0490780Z Traceback (most recent call last): 2025-12-04T12:10:21.0490958Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0491020Z method(*args, **kwargs) 2025-12-04T12:10:21.0491196Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0491256Z method(*args, **kwargs) 2025-12-04T12:10:21.0491427Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.0491484Z with policy(): 2025-12-04T12:10:21.0491657Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.0491719Z raise RuntimeError(msg) 2025-12-04T12:10:21.0492145Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.0492149Z 2025-12-04T12:10:21.0492244Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.0492537Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.0492539Z 2025-12-04T12:10:21.0492646Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.0492756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0492826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0492902Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0493474Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0493592Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0493653Z graph_break [] 2025-12-04T12:10:21.0493736Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0493831Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0494357Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.0494444Z current_size = base.storage().size() 2025-12-04T12:10:21.0494504Z Autotune Choices Stats: 2025-12-04T12:10:21.0494894Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008559999987483025, "best_triton_pos": 0} 2025-12-04T12:10:21.0494978Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0495047Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0495168Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0495420Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0495668Z triton_mm_33 0.0097 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0495906Z triton_mm_29 0.0106 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0496149Z triton_mm_22 0.0112 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0496392Z triton_mm_21 0.0119 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0496632Z triton_mm_30 0.0120 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0496876Z triton_mm_23 0.0121 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0497128Z triton_mm_15 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0497375Z triton_mm_31 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0497618Z triton_mm_16 0.0129 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0497766Z SingleProcess AUTOTUNE benchmarking takes 0.1712 seconds and 8.8538 seconds precompiling for 33 choices 2025-12-04T12:10:21.0497861Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0497923Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0498003Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0498135Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0498651Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0498720Z graph_break [] 2025-12-04T12:10:21.0498806Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0498897Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0499282Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.0499394Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.0499459Z Autotune Choices Stats: 2025-12-04T12:10:21.0499845Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009159999899566174, "best_triton_pos": 0} 2025-12-04T12:10:21.0499925Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0499996Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0500162Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0500416Z triton_mm_72 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0500479Z _scaled_mm 0.0097 ms 94.2% 2025-12-04T12:10:21.0500726Z triton_mm_71 0.0109 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0500968Z triton_mm_67 0.0113 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0501228Z triton_mm_68 0.0115 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0501470Z triton_mm_59 0.0118 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0501707Z triton_mm_60 0.0121 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0501949Z triton_mm_53 0.0127 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0502189Z triton_mm_69 0.0128 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0502445Z triton_mm_54 0.0130 ms 70.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0502628Z SingleProcess AUTOTUNE benchmarking takes 0.2590 seconds and 0.7998 seconds precompiling for 39 choices 2025-12-04T12:10:21.0502715Z =================================== FAILURES =================================== 2025-12-04T12:10:21.0502892Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.0502957Z Traceback (most recent call last): 2025-12-04T12:10:21.0503134Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0503195Z method(*args, **kwargs) 2025-12-04T12:10:21.0503368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.0503428Z method(*args, **kwargs) 2025-12-04T12:10:21.0503598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.0503656Z with policy(): 2025-12-04T12:10:21.0503830Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.0503890Z raise RuntimeError(msg) 2025-12-04T12:10:21.0504317Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.0504319Z 2025-12-04T12:10:21.0504412Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.0504704Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.0504707Z 2025-12-04T12:10:21.0504817Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.0504909Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0504973Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0505049Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0505630Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0505747Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0505807Z graph_break [] 2025-12-04T12:10:21.0505890Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0505985Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0506483Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.0506555Z current_size = base.storage().size() 2025-12-04T12:10:21.0506618Z Autotune Choices Stats: 2025-12-04T12:10:21.0507015Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008559999987483025, "best_triton_pos": 0} 2025-12-04T12:10:21.0507118Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0507184Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0507302Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0507552Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0507800Z triton_mm_33 0.0097 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0508039Z triton_mm_29 0.0106 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0508280Z triton_mm_22 0.0112 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0508523Z triton_mm_21 0.0119 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0508761Z triton_mm_30 0.0120 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0509005Z triton_mm_23 0.0121 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0509245Z triton_mm_15 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0509487Z triton_mm_31 0.0123 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0509740Z triton_mm_16 0.0129 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0509886Z SingleProcess AUTOTUNE benchmarking takes 0.1712 seconds and 8.8538 seconds precompiling for 33 choices 2025-12-04T12:10:21.0509980Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0510041Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0510168Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0510284Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0510786Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0510857Z graph_break [] 2025-12-04T12:10:21.0510944Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0511033Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0511436Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.0511563Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.0511622Z Autotune Choices Stats: 2025-12-04T12:10:21.0512004Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009159999899566174, "best_triton_pos": 0} 2025-12-04T12:10:21.0512084Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0512165Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0512281Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0512534Z triton_mm_72 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0512596Z _scaled_mm 0.0097 ms 94.2% 2025-12-04T12:10:21.0512844Z triton_mm_71 0.0109 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0513086Z triton_mm_67 0.0113 ms 80.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0513324Z triton_mm_68 0.0115 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0513567Z triton_mm_59 0.0118 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0513804Z triton_mm_60 0.0121 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0514065Z triton_mm_53 0.0127 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0514311Z triton_mm_69 0.0128 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0514549Z triton_mm_54 0.0130 ms 70.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0514697Z SingleProcess AUTOTUNE benchmarking takes 0.2590 seconds and 0.7998 seconds precompiling for 39 choices 2025-12-04T12:10:21.0514788Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.0514853Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.0514939Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.0515058Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.0515564Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.0515641Z graph_break [] 2025-12-04T12:10:21.0515723Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.0515818Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.0515877Z Autotune Choices Stats: 2025-12-04T12:10:21.0516261Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008479000069200993, "best_triton_pos": 0} 2025-12-04T12:10:21.0516345Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.0516411Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.0516529Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.0516780Z triton_mm_110 0.0085 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0517028Z triton_mm_109 0.0090 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0517089Z _scaled_mm 0.0093 ms 91.0% 2025-12-04T12:10:21.0517330Z triton_mm_92 0.0108 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0517568Z triton_mm_105 0.0111 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0517810Z triton_mm_106 0.0112 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0518058Z triton_mm_97 0.0116 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0518297Z triton_mm_98 0.0116 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.0518538Z triton_mm_99 0.0122 ms 69.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0518776Z triton_mm_91 0.0125 ms 67.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.0518924Z SingleProcess AUTOTUNE benchmarking takes 0.2795 seconds and 0.6476 seconds precompiling for 39 choices 2025-12-04T12:10:21.0519147Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe0eb0f1af3fb618.xml - 2025-12-04T12:10:21.0519224Z =========================== short test summary info ============================ 2025-12-04T12:10:21.0519882Z FAILED [1.6078s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.0519896Z 2025-12-04T12:10:21.0519987Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.0520304Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.0520307Z 2025-12-04T12:10:21.0520412Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.0520495Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.0520584Z ================= 1 failed, 187 deselected, 2 rerun in 14.83s ================== 2025-12-04T12:10:21.0520641Z Got exit code 1 2025-12-04T12:10:21.0520701Z Retrying single test... 2025-12-04T12:10:21.0520862Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe7e9aac91275213.xml 2025-12-04T12:10:21.0520941Z ============================= test session starts ============================== 2025-12-04T12:10:21.0521071Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.0521134Z cachedir: .pytest_cache 2025-12-04T12:10:21.0521309Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.0521376Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.0521436Z configfile: pytest.ini 2025-12-04T12:10:21.0521625Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.0521719Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.0522007Z stepcurrent: skipping 108 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.0522069Z Running 1 items in this shard 2025-12-04T12:10:21.0522071Z 2025-12-04T12:10:21.0522451Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:37.135979718 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0522455Z 2025-12-04T12:10:21.0522789Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0523101Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0523256Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0523761Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0524075Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0524321Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0524547Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0524768Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0525012Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0525255Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0525499Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0525740Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0525987Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0526221Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0526464Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0526699Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0526951Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0527186Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0527431Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0527665Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0527910Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0528156Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0528370Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0528618Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0528858Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0529093Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0529302Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0529535Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0529779Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0530015Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0530297Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0530531Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0530751Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0530982Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0531156Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0531371Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0531912Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/d4/cd4i7s52vinet5572rcfxtekz764iiph45k3npnsrojp3frclnvc.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0532077Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0532311Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0532483Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0532803Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0532987Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0533265Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0533420Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0533693Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0533866Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0534156Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0534310Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0534598Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0534808Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0535139Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0535450Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0535598Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0536100Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0536373Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0536614Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0536836Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0537056Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0537309Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0537559Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0537812Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0538047Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0538288Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0538526Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0538771Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0539005Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0539250Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0539484Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0539729Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0539962Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0540293Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0540548Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0540756Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0540994Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0541237Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0541474Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0541680Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0541932Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0542207Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0542442Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0542694Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0542928Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0543150Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0543377Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0543556Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0543757Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0543877Z E1204 11:14:45.516000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0544056Z [W1204 11:14:45.032850797 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0544058Z 2025-12-04T12:10:21.0544385Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0544699Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0544847Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0545353Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0545626Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0545865Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0546090Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0546320Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0546578Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0546828Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0547070Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0547309Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0547550Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0547791Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0548031Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0548268Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0548513Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0548750Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0548997Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0549231Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0549520Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0549755Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0549967Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0550249Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0550492Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0550729Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0550959Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0551211Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0551466Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0551704Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0551950Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0552183Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0552405Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0552631Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0552810Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0553006Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0553551Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/js/cjsnt5ybembe2fnsixw3iunffuzd23eh7q4zydss5y55mal5cucl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0553718Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0553949Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0554136Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0554440Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0554592Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0554867Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0555021Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0555297Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0555481Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0555787Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0555948Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0556240Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0556448Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0556782Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0557091Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0557238Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0557741Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0558013Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0558255Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0558482Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0558708Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0558956Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0559191Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0559439Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0559678Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0559921Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0560204Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0560459Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0560693Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0560935Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0561172Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0561418Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0561654Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0561900Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0562135Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0562344Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0562578Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0562827Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0563065Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0563283Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0563521Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0563767Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0564004Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0564246Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0564485Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0564732Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0564969Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0565147Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0565342Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0565467Z E1204 11:14:45.573000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0565640Z [W1204 11:14:45.053760605 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0565642Z 2025-12-04T12:10:21.0565970Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0566280Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0566426Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0567140Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0567417Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0567661Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0567915Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0568136Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0568383Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0568620Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0568865Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0569098Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0569358Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0569620Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0569862Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0570135Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0570379Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0570617Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0570859Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0571098Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0571343Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0571577Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0571784Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0572018Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0572270Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0572520Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0572731Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0572966Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0573212Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0573450Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0573693Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0573989Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0574224Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0574455Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0574633Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0574828Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0575370Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/h4/ch47pvrzjzy5l5g2miliwosvjhxn2mhzlg2mjae43rcj5ww67u2e.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0575532Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0575764Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0575939Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0576242Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0576396Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0576667Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0576825Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0577104Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0577280Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0577563Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0577717Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0578012Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0578221Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0578579Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0578896Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0579046Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0579542Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0579809Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0580052Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0580310Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0580534Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0580786Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0581023Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0581269Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0581503Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0581764Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0582003Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0582266Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0582502Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0582750Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0583002Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0583257Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0583525Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0583766Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0584003Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0584212Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0584447Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0584692Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0584929Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0585137Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0585371Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0585617Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0585852Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0586093Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0586340Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0586559Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0586791Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0586968Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0587163Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0587285Z E1204 11:14:45.594000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0587467Z [W1204 11:14:45.080853523 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0587469Z 2025-12-04T12:10:21.0587654Z [W1204 11:14:45.090341790 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0587666Z 2025-12-04T12:10:21.0588003Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0588314Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0588463Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0588959Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0589230Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0589470Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0589696Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0589913Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0590247Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0590486Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0590744Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0590986Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0591231Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0591468Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0591709Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0591947Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0592222Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0592468Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0592715Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0592950Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0593199Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0593434Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0593648Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0593884Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0594126Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0594363Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0594570Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0594809Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0595052Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0595308Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0595558Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0595792Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0596013Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0596240Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0596419Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0596625Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0597185Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/o7/co7nrawcubelamwpu3mcromhl3rhimqzkugscfu76wuqi2vn7vou.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0597353Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0597583Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0597759Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0598061Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0598213Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0598488Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0598642Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0598915Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0599089Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0599380Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0599530Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0599834Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0600044Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0600422Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0600731Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0600878Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0601390Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0601696Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0601938Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0602164Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0602381Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0602630Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0602867Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0603114Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0603353Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0603597Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0603841Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0604083Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0604437Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0604680Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0604920Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0605166Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0605398Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0605645Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0605893Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0606124Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0606359Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0606604Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0606844Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0607051Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0607290Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0607533Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0607769Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0608016Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0608255Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0608477Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0608705Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0608884Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0609089Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0609212Z E1204 11:14:45.626000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0609537Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0609844Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0609992Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0610537Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0610836Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0611076Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0611300Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0611523Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0611765Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0612004Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0612249Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0612485Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0612729Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0612968Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0613213Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0613460Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0613707Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0613942Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0614190Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0614425Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0614672Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0614921Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0615136Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0615385Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0615627Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0615864Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0616069Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0616312Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0616556Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0616788Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0617035Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0617269Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0617491Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0617716Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0617895Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0618100Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0618641Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/xc/cxc2anr2xernse3b4gn5u4tsll3tzesipkn3uljpaeetr7cv5ujy.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0618807Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0619034Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0619209Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0619570Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0619728Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0620003Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0620194Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0620469Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0620643Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0620931Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0621082Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0621375Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0621586Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0621915Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0622223Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0622369Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0622882Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0623156Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0623396Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0623621Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0623841Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0624101Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0624362Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0624609Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0624848Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0625093Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0625332Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0625573Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0625811Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0626058Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0626292Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0626539Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0626778Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0627024Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0627269Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0627479Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0627718Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0627960Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0628196Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0628400Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0628659Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0628914Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0629152Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0629397Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0629632Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0629855Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0630082Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0630295Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0630488Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0630612Z E1204 11:14:45.639000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0630783Z [W1204 11:14:45.113037445 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0630790Z 2025-12-04T12:10:21.0631117Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0631426Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0631573Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0632079Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0632351Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0632590Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0632816Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0633054Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0633316Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0633565Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0633812Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0634053Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0634295Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0634533Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0634775Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0635014Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0635258Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0635496Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0635742Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0635974Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0636237Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0636471Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0636682Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0636916Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0637163Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0637401Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0637615Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0637876Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0638118Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0638354Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0638596Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0638835Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0639057Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0639286Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0639463Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0639659Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0640239Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpfnmqf382/qc/cqciw5xrx6t6adzlfw6exhkjy4a5qtvq355pcgigeez7m5gbvgsq.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.0640405Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.0640634Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.0640819Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.0641121Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.0641272Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.0641546Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.0641700Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.0641970Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.0642156Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.0642464Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.0642613Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.0642904Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.0643111Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.0643443Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0643754Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0643899Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0644392Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0644659Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0644904Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0645125Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0645352Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0645601Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0645838Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0646082Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0646314Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0646572Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0646818Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0647070Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0647305Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0647547Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0647784Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0648025Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0648260Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0648504Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0648736Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0648943Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0649176Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0649419Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0649653Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0649872Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0650142Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0650383Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0650619Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0650860Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0651112Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0651342Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.0651589Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.0651765Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.0651959Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.0652079Z E1204 11:14:45.652000 840652 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.0652150Z ('RERUN', {'yellow': True}) [11.9978s] [100%] 2025-12-04T12:10:21.0652516Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:47.096213845 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0652520Z 2025-12-04T12:10:21.0652679Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0652988Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0653297Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0653443Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0653937Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0654214Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0654459Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0654680Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0654897Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0655141Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0655377Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0655631Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0655887Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0656133Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0656367Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0656608Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0656843Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0657084Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0657319Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0657534Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0657760Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0657981Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0658227Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0658462Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0658676Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0658911Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0659152Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0659387Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0659593Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0659825Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0660051Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0660315Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0660563Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0660774Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0660979Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0661215Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0661458Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0661694Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0661936Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0662173Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0662386Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0662612Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0662828Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0663069Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0663320Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0663562Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0663797Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0664037Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0664276Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0664533Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0664776Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0665029Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0665261Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0665504Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0665740Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0665985Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0666220Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0666465Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0666700Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0666941Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0670948Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0671203Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0671464Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0671711Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0671947Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0672168Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0672380Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0672626Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0672874Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0673155Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0673393Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0673634Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0673872Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0674115Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0674357Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0674598Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0674836Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0675081Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0675318Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0675534Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0675737Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0675989Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0676207Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0676438Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0676656Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0676897Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0677134Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0677358Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0677586Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0677819Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0678033Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0678239Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0678478Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0678724Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0678960Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0679207Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0679442Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0679659Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0679885Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0680140Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0680411Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0680650Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0680872Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0681084Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0681304Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0681553Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0681801Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0682060Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0682307Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0682553Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0682793Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0683038Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0683278Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0683520Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0683757Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0683975Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0684183Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0684418Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0684644Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0684873Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0685090Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0685338Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0685574Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0685819Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0686054Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0686310Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0686565Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0686819Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0687055Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0687296Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0687536Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0687758Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0687970Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0688181Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0688408Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0688629Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0688873Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0689110Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0689341Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0689555Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0689774Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0690017Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0690301Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0690544Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0690808Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0691074Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0691321Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0691566Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0691801Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0692047Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0692282Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0692527Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0692766Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0693014Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0693253Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0693495Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0693734Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0693989Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0694228Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0694475Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0694708Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0694929Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0695135Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0695391Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0695655Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0695892Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0696137Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0696374Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0696621Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0696855Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0697104Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0697344Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0697587Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0697828Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0698037Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0698274Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0698527Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0698767Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0699019Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0699255Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0699485Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0699716Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0699943Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0700322Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0700564Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0700801Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0701031Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0701251Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0701466Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0701685Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0701927Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0702167Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0702390Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0702604Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0702815Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0702994Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0703235Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0703443Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0703680Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0703926Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0704161Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0704391Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0704608Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0704866Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0705078Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0705288Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0705527Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0705732Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0705971Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0706175Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0706414Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0706618Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0706859Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0707105Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0707342Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0707607Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0707844Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0708089Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0708325Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0708570Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0708819Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0709072Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0709321Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0709537Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0709746Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0709982Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0710250Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0710471Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0710685Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0710903Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0711146Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0711387Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0711636Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0711873Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0712096Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0712333Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0712580Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0712817Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0713062Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0713309Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0713556Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0713793Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0714006Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0714220Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0714426Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0714666Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0714910Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0715147Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0715393Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0715631Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0715842Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0716077Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0716322Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0716566Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0716813Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0717051Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0717256Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0717494Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0717751Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0718001Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0718255Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0718492Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0718724Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0718941Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0719160Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0719376Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0719621Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0719859Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0720067Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0720342Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0720587Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0720824Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0721080Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0721318Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0721551Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0721768Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0721986Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0722215Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0722472Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0722727Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0722970Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0723207Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0723449Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0723689Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0723895Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0724131Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0724373Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0724612Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0724857Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0725092Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0725321Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0725550Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0725768Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0725987Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0726232Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0726469Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0726721Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0726969Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0727221Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0727458Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0727700Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0727939Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0728187Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0728422Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0728636Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0728853Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0729062Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0729274Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0729508Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0729731Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0729961Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0730210Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0730420Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0730611Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0730756Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0730920Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0731058Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0731203Z E1204 11:14:47.650000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0731391Z [W1204 11:14:47.116656879 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0731406Z 2025-12-04T12:10:21.0731567Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0731879Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0732191Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0732340Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0732833Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0733104Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0733347Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0733569Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0733787Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0734030Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0734285Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0734530Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0734765Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0735010Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0735242Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0735486Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0735730Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0735994Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0736234Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0736446Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0736671Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0736886Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0737131Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0737364Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0737573Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0737810Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0738053Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0738291Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0738494Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0738740Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0738956Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0739163Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0739402Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0739613Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0739818Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0740063Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0740374Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0740619Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0740866Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0741101Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0741313Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0741539Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0741755Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0742000Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0742234Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0742485Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0742721Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0742963Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0743197Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0743454Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0743691Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0743934Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0744169Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0744415Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0744664Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0744917Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0745160Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0745402Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0745637Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0745879Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0746114Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0746356Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0746594Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0746837Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0747072Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0747300Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0747513Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0747769Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0748004Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0748250Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0748484Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0748733Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0748971Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0749224Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0749488Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0749731Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0749967Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0750246Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0750485Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0750702Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0750908Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0751146Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0751359Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0751584Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0751801Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0752047Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0752301Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0752514Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0752724Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0752960Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0753175Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0753379Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0753636Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0753897Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0754143Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0754387Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0754620Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0754837Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0755061Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0755278Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0755526Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0755762Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0755983Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0756198Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0756416Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0756659Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0756910Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0757157Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0757392Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0757636Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0757873Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0758141Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0758385Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0758640Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0758879Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0759097Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0759304Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0759542Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0759762Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0759976Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0760225Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0760474Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0760711Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0760959Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0761199Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0761463Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0761700Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0761948Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0762189Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0762434Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0762690Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0762921Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0763156Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0763364Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0763595Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0763815Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0764059Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0764298Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0764514Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0764730Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0764946Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0765196Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0765434Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0765676Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0765925Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0766169Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0766406Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0766650Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0766885Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0767145Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0767398Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0767653Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0767887Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0768133Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0768371Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0768612Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0768849Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0769091Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0769333Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0769575Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0769811Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0770028Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0770286Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0770525Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0770768Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0771006Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0771248Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0771489Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0771747Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0772007Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0772251Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0772487Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0772734Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0772969Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0773176Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0773415Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0773661Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0773898Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0774139Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0774376Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0774606Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0774840Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0775058Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0775274Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0775626Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0775861Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0776094Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0776336Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0776562Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0776781Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0777025Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0777264Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0777483Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0777702Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0777910Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0778078Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0778320Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0778527Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0778767Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0779011Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0779262Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0779478Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0779692Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0779935Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0780185Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0780395Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0780645Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0780870Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0781120Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0781331Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0781571Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0781780Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0782023Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0782269Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0782508Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0782753Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0782994Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0783245Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0783480Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0783728Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0783979Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0784228Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0784468Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0784684Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0784892Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0785141Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0785384Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0785612Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0785830Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0786053Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0786298Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0786538Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0786783Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0787023Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0787229Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0787470Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0787718Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0787956Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0788205Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0788450Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0788685Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0788904Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0789123Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0789336Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0789542Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0789801Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0790066Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0790343Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0790585Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0790824Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0791033Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0791268Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0791514Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0791749Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0791996Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0792236Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0792445Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0792683Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0792942Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0793182Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0793426Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0793664Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0793894Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0794131Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0794364Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0794594Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0794841Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0795077Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0795287Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0795524Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0795772Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0796011Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0796257Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0796499Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0796732Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0796954Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0797169Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0797398Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0797647Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0797883Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0798131Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0798370Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0798629Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0798876Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0799099Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0799339Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0799583Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0799824Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0800069Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0800349Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0800585Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0800804Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0801023Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0801241Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0801489Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0801723Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0801983Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0802225Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0802471Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0802712Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0802955Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0803207Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0803464Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0803718Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0803934Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0804155Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0804366Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0804582Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0804816Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0805038Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0805255Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0805468Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0805677Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0805871Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0806014Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0806179Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0806311Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0806460Z E1204 11:14:47.655000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0806640Z [W1204 11:14:47.119241886 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0806643Z 2025-12-04T12:10:21.0806809Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0807121Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0807430Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0807589Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0808099Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0808383Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0808630Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0808853Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0809072Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0809315Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0809555Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0809799Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0810039Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0810321Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0810555Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0810819Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0811056Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0811305Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0811541Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0811758Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0811987Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0812222Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0812482Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0812728Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0812942Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0813177Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0813422Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0813660Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0813866Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0814103Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0814317Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0814523Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0814759Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0814979Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0815186Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0815431Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0815677Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0815912Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0816158Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0816392Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0816621Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0816855Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0817084Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0817330Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0817564Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0817809Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0818042Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0818288Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0818617Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0818861Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0819100Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0819346Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0819583Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0819825Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0820073Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0820356Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0820589Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0820832Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0821067Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0821333Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0821581Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0821839Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0822074Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0822318Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0822557Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0822776Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0822990Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0823232Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0823476Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0823723Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0823965Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0824209Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0824456Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0824703Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0824941Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0825185Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0825423Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0825670Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0825928Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0826165Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0826373Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0826612Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0826824Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0827053Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0827271Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0827521Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0827756Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0827972Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0828180Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0828415Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0828632Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0828835Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0829085Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0829330Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0829570Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0829816Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0830051Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0830315Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0830553Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0830785Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0831031Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0831273Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0831495Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0831714Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0831935Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0832178Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0832420Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0832664Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0832906Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0833152Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0833388Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0833649Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0833889Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0834139Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0834375Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0834593Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0834818Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0835063Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0835294Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0835508Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0835729Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0835977Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0836218Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0836467Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0836702Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0836950Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0837188Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0837436Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0837673Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0837923Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0838173Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0838393Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0838613Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0838821Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0839052Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0839282Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0839539Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0839787Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0840009Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0840259Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0840477Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0840725Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0840962Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0841209Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0841449Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0841692Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0841934Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0842178Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0842417Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0842688Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0842928Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0843178Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0843412Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0843659Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0843908Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0844170Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0844422Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0844664Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0844902Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0845145Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0845382Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0845595Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0845802Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0846041Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0846289Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0846527Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0846770Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0847017Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0847260Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0847497Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0847742Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0847977Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0848224Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0848472Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0848703Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0848939Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0849184Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0849421Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0849664Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0849903Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0850162Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0850386Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0850601Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0850821Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0851067Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0851301Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0851551Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0851770Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0851986Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0852202Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0852451Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0852689Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0852918Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0853158Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0853368Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0853533Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0853769Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0853978Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0854215Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0854461Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0854699Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0854913Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0855121Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0855358Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0855575Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0855782Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0856028Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0856236Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0856475Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0856683Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0856917Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0857125Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0857383Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0857644Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0857880Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0858123Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0858361Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0858609Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0858848Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0859092Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0859328Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0859572Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0859808Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0860024Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0860269Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0860520Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0860757Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0860976Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0861192Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0861408Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0861654Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0861915Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0862173Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0862411Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0862619Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0862858Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0863101Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0863340Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0863581Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0863819Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0864049Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0864267Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0864483Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0864693Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0864912Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0865149Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0865398Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0865639Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0865883Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0866122Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0866363Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0866614Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0866864Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0867101Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0867350Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0867586Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0867796Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0868031Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0868278Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0868519Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0868764Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0869006Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0869236Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0869470Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0869687Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0869909Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0870188Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0870425Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0870636Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0870907Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0871173Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0871409Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0871657Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0871896Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0872127Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0872349Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0872564Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0872785Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0873033Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0873273Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0873520Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0873755Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0874016Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0874253Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0874465Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0874701Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0874948Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0875193Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0875463Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0875715Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0875944Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0876167Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0876384Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0876606Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0876854Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0877091Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0877344Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0877581Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0877829Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0878066Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0878313Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0878563Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0878808Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0879048Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0879263Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0879486Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0879709Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0879932Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0880210Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0880433Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0880651Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0880861Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0881074Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0881265Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0881414Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0881580Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0881701Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0881847Z E1204 11:14:47.658000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0882021Z [W1204 11:14:47.161127221 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0882023Z 2025-12-04T12:10:21.0882190Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0882501Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0882813Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0882975Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0883481Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0883755Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0883998Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0884237Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0884466Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0884726Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0884967Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0885211Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0885455Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0885699Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0885937Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0886182Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0886416Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0886661Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0886894Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0887110Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0887334Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0887568Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0887821Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0888057Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0888267Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0888504Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0888763Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0889007Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0889224Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0889461Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0889679Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0889888Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0890160Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0890377Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0890580Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0890816Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0891061Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0891299Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0891546Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0891784Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0892026Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0892252Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0892472Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0892718Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0892952Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0893199Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0893470Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0893730Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0893968Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0894215Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0894454Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0894697Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0894938Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0895181Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0895419Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0895662Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0895904Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0896150Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0896386Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0896641Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0896877Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0897129Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0897362Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0897608Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0897858Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0898098Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0898326Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0898568Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0898807Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0899050Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0899291Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0899536Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0899771Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0900020Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0900303Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0900556Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0900791Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0901067Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0901305Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0901520Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0901728Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0901961Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0902179Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0902416Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0902653Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0902912Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0903147Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0903364Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0903569Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0903806Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0904019Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0904230Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0904469Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0904713Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0904951Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0905193Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0905430Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0905658Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0905888Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0906108Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0906358Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0906597Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0906829Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0907058Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0907285Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0907534Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0907772Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0908017Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0908256Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0908501Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0908742Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0908989Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0909226Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0909475Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0909709Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0909925Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0910205Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0910451Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0910674Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0910886Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0911108Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0911353Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0911651Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0911907Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0912145Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0912393Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0912632Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0912879Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0913116Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0913361Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0913597Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0913818Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0914036Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0914243Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0914472Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0914704Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0914954Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0915191Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0915411Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0915629Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0915848Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0916119Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0916367Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0916614Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0916963Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0917295Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0917535Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0917782Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0918018Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0918266Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0918504Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0918754Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0918995Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0919240Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0919498Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0919742Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0919985Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0920266Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0920507Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0920776Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0921025Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0921256Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0921463Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0921705Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0921949Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0922191Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0922438Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0922675Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0922922Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0923161Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0923412Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0923648Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0923906Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0924146Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0924353Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0924594Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0924839Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0925077Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0925334Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0925603Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0925837Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0926054Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0926275Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0926493Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0926742Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0926978Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0927211Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0927438Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0927654Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0927876Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0928121Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0928369Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0928588Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0928808Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0929022Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0929188Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.0929431Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0929651Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0929900Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0930205Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0930446Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0930664Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0930870Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0931111Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0931327Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0931541Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0931779Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0931992Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0932233Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0932438Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0932677Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0932903Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0933144Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0933390Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0933631Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0933878Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0934113Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0934394Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0934646Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0934894Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0935130Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0935379Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0935621Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0935840Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0936049Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0936286Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0936520Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0936740Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0936959Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0937178Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0937431Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0937672Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0937923Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0938161Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0938368Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0938608Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0938877Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0939123Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0939370Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0939606Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0939841Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0940063Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0940329Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0940541Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0940749Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0940991Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0941234Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0941473Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0941721Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0941974Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0942188Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0942423Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0942669Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0942904Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0943152Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0943423Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0943643Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0943880Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0944126Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0944367Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0944612Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0944850Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0945081Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0945300Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0945517Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0945736Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0945985Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0946224Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0946445Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0946685Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0946929Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0947168Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0947411Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0947649Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0947901Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0948135Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0948354Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0948570Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0948817Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0949054Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0949302Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0949537Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0949783Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0950024Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0950273Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0950514Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0950758Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0951008Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0951251Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0951492Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0951724Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.0951942Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0952161Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0952395Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0952671Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0952906Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0953153Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0953391Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0953636Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0953873Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0954115Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0954359Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0954609Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0954845Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0955060Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.0955278Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.0955501Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.0955715Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.0955949Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.0956174Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.0956392Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.0956604Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.0956823Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.0957036Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.0957178Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.0957342Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.0957464Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.0957610Z E1204 11:14:47.700000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.0957784Z [W1204 11:14:47.163239344 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.0957789Z 2025-12-04T12:10:21.0957951Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.0958268Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.0958582Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.0958732Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.0959227Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.0959502Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.0959747Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.0959980Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.0960234Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0960481Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0960722Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0960972Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0961228Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0961488Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0961736Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0961982Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0962218Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0962467Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0962709Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0962922Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0963148Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0963364Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0963609Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0963845Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0964054Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0964291Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0964548Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0964788Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0964993Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0965230Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0965442Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0965649Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0965910Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0966134Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0966343Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0966577Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0966827Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0967063Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0967309Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0967546Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0967760Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0967988Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0968203Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0968452Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0968686Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0968944Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0969184Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0969428Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0969667Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0969909Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0970186Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0970457Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0970712Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0970960Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0971195Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0971440Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0971676Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0971922Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0972157Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0972401Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0972641Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0972887Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0973125Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0973368Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0973622Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0973841Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0974058Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.0974303Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0974539Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0974796Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0975044Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0975305Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0975537Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0975786Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0976023Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0976267Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0976503Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0976746Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0976982Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0981014Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0981230Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0981470Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0981686Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0981953Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0982172Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0982418Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0982653Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0982868Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0983093Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0983340Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0983568Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0983770Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0984008Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0984251Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0984488Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0984734Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0984968Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0985190Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0985414Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0985631Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0985878Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0986119Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0986349Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0986564Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0986788Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0987032Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0987273Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0987517Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0987777Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0988034Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0988268Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0988513Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0988750Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0988996Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0989234Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0989454Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.0989664Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.0989902Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0990154Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0990368Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0990587Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0990845Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0991083Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0991329Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0991566Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0991813Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0992049Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0992322Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0992573Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0992816Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0993055Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0993273Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0993494Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0993704Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.0993933Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.0994151Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0994395Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0994634Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0994851Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.0995069Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.0995293Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.0995544Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.0995783Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0996025Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0996262Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0996504Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0996764Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0997023Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0997260Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0997511Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0997747Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0997991Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0998226Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0998471Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0998708Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0998953Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0999193Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0999435Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.0999676Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.0999929Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1000203Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1000418Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1000626Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1000866Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1001128Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1001378Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1001633Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1001873Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1002120Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1002355Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1002602Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1002837Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1003081Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1003316Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1003527Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1003768Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1004011Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1004260Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1004504Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1004741Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1004970Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1005191Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1005407Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1005632Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1005890Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1006140Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1006370Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1006588Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1006805Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1007025Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1007268Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1007506Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1007725Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1007947Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1008156Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1008322Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1008560Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1008776Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1009015Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1009260Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1009499Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1009713Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1009923Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1010231Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1010457Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1010665Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1010902Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1011112Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1011351Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1011560Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1011799Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1012008Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1012250Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1012496Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1012733Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1012975Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1013226Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1013472Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1013709Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1013954Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1014190Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1014436Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1014700Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1014929Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1015137Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1015371Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1015603Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1015823Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1016040Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1016258Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1016505Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1016743Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1016987Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1017225Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1017429Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1017677Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1017922Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1018163Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1018408Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1018642Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1018874Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1019103Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1019341Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1019548Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1019756Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1019997Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1020297Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1020537Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1020781Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1021022Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1021228Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1021468Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1021714Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1021949Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1022210Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1022449Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1022657Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1022896Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1023140Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1023380Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1023637Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1023901Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1024130Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1024353Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1024572Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1024794Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1025039Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1025273Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1025481Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1025716Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1025962Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1026200Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1026445Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1026695Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1026924Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1027145Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1027361Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1027579Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1027833Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1028087Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1028352Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1028591Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1028838Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1029077Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1029287Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1029525Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1029767Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1030004Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1030284Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1030522Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1030751Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1030970Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1031200Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1031417Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1031663Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1031899Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1032144Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1032379Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1032661Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1032912Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1033154Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1033391Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1033634Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1033874Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1034086Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1034307Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1034515Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1034730Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1034964Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1035185Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1035402Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1035618Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1035829Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1036021Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1036164Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1036328Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1036447Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1036591Z E1204 11:14:47.702000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1036766Z [W1204 11:14:47.165236508 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1036782Z 2025-12-04T12:10:21.1036944Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1037270Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1037592Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1037742Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1038238Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1038510Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1038752Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1038976Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1039196Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1039440Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1039681Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1039922Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1040213Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1040457Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1040697Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1040944Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1041178Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1041439Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1041686Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1041916Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1042140Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1042360Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1042605Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1042840Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1043048Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1043282Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1043527Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1043760Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1043968Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1044204Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1044417Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1044634Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1044868Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1045087Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1045291Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1045529Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1045775Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1046021Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1046290Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1046524Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1046740Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1046964Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1047187Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1047436Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1047670Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1047916Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1048150Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1048395Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1048631Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1048876Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1049138Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1049382Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1049618Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1049859Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1050140Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1050386Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1050652Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1050907Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1051140Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1051391Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1051625Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1051870Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1052106Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1052348Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1052583Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1052801Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1053016Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1053257Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1053496Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1053753Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1053988Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1054233Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1054465Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1054709Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1054952Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1055207Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1055457Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1055699Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1055934Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1056148Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1056354Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1056595Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1056810Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1057040Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1057258Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1057508Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1057742Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1057958Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1058172Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1058410Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1058626Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1058830Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1059067Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1059311Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1059572Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1059826Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1060062Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1060301Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1060526Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1060745Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1060992Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1061233Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1061453Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1061675Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1061896Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1062141Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1062380Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1062640Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1062879Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1063125Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1063365Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1063612Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1063849Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1064122Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1064378Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1064597Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1064804Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1065046Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1065267Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1065482Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1065700Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1065950Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1066190Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1066433Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1066673Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1066919Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1067164Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1067412Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1067649Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1067899Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1068137Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1068356Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1068595Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1068815Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1069043Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1069259Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1069507Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1069748Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1069972Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1070225Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1070444Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1070692Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1070928Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1071176Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1071416Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1071671Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1071910Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1072162Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1072400Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1072643Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1072881Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1073152Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1073400Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1073646Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1073881Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1074130Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1074366Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1074613Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1074852Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1075096Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1075335Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1075551Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1075761Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1075995Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1076254Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1076496Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1076739Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1076979Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1077220Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1077469Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1077721Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1077968Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1078217Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1078453Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1078660Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1078897Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1079140Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1079376Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1079618Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1079854Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1080083Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1080350Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1080563Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1080796Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1081043Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1081279Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1081510Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1081728Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1081957Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1082189Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1082449Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1082688Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1082905Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1083121Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1083328Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1083494Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1083729Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1083938Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1084174Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1084420Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1084657Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1084871Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1085090Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1085328Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1085544Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1085749Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1085988Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1086195Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1086439Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1086666Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1086901Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1087107Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1087345Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1087593Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1087829Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1088072Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1088308Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1088551Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1088787Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1089031Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1089265Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1089519Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1089756Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1089973Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1090231Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1090469Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1090702Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1090936Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1091177Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1091393Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1091637Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1091872Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1092117Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1092356Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1092560Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1092797Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1093039Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1093276Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1093520Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1093756Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1093998Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1094216Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1094433Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1094643Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1094851Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1095086Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1095341Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1095588Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1095840Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1096076Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1096282Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1096519Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1096762Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1096999Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1097245Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1097482Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1097690Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1097928Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1098178Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1098422Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1098671Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1098912Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1099141Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1099363Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1099579Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1099823Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1100078Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1100371Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1100579Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1100815Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1101061Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1101298Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1101546Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1101782Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1102018Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1102242Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1102458Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1102676Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1102934Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1103174Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1103418Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1103656Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1103992Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1104228Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1104454Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1104703Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1104963Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1105201Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1105444Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1105684Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1105912Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1106136Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1106350Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1106572Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1106821Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1107060Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1107306Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1107551Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1107798Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1108034Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1108280Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1108519Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1108761Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1109021Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1109244Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1109466Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1109673Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1109888Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1110143Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1110367Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1110584Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1110791Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1111002Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1111190Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1111340Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1111502Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1111626Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1111771Z E1204 11:14:47.704000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1111842Z ('RERUN', {'yellow': True}) [1.7395s] [100%] 2025-12-04T12:10:21.1112231Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:14:49.647870021 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1112235Z 2025-12-04T12:10:21.1112397Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1112712Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1113022Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1113174Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1113698Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1113979Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1114224Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1114449Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1114669Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1114916Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1115153Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1115401Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1115637Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1115882Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1116116Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1116361Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1116614Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1116860Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1117098Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1117313Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1117540Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1117756Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1118023Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1118269Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1118477Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1118713Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1118954Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1119191Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1119396Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1119630Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1119846Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1120050Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1120318Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1120531Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1120737Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1120986Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1121232Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1121470Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1121716Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1121953Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1122167Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1122406Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1122648Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1122895Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1123130Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1123372Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1123610Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1123852Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1124091Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1124340Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1124578Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1124823Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1125056Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1125300Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1125543Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1125791Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1126026Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1126270Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1126509Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1126753Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1127014Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1127266Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1127503Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1127748Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1127982Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1128201Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1128413Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1128658Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1128892Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1129138Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1129377Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1129621Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1129858Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1130144Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1130381Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1130622Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1130857Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1131101Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1131347Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1131575Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1131799Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1132036Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1132249Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1132476Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1132695Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1132941Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1133179Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1133391Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1133599Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1133834Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1134051Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1134257Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1134512Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1134759Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1134993Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1135242Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1135475Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1135691Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1135928Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1136165Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1136416Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1136654Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1136877Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1137092Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1137312Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1137560Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1137796Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1138042Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1138278Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1138524Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1138762Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1139020Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1139260Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1139505Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1139742Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1139956Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1140204Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1140454Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1140698Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1140913Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1141127Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1141375Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1141612Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1141856Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1142090Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1142334Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1142572Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1142814Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1143051Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1143292Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1143541Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1143761Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1143978Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1144187Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1144411Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1144631Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1144883Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1145143Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1145361Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1145578Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1145799Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1146047Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1146289Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1146531Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1146770Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1147013Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1147253Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1147499Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1147734Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1147990Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1148229Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1148479Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1148722Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1148967Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1149207Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1149477Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1149725Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1149967Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1150246Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1150490Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1150726Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1150943Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1151151Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1151389Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1151632Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1151871Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1152117Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1152352Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1152609Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1152845Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1153090Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1153324Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1153572Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1153823Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1154042Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1154290Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1154533Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1154772Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1155014Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1155252Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1155482Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1155698Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1155915Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1156132Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1156378Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1156613Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1156842Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1157073Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1157288Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1157506Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1157748Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1157986Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1158215Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1158444Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1158663Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1158826Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1159064Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1159270Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1159508Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1159751Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1159989Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1160235Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1160441Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1160679Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1160894Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1161101Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1161350Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1161557Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1161796Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1162001Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1162240Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1162446Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1162697Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1162951Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1163202Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1163450Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1163686Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1163935Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1164172Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1164420Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1164654Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1164902Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1165141Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1165357Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1165568Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1165816Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1166048Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1166270Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1166485Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1166704Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1166948Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1167203Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1167457Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1167710Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1167919Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1168160Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1168409Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1168645Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1168891Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1169125Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1169359Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1169578Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1169794Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1170005Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1170229Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1170486Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1170733Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1170973Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1171218Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1171455Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1171678Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1171926Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1172184Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1172419Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1172668Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1172910Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1173119Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1173358Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1173601Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1173841Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1174085Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1174324Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1174556Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1174786Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1175005Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1175223Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1175472Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1175708Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1175918Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1176167Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1176422Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1176673Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1176915Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1177154Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1177384Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1177609Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1177828Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1178046Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1178294Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1178534Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1178780Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1179015Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1179277Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1179517Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1179730Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1179972Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1180262Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1180506Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1180763Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1181016Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1181262Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1181479Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1181696Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1181914Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1182162Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1182402Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1182643Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1182881Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1183124Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1183362Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1183605Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1183867Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1184115Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1184352Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1184569Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1184786Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1184997Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1185225Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1185474Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1185708Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1185922Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1186153Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1186363Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1186555Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1186698Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1186869Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1186989Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1187136Z E1204 11:14:49.187000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1187311Z [W1204 11:14:49.650580225 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1187317Z 2025-12-04T12:10:21.1187476Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1187793Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1188101Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1188263Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1188756Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1189026Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1189270Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1189492Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1189724Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1189990Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1190285Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1190549Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1190784Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1191033Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1191268Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1191513Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1191746Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1191994Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1192232Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1192446Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1192675Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1192909Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1193157Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1193392Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1193602Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1193839Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1194081Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1194357Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1194574Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1194810Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1195024Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1195234Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1195473Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1195685Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1195893Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1196128Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1196374Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1196608Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1196854Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1197089Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1197312Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1197543Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1197759Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1198006Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1198239Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1198486Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1198732Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1198988Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1199241Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1199499Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1199737Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1199999Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1200295Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1200541Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1200780Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1201024Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1201258Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1201503Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1201736Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1201995Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1202237Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1202481Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1202725Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1202969Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1203221Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1203479Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1203705Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1203953Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1204190Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1204441Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1204675Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1204920Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1205156Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1205399Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1205635Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1205879Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1206118Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1206360Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1206609Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1206825Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1207031Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1207271Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1207483Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1207711Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1207948Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1208204Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1208440Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1208652Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1208860Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1209094Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1209309Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1209512Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1209749Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1209998Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1210265Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1210511Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1210745Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1210975Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1211202Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1211423Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1211674Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1211911Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1212133Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1212366Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1212601Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1212857Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1213095Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1213344Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1213581Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1213828Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1214063Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1214310Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1214546Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1214793Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1215034Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1215247Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1215468Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1215706Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1215929Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1216144Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1216382Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1216637Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1216884Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1217149Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1217399Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1217644Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1217884Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1218128Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1218366Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1218609Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1218848Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1219066Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1219283Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1219495Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1219725Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1219961Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1220240Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1220478Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1220696Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1220912Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1221131Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1221387Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1221636Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1221894Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1222151Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1222397Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1222636Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1222883Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1223117Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1223364Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1223599Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1223851Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1224093Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1224338Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1224591Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1224839Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1225080Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1225322Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1225565Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1225808Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1226069Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1226298Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1226505Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1226746Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1226996Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1227238Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1227484Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1227723Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1227975Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1228213Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1228461Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1228700Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1228945Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1229218Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1229427Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1229668Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1229910Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1230187Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1230455Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1230709Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1230953Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1231172Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1231391Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1231609Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1231856Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1232095Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1232328Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1232549Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1232766Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1232987Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1233232Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1233470Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1233704Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1233922Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1234133Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1234297Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1234535Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1234742Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1234997Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1235264Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1235502Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1235720Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1235926Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1236166Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1236381Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1236588Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1236823Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1237033Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1237272Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1237478Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1237716Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1237923Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1238171Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1238416Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1238654Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1238899Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1239134Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1239392Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1239639Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1239895Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1240165Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1240410Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1240649Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1240862Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1241069Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1241305Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1241538Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1241756Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1241975Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1242194Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1242438Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1242690Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1242936Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1243177Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1243382Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1243620Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1243880Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1244129Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1244390Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1244625Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1244857Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1245077Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1245296Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1245507Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1245711Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1245950Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1246195Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1246435Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1246683Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1246924Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1247144Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1247382Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1247629Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1247864Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1248112Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1248358Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1248583Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1248832Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1249076Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1249317Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1249560Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1249800Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1250031Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1250285Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1250506Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1250723Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1250970Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1251206Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1251415Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1251678Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1251924Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1252167Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1252409Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1252646Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1252889Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1253122Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1253355Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1253571Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1253817Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1254054Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1254302Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1254537Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1254784Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1255023Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1255230Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1255468Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1255710Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1255955Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1256198Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1256436Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1256667Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1256883Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1257099Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1257326Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1257580Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1257825Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1258068Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1258304Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1258546Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1258783Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1259026Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1259263Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1259507Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1259747Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1259961Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1260213Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1260434Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1260647Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1260877Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1261099Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1261315Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1261526Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1261745Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1261946Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1262100Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1262261Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1262380Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1262524Z E1204 11:14:49.189000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1262696Z [W1204 11:14:49.653827173 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1262699Z 2025-12-04T12:10:21.1262861Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1263173Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1263480Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1263628Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1264122Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1264393Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1264636Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1264867Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1265086Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1265328Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1265565Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1265808Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1266044Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1266300Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1266559Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1266803Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1267035Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1267277Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1267512Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1267727Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1267953Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1268168Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1268412Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1268646Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1268856Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1269089Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1269342Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1269578Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1269782Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1270019Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1270264Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1270471Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1270717Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1270944Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1271162Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1271394Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1271639Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1271871Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1272116Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1272348Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1272564Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1272790Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1273005Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1273248Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1273482Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1273739Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1273973Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1274216Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1274450Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1274691Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1274926Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1275178Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1275423Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1275675Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1275910Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1276155Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1276389Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1276631Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1276863Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1277106Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1277341Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1277582Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1277817Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1278057Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1278303Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1278524Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1278738Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1278983Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1279215Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1279458Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1279701Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1279964Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1280248Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1280491Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1280727Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1280969Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1281205Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1285392Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1285642Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1285856Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1286064Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1286302Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1286513Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1286772Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1286988Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1287232Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1287468Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1287681Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1287886Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1288136Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1288383Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1288586Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1288820Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1289064Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1289299Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1289543Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1289779Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1289993Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1290252Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1290470Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1290716Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1290952Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1291185Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1291400Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1291619Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1291862Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1292099Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1292346Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1292596Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1292851Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1293100Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1293342Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1293578Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1293824Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1294059Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1294274Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1294483Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1294723Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1294943Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1295157Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1295373Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1295625Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1295863Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1296107Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1296342Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1296586Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1296821Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1297078Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1297333Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1297576Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1297815Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1298032Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1298247Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1298454Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1298678Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1298892Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1299137Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1299376Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1299593Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1299807Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1300023Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1300317Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1300552Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1300795Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1301030Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1301273Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1301521Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1301778Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1302030Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1302272Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1302509Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1302754Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1302989Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1303234Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1303470Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1303714Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1303950Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1304194Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1304431Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1304683Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1304922Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1305139Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1305346Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1305587Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1305829Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1306094Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1306344Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1306581Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1306826Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1307066Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1307309Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1307546Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1307790Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1308026Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1308236Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1308471Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1308718Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1308955Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1309212Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1309453Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1309682Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1309901Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1310295Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1310516Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1310791Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1311039Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1311269Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1311487Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1311707Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1311925Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1312170Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1312407Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1312625Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1312841Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1313048Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1313217Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1313453Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1313674Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1313913Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1314155Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1314393Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1314605Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1314811Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1315056Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1315282Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1315498Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1315733Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1315941Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1316176Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1316385Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1316623Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1316829Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1317067Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1317309Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1317547Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1317788Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1318026Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1318276Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1318513Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1318759Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1318995Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1319240Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1319485Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1319710Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1319923Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1320194Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1320424Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1320642Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1320860Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1321075Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1321320Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1321557Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1321801Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1322039Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1322243Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1322477Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1322744Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1322982Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1323225Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1323461Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1323691Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1323922Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1324152Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1324370Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1324576Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1324811Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1325058Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1325294Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1325540Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1325775Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1325980Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1326219Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1326470Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1326704Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1326949Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1327193Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1327401Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1327636Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1327885Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1328122Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1328375Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1328620Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1328861Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1329080Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1329297Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1329515Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1329759Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1329995Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1330239Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1330477Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1330722Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1330958Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1331203Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1331440Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1331681Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1331900Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1332116Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1332334Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1332576Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1332825Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1333082Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1333331Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1333576Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1333812Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1334022Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1334256Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1334502Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1334738Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1334980Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1335221Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1335449Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1335668Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1335882Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1336110Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1336356Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1336591Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1336834Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1337068Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1337322Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1337565Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1337823Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1338060Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1338301Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1338539Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1338755Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1338974Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1339179Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1339392Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1339623Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1339846Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1340059Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1340294Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1340516Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1340704Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1340850Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1341014Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1341132Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1341275Z E1204 11:14:49.192000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1341448Z [W1204 11:14:49.695210805 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1341465Z 2025-12-04T12:10:21.1341626Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1341951Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1342275Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1342421Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1342921Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1343193Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1343433Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1343655Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1343870Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1344114Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1344352Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1344594Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1344840Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1345082Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1345315Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1345556Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1345791Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1346033Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1346276Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1346510Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1346733Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1346948Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1347189Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1347425Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1347631Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1347863Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1348106Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1348340Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1348545Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1348779Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1348991Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1349194Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1349439Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1349652Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1349854Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1350127Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1350368Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1350618Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1350874Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1351117Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1351329Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1351554Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1351770Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1352010Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1352244Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1352486Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1352718Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1352962Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1353195Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1353436Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1353669Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1353923Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1354157Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1354398Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1354631Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1354872Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1355116Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1355373Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1355616Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1355857Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1356089Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1356333Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1356565Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1356810Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1357043Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1357258Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1357469Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1357711Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1357946Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1358196Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1358429Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1358672Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1358904Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1359144Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1359375Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1359627Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1359880Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1360160Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1360396Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1360608Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1360812Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1361045Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1361260Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1361483Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1361702Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1361947Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1362182Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1362395Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1362611Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1362849Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1363062Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1363268Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1363503Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1363745Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1363995Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1364246Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1364493Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1364706Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1364932Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1365148Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1365395Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1365632Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1365850Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1366067Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1366281Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1366528Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1366765Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1367008Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1367257Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1367501Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1367741Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1367981Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1368218Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1368471Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1368713Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1368937Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1369142Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1369381Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1369597Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1369813Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1370030Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1370309Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1370544Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1370786Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1371022Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1371263Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1371509Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1371751Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1371985Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1372227Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1372463Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1372681Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1372913Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1373131Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1373369Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1373585Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1373831Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1374068Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1374287Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1374504Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1374720Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1374972Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1375207Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1375450Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1375684Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1375939Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1376176Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1376417Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1376652Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1376893Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1377131Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1377385Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1377631Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1377883Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1378117Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1378360Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1378594Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1378836Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1379069Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1379312Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1379549Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1379765Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1379972Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1380247Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1380502Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1380736Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1380979Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1381214Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1381455Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1381689Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1381955Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1382202Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1382445Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1382678Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1382883Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1383117Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1383361Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1383593Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1383836Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1384071Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1384298Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1384517Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1384731Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1384956Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1385198Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1385434Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1385663Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1385878Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1386093Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1386316Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1386578Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1386813Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1387032Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1387246Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1387453Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1387618Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1387852Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1388057Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1388291Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1388534Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1388770Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1388983Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1389188Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1389439Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1389654Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1389859Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1390140Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1390345Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1390595Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1390815Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1391061Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1391267Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1391502Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1391748Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1391985Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1392227Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1392461Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1392705Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1392942Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1393185Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1393422Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1393664Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1393913Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1394130Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1394336Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1394571Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1394799Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1395028Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1395252Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1395478Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1395720Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1395954Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1396198Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1396432Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1396641Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1396880Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1397122Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1397357Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1397600Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1397836Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1398064Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1398294Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1398512Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1398718Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1398925Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1399160Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1399406Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1399660Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1399914Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1400190Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1400395Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1400631Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1400872Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1401108Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1401351Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1401593Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1401802Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1402037Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1402284Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1402520Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1402780Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1403016Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1403245Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1403463Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1403677Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1403910Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1404166Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1404421Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1404626Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1404861Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1405106Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1405341Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1405586Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1405820Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1406051Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1406272Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1406488Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1406704Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1406948Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1407194Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1407437Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1407673Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1407918Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1408151Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1408369Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1408615Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1408872Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1409105Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1409353Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1409591Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1409817Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1410036Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1410300Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1410517Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1410760Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1410996Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1411243Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1411475Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1411732Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1411968Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1412213Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1412446Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1412690Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1412939Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1413163Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1413397Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1413601Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1413815Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1414043Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1414269Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1414484Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1414690Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1414900Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1415086Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1415231Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1415392Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1415512Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1415653Z E1204 11:14:49.234000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1415829Z [W1204 11:14:49.697782822 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1415831Z 2025-12-04T12:10:21.1416004Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1416316Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1416625Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1416770Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1417264Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1417554Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1417805Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1418025Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1418240Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1418484Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1418720Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1418964Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1419200Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1419441Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1419676Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1419917Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1420185Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1420439Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1420676Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1420891Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1421113Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1421327Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1421569Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1421817Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1422042Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1422287Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1422528Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1422763Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1422969Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1423203Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1423418Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1423619Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1423856Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1424069Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1424271Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1424504Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1424745Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1424995Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1425237Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1425471Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1425685Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1425908Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1426135Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1426386Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1426630Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1426869Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1427105Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1427349Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1427583Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1427827Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1428059Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1428306Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1428540Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1428783Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1429016Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1429256Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1429500Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1429740Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1429978Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1430253Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1430487Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1430749Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1430995Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1431253Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1431488Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1431712Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1431932Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1432176Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1432416Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1432660Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1432901Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1433145Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1433385Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1433632Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1433880Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1434126Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1434358Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1434599Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1434830Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1435042Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1435257Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1435511Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1435724Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1435945Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1436161Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1436402Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1436637Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1436852Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1437053Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1437288Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1437499Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1437705Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1437939Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1438183Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1438428Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1438670Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1438907Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1439118Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1439345Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1439576Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1439834Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1440081Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1440344Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1440561Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1440777Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1441023Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1441259Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1441503Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1441739Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1441982Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1442221Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1442463Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1442701Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1442958Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1443195Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1443412Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1443617Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1443856Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1444085Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1444312Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1444541Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1444787Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1445023Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1445268Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1445505Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1445749Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1445989Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1446229Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1446467Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1446711Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1446946Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1447165Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1447390Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1447602Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1447828Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1448042Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1448286Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1448530Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1448769Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1448993Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1449211Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1449455Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1449692Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1449939Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1450210Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1450455Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1450689Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1450934Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1451173Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1451414Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1451667Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1451910Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1452146Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1452388Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1452626Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1452870Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1453117Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1453372Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1453623Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1453867Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1454103Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1454319Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1454527Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1454761Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1455006Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1457158Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1457407Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1457642Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1457886Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1458136Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1458399Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1458635Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1458880Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1459114Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1459322Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1459571Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1459826Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1460061Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1460342Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1460581Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1460809Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1461026Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1461239Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1461457Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1461785Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1462021Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1462251Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1462468Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1462694Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1462914Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1463156Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1463393Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1463608Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1463823Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1464048Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1464211Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1464462Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1464666Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1464903Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1465146Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1465382Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1465596Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1465799Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1466035Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1466263Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1466470Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1466706Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1466912Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1467157Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1467364Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1467599Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1467802Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1468038Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1468280Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1468528Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1468781Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1469016Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1469260Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1469495Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1469739Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1469972Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1470252Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1470487Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1470718Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1470924Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1471158Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1471387Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1471618Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1471835Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1472052Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1472295Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1472533Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1472775Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1473025Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1473250Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1473484Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1473727Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1473963Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1474207Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1474441Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1474669Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1474887Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1475112Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1475320Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1475524Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1475759Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1476010Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1476248Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1476494Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1476727Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1476932Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1477166Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1477419Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1477664Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1477907Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1478143Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1478349Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1478585Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1478830Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1479064Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1479306Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1479554Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1479784Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1480002Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1480252Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1480484Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1480732Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1480967Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1481182Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1481420Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1481662Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1481912Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1482172Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1482409Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1482640Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1482862Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1483078Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1483296Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1483543Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1483779Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1484038Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1484274Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1484520Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1484759Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1484974Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1485213Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1485458Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1485696Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1485939Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1486178Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1486419Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1486646Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1486861Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1487078Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1487326Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1487560Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1487805Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1488042Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1488286Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1488538Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1488781Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1489019Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1489264Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1489509Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1489724Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1489942Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1490196Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1490408Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1490641Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1490888Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1491115Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1491325Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1491532Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1491722Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1491865Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1492027Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1492148Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1492292Z E1204 11:14:49.236000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1492463Z [W1204 11:14:49.699973603 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1492468Z 2025-12-04T12:10:21.1492629Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1492957Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1493267Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1493417Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1493919Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1494191Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1494434Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1494655Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1494873Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1495133Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1495372Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1495628Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1495863Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1496107Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1496341Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1496584Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1496816Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1497060Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1497295Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1497521Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1497747Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1497961Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1498206Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1498451Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1498660Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1498896Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1499136Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1499373Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1499587Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1499822Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1500045Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1500281Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1500517Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1500730Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1500931Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1501164Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1501408Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1501639Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1501896Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1502131Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1502343Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1502568Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1502794Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1503040Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1503272Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1503516Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1503751Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1503992Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1504242Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1504495Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1504728Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1504969Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1505206Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1505449Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1505683Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1505925Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1506159Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1506414Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1506648Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1506890Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1507124Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1507384Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1507620Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1507862Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1508098Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1508314Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1508537Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1508781Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1509028Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1509270Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1509503Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1509749Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1509981Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1510238Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1510474Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1510714Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1510968Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1511210Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1511445Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1511669Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1511874Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1512108Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1512320Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1512543Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1512760Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1513016Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1513249Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1513475Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1513680Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1513913Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1514127Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1514332Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1514567Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1514812Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1515047Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1515305Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1515538Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1515753Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1515975Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1516200Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1516451Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1516686Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1516906Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1517120Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1517342Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1517599Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1517849Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1518095Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1518331Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1518579Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1518813Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1519057Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1519295Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1519538Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1519789Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1520008Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1520254Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1520488Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1520725Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1520942Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1521158Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1521403Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1521638Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1521898Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1522132Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1522396Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1522637Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1522880Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1523118Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1523361Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1523599Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1523817Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1524036Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1524271Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1524496Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1524714Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1524959Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1525207Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1525426Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1525647Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1525865Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1526109Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1526348Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1526604Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1526854Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1527097Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1527337Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1527584Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1527819Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1528064Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1528298Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1528544Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1528792Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1529039Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1529277Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1529533Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1529777Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1530019Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1530315Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1530556Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1530793Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1531027Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1531231Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1531483Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1531725Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1531962Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1532210Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1532447Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1532692Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1532927Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1533200Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1533437Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1533682Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1533918Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1534138Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1534378Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1534622Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1534862Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1535104Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1535344Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1535592Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1535824Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1536040Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1536257Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1536504Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1536742Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1536974Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1537197Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1537410Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1537629Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1537885Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1538123Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1538340Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1538567Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1538780Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1538943Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1539184Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1539393Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1539633Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1539891Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1540193Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1540430Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1540635Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1540874Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1541089Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1541297Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1541534Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1541742Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1541981Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1542201Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1542438Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1542644Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1542883Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1543139Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1543378Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1543625Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1543863Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1544107Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1544344Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1544604Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1544851Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1545095Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1545333Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1545548Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1545756Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1545992Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1546223Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1546440Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1546673Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1546895Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1547140Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1547376Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1547629Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1547870Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1548076Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1548314Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1548558Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1548792Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1549049Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1549298Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1549529Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1549744Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1549963Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1550207Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1550414Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1550652Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1550895Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1551148Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1551393Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1551633Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1551842Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1552093Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1552342Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1552578Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1552826Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1553060Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1553270Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1553528Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1553783Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1554022Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1554269Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1554508Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1554739Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1554959Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1555175Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1555391Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1555648Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1555884Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1556094Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1556330Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1556586Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1556827Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1557070Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1557306Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1557533Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1557754Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1557987Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1558215Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1558460Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1558696Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1558944Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1559182Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1559428Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1559666Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1559873Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1560208Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1560451Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1560691Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1560934Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1561186Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1561421Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1561643Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1561860Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1562078Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1562324Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1562573Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1562833Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1563075Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1563321Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1563561Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1563805Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1564046Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1564288Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1564526Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1564761Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1564978Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1565187Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1565399Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1565643Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1565868Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1566087Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1566298Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1566507Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1566697Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1566852Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1567014Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1567145Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1567289Z E1204 11:14:49.239000 840652 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1567351Z FAILED [1.6013s] [100%] 2025-12-04T12:10:21.1567353Z 2025-12-04T12:10:21.1567432Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.1567608Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1567677Z Traceback (most recent call last): 2025-12-04T12:10:21.1567857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1567919Z method(*args, **kwargs) 2025-12-04T12:10:21.1568090Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1568149Z method(*args, **kwargs) 2025-12-04T12:10:21.1568317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1568373Z with policy(): 2025-12-04T12:10:21.1568544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1568603Z raise RuntimeError(msg) 2025-12-04T12:10:21.1569034Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.1569049Z 2025-12-04T12:10:21.1569146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1569439Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.1569442Z 2025-12-04T12:10:21.1569549Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1569646Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1569708Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1569785Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1570400Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1570520Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1570578Z graph_break [] 2025-12-04T12:10:21.1570661Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1570754Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1571258Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1571342Z current_size = base.storage().size() 2025-12-04T12:10:21.1571403Z Autotune Choices Stats: 2025-12-04T12:10:21.1571800Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.1571897Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1571965Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1572085Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1572342Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1572591Z triton_mm_33 0.0090 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1572833Z triton_mm_16 0.0107 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1573075Z triton_mm_22 0.0112 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1573317Z triton_mm_29 0.0114 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1573568Z triton_mm_30 0.0118 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1573812Z triton_mm_21 0.0119 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1574054Z triton_mm_23 0.0120 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1574312Z triton_mm_15 0.0125 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1574557Z triton_mm_31 0.0126 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1574706Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 9.1826 seconds precompiling for 33 choices 2025-12-04T12:10:21.1574885Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1574951Z Traceback (most recent call last): 2025-12-04T12:10:21.1575128Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1575188Z method(*args, **kwargs) 2025-12-04T12:10:21.1575371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1575430Z method(*args, **kwargs) 2025-12-04T12:10:21.1575601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1575667Z with policy(): 2025-12-04T12:10:21.1575837Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1575895Z raise RuntimeError(msg) 2025-12-04T12:10:21.1576323Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.1576325Z 2025-12-04T12:10:21.1576419Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1576710Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.1576713Z 2025-12-04T12:10:21.1576822Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1576914Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1576978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1577054Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1577625Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1577753Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1577813Z graph_break [] 2025-12-04T12:10:21.1577894Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1577987Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1578489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1578566Z current_size = base.storage().size() 2025-12-04T12:10:21.1578628Z Autotune Choices Stats: 2025-12-04T12:10:21.1579015Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.1579098Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1579163Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1579282Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1579533Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1579780Z triton_mm_33 0.0090 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1580034Z triton_mm_16 0.0107 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1580323Z triton_mm_22 0.0112 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1580562Z triton_mm_29 0.0114 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1580802Z triton_mm_30 0.0118 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1581046Z triton_mm_21 0.0119 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1581289Z triton_mm_23 0.0120 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1581531Z triton_mm_15 0.0125 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1581788Z triton_mm_31 0.0126 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1581936Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 9.1826 seconds precompiling for 33 choices 2025-12-04T12:10:21.1582030Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1582089Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1582164Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1582281Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1582798Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1582854Z graph_break [] 2025-12-04T12:10:21.1582938Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1583030Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1583415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.1583525Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.1583583Z Autotune Choices Stats: 2025-12-04T12:10:21.1583969Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.1584064Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1584131Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1584261Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1584515Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1584755Z triton_mm_71 0.0092 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1587748Z triton_mm_67 0.0103 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1587989Z triton_mm_54 0.0110 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1588227Z triton_mm_60 0.0111 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1588467Z triton_mm_59 0.0112 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1588723Z triton_mm_68 0.0119 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1588965Z triton_mm_61 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1589211Z triton_mm_69 0.0128 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1589449Z triton_mm_53 0.0128 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1589609Z SingleProcess AUTOTUNE benchmarking takes 0.2684 seconds and 0.7538 seconds precompiling for 39 choices 2025-12-04T12:10:21.1589685Z =================================== FAILURES =================================== 2025-12-04T12:10:21.1589860Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1589925Z Traceback (most recent call last): 2025-12-04T12:10:21.1590150Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1590211Z method(*args, **kwargs) 2025-12-04T12:10:21.1590380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1590436Z method(*args, **kwargs) 2025-12-04T12:10:21.1590607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1590684Z with policy(): 2025-12-04T12:10:21.1590857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1590915Z raise RuntimeError(msg) 2025-12-04T12:10:21.1591339Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.1591356Z 2025-12-04T12:10:21.1591449Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1591737Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.1591740Z 2025-12-04T12:10:21.1591847Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1591940Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1592002Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1592077Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1592648Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1592763Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1592818Z graph_break [] 2025-12-04T12:10:21.1592901Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1592990Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1593521Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1593722Z current_size = base.storage().size() 2025-12-04T12:10:21.1593783Z Autotune Choices Stats: 2025-12-04T12:10:21.1594183Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.1594268Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1594335Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1594452Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1594705Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1594945Z triton_mm_33 0.0090 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1595184Z triton_mm_16 0.0107 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1595435Z triton_mm_22 0.0112 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1595688Z triton_mm_29 0.0114 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1595923Z triton_mm_30 0.0118 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1596160Z triton_mm_21 0.0119 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1596400Z triton_mm_23 0.0120 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1596638Z triton_mm_15 0.0125 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1596878Z triton_mm_31 0.0126 ms 69.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1597022Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 9.1826 seconds precompiling for 33 choices 2025-12-04T12:10:21.1597116Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1597176Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1597264Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1597379Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1597883Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1597938Z graph_break [] 2025-12-04T12:10:21.1598018Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1598118Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1598498Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.1598608Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.1598666Z Autotune Choices Stats: 2025-12-04T12:10:21.1599047Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.1599125Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1599194Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1599321Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1599572Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1599825Z triton_mm_71 0.0092 ms 96.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1600064Z triton_mm_67 0.0103 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1600338Z triton_mm_54 0.0110 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1600575Z triton_mm_60 0.0111 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1600813Z triton_mm_59 0.0112 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1601049Z triton_mm_68 0.0119 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1601285Z triton_mm_61 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1601541Z triton_mm_69 0.0128 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1601781Z triton_mm_53 0.0128 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1601927Z SingleProcess AUTOTUNE benchmarking takes 0.2684 seconds and 0.7538 seconds precompiling for 39 choices 2025-12-04T12:10:21.1602016Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1602079Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1602165Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1602283Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1602780Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1602835Z graph_break [] 2025-12-04T12:10:21.1602916Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1603004Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1603062Z Autotune Choices Stats: 2025-12-04T12:10:21.1603440Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00839999970048666, "best_triton_pos": 0} 2025-12-04T12:10:21.1603532Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512) 2025-12-04T12:10:21.1603612Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1] 2025-12-04T12:10:21.1603725Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.1603972Z triton_mm_110 0.0084 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1604217Z triton_mm_109 0.0092 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1604280Z _scaled_mm 0.0096 ms 87.9% 2025-12-04T12:10:21.1604517Z triton_mm_98 0.0112 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1604761Z triton_mm_105 0.0112 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1604996Z triton_mm_106 0.0114 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1605235Z triton_mm_92 0.0116 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1605488Z triton_mm_97 0.0118 ms 70.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1605729Z triton_mm_99 0.0120 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1605969Z triton_mm_91 0.0123 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1606114Z SingleProcess AUTOTUNE benchmarking takes 0.2658 seconds and 0.6021 seconds precompiling for 39 choices 2025-12-04T12:10:21.1606333Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe7e9aac91275213.xml - 2025-12-04T12:10:21.1606413Z =========================== short test summary info ============================ 2025-12-04T12:10:21.1607052Z FAILED [1.6013s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.1607056Z 2025-12-04T12:10:21.1607149Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1607436Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.1607449Z 2025-12-04T12:10:21.1607557Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1607635Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.1607743Z ================= 1 failed, 187 deselected, 2 rerun in 15.36s ================== 2025-12-04T12:10:21.1607797Z Got exit code 1 2025-12-04T12:10:21.1608032Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.1608172Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.1608332Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-448591afe34a5d7d.xml 2025-12-04T12:10:21.1608407Z ============================= test session starts ============================== 2025-12-04T12:10:21.1608539Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.1608600Z cachedir: .pytest_cache 2025-12-04T12:10:21.1608775Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.1608841Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.1608899Z configfile: pytest.ini 2025-12-04T12:10:21.1609080Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.1609172Z collecting ... collected 188 items / 109 deselected / 79 selected 2025-12-04T12:10:21.1609244Z stepcurrent: skipping 109 already run items. 2025-12-04T12:10:21.1609306Z Running 79 items in this shard 2025-12-04T12:10:21.1609309Z 2025-12-04T12:10:21.1610342Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/gd/cgdjqihwb6wei5vogddrxsr3uj6k3mkwcpwmeop433zt6scy7oby.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1610511Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1610757Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1610934Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1611239Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1611391Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1611665Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1611820Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1612093Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1612277Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1612574Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1612722Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1613013Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1613223Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1613554Z E1204 11:15:06.248000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1614302Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/xl/cxlpsyvwnrk56nfsyrgh7nfpknbn6y5wlvuils5tr5kvs5vihhvw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1614475Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1614705Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1614873Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1615173Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1615331Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1615603Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1615755Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1616022Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1616190Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1616472Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1616631Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1616922Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1617141Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1617468Z E1204 11:15:06.300000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1618211Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/yg/cyggx77l4i7zrf7ivjowg6o77wwwtockbmpdrehaz4wylhwiljfg.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1618376Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1618604Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1618772Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1619081Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1619227Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1619500Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1619650Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1619927Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1620142Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1620426Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1620574Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1620863Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1621070Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1621408Z E1204 11:15:06.304000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1622161Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/65/c65tpzsr225cayjsrgcktcqiksvzthry7o5t3fk3ni7hl7gdse5m.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1622328Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1622555Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1622725Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1623021Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1623165Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1623436Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1623600Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1623868Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1624037Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1624319Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1624480Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1624769Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1624976Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1625302Z E1204 11:15:06.325000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1626037Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/ni/cnin4lnfsblbx6fcfua3ufkr54syid3fvofi5mninn5ugvpfcwvr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1626225Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1626453Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1626623Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1626924Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1627073Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1627341Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1627493Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1627758Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1627928Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1628222Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1628369Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1628658Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1628863Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1629199Z E1204 11:15:06.330000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1629938Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpw7rzybr8/wn/cwnqq7dy3yhlz2ui4gxvx23k3somra6vvymz23td3natl7ln76kt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1630134Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1630363Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1630544Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1630841Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1631000Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1631268Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1631420Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1631687Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1631858Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1632140Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1632287Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1632575Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1632795Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1633121Z E1204 11:15:06.343000 846575 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1633192Z ('RERUN', {'yellow': True}) [8.0322s] [ 1%] 2025-12-04T12:10:21.1633541Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:15:08.422000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1633862Z E1204 11:15:08.422000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1634009Z E1204 11:15:08.422000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1634168Z E1204 11:15:08.425000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1634478Z E1204 11:15:08.425000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1634621Z E1204 11:15:08.425000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1634779Z E1204 11:15:08.427000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1635095Z E1204 11:15:08.427000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1635248Z E1204 11:15:08.427000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1635406Z E1204 11:15:08.488000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1635709Z E1204 11:15:08.488000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1635853Z E1204 11:15:08.488000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1636011Z E1204 11:15:08.490000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1636315Z E1204 11:15:08.490000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1636460Z E1204 11:15:08.490000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1636617Z E1204 11:15:08.492000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1636923Z E1204 11:15:08.492000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1637065Z E1204 11:15:08.492000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1637143Z ('RERUN', {'yellow': True}) [1.8614s] [ 1%] 2025-12-04T12:10:21.1637488Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda E1204 11:15:10.078000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1637795Z E1204 11:15:10.078000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1637935Z E1204 11:15:10.078000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1638101Z E1204 11:15:10.080000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1638406Z E1204 11:15:10.080000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1638546Z E1204 11:15:10.080000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1638704Z E1204 11:15:10.082000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1639010Z E1204 11:15:10.082000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1639152Z E1204 11:15:10.082000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1639319Z E1204 11:15:10.136000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1639624Z E1204 11:15:10.136000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1639777Z E1204 11:15:10.136000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1639933Z E1204 11:15:10.138000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1640280Z E1204 11:15:10.138000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1640421Z E1204 11:15:10.138000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1640580Z E1204 11:15:10.140000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1640883Z E1204 11:15:10.140000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.1641025Z E1204 11:15:10.140000 846575 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1641082Z FAILED [1.7096s] [ 1%] 2025-12-04T12:10:21.1641084Z 2025-12-04T12:10:21.1641156Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.1641331Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1641396Z Traceback (most recent call last): 2025-12-04T12:10:21.1641587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1641646Z method(*args, **kwargs) 2025-12-04T12:10:21.1641813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1641869Z method(*args, **kwargs) 2025-12-04T12:10:21.1642036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1642090Z with policy(): 2025-12-04T12:10:21.1642257Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1642314Z raise RuntimeError(msg) 2025-12-04T12:10:21.1642757Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.1642761Z 2025-12-04T12:10:21.1642854Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1643142Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.1643144Z 2025-12-04T12:10:21.1643249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1643340Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1643402Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1643477Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1644061Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1644191Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1644247Z graph_break [] 2025-12-04T12:10:21.1644328Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1644418Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1644916Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1644984Z current_size = base.storage().size() 2025-12-04T12:10:21.1645043Z Autotune Choices Stats: 2025-12-04T12:10:21.1645435Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009440000168979168, "best_triton_pos": 0} 2025-12-04T12:10:21.1645519Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1645588Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1645728Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1645993Z triton_mm_34 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1646055Z _scaled_mm 0.0096 ms 97.9% 2025-12-04T12:10:21.1646304Z triton_mm_33 0.0108 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1646547Z triton_mm_16 0.0110 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1646798Z triton_mm_29 0.0112 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1647041Z triton_mm_21 0.0118 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1647286Z triton_mm_23 0.0122 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1647524Z triton_mm_22 0.0124 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1647766Z triton_mm_30 0.0125 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1648019Z triton_mm_15 0.0126 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1648179Z SingleProcess AUTOTUNE benchmarking takes 0.1756 seconds and 1.5159 seconds precompiling for 33 choices 2025-12-04T12:10:21.1648351Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1648414Z Traceback (most recent call last): 2025-12-04T12:10:21.1648587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1648645Z method(*args, **kwargs) 2025-12-04T12:10:21.1648819Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1648876Z method(*args, **kwargs) 2025-12-04T12:10:21.1649042Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1649097Z with policy(): 2025-12-04T12:10:21.1649265Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1649323Z raise RuntimeError(msg) 2025-12-04T12:10:21.1649741Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.1649744Z 2025-12-04T12:10:21.1649834Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1650202Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.1650204Z 2025-12-04T12:10:21.1650308Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1650398Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1650459Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1650533Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1651108Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1651226Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1651281Z graph_break [] 2025-12-04T12:10:21.1651360Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1651451Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1651944Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1652010Z current_size = base.storage().size() 2025-12-04T12:10:21.1652068Z Autotune Choices Stats: 2025-12-04T12:10:21.1652467Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009440000168979168, "best_triton_pos": 0} 2025-12-04T12:10:21.1652562Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1652629Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1652765Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1653015Z triton_mm_34 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1653075Z _scaled_mm 0.0096 ms 97.9% 2025-12-04T12:10:21.1653319Z triton_mm_33 0.0108 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1653562Z triton_mm_16 0.0110 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1653807Z triton_mm_29 0.0112 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1654048Z triton_mm_21 0.0118 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1654301Z triton_mm_23 0.0122 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1654541Z triton_mm_22 0.0124 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1654782Z triton_mm_30 0.0125 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1655027Z triton_mm_15 0.0126 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1655182Z SingleProcess AUTOTUNE benchmarking takes 0.1756 seconds and 1.5159 seconds precompiling for 33 choices 2025-12-04T12:10:21.1655277Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1655337Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1655413Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1655528Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1656029Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1656084Z graph_break [] 2025-12-04T12:10:21.1656164Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1656263Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1656324Z Autotune Choices Stats: 2025-12-04T12:10:21.1656707Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_71", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009080000221729279, "best_triton_pos": 0} 2025-12-04T12:10:21.1659120Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1659188Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1659323Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1659574Z triton_mm_71 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1659633Z _scaled_mm 0.0096 ms 94.2% 2025-12-04T12:10:21.1659880Z triton_mm_72 0.0099 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1660163Z triton_mm_67 0.0109 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1660408Z triton_mm_60 0.0114 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1660654Z triton_mm_54 0.0116 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1660912Z triton_mm_59 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1661157Z triton_mm_68 0.0121 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1661399Z triton_mm_61 0.0122 ms 74.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1661655Z triton_mm_53 0.0123 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1661803Z SingleProcess AUTOTUNE benchmarking takes 0.2610 seconds and 0.8597 seconds precompiling for 39 choices 2025-12-04T12:10:21.1661878Z =================================== FAILURES =================================== 2025-12-04T12:10:21.1662050Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.1662114Z Traceback (most recent call last): 2025-12-04T12:10:21.1662290Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1662347Z method(*args, **kwargs) 2025-12-04T12:10:21.1662518Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.1662587Z method(*args, **kwargs) 2025-12-04T12:10:21.1662756Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.1662809Z with policy(): 2025-12-04T12:10:21.1662979Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.1663048Z raise RuntimeError(msg) 2025-12-04T12:10:21.1663465Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3007315968. 2025-12-04T12:10:21.1663468Z 2025-12-04T12:10:21.1663557Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1663845Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.1663848Z 2025-12-04T12:10:21.1663954Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1664047Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1664107Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1664181Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1664741Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1664856Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1664922Z graph_break [] 2025-12-04T12:10:21.1665002Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1665095Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1665591Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.1665660Z current_size = base.storage().size() 2025-12-04T12:10:21.1665719Z Autotune Choices Stats: 2025-12-04T12:10:21.1666114Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009440000168979168, "best_triton_pos": 0} 2025-12-04T12:10:21.1666199Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1666266Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1666404Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1666653Z triton_mm_34 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1666712Z _scaled_mm 0.0096 ms 97.9% 2025-12-04T12:10:21.1666957Z triton_mm_33 0.0108 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1667213Z triton_mm_16 0.0110 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1667466Z triton_mm_29 0.0112 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1667705Z triton_mm_21 0.0118 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1667948Z triton_mm_23 0.0122 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1668187Z triton_mm_22 0.0124 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1668428Z triton_mm_30 0.0125 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1668673Z triton_mm_15 0.0126 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1668819Z SingleProcess AUTOTUNE benchmarking takes 0.1756 seconds and 1.5159 seconds precompiling for 33 choices 2025-12-04T12:10:21.1668914Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1668990Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1669065Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1669179Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1669678Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1669732Z graph_break [] 2025-12-04T12:10:21.1669811Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1669910Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1669973Z Autotune Choices Stats: 2025-12-04T12:10:21.1670394Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_71", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009080000221729279, "best_triton_pos": 0} 2025-12-04T12:10:21.1670478Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1670544Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1670679Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1670932Z triton_mm_71 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1671005Z _scaled_mm 0.0096 ms 94.2% 2025-12-04T12:10:21.1671250Z triton_mm_72 0.0099 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1671502Z triton_mm_67 0.0109 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1671742Z triton_mm_60 0.0114 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1671981Z triton_mm_54 0.0116 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1672222Z triton_mm_59 0.0120 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1672462Z triton_mm_68 0.0121 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1672701Z triton_mm_61 0.0122 ms 74.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1672945Z triton_mm_53 0.0123 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1673100Z SingleProcess AUTOTUNE benchmarking takes 0.2610 seconds and 0.8597 seconds precompiling for 39 choices 2025-12-04T12:10:21.1673192Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.1673252Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.1673325Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.1673439Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.1673944Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.1674001Z graph_break [] 2025-12-04T12:10:21.1674078Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.1674173Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.1674230Z Autotune Choices Stats: 2025-12-04T12:10:21.1674613Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.1674692Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.1674759Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.1674893Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.1675155Z triton_mm_110 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1675400Z triton_mm_109 0.0093 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1675473Z _scaled_mm 0.0095 ms 92.0% 2025-12-04T12:10:21.1675713Z triton_mm_105 0.0110 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1675953Z triton_mm_92 0.0112 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1676195Z triton_mm_97 0.0117 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1676438Z triton_mm_106 0.0119 ms 73.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1676680Z triton_mm_98 0.0121 ms 71.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.1676922Z triton_mm_99 0.0122 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1677174Z triton_mm_91 0.0128 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.1677320Z SingleProcess AUTOTUNE benchmarking takes 0.2866 seconds and 0.6719 seconds precompiling for 39 choices 2025-12-04T12:10:21.1677523Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-448591afe34a5d7d.xml - 2025-12-04T12:10:21.1677601Z =========================== short test summary info ============================ 2025-12-04T12:10:21.1678243Z FAILED [1.7096s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3007315968. 2025-12-04T12:10:21.1678249Z 2025-12-04T12:10:21.1678338Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.1678623Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.1678625Z 2025-12-04T12:10:21.1678727Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.1678806Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.1678891Z ================= 1 failed, 109 deselected, 2 rerun in 11.62s ================== 2025-12-04T12:10:21.1678947Z Got exit code 1 2025-12-04T12:10:21.1679014Z Retrying single test... 2025-12-04T12:10:21.1679175Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-925efe9ecb873b90.xml 2025-12-04T12:10:21.1679250Z ============================= test session starts ============================== 2025-12-04T12:10:21.1679389Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.1679447Z cachedir: .pytest_cache 2025-12-04T12:10:21.1679621Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.1679684Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.1679743Z configfile: pytest.ini 2025-12-04T12:10:21.1679921Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.1680014Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.1680327Z stepcurrent: skipping 109 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.1680386Z Running 1 items in this shard 2025-12-04T12:10:21.1680389Z 2025-12-04T12:10:21.1680750Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:15:49.994728083 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1680753Z 2025-12-04T12:10:21.1680922Z [W1204 11:15:57.907980653 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1680924Z 2025-12-04T12:10:21.1681253Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1681578Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1681727Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1682223Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1682503Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1682747Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1682970Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1683185Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1683429Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1683678Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1683922Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1684167Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1684408Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1684641Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1684882Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1685116Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1685355Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1685588Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1685839Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1686074Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1686317Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1686549Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1686764Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1686999Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1687240Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1687471Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1687676Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1687910Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1688160Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1688410Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1688653Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1688886Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1689102Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1689329Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1689504Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1689697Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1690279Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/ni/cnin4lnfsblbx6fcfua3ufkr54syid3fvofi5mninn5ugvpfcwvr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1690452Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1690684Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1690858Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1691162Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1691323Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1691598Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1691754Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1692022Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1692192Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1692476Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1692648Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1692938Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1693155Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1693487Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1693796Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1693943Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1694438Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1694705Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1694958Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1695180Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1695399Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1695641Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1695889Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1696139Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1696374Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1696616Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1696848Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1697101Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1697334Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1697585Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1697820Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1698062Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1698297Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1698539Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1698775Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1698979Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1699212Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1699462Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1699695Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1699901Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1700245Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1700499Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1700735Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1700979Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1701213Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1701432Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1701671Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1701844Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1702056Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1702173Z E1204 11:15:57.411000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1702497Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1702806Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1702952Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1703441Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1703705Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1703958Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1704179Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1704394Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1704637Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1704881Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1705125Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1705357Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1705597Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1705830Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1706072Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1706316Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1706566Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1706798Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1707040Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1707277Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1707519Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1707844Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1708048Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1708282Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1708536Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1708772Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1708974Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1709207Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1709462Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1709700Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1709940Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1710207Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1710426Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1710664Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1710839Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1711042Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1711582Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/xl/cxlpsyvwnrk56nfsyrgh7nfpknbn6y5wlvuils5tr5kvs5vihhvw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1711744Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1711973Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1712145Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1712444Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1712591Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1712862Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1713030Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1713299Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1713471Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1713753Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1713919Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1714210Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1714416Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1714744Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1715049Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1715205Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1715698Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1715974Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1716215Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1716436Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1716652Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1716894Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1717127Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1717370Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1717612Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1717855Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1718087Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1718339Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1718575Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1718814Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1719050Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1719289Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1719525Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1719775Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1720019Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1720259Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1720493Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1720738Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1720970Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1721174Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1721405Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1721648Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1721896Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1722136Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1722370Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1722585Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1722824Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1722999Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1723192Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1723311Z E1204 11:15:57.490000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1723481Z [W1204 11:15:57.972209956 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1723484Z 2025-12-04T12:10:21.1723651Z [W1204 11:15:57.978895438 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1723652Z 2025-12-04T12:10:21.1723975Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1724293Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1724450Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1724938Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1725206Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1725445Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1725668Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1725882Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1726128Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1726381Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1726623Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1726858Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1727098Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1727341Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1727584Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1727821Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1728064Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1728296Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1728551Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1728782Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1729034Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1729269Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1729473Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1729708Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1729948Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1730222Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1730425Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1730660Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1730915Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1731147Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1731387Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1731629Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1731848Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1732073Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1732248Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1732443Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1732983Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/65/c65tpzsr225cayjsrgcktcqiksvzthry7o5t3fk3ni7hl7gdse5m.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1733165Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1733406Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1733575Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1733874Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1734024Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1734296Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1734450Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1734719Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1734888Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1735173Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1735331Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1735619Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1735827Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1736163Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1736471Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1736615Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1737103Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1737370Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1737620Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1737850Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1738064Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1738307Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1738542Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1738787Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1739022Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1739262Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1739495Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1739746Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1739981Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1740262Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1740496Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1740749Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1740983Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1741226Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1741458Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1741663Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1741896Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1742154Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1742413Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1742616Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1742848Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1743090Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1743321Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1743562Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1743792Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1744009Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1744245Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1744420Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1744612Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1744730Z E1204 11:15:57.511000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1745062Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1745368Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1745513Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1745998Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1746265Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1746512Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1746764Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1746978Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1747219Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1747455Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1747696Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1747931Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1748170Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1748404Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1748662Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1748893Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1749135Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1749365Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1749615Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1749852Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1750119Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1750353Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1750555Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1750790Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1751042Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1751287Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1751490Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1751721Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1751963Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1752197Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1752439Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1752670Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1752887Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1753123Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1753295Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1753488Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1754038Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/yg/cyggx77l4i7zrf7ivjowg6o77wwwtockbmpdrehaz4wylhwiljfg.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1754201Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1754428Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1754599Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1754900Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1755046Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1755317Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1755480Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1755757Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1755926Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1756208Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1756359Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1756650Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1756857Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1757185Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1757493Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1757646Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1758135Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1758401Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1758651Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1758874Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1759088Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1759334Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1759572Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1759830Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1760063Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1760349Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1760583Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1760825Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1761062Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1761303Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1761537Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1761778Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1762013Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1762269Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1762502Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1762706Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1762940Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1763191Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1763426Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1763629Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1763865Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1764105Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1764355Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1764601Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1764844Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1765061Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1765286Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1765463Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1765655Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1765775Z E1204 11:15:57.518000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1765948Z [W1204 11:15:57.980877533 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1765950Z 2025-12-04T12:10:21.1766273Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1766589Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1766734Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1767224Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1767497Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1767736Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1767959Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1768173Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1768414Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1768648Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1768902Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1769149Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1769393Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1769629Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1769872Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1770146Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1770390Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1770623Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1770866Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1771113Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1771357Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1771591Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1771797Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1772042Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1772283Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1772519Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1772721Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1772955Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1773208Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1773440Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1773693Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1773925Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1774146Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1774371Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1774547Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1774740Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1775276Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/wn/cwnqq7dy3yhlz2ui4gxvx23k3somra6vvymz23td3natl7ln76kt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1775439Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1775676Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1775848Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1776146Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1776294Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1776583Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1776737Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1777005Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1777174Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1777458Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1777617Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1777908Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1778127Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1778456Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1778763Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1778908Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1779398Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1779662Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1779906Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1780174Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1780390Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1780632Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1780876Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1781121Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1781353Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1781599Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1781832Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1782074Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1782327Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1782567Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1782812Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1783053Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1783289Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1783532Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1783766Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1783973Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1784208Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1784460Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1784695Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1784901Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1785134Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1785383Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1785619Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1785963Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1786199Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1786417Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1786646Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1786833Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1787026Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1787155Z E1204 11:15:57.520000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1787328Z [W1204 11:15:57.987127181 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1787330Z 2025-12-04T12:10:21.1787652Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1787957Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1788102Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1788591Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1788860Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1789111Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1789330Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1789544Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1789787Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1790032Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1790297Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1790530Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1790771Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1791003Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1791262Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1791494Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1791747Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1791980Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1792221Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1792456Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1792696Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1792929Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1793135Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1793368Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1793632Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1793866Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1794069Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1794314Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1794558Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1794790Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1795032Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1795265Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1795482Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1795719Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1795892Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1796096Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1796636Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpi6mgk073/gd/cgdjqihwb6wei5vogddrxsr3uj6k3mkwcpwmeop433zt6scy7oby.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.1796799Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.1797027Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.1797196Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.1797497Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.1797646Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.1797926Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.1798085Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.1798352Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.1798525Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.1798816Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.1798968Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.1799256Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.1799462Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.1799788Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1800124Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1800281Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1800767Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1801050Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1801292Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1801514Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1801729Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1801970Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1802206Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1802460Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1802692Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1802935Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1803167Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1803421Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1803656Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1803898Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1804134Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1804375Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1804610Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1804859Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1805104Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1805307Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1805541Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1805788Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1806023Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1806228Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1806459Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1806701Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1806943Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1807186Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1807421Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1807638Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.1807873Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.1808050Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.1808243Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.1808361Z E1204 11:15:57.526000 852494 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.1808433Z ('RERUN', {'yellow': True}) [38.3878s] [100%] 2025-12-04T12:10:21.1808801Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:15:59.988075121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1808805Z 2025-12-04T12:10:21.1808975Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1809284Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1809604Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1809748Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1810273Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1810540Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1810782Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1811002Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1811219Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1811475Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1811711Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1811953Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1812185Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1812442Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1812676Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1812919Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1813151Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1813395Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1813641Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1813853Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1814088Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1814303Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1814547Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1814781Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1814989Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1815225Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1815465Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1815702Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1815916Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1816150Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1816361Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1816564Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1816811Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1817024Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1817227Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1817459Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1817701Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1817934Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1818191Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1818434Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1818645Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1818870Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1819087Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1819327Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1819558Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1819802Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1820037Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1820330Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1820566Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1820805Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1821037Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1821289Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1821525Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1821770Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1822001Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1822247Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1822494Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1822735Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1822981Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1823221Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1823457Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1823698Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1823934Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1824174Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1824408Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1824626Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1824846Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1825090Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1825323Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1825578Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1825813Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1826056Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1826292Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1826534Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1826767Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1827023Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1827266Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1827506Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1827740Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1827956Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1828160Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1828396Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1828606Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1828830Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1829044Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1829296Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1829531Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1829740Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1829947Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1830239Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1830454Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1830657Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1830889Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1831132Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1831377Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1831619Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1831863Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1832076Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1832300Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1832517Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1832766Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1833001Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1833219Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1833433Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1833662Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1833905Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1834142Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1834384Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1834627Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1834873Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1835109Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1835351Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1835585Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1835839Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1836074Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1836296Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1836502Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1836736Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1836956Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1837168Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1837385Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1837631Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1837867Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1838121Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1838355Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1838599Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1838833Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1839085Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1839321Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1839563Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1839799Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1840017Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1840276Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1840482Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1840723Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1840940Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1841183Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1841419Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1841636Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1841851Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1842066Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1842312Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1842560Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1842801Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1843036Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1843278Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1843533Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1843780Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1844016Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1844258Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1844494Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1844757Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1844992Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1845245Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1845483Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1845726Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1845965Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1846206Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1846440Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1846681Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1846918Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1847144Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1847351Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1847588Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1847840Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1848082Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1848323Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1848559Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1848801Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1849035Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1849288Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1849534Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1849780Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1850013Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1850258Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1850497Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1850741Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1850976Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1851220Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1851469Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1851698Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1851916Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1852133Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1852359Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1852605Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1852839Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1853069Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1853288Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1853502Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1853730Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1853972Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1854220Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1854438Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1854654Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1854861Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1855026Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1855262Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1855465Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1855702Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1855953Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1856190Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1856401Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1856606Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1856853Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1857067Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1857272Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1857505Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1857708Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1857943Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1858161Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1858405Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1858607Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1858841Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1859083Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1859324Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1859567Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1859800Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1860042Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1860334Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1860577Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1860810Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1861053Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1861298Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1861513Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1861720Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1861954Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1862182Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1862399Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1862625Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1862852Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1863093Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1863327Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1863570Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1863805Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1864011Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1864248Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1864491Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1864735Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1864977Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1865211Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1865439Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1865665Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1865883Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1866090Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1866293Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1866528Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1866770Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1867015Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1867264Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1867498Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1867702Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1867935Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1868178Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1868412Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1868653Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1868889Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1869103Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1869337Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1869578Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1869811Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1870061Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1870331Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1870561Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1870779Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1870993Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1871210Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1871467Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1871715Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1871919Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1872152Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1872395Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1872630Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1872873Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1873107Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1873334Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1873571Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1873785Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1874002Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1874243Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1874488Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1874731Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1874964Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1875207Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1875440Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1875719Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1875968Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1876230Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1876464Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1876706Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1876943Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1877169Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1877386Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1877599Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1877814Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1878067Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1878301Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1878544Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1878781Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1879031Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1879267Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1879509Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1879743Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1879983Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1882658Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1882876Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1883114Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1883320Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1883532Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1883765Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1883987Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1884202Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1884411Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1884619Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1884807Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1884962Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1885129Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1885248Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1885391Z E1204 11:15:59.541000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1885562Z [W1204 11:15:59.006444491 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1885565Z 2025-12-04T12:10:21.1885743Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1886060Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1886375Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1886523Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1887018Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1887303Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1887551Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1887773Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1887988Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1888233Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1888471Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1888715Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1888950Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1889194Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1889439Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1889681Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1889913Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1890199Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1890448Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1890662Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1890888Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1891102Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1891345Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1891590Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1891794Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1892038Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1892280Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1892514Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1892719Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1892953Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1893168Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1893372Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1893604Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1893832Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1894036Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1894267Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1894508Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1894748Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1894992Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1895223Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1895436Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1895660Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1895875Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1896133Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1896374Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1896616Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1896848Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1897092Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1897324Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1897563Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1897796Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1898039Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1898282Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1898521Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1898754Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1898995Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1899235Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1899478Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1899712Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1899953Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1900233Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1900488Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1900722Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1900973Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1901205Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1901421Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1901635Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1901875Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1902110Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1902350Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1902581Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1902835Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1903070Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1903311Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1903552Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1903797Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1904031Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1904271Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1904502Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1904713Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1904926Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1905156Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1905380Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1905601Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1905815Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1906058Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1906292Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1906504Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1906707Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1906942Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1907165Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1907367Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1907601Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1907844Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1908088Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1908329Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1908563Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1908774Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1908994Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1909210Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1909464Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1909709Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1909924Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1910171Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1910389Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1910630Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1910868Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1911109Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1911344Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1911598Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1911833Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1912076Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1912309Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1912571Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1912808Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1913023Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1913229Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1913462Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1913680Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1913905Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1914132Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1914373Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1914610Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1914854Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1915089Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1915331Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1915564Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1915808Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1916050Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1916294Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1916529Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1916744Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1916969Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1917177Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1917404Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1917618Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1917860Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1918095Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1918322Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1918545Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1918758Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1919001Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1919235Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1919476Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1919713Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1919956Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1920235Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1920488Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1920725Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1920966Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1921203Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1921455Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1921690Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1921934Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1922169Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1922416Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1922664Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1922909Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1923157Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1923397Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1923632Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1923846Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1924053Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1924288Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1924530Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1924769Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1925019Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1925256Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1925495Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1925739Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1925983Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1926216Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1926459Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1926692Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1926898Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1927145Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1927387Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1927631Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1927873Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1928109Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1928338Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1928558Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1928771Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1928987Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1929232Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1929482Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1929711Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1929927Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1930195Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1930411Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1930653Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1930889Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1931106Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1931318Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1931536Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1931700Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.1931949Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1932156Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1932392Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1932636Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1932869Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1933081Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1933286Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1933520Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1933748Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1933953Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1934188Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1934397Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1934641Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1934848Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1935080Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1935285Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1935518Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1935760Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1936005Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1936258Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1936492Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1936734Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1936968Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1937211Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1937444Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1937685Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1937919Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1938142Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1938347Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1938583Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1938811Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1939038Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1939255Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1939469Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1939711Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1939944Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1940237Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1940486Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1940701Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1940934Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1941174Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1941408Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1941653Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1941889Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1942116Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1942332Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1942559Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1942765Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1942971Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1943205Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1943460Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1943697Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1943938Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1944173Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1944376Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1944610Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1944864Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1945118Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1945359Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1945592Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1945797Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1946031Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1946274Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1946511Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1946752Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1946998Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1947225Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1947442Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1947654Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1947880Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1948124Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1948357Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1948562Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1948796Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1949040Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1949289Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1949540Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1949773Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1949999Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1950250Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1950465Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1950681Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1950922Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1951159Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1951418Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1951652Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1951894Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1952127Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1952343Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1952578Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1952819Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1953054Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1953294Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1953530Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1953771Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.1954001Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1954214Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1954430Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1954672Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1954908Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1955150Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1955382Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1955624Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1955871Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1956115Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1956351Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1956590Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1956835Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1957048Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.1957264Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.1957467Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.1957677Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.1957908Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.1958141Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.1958370Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.1958576Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.1958785Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.1958972Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.1959115Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.1959275Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.1959395Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.1959538Z E1204 11:15:59.545000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.1959709Z [W1204 11:15:59.008845070 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.1959712Z 2025-12-04T12:10:21.1959873Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.1960233Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.1960541Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.1960689Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.1961197Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.1961468Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.1961710Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.1961932Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.1962145Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1962391Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1962646Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1962899Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1963134Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1963380Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1963616Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1963856Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1964090Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1964331Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1964650Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1964877Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1965102Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1965315Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1965555Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1965801Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1966008Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1966241Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1966483Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1966714Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1966928Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1967160Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1967382Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1967584Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1967815Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1968027Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1968230Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1968464Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1968704Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1968937Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1969190Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1969422Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1969635Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1969856Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1970080Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1970368Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1970601Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1970844Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1971076Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1971317Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1971563Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1971817Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1972051Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1972292Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1972526Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1972766Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1973001Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1973243Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1973478Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1973733Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1973965Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1974206Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1974436Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1974688Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1974922Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1975164Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1975397Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1975614Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1975836Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.1976076Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1976318Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1976558Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1976790Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1977031Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1977263Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1977505Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1977737Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1977980Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1978221Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1978463Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1978696Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1978906Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1979124Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1979356Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1979568Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1979788Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1980003Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1980297Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1980530Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1980755Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1980956Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1981189Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1981400Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1981602Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1981835Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1982075Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1982310Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1982561Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1982795Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1983007Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1983229Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1983454Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1983701Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1983936Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1984151Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1984366Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1984580Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1984837Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1985082Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1985326Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1985559Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1985803Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1986037Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1986279Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1986514Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1986756Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1987001Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1987216Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1987422Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1987660Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1987886Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1988102Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1988318Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1988560Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1988794Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1989036Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1989285Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1989537Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1989774Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1990021Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1990309Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1990551Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1990784Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1991000Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1991215Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1991435Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.1991661Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.1991875Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1992119Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1992366Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1992586Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.1992800Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.1993015Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.1993258Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.1993492Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1993747Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1993994Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1994236Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1994472Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1994713Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1994951Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1995192Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1995426Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1995666Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1995916Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1996160Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1996394Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1996637Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1996881Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1997129Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1997365Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1997610Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1997846Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1998071Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.1998276Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.1998523Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1998770Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1999005Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1999251Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1999486Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.1999730Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.1999967Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2000248Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2000497Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2000741Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2000974Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2001197Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2001434Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2001676Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2001911Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2002157Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2002393Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2002634Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2002851Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2003076Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2003293Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2003535Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2003771Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2004000Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2004216Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2004430Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2004649Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2004903Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2005139Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2005355Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2005571Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2005790Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2005956Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2006192Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2006399Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2006633Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2006880Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2007129Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2007355Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2007559Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2007793Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2008008Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2008214Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2008451Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2008658Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2008895Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2009101Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2009349Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2009557Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2009791Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2010045Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2010316Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2010558Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2010792Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2011033Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2011271Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2011527Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2011785Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2012029Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2012262Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2012475Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2012679Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2012914Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2013140Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2013360Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2013575Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2013803Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2014048Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2014284Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2014543Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2014778Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2014984Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2015219Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2015461Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2015696Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2015952Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2016199Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2016426Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2016646Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2016862Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2017070Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2017277Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2017511Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2017753Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2017987Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2018239Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2018475Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2018679Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2018921Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2019167Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2019401Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2019643Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2019880Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2020085Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2020385Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2020641Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2020873Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2021116Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2021349Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2021584Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2021802Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2022015Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2022230Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2022471Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2022717Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2022923Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2023158Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2023411Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2023646Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2023889Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2024124Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2024353Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2024572Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2024796Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2025011Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2025262Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2025497Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2025737Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2025974Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2026217Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2026451Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2026657Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2026900Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2027144Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2027379Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2027625Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2027871Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2028100Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2028317Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2028530Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2028746Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2028990Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2029241Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2029493Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2029725Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2029968Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2030241Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2030486Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2030722Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2030966Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2031202Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2031430Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2031646Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2031852Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2032064Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2032304Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2032529Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2032741Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2032947Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2033155Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2033344Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2033499Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2033661Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2033803Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2033943Z E1204 11:15:59.548000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2034119Z [W1204 11:15:59.050832533 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2034121Z 2025-12-04T12:10:21.2034280Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2034589Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2034898Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2035044Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2035536Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2035823Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2036065Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2036289Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2036502Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2036756Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2036992Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2037237Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2037473Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2037714Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2037947Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2038205Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2038448Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2038689Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2038926Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2039140Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2039362Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2039580Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2039820Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2040057Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2040311Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2040547Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2040790Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2041027Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2041243Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2041479Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2041691Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2041892Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2042125Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2042338Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2042552Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2042797Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2043037Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2043273Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2043516Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2043752Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2043966Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2044188Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2044402Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2044654Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2044887Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2045128Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2045362Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2045621Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2045858Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2046098Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2046330Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2046572Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2046803Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2047055Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2047297Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2047537Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2047771Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2048012Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2048247Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2048489Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2048721Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2048961Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2049203Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2049446Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2049678Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2049894Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2050156Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2050399Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2050634Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2050874Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2051107Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2051362Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2051595Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2051849Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2052081Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2052322Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2052554Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2052796Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2053030Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2053244Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2053448Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2053695Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2053907Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2054128Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2054343Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2054592Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2054827Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2055039Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2055240Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2055474Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2055684Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2055897Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2056138Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2056380Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2056612Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2056852Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2057086Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2057296Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2057518Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2057733Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2057991Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2058226Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2058442Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2058655Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2058879Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2059124Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2059357Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2059601Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2059835Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2060077Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2060361Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2060616Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2060851Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2061095Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2061330Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2061543Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2061748Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2061983Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2062200Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2062431Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2062646Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2062890Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2063124Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2063378Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2063614Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2063855Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2064092Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2064333Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2064569Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2064823Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2065067Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2065284Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2065497Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2065705Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2065930Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2066149Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2066392Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2066627Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2066853Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2067066Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2067283Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2067525Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2067769Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2068016Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2068249Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2068491Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2068723Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2068964Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2069215Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2069465Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2069698Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2069940Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2070213Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2070454Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2070690Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2070931Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2071165Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2071420Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2071655Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2071896Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2072129Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2072355Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2072563Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2072798Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2073040Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2073273Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2073528Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2073761Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2074016Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2074250Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2074491Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2074727Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2074969Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2075205Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2075409Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2075645Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2075899Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2076134Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2076377Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2076619Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2076848Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2077066Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2077282Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2077497Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2077739Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2077985Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2078213Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2078445Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2078658Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2078874Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2079116Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2079350Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2079574Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2079788Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2079997Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2080207Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2080446Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2080652Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2080885Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2081142Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2081380Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2081592Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2081798Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2082038Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2082255Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2082471Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2082707Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2082925Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2083160Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2083365Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2083602Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2083807Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2084043Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2084285Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2084521Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2084777Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2085010Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2085252Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2085497Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2085740Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2085974Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2086216Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2086451Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2086666Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2086881Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2087115Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2087351Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2087568Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2087780Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2087998Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2088242Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2088477Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2088717Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2088951Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2089167Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2089403Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2089647Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2089892Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2090172Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2090407Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2090635Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2090852Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2091065Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2091290Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2091496Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2091743Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2091984Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2092218Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2092463Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2092699Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2092904Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2093139Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2093381Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2093627Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2093871Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2094108Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2094323Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2094559Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2094802Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2095036Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2095277Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2095510Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2095754Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2095969Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2096193Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2096408Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2096652Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2096889Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2097095Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2097330Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2097571Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2097807Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2098059Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2098293Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2098520Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2098750Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2098967Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2099181Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2099424Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2099658Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2099900Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2100178Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2100419Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2100668Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2100871Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2101107Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2101353Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2101587Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2101831Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2102066Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2102308Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2102526Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2102740Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2102956Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2103210Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2103447Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2103694Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2103930Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2104174Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2104412Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2104669Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2104914Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2105157Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2105389Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2105603Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2105826Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2106032Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2106242Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2106469Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2106691Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2106911Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2107119Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2107325Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2107510Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2107662Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2107823Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2107946Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2108088Z E1204 11:15:59.590000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2108263Z [W1204 11:15:59.052927996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2108265Z 2025-12-04T12:10:21.2108427Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2108737Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2109057Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2109214Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2109705Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2109971Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2110247Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2110469Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2110684Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2110933Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2111182Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2111425Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2111659Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2111901Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2112152Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2112394Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2112629Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2112873Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2113106Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2113320Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2113557Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2113785Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2114025Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2114260Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2114465Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2114698Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2114940Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2115173Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2115377Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2115619Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2115832Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2116034Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2116266Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2116486Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2116694Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2116929Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2117171Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2117403Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2117644Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2117888Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2118112Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2118334Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2118551Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2118791Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2119025Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2119264Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2119498Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2119738Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2119983Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2120260Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2120495Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2120739Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2120988Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2121231Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2121464Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2121705Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2121936Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2122176Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2122423Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2122676Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2122913Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2123156Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2123393Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2123635Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2123867Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2124086Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2124296Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2124554Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2124785Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2125027Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2125262Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2125513Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2125747Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2125986Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2126220Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2126464Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2126706Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2126949Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2127190Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2127403Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2127606Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2127844Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2128056Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2128278Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2128492Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2128733Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2128985Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2129196Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2129401Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2129634Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2129853Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2130059Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2130335Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2130578Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2130810Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2131053Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2131300Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2131521Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2131743Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2131956Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2132202Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2132438Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2132658Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2132874Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2133088Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2133333Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2133579Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2133822Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2134056Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2134310Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2134549Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2134790Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2135026Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2135271Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2135510Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2135733Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2135948Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2136184Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2136399Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2136613Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2136829Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2137072Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2137306Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2137553Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2137799Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2138042Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2138278Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2138518Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2138763Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2139007Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2139243Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2139463Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2139676Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2139889Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2140166Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2140395Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2140635Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2140872Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2141090Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2141304Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2141520Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2141762Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2141997Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2142255Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2142494Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2142736Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2142969Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2143223Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2143459Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2143701Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2143936Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2144179Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2144415Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2144671Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2144924Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2145164Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2145400Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2145645Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2145879Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2146121Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2146355Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2146571Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2146785Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2147023Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2147267Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2147501Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2147752Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2147989Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2148232Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2148465Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2148707Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2148951Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2149192Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2149437Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2149644Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2149881Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2150160Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2150395Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2150638Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2150869Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2151100Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2151330Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2151544Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2151758Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2152001Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2152253Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2152484Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2152701Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2152912Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2153127Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2153369Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2153618Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2153846Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2154058Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2154266Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2154429Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2154667Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2154871Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2155107Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2155350Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2155598Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2155811Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2156016Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2156253Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2156473Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2156680Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2156921Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2157124Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2157361Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2157566Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2157812Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2158016Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2158263Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2158508Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2158741Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2158986Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2159225Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2159470Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2159703Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2159946Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2160238Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2160481Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2160715Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2160941Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2161151Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2161384Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2161615Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2161836Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2162050Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2162286Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2162528Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2162776Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2163018Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2163254Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2163462Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2163695Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2163938Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2164173Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2164417Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2164660Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2164890Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2165107Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2165320Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2165538Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2165743Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2165982Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2166225Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2166462Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2166718Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2166951Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2167167Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2167400Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2167645Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2167882Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2168124Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2168358Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2168562Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2168798Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2169051Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2169287Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2169529Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2169772Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2170005Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2170257Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2170475Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2170690Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2170938Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2171187Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2171394Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2171643Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2171884Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2172123Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2172365Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2172602Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2172833Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2173049Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2173265Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2173492Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2173741Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2173977Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2174221Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2174470Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2174712Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2174947Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2175150Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2175386Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2175638Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2175873Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2176151Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2176387Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2176617Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2176835Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2177051Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2177266Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2177511Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2177746Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2177996Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2178232Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2178472Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2178725Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2178968Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2179203Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2179448Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2179681Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2179897Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2180163Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2180369Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2180592Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2180821Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2181047Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2181263Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2181470Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2181677Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2181863Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2182004Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2182165Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2182297Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2182437Z E1204 11:15:59.592000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2182610Z [W1204 11:15:59.054972649 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2182612Z 2025-12-04T12:10:21.2182773Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2183097Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2183405Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2183553Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2184050Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2186154Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2186416Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2186649Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2186865Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2187106Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2187342Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2187586Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2187820Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2188063Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2188296Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2188548Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2188780Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2189023Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2189258Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2189481Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2189706Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2189920Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2190203Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2190439Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2190646Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2190895Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2191147Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2191380Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2191583Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2191818Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2192030Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2192234Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2192467Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2192677Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2192882Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2193128Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2193371Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2193602Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2193856Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2194091Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2194301Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2194527Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2194739Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2194981Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2195229Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2195472Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2195716Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2195955Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2196188Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2196430Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2196663Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2196903Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2197134Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2197393Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2197625Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2197867Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2198097Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2198348Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2198584Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2198824Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2199057Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2199297Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2199530Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2199784Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2200030Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2200289Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2200499Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2200742Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2200978Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2201219Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2201451Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2201694Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2201943Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2202183Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2202417Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2202659Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2202903Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2203145Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2203379Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2203591Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2203793Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2204026Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2204248Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2204485Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2204698Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2204943Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2205178Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2205389Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2205593Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2205824Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2206035Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2206237Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2206481Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2206724Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2206955Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2207206Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2207441Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2207653Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2207874Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2208088Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2208334Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2208578Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2208795Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2209017Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2209232Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2209476Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2209716Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2209962Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2210231Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2210474Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2210709Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2210965Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2211200Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2211442Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2211687Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2211903Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2212110Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2212345Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2212562Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2212775Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2213005Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2213247Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2213499Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2213741Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2213975Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2214219Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2214456Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2214702Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2214937Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2215189Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2215424Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2215640Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2215855Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2216071Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2216297Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2216512Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2216754Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2216993Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2217209Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2217435Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2217649Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2217900Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2218134Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2218376Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2218612Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2218854Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2219088Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2219333Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2219577Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2219820Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2220054Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2220337Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2220584Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2220829Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2221068Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2221310Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2221545Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2221789Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2222038Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2222291Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2222525Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2222739Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2222945Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2223179Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2223422Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2223658Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2223900Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2224153Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2224398Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2224633Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2224874Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2225119Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2225362Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2225598Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2225803Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2226036Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2226278Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2226527Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2226782Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2227015Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2227245Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2227462Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2227674Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2227890Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2228132Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2228366Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2228605Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2228823Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2229038Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2229253Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2229505Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2229745Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2229960Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2230210Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2230417Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2230580Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2230834Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2231052Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2231287Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2231531Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2231767Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2231983Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2232187Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2232421Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2232632Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2232836Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2233082Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2233288Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2233521Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2233727Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2233979Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2234184Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2234420Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2234661Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2234895Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2235150Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2235383Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2235637Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2235870Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2236113Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2236351Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2236595Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2236829Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2237042Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2237248Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2237491Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2237722Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2237938Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2238152Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2238378Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2238624Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2238863Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2239104Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2239338Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2239552Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2239785Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2240037Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2240314Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2240557Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2240793Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2241023Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2241241Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2241455Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2241662Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2241878Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2242115Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2242355Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2242590Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2242844Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2243080Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2243287Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2243523Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2243768Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2244015Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2244257Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2244502Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2244707Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2244944Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2245186Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2245421Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2245663Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2245899Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2246127Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2246354Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2246570Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2246784Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2247027Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2247276Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2247484Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2247717Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2247960Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2248195Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2248451Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2248685Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2248924Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2249142Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2249355Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2249573Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2249815Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2250048Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2250330Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2250564Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2250821Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2251057Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2251262Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2251497Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2251750Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2251987Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2252228Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2252464Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2252692Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2252921Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2253135Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2253368Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2253611Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2253846Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2254095Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2254328Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2254570Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2254804Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2255046Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2255290Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2255532Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2255771Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2255992Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2256210Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2256420Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2256631Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2256860Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2257080Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2257305Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2257511Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2257726Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2257913Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2258055Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2258216Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2258339Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2258481Z E1204 11:15:59.594000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2258557Z ('RERUN', {'yellow': True}) [1.7974s] [100%] 2025-12-04T12:10:21.2258923Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:16:01.599968290 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2258926Z 2025-12-04T12:10:21.2259087Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2259394Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2259713Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2259859Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2260397Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2260668Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2260909Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2261130Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2261344Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2261588Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2261834Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2262077Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2262321Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2262563Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2262797Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2263041Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2263280Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2263519Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2263753Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2263983Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2264205Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2264421Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2264662Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2264903Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2265108Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2265341Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2265584Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2265816Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2266019Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2266262Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2266473Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2266684Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2266918Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2267131Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2267335Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2267569Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2267812Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2268046Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2268288Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2268531Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2268745Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2268966Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2269182Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2269432Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2269668Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2269910Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2270187Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2270432Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2270678Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2270918Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2271162Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2271403Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2271635Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2271879Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2272112Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2272353Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2272586Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2272840Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2273074Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2273314Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2273546Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2273798Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2274032Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2274274Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2274506Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2274723Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2274935Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2275189Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2275434Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2275674Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2275906Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2276147Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2276382Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2276624Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2276855Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2277096Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2277337Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2277579Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2277812Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2278024Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2278237Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2278472Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2278682Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2278905Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2279120Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2279360Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2279609Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2279839Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2280043Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2280309Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2280520Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2280723Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2280957Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2281199Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2281433Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2281675Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2281919Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2282131Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2282354Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2282569Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2282828Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2283063Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2283280Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2283493Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2283708Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2283964Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2284198Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2284452Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2284686Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2284932Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2285168Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2285411Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2285644Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2285886Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2286123Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2286347Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2286553Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2286787Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2287016Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2287233Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2287450Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2287693Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2287928Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2288170Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2288417Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2288659Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2288903Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2289144Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2289379Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2289624Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2289861Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2290079Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2290323Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2290531Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2290767Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2290985Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2291227Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2291472Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2291691Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2291903Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2292123Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2292365Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2292601Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2292856Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2293090Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2293343Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2293577Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2293820Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2294056Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2294297Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2294534Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2294776Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2295021Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2295262Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2295499Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2295743Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2295986Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2296230Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2296464Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2296706Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2296943Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2297158Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2297391Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2297638Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2297880Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2298114Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2298357Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2298592Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2298835Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2299068Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2299311Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2299559Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2299800Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2300035Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2300272Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2300519Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2300764Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2300998Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2301240Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2301475Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2301705Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2301937Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2302166Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2302383Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2302624Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2302861Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2303087Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2303305Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2303517Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2303734Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2303992Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2304231Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2304449Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2304661Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2304878Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2305043Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2305280Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2305486Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2305720Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2305965Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2306213Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2306428Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2306642Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2306878Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2307090Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2307295Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2307531Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2307736Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2307974Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2308179Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2308425Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2308634Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2308869Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2309114Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2309358Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2309605Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2309839Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2310084Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2310362Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2310617Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2310854Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2311108Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2311343Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2311558Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2311763Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2311998Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2312225Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2312442Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2312657Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2312887Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2313130Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2313365Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2313606Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2313860Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2314069Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2314303Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2314547Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2314783Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2315035Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2315269Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2315508Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2315725Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2315938Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2316149Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2316358Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2316595Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2316838Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2317072Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2317324Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2317559Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2317764Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2317999Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2318252Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2318490Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2318733Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2318968Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2319173Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2319421Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2319664Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2319911Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2320189Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2320423Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2320654Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2320871Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2321087Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2321305Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2321548Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2321802Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2322007Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2322243Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2322484Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2322730Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2322977Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2323211Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2323441Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2323661Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2323888Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2324103Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2324358Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2324594Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2324835Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2325071Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2325312Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2325549Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2325753Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2325992Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2326246Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2326480Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2326722Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2326955Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2327191Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2327410Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2327623Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2327838Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2328082Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2328331Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2328573Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2328819Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2329060Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2329295Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2329538Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2329772Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2330014Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2330285Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2330501Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2330742Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2330951Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2331165Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2331395Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2331628Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2331844Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2332054Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2332263Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2332448Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2332590Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2332765Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2332891Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2333046Z E1204 11:16:01.139000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2333220Z [W1204 11:16:01.602459648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2333223Z 2025-12-04T12:10:21.2333382Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2333691Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2334000Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2334146Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2334639Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2334907Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2335158Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2335379Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2335596Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2335852Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2336089Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2336332Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2336566Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2336810Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2337044Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2337297Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2337532Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2337782Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2338016Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2338231Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2338455Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2338670Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2338910Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2339143Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2339346Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2339592Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2339834Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2340067Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2340317Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2340554Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2340769Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2340973Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2341206Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2341416Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2341634Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2341868Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2342121Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2342353Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2342594Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2342833Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2343047Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2343271Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2343488Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2343729Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2343976Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2344218Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2344450Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2344691Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2344934Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2345178Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2345414Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2345656Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2345890Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2346143Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2346375Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2346632Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2346865Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2347107Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2347342Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2347582Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2347816Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2348057Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2348291Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2348542Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2348776Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2348993Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2349214Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2349458Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2349691Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2349934Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2350199Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2350443Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2350690Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2350942Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2351175Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2351414Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2351647Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2351890Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2352123Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2352337Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2352541Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2352789Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2353004Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2353226Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2353441Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2353693Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2353929Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2354139Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2354342Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2354575Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2354786Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2355002Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2355237Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2355492Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2355723Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2355964Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2356198Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2356409Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2356631Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2356846Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2357092Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2357339Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2357559Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2357775Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2357989Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2358242Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2358478Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2358722Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2358955Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2359198Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2359443Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2359684Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2359935Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2360224Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2360460Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2360676Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2360882Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2361117Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2361333Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2361547Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2361773Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2362017Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2362251Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2362495Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2362742Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2362985Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2363221Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2363463Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2363700Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2363963Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2364198Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2364429Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2364641Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2364852Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2365078Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2365295Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2365538Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2365772Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2365989Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2366211Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2366428Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2366669Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2366903Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2367154Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2367394Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2367637Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2367871Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2368114Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2368359Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2368605Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2368847Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2369091Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2369328Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2369574Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2369812Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2370053Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2370324Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2370580Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2370815Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2371060Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2371295Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2371522Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2371729Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2371964Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2372215Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2372450Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2372695Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2372941Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2373195Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2373427Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2373669Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2373904Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2374146Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2374381Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2374587Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2374822Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2375073Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2375399Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2375644Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2375880Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2376117Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2376337Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2376552Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2376766Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2377017Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2377254Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2377494Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2377723Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2377935Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2378153Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2378397Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2378634Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2378851Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2379065Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2379275Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2379441Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2379690Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2379897Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2380177Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2380419Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2380673Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2380887Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2381093Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2381329Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2381541Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2381758Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2381996Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2382213Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2382448Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2382651Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2382888Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2383091Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2383327Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2383569Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2383804Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2384059Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2384297Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2384546Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2384779Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2385033Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2385270Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2385513Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2385746Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2385959Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2386175Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2386409Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2386655Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2386872Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2387085Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2387303Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2387545Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2387784Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2388026Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2388262Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2388478Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2388712Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2388957Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2389193Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2389444Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2389681Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2389908Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2390162Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2390375Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2390582Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2390801Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2391048Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2391290Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2391526Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2391772Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2392005Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2392210Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2392442Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2392687Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2392934Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2393174Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2393411Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2393615Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2393865Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2394111Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2394347Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2394588Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2394821Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2395059Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2395275Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2395500Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2395715Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2395958Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2396197Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2396403Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2396641Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2396882Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2397117Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2397376Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2397611Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2397841Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2398058Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2398280Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2398497Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2398743Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2398978Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2399222Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2399457Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2399708Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2399953Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2400197Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2400432Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2400675Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2400912Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2401159Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2401396Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2401624Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2401853Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2402072Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2402288Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2402533Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2402780Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2403023Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2403261Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2403502Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2403741Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2403998Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2404232Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2404486Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2404719Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2404932Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2405150Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2405356Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2405567Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2405794Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2406019Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2406242Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2406451Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2406659Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2406846Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2406988Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2407160Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2407285Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2407425Z E1204 11:16:01.141000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2407598Z [W1204 11:16:01.604647529 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2407600Z 2025-12-04T12:10:21.2407760Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2408069Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2408387Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2408533Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2409038Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2409307Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2409548Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2409769Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2409984Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2410260Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2410498Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2410754Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2410990Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2411234Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2411478Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2411723Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2411956Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2412197Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2412430Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2412644Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2412883Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2413096Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2413362Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2413597Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2413804Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2414040Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2414280Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2414515Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2414716Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2414949Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2415172Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2415375Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2415609Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2415824Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2416041Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2416276Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2416520Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2416754Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2416994Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2417246Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2417458Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2417693Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2417906Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2418149Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2418388Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2418628Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2418862Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2419101Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2419337Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2419591Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2419834Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2420075Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2420350Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2420605Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2420840Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2421083Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2421315Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2421556Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2421803Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2422047Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2422293Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2422535Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2422770Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2423011Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2423250Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2423467Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2423679Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2423921Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2424167Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2424410Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2424642Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2424893Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2425131Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2425371Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2425607Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2425848Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2426082Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2426334Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2426577Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2426790Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2426991Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2427224Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2427436Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2427659Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2427872Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2428120Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2428355Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2428578Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2428783Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2429015Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2429226Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2429436Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2429673Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2429918Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2430183Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2430429Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2430684Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2430897Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2431132Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2431346Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2431597Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2431832Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2432050Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2432264Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2432478Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2432721Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2432971Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2433216Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2433449Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2433691Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2433939Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2434186Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2434421Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2434662Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2434898Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2435123Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2435333Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2435579Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2435796Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2436012Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2436229Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2436474Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2436708Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2436952Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2437187Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2437444Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2437683Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2437926Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2438165Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2438417Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2438652Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2438868Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2439081Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2439289Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2439526Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2439742Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2439994Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2440266Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2440485Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2440702Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2440917Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2441160Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2441395Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2441638Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2441886Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2442128Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2442365Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2442609Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2442857Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2443103Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2443336Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2443578Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2443812Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2444070Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2444303Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2444556Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2444791Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2445035Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2445271Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2445513Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2445747Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2445960Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2446166Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2446413Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2446656Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2446893Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2447158Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2447397Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2447640Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2447873Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2448115Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2448349Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2448604Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2448849Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2449053Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2449288Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2449530Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2449768Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2450011Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2450286Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2450516Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2450750Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2450965Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2451181Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2451423Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2451670Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2451899Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2452117Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2452334Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2452551Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2452793Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2453043Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2453273Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2453486Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2453693Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2453856Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2454094Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2454300Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2454536Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2454779Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2455014Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2455239Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2455446Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2455681Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2455894Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2456108Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2456344Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2456550Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2456785Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2456990Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2457231Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2457445Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2457690Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2457933Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2458168Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2458414Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2458646Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2458890Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2459123Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2459368Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2459613Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2459856Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2460123Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2460336Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2460553Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2460789Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2461018Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2461234Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2461448Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2461665Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2461923Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2462172Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2462413Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2462649Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2462854Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2463088Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2463331Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2463565Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2463807Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2464058Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2464290Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2464509Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2464723Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2464939Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2465146Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2465380Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2465622Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2465857Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2466098Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2466343Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2466559Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2466795Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2467038Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2467274Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2467518Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2467754Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2467958Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2468193Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2468445Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2468680Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2468923Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2469159Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2469398Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2469616Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2469830Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2470044Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2470320Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2470554Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2470773Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2471020Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2471261Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2471498Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2471746Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2471980Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2472208Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2472425Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2472640Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2472867Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2473111Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2473346Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2473588Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2473834Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2474081Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2474315Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2474518Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2474752Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2474995Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2475241Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2475492Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2475726Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2475955Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2476172Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2476386Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2476602Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2476845Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2477078Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2477329Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2477564Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2477805Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2478039Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2478290Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2478527Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2478771Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2479007Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2479219Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2479448Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2479654Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2479882Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2480148Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2480369Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2480584Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2480790Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2480996Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2481184Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2481324Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2481486Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2481606Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2481759Z E1204 11:16:01.143000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2481934Z [W1204 11:16:01.646349085 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2481937Z 2025-12-04T12:10:21.2482095Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2482404Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2482726Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2482874Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2483363Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2483634Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2483892Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2484112Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2484340Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2484581Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2484818Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2485061Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2485295Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2485537Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2485768Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2486011Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2486255Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2486497Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2486728Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2486941Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2487173Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2487389Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2487632Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2487864Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2489872Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2490157Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2490401Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2490654Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2490856Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2491089Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2491301Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2491503Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2491735Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2491946Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2492147Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2492392Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2492635Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2492869Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2493113Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2493357Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2493572Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2493795Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2494008Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2494251Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2494484Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2494741Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2494982Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2495224Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2495457Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2495699Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2495931Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2496172Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2496405Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2496645Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2496889Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2497135Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2497366Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2497608Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2497850Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2498094Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2498327Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2498566Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2498800Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2499057Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2499291Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2499518Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2499729Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2499972Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2500248Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2500490Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2500723Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2500964Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2501197Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2501452Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2501686Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2501925Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2502168Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2502410Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2502646Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2502856Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2503058Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2503291Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2503514Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2503737Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2503964Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2504206Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2504439Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2504650Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2504854Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2505087Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2505299Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2505500Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2505742Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2505983Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2506216Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2506457Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2506696Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2506910Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2507132Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2507347Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2507590Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2507824Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2508053Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2508278Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2508493Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2508737Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2508973Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2509215Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2509450Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2509693Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2509927Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2510218Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2510452Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2510693Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2510926Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2511152Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2511364Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2511600Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2511817Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2512029Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2512245Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2512501Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2512747Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2512988Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2513223Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2513466Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2513698Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2513941Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2514174Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2514417Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2514662Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2514880Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2515092Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2515300Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2515540Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2515757Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2516000Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2516234Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2516449Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2516662Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2516891Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2517143Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2517376Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2517621Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2517857Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2518098Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2518335Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2518576Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2518811Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2519064Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2519300Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2519543Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2519777Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2520032Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2520307Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2520552Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2520785Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2521027Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2521278Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2521519Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2521769Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2521983Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2522190Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2522426Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2522668Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2522902Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2523143Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2523380Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2523634Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2523869Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2524112Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2524358Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2524601Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2524835Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2525042Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2525275Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2525516Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2525761Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2526011Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2526244Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2526472Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2526692Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2526907Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2527121Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2527366Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2527599Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2527828Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2528054Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2528270Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2528484Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2528736Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2528974Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2529190Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2529405Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2529612Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2529777Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2530023Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2530263Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2530513Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2530754Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2530989Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2531203Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2531408Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2531643Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2531855Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2532060Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2532391Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2532595Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2532882Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2533087Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2533333Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2533539Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2533775Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2534019Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2534253Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2534495Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2534742Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2534992Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2535226Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2535469Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2535706Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2535948Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2536181Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2536398Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2536604Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2536846Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2537074Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2537291Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2537505Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2537728Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2537974Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2538209Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2538452Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2538687Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2538890Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2539135Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2539386Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2539620Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2539861Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2540135Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2540362Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2540579Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2540793Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2541000Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2541219Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2541455Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2541697Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2541931Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2542184Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2542421Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2542625Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2542859Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2543101Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2543335Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2543594Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2543842Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2544045Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2544278Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2544521Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2544756Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2544998Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2545231Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2545459Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2545687Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2545901Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2546118Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2546360Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2546605Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2546811Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2547045Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2547288Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2547521Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2547764Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2548008Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2548252Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2548472Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2548684Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2548901Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2549144Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2549379Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2549620Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2549854Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2550147Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2550381Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2550586Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2550822Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2551077Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2551314Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2551555Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2551788Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2552015Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2552232Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2552460Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2552687Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2552928Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2553163Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2553406Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2553640Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2553883Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2554117Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2554359Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2554603Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2554844Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2555079Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2555290Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2555518Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2555726Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2555937Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2556165Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2556386Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2556601Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2556817Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2557033Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2557219Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2557359Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2557521Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2557639Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2557784Z E1204 11:16:01.185000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2557955Z [W1204 11:16:01.648425708 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2557959Z 2025-12-04T12:10:21.2558119Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2558427Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2558737Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2558892Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2559383Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2559652Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2559900Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2560165Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2560383Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2560625Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2560859Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2561114Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2561346Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2561597Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2561830Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2562072Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2562306Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2562546Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2562779Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2562992Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2563214Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2563440Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2563682Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2563913Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2564117Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2564360Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2564602Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2564835Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2565038Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2565274Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2565485Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2565707Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2565950Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2566162Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2566363Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2566596Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2566838Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2567070Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2567310Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2567543Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2567767Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2567988Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2568203Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2568445Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2568688Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2568931Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2569165Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2569406Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2569638Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2569880Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2570163Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2570414Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2570646Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2570887Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2571122Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2571362Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2571595Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2571834Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2572067Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2572322Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2572556Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2572796Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2573027Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2573284Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2573519Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2573735Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2573945Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2574186Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2574432Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2574672Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2574915Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2575155Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2575388Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2575631Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2575862Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2576102Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2576335Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2576575Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2576818Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2577032Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2577237Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2577468Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2577687Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2577912Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2578127Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2578369Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2578601Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2578830Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2579030Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2579273Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2579483Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2579687Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2579921Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2580195Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2580430Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2580669Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2580905Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2581134Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2581355Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2581570Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2581813Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2582072Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2582291Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2582504Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2582718Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2582960Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2583195Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2583451Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2583698Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2583938Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2584172Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2584417Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2584654Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2584897Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2585131Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2585344Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2585560Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2585795Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2586012Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2586224Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2586448Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2586692Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2586930Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2587170Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2587404Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2587645Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2587890Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2588141Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2588375Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2588617Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2588852Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2589069Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2589284Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2589489Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2589714Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2589938Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2590440Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2590675Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2590892Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2591118Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2591334Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2591577Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2591813Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2592057Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2592291Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2592544Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2592791Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2593031Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2593266Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2593507Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2593741Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2593983Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2594218Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2594461Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2594708Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2594950Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2595183Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2595425Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2595668Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2595911Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2596146Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2596358Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2596563Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2596808Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2597052Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2597299Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2597539Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2597776Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2598017Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2598252Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2598492Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2598727Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2598969Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2599222Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2599430Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2599663Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2599915Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2600185Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2600426Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2600662Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2600888Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2601106Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2601333Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2601550Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2601803Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2602037Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2602266Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2602484Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2602697Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2602910Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2603152Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2603386Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2603618Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2603832Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2604041Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2604203Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2604452Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2604659Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2604894Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2605138Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2605371Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2605583Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2605798Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2606047Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2606262Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2606469Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2606706Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2606913Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2607149Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2607357Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2607590Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2607797Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2608041Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2608284Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2608517Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2608769Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2609006Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2609246Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2609483Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2609724Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2609960Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2610245Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2610492Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2610705Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2610908Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2611144Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2611375Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2611592Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2611806Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2612021Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2612278Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2612513Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2612757Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2612989Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2613207Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2613443Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2613686Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2613921Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2614161Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2614395Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2614637Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2614867Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2615081Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2615285Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2615489Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2615726Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2615972Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2616207Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2616448Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2616692Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2616896Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2617131Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2617372Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2617614Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2617857Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2618090Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2618296Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2618531Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2618773Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2619017Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2619268Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2619502Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2619728Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2619944Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2620194Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2620410Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2620650Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2620886Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2621093Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2621339Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2621583Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2621816Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2622070Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2622305Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2622533Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2622750Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2622961Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2623177Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2623431Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2623679Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2623919Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2624154Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2624398Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2624633Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2624839Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2625072Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2625314Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2625559Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2625802Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2626037Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2626264Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2626489Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2626703Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2626917Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2627159Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2627393Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2627635Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2627879Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2628133Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2628366Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2628607Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2628842Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2629083Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2629318Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2629529Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2629747Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2629961Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2630217Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2630451Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2630673Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2630898Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2631106Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2631311Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2631496Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2631638Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2631796Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2631916Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2632074Z E1204 11:16:01.187000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2632247Z [W1204 11:16:01.650442342 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2632263Z 2025-12-04T12:10:21.2632422Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2632730Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2633040Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2633185Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2633675Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2633944Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2634182Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2634416Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2634630Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2634873Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2635108Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2635362Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2635599Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2635840Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2636072Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2636311Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2636553Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2636794Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2637036Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2637249Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2637471Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2637690Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2637931Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2638164Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2638367Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2638599Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2638851Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2639084Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2639288Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2639520Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2639743Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2639949Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2640230Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2640444Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2640646Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2640880Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2641138Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2641384Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2641624Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2641856Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2642067Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2642292Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2642508Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2642748Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2642981Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2643237Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2643470Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2643712Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2643943Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2644196Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2644429Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2644668Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2644902Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2645141Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2645373Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2645625Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2645866Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2646106Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2646341Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2648309Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2648541Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2648781Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2649013Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2649944Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2650257Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2650475Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2650687Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2650930Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2651181Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2651424Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2651655Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2651895Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2652130Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2652374Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2652606Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2652847Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2653083Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2653323Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2653614Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2653825Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2654028Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2654260Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2654491Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2654731Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2654946Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2655190Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2655422Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2655645Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2655849Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2656080Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2656290Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2656491Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2656726Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2656970Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2657204Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2657444Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2657675Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2657888Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2658127Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2658342Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2658586Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2658821Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2659069Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2659282Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2659498Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2659740Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2659987Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2660272Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2660505Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2660748Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2660980Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2661225Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2661459Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2661701Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2661939Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2662151Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2662358Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2662610Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2662828Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2663039Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2663255Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2663528Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2663763Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2664006Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2664240Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2664503Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2664741Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2664981Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2665214Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2665454Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2665690Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2665908Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2666121Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2666328Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2666553Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2666771Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2667032Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2667266Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2667481Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2667695Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2667933Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2668176Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2668411Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2668651Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2668895Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2669140Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2669374Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2669615Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2669848Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2670126Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2670361Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2670603Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2670835Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2671079Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2671333Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2671575Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2671809Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2672049Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2672300Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2672554Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2672791Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2673004Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2673207Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2673461Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2673705Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2673940Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2674183Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2674417Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2674659Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2674893Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2675137Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2675370Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2675612Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2675860Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2676064Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2676299Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2676542Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2676788Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2677040Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2677275Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2677502Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2677717Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2677943Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2678159Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2678400Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2678633Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2678864Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2679082Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2679294Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2679508Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2679749Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2679983Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2680250Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2680463Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2680669Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2680831Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2681068Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2681304Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2681539Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2681782Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2682016Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2682249Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2682454Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2682688Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2682900Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2683103Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2683337Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2683543Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2683779Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2683982Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2684216Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2684420Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2684671Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2684914Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2685150Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2685392Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2685637Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2685892Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2686129Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2686370Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2686614Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2686859Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2687093Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2687305Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2687509Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2687743Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2687973Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2688189Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2688406Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2688623Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2688867Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2689118Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2689360Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2689599Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2689802Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2690048Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2690324Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2690558Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2690802Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2691049Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2691280Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2691496Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2691712Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2691918Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2692123Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2692360Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2692601Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2692836Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2693077Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2693315Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2693618Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2693852Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2694094Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2694327Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2694584Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2694828Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2695034Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2695269Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2695519Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2695758Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2695999Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2696233Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2696460Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2696678Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2696894Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2697109Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2697351Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2697584Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2697791Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2698038Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2698282Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2698517Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2698757Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2699008Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2699249Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2699467Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2699683Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2699908Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2700263Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2700498Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2700742Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2700976Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2701220Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2701458Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2701663Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2701897Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2702140Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2702376Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2702632Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2702869Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2703100Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2703316Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2703545Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2703772Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2704017Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2704251Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2704506Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2704742Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2704983Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2705218Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2705460Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2705698Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2705939Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2706174Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2706386Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2706602Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2706809Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2707030Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2707261Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2707482Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2707697Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2707919Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2708136Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2708323Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2708464Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2708623Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2708741Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2708893Z E1204 11:16:01.189000 852494 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2708954Z FAILED [1.4813s] [100%] 2025-12-04T12:10:21.2708956Z 2025-12-04T12:10:21.2709033Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.2709205Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.2709270Z Traceback (most recent call last): 2025-12-04T12:10:21.2709447Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2709508Z method(*args, **kwargs) 2025-12-04T12:10:21.2709676Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2709733Z method(*args, **kwargs) 2025-12-04T12:10:21.2709899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.2709955Z with policy(): 2025-12-04T12:10:21.2710154Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.2710213Z raise RuntimeError(msg) 2025-12-04T12:10:21.2710641Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.2710643Z 2025-12-04T12:10:21.2710737Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.2711028Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.2711031Z 2025-12-04T12:10:21.2711154Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.2711250Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2711312Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2711389Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2711959Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.2712076Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2712151Z graph_break [] 2025-12-04T12:10:21.2712234Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2712338Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2712840Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.2712906Z current_size = base.storage().size() 2025-12-04T12:10:21.2712965Z Autotune Choices Stats: 2025-12-04T12:10:21.2713374Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.2713461Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2713529Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2713667Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2713923Z triton_mm_33 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2714172Z triton_mm_34 0.0108 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2714415Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2714659Z triton_mm_30 0.0114 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2714900Z triton_mm_21 0.0121 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2715146Z triton_mm_15 0.0123 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2715389Z triton_mm_16 0.0124 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2715642Z triton_mm_29 0.0124 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2715883Z triton_mm_23 0.0129 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2716125Z triton_mm_31 0.0138 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2716293Z SingleProcess AUTOTUNE benchmarking takes 0.1771 seconds and 8.9941 seconds precompiling for 33 choices 2025-12-04T12:10:21.2716476Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.2716539Z Traceback (most recent call last): 2025-12-04T12:10:21.2716712Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2716770Z method(*args, **kwargs) 2025-12-04T12:10:21.2716938Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2716995Z method(*args, **kwargs) 2025-12-04T12:10:21.2717162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.2717219Z with policy(): 2025-12-04T12:10:21.2717399Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.2717459Z raise RuntimeError(msg) 2025-12-04T12:10:21.2717883Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.2717885Z 2025-12-04T12:10:21.2717976Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.2718263Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.2718265Z 2025-12-04T12:10:21.2718373Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.2718464Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2718526Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2718600Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2719164Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.2719278Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2719333Z graph_break [] 2025-12-04T12:10:21.2719413Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2719506Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2720028Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.2720139Z current_size = base.storage().size() 2025-12-04T12:10:21.2720198Z Autotune Choices Stats: 2025-12-04T12:10:21.2720584Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.2720684Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2720752Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2720901Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2721152Z triton_mm_33 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2721396Z triton_mm_34 0.0108 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2721639Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2721896Z triton_mm_30 0.0114 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2722140Z triton_mm_21 0.0121 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2722384Z triton_mm_15 0.0123 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2722624Z triton_mm_16 0.0124 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2722866Z triton_mm_29 0.0124 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2723108Z triton_mm_23 0.0129 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2723350Z triton_mm_31 0.0138 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2723496Z SingleProcess AUTOTUNE benchmarking takes 0.1771 seconds and 8.9941 seconds precompiling for 33 choices 2025-12-04T12:10:21.2723588Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2723648Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2723743Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2723859Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2724360Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.2724415Z graph_break [] 2025-12-04T12:10:21.2724494Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2724583Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2724986Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.2725096Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.2725155Z Autotune Choices Stats: 2025-12-04T12:10:21.2725541Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00827999971807003, "best_triton_pos": 0} 2025-12-04T12:10:21.2725622Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2725690Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2725835Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2726089Z triton_mm_72 0.0083 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2726332Z triton_mm_71 0.0098 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2726391Z _scaled_mm 0.0102 ms 80.9% 2025-12-04T12:10:21.2726632Z triton_mm_60 0.0112 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2726875Z triton_mm_67 0.0113 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2727119Z triton_mm_54 0.0116 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2727360Z triton_mm_61 0.0118 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2727600Z triton_mm_59 0.0120 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2727843Z triton_mm_68 0.0121 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2728096Z triton_mm_53 0.0122 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2728241Z SingleProcess AUTOTUNE benchmarking takes 0.2600 seconds and 0.7879 seconds precompiling for 39 choices 2025-12-04T12:10:21.2728311Z =================================== FAILURES =================================== 2025-12-04T12:10:21.2728482Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.2728544Z Traceback (most recent call last): 2025-12-04T12:10:21.2728716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2728786Z method(*args, **kwargs) 2025-12-04T12:10:21.2728956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.2729026Z method(*args, **kwargs) 2025-12-04T12:10:21.2729192Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.2729247Z with policy(): 2025-12-04T12:10:21.2729416Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.2729473Z raise RuntimeError(msg) 2025-12-04T12:10:21.2729893Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3965714432. 2025-12-04T12:10:21.2729906Z 2025-12-04T12:10:21.2730000Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.2730339Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.2730342Z 2025-12-04T12:10:21.2730447Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.2730537Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2730598Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2730671Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2731236Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.2731352Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2731405Z graph_break [] 2025-12-04T12:10:21.2731485Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2731574Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2732073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.2732139Z current_size = base.storage().size() 2025-12-04T12:10:21.2732223Z Autotune Choices Stats: 2025-12-04T12:10:21.2732608Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.2732690Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2732757Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2732893Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2733144Z triton_mm_33 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2733415Z triton_mm_34 0.0108 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2733660Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2733901Z triton_mm_30 0.0114 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2734160Z triton_mm_21 0.0121 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2734405Z triton_mm_15 0.0123 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2734647Z triton_mm_16 0.0124 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2734885Z triton_mm_29 0.0124 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2735126Z triton_mm_23 0.0129 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2735371Z triton_mm_31 0.0138 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2735516Z SingleProcess AUTOTUNE benchmarking takes 0.1771 seconds and 8.9941 seconds precompiling for 33 choices 2025-12-04T12:10:21.2735607Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2735665Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2735739Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2735853Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2736357Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.2736429Z graph_break [] 2025-12-04T12:10:21.2736507Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2736596Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2736972Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.2737079Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.2737136Z Autotune Choices Stats: 2025-12-04T12:10:21.2737546Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00827999971807003, "best_triton_pos": 0} 2025-12-04T12:10:21.2737627Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2737695Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2737831Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2738079Z triton_mm_72 0.0083 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2738334Z triton_mm_71 0.0098 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2738397Z _scaled_mm 0.0102 ms 80.9% 2025-12-04T12:10:21.2738638Z triton_mm_60 0.0112 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2738877Z triton_mm_67 0.0113 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2739116Z triton_mm_54 0.0116 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2739358Z triton_mm_61 0.0118 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2739599Z triton_mm_59 0.0120 ms 68.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2739838Z triton_mm_68 0.0121 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2740078Z triton_mm_53 0.0122 ms 68.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2740278Z SingleProcess AUTOTUNE benchmarking takes 0.2600 seconds and 0.7879 seconds precompiling for 39 choices 2025-12-04T12:10:21.2740368Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.2740443Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.2740519Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.2740634Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.2741100Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.2741153Z graph_break [] 2025-12-04T12:10:21.2741233Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.2741337Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.2741395Z Autotune Choices Stats: 2025-12-04T12:10:21.2741891Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.009359999559819698, "best_triton_pos": 1, "best_triton_time": 0.010479999706149101, "best_triton_kernel": "triton_mm_92", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2"} 2025-12-04T12:10:21.2741972Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.2742038Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.2742173Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.2742233Z _scaled_mm 0.0094 ms 100.0% 2025-12-04T12:10:21.2742490Z triton_mm_92 0.0105 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2742737Z triton_mm_106 0.0115 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2742976Z triton_mm_98 0.0116 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2743223Z triton_mm_110 0.0116 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2743466Z triton_mm_105 0.0119 ms 78.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2743709Z triton_mm_97 0.0122 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.2743953Z triton_mm_109 0.0122 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2744192Z triton_mm_99 0.0123 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2744437Z triton_mm_91 0.0124 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.2744594Z SingleProcess AUTOTUNE benchmarking takes 0.2697 seconds and 0.6267 seconds precompiling for 39 choices 2025-12-04T12:10:21.2744802Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-925efe9ecb873b90.xml - 2025-12-04T12:10:21.2744879Z =========================== short test summary info ============================ 2025-12-04T12:10:21.2745516Z FAILED [1.4813s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3965714432. 2025-12-04T12:10:21.2745531Z 2025-12-04T12:10:21.2745622Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.2745920Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.2745923Z 2025-12-04T12:10:21.2746027Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.2746104Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.2746190Z ================= 1 failed, 187 deselected, 2 rerun in 41.69s ================== 2025-12-04T12:10:21.2746246Z Got exit code 1 2025-12-04T12:10:21.2746305Z Retrying single test... 2025-12-04T12:10:21.2746481Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-20658da356ef03da.xml 2025-12-04T12:10:21.2746557Z ============================= test session starts ============================== 2025-12-04T12:10:21.2746686Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.2746747Z cachedir: .pytest_cache 2025-12-04T12:10:21.2746921Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.2746985Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.2747043Z configfile: pytest.ini 2025-12-04T12:10:21.2747223Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.2747315Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.2747600Z stepcurrent: skipping 109 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.2747661Z Running 1 items in this shard 2025-12-04T12:10:21.2747664Z 2025-12-04T12:10:21.2748027Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:16:11.197846185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2748030Z 2025-12-04T12:10:21.2748359Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2748668Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2748828Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2749323Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2749592Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2749837Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2750082Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2750335Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2750578Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2750814Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2751077Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2751312Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2751555Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2751787Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2752027Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2752263Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2752503Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2752736Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2752977Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2753211Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2753467Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2753699Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2753908Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2754139Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2754396Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2754641Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2754848Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2755081Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2755325Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2755569Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2755811Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2756044Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2756259Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2756486Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2756662Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2756860Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2757405Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/wn/cwnqq7dy3yhlz2ui4gxvx23k3somra6vvymz23td3natl7ln76kt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2757566Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2757802Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2757986Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2758292Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2758441Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2758712Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2758880Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2759160Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2759332Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2759614Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2759763Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2760062Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2760314Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2760643Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2760948Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2761093Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2761585Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2761855Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2762094Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2762318Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2762558Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2762799Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2763034Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2763276Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2763524Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2763785Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2764016Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2764258Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2764503Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2764748Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2764983Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2765223Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2765456Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2765697Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2765936Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2766139Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2766373Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2766615Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2766850Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2767068Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2767300Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2767543Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2767775Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2768038Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2768272Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2768486Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2768712Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2768895Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2769092Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2769211Z E1204 11:16:19.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2769385Z [W1204 11:16:19.078178804 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2769387Z 2025-12-04T12:10:21.2769555Z [W1204 11:16:19.081987554 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2769557Z 2025-12-04T12:10:21.2769882Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2770231Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2770378Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2770868Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2771135Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2771397Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2771617Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2771831Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2772072Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2772326Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2772581Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2772815Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2773056Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2773288Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2773542Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2773776Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2774015Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2774246Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2774488Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2774723Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2774964Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2775196Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2775400Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2775634Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2775886Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2776117Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2776319Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2776552Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2776803Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2777048Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2777288Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2777520Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2777745Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2777970Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2778147Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2778338Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2778878Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/xl/cxlpsyvwnrk56nfsyrgh7nfpknbn6y5wlvuils5tr5kvs5vihhvw.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2779039Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2779267Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2779437Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2779738Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2779885Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2780194Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2780369Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2780637Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2780806Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2781088Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2781256Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2781558Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2781765Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2782094Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2782413Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2782560Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2783049Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2785260Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2785510Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2785736Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2785952Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2786194Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2786429Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2786674Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2786928Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2787169Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2787405Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2787647Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2787902Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2788144Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2788378Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2788618Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2788866Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2789110Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2789345Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2789550Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2789782Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2790025Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2790295Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2790500Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2790732Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2790975Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2791209Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2791471Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2791704Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2791918Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2792144Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2792347Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2792542Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2792662Z E1204 11:16:19.617000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2792983Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2793303Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2793452Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2793948Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2794215Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2794454Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2794675Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2794889Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2795131Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2795366Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2795609Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2795857Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2796099Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2796332Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2796574Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2796831Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2797071Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2797302Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2797543Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2797787Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2798031Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2798263Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2798467Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2798701Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2798942Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2799177Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2799378Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2799611Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2799851Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2800084Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2800413Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2800645Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2800861Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2801089Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2801292Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2801485Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2802026Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/ni/cnin4lnfsblbx6fcfua3ufkr54syid3fvofi5mninn5ugvpfcwvr.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2802188Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2802431Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2802602Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2802902Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2803050Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2803320Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2803476Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2803754Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2803924Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2804205Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2804353Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2804643Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2804862Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2805190Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2805495Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2805652Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2806159Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2806425Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2806665Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2806898Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2807116Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2807358Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2807591Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2807834Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2808068Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2808313Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2808548Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2808787Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2809019Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2809276Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2809511Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2809752Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2809983Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2810285Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2810545Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2810751Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2810986Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2811226Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2811480Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2811683Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2811917Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2812156Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2812389Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2812630Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2812865Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2813083Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2813309Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2813483Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2813697Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2813814Z E1204 11:16:19.627000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2813985Z [W1204 11:16:19.094513031 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2813988Z 2025-12-04T12:10:21.2814310Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2814616Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2814792Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2815278Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2815543Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2815804Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2816025Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2816238Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2816478Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2816710Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2816954Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2817187Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2817428Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2817661Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2817901Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2818147Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2818386Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2818619Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2818858Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2819104Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2819354Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2819586Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2819789Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2820020Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2820313Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2820549Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2820753Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2820986Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2821227Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2821461Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2821702Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2821935Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2822149Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2822374Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2822564Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2822759Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2823304Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/gd/cgdjqihwb6wei5vogddrxsr3uj6k3mkwcpwmeop433zt6scy7oby.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2823464Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2823709Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2823893Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2824194Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2824341Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2824621Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2824776Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2825044Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2825214Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2825498Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2825650Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2825940Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2826147Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2826474Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2826779Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2826924Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2827428Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2827694Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2827936Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2828168Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2828393Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2828636Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2828871Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2829124Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2829358Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2829599Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2829830Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2830072Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2830344Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2830584Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2830816Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2831056Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2831290Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2831543Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2831797Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2832010Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2832245Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2832500Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2832772Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2832982Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2833212Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2833452Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2833696Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2833940Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2834173Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2834389Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2834613Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2834789Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2834986Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2835104Z E1204 11:16:19.634000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2835275Z [W1204 11:16:19.098045345 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2835277Z 2025-12-04T12:10:21.2835600Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2835905Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2836064Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2836549Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2836818Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2837079Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2837310Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2837529Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2837771Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2838005Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2838258Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2838493Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2838736Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2838967Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2839213Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2839447Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2839689Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2839923Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2840204Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2840437Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2840695Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2840927Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2841129Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2841362Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2841619Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2841867Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2842071Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2842303Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2842560Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2842796Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2843038Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2843271Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2843488Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2843713Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2843887Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2844082Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2844620Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/yg/cyggx77l4i7zrf7ivjowg6o77wwwtockbmpdrehaz4wylhwiljfg.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2844784Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2845018Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2845196Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2845496Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2845642Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2845915Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2846082Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2846366Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2846538Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2846818Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2846970Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2847274Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2847487Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2847814Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2848122Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2848272Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2848763Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2849032Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2849273Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2849497Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2849731Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2849974Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2850245Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2850487Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2850762Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2851006Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2851243Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2851489Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2851737Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2851983Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2852215Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2852460Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2852696Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2852939Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2853177Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2853382Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2853621Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2853864Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2854118Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2854326Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2854562Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2854806Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2855039Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2855307Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2855543Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2855764Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2855992Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2856180Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2856380Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2856498Z E1204 11:16:19.637000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2856675Z [W1204 11:16:19.101976804 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2856678Z 2025-12-04T12:10:21.2857002Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2857315Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2857465Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2857955Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2858226Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2858468Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2858703Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2858924Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2859168Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2859404Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2859680Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2859917Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2860191Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2860427Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2860704Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2860940Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2861187Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2861420Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2861668Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2861904Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2862153Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2862392Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2862598Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2862837Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2863081Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2863333Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2863536Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2863784Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2864032Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2864296Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2864545Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2864781Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2865003Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2865243Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2865425Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2865624Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2866159Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmpij0fki26/65/c65tpzsr225cayjsrgcktcqiksvzthry7o5t3fk3ni7hl7gdse5m.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.2866327Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.2866559Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.2866734Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.2867039Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.2867194Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.2867471Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.2867686Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.2867960Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.2868133Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.2868421Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.2868572Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.2868891Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.2869109Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.2869440Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2869754Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2869915Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2870444Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2870715Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2870957Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2871186Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2871404Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2871653Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2871890Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2872144Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2872403Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2872646Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2872884Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2873127Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2873379Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2873635Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2873876Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2874123Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2874366Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2874630Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2874866Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2875077Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2875316Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2875560Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2875803Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2876012Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2876256Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2876498Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2876739Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2876997Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2877231Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2877454Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.2877680Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.2877874Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.2878080Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.2878207Z E1204 11:16:19.641000 858404 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.2878283Z ('RERUN', {'yellow': True}) [11.7583s] [100%] 2025-12-04T12:10:21.2878657Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:16:21.014715612 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2878660Z 2025-12-04T12:10:21.2878830Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2879153Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2879467Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2879612Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2880170Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2880444Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2880685Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2880910Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2881127Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2881379Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2881630Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2881877Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2882115Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2882357Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2882630Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2882873Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2883112Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2883358Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2883614Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2883835Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2884061Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2884283Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2884526Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2884767Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2884977Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2885210Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2885454Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2885688Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2885896Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2886145Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2886361Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2886564Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2886800Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2887025Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2887239Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2887474Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2887715Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2887964Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2888208Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2888444Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2888660Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2888883Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2889101Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2889345Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2889580Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2889821Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2890058Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2890344Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2890591Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2890835Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2891071Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2891317Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2891578Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2891834Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2892075Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2892316Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2892566Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2892811Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2893048Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2893292Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2893531Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2893779Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2894015Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2894260Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2894494Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2894716Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2894932Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2895187Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2895424Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2895666Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2895907Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2896183Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2896422Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2896665Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2896898Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2897156Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2897393Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2897638Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2897871Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2898087Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2898297Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2898534Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2898748Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2898970Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2899188Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2899431Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2899680Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2899894Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2900130Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2900369Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2900595Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2900825Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2901060Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2901305Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2901555Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2901800Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2902036Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2902249Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2902478Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2902697Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2902950Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2903190Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2903410Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2903625Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2903842Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2904106Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2904340Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2904588Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2904824Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2905078Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2905326Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2905568Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2905807Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2906061Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2906303Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2906524Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2906731Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2906972Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2907191Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2907411Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2907627Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2907877Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2908123Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2908370Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2908623Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2908866Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2909105Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2909352Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2909614Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2909863Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2910135Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2910357Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2910588Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2910805Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2911036Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2911252Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2911500Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2911737Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2911962Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2912176Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2912398Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2912645Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2912883Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2913148Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2913384Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2913631Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2913866Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2914128Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2914382Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2914625Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2914866Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2915119Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2915363Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2915612Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2915853Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2916100Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2916338Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2916587Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2916823Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2917071Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2917308Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2917530Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2917752Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2917990Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2918239Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2918476Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2918750Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2918987Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2919234Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2919474Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2919729Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2919970Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2920245Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2920487Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2920718Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2920959Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2921210Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2921447Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2921698Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2921935Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2922189Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2922411Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2922626Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2922848Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2923094Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2923363Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2923593Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2923815Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2924033Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2924263Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2924515Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2924751Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2924974Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2925190Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2925404Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2925575Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.2925812Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2926022Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2926258Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2926507Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2926754Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2926973Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2927180Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2927415Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2927640Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2927859Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2928099Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2928304Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2928545Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2928765Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2929004Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2929213Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2929449Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2929696Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2929932Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2930192Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2930437Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2930679Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2930918Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2931180Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2931419Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2931662Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2931902Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2932133Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2932351Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2932592Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2932822Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2933045Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2933274Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2933498Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2933745Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2933980Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2934227Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2934464Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2934672Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2934907Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2935154Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2935397Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2935658Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2935898Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2936124Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2936345Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2936571Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2936795Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2937004Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2937239Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2937484Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2937733Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2937983Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2938217Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2938425Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2938664Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2938908Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2939146Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2939387Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2939627Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2939833Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2940155Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2940405Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2940640Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2940888Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2941142Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2941398Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2941621Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2941837Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2942059Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2942315Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2942559Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2942766Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2943005Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2943254Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2943490Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2943739Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2943974Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2944205Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2944424Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2944657Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2944878Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2945125Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2945363Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2945617Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2945868Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2946111Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2946349Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2946557Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2946804Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2947055Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2947290Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2947541Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2947778Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2948012Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2948230Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2948443Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2948659Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2948902Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2949151Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2949393Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2949629Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2949875Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2950154Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2950412Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2950647Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2950892Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2951146Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2951364Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.2951583Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.2951789Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.2952002Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.2952232Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.2952460Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.2952673Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.2952883Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.2953093Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.2953280Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.2953426Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.2953602Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.2953731Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.2953874Z E1204 11:16:21.568000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.2954052Z [W1204 11:16:21.032709317 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.2954055Z 2025-12-04T12:10:21.2954215Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.2954544Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.2954868Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.2955017Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.2955527Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.2955798Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.2956043Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.2956268Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.2956484Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2956733Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2956972Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2957220Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2957456Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2957702Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2957941Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2958195Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2958431Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2958671Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2958907Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2959141Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2959369Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2959586Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2959829Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2960077Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2960320Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2960555Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2960797Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2961032Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2961242Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2961476Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2961690Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2961892Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2962128Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2962343Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2962567Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2962852Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2963094Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2963329Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2963589Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2963838Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2964050Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2964276Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2964493Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2964750Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2964990Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2965232Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2965466Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2965708Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2965946Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2966188Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2966419Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2966662Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2966895Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2967151Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2967385Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2967628Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2967863Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2968115Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2968367Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2968608Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2968842Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2969096Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2969330Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2969573Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2969810Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2970028Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2970277Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.2970524Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2970762Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2971005Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2971241Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2971484Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2971735Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2971977Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2972218Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2972464Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2972723Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2972970Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2973203Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2973420Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2973636Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2973876Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2974093Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2974316Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2974538Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2974786Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2975026Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2975239Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2975448Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2975686Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2975900Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2976121Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2976354Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2976601Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2976835Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2977094Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2977343Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2977557Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2977784Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2977997Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2978258Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2978496Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2978718Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2978936Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2979154Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2979405Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2979644Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2979891Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2980169Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2980418Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2980677Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2980920Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2981159Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2981402Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2981657Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2981885Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2982099Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2982340Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2982558Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2982789Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2983008Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2983256Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2983491Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2983737Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2983979Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2984222Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2984462Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2984706Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2984948Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2985202Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2985441Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2985663Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2985876Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2986106Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.2986345Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.2986565Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2986812Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2987065Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2987288Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2987505Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2987726Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2987970Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2988211Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2988456Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2988695Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2988943Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2989178Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2989430Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2989676Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2989924Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2990200Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2990443Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2990697Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2990955Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2991194Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2991435Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2991687Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2991939Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2992175Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2992422Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2992659Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2992880Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.2993088Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2993329Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2993578Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2993814Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2994062Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2994314Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2994562Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2994799Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2995049Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2995312Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2995556Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2995796Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2996003Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.2996254Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2996501Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2996740Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2996988Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.2997225Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2997459Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2997679Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2997896Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2998116Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2998360Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.2998602Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.2998845Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.2999067Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.2999283Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.2999505Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.2999775Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3000012Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3000272Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3000487Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3000710Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3000878Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3001118Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3001324Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3001566Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3001816Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3002053Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3002273Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3002479Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3002719Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3002933Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3003177Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3003416Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3003620Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3003860Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3004082Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3004337Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3004544Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3004785Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3005032Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3005279Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3005530Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3005765Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3006012Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3006250Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3006497Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3006740Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3006984Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3007220Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3007435Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3007658Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3007897Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3008129Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3008349Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3008579Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3008814Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3009059Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3009298Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3009543Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3009788Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3010001Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3010276Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3010525Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3010763Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3011012Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3011251Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3011480Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3011704Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3011920Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3012150Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3012356Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3012597Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3012845Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3013083Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3013361Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3013597Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3013806Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3014042Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3014303Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3014543Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3014786Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3015025Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3015230Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3015472Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3015717Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3015954Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3016200Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3016437Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3016685Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3016902Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3017122Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3017338Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3017588Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3017866Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3018076Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3018334Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3018577Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3018837Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3019083Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3019323Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3019557Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3019775Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3019996Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3020250Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3020499Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3020739Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3020986Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3021241Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3021484Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3021724Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3021930Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3022184Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3022443Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3022680Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3022929Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3023164Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3023408Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3023628Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3023847Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3024067Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3024310Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3024554Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3024798Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3025037Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3025280Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3025522Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3025780Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3026016Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3026262Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3026498Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3026725Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3026955Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3027166Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3027381Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3027614Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3027859Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3028075Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3028287Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3028494Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3028685Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3028831Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3028995Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3029122Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3029264Z E1204 11:16:21.571000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3029443Z [W1204 11:16:21.035101896 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3029445Z 2025-12-04T12:10:21.3029605Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3029919Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3030274Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3030424Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3030924Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3031212Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3031471Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3031692Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3031913Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3032172Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3032411Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3032658Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3032894Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3033141Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3033375Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3033624Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3033859Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3034101Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3034339Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3034554Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3034796Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3035012Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3035261Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3035501Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3035719Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3035974Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3036216Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3036453Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3036668Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3036908Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3037126Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3037330Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3037568Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3037785Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3037995Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3038228Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3038474Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3038711Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3038955Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3039204Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3039415Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3039642Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3039858Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3040162Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3040416Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3040658Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3040895Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3041137Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3041388Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3041633Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3041870Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3042116Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3042348Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3042596Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3042831Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3043076Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3043310Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3043557Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3043811Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3044053Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3044290Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3044532Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3044783Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3045036Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3045275Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3045497Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3045719Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3045967Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3046202Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3046449Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3046684Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3046925Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3047162Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3047404Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3047644Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3047886Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3048125Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3048380Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3048613Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3048829Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3049034Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3049292Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3049504Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3049732Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3049952Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3050257Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3050497Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3050710Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3050916Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3051150Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3051368Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3051578Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3051812Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3052058Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3052290Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3052540Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3052789Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3053005Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3053230Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3053447Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3053716Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3053966Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3054187Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3054400Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3054619Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3054880Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3055121Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3055368Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3055603Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3055852Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3056090Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3056337Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3056578Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3056822Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3057061Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3057289Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3057501Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3057737Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3057959Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3058189Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3058416Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3058664Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3058900Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3059147Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3059393Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3059640Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3059881Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3060158Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3060402Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3060647Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3060886Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3061227Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3061449Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3061663Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3061907Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3062125Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3062372Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3062614Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3062845Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3063077Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3063298Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3063542Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3063781Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3064038Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3064279Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3064523Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3064765Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3065012Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3065248Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3065494Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3065729Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3065976Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3066216Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3066471Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3066711Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3066953Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3067192Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3067457Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3067698Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3067945Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3068179Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3068406Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3068613Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3068850Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3069092Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3069328Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3069573Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3069811Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3070059Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3070323Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3070565Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3070802Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3071066Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3071305Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3071513Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3071749Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3072020Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3072259Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3072501Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3072738Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3072982Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3073201Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3073418Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3073633Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3073877Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3074113Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3074345Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3074564Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3074781Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3075001Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3075246Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3075495Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3075711Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3075929Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3076137Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3076312Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3076560Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3076766Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3077008Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3077252Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3077502Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3077719Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3077924Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3078161Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3078374Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3078582Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3078819Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3079027Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3079264Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3079472Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3079711Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3079926Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3080204Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3080446Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3080684Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3080955Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3081189Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3081434Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3081668Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3081930Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3082170Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3082416Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3082651Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3082867Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3083075Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3083311Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3083542Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3083758Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3083976Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3084195Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3084458Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3084696Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3084939Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3085176Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3085403Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3085640Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3085885Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3086119Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3086380Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3086616Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3086851Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3087070Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3087284Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3087495Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3087701Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3087939Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3088181Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3088418Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3088665Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3088910Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3089117Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3089354Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3089598Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3089853Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3090163Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3090400Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3090605Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3090857Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3091101Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3091337Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3091581Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3091821Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3092052Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3092271Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3092487Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3092702Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3092947Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3093183Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3093403Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3093640Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3093881Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3098064Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3098360Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3098603Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3098829Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3099049Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3099275Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3099492Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3099735Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3099970Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3100246Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3100483Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3100733Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3100968Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3101173Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3101409Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3101671Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3101907Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3102147Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3102384Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3102627Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3102858Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3103078Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3103292Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3103539Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3103788Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3104033Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3104268Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3104509Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3104745Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3104990Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3105230Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3105471Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3105708Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3105922Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3106151Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3106358Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3106570Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3106801Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3107026Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3107271Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3107477Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3107684Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3107873Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3108016Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3108191Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3108316Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3108462Z E1204 11:16:21.574000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3108634Z [W1204 11:16:21.077269936 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3108639Z 2025-12-04T12:10:21.3108799Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3109111Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3109423Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3109572Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3110067Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3110383Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3110641Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3110860Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3111076Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3111317Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3111566Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3111821Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3112056Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3112296Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3112527Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3112783Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3113017Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3113257Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3113490Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3113702Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3113928Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3114141Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3114381Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3114613Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3114823Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3115066Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3115309Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3115544Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3115746Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3115991Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3116212Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3116416Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3116646Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3116861Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3117075Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3117309Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3117551Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3117790Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3118032Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3118266Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3118479Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3118702Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3118915Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3119159Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3119404Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3119646Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3119876Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3120161Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3120407Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3120661Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3120896Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3121136Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3121370Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3121625Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3121859Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3122102Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3122334Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3122579Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3122815Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3123059Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3123294Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3123534Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3123770Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3124024Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3124259Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3124474Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3124688Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3124950Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3125193Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3125434Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3125667Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3125921Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3126155Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3126399Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3126637Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3126877Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3127111Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3127352Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3127589Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3127802Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3128008Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3128243Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3128464Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3128686Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3128899Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3129141Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3129384Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3129608Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3129816Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3130051Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3130297Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3130513Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3130748Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3130992Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3131224Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3131468Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3131701Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3131913Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3132135Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3132354Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3132598Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3132850Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3133066Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3133279Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3133493Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3133736Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3133997Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3134239Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3134474Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3134719Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3134965Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3135210Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3135443Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3135685Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3135919Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3136135Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3136341Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3136575Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3136793Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3137006Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3137239Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3137482Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3137719Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3137961Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3138207Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3138462Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3138697Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3138942Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3139176Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3139427Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3139669Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3139884Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3140134Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3140341Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3140570Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3140786Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3141029Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3141264Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3141481Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3141718Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3141933Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3142179Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3142414Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3142658Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3142919Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3143162Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3143397Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3143638Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3143888Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3144131Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3144368Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3144616Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3144849Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3145094Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3145327Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3145571Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3145806Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3146048Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3146295Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3146537Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3146776Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3146994Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3147213Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3147463Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3147706Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3147942Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3148193Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3148430Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3148671Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3148905Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3149147Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3149384Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3149627Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3149859Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3150065Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3150334Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3150577Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3150827Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3151067Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3151300Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3151528Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3151775Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3151989Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3152203Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3152446Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3152709Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3152942Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3153158Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3153371Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3153585Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3153829Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3154068Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3154286Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3154499Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3154704Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3154869Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3155117Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3155321Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3155555Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3155798Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3156044Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3156267Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3156476Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3156714Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3156925Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3157151Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3157387Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3157591Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3157825Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3158030Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3158266Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3158471Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3158705Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3158949Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3159184Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3159429Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3159678Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3159920Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3160196Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3160439Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3160698Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3160942Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3161175Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3161389Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3161611Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3161847Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3162076Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3162292Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3162506Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3162720Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3162967Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3163201Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3163442Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3163678Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3163884Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3164133Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3164373Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3164607Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3164850Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3165104Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3165332Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3165547Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3165762Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3165978Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3166187Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3166425Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3166666Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3166899Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3167141Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3167376Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3167579Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3167814Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3168056Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3168397Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3168652Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3168886Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3169091Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3169323Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3169591Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3169825Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3170066Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3170338Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3170578Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3170796Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3171008Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3171224Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3171467Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3171702Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3171908Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3172145Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3172388Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3172623Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3172865Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3173111Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3173339Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3173558Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3173773Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3174026Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3174271Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3174505Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3174749Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3174995Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3175243Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3175476Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3175682Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3175919Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3176167Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3176405Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3176646Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3176881Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3177110Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3177331Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3177560Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3177775Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3178017Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3178251Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3178520Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3178753Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3178995Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3179231Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3179482Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3179718Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3179959Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3180228Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3180442Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3180663Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3180872Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3181083Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3181313Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3181533Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3181748Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3181969Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3182176Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3182363Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3182505Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3182679Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3182799Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3182953Z E1204 11:16:21.616000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3183125Z [W1204 11:16:21.079341909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3183128Z 2025-12-04T12:10:21.3183288Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3183601Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3183924Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3184073Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3184565Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3184833Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3185073Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3185293Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3185511Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3185755Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3185995Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3186252Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3186488Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3186728Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3186963Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3187217Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3187468Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3187711Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3187946Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3188163Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3188400Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3188618Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3188861Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3189094Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3189299Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3189533Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3189774Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3190005Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3190248Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3190480Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3190708Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3190911Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3191142Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3191353Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3191556Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3191823Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3192065Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3192296Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3192536Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3192781Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3192996Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3193219Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3193434Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3193676Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3193908Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3194150Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3194381Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3194622Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3194852Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3195095Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3195338Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3195580Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3195812Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3196052Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3196305Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3196545Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3196778Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3197018Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3197264Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3197507Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3197738Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3197985Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3198220Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3198462Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3198694Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3198908Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3199120Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3199360Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3199605Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3199846Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3200079Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3200371Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3200621Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3200874Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3201106Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3201347Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3201596Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3201838Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3202073Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3202312Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3202515Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3202747Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3202964Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3203189Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3203403Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3203646Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3203879Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3204107Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3204310Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3204545Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3204756Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3204970Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3205215Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3205461Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3205697Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3205937Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3206181Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3206395Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3206617Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3206833Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3207077Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3207314Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3207531Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3207749Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3207967Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3208210Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3208447Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3208708Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3208944Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3209186Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3209423Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3209692Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3209926Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3210217Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3210450Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3210681Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3210888Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3211126Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3211345Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3211558Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3211777Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3212021Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3212256Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3212498Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3212741Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3213000Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3213233Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3213477Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3213711Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3213967Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3214215Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3214435Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3214651Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3214859Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3215098Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3215315Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3215561Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3215795Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3216011Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3216231Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3216446Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3216689Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3216923Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3217167Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3217408Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3217663Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3217898Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3218140Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3218387Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3218639Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3218876Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3219119Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3219352Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3219608Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3219845Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3220133Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3220368Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3220612Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3220850Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3221094Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3221332Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3221545Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3221754Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3222003Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3222251Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3222488Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3222730Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3222979Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3223233Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3223469Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3223709Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3223961Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3224206Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3224439Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3224648Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3224884Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3225128Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3225457Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3225700Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3225935Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3226161Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3226380Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3226606Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3226822Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3227066Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3227302Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3227545Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3227772Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3227987Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3228201Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3228455Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3228692Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3228909Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3229122Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3229327Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3229492Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3229732Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3229939Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3230204Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3230450Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3230688Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3230925Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3231133Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3231367Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3231579Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3231783Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3232050Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3232258Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3232491Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3232697Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3232945Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3233153Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3233386Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3233630Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3233863Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3234105Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3234342Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3234585Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3234821Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3235063Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3235299Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3235552Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3235785Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3235998Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3236204Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3236461Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3236691Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3236909Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3237124Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3237349Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3237597Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3237831Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3238076Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3238311Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3238517Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3238754Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3238997Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3239237Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3239481Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3239718Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3239957Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3240212Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3240426Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3240631Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3240851Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3241106Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3241351Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3241586Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3241841Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3242082Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3242285Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3242520Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3242761Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3242996Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3243242Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3243475Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3243684Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3243918Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3244164Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3244411Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3244653Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3244888Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3245115Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3245356Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3245570Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3245786Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3246028Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3246273Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3246481Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3246717Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3246958Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3247190Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3247433Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3247667Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3247895Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3248112Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3248326Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3248543Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3248796Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3249034Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3249279Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3249514Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3249782Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3250017Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3250262Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3250496Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3250758Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3250998Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3251241Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3251481Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3251710Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3251933Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3252150Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3252370Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3252616Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3252851Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3253100Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3253351Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3253596Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3253832Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3254084Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3254349Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3254592Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3254833Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3255047Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3255283Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3255492Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3255706Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3255940Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3256162Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3256386Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3256596Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3256809Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3256994Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3257144Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3257305Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3257433Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3257590Z E1204 11:16:21.618000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3257763Z [W1204 11:16:21.081385132 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3257765Z 2025-12-04T12:10:21.3257926Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3258236Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3258548Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3258721Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3259221Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3259492Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3259743Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3259970Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3260221Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3260468Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3260707Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3260950Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3261188Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3261435Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3261674Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3261915Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3262170Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3262414Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3262648Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3262862Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3263099Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3263330Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3263572Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3263809Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3264020Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3264269Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3264516Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3264750Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3264957Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3265190Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3265405Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3265613Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3265846Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3266060Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3266266Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3266508Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3266761Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3266997Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3267241Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3267476Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3267710Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3267933Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3268153Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3268396Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3268643Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3268892Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3269125Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3269372Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3269605Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3269850Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3270084Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3270362Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3270596Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3270837Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3271073Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3271331Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3271569Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3271809Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3272044Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3272313Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3272544Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3272786Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3273017Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3273273Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3273508Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3273729Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3273941Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3274183Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3274420Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3274661Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3274895Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3275136Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3275368Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3275626Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3275858Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3276101Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3276333Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3276586Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3276829Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3277041Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3277245Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3277477Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3277700Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3277923Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3278140Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3278384Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3278618Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3278830Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3279033Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3279267Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3279476Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3279679Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3279915Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3280214Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3280449Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3280689Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3280926Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3281165Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3281389Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3281604Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3281849Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3282097Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3282315Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3282529Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3282743Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3282987Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3283223Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3283471Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3283707Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3283948Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3284183Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3284426Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3284673Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3284915Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3285148Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3285363Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3285589Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3285828Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3286043Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3286257Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3286483Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3286727Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3286963Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3287204Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3287440Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3287683Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3287920Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3288168Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3288405Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3288649Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3288900Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3289118Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3289330Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3289538Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3289765Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3290000Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3290295Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3290531Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3290752Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3290991Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3291214Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3291460Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3291695Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3291939Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3292176Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3292424Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3292658Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3292905Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3293145Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3293389Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3293642Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3293884Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3294122Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3294380Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3294630Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3294875Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3295109Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3295355Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3295605Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3295854Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3296092Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3296307Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3296515Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3296751Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3296998Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3297238Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3297483Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3297720Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3297979Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3298215Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3298457Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3298696Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3298949Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3299199Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3299408Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3299643Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3299899Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3300170Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3300418Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3300656Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3300887Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3301108Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3301324Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3301542Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3301785Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3302024Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3302253Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3302488Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3302703Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3302920Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3303164Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3303414Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3303645Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3303860Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3304068Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3304233Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3304481Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3304691Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3304925Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3305169Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3305404Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3305622Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3305832Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3306067Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3306282Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3306485Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3306725Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3306942Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3307178Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3307387Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3307623Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3307845Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3308096Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3308344Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3308577Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3308835Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3309075Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3309317Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3309554Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3309796Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3310038Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3310320Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3310558Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3310775Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3310980Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3311220Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3311462Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3311680Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3311893Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3312112Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3312371Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3312620Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3312867Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3313100Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3313319Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3313555Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3313800Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3314038Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3314280Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3314519Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3314748Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3314968Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3315187Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3315394Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3315604Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3315851Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3316094Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3316327Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3316572Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3316821Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3317038Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3317276Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3317521Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3317770Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3318014Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3318253Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3318460Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3318695Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3318940Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3319174Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3319419Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3319653Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3319885Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3320153Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3320384Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3320603Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3320847Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3321084Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3321304Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3321555Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3321799Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3322032Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3322291Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3322528Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3322759Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3322975Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3323192Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3323410Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3323653Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3323888Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3324131Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3324366Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3324612Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3324860Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3325067Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3325302Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3325547Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3325811Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3326058Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3326295Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3326526Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3326759Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3326978Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3327200Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3327445Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3327680Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3327923Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3328164Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3328408Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3328643Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3328890Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3329126Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3329382Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3329624Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3329839Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3330059Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3330327Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3330540Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3330768Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3330993Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3331222Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3331433Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3331646Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3331831Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3331980Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3332140Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3332266Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3332407Z E1204 11:16:21.620000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3332485Z ('RERUN', {'yellow': True}) [1.8201s] [100%] 2025-12-04T12:10:21.3332852Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda [W1204 11:16:23.648554717 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3332858Z 2025-12-04T12:10:21.3333018Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3333330Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3333641Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3333804Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3334292Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3334565Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3334832Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3335053Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3335270Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3335513Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3335761Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3336007Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3336241Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3336488Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3336721Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3336967Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3337205Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3337451Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3337687Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3337900Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3338138Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3338354Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3338599Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3338834Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3339043Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3339302Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3339547Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3339783Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3339987Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3340274Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3340488Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3340695Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3340933Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3341147Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3341356Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3341591Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3341835Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3342071Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3342316Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3342554Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3342788Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3343012Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3343227Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3343473Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3343735Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3343982Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3344217Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3344545Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3344798Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3345044Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3345281Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3345524Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3345760Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3346007Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3346241Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3346486Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3346720Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3346967Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3347204Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3347460Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3347696Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3347937Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3348173Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3348435Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3348671Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3348893Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3349109Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3349365Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3349603Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3349848Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3350080Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3350359Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3350594Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3350841Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3351076Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3351317Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3351552Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3351810Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3352049Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3352264Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3352468Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3352720Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3352946Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3353172Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3353388Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3353633Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3353880Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3354098Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3354307Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3354545Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3354760Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3354964Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3355203Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3355443Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3355680Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3355924Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3356159Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3356385Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3356608Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3356829Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3357078Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3357341Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3357560Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3357774Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3357994Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3358253Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3358493Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3358736Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3358974Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3359220Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3359461Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3359711Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3359945Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3360222Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3360461Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3360677Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3360902Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3361136Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3361357Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3361571Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3361818Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3362069Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3362304Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3362550Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3362799Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3363049Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3363282Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3363528Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3363768Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3364012Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3364255Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3364472Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3364690Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3364898Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3365128Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3365357Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3365601Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3365841Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3366059Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3366296Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3366512Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3366761Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3367000Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3367253Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3367495Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3367736Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3367974Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3368218Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3368459Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3368707Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3368943Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3369194Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3369432Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3369692Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3369926Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3370204Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3370443Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3370698Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3370952Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3371195Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3371438Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3371657Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3371876Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3372116Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3372359Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3372597Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3372840Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3373082Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3373329Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3373563Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3373807Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3374043Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3374305Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3374540Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3374748Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3374986Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3375241Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3375496Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3375739Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3375977Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3376217Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3376441Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3376660Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3376877Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3377123Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3377359Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3377595Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3377812Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3378030Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3378249Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3378495Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3378746Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3378967Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3379183Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3379390Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3379568Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3379820Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3380026Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3380307Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3380551Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3380804Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3381020Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3381232Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3381473Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3381687Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3381898Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3382136Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3382346Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3382581Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3382791Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3383029Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3383247Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3383486Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3383733Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3383973Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3384230Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3384481Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3384727Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3384961Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3385217Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3385454Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3385699Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3385934Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3386153Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3386366Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3386602Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3386834Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3387051Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3387269Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3387488Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3387749Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3387988Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3388234Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3388472Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3388691Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3388940Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3389183Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3389422Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3389681Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3389919Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3390191Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3390409Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3390627Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3390835Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3391048Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3391291Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3391534Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3391772Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3392016Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3392271Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3392476Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3392717Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3392962Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3393217Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3393477Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3393715Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3393923Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3394174Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3394419Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3394656Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3394898Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3395135Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3395364Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3395586Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3395804Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3396023Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3396268Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3396505Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3396726Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3396961Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3397208Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3397444Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3397699Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3397950Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3398177Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3398398Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3398623Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3398843Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3399089Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3399324Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3399569Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3399805Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3400053Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3400331Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3400541Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3400780Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3401026Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3401281Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3401523Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3401762Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3401990Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3405309Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3405561Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3405779Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3406024Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3406273Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3406522Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3406758Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3407002Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3407236Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3407481Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3407718Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3407960Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3408196Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3408409Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3408629Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3408851Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3409063Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3409380Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3409602Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3409832Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3410050Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3410294Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3410482Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3410625Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3410800Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3410922Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3411067Z E1204 11:16:23.187000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3411239Z [W1204 11:16:23.651106903 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3411241Z 2025-12-04T12:10:21.3411406Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3411718Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3412030Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3412179Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3412674Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3412945Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3413187Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3413429Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3413649Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3413893Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3414133Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3414401Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3414635Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3414875Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3415108Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3415359Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3415593Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3415834Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3416066Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3416283Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3416511Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3416729Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3416970Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3417202Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3417408Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3417643Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3417898Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3418129Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3418334Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3418568Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3418802Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3419006Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3419236Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3419446Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3419657Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3419896Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3420175Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3420406Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3420648Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3420882Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3421095Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3421319Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3421537Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3421777Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3422011Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3422267Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3422498Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3422739Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3422970Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3423238Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3423471Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3423711Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3423945Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3424202Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3424437Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3424676Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3424909Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3425148Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3425381Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3425624Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3425855Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3426101Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3426336Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3426577Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3426821Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3427036Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3427248Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3427489Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3427745Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3427986Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3428218Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3428460Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3428717Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3428962Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3429194Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3429436Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3429669Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3429911Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3430165Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3430377Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3430579Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3430812Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3431045Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3431268Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3431483Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3431726Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3431958Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3432204Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3432406Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3432639Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3432850Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3433063Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3433301Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3433547Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3433782Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3434021Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3434256Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3434469Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3434691Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3434909Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3435152Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3435388Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3435615Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3435831Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3436047Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3436289Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3436535Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3436787Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3437023Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3437264Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3437511Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3437755Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3437990Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3438236Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3438470Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3438686Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3438891Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3439126Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3439342Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3439554Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3439771Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3440024Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3440294Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3440537Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3440776Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3441047Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3441282Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3441523Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3441757Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3442019Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3442255Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3442474Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3442687Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3442893Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3443122Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3443339Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3443582Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3443815Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3444034Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3444249Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3444479Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3444724Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3444959Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3445203Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3445470Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3445713Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3445947Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3446301Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3446548Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3446792Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3447027Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3447268Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3447501Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3447744Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3447982Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3448225Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3448458Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3448703Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3448954Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3449194Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3449428Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3449641Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3449854Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3450147Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3450394Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3450631Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3450874Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3451121Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3451365Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3451602Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3451843Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3452080Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3452323Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3452558Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3452767Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3453005Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3453248Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3453495Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3453739Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3453975Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3454201Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3454432Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3454657Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3454875Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3455119Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3455357Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3455598Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3455818Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3456034Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3456249Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3456492Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3456729Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3456945Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3457161Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3457366Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3457530Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3457769Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3457987Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3458220Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3458464Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3458699Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3458933Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3459140Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3459373Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3459586Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3459802Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3460041Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3460287Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3460520Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3460724Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3460958Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3461169Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3461404Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3461649Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3461882Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3462126Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3462373Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3462617Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3462853Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3463096Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3463348Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3463603Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3463837Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3464052Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3464267Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3464506Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3464735Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3464954Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3465168Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3465384Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3465630Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3465862Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3466105Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3466342Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3466547Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3466793Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3467035Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3467272Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3467515Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3467763Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3468003Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3468221Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3468435Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3468641Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3468857Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3469093Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3469337Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3469572Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3469817Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3470053Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3470288Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3470524Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3470765Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3471006Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3471263Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3471496Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3471700Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3471934Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3472193Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3472440Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3472684Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3472917Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3473156Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3473374Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3473588Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3473803Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3474043Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3474280Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3474487Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3474726Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3474970Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3475202Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3475448Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3475692Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3475919Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3476137Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3476350Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3476576Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3476832Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3477068Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3477310Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3477555Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3477798Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3478031Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3478238Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3478471Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3478720Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3478956Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3479196Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3479439Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3480017Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3480565Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3481064Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3481563Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3482061Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3482580Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3483111Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3483641Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3484159Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3484680Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3485227Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3485749Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3486271Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3486790Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3487287Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3487761Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3488225Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3488677Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3489171Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3489663Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3490229Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3490702Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3491243Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3491725Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3492089Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3492428Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3492766Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3493075Z E1204 11:16:23.190000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3493424Z [W1204 11:16:23.653291045 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3493633Z 2025-12-04T12:10:21.3493793Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3494299Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3494998Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3495486Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3496165Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3496949Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3497490Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3497986Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3498457Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3498959Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3499470Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3499982Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3500539Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3501051Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3501563Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3502074Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3502609Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3503119Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3503630Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3504113Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3504596Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3505072Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3505563Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3506075Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3506550Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3507026Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3507538Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3508048Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3508517Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3508990Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3509472Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3509934Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3510444Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3510921Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3511371Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3511859Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3512384Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3512898Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3513413Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3513944Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3514428Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3514900Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3515377Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3515871Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3516385Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3516901Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3517414Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3517926Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3518436Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3518950Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3519477Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3519992Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3520547Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3521060Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3521601Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3522115Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3522627Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3523140Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3523666Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3524179Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3524690Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3525203Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3525716Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3526235Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3526749Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3527241Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3527707Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3528196Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3528710Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3529235Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3529748Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3530293Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3530806Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3531347Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3531860Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3532379Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3532894Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3533421Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3533936Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3534424Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3534879Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3535351Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3535832Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3536302Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3536781Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3537278Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3537794Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3538280Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3538745Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3539217Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3539697Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3540190Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3540675Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3541203Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3541716Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3542231Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3542760Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3543244Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3543714Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3544188Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3544684Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3545202Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3545695Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3546163Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3546627Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3547123Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3547639Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3548168Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3548682Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3549197Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3549711Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3550301Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3550819Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3551336Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3551853Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3552356Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3552820Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3553300Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3553795Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3554266Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3554731Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3555226Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3555740Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3556255Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3556770Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3557286Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3557813Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3558329Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3558845Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3559358Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3559894Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3560415Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3560881Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3561335Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3561815Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3562294Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3562787Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3563301Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3563792Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3564265Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3564735Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3565230Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3565746Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3566261Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3566779Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3567317Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3567830Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3568342Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3568855Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3569394Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3569907Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3570447Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3570961Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3571485Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3572001Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3572516Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3573029Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3573540Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3574055Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3574568Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3575084Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3575565Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3576017Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3576493Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3577020Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3577534Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3578053Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3578566Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3579105Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3579622Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3580169Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3580686Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3581212Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3581726Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3582201Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3582678Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3583198Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3583713Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3584231Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3584746Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3585253Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3585739Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3586226Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3586691Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3587185Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3587698Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3588218Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3588714Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3589183Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3589653Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3590179Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3590706Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3591197Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3591667Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3592127Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3592535Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3592972Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3593452Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3593934Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3594452Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3594969Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3595461Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3595934Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3596411Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3596898Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3597352Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3597843Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3598333Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3598810Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3599283Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3599762Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3600287Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3600767Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3601281Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3601801Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3602315Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3602836Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3603349Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3603866Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3604383Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3604967Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3605505Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3606020Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3606507Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3606960Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3607452Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3607965Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3608448Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3608915Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3609381Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3609887Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3610441Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3610960Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3611473Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3611956Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3612440Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3612959Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3613480Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3614015Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3614534Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3615047Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3615538Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3616010Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3616467Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3616936Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3617430Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3617947Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3618466Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3618984Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3619514Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3619993Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3620521Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3621039Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3621558Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3622075Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3622592Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3623069Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3623545Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3624063Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3624601Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3625116Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3625631Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3626134Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3626635Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3627117Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3627582Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3628080Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3628595Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3629085Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3629563Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3630077Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3630632Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3631150Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3631667Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3632175Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3632659Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3633125Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3633593Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3634109Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3634628Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3635146Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3635664Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3636193Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3636722Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3637199Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3637677Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3638193Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3638722Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3639237Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3639755Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3640295Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3640782Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3641254Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3641725Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3642223Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3642738Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3643253Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3643785Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3644301Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3644816Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3645331Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3645861Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3646387Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3646901Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3647387Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3647866Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3648328Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3648784Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3649260Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3649748Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3650264Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3650722Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3651177Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3651609Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3651977Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3652320Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3652640Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3652937Z E1204 11:16:23.192000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3653303Z [W1204 11:16:23.695613573 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3653512Z 2025-12-04T12:10:21.3653671Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3654179Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3654854Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3655356Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3656045Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3656833Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3657396Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3657898Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3658372Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3658867Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3659383Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3659902Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3660457Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3660972Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3661482Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3661996Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3662514Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3663044Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3663555Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3664037Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3664511Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3665013Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3665512Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3666025Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3666501Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3666987Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3667502Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3668017Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3668494Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3668968Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3669450Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3669908Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3670427Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3670913Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3671362Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3671839Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3672368Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3672883Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3673401Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3673918Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3674423Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3674911Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3675390Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3675884Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3676403Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3676930Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3677444Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3677958Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3678475Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3678991Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3679510Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3680024Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3680580Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3681091Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3681603Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3682136Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3682648Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3683160Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3683673Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3684214Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3684729Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3685243Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3685756Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3686285Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3686803Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3687290Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3687752Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3688237Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3688749Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3689261Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3689771Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3690315Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3690824Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3691335Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3691860Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3692373Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3692889Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3693399Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3693944Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3694428Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3694879Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3695353Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3695848Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3696324Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3696796Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3697289Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3697799Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3698282Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3698733Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3699205Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3699687Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3700244Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3700718Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3701244Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3701754Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3702264Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3702773Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3703266Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3703747Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3704222Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3704722Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3705254Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3705744Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3706212Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3706679Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3707173Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3707688Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3708203Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3708719Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3709236Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3709750Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3710303Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3710833Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3711345Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3711858Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3712343Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3712812Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3713300Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3713788Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3714256Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3714721Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3719348Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3719877Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3720439Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3720955Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3721473Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3721994Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3722512Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3723031Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3723543Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3724059Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3724577Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3725049Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3725507Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3725974Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3726470Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3726982Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3727500Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3727991Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3728458Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3728943Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3729439Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3729957Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3730524Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3731043Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3731558Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3732074Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3732588Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3733105Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3733620Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3734151Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3734669Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3735187Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3735430Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3735701Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3735944Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3736180Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3736424Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3736673Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3736918Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3737152Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3737369Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3737576Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3737814Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3738057Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3738296Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3738541Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3738775Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3739021Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3739267Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3739509Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3739745Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3739993Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3740315Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3740521Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3740756Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3740997Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3741247Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3741492Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3741725Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3741954Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3742172Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3742393Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3742611Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3742855Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3743090Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3743317Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3743538Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3743763Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3743980Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3744222Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3744460Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3744703Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3744921Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3745132Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3745298Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3745550Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3745756Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3745993Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3746239Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3746478Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3746693Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3746899Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3747137Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3747353Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3747559Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3747795Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3748015Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3748252Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3748458Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3748696Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3748912Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3749160Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3749403Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3749640Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3749885Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3750173Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3750419Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3750653Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3750896Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3751134Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3751380Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3751617Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3751830Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3752041Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3752280Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3752531Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3752749Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3752964Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3753181Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3753423Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3753684Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3753927Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3754165Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3754371Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3754619Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3754864Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3755097Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3755342Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3755577Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3755806Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3756024Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3756239Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3756447Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3756653Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3756894Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3757152Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3757389Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3757633Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3757882Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3758102Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3758337Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3758581Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3758814Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3759075Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3759315Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3759528Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3759766Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3760008Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3760301Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3760546Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3760783Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3761013Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3761233Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3761470Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3761687Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3761936Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3762170Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3762395Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3762644Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3762887Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3763123Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3763366Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3763617Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3763848Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3764068Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3764287Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3764503Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3764750Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3764985Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3765229Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3765466Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3765709Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3765973Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3766178Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3766414Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3766657Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3766908Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3767162Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3767398Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3767627Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3767843Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3768076Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3768295Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3768539Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3768775Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3769017Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3769258Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3769501Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3769738Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3769978Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3770269Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3770530Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3770765Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3770981Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3771198Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3771423Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3771650Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3771883Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3772106Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3773016Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3774064Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3774299Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3774494Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3774634Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3774807Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3774925Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3775067Z E1204 11:16:23.234000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3775250Z [W1204 11:16:23.697682466 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3775255Z 2025-12-04T12:10:21.3775412Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3775742Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3776061Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3776210Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3776815Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3777089Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3777331Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3777583Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3777829Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3778061Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3778289Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3778534Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3778757Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3778991Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3779211Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3779443Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3779663Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3779898Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3780179Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3780379Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3780594Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3780802Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3781058Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3781279Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3781476Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3781700Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3781949Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3782192Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3782382Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3782605Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3782803Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3783011Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3783243Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3783440Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3783632Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3783855Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3784090Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3784310Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3784540Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3784763Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3784962Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3785187Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3785388Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3785621Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3785841Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3786084Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3786316Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3786545Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3786767Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3786994Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3787227Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3787457Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3787682Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3787915Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3788136Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3788369Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3788588Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3788819Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3789038Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3789271Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3789505Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3789733Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3789958Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3790234Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3790507Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3790715Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3790913Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3791145Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3791388Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3791622Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3791841Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3792071Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3792297Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3792531Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3792755Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3792982Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3793204Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3793430Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3793654Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3793870Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3794059Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3794282Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3794479Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3794705Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3794916Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3795150Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3795372Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3795579Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3795775Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3795994Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3796194Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3796384Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3796722Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3796956Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3797178Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3797434Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3797657Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3797864Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3798085Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3798293Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3798531Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3798756Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3798980Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3799193Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3799406Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3799639Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3799868Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3800159Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3800384Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3800618Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3800841Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3801077Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3801301Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3801535Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3801767Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3801970Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3802168Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3802402Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3802610Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3802812Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3803018Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3803269Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3803504Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3803741Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3803965Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3804200Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3804432Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3804665Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3804890Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3805118Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3805343Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3805548Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3805752Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3805945Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3806160Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3806370Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3806623Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3806847Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3807050Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3807252Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3807464Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3807707Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3807931Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3808160Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3808383Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3808625Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3808855Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3809083Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3809309Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3809546Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3809772Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3810009Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3810271Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3810507Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3810738Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3810985Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3811218Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3811448Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3811678Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3811934Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3812164Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3812371Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3812564Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3812805Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3813038Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3813267Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3813502Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3813727Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3813960Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3814182Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3814413Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3814635Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3814867Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3815089Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3815295Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3815519Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3815752Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3815979Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3816229Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3816455Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3816670Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3816877Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3817089Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3817294Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3817527Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3817748Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3817968Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3818174Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3818379Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3818584Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3818813Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3819037Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3819242Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3819457Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3819651Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3819804Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3820029Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3820288Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3820530Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3820761Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3820987Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3821187Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3821400Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3821627Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3821826Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3822020Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3822242Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3822439Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3822665Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3822859Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3823084Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3823275Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3823502Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3823750Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3823975Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3824204Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3824430Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3824686Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3824908Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3825146Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3825368Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3825615Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3825846Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3826044Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3826241Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3826463Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3826688Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3826894Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3827103Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3827313Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3827554Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3827784Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3828024Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3828253Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3828445Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3828677Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3828932Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3829155Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3829389Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3829613Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3829846Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3830053Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3830327Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3830526Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3830718Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3830948Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3831180Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3831408Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3831640Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3831870Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3832071Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3832313Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3832547Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3832771Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3833009Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3833257Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3833455Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3833683Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3833913Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3834153Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3834389Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3834617Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3834835Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3835045Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3835256Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3835463Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3835699Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3835922Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3836117Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3836343Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3836590Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3836820Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3837051Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3837279Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3837520Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3837729Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3837930Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3838138Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3838383Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3838607Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3838843Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3839069Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3839304Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3839528Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3839727Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3839953Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3840223Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3840451Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3840695Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3840923Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3841142Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3841353Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3841577Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3841940Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3842180Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3842404Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3842637Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3842878Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3843111Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3843341Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3843571Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3843800Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3844033Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3844260Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3844467Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3844674Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3844873Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3845085Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3845304Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3845516Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3845720Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3845922Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3846141Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3846323Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3846453Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3846606Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3846715Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3846851Z E1204 11:16:23.236000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3847022Z [W1204 11:16:23.699736789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.3847026Z 2025-12-04T12:10:21.3847179Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3847483Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3847781Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.3847921Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.3848416Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.3848681Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.3848914Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.3849124Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.3849343Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3849574Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3849801Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3850031Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3850313Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3850563Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3850787Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3851024Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3851255Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3851492Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3851715Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3851920Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3852137Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3852340Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3852575Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3852797Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3852996Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3853221Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3853457Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3853700Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3853892Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3854121Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3854321Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3854537Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3854766Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3854967Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3855162Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3855385Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3855637Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3855860Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3856092Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3856311Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3856515Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3856733Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3856937Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3857172Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3857394Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3857631Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3857865Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3858098Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3858324Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3858554Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3858793Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3859033Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3859257Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3859487Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3859713Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3859956Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3860217Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3860451Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3860671Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3860907Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3861130Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3861362Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3861588Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3861817Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3862046Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3862263Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3862467Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.3862697Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3862925Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3863173Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3863407Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3863640Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3863860Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3864137Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3864361Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3864595Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3864820Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3865050Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3865279Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3865480Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3865677Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3865904Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3866103Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3866318Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3866533Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3866766Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3866986Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3867192Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3867393Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3867629Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3867833Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3868022Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3868246Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3868486Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3868714Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3868943Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3869168Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3869373Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3869590Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3869798Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3870032Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3870296Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3870501Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3870730Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3870940Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3871171Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3871399Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3871642Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3871886Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3872125Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3872349Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3872586Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3872823Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3873061Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3873283Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3873489Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3873687Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3873911Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3874124Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3874327Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3874533Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3874765Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3875005Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3875242Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3875464Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3875699Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3875932Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3876178Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3876405Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3876642Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3876870Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3877086Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3877295Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3877489Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3877708Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.3877909Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3878146Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3878375Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3878579Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3878787Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3878991Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3879238Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3879460Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3879694Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3879922Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3880200Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3880440Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3880670Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3880897Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3881128Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3881368Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3881605Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3881827Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3882062Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3882285Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3882522Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3882749Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3882978Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3883205Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3883438Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3883679Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3883878Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3884077Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3884304Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3884547Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3884784Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3885014Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3885240Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3885484Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3885715Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3885952Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3886174Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3886409Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3886631Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3886830Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3887051Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3887284Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3887511Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3887740Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3887981Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3888199Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3888408Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3888609Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3888835Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3889071Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3889294Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3889513Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3889733Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3889939Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3890193Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3890427Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3890651Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3890856Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3891061Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3891255Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3891407Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.3891629Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3891825Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3892065Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3892293Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3892518Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3892719Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3892927Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3893160Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3893362Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3893554Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3893775Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3893984Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3894206Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3894400Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3894620Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3894813Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3895041Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3895271Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3895496Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3895723Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3895947Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3896177Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3896410Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3896641Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3896860Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3897092Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3897341Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3897545Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.3897735Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3897958Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3898186Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3898391Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3898595Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3898795Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3899027Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3899249Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3899484Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3899710Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3899901Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3900159Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3900389Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3900626Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3900854Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3901078Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3901297Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3901530Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3901737Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3901931Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.3902126Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3902359Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3902594Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3905148Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3905383Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3906419Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3906618Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3906844Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3907103Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3907326Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3907564Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3907785Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3907990Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3908216Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3908444Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3908668Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3908908Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3909133Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3909347Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3909556Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3909763Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3909966Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3910340Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3910561Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3910754Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3911000Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3911232Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3911460Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3911687Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3911914Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3912132Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3912356Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3912559Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3912767Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3913004Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3913244Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3913480Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3913702Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3913937Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3914165Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3914360Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.3914607Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3914842Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3915082Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3915314Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3915543Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3915765Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.3915969Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.3916177Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.3916381Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.3916631Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.3916858Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3917094Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3917322Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3917563Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3917791Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3918022Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3918249Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3918480Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.3918708Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.3918927Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.3919135Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.3919357Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.3919557Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.3919782Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.3919993Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.3920240Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.3920440Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.3920637Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.3920833Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.3920963Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.3921118Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.3921228Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.3921363Z E1204 11:16:23.238000 858404 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3921424Z FAILED [1.6261s] [100%] 2025-12-04T12:10:21.3921428Z 2025-12-04T12:10:21.3921501Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.3921666Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.3921726Z Traceback (most recent call last): 2025-12-04T12:10:21.3921899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3921953Z method(*args, **kwargs) 2025-12-04T12:10:21.3922116Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3922162Z method(*args, **kwargs) 2025-12-04T12:10:21.3922321Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.3922364Z with policy(): 2025-12-04T12:10:21.3922527Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.3922575Z raise RuntimeError(msg) 2025-12-04T12:10:21.3922995Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.3922997Z 2025-12-04T12:10:21.3923099Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3923383Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.3923385Z 2025-12-04T12:10:21.3923499Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3923595Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3923646Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3923719Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3924283Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3924395Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3924444Z graph_break [] 2025-12-04T12:10:21.3924518Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3924607Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3925108Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.3925170Z current_size = base.storage().size() 2025-12-04T12:10:21.3925216Z Autotune Choices Stats: 2025-12-04T12:10:21.3925606Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.3925698Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3925756Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3925892Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3926139Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3926193Z _scaled_mm 0.0094 ms 93.2% 2025-12-04T12:10:21.3926430Z triton_mm_33 0.0096 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3926666Z triton_mm_30 0.0112 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3926895Z triton_mm_22 0.0113 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3927138Z triton_mm_16 0.0113 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3927371Z triton_mm_29 0.0114 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3927616Z triton_mm_23 0.0123 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3927855Z triton_mm_15 0.0123 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3928083Z triton_mm_21 0.0125 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3928224Z SingleProcess AUTOTUNE benchmarking takes 0.1795 seconds and 9.0352 seconds precompiling for 33 choices 2025-12-04T12:10:21.3928391Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.3928441Z Traceback (most recent call last): 2025-12-04T12:10:21.3928606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3928660Z method(*args, **kwargs) 2025-12-04T12:10:21.3928819Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3928862Z method(*args, **kwargs) 2025-12-04T12:10:21.3929019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.3929060Z with policy(): 2025-12-04T12:10:21.3929221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.3929267Z raise RuntimeError(msg) 2025-12-04T12:10:21.3929681Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.3929695Z 2025-12-04T12:10:21.3929778Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3930058Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.3930060Z 2025-12-04T12:10:21.3930204Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3930288Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3930341Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3930405Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3930971Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3931078Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3931125Z graph_break [] 2025-12-04T12:10:21.3931207Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3931290Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3931784Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.3931843Z current_size = base.storage().size() 2025-12-04T12:10:21.3931888Z Autotune Choices Stats: 2025-12-04T12:10:21.3932267Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.3932342Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3932396Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3932528Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3932768Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3932833Z _scaled_mm 0.0094 ms 93.2% 2025-12-04T12:10:21.3933064Z triton_mm_33 0.0096 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3933301Z triton_mm_30 0.0112 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3933536Z triton_mm_22 0.0113 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3933779Z triton_mm_16 0.0113 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3934013Z triton_mm_29 0.0114 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3934243Z triton_mm_23 0.0123 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3934479Z triton_mm_15 0.0123 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3934708Z triton_mm_21 0.0125 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3934847Z SingleProcess AUTOTUNE benchmarking takes 0.1795 seconds and 9.0352 seconds precompiling for 33 choices 2025-12-04T12:10:21.3934930Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3934976Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3935058Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3935164Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3935669Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3935713Z graph_break [] 2025-12-04T12:10:21.3935785Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3935860Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3936230Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.3936330Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.3936379Z Autotune Choices Stats: 2025-12-04T12:10:21.3936759Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.3936840Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3936899Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3937024Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3937266Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3937500Z triton_mm_71 0.0093 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3937560Z _scaled_mm 0.0093 ms 94.8% 2025-12-04T12:10:21.3937789Z triton_mm_67 0.0114 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3938023Z triton_mm_60 0.0114 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3938255Z triton_mm_68 0.0115 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3938484Z triton_mm_59 0.0116 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3938722Z triton_mm_54 0.0117 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3938972Z triton_mm_61 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3939202Z triton_mm_53 0.0124 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3939341Z SingleProcess AUTOTUNE benchmarking takes 0.2603 seconds and 0.8009 seconds precompiling for 39 choices 2025-12-04T12:10:21.3939411Z =================================== FAILURES =================================== 2025-12-04T12:10:21.3939579Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.3939631Z Traceback (most recent call last): 2025-12-04T12:10:21.3939796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3939842Z method(*args, **kwargs) 2025-12-04T12:10:21.3940001Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3940050Z method(*args, **kwargs) 2025-12-04T12:10:21.3940242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.3940287Z with policy(): 2025-12-04T12:10:21.3940444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.3940495Z raise RuntimeError(msg) 2025-12-04T12:10:21.3940923Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.3940926Z 2025-12-04T12:10:21.3941010Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3941282Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.3941284Z 2025-12-04T12:10:21.3941382Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3941476Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3941526Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3941586Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3942143Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3942250Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3942290Z graph_break [] 2025-12-04T12:10:21.3942361Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3942440Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3942928Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.3942981Z current_size = base.storage().size() 2025-12-04T12:10:21.3943041Z Autotune Choices Stats: 2025-12-04T12:10:21.3943412Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00872000027447939, "best_triton_pos": 0} 2025-12-04T12:10:21.3943498Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3943558Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3943683Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3943927Z triton_mm_34 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3943971Z _scaled_mm 0.0094 ms 93.2% 2025-12-04T12:10:21.3944207Z triton_mm_33 0.0096 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3944439Z triton_mm_30 0.0112 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3944674Z triton_mm_22 0.0113 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3944921Z triton_mm_16 0.0113 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3945149Z triton_mm_29 0.0114 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3945382Z triton_mm_23 0.0123 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3945622Z triton_mm_15 0.0123 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3945855Z triton_mm_21 0.0125 ms 69.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3945987Z SingleProcess AUTOTUNE benchmarking takes 0.1795 seconds and 9.0352 seconds precompiling for 33 choices 2025-12-04T12:10:21.3946070Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3946114Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3946180Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3946284Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3946772Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3946819Z graph_break [] 2025-12-04T12:10:21.3946899Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3946986Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3947363Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.3947467Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.3947511Z Autotune Choices Stats: 2025-12-04T12:10:21.3947888Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.3947962Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3948016Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3948147Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3948384Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3948632Z triton_mm_71 0.0093 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3948675Z _scaled_mm 0.0093 ms 94.8% 2025-12-04T12:10:21.3948908Z triton_mm_67 0.0114 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3949137Z triton_mm_60 0.0114 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3949369Z triton_mm_68 0.0115 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3949617Z triton_mm_59 0.0116 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3949849Z triton_mm_54 0.0117 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3950086Z triton_mm_61 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3950353Z triton_mm_53 0.0124 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3950489Z SingleProcess AUTOTUNE benchmarking takes 0.2603 seconds and 0.8009 seconds precompiling for 39 choices 2025-12-04T12:10:21.3950568Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3950618Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3950678Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3950804Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3951300Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3951342Z graph_break [] 2025-12-04T12:10:21.3951418Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3951497Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3951548Z Autotune Choices Stats: 2025-12-04T12:10:21.3951919Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008559999987483025, "best_triton_pos": 0} 2025-12-04T12:10:21.3951991Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3952046Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3952178Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3952420Z triton_mm_110 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3952482Z _scaled_mm 0.0092 ms 92.6% 2025-12-04T12:10:21.3952721Z triton_mm_109 0.0098 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3952950Z triton_mm_105 0.0111 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3953189Z triton_mm_92 0.0114 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3953430Z triton_mm_98 0.0114 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3953665Z triton_mm_106 0.0117 ms 73.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3953892Z triton_mm_97 0.0118 ms 72.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3954216Z triton_mm_99 0.0119 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3954452Z triton_mm_91 0.0120 ms 71.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3954584Z SingleProcess AUTOTUNE benchmarking takes 0.2738 seconds and 0.6399 seconds precompiling for 39 choices 2025-12-04T12:10:21.3954793Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-20658da356ef03da.xml - 2025-12-04T12:10:21.3954862Z =========================== short test summary info ============================ 2025-12-04T12:10:21.3955500Z FAILED [1.6261s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.3955505Z 2025-12-04T12:10:21.3955589Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3955862Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.3955864Z 2025-12-04T12:10:21.3955960Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3956027Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.3956111Z ================= 1 failed, 187 deselected, 2 rerun in 15.22s ================== 2025-12-04T12:10:21.3956154Z Got exit code 1 2025-12-04T12:10:21.3956381Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.3956523Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.3956674Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bae73f6a5e89f809.xml 2025-12-04T12:10:21.3956738Z ============================= test session starts ============================== 2025-12-04T12:10:21.3956865Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.3956910Z cachedir: .pytest_cache 2025-12-04T12:10:21.3957075Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.3957140Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.3957187Z configfile: pytest.ini 2025-12-04T12:10:21.3957360Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.3957443Z collecting ... collected 188 items / 110 deselected / 78 selected 2025-12-04T12:10:21.3957507Z stepcurrent: skipping 110 already run items. 2025-12-04T12:10:21.3957556Z Running 78 items in this shard 2025-12-04T12:10:21.3957559Z 2025-12-04T12:10:21.3958486Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/pk/cpkwh7kldh4znmktanpop3o2j6rkmavnhifjlnwie2xbt56zzixl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3958640Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3958876Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3959040Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3959339Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3959484Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3959746Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3959894Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3960184Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3960348Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3960624Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3960775Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3961059Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3961257Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3961582Z E1204 11:16:53.748000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3962329Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/2q/c2q3bo3nhpj55xwdb4yaafdidijudkebr3goyjvcwdqwopfuldps.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3962480Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3962701Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3962861Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3963156Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3963301Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3963563Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3963711Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3963980Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3964144Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3964413Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3964552Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3964829Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3965031Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3965369Z E1204 11:16:53.843000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3966095Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/sy/csyk6cjft33q2vmdakuxjvkuee6knwuff45z42j4qfh4l64sdxry.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3966260Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3966480Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3966639Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3966929Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3967061Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3967326Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3967468Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3967738Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3967897Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3968178Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3968324Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3968600Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3968803Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3969115Z E1204 11:16:53.852000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3969834Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/r5/cr5ogh5aq2rbxtc3f74mkfjrfhlrj5e435w2lis7izdqcqqjijwt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3970001Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3970268Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3970433Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3970736Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3970874Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3971135Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3971277Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3971538Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3971696Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3971974Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3972120Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3972401Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3972605Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3972925Z E1204 11:16:53.853000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3973652Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/37/c37wm22win6ziqpocp4qlhkovxjslc37imuhelrpgvr5yu7mrpl5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3973801Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3974022Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3974193Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3974479Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3974617Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3974873Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3975023Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3975279Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3975444Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3975714Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3975855Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3976135Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3976330Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3976952Z E1204 11:16:53.854000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3977683Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/tmpnam_gdpr/rg/crglecrd3vrrxbs2dqzmtlex2mzj6sygkctp3cl4xgr2yzt7yheh.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.3977836Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.3978061Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.3978218Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.3978509Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.3978654Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.3978915Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.3979059Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.3979313Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.3979475Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.3979753Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.3979896Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.3980221Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.3980420Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.3980739Z E1204 11:16:53.856000 864328 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.3980801Z ('RERUN', {'yellow': True}) [21.9780s] [ 1%] 2025-12-04T12:10:21.3981142Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:16:56.026000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3981452Z E1204 11:16:56.026000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3981587Z E1204 11:16:56.026000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3981751Z E1204 11:16:56.029000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3982052Z E1204 11:16:56.029000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3982183Z E1204 11:16:56.029000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3982334Z E1204 11:16:56.031000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3982633Z E1204 11:16:56.031000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3982761Z E1204 11:16:56.031000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3982913Z E1204 11:16:56.094000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3983222Z E1204 11:16:56.094000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3983357Z E1204 11:16:56.094000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3983501Z E1204 11:16:56.096000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3983800Z E1204 11:16:56.096000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3983948Z E1204 11:16:56.096000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3984093Z E1204 11:16:56.098000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3984390Z E1204 11:16:56.098000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3984519Z E1204 11:16:56.098000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3984583Z ('RERUN', {'yellow': True}) [1.8360s] [ 1%] 2025-12-04T12:10:21.3984914Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda E1204 11:16:57.676000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3985214Z E1204 11:16:57.676000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3985345Z E1204 11:16:57.676000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3985503Z E1204 11:16:57.678000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3985802Z E1204 11:16:57.678000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3985940Z E1204 11:16:57.678000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3986088Z E1204 11:16:57.681000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3986382Z E1204 11:16:57.681000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3986516Z E1204 11:16:57.681000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3986661Z E1204 11:16:57.722000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3986960Z E1204 11:16:57.722000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3987092Z E1204 11:16:57.722000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3987247Z E1204 11:16:57.724000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3987551Z E1204 11:16:57.724000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3987684Z E1204 11:16:57.724000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3987834Z E1204 11:16:57.726000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.3988127Z E1204 11:16:57.726000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help.. 2025-12-04T12:10:21.3988272Z E1204 11:16:57.726000 864328 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.3988317Z FAILED [2.3655s] [ 1%] 2025-12-04T12:10:21.3988320Z 2025-12-04T12:10:21.3988382Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.3988543Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.3988601Z Traceback (most recent call last): 2025-12-04T12:10:21.3988769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3988814Z method(*args, **kwargs) 2025-12-04T12:10:21.3988975Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3989021Z method(*args, **kwargs) 2025-12-04T12:10:21.3989178Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.3989220Z with policy(): 2025-12-04T12:10:21.3989381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.3989428Z raise RuntimeError(msg) 2025-12-04T12:10:21.3989848Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.3989851Z 2025-12-04T12:10:21.3989932Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3990259Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.3990262Z 2025-12-04T12:10:21.3990360Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3990438Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3990492Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3990557Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3991123Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3991232Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3991291Z graph_break [] 2025-12-04T12:10:21.3991361Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3991446Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3991933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.3991991Z current_size = base.storage().size() 2025-12-04T12:10:21.3992036Z Autotune Choices Stats: 2025-12-04T12:10:21.3992413Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.3992508Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3992562Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3992701Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3992940Z triton_mm_34 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3993180Z triton_mm_33 0.0092 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3993227Z _scaled_mm 0.0096 ms 92.5% 2025-12-04T12:10:21.3993457Z triton_mm_16 0.0108 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3993699Z triton_mm_30 0.0109 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3993934Z triton_mm_22 0.0113 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3994175Z triton_mm_29 0.0114 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3994402Z triton_mm_21 0.0115 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.3994634Z triton_mm_23 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3994860Z triton_mm_15 0.0126 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3995001Z SingleProcess AUTOTUNE benchmarking takes 0.1752 seconds and 1.1050 seconds precompiling for 33 choices 2025-12-04T12:10:21.3995165Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.3995226Z Traceback (most recent call last): 2025-12-04T12:10:21.3995392Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3995436Z method(*args, **kwargs) 2025-12-04T12:10:21.3995596Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.3995638Z method(*args, **kwargs) 2025-12-04T12:10:21.3995797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.3995839Z with policy(): 2025-12-04T12:10:21.3995999Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.3996055Z raise RuntimeError(msg) 2025-12-04T12:10:21.3996469Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.3996472Z 2025-12-04T12:10:21.3996552Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.3996829Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.3996831Z 2025-12-04T12:10:21.3996935Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.3997017Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.3997069Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.3997134Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.3997706Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.3997812Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.3997859Z graph_break [] 2025-12-04T12:10:21.3997929Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.3998020Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.3998506Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.3998564Z current_size = base.storage().size() 2025-12-04T12:10:21.3998615Z Autotune Choices Stats: 2025-12-04T12:10:21.3998985Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.3999057Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.3999113Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.3999245Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.3999492Z triton_mm_34 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3999728Z triton_mm_33 0.0092 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.3999773Z _scaled_mm 0.0096 ms 92.5% 2025-12-04T12:10:21.4000005Z triton_mm_16 0.0108 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4000282Z triton_mm_30 0.0109 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4000507Z triton_mm_22 0.0113 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4000736Z triton_mm_29 0.0114 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4000959Z triton_mm_21 0.0115 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4001192Z triton_mm_23 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4001427Z triton_mm_15 0.0126 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4001573Z SingleProcess AUTOTUNE benchmarking takes 0.1752 seconds and 1.1050 seconds precompiling for 33 choices 2025-12-04T12:10:21.4006321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.4006382Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.4006445Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.4006601Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.4007091Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.4007130Z graph_break [] 2025-12-04T12:10:21.4007199Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.4007273Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.4007315Z Autotune Choices Stats: 2025-12-04T12:10:21.4007680Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00875999964773655, "best_triton_pos": 0} 2025-12-04T12:10:21.4007767Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.4007818Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.4007943Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.4008181Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4008223Z _scaled_mm 0.0094 ms 92.8% 2025-12-04T12:10:21.4008448Z triton_mm_67 0.0108 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4008690Z triton_mm_60 0.0114 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4008912Z triton_mm_59 0.0117 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4009135Z triton_mm_68 0.0121 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4009362Z triton_mm_61 0.0121 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4009588Z triton_mm_53 0.0122 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4009814Z triton_mm_71 0.0126 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4010047Z triton_mm_54 0.0128 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4010221Z SingleProcess AUTOTUNE benchmarking takes 0.2728 seconds and 0.8097 seconds precompiling for 39 choices 2025-12-04T12:10:21.4010280Z =================================== FAILURES =================================== 2025-12-04T12:10:21.4010451Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.4010502Z Traceback (most recent call last): 2025-12-04T12:10:21.4010661Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.4010705Z method(*args, **kwargs) 2025-12-04T12:10:21.4010857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.4010902Z method(*args, **kwargs) 2025-12-04T12:10:21.4011051Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.4011088Z with policy(): 2025-12-04T12:10:21.4011239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.4011279Z raise RuntimeError(msg) 2025-12-04T12:10:21.4011689Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3007315968. 2025-12-04T12:10:21.4011706Z 2025-12-04T12:10:21.4011782Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.4012055Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.4012057Z 2025-12-04T12:10:21.4012143Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.4012221Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.4012278Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.4012337Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.4012890Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.4012994Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.4013032Z graph_break [] 2025-12-04T12:10:21.4013096Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.4013173Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.4013655Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.4013712Z current_size = base.storage().size() 2025-12-04T12:10:21.4013752Z Autotune Choices Stats: 2025-12-04T12:10:21.4014128Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008840000256896019, "best_triton_pos": 0} 2025-12-04T12:10:21.4014200Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.4014251Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.4014391Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.4014630Z triton_mm_34 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4014860Z triton_mm_33 0.0092 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4014901Z _scaled_mm 0.0096 ms 92.5% 2025-12-04T12:10:21.4015128Z triton_mm_16 0.0108 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4015353Z triton_mm_30 0.0109 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4015589Z triton_mm_22 0.0113 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4015811Z triton_mm_29 0.0114 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4016032Z triton_mm_21 0.0115 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4016259Z triton_mm_23 0.0122 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4016501Z triton_mm_15 0.0126 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4016631Z SingleProcess AUTOTUNE benchmarking takes 0.1752 seconds and 1.1050 seconds precompiling for 33 choices 2025-12-04T12:10:21.4016709Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.4016753Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.4016814Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.4016917Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.4017402Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.4017441Z graph_break [] 2025-12-04T12:10:21.4017507Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.4017590Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.4017631Z Autotune Choices Stats: 2025-12-04T12:10:21.4018003Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00875999964773655, "best_triton_pos": 0} 2025-12-04T12:10:21.4018073Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.4018124Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.4018247Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.4018479Z triton_mm_72 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4018521Z _scaled_mm 0.0094 ms 92.8% 2025-12-04T12:10:21.4018745Z triton_mm_67 0.0108 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4018970Z triton_mm_60 0.0114 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4019205Z triton_mm_59 0.0117 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4019427Z triton_mm_68 0.0121 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4019653Z triton_mm_61 0.0121 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4019881Z triton_mm_53 0.0122 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4020155Z triton_mm_71 0.0126 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4020382Z triton_mm_54 0.0128 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4020511Z SingleProcess AUTOTUNE benchmarking takes 0.2728 seconds and 0.8097 seconds precompiling for 39 choices 2025-12-04T12:10:21.4020586Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.4020629Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.4020688Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.4020787Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.4021269Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.4021325Z graph_break [] 2025-12-04T12:10:21.4021391Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.4021465Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.4021505Z Autotune Choices Stats: 2025-12-04T12:10:21.4021883Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00860000029206276, "best_triton_pos": 0} 2025-12-04T12:10:21.4021952Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.4022003Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.4022127Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.4022361Z triton_mm_110 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4022589Z triton_mm_109 0.0098 ms 87.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4022816Z triton_mm_92 0.0113 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4023053Z triton_mm_97 0.0115 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4023275Z triton_mm_98 0.0117 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4023500Z triton_mm_105 0.0118 ms 73.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4023736Z triton_mm_99 0.0122 ms 70.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4023962Z triton_mm_91 0.0124 ms 69.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4024186Z triton_mm_106 0.0125 ms 68.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4024413Z triton_mm_107 0.0130 ms 66.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4024541Z SingleProcess AUTOTUNE benchmarking takes 0.2742 seconds and 0.6461 seconds precompiling for 39 choices 2025-12-04T12:10:21.4024734Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bae73f6a5e89f809.xml - 2025-12-04T12:10:21.4024799Z =========================== short test summary info ============================ 2025-12-04T12:10:21.4025429Z FAILED [2.3655s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3007315968. 2025-12-04T12:10:21.4025434Z 2025-12-04T12:10:21.4025520Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.4025791Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.4025794Z 2025-12-04T12:10:21.4025884Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.4025949Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.4026020Z ================= 1 failed, 110 deselected, 2 rerun in 26.20s ================== 2025-12-04T12:10:21.4026059Z Got exit code 1 2025-12-04T12:10:21.4026098Z Retrying single test... 2025-12-04T12:10:21.4026244Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e7f1d2f3686efb0.xml 2025-12-04T12:10:21.4026304Z ============================= test session starts ============================== 2025-12-04T12:10:21.4026422Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.4026472Z cachedir: .pytest_cache 2025-12-04T12:10:21.4026633Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.4026681Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.4026722Z configfile: pytest.ini 2025-12-04T12:10:21.4026887Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.4026965Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.4027229Z stepcurrent: skipping 110 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.4027283Z Running 1 items in this shard 2025-12-04T12:10:21.4027286Z 2025-12-04T12:10:21.4027633Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:09.664768799 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4027636Z 2025-12-04T12:10:21.4027950Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4028249Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4028383Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4028865Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4029129Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4029355Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4029574Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4029776Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4030008Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4030267Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4030499Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4030725Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4030970Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4031190Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4031415Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4031635Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4031876Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4032093Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4032321Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4032537Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4032766Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4032985Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4033176Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4033409Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4033635Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4033867Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4034056Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4034276Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4034502Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4034719Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4034947Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4035175Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4035378Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4035589Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4035749Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4035938Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4036466Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/pk/cpkwh7kldh4znmktanpop3o2j6rkmavnhifjlnwie2xbt56zzixl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4036611Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4036826Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4036982Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4037273Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4037418Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4037675Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4037815Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4038084Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4038241Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4038511Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4038646Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4038920Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4039114Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4039437Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4039733Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4039862Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4040369Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4040634Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4040860Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4041069Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4041271Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4041498Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4041730Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4041961Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4042193Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4042420Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4042638Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4042866Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4043083Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4043309Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4043546Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4043773Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4043990Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4044218Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4044448Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4044640Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4044857Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4045083Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4045302Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4045489Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4045708Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4045946Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4046164Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4046411Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4046633Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4046835Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4047046Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4047205Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4047384Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4047489Z E1204 11:17:16.677000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4047656Z [W1204 11:17:16.181231212 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4047659Z 2025-12-04T12:10:21.4047969Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4048258Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4048389Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4048875Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4049127Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4049350Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4049557Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4049757Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4049997Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4050247Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4050490Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4050708Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4050935Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4051157Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4051383Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4051600Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4051839Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4052055Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4052281Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4052497Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4052737Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4052955Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4053146Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4053362Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4053591Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4053809Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4053999Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4054227Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4054451Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4054680Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4054904Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4055122Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4055323Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4055531Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4055692Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4055871Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4056407Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/rg/crglecrd3vrrxbs2dqzmtlex2mzj6sygkctp3cl4xgr2yzt7yheh.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4056552Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4056861Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4057027Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4057313Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4057446Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4057703Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4057844Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4058101Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4058258Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4058534Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4058667Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4058950Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4059143Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4059457Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4059746Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4059876Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4060392Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4060662Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4060887Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4061092Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4061305Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4061534Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4061753Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4061979Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4062196Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4062423Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4062640Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4062888Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4063105Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4063341Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4063560Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4063786Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4064005Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4064230Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4064449Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4064648Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4064868Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4065097Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4065313Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4065512Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4065729Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4065955Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4066172Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4066398Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4066617Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4066818Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4067039Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4067199Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4067388Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4067494Z E1204 11:17:16.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4067650Z [W1204 11:17:16.186666251 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4067652Z 2025-12-04T12:10:21.4067959Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4068247Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4068378Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4068851Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4069113Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4069337Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4069556Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4069757Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4069983Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4070215Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4070442Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4070659Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4070885Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4071116Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4071341Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4071573Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4071804Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4072030Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4072256Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4072476Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4072705Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4072936Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4073128Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4073346Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4073574Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4073803Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4073996Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4074216Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4074445Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4074662Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4074891Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4075113Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4075325Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4075537Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4075703Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4075885Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4076409Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/r5/cr5ogh5aq2rbxtc3f74mkfjrfhlrj5e435w2lis7izdqcqqjijwt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4076556Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4076776Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4076931Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4077229Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4077361Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4077618Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4077759Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4078025Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4078184Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4078451Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4078588Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4078866Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4079060Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4079376Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4079675Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4079805Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4080332Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4080588Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4080816Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4081023Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4081228Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4081475Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4081700Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4081926Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4082148Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4082390Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4082608Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4082833Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4083050Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4083283Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4083505Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4083745Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4083965Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4084201Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4084422Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4084612Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4084831Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4085057Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4085276Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4085467Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4085697Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4085928Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4086145Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4086374Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4086602Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4086806Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4087021Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4087180Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4087361Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4087466Z E1204 11:17:16.726000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4087625Z [W1204 11:17:16.189184938 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4087627Z 2025-12-04T12:10:21.4087942Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4088236Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4088376Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4088848Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4089101Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4089325Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4089532Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4089742Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4089969Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4090260Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4090496Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4090730Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4090957Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4091177Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4091404Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4091621Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4091848Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4092064Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4092301Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4092519Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4092760Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4092980Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4093169Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4093388Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4093614Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4093834Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4094042Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4094261Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4094487Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4094704Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4094943Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4095160Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4095362Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4095571Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4095731Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4095909Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4096439Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/sy/csyk6cjft33q2vmdakuxjvkuee6knwuff45z42j4qfh4l64sdxry.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4096585Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4096808Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4096964Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4097253Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4097384Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4097639Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4097776Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4098030Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4098194Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4098462Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4098596Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4098870Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4099073Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4099388Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4099683Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4099811Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4100342Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4100607Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4100831Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4101049Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4101250Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4101478Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4101700Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4101927Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4102145Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4102383Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4102602Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4102828Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4103045Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4103284Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4103502Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4103729Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4103947Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4104175Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4104392Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4104582Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4104808Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4105033Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4105260Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4105451Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4105668Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4105894Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4106111Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4106341Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4106573Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4106774Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4106985Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4107144Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4107322Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4107440Z E1204 11:17:16.728000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4107595Z [W1204 11:17:16.192131269 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4107597Z 2025-12-04T12:10:21.4107749Z [W1204 11:17:16.193148756 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4107751Z 2025-12-04T12:10:21.4108060Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4108350Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4108482Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4108969Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4109220Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4109461Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4109667Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4109870Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4110119Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4110340Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4110568Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4110802Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4111031Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4111249Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4111478Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4111711Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4111940Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4112163Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4112389Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4112612Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4112839Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4113072Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4113261Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4113494Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4113723Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4113941Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4114131Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4114348Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4114576Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4114805Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4115031Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4115252Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4115454Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4115668Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4115836Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4116017Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4116539Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/2q/c2q3bo3nhpj55xwdb4yaafdidijudkebr3goyjvcwdqwopfuldps.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4116686Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4116903Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4117059Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4117356Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4117486Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4117757Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4117900Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4118153Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4118312Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4118579Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4118714Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4118987Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4119192Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4119507Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4119796Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4119937Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4120466Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4120716Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4120944Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4121150Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4121354Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4121592Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4121815Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4122052Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4122274Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4122508Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4122727Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4122957Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4123176Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4123418Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4123635Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4123862Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4124083Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4124324Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4124546Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4124739Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4124959Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4125186Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4125407Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4125596Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4125827Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4126053Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4126279Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4126508Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4126727Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4126932Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4127147Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4127308Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4127501Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4127605Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4127912Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4128201Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4128341Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4128816Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4129064Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4129292Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4129500Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4129701Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4129939Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4130198Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4130439Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4130658Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4130884Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4131103Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4131331Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4131550Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4131790Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4132011Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4132236Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4132453Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4132690Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4132908Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4133099Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4133317Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4133545Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4133762Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4133954Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4134183Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4134410Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4134639Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4134867Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4135087Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4135288Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4135500Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4135659Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4135848Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4136375Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp7n3nwilj/37/c37wm22win6ziqpocp4qlhkovxjslc37imuhelrpgvr5yu7mrpl5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.4136520Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.4136733Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.4136896Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.4137183Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.4137316Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.4137572Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.4137711Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.4137963Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.4138122Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.4138397Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.4138535Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.4138820Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.4139016Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.4139331Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4139621Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4139754Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4140270Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4140536Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4140762Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4140969Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4141190Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4141419Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4141640Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4141868Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4142087Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4142316Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4142532Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4142770Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4142987Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4143234Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4143456Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4143681Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4143901Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4144127Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4144348Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4144547Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4144768Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4144996Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4145216Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4145420Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4145641Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4145873Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4146092Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4146321Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4146543Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4146754Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.4146969Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.4147127Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.4147322Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.4147429Z E1204 11:17:16.733000 870247 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.4147488Z ('RERUN', {'yellow': True}) [11.0506s] [100%] 2025-12-04T12:10:21.4147838Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:18.121682187 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4147844Z 2025-12-04T12:10:21.4147992Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4148291Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4148591Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4148724Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4149197Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4149460Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4149689Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4149894Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4150137Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4150366Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4150590Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4150822Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4151053Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4151285Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4151516Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4151744Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4151961Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4152189Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4152409Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4152609Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4152832Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4153031Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4153261Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4153477Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4153681Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4153900Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4154127Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4154347Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4154538Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4154760Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4154960Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4155160Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4155380Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4155584Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4155774Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4155991Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4156221Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4156438Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4156666Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4156905Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4157102Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4157315Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4157516Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4157747Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4157976Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4158206Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4158426Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4158652Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4158875Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4159103Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4159333Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4159561Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4159789Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4160020Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4160281Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4160512Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4160729Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4160956Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4161187Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4161415Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4161636Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4161863Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4162095Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4162321Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4162541Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4162741Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4162939Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4163168Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4163384Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4163623Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4163841Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4164084Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4164308Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4164533Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4164752Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4164977Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4165196Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4165431Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4165651Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4165848Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4166037Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4166270Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4166469Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4166680Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4166880Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4167108Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4167327Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4167526Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4167727Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4167945Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4168151Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4168341Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4168567Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4168793Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4169011Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4169239Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4169466Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4169665Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4169872Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4170073Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4170349Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4170581Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4170788Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4170990Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4171191Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4171420Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4171641Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4171883Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4172102Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4172350Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4172571Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4172803Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4173025Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4173256Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4173482Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4173694Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4173889Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4174110Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4174315Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4174528Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4174733Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4174963Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4175184Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4175418Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4175639Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4175870Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4176104Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4176336Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4176568Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4176797Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4177020Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4177224Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4177424Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4177620Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4177843Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4178047Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4178278Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4178500Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4178712Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4178915Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4179112Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4179341Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4179563Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4179797Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4180023Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4180322Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4180546Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4180792Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4181016Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4181249Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4181470Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4181701Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4181922Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4182166Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4182392Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4182621Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4182844Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4183083Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4183306Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4183534Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4183757Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4183960Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4184152Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4184375Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4184616Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4184840Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4185076Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4185300Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4185529Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4185750Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4185982Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4186202Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4186442Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4186663Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4186859Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4187082Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4187319Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4187543Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4187772Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4187996Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4188211Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4188416Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4188618Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4188840Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4189070Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4189303Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4189519Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4189722Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4189922Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4190176Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4190407Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4190645Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4190847Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4191047Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4191239Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4191409Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4191634Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4191824Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4192045Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4192273Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4192495Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4192696Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4192902Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4193126Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4193336Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4193529Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4193752Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4193947Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4194168Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4194359Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4194583Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4194782Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4195005Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4195232Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4195456Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4195694Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4195920Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4196153Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4196372Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4196603Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4196824Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4197063Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4197282Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4197494Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4197686Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4197907Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4198125Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4198330Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4198532Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4198733Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4198975Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4199198Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4199427Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4199649Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4199850Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4200072Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4200336Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4200562Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4200797Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4201020Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4201251Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4201454Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4201653Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4201859Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4202054Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4202281Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4202509Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4202733Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4202962Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4203196Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4203389Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4203610Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4203842Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4204084Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4204316Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4204536Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4204728Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4204955Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4205189Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4205423Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4205651Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4205881Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4206095Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4206302Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4206504Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4206703Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4206934Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4207154Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4207359Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4207582Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4207811Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4208035Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4208272Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4208497Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4208710Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4208914Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4209113Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4209316Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4209549Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4209784Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4210018Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4210295Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4210526Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4210746Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4210939Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4211167Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4211397Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4211638Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4211869Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4212104Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4212319Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4212542Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4212744Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4212945Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4213175Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4213397Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4213628Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4213861Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4214091Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4214330Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4214559Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4214781Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4215008Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4215230Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4215431Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4215648Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4215839Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4216036Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4216252Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4216462Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4216687Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4216954Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4217163Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4217343Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4217471Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4217630Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4217744Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4217882Z E1204 11:17:18.672000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4218051Z [W1204 11:17:18.136744140 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4218054Z 2025-12-04T12:10:21.4218206Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4218513Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4218815Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4218953Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4219435Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4219695Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4219944Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4220197Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4220408Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4220638Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4220882Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4221115Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4221345Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4221577Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4221801Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4222035Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4222256Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4222501Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4222721Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4222945Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4223160Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4223360Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4223593Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4223816Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4224015Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4224248Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4224481Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4224706Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4224896Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4225133Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4225334Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4225531Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4225751Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4225957Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4226151Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4226376Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4226621Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4226841Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4227086Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4227308Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4227511Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4227727Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4227929Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4228162Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4228385Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4228629Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4228852Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4229086Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4229310Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4229549Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4229773Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4230002Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4230274Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4230503Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4230730Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4230975Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4231195Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4231441Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4231662Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4231895Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4232117Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4232350Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4232576Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4232817Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4233046Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4233252Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4233457Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4233698Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4233923Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4234159Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4234379Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4234614Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4234836Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4235070Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4235303Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4235539Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4235778Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4236008Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4236231Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4236431Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4236627Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4236850Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4237062Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4237277Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4237479Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4237788Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4238027Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4238229Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4238420Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4238648Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4238852Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4239045Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4239271Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4239509Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4239732Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4239975Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4240260Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4240466Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4240680Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4240887Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4241120Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4241346Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4241570Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4241777Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4241983Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4242214Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4242457Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4242687Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4242915Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4243145Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4243373Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4243607Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4244410Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4244643Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4244875Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4245083Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4245283Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4245507Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4245716Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4245917Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4246126Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4246367Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4246595Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4246831Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4247054Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4247297Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4247519Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4247753Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4247975Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4248210Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4248436Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4248655Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4248863Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4249067Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4249285Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4249488Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4249722Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4249953Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4250193Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4250399Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4250614Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4250851Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4251075Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4251309Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4251548Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4251778Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4252003Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4252237Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4252465Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4252697Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4252941Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4253175Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4253408Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4253643Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4253867Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4254103Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4254325Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4254562Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4254801Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4255033Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4255262Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4255463Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4255672Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4255893Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4256129Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4256355Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4256586Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4256814Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4257046Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4257284Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4257520Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4257751Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4257988Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4258209Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4258407Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4258630Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4258865Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4259103Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4259337Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4259565Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4259779Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4259998Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4260236Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4260445Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4260678Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4260899Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4261121Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4261326Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4261549Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4261753Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4262002Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4262232Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4262436Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4262641Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4262836Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4262992Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4263228Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4263424Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4263653Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4263887Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4264116Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4264330Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4264526Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4264748Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4264950Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4265146Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4265368Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4265562Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4265793Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4265987Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4266220Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4266415Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4266637Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4266865Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4267087Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4267316Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4267546Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4267774Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4267995Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4268223Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4268459Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4268690Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4268909Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4269109Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4269300Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4269522Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4269736Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4269946Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4270187Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4270400Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4270632Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4270854Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4271081Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4271303Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4271493Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4271728Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4271956Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4272178Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4272408Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4272642Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4272858Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4273062Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4273262Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4273454Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4273646Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4273865Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4274105Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4274327Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4274563Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4274787Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4274977Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4275220Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4275451Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4275673Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4275912Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4276131Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4276324Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4276543Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4276783Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4277003Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4277234Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4277459Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4277674Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4277879Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4278077Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4278287Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4278517Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4278748Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4278943Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4279162Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4279391Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4279614Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4279846Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4280076Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4280334Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4280538Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4280736Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4280951Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4281179Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4281402Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4281632Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4281852Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4282087Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4282307Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4282513Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4282734Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4282975Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4283197Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4283424Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4283645Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4283858Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4284063Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4284281Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4284487Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4284716Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4284934Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4285172Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4285392Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4285621Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4285840Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4286071Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4286292Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4286529Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4286754Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4286959Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4287165Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4287355Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4287554Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4287771Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4287978Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4288180Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4288381Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4288576Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4292837Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4292983Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4293136Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4293288Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4293417Z E1204 11:17:18.675000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4293578Z [W1204 11:17:18.139156318 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4293581Z 2025-12-04T12:10:21.4293728Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4294024Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4294323Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4294456Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4294955Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4295221Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4295450Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4295662Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4295865Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4296095Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4296317Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4296558Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4296779Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4297006Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4297226Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4297471Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4297693Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4297921Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4298138Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4298336Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4298545Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4298750Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4298985Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4299204Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4299402Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4299622Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4299854Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4300073Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4300313Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4300531Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4300728Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4300935Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4301155Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4301352Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4301539Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4301772Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4301999Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4302225Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4302456Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4302673Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4302875Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4303083Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4303295Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4303522Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4303764Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4303992Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4304211Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4304442Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4304662Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4304891Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4305120Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4305350Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4305568Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4305795Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4306022Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4306248Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4306469Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4306697Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4306918Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4307144Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4307374Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4307599Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4307826Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4308053Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4308272Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4308474Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4308671Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4308900Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4309132Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4309359Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4309580Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4309808Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4310036Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4310297Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4310515Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4310742Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4310960Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4311186Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4311410Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4311621Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4311809Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4312036Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4312236Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4312443Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4312645Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4312874Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4313094Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4313309Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4313497Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4313720Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4313916Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4314105Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4314337Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4314563Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4314782Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4315007Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4315229Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4315426Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4315644Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4315846Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4316077Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4316310Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4316515Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4316718Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4316919Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4317149Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4317369Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4317606Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4317828Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4318057Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4318282Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4318519Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4318741Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4318967Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4319185Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4319387Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4319577Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4319819Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4320021Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4320266Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4320485Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4320714Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4320935Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4321162Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4321384Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4321610Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4321846Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4322075Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4322294Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4322523Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4322755Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4322959Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4323156Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4323350Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4323561Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4323761Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4324003Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4324224Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4324425Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4324633Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4324836Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4325067Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4325287Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4325514Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4325735Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4325972Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4326190Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4326418Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4326639Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4326879Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4327100Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4327330Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4327551Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4327780Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4328000Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4328239Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4328460Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4328696Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4328916Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4329145Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4329369Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4329569Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4329762Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4329994Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4330259Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4330478Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4330707Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4330939Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4331168Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4331388Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4331613Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4331837Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4332064Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4332284Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4332485Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4332705Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4332945Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4333166Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4333393Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4333613Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4333828Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4334036Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4334247Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4334449Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4334675Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4334896Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4335127Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4335329Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4335529Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4335730Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4335963Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4336185Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4336392Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4336599Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4336791Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4336950Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4337170Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4337363Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4337583Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4337809Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4338028Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4338228Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4338428Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4338652Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4338849Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4339038Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4339269Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4339458Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4339677Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4339866Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4340085Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4340314Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4340532Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4340781Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4341001Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4341239Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4341460Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4341687Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4341907Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4342134Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4342353Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4342592Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4342811Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4343012Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4343202Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4343433Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4343646Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4343849Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4344045Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4344247Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4344476Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4344694Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4344932Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4345150Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4345351Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4345573Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4345800Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4346019Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4346245Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4346472Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4346692Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4346895Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4347094Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4347285Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4347485Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4347707Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4347935Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4348153Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4348381Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4348601Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4348790Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4349018Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4349244Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4349476Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4349704Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4349927Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4350151Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4350371Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4350599Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4350848Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4351075Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4351293Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4351507Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4351722Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4351920Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4352122Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4352348Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4352568Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4352757Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4352975Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4353213Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4353431Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4353669Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4353889Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4354102Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4354306Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4354509Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4354711Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4354947Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4355168Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4355394Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4355613Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4355848Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4356068Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4356258Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4356477Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4356708Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4356929Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4357156Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4357382Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4357596Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4357807Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4358006Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4358205Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4358432Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4358651Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4358887Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4359115Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4359343Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4359562Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4359791Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4360020Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4360291Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4360512Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4360708Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4360912Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4361105Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4361303Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4361529Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4361736Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4361945Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4362137Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4362329Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4362500Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4362625Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4362769Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4362877Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4363013Z E1204 11:17:18.678000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4363173Z [W1204 11:17:18.181200379 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4363175Z 2025-12-04T12:10:21.4363322Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4363622Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4363917Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4364061Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4364544Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4364798Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4365027Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4365235Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4365446Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4365677Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4365916Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4366149Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4366370Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4366598Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4366821Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4367049Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4367277Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4367503Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4367722Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4367921Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4368134Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4368345Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4368571Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4368792Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4368981Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4369201Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4369431Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4369659Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4369848Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4370065Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4370312Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4370503Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4370722Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4370919Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4371106Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4371327Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4371566Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4371786Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4372010Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4372229Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4372438Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4372651Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4372854Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4373079Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4373297Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4373524Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4373743Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4373981Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4374198Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4374435Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4374654Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4374886Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4375106Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4375333Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4375554Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4375789Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4376010Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4376236Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4376456Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4376694Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4376913Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4377142Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4377364Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4377596Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4377818Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4378030Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4378227Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4378468Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4378690Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4378916Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4379136Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4379366Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4379587Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4379825Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4380042Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4380305Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4380522Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4380762Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4380981Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4381178Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4381366Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4381585Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4381784Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4381993Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4382213Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4382439Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4382669Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4382867Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4383055Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4383273Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4383468Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4383657Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4383875Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4384119Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4384337Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4384564Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4384783Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4384989Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4385197Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4385397Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4385628Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4385850Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4386054Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4386256Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4386466Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4386697Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4386927Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4387156Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4387377Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4387604Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4387824Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4388051Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4388281Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4388512Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4388734Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4388933Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4389132Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4389353Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4389555Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4389753Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4389953Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4390217Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4390439Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4390678Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4390903Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4391142Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4391366Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4391593Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4391816Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4392047Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4392268Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4392483Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4392684Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4392878Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4393092Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4393308Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4393538Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4393760Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4393964Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4394162Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4394365Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4394594Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4394822Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4395051Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4395283Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4395515Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4395736Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4395963Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4396183Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4396413Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4396645Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4396873Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4397094Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4397322Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4397562Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4397791Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4398011Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4398240Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4398462Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4398690Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4398918Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4399120Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4399322Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4399544Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4399773Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4399995Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4400267Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4400486Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4400725Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4400952Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4401179Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4401400Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4401640Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4401865Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4402057Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4402282Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4402511Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4402732Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4402965Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4403197Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4403412Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4403626Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4403827Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4404029Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4404258Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4404482Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4404696Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4404909Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4405107Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4405308Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4405537Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4405767Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4405972Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4406173Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4406365Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4406513Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4406738Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4406933Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4407151Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4407391Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4407611Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4407822Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4408011Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4408233Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4408432Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4408621Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4408848Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4409049Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4409273Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4409463Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4409684Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4409883Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4410129Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4410358Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4410577Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4410808Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4411027Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4411260Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4411492Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4411720Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4411952Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4412181Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4412404Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4412601Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4412791Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4413013Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4413244Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4413449Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4413653Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4413858Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4414098Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4414319Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4414547Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4414765Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4414956Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4415175Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4415405Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4415633Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4415862Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4416093Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4416306Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4416507Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4416706Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4416900Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4417089Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4417319Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4417548Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4417767Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4417999Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4418229Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4418420Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4418642Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4418869Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4419091Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4419319Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4419539Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4419737Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4419958Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4420240Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4420464Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4420695Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4420913Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4421125Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4421328Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4421540Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4421743Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4421971Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4422197Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4422397Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4422624Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4422855Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4423076Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4423306Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4423526Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4423742Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4423955Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4424155Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4424363Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4424596Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4424821Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4425050Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4425270Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4425497Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4425726Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4425915Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4426137Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4426367Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4426595Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4426822Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4427043Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4427258Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4427459Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4427658Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4427859Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4428095Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4428315Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4428558Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4428781Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4429009Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4429232Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4429462Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4429682Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4429919Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4430177Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4430375Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4430578Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4430781Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4430981Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4431196Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4431407Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4431608Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4431802Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4431995Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4432177Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4432303Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4432447Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4432566Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4432693Z E1204 11:17:18.720000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4432851Z [W1204 11:17:18.183234562 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4432854Z 2025-12-04T12:10:21.4432996Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4433290Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4433581Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4433713Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4434211Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4434464Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4434691Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4434906Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4435107Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4435336Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4435555Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4435787Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4436005Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4436244Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4436461Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4436699Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4436921Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4437147Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4437366Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4437563Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4437773Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4437976Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4438212Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4438433Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4438623Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4438842Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4439079Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4439299Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4439492Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4439708Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4439907Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4440135Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4440355Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4440563Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4440754Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4440989Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4441219Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4441440Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4441665Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4441882Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4442080Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4442303Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4442502Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4442730Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4442948Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4443193Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4443413Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4443640Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4443860Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4444089Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4444309Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4444536Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4444778Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4445004Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4445231Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4445462Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4445683Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4445911Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4446132Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4446359Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4446587Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4446813Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4447031Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4447259Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4447486Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4447691Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4447887Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4448116Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4448335Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4448563Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4448791Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4449015Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4449246Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4449473Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4449693Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4449923Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4450178Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4450404Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4450636Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4450836Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4451023Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4451240Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4451437Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4451660Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4451859Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4452085Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4452308Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4452509Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4452702Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4452921Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4453148Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4453338Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4453573Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4453802Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4454020Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4454248Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4454468Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4454668Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4454885Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4455084Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4455314Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4455533Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4455748Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4455947Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4456150Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4456378Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4456598Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4456829Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4457048Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4457285Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4457505Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4457742Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4457964Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4458191Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4458411Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4458608Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4458801Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4459034Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4459236Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4459434Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4459633Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4459876Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4460129Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4460358Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4460578Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4460805Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4461027Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4461255Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4461489Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4461716Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4461950Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4462153Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4462351Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4462543Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4462752Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4462954Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4463193Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4463413Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4463617Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4463815Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4464027Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4464255Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4464475Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4464701Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4464922Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4465151Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4465369Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4465605Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4465826Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4466064Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4466284Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4466512Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4466732Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4466960Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4467182Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4467418Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4467639Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4467864Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4468086Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4468325Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4468543Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4468743Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4468931Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4469154Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4469380Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4469608Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4469836Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4470064Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4470331Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4470554Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4470783Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4471004Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4471231Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4471464Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4471653Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4471874Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4472100Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4472337Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4472568Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4472789Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4473002Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4473204Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4473402Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4473603Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4473843Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4474066Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4474289Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4474496Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4474695Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4474898Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4475127Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4475349Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4475568Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4475765Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4475959Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4476106Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4476328Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4476527Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4476748Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4476978Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4477200Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4477402Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4477592Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4477812Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4478018Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4478211Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4478445Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4478636Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4478856Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4479047Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4479272Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4479462Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4479698Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4479930Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4480200Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4480430Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4480662Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4480894Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4481112Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4481341Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4481563Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4481794Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4482014Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4482222Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4482413Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4482643Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4482858Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4483059Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4483258Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4483459Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4483688Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4483952Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4484183Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4484404Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4484593Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4484826Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4485056Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4485276Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4485504Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4485722Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4485939Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4486143Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4486351Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4486543Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4486740Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4486962Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4487189Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4487409Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4487638Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4487856Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4488059Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4488279Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4488514Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4488734Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4488974Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4489194Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4489383Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4489603Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4489830Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4490050Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4490313Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4490548Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4490766Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4490987Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4491189Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4491388Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4491620Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4491842Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4492033Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4492267Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4492494Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4492715Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4492943Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4493182Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4493399Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4493600Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4493800Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4494001Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4494230Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4494450Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4494685Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4494908Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4495144Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4495369Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4495558Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4495780Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4496006Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4496228Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4496465Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4496684Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4496898Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4497100Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4497308Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4497509Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4497740Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4497961Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4498191Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4498413Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4498639Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4498869Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4499096Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4499327Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4499558Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4499778Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4499979Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4500223Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4500418Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4500632Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4500850Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4501059Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4501255Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4501463Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4501655Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4501826Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4501951Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4502098Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4502202Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4502331Z E1204 11:17:18.722000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4502488Z [W1204 11:17:18.185287245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4502490Z 2025-12-04T12:10:21.4502631Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4502936Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4503227Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4503370Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4503846Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4504101Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4504327Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4504538Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4504750Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4504977Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4505200Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4505428Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4505655Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4505883Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4506101Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4506329Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4506547Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4506776Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4506997Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4507212Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4507422Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4507632Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4507861Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4508081Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4508269Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4508488Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4508714Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4508949Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4509138Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4509356Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4509554Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4509749Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4509968Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4510191Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4510380Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4510597Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4510825Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4511044Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4511282Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4511500Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4511707Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4511916Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4512114Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4512341Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4512561Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4512789Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4513019Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4513244Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4513462Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4513691Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4513923Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4514151Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4514371Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4514596Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4514814Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4515042Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4515259Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4515497Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4515715Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4515954Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4516173Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4516399Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4516617Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4516844Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4517063Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4517273Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4517470Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4517696Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4517914Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4518152Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4518372Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4518600Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4518818Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4519044Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4519262Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4519498Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4519717Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4519951Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4520205Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4520405Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4520592Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4520812Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4521009Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4521219Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4521428Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4521656Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4521873Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4522068Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4522274Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4522492Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4522693Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4522880Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4523099Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4523325Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4523542Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4523781Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4523998Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4524206Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4524414Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4524617Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4524850Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4525071Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4525275Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4525483Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4525687Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4525915Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4526136Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4526373Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4526592Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4526820Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4527040Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4527271Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4527493Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4527720Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4527952Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4528149Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4528350Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4528570Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4528773Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4528972Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4529172Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4529407Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4529637Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4529865Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4530084Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4530343Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4530574Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4530800Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4531020Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4531246Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4531466Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4531671Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4531870Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4532075Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4532283Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4532499Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4532729Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4532948Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4533151Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4533348Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4533550Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4533790Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4534013Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4534241Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4534462Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4534698Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4534918Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4535145Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4535364Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4535591Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4535811Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4536040Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4536269Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4536497Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4536727Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4536955Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4537175Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4537401Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4537621Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4537848Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4538084Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4538286Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4538479Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4538701Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4538938Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4539160Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4539390Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4539609Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4539837Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4540056Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4540333Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4540552Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4540794Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4541016Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4541206Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4541425Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4541651Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4541872Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4542110Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4542329Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4542542Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4542743Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4542944Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4543158Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4543385Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4543604Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4543818Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4544022Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4544220Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4544431Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4544658Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4544888Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4545089Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4545290Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4545483Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4545629Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4545851Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4546041Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4546271Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4546499Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4546718Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4546917Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4547117Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4547338Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4547538Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4547727Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4547946Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4548136Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4548356Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4548554Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4548774Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4548971Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4549193Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4549421Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4549643Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4549874Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4550128Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4550372Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4550594Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4550822Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4551041Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4551285Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4551508Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4551706Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4551898Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4552121Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4552338Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4552540Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4552752Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4552953Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4553205Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4553427Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4553655Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4553877Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4554066Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4554290Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4554531Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4554751Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4554980Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4555199Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4555424Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4555626Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4555828Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4556019Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4556208Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4556432Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4556663Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4556900Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4557126Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4557354Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4557545Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4557763Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4557991Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4558208Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4558437Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4558668Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4558863Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4559082Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4559308Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4559538Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4559764Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4559983Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4560241Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4560442Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4560642Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4560843Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4561087Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4561307Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4561512Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4561732Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4561958Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4562179Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4562404Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4562625Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4562850Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4563054Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4563255Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4563455Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4563695Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4563914Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4564142Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4564360Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4564586Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4564810Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4565000Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4565230Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4565455Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4565687Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4565917Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4566136Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4566348Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4566548Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4566746Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4566957Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4567186Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4567406Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4567633Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4567868Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4568095Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4568318Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4568544Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4568764Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4568991Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4569214Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4569426Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4569629Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4569828Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4570027Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4570281Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4570488Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4570686Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4570879Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4571084Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4571256Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4571385Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4571531Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4571635Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4571761Z E1204 11:17:18.724000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4571829Z ('RERUN', {'yellow': True}) [1.7897s] [100%] 2025-12-04T12:10:21.4572181Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:20.707763393 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4572184Z 2025-12-04T12:10:21.4572328Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4572626Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4572922Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4573052Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4573541Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4573794Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4574028Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4574235Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4574433Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4574661Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4574882Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4575111Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4575349Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4575575Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4577561Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4577795Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4578031Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4578259Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4578478Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4578676Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4578888Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4579090Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4579314Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4579544Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4579732Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4579961Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4580243Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4580462Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4580650Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4580867Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4581068Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4581271Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4581493Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4581690Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4581875Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4582107Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4582334Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4582552Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4582776Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4582995Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4583193Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4583403Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4583616Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4583842Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4584070Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4584297Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4584515Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4584742Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4584960Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4585188Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4585415Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4585644Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4585863Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4586089Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4586330Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4586555Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4586773Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4586998Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4587216Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4587444Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4587660Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4587903Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4588121Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4588357Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4588574Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4588774Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4588969Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4589195Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4589413Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4589646Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4589866Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4590122Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4590343Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4590582Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4590799Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4591026Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4591242Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4591468Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4591685Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4591881Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4592081Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4592298Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4592506Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4592715Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4592915Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4593140Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4593359Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4593556Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4593755Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4593972Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4594167Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4594355Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4594582Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4594812Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4595030Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4595255Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4595472Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4595668Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4595876Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4596083Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4596315Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4596545Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4596747Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4596949Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4597153Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4597380Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4597600Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4597839Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4598059Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4598285Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4598506Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4598743Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4598965Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4599196Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4599420Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4599621Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4599811Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4600033Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4600283Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4600481Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4600691Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4600921Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4601142Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4601369Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4601590Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4601820Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4602059Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4602286Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4602505Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4602732Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4602962Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4603165Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4603362Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4603554Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4603767Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4603970Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4604199Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4604428Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4604629Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4604838Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4605040Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4605265Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4605487Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4605714Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4605934Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4606174Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4606393Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4606621Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4606840Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4607075Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4607295Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4607522Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4607741Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4607967Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4608189Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4608422Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4608653Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4608881Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4609109Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4609337Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4609555Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4609753Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4609943Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4610217Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4610458Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4610681Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4610911Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4611131Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4611369Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4611589Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4611815Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4612034Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4612261Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4612481Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4612680Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4612904Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4613144Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4613363Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4613591Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4613810Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4614023Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4614224Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4614423Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4614635Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4614861Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4615085Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4615299Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4615515Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4615713Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4615913Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4616140Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4616359Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4616562Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4616758Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4616959Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4617108Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4617347Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4617540Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4617762Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4617990Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4618208Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4618410Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4618607Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4618828Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4619025Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4619213Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4619446Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4619636Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4619857Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4620045Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4620298Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4620488Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4620708Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4620948Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4621166Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4621408Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4621627Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4621859Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4622080Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4622306Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4622526Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4622764Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4622983Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4623181Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4623370Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4623601Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4623814Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4624018Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4624217Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4624417Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4624645Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4624865Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4625101Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4625319Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4625518Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4625738Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4625966Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4626185Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4626416Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4626637Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4626858Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4627061Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4627258Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4627451Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4627642Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4627872Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4628101Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4628320Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4628550Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4628771Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4628960Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4629189Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4629414Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4629644Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4629871Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4630121Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4630311Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4630531Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4630758Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4630991Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4631220Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4631443Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4631655Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4631869Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4632068Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4632268Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4632495Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4632713Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4632903Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4633128Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4633374Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4633593Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4633831Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4634051Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4634264Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4634466Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4634664Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4634864Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4635103Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4635326Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4635555Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4635774Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4636012Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4636231Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4636420Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4636639Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4636866Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4637086Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4637315Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4637542Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4637759Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4637970Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4638169Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4638370Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4638597Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4638816Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4639042Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4639276Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4639505Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4639725Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4639955Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4640235Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4640463Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4640683Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4640883Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4641085Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4641276Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4641474Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4641701Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4641908Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4642116Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4642311Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4642504Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4642675Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4642801Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4642945Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4643051Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4643176Z E1204 11:17:20.246000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4643346Z [W1204 11:17:20.710184451 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4643349Z 2025-12-04T12:10:21.4643491Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4643787Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4644081Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4644223Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4644713Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4644966Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4645194Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4645401Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4645600Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4645837Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4646056Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4646294Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4646513Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4646743Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4646964Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4647189Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4647409Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4647645Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4647864Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4648061Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4648269Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4648488Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4648714Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4648932Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4649121Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4649341Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4649568Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4649787Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4649984Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4650244Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4650453Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4650641Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4650861Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4651057Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4651246Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4651468Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4651708Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4651926Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4652152Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4652369Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4652578Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4652787Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4652989Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4653215Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4653432Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4653663Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4653883Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4654119Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4654336Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4654572Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4654790Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4655017Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4655234Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4655460Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4655677Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4655915Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4656134Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4656359Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4656578Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4656812Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4657030Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4657257Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4657475Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4657705Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4657923Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4658126Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4658334Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4658560Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4658785Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4659014Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4659233Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4659459Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4659678Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4659905Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4660171Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4660399Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4660618Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4660845Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4661075Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4661272Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4661459Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4661678Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4661874Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4662081Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4662281Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4662518Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4662739Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4662946Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4663136Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4663354Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4663549Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4663737Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4663954Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4664199Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4664416Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4664644Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4664864Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4665073Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4665283Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4665484Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4665716Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4665935Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4666139Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4666339Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4666548Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4666777Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4667004Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4667236Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4667457Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4667687Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4667907Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4668135Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4668364Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4668591Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4668810Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4669007Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4669207Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4669432Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4669639Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4669838Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4670037Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4670293Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4670513Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4670752Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4670972Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4671209Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4671430Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4671659Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4671882Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4672108Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4672329Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4672543Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4672740Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4672931Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4673139Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4673352Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4673579Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4673799Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4674002Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4674200Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4674400Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4674627Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4674857Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4675082Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4675311Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4675540Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4675760Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4675987Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4676208Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4676439Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4676666Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4676894Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4677114Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4677340Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4677570Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4677801Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4678021Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4678249Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4678471Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4678701Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4678920Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4679127Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4679317Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4679553Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4679782Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4680001Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4680274Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4680494Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4680723Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4680958Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4681185Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4681405Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4681632Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4681868Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4682057Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4682278Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4682504Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4682724Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4682955Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4683188Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4683402Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4683603Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4683815Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4684016Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4684244Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4684464Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4684677Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4684881Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4685088Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4685290Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4685519Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4685842Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4686056Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4686253Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4686447Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4686592Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4686812Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4687001Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4687221Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4687460Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4687683Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4687894Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4688084Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4688304Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4688501Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4688691Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4688911Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4689109Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4689329Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4689517Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4689736Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4689928Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4690205Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4690433Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4690652Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4690881Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4691100Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4691329Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4691560Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4691793Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4692026Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4692258Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4692478Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4692677Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4692868Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4693087Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4693312Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4693513Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4693772Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4693975Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4694205Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4694444Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4694673Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4694893Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4695084Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4695303Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4695532Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4695766Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4695995Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4696225Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4696438Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4696640Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4696840Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4697031Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4697221Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4697443Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4697679Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4697899Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4698126Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4698345Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4698544Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4698762Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4698992Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4699213Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4699443Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4699663Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4699860Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4700079Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4700469Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4700690Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4700917Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4701136Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4701350Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4701553Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4701765Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4701965Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4702196Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4702415Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4702606Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4702838Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4703064Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4703285Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4703512Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4703737Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4703950Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4704163Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4704361Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4704570Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4704799Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4705020Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4705247Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4705466Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4705693Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4705929Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4706118Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4706337Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4706564Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4706784Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4707022Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4707241Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4707456Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4707656Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4707856Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4708059Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4708305Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4708526Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4708763Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4708983Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4709210Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4709429Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4709655Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4709874Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4710163Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4710382Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4710583Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4710786Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4710999Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4711195Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4711407Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4711616Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4711816Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4712011Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4712207Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4712377Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4712514Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4712659Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4712764Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4712902Z E1204 11:17:20.249000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4713060Z [W1204 11:17:20.712981264 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4713064Z 2025-12-04T12:10:21.4713206Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4713500Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4713793Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4713924Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4714404Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4714671Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4714895Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4715113Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4715315Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4715542Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4715762Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4715990Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4716207Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4716434Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4716661Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4716887Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4717115Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4717346Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4717567Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4717764Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4717972Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4718173Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4718413Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4718632Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4718823Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4719041Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4719280Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4719501Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4719693Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4719913Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4720150Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4720338Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4720558Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4720770Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4720957Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4721189Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4721417Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4721637Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4721866Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4722087Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4722285Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4722505Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4722704Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4722930Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4723151Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4723377Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4723608Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4723833Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4724052Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4724281Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4724499Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4724727Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4724952Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4725178Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4725404Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4725631Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4725849Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4726076Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4726294Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4726524Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4726758Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4726984Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4727203Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4727428Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4727655Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4727857Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4728052Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4728279Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4728497Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4728725Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4728943Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4729181Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4729400Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4729635Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4729855Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4730080Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4730342Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4730567Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4730787Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4731000Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4731190Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4731411Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4731608Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4731831Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4732030Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4732257Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4732475Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4732671Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4732863Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4733098Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4733312Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4733500Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4733734Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4733962Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4734179Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4734407Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4734623Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4734821Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4735039Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4735238Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4735471Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4735693Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4735907Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4736107Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4736307Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4736534Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4736755Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4736984Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4737203Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4737442Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4737662Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4737901Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4738123Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4738350Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4738570Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4738767Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4738959Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4739189Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4739390Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4739589Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4739788Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4740020Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4740291Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4740520Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4740739Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4740967Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4741189Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4741416Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4741650Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4741878Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4742119Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4742324Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4742527Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4742719Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4742928Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4743129Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4743368Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4743587Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4743789Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4743987Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4744188Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4744427Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4744650Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4744878Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4745098Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4745326Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4745545Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4745785Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4746005Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4746241Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4746462Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4746691Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4746913Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4747141Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4747364Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4747599Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4747820Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4748046Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4748267Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4748502Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4748723Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4748923Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4749115Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4749337Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4749566Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4749787Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4750025Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4750287Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4750526Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4750746Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4750974Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4751194Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4751423Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4751647Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4751857Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4752081Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4752307Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4752529Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4752769Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4752990Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4753205Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4753405Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4753607Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4753812Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4754039Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4754270Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4754485Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4754696Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4754896Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4755095Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4755322Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4755542Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4755744Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4755954Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4756147Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4756293Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4756514Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4756715Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4756938Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4757165Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4757384Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4757583Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4757774Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4757996Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4758213Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4758403Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4758633Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4758824Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4759044Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4759232Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4759452Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4759640Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4759861Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4760135Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4760358Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4760587Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4760808Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4761051Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4761269Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4761497Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4761716Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4761946Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4762165Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4762377Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4762568Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4762800Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4763017Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4763218Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4763417Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4763619Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4763848Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4764069Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4764307Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4764529Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4764718Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4764942Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4765181Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4765400Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4765627Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4765845Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4766062Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4766263Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4766470Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4766661Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4766849Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4767081Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4767313Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4767535Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4767762Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4767983Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4768173Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4768402Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4768630Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4768850Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4769077Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4769306Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4769498Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4769720Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4769946Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4770205Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4770432Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4770667Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4770880Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4771093Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4771294Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4771494Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4771727Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4771949Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4772141Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4772361Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4772598Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4772820Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4773047Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4773267Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4773497Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4773700Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4773903Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4774108Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4774336Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4774555Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4775949Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4776168Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4776404Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4776625Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4776816Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4777038Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4777266Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4777489Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4777718Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4777950Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4778166Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4778366Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4778568Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4778776Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4779004Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4779223Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4779452Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4779676Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4779906Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4780171Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4780398Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4780630Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4780858Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4781084Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4781282Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4781484Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4781675Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4781872Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4782101Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4782308Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4782507Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4782699Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4782903Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4783075Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4783200Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4783344Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4783449Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4783577Z E1204 11:17:20.252000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4783734Z [W1204 11:17:20.755102834 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4783737Z 2025-12-04T12:10:21.4783880Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4784187Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4784482Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4784612Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4785099Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4785354Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4785580Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4785784Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4785994Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4786220Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4786441Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4786670Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4786900Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4787127Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4787347Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4787573Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4787792Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4788020Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4788237Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4788444Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4788652Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4788864Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4789093Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4789312Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4789503Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4789721Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4789947Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4790218Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4790405Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4790627Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4790821Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4791010Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4791245Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4791441Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4791632Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4791848Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4792077Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4792295Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4792535Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4792752Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4792961Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4793171Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4793375Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4793607Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4793825Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4794054Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4794270Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4794508Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4794729Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4794954Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4795176Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4795411Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4795633Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4795861Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4796078Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4796305Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4796524Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4796760Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4796979Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4797216Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4797435Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4797661Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4797884Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4798116Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4798339Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4798549Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4798746Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4798977Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4799196Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4799432Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4799649Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4799877Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4800136Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4800368Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4800590Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4800817Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4801048Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4801275Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4801507Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4801706Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4801895Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4802114Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4802309Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4802522Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4802736Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4802963Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4803180Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4803377Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4803579Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4803798Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4803997Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4804184Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4804403Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4804630Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4804852Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4805091Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4805309Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4805524Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4805731Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4805931Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4806162Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4806382Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4806587Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4806786Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4806999Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4807230Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4807450Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4807678Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4807908Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4808136Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4808355Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4808584Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4808804Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4809033Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4809264Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4809465Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4809666Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4809887Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4810125Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4810325Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4810527Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4810756Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4810978Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4811217Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4811436Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4811668Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4811890Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4812132Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4812353Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4812583Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4812808Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4813011Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4813210Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4813414Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4813625Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4813838Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4814069Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4814295Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4814500Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4814703Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4814905Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4815134Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4815364Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4815591Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4815812Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4816040Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4816271Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4816498Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4816720Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4816949Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4817168Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4817399Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4817627Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4817857Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4818086Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4818317Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4818538Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4818765Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4818986Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4819215Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4819445Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4819644Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4819833Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4820054Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4820322Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4820542Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4820773Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4820994Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4821220Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4821442Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4821670Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4821909Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4822137Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4822366Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4822560Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4822781Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4823011Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4823234Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4823462Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4823701Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4823915Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4824117Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4824317Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4824527Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4824756Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4824976Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4825194Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4825396Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4825596Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4825798Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4826035Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4826256Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4826467Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4826666Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4826857Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4827007Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4827230Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4827425Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4827650Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4827889Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4828111Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4828308Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4828499Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4828732Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4828928Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4829117Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4829336Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4829527Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4829750Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4829940Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4830203Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4830392Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4830627Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4830855Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4831079Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4831306Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4831525Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4831756Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4831989Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4832220Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4832440Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4832670Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4832904Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4833101Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4833294Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4833516Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4833730Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4833933Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4834134Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4834346Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4834575Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4834805Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4835033Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4835255Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4835444Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4835664Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4835891Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4836120Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4836349Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4836570Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4836785Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4837005Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4837204Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4837396Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4837585Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4837806Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4838034Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4838255Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4838490Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4838712Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4838912Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4839133Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4839363Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4839581Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4839809Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4840029Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4840274Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4840494Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4840722Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4840945Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4841186Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4841408Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4841621Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4841823Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4842022Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4842224Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4842452Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4842681Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4842871Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4843102Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4843333Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4843553Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4843781Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4844001Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4844213Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4844428Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4844626Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4844827Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4845055Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4845286Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4845519Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4845740Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4845970Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4846190Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4846383Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4846602Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4846838Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4847057Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4847294Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4847514Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4847730Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4847933Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4848134Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4848334Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4848570Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4848789Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4849015Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4849235Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4849474Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4849693Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4849920Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4850178Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4850406Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4850626Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4850835Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4851036Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4851227Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4851434Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4851648Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4851855Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4852052Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4852246Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4852443Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4852638Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4852764Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4852909Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4853013Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4853141Z E1204 11:17:20.294000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4853297Z [W1204 11:17:20.757194906 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4853311Z 2025-12-04T12:10:21.4853456Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4853748Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4854041Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4854173Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4854650Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4854916Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4855140Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4855354Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4855555Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4855782Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4856002Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4856228Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4856447Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4856682Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4856901Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4857131Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4857349Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4857585Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4857804Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4858004Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4858210Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4858410Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4858637Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4858855Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4859054Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4859274Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4859509Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4859728Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4859917Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4860169Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4860366Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4860555Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4860772Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4860981Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4861168Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4862824Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4863056Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4863296Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4863523Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4863741Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4863938Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4864144Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4864346Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4864575Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4864805Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4865032Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4865268Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4865497Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4865717Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4865945Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4866164Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4866391Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4866619Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4866844Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4867062Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4867289Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4867522Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4867751Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4867969Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4868195Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4868413Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4868639Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4868865Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4869094Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4869328Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4869530Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4869729Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4869959Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4870219Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4870446Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4870678Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4870903Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4871123Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4871349Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4871578Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4871805Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4872025Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4872253Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4872471Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4872669Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4872858Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4873088Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4873285Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4873503Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4873702Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4873929Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4874147Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4874346Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4874538Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4874757Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4874962Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4875149Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4875367Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4875593Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4875823Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4876048Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4876266Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4876463Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4876673Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4876873Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4877121Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4877342Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4877542Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4877751Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4877952Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4878180Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4878400Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4878627Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4878852Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4879088Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4879310Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4879538Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4879757Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4879994Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4880250Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4880450Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4880638Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4880860Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4881065Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4881264Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4881478Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4881705Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4881938Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4882168Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4882391Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4882619Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4882840Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4883072Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4883307Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4883539Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4883761Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4883964Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4884184Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4884377Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4884591Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4884791Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4885021Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4885241Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4885463Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4885663Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4885863Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4886100Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4886321Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4886550Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4886768Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4886997Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4887218Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4887454Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4887674Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4887903Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4888124Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4888360Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4888580Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4888808Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4889028Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4889256Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4889475Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4889711Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4889933Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4890223Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4890445Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4890643Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4890836Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4891055Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4891286Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4891519Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4891746Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4891967Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4892195Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4892432Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4892666Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4892889Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4893118Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4893340Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4893532Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4893752Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4893991Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4894210Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4894446Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4894670Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4894885Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4895090Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4895287Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4895490Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4895728Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4895948Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4896160Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4896362Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4896569Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4896770Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4897001Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4897221Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4897426Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4897627Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4897819Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4897976Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4898198Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4898389Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4898618Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4898846Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4899068Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4899268Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4899459Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4899678Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4899887Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4900077Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4900337Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4900527Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4900766Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4900958Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4901177Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4901367Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4901591Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4901820Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4902040Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4902280Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4902500Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4902740Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4902961Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4903189Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4903408Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4903636Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4903859Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4904069Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4904258Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4904481Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4904695Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4904908Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4905108Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4905310Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4905541Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4905761Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4905993Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4906212Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4906413Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4906632Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4906868Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4907090Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4907318Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4907537Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4907750Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4907952Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4908160Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4908353Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4908544Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4908766Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4909004Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4909225Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4909453Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4909672Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4909861Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4910082Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4910370Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4910604Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4910833Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4911064Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4911255Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4911474Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4911702Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4911921Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4912150Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4912386Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4912600Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4912806Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4913006Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4913221Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4913449Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4913669Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4913859Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4914077Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4914310Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4914529Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4914766Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4914985Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4915209Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4915412Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4915610Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4915812Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4916039Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4916260Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4916501Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4916723Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4916951Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4917170Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4917372Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4917592Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4917820Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4918038Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4918266Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4918487Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4918698Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4918910Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4919109Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4919322Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4919553Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4919777Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4920008Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4920273Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4920502Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4920732Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4920963Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4921182Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4921410Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4921643Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4921842Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4922049Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4922242Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4922439Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4922653Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4922862Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4923071Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4923263Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4923469Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4923640Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4923767Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4923912Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4924016Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4924142Z E1204 11:17:20.296000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4924299Z [W1204 11:17:20.759910941 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.4924303Z 2025-12-04T12:10:21.4924448Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.4924752Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.4925045Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.4925175Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.4925654Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.4925918Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.4926143Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.4926350Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.4926551Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4926781Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4927010Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4927239Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4927471Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4927698Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4927917Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4928143Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4928361Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4928588Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4928820Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4929018Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4929226Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4929430Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4929655Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4929883Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4930072Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4930327Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4930554Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4930772Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4930961Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4931193Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4931392Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4931580Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4931815Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4932013Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4932201Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4932419Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4932645Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4932865Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4933103Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4933322Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4933521Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4933729Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4933942Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4934169Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4934389Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4934615Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4934833Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4935067Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4935286Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4935522Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4935746Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4935985Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4936204Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4936434Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4936652Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4936876Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4937094Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4937329Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4937548Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4937773Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4937992Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4938232Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4938448Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4938675Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4938890Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4939093Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4939289Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.4939523Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4939741Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4939977Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4940235Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4940464Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4940683Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4940909Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4941127Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4941365Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4941585Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4941813Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4942031Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4942239Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4942431Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4942649Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4942845Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4943052Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4943254Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4943483Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4943714Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4943910Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4944097Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4944329Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4944525Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4944719Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4944939Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4945168Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4945386Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4945621Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4945839Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4946035Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4946243Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4946450Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4946683Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4946907Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4947111Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4947312Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4947513Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4947746Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4947980Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4948211Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4948442Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4948669Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4948889Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4949119Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4949342Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4949570Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4949798Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4949998Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4950227Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4950450Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4950664Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4950863Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4951063Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4951291Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4951516Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4951746Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4951967Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4952207Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4952427Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4952667Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4952888Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4953117Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4953335Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4953538Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4953740Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4953950Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4954159Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.4954359Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4954587Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4954815Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4955018Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4955217Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4955417Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4955645Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4955867Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4956097Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4956326Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4956554Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4956783Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4957011Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4957232Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4957458Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4957678Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4957905Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4958135Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4958366Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4958587Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4958815Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4959044Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4959273Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4959493Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4959720Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4959939Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4960170Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4960372Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4960594Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4960834Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4961054Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4961282Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4961504Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4961731Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4961952Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4962193Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4962416Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4962644Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4962868Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4963079Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4963301Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4963529Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4963748Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4963975Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4964194Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4964409Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4964620Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4964818Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4965117Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4965348Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4965571Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4965784Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4965986Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4966184Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4966397Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4966629Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4966849Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4967052Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4967259Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4967453Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4967599Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.4967819Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4968010Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4968232Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4968462Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4968681Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4968888Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4969079Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4969309Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4969508Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4969699Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4969920Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4970143Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4970364Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4970566Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4970785Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4970976Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4971196Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4971436Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4971657Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4971888Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4972110Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4972337Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4972559Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4972787Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4973025Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4973251Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4973485Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4973685Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.4973874Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4974094Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4974312Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4974517Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4974725Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4974928Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4975156Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4975378Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4975616Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4975835Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4976028Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4976247Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4976479Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4976701Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4976927Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4977158Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4977369Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4977584Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4977785Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4977978Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.4978169Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4978389Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4978620Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4978861Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4979089Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4979308Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4979498Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4979731Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4979959Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4980211Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4980437Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4980657Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4980847Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4981070Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4981311Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4981530Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4981774Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4981997Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4982211Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4982415Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4982612Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4982814Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4983055Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4983276Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4983466Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4983691Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4983929Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4984149Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4984376Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4984594Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4984809Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4985012Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4985213Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4985425Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4985655Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4985884Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4986112Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4986334Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4986563Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4986783Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4986975Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.4987203Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4987431Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4987649Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4987881Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4988110Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4988324Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.4988527Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.4988725Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.4988926Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.4989154Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.4989375Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4989618Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4989837Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4990077Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4990329Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4990556Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4990776Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4991005Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.4991224Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.4991437Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.4991641Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.4991831Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.4992031Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.4992257Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.4992468Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.4992666Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.4992857Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.4993051Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.4993222Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.4993349Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.4993493Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.4993608Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.4993736Z E1204 11:17:20.299000 870247 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.4993780Z FAILED [2.3644s] [100%] 2025-12-04T12:10:21.4993782Z 2025-12-04T12:10:21.4993844Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.4994022Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.4994072Z Traceback (most recent call last): 2025-12-04T12:10:21.4994237Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.4994280Z method(*args, **kwargs) 2025-12-04T12:10:21.4994432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.4994474Z method(*args, **kwargs) 2025-12-04T12:10:21.4994624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.4994663Z with policy(): 2025-12-04T12:10:21.4994813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.4994855Z raise RuntimeError(msg) 2025-12-04T12:10:21.4995272Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.4995285Z 2025-12-04T12:10:21.4995366Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.4995640Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.4995642Z 2025-12-04T12:10:21.4995734Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.4995817Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.4995863Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.4995935Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.4996494Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.4996603Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.4996643Z graph_break [] 2025-12-04T12:10:21.4996714Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.4996790Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.4997278Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.4997329Z current_size = base.storage().size() 2025-12-04T12:10:21.4997372Z Autotune Choices Stats: 2025-12-04T12:10:21.4997760Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:21.4997830Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.4997884Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.4998018Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.4998262Z triton_mm_34 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4998495Z triton_mm_33 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.4998724Z triton_mm_16 0.0108 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4998947Z triton_mm_29 0.0114 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4999180Z triton_mm_22 0.0116 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4999404Z triton_mm_21 0.0117 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4999626Z triton_mm_30 0.0118 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.4999850Z triton_mm_23 0.0120 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5000085Z triton_mm_15 0.0128 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5000348Z triton_mm_31 0.0130 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5000483Z SingleProcess AUTOTUNE benchmarking takes 0.1663 seconds and 8.4158 seconds precompiling for 33 choices 2025-12-04T12:10:21.5000642Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.5000690Z Traceback (most recent call last): 2025-12-04T12:10:21.5000850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5000895Z method(*args, **kwargs) 2025-12-04T12:10:21.5001047Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5001091Z method(*args, **kwargs) 2025-12-04T12:10:21.5001241Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.5001279Z with policy(): 2025-12-04T12:10:21.5001444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.5001486Z raise RuntimeError(msg) 2025-12-04T12:10:21.5001901Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.5001906Z 2025-12-04T12:10:21.5001988Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.5002259Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.5002262Z 2025-12-04T12:10:21.5002356Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.5002434Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5002478Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5002539Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5003092Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5003211Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5003250Z graph_break [] 2025-12-04T12:10:21.5003318Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5003393Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5003881Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.5003944Z current_size = base.storage().size() 2025-12-04T12:10:21.5003985Z Autotune Choices Stats: 2025-12-04T12:10:21.5004356Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:21.5004422Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5004474Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5004596Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5004837Z triton_mm_34 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5005065Z triton_mm_33 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5005300Z triton_mm_16 0.0108 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5005526Z triton_mm_29 0.0114 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5005756Z triton_mm_22 0.0116 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5005980Z triton_mm_21 0.0117 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5006205Z triton_mm_30 0.0118 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5006431Z triton_mm_23 0.0120 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5006657Z triton_mm_15 0.0128 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5006894Z triton_mm_31 0.0130 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5007024Z SingleProcess AUTOTUNE benchmarking takes 0.1663 seconds and 8.4158 seconds precompiling for 33 choices 2025-12-04T12:10:21.5007103Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5007146Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5007202Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5007307Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5007792Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5007841Z graph_break [] 2025-12-04T12:10:21.5007906Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5007979Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5008345Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.5008440Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.5008485Z Autotune Choices Stats: 2025-12-04T12:10:21.5008854Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008558999747037888, "best_triton_pos": 0} 2025-12-04T12:10:21.5008922Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5008973Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5009112Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5009346Z triton_mm_72 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5009389Z _scaled_mm 0.0096 ms 89.2% 2025-12-04T12:10:21.5009627Z triton_mm_71 0.0098 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5009853Z triton_mm_67 0.0112 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5010080Z triton_mm_60 0.0114 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5010345Z triton_mm_59 0.0116 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5010570Z triton_mm_68 0.0116 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5010810Z triton_mm_54 0.0116 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5011036Z triton_mm_61 0.0122 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5011267Z triton_mm_53 0.0122 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5011409Z SingleProcess AUTOTUNE benchmarking takes 0.2526 seconds and 0.8033 seconds precompiling for 39 choices 2025-12-04T12:10:21.5011470Z =================================== FAILURES =================================== 2025-12-04T12:10:21.5011626Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.5011676Z Traceback (most recent call last): 2025-12-04T12:10:21.5011834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5011877Z method(*args, **kwargs) 2025-12-04T12:10:21.5012029Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5012070Z method(*args, **kwargs) 2025-12-04T12:10:21.5012221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.5012261Z with policy(): 2025-12-04T12:10:21.5012413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.5012456Z raise RuntimeError(msg) 2025-12-04T12:10:21.5012871Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.5012876Z 2025-12-04T12:10:21.5012957Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.5013228Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.5013244Z 2025-12-04T12:10:21.5013331Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.5013408Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5013451Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5013511Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5014065Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5014165Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5014203Z graph_break [] 2025-12-04T12:10:21.5014270Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5014355Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5014840Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.5014891Z current_size = base.storage().size() 2025-12-04T12:10:21.5014932Z Autotune Choices Stats: 2025-12-04T12:10:21.5015301Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00887999963015318, "best_triton_pos": 0} 2025-12-04T12:10:21.5015376Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5015427Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5015547Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5015782Z triton_mm_34 0.0089 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5016009Z triton_mm_33 0.0091 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5016233Z triton_mm_16 0.0108 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5016458Z triton_mm_29 0.0114 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5016691Z triton_mm_22 0.0116 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5016915Z triton_mm_21 0.0117 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5017144Z triton_mm_30 0.0118 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5017372Z triton_mm_23 0.0120 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5017597Z triton_mm_15 0.0128 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5017821Z triton_mm_31 0.0130 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5017950Z SingleProcess AUTOTUNE benchmarking takes 0.1663 seconds and 8.4158 seconds precompiling for 33 choices 2025-12-04T12:10:21.5018025Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5018081Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5018137Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5018236Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5018719Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5018757Z graph_break [] 2025-12-04T12:10:21.5018820Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5018906Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5019269Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.5019363Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.5019405Z Autotune Choices Stats: 2025-12-04T12:10:21.5019769Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008558999747037888, "best_triton_pos": 0} 2025-12-04T12:10:21.5019834Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5019886Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5020006Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5020268Z triton_mm_72 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5020312Z _scaled_mm 0.0096 ms 89.2% 2025-12-04T12:10:21.5020551Z triton_mm_71 0.0098 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5020773Z triton_mm_67 0.0112 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5021010Z triton_mm_60 0.0114 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5021234Z triton_mm_59 0.0116 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5021457Z triton_mm_68 0.0116 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5021680Z triton_mm_54 0.0116 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5021905Z triton_mm_61 0.0122 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5022144Z triton_mm_53 0.0122 ms 69.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5022273Z SingleProcess AUTOTUNE benchmarking takes 0.2526 seconds and 0.8033 seconds precompiling for 39 choices 2025-12-04T12:10:21.5022347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5022389Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5022445Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5022543Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5023048Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5023087Z graph_break [] 2025-12-04T12:10:21.5023149Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5023223Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5023263Z Autotune Choices Stats: 2025-12-04T12:10:21.5023629Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009519999846816063, "best_triton_pos": 0} 2025-12-04T12:10:21.5023694Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5023744Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5023863Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5024106Z triton_mm_110 0.0095 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5024333Z triton_mm_109 0.0105 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5025769Z triton_mm_105 0.0109 ms 87.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5025996Z triton_mm_92 0.0120 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5026219Z triton_mm_97 0.0122 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5026448Z triton_mm_99 0.0122 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5026684Z triton_mm_106 0.0122 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5026919Z triton_mm_98 0.0128 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.5027146Z triton_mm_91 0.0129 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5027372Z triton_mm_107 0.0132 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5027503Z SingleProcess AUTOTUNE benchmarking takes 0.2771 seconds and 0.6313 seconds precompiling for 39 choices 2025-12-04T12:10:21.5027694Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e7f1d2f3686efb0.xml - 2025-12-04T12:10:21.5027758Z =========================== short test summary info ============================ 2025-12-04T12:10:21.5028375Z FAILED [2.3644s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.5028379Z 2025-12-04T12:10:21.5028453Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.5028727Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.5028734Z 2025-12-04T12:10:21.5028821Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.5028883Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.5028953Z ================= 1 failed, 187 deselected, 2 rerun in 15.23s ================== 2025-12-04T12:10:21.5029004Z Got exit code 1 2025-12-04T12:10:21.5029045Z Retrying single test... 2025-12-04T12:10:21.5029189Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-48788c463bb75a05.xml 2025-12-04T12:10:21.5029247Z ============================= test session starts ============================== 2025-12-04T12:10:21.5029360Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.5029462Z cachedir: .pytest_cache 2025-12-04T12:10:21.5029624Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.5029673Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.5029716Z configfile: pytest.ini 2025-12-04T12:10:21.5029881Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.5029958Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.5030253Z stepcurrent: skipping 110 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.5030299Z Running 1 items in this shard 2025-12-04T12:10:21.5030301Z 2025-12-04T12:10:21.5030650Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:47.920867931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5030671Z 2025-12-04T12:10:21.5030826Z [W1204 11:17:55.716035052 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5030828Z 2025-12-04T12:10:21.5031143Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5031438Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5031574Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5032060Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5032314Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5032542Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5032753Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5032957Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5033198Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5033421Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5033666Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5033905Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5034137Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5034355Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5034584Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5034803Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5035039Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5035260Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5035485Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5035704Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5035933Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5036153Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5036344Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5036566Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5036795Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5037014Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5037207Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5037434Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5037660Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5037885Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5038136Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5038355Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5038559Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5038772Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5038931Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5039122Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5039649Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/pk/cpkwh7kldh4znmktanpop3o2j6rkmavnhifjlnwie2xbt56zzixl.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5039796Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5040013Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5040205Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5040498Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5040631Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5040890Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5041030Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5041285Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5041444Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5041724Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5041860Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5042146Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5042353Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5042673Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5042969Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5043101Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5043583Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5043848Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5044075Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5044281Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5044482Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5044710Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5044931Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5045160Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5045381Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5045609Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5045828Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5046064Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5046281Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5046529Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5046748Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5046976Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5047196Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5047425Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5047646Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5047843Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5048063Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5048289Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5048509Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5048698Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5048916Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5049145Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5049362Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5049591Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5049810Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5050026Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5050271Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5050428Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5050634Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5050738Z E1204 11:17:55.170000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5051049Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5051339Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5051469Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5051950Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5052215Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5052439Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5052645Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5052849Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5053076Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5053294Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5053521Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5053738Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5053967Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5054186Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5054426Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5054646Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5054900Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5055119Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5055344Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5055562Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5055788Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5056006Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5056204Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5056426Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5056655Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5056872Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5057062Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5057279Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5057506Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5057727Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5057955Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5058175Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5058375Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5058600Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5058760Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5058961Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5059484Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/r5/cr5ogh5aq2rbxtc3f74mkfjrfhlrj5e435w2lis7izdqcqqjijwt.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5059631Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5059844Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5060001Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5060314Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5060461Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5060721Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5060862Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5061121Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5061279Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5061546Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5061680Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5061954Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5062147Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5062463Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5062765Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5062894Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5063387Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5063654Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5063879Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5064084Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5064287Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5064515Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5064746Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5064974Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5065192Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5065423Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5065646Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5065872Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5066089Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5066316Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5066536Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5066761Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5066989Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5067214Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5067444Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5067645Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5067865Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5068098Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5068315Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5068508Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5068734Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5068962Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5069184Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5069410Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5069629Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5069831Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5070045Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5070238Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5070417Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5070521Z E1204 11:17:55.269000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5070679Z [W1204 11:17:55.744104195 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5070683Z 2025-12-04T12:10:21.5070992Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5071307Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5071437Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5071935Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5072188Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5072417Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5072623Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5072825Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5073063Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5073288Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5073513Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5073732Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5073958Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5074174Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5074401Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5074618Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5074850Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5075068Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5075301Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5075522Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5075756Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5075986Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5076175Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5076393Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5076618Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5076836Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5077037Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5077256Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5077484Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5077701Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5077930Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5078146Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5078348Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5078557Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5078714Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5078894Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5079422Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/rg/crglecrd3vrrxbs2dqzmtlex2mzj6sygkctp3cl4xgr2yzt7yheh.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5079578Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5079793Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5079967Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5080285Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5080416Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5080671Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5080808Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5081062Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5081232Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5081500Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5081637Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5081909Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5082105Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5082418Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5082709Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5082837Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5083312Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5083567Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5083807Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5084018Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5084243Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5084471Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5084692Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5084921Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5085138Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5085364Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5085599Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5085829Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5086049Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5086286Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5086505Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5086735Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5086951Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5087180Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5087399Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5087589Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5087807Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5088044Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5088271Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5088484Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5088704Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5088932Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5089148Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5089374Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5089593Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5089808Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5090018Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5090222Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5090403Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5090506Z E1204 11:17:55.283000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5090665Z [W1204 11:17:55.747816797 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5090667Z 2025-12-04T12:10:21.5090974Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5091264Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5091392Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5091868Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5092141Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5092364Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5092581Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5092798Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5093026Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5093247Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5093473Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5093695Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5093942Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5094159Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5094386Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5094603Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5094830Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5095050Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5095278Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5095494Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5095723Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5095942Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5096132Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5096362Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5096588Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5096836Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5097025Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5097249Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5097477Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5097694Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5097921Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5098153Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5098356Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5098564Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5098721Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5098899Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5099428Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/sy/csyk6cjft33q2vmdakuxjvkuee6knwuff45z42j4qfh4l64sdxry.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5099578Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5099796Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5099952Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5100282Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5100414Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5100686Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5100826Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5101092Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5101276Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5101547Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5101678Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5101956Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5102149Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5102476Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5102768Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5102896Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5103378Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5103630Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5103856Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5104061Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5104265Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5104494Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5104712Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5104950Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5105167Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5105418Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5105638Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5105863Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5106082Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5106307Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5106529Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5106767Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5106985Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5107212Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5107431Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5107622Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5107838Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5108064Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5108282Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5108472Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5108696Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5108937Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5109157Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5109396Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5109624Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5109826Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5110035Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5110223Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5110400Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5110504Z E1204 11:17:55.287000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5110682Z [W1204 11:17:55.750517641 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5110685Z 2025-12-04T12:10:21.5110997Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5111290Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5111418Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5111895Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5112150Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5112377Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5112582Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5112785Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5113016Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5113253Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5113482Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5113727Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5113954Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5114173Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5114397Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5114616Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5114842Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5115071Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5115298Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5115517Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5115746Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5115963Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5116155Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5116373Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5116602Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5116821Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5117012Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5117229Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5117463Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5117683Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5117934Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5118154Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5118356Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5118566Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5118724Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5118901Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5119435Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/37/c37wm22win6ziqpocp4qlhkovxjslc37imuhelrpgvr5yu7mrpl5.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5119580Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5119795Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5119950Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5120280Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5120410Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5120666Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5120804Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5121058Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5121214Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5121482Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5121632Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5121905Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5122119Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5122438Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5122734Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5122863Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5123339Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5123605Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5123830Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5124037Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5124240Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5124466Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5124689Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5124920Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5125138Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5125371Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5125589Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5125827Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5126047Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5126282Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5126511Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5126739Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5126959Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5127188Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5127410Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5127609Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5127828Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5128060Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5128279Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5128471Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5128689Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5128915Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5129133Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5129363Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5129584Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5129786Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5130008Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5130192Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5130385Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5130500Z E1204 11:17:55.289000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5130659Z [W1204 11:17:55.752252249 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5130661Z 2025-12-04T12:10:21.5130971Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5131259Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5131391Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5131872Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5132137Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5132362Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5132569Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5132770Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5132996Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5133217Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5133444Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5133662Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5133891Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5134126Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5134354Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5134581Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5134819Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5135040Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5135266Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5135484Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5135710Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5135940Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5136129Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5136351Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5136577Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5136796Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5136986Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5137202Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5137430Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5137646Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5137873Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5138090Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5138306Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5140241Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5140406Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5140624Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5141155Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] for benchmark choice TritonTemplateCaller(/tmp/tmp40ih1unk/2q/c2q3bo3nhpj55xwdb4yaafdidijudkebr3goyjvcwdqwopfuldps.py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4) 2025-12-04T12:10:21.5141301Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.5141519Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/thread.py", line 58, in run 2025-12-04T12:10:21.5141676Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] result = self.fn(*self.args, **self.kwargs) 2025-12-04T12:10:21.5141980Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 3255, in precompile_with_captured_stdout 2025-12-04T12:10:21.5142112Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] choice.precompile() 2025-12-04T12:10:21.5142372Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py", line 2289, in precompile 2025-12-04T12:10:21.5142511Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self.bmreq.precompile() 2025-12-04T12:10:21.5142767Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/autotune_process.py", line 677, in precompile 2025-12-04T12:10:21.5142924Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] getattr(mod, self.kernel_name).precompile() 2025-12-04T12:10:21.5143190Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 444, in precompile 2025-12-04T12:10:21.5143324Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] self._make_launchers() 2025-12-04T12:10:21.5143596Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 613, in _make_launchers 2025-12-04T12:10:21.5143792Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] raise RuntimeError(f"No valid triton configs. {type(exc).__name__}: {exc}") 2025-12-04T12:10:21.5144109Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] RuntimeError: No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5144416Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5144546Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5145037Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5145303Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5145529Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5145735Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5145939Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5146176Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5146398Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5146625Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5146843Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5147071Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5147291Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5147518Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5147735Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5147964Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5148183Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5148411Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5148639Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5148864Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5149105Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5149297Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #23 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5149517Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5149742Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5149960Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5150185Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #27 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5150424Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5150650Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5150867Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5151094Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5151312Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5151515Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #33 thread_run from /usr/local/src/conda/python-3.10.14/Modules/_threadmodule.c:1100 2025-12-04T12:10:21.5151728Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #34 pythread_wrapper from /usr/local/src/conda/python-3.10.14/Python/thread_pthread.h:248 2025-12-04T12:10:21.5151886Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #35 start_thread from ./nptl/pthread_create.c:442 2025-12-04T12:10:21.5152065Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] #36 __clone3 from ./misc/../sysdeps/unix/sysv/linux/x86_64/clone3.S:81 2025-12-04T12:10:21.5152167Z E1204 11:17:55.291000 876171 site-packages/torch/_inductor/select_algorithm.py:3323] [0/0] 2025-12-04T12:10:21.5152222Z ('RERUN', {'yellow': True}) [24.7346s] [100%] 2025-12-04T12:10:21.5152571Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:57.870690232 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5152574Z 2025-12-04T12:10:21.5152732Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5153027Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5153346Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5153477Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5153955Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5154208Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5154433Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5154648Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5154849Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5155075Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5155296Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5155523Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5155741Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5155967Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5156188Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5156417Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5156635Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5156871Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5157087Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5157294Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5157513Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5157715Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5157943Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5158160Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5158350Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5158570Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5158807Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5159025Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5159213Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5159434Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5159632Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5159822Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5160040Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5160273Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5160461Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5160684Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5160914Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5161146Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5161372Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5161612Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5161810Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5162020Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5162220Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5162446Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5162666Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5162903Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5163122Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5163349Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5163567Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5163794Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5164012Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5164239Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5164457Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5164684Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5164902Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5165141Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5165362Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5165598Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5165834Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5166061Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5166278Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5166504Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5166721Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5166957Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5167174Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5167376Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5167573Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5167801Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5168019Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5168244Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5168462Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5168688Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5168906Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5169133Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5169359Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5169586Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5169813Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5170052Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5170314Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5170512Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5170699Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5170918Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5171133Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5171341Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5171539Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5171766Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5171984Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5172183Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5172370Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5172588Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5172783Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5172971Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5173190Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5173415Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5173646Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5173872Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5174117Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5174317Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5174530Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5174729Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5174961Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5175181Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5175394Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5175594Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5175795Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5176022Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5176243Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5176471Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5176693Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5176920Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5177141Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5177369Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5177597Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5177825Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5178054Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5178265Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5178454Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5178675Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5178879Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5179078Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5179279Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5179515Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5179736Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5179963Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5180224Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5180451Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5180671Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5180898Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5181122Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5181353Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5181577Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5181794Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5181993Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5182184Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5182424Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5182625Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5182856Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5183075Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5183278Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5183479Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5183690Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5183919Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5184138Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5184367Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5184588Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5184815Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5185035Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5185261Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5185483Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5185714Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5185943Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5186171Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5186400Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5186636Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5186856Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5187084Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5187302Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5187530Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5187760Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5187987Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5188208Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5188406Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5188598Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5188817Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5189046Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5189265Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5189493Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5189713Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5189941Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5190207Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5190437Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5190669Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5190909Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5191127Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5191318Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5191536Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5191765Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5191997Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5192223Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5192444Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5192658Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5192861Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5193059Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5193261Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5193489Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5193708Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5193921Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5194121Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5194333Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5194532Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5194771Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5195001Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5195203Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5195402Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5195592Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5195741Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5195960Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5196164Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5196384Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5196611Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5196831Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5197033Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5197222Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5197443Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5197641Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5197830Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5198052Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5198240Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5198474Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5198664Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5198905Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5199098Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5199322Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5199550Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5199769Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5199998Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5200271Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5200499Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5200718Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5200945Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5201165Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5201393Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5201615Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5201815Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5202004Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5202227Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5202440Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5202653Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5202851Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5203080Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5203309Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5203528Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5203757Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5203977Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5204167Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5204399Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5204627Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5204847Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5205074Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5205293Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5205506Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5205708Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5205909Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5206102Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5206292Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5206512Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5206748Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5206968Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5207217Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5207438Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5207626Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5207845Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5208072Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5208295Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5208533Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5208752Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5208941Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5209161Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5209388Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5209607Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5209835Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5210055Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5210299Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5210506Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5210703Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5210916Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5211143Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5211390Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5211582Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5211800Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5212028Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5212246Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5212475Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5212719Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5212936Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5213138Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5213336Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5213536Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5213764Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5213984Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5214213Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5214433Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5214662Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5214883Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5215084Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5215303Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5215553Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5215773Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5215999Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5216219Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5216430Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5216634Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5216844Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5217046Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5217278Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5217498Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5217726Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5217944Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5218171Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5218390Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5218617Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5218840Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5219066Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5219392Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5219591Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5219821Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5220013Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5220244Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5220458Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5220663Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5220862Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5221076Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5221271Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5221442Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5221568Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5221713Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5221818Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5221945Z E1204 11:17:57.426000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5222101Z [W1204 11:17:57.892062442 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5222104Z 2025-12-04T12:10:21.5222248Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5222543Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5222836Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5222968Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5223459Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5223712Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5223963Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5224172Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5224373Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5224601Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5224822Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5225049Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5225281Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5225508Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5225727Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5225953Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5226173Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5226402Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5226624Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5226821Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5227029Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5227232Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5227470Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5227689Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5227877Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5228123Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5228350Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5228570Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5228759Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5228977Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5229174Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5229373Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5229590Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5229786Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5229975Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5230228Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5230454Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5230672Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5230902Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5231126Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5231325Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5231532Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5231745Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5231970Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5232214Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5232442Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5232658Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5232885Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5233102Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5233332Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5233564Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5233790Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5234008Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5234234Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5234451Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5234677Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5234896Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5235122Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5235342Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5235571Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5235804Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5236030Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5236256Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5236492Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5236710Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5236911Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5237106Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5237332Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5237550Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5237789Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5238008Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5238234Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5238453Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5238679Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5238895Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5239120Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5239337Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5239564Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5239783Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5239989Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5240213Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5240445Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5240653Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5240859Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5241059Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5241285Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5241502Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5241700Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5241899Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5242117Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5242315Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5242504Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5242722Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5242948Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5243166Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5243390Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5243610Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5243806Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5244013Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5244224Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5244454Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5244714Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5244915Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5245114Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5245314Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5245542Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5245762Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5246001Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5246220Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5246447Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5246667Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5246897Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5247118Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5247346Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5247565Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5247765Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5247954Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5248175Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5248387Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5248585Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5248808Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5249040Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5249261Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5249488Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5249708Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5249935Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5250234Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5250462Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5250680Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5250908Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5251127Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5251331Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5251530Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5251722Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5251933Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5252132Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5252361Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5252594Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5252797Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5253020Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5253222Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5253449Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5253671Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5253899Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5254119Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5254362Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5254582Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5254811Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5255031Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5255258Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5255480Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5255709Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5255933Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5256165Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5256386Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5256632Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5256851Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5257094Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5257323Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5257551Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5257774Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5257972Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5258167Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5258388Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5258626Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5258845Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5259072Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5259294Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5259520Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5259742Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5259970Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5260244Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5260474Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5260695Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5260905Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5261124Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5261366Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5261598Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5261827Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5262045Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5262258Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5262462Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5262673Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5262875Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5263104Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5263323Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5263538Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5263740Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5263939Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5264139Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5264369Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5264589Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5264791Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5265005Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5265199Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5265347Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5265588Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5265779Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5266000Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5266228Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5266446Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5266646Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5266848Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5267067Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5267266Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5267456Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5267676Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5267866Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5268085Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5268276Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5268495Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5268685Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5268905Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5269145Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5269365Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5269606Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5269837Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5270064Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5270319Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5270544Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5270765Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5271007Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5271227Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5271424Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5271614Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5271838Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5272052Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5272254Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5272452Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5272652Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5272881Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5273100Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5273341Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5273560Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5273778Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5274000Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5274227Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5274447Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5274673Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5274893Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5275122Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5275324Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5275523Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5275713Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5275904Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5276124Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5276355Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5276573Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5276801Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5277021Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5277211Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5277440Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5277667Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5277896Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5278135Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5278357Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5278549Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5278770Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5278998Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5279228Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5279456Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5279675Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5279888Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5280124Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5280323Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5280523Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5280753Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5280977Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5281166Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5281386Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5281628Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5281847Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5282088Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5282321Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5282534Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5282735Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5282937Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5283142Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5283382Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5283601Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5283828Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5284048Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5284276Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5284496Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5284685Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5284904Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5285131Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5285355Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5285584Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5285816Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5286028Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5286248Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5286447Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5286648Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5286877Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5287097Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5287325Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5287557Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5287786Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5288004Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5288232Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5288452Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5288679Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5288898Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5289094Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5289298Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5289487Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5289688Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5289915Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5290167Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5290377Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5290585Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5290777Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5290947Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5291072Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5291215Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5291320Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5291445Z E1204 11:17:57.431000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5291613Z [W1204 11:17:57.895099733 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5291615Z 2025-12-04T12:10:21.5291758Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5292058Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5292351Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5292480Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5292958Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5293212Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5293438Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5293644Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5293844Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5294084Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5294308Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5294555Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5294775Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5295001Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5295218Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5295443Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5295662Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5295897Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5296117Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5296317Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5296529Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5296732Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5296962Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5297183Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5297371Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5297592Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5297819Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5298036Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5298235Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5298452Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5298678Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5298869Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5299088Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5299286Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5299472Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5299692Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5299930Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5300189Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5300415Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5300634Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5300832Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5301042Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5301243Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5301469Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5301687Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5301914Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5302133Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5302371Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5302588Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5302838Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5303055Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5303282Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5303503Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5303729Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5303948Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5304187Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5304405Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5304631Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5304849Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5305075Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5305294Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5305521Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5305742Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5305970Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5306187Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5306388Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5306594Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5306821Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5307067Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5307293Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5307511Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5307736Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5307957Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5308183Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5308412Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5308640Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5308856Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5309084Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5309301Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5309498Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5309685Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5309902Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5310136Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5310346Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5310546Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5310785Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5311005Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5311226Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5311414Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5311632Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5311826Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5312014Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5312233Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5312473Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5312691Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5312917Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5313135Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5313330Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5313538Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5313737Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5313967Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5314187Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5314391Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5314590Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5314806Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5315038Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5315278Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5315506Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5315726Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5315954Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5316178Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5316405Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5316635Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5316861Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5317087Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5317286Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5317475Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5317695Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5317896Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5318095Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5318295Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5318524Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5318744Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5318982Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5319204Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5319451Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5319672Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5319899Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5320148Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5320377Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5320596Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5320810Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5321008Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5321199Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5321410Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5321616Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5321845Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5322065Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5322266Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5322464Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5322664Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5322892Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5323130Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5323360Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5323604Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5323834Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5324053Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5324283Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5324503Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5324733Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5324962Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5325191Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5325413Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5325643Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5325864Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5326094Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5326317Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5326544Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5326764Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5326992Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5327221Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5327420Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5327611Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5327850Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5328079Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5328300Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5328528Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5328747Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5328975Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5329208Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5329434Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5329655Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5329882Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5330143Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5330333Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5330554Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5330787Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5331009Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5331237Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5331467Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5331681Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5331893Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5332104Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5332305Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5332533Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5332752Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5332966Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5333181Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5333377Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5333578Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5333805Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5334025Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5334229Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5334425Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5334617Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5334763Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5334986Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5335182Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5335401Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5335638Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5335857Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5336075Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5336265Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5336484Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5336683Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5336871Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5337093Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5337302Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5337524Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5337713Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5337933Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5338124Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5338343Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5338572Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5338792Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5339021Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5339240Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5339469Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5339700Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5339927Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5340208Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5340449Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5340669Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5340866Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5341055Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5341277Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5341504Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5341707Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5341909Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5342110Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5342338Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5342560Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5342790Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5343010Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5343202Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5343421Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5343650Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5343881Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5344111Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5344342Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5344564Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5344766Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5344965Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5345155Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5345344Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5345578Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5345805Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5346031Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5346258Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5346480Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5346671Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5346890Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5347118Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5347338Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5347566Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5347786Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5347984Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5348204Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5348441Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5348673Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5348905Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5349125Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5349337Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5349539Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5349752Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5349953Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5350199Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5350418Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5350608Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5350831Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5351062Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5351281Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5351509Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5351728Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5351942Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5352158Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5352358Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5352568Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5352816Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5353041Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5353272Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5353492Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5353720Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5353952Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5354141Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5354364Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5354593Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5354814Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5355043Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5355262Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5355476Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5355680Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5355882Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5356084Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5356322Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5356542Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5356777Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5357009Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5357234Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5357455Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5357686Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5357908Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5358148Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5358368Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5358566Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5358768Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5358958Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5359157Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5359370Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5359576Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5359773Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5359968Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5360206Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5360397Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5360524Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5360668Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5360770Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5360921Z E1204 11:17:57.434000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5361079Z [W1204 11:17:57.937772524 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5361081Z 2025-12-04T12:10:21.5361223Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5361517Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5361807Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5361939Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5362433Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5362684Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5362912Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5363117Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5363318Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5363547Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5363767Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5363994Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5364214Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5364444Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5364673Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5364898Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5365142Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5365369Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5365587Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5365785Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5365991Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5366193Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5366429Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5366651Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5366841Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5367061Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5367288Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5367507Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5367696Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5367913Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5368109Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5368298Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5368517Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5368721Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5368912Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5369149Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5369388Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5369606Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5369833Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5370051Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5370291Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5370513Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5370711Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5370938Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5371156Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5371385Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5371604Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5371831Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5372049Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5372275Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5372492Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5372719Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5372949Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5373177Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5373419Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5373648Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5373870Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5374095Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5374313Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5374539Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5374768Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5374995Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5375214Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5375443Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5375661Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5375863Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5376060Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5376288Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5376505Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5376735Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5376953Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5377187Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5377405Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5377649Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5377868Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5378094Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5378315Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5378544Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5378762Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5378970Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5379160Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5379377Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5379573Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5379782Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5379983Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5380241Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5380462Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5380659Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5380847Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5381064Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5381275Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5381462Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5381693Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5381934Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5382150Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5382377Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5382596Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5382797Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5383016Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5383214Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5383444Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5383663Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5383867Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5384066Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5384269Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5384499Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5384720Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5384954Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5385174Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5385418Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5385637Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5385885Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5386105Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5386332Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5386554Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5386752Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5386943Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5387180Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5387383Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5387581Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5387782Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5388012Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5388234Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5388463Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5388683Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5388911Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5389133Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5389361Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5389595Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5389823Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5390064Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5390297Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5390495Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5390690Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5390898Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5391099Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5391336Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5391559Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5391763Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5391962Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5392164Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5392392Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5392612Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5392841Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5393061Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5393294Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5393514Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5393755Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5393976Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5394231Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5394450Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5394677Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5394897Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5395122Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5395344Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5395584Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5395806Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5396032Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5396254Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5396483Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5396702Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5396901Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5397090Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5397312Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5397539Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5397758Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5397997Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5398215Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5398462Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5398683Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5398910Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5399130Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5399358Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5399578Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5399782Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5400002Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5400259Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5400482Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5400710Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5400931Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5401145Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5401346Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5401546Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5401747Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5401986Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5402206Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5402429Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5402641Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5402840Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5403041Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5403268Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5403489Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5403692Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5403903Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5404095Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5404242Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5404462Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5404651Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5404872Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5405101Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5405320Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5405519Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5405709Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5405928Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5406134Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5406324Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5406554Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5406753Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5406975Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5407163Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5407385Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5407575Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5407797Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5408036Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5408255Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5408483Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5408703Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5408931Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5409150Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5409379Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5409599Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5409828Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5410050Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5410306Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5410495Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5410727Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5410951Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5411154Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5411352Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5411552Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5411779Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5412019Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5412247Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5412467Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5412656Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5412876Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5413104Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5413323Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5413550Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5413768Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5413982Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5414186Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5414395Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5414587Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5414784Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5415018Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5415245Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5415466Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5415693Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5415912Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5416115Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5416334Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5416565Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5416784Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5417012Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5417233Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5417421Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5417641Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5417866Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5418086Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5418313Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5418541Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5418755Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5418967Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5419177Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5419378Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5419608Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5419828Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5420018Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5420266Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5420509Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5420730Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5420956Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5421179Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5421393Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5421595Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5421794Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5421994Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5422222Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5422442Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5422682Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5422901Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5423141Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5423377Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5424859Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5425087Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5425314Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5425540Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5425786Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5426005Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5426219Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5426420Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5426625Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5426826Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5427057Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5427277Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5427505Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5427730Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5427962Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5428193Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5428419Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5428650Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5428890Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5429112Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5429314Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5429517Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5429709Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5429917Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5430166Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5430374Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5430571Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5430765Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5430957Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5431129Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5431258Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5431404Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5431506Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5431634Z E1204 11:17:57.476000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5431793Z [W1204 11:17:57.940347440 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5431797Z 2025-12-04T12:10:21.5431939Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5432252Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5432544Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5432709Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5433188Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5433444Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5433670Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5433878Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5434092Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5434320Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5434540Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5434768Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5434987Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5435215Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5435433Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5435659Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5435881Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5436109Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5436328Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5436536Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5436744Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5436964Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5437193Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5437410Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5437600Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5437819Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5438046Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5438277Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5438466Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5438683Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5438879Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5439069Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5439288Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5439485Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5439672Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5439889Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5440158Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5440380Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5440621Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5440839Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5441047Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5441267Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5441467Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5441694Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5441911Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5442139Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5442373Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5442598Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5442821Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5443047Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5443266Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5443492Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5443711Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5443937Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5444155Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5444382Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5444599Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5444839Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5445060Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5445307Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5445526Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5445752Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5445970Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5446195Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5446414Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5446630Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5446826Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5447053Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5447275Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5447503Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5447724Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5447953Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5448171Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5448396Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5448617Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5448841Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5449075Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5449301Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5449540Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5449740Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5449929Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5450187Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5450382Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5450591Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5450805Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5451033Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5451250Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5451447Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5451636Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5451857Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5452055Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5452241Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5452460Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5452688Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5452906Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5453146Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5453364Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5453575Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5453794Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5453994Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5454227Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5454446Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5454650Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5454860Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5455060Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5455288Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5455507Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5455737Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5455959Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5456187Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5456408Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5456638Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5456860Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5457088Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5457320Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5457519Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5457720Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5457949Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5458152Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5458350Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5458551Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5458784Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5459016Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5459243Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5459463Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5459691Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5459911Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5460173Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5460394Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5460619Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5460843Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5461048Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5461250Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5461453Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5461668Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5461880Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5462121Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5462342Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5462544Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5462742Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5462944Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5463191Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5463413Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5463640Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5463860Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5464088Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5464308Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5464535Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5464755Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5464984Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5465203Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5465433Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5465665Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5465895Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5466138Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5466366Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5466586Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5466812Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5467032Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5467260Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5467492Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5467696Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5467886Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5468106Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5468334Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5468555Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5468782Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5469001Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5469229Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5469448Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5469675Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5469908Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5470181Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5470429Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5470620Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5470841Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5471070Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5471290Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5471517Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5471748Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5471961Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5472165Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5472369Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5472571Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5472798Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5473017Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5473230Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5473432Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5473633Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5473833Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5474070Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5474290Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5474513Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5474713Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5474905Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5475052Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5475272Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5475463Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5475692Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5475918Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5476138Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5476335Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5476526Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5476748Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5476946Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5477135Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5477354Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5477545Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5477764Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5477962Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5478181Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5478370Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5478614Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5478842Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5479067Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5479294Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5479516Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5479744Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5479976Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5480237Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5480456Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5480685Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5480905Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5481104Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5481295Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5481513Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5481728Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5481930Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5482143Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5482343Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5482583Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5482817Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5483045Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5483265Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5483457Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5483682Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5483912Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5484146Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5484375Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5484594Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5484807Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5485009Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5485207Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5485400Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5485588Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5485814Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5486042Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5486271Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5486498Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5486727Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5486925Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5487145Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5487373Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5487592Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5487820Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5488043Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5488245Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5488465Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5488692Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5488913Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5489139Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5489359Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5489571Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5489772Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5489971Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5490205Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5490450Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5490669Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5490859Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5491103Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5491331Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5491551Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5491777Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5491997Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5492210Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5492423Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5492625Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5492829Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5493057Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5493277Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5493503Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5493722Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5493950Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5494170Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5494365Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5494594Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5494822Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5495060Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5495299Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5495521Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5495733Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5495936Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5496135Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5496336Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5496573Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5496793Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5497024Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5497250Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5497478Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5497699Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5497925Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5498145Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5498372Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5498594Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5498804Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5499006Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5499209Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5499420Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5499636Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5499843Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5500040Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5500247Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5500440Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5500625Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5500750Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5500894Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5500996Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5501121Z E1204 11:17:57.479000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5501278Z [W1204 11:17:57.942927997 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5501281Z 2025-12-04T12:10:21.5501426Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5501723Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5502015Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5502145Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5502621Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5502888Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5503114Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5503330Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5503542Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5503773Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5503995Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5504222Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5504443Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5504679Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5504898Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5505124Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5505342Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5505569Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5505788Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5505988Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5506199Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5506399Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5506627Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5506845Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5507044Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5507262Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5507497Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5507724Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5507912Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5508130Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5508329Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5508520Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5508747Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5508943Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5509134Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5509350Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5509579Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5509798Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5510026Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5510280Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5510477Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5510689Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5510889Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5511138Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5511355Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5511594Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5511824Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5512052Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5512271Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5512497Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5512715Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5512957Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5513175Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5513400Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5513618Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5513846Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5514064Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5514290Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5514508Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5514734Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5514953Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5515181Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5515412Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5515638Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5515875Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5516077Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5516272Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5516499Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5516717Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5516944Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5517172Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5517403Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5517622Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5517849Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5518066Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5518294Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5518512Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5518737Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5518955Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5519152Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5519340Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5519567Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5519765Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5519991Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5520220Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5520447Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5520665Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5520862Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5521052Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5521285Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5521481Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5521669Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5521889Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5522118Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5522338Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5522565Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5522783Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5522979Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5523187Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5523387Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5523637Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5523857Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5524072Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5524288Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5524492Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5524720Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5524940Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5525167Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5525402Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5525630Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5525849Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5526077Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5526297Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5526528Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5526750Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5526948Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5527138Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5527358Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5527561Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5527769Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5527970Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5528208Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5528440Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5528671Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5528892Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5529120Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5529341Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5529579Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5529798Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5530026Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5530278Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5530481Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5530680Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5530873Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5531087Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5531287Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5531517Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5531738Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5531953Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5532151Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5532363Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5532603Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5532823Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5533051Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5533273Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5533502Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5533735Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5533961Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5534182Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5534407Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5534631Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5534858Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5535079Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5535307Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5535528Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5535758Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5535978Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5536217Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5536438Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5536686Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5536907Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5537105Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5537296Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5537518Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5537749Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5537981Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5538208Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5538429Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5538657Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5538877Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5539107Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5539327Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5539554Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5539774Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5539967Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5540218Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5540461Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5540684Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5540943Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5541164Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5541378Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5541580Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5541779Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5541980Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5542220Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5542443Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5542658Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5542859Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5543058Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5543258Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5543487Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5543707Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5543909Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5544109Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5544300Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5544458Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5544680Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5544879Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5545111Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5545339Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5545559Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5545756Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5545947Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5546178Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5546376Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5546565Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5546786Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5546980Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5547201Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5547390Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5547609Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5547798Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5548019Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5548247Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5548467Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5548705Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5548924Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5549173Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5549394Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5549622Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5549841Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5550070Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5550335Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5550551Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5550741Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5550960Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5551175Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5551380Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5551580Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5551781Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5552011Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5552231Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5552460Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5552691Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5552880Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5553100Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5553352Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5553573Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5553805Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5554024Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5554238Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5554439Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5554652Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5554845Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5555035Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5555257Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5555485Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5555707Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5555938Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5556162Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5556351Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5556572Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5556799Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5557029Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5557257Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5557502Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5557694Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5557913Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5558145Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5558367Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5558595Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5558827Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5559039Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5559241Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5559439Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5559641Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5559868Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5560087Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5560314Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5560536Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5560764Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5560983Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5561224Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5561443Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5561680Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5561883Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5562082Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5562282Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5562510Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5562735Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5562981Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5563200Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5563428Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5563649Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5563839Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5564057Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5564285Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5564505Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5564733Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5564958Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5565185Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5565387Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5565584Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5565808Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5566040Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5566261Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5566491Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5566711Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5566940Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5567171Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5567403Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5567625Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5567854Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5568076Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5568274Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5568478Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5568669Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5568868Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5569082Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5569289Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5569498Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5569692Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5569902Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5570073Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5570235Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5570380Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5570481Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5570606Z E1204 11:17:57.482000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5570659Z ('RERUN', {'yellow': True}) [2.4968s] [100%] 2025-12-04T12:10:21.5571012Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda [W1204 11:17:59.175639563 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5571028Z 2025-12-04T12:10:21.5571172Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5571469Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5571763Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5571896Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5572374Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5572626Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5572853Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5573060Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5573262Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5573508Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5573728Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5573974Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5574207Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5574435Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5574653Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5574879Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5575099Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5575334Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5575556Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5575754Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5575962Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5576162Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5576391Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5576609Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5576797Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5577016Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5577242Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5577462Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5577660Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5577877Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5578084Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5578282Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5578504Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5578702Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5578890Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5579108Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5579337Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5579567Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5579792Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5580010Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5580240Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5580449Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5580648Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5580877Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5581097Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5581323Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5581543Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5581779Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5581998Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5582235Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5582464Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5582691Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5582910Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5583137Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5583355Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5583594Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5583812Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5584038Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5584256Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5584484Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5584703Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5584928Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5585147Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5585378Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5585598Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5585804Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5586011Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5586240Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5586467Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5586704Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5586921Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5587147Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5587364Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5587593Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5587827Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5588053Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5588272Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5588498Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5588716Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5588913Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5589102Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5589319Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5589515Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5589722Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5589926Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5590202Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5590420Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5590627Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5590828Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5591046Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5591243Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5591430Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5591649Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5591877Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5592107Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5592338Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5592556Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5592757Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5592966Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5593163Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5593397Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5593617Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5593824Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5594024Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5594235Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5594466Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5594697Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5594934Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5595154Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5595384Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5595602Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5595833Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5596064Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5596292Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5596512Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5596713Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5596906Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5597126Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5597327Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5597526Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5597725Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5597955Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5598175Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5598413Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5598634Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5598868Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5599101Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5599328Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5599548Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5599774Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5599995Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5600250Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5600447Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5600639Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5600848Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5601049Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5601279Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5601500Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5601704Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5601902Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5602104Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5602331Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5602566Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5602792Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5603024Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5603268Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5603489Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5603720Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5603939Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5604167Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5604399Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5604626Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5604846Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5605071Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5605292Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5605520Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5605744Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5605972Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5606193Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5606420Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5606639Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5606850Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5607039Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5607267Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5607504Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5607724Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5607953Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5608174Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5608404Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5608633Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5608862Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5609083Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5609311Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5609533Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5609724Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5609945Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5610203Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5610428Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5610657Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5610878Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5611109Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5611311Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5611535Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5611736Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5611963Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5612187Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5612399Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5612607Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5612818Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5613018Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5613245Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5613467Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5613669Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5613867Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5614061Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5614209Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5614429Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5614620Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5614843Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5615085Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5615305Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5615514Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5615712Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5615935Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5616132Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5616321Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5616542Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5616732Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5616966Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5617159Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5617380Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5617569Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5617790Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5618019Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5618239Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5618467Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5618687Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5618914Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5619133Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5619375Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5619597Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5619850Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5620072Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5620331Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5620521Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5620741Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5620954Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5621175Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5621373Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5621576Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5621808Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5622029Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5622257Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5622477Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5622666Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5622886Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5623115Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5623333Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5623574Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5623794Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5624035Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5624240Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5624438Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5624630Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5624819Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5625041Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5625277Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5625498Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5625727Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5625947Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5626140Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5626361Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5626590Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5626809Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5627037Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5627258Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5627447Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5627675Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5627903Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5628152Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5628381Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5628603Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5628817Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5629018Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5629217Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5629430Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5629659Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5629879Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5630070Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5630342Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5630569Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5630792Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5631018Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5631241Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5631455Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5631655Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5631869Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5632069Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5632325Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5632545Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5632773Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5632995Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5633224Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5633445Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5633647Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5633867Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5634094Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5634313Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5634542Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5634760Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5634975Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5635180Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5635385Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5635586Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5635816Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5636055Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5636282Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5636527Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5636754Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5636976Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5637202Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5637424Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5637654Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5637883Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5638083Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5638285Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5638477Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5638673Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5638890Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5639098Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5639294Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5639487Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5639682Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5639854Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5639991Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5640178Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5640282Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5640425Z E1204 11:17:59.714000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5640595Z [W1204 11:17:59.177855644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5640598Z 2025-12-04T12:10:21.5640740Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5641036Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5641326Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5641457Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5641936Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5642204Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5642429Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5642634Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5642835Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5643062Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5643283Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5643511Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5643730Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5643956Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5644186Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5644419Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5644648Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5644886Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5645104Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5645301Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5645508Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5645708Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5645945Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5646163Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5646353Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5646572Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5646799Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5647019Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5647205Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5647423Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5647619Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5647808Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5648026Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5648232Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5648420Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5648637Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5648889Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5649108Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5649335Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5649553Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5649749Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5649957Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5650202Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5650429Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5650647Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5650875Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5651096Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5651323Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5651541Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5651766Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5651986Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5652212Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5652442Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5652668Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5652898Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5653137Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5653357Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5653587Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5653805Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5654030Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5654263Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5654488Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5654706Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5654931Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5655150Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5655351Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5655547Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5655776Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5655995Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5656223Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5656442Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5656678Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5656898Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5657134Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5657367Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5657593Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5657813Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5658040Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5658259Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5658467Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5658654Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5658873Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5659067Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5659277Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5659477Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5659706Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5659924Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5660159Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5660355Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5660575Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5660789Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5660976Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5661193Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5661446Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5661665Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5661892Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5662109Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5662307Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5662514Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5662728Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5662959Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5663184Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5663386Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5663583Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5663783Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5664015Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5664236Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5664466Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5664687Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5664925Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5665144Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5665380Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5665614Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5665841Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5666064Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5666261Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5666452Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5666671Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5666888Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5667088Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5667290Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5667518Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5667739Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5667968Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5668186Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5668413Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5668633Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5668862Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5669091Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5669319Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5669551Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5669765Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5669966Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5670188Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5670397Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5670598Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5670824Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5671070Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5671272Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5671474Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5671677Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5671905Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5672126Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5672353Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5672572Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5672800Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5673022Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5673272Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5673493Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5673744Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5673985Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5674213Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5674434Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5674661Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5674883Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5675121Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5675344Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5675571Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5675791Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5676021Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5676244Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5676444Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5676634Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5676855Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5677082Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5677302Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5677542Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5677763Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5678001Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5678233Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5678462Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5678683Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5678910Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5679131Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5679332Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5679553Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5679778Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5679999Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5680256Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5680478Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5680695Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5680897Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5681099Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5681301Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5681529Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5681859Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5682075Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5682311Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5682511Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5682710Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5682940Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5683161Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5683363Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5683575Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5683767Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5683913Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5684140Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5684332Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5684556Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5684783Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5685006Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5685209Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5685405Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5685628Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5685826Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5686029Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5686248Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5686462Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5686682Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5686873Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5687093Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5687281Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5687506Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5687748Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5687968Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5688195Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5688416Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5688645Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5688866Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5689095Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5689314Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5689543Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5689767Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5689966Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5690217Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5690437Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5690685Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5690889Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5691089Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5691290Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5691516Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5691737Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5691979Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5692208Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5692397Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5692619Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5692850Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5693068Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5693298Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5693515Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5693729Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5693932Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5694132Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5694338Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5694530Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5694773Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5695000Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5695220Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5695447Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5695666Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5695858Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5696093Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5696322Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5696545Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5696776Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5696997Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5697188Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5697411Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5697638Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5697859Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5698087Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5698308Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5698565Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5698770Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5698992Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5699193Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5699430Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5699652Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5699841Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5700064Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5700458Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5700678Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5700904Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5701124Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5701339Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5701543Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5701744Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5701944Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5702173Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5702395Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5702623Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5702867Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5703096Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5703346Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5703539Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5703765Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5703992Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5704213Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5704439Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5704676Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5704889Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5705090Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5705289Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5705491Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5705718Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5705939Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5706166Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5706387Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5706613Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5706833Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5707070Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5707290Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5707539Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5707762Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5709280Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5709488Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5709682Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5709879Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5710174Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5710381Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5710580Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5710773Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5710967Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5711143Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5711269Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5711416Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5711520Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5711648Z E1204 11:17:59.717000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5711810Z [W1204 11:17:59.179913017 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5711814Z 2025-12-04T12:10:21.5711959Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5712256Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5712573Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5712706Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5713219Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5713475Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5713702Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5713908Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5714111Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5714384Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5714606Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5714834Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5715058Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5715285Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5715503Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5715730Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5715947Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5716178Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5716398Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5716611Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5716821Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5717028Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5717268Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5717487Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5717681Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5717898Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5718124Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5718343Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5718542Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5718764Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5718960Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5719149Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5719367Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5719563Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5719750Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5719967Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5720240Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5720458Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5720685Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5720920Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5721120Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5721353Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5721553Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5721780Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5721997Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5722224Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5722441Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5722684Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5722903Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5723133Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5723353Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5723581Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5723798Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5724023Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5724240Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5724472Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5724689Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5724927Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5725146Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5725381Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5725614Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5725846Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5726064Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5726291Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5726511Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5726723Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5726919Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5727145Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5727368Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5727596Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5727814Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5728045Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5728261Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5728488Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5728707Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5728933Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5729162Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5729387Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5729618Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5729836Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5730027Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5730283Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5730478Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5730686Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5730907Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5731138Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5731355Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5731552Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5731740Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5731960Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5732155Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5732347Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5732567Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5732795Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5733013Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5733253Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5733475Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5733674Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5733905Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5734105Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5734335Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5734557Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5734761Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5734959Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5735168Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5735399Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5735623Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5735851Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5736073Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5736302Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5736523Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5736752Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5736974Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5737202Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5737434Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5737633Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5737824Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5738062Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5738266Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5738465Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5738665Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5738893Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5739116Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5739354Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5739573Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5739800Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5740019Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5740283Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5740501Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5740731Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5740953Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5741156Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5741356Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5741548Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5741786Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5741986Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5742253Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5742476Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5742679Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5742879Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5743078Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5743307Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5743551Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5743781Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5744002Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5744229Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5744450Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5744677Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5744897Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5745125Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5745347Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5745578Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5745808Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5746041Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5746281Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5746524Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5746744Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5746970Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5747190Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5747419Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5747655Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5747852Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5748046Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5748273Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5748503Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5748726Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5748954Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5749174Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5749401Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5749622Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5749849Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5750080Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5750351Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5750582Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5750790Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5751010Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5751240Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5751461Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5751689Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5751933Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5752145Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5752350Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5752550Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5752753Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5752983Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5753202Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5753417Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5753618Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5753817Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5754016Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5754256Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5754478Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5754689Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5754902Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5755096Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5755246Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5755465Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5755655Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5755876Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5756115Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5756336Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5756533Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5756726Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5756949Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5757148Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5757345Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5757564Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5757753Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5757973Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5758164Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5758392Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5758583Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5758826Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5759054Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5759275Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5759506Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5759728Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5759957Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5760223Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5760451Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5760669Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5760896Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5761115Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5761316Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5761504Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5761727Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5761945Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5762147Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5762346Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5762575Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5762803Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5763052Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5763284Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5763503Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5763692Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5763913Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5764145Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5764379Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5764606Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5764827Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5765043Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5765244Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5765443Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5765634Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5765824Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5766045Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5766280Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5766503Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5766744Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5766964Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5767172Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5767394Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5767623Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5767843Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5768070Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5768290Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5768493Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5768715Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5768942Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5769163Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5769391Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5769612Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5769824Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5770029Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5770264Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5770467Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5770696Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5770933Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5771124Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5771370Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5771601Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5771819Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5772047Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5772267Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5772482Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5772697Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5772895Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5773097Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5773328Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5773548Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5773778Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5773997Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5774227Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5774446Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5774723Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5774942Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5775180Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5775400Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5775654Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5775878Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5776090Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5776295Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5776492Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5776698Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5776944Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5777163Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5777393Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5777613Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5777846Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5778066Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5778295Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5778515Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5778743Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5778963Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5779160Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5779375Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5779565Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5779783Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5780001Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5780250Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5780448Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5780642Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5780837Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5781027Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5781158Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5781304Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5781408Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5781533Z E1204 11:17:59.719000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5781689Z [W1204 11:17:59.221644290 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5781692Z 2025-12-04T12:10:21.5781836Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5782133Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5782432Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5782562Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5783041Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5783299Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5783545Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5783750Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5783978Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5784206Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5784428Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5784658Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5784879Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5785106Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5785338Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5785563Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5785784Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5786012Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5786230Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5786431Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5786638Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5786842Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5787074Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5787291Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5787481Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5787708Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5787937Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5788176Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5788367Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5788586Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5788783Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5788972Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5789193Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5789400Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5789588Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5789808Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5790034Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5790709Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5790939Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5791157Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5791356Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5791563Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5791771Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5792000Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5792235Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5792461Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5792717Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5792948Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5793166Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5793393Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5793612Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5793838Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5794073Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5794300Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5794518Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5794745Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5794965Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5795191Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5795409Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5795635Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5795855Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5796083Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5796314Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5796543Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5796773Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5796986Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5797187Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5797413Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5797632Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5797860Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5798079Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5798315Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5798533Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5798763Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5798985Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5799213Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5799430Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5799655Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5799874Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5800071Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5800294Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5800525Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5800722Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5800944Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5801158Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5801385Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5801603Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5801800Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5801987Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5802208Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5802416Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5802605Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5802823Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5803050Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5803272Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5803500Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5803719Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5803915Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5804124Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5804326Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5804555Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5804784Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5804986Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5805204Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5805408Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5805639Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5805860Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5806088Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5806308Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5806546Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5806767Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5806993Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5807213Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5807442Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5807665Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5807866Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5808056Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5808277Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5808480Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5808676Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5808886Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5809114Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5809363Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5809592Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5809813Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5810044Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5810300Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5810529Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5810764Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5810993Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5811212Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5811414Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5811614Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5811805Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5812015Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5812218Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5812448Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5812668Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5812870Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5813081Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5813282Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5813543Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5813763Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5813991Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5814210Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5814438Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5814660Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5814899Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5815121Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5815347Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5815567Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5815795Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5816014Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5816243Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5816461Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5816690Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5816913Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5817150Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5817370Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5817608Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5817839Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5818038Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5818232Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5818452Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5818680Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5818899Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5819140Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5819362Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5819589Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5819810Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5820038Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5820302Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5820529Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5820751Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5820948Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5821170Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5821418Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5821638Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5821878Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5822112Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5822326Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5822531Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5822730Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5822931Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5823174Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5823395Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5823614Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5823820Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5824021Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5824221Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5824451Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5824670Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5824875Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5825075Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5825267Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5825418Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5825652Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5825847Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5826092Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5826321Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5826543Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5826742Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5826931Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5827152Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5827359Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5827548Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5827770Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5827960Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5828185Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5828376Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5828597Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5828787Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5829006Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5829238Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5829462Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5832202Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5832434Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5832692Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5832921Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5833151Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5833375Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5833606Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5833829Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5834044Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5834234Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5834453Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5834666Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5834872Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5835074Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5835278Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5835506Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5835728Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5835955Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5836175Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5836378Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5836600Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5836836Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5837070Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5837296Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5837521Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5837735Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5837939Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5838152Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5838344Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5838536Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5838755Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5838983Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5839202Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5839430Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5839651Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5839843Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5840068Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5840316Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5840549Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5840775Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5841007Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5841210Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5841428Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5841660Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5841880Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5842111Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5842351Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5842566Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5842768Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5842967Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5843171Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5843398Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5843618Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5843807Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5844028Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5844265Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5844487Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5844725Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5844944Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5845178Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5845381Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5845578Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5845778Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5846006Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5846229Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5846470Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5846692Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5846922Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5847143Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5847333Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5847552Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5847780Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5847997Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5848228Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5848449Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5848662Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5848877Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5849076Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5849298Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5849526Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5849746Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5849979Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5850236Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5850466Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5850697Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5850925Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5851149Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5851378Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5851599Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5851797Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5852001Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5852189Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5852386Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5852600Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5852807Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5853017Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5853211Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5853418Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5853600Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5853729Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5853876Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5853984Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5854109Z E1204 11:17:59.760000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5854268Z [W1204 11:17:59.223701324 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5854271Z 2025-12-04T12:10:21.5854415Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5854718Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5855010Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5855138Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5855622Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5855879Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5856105Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5856310Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5856511Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5856740Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5856960Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5857198Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5857416Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5857668Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5857895Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5858121Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5858340Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5858568Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5858786Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5858994Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5859203Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5859402Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5859627Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5859848Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5860043Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5860303Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5860529Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5860749Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5860938Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5861157Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5861376Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5861563Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5861806Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5862005Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5862192Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5862416Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5862643Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5862864Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5863102Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5863321Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5863518Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5863727Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5863927Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5864153Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5864374Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5864602Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5864828Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5865055Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5865273Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5865511Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5865728Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5865979Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5866198Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5866425Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5866647Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5866875Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5867098Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5867336Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5867559Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5867788Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5868007Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5868237Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5868458Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5868687Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5868904Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5869109Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5869310Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5869536Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5869764Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5869988Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5870273Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5870499Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5870718Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5870944Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5871163Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5871389Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5871622Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5871849Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5872066Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5872265Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5872455Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5872671Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5872868Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5873074Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5873274Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5873501Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5873721Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5873933Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5874121Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5874369Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5874566Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5874755Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5874973Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5875199Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5875421Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5875657Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5875876Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5876073Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5876284Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5876483Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5876715Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5876936Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5877137Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5877336Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5877537Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5877766Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5877995Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5878225Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5878465Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5878694Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5878916Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5879143Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5879363Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5879591Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5879821Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5880021Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5880245Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5880466Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5880670Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5880870Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5881069Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5881298Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5881521Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5881748Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5881969Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5882207Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5882428Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5882680Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5882903Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5883133Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5883353Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5883556Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5883754Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5883959Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5884170Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5884372Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5884601Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5884820Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5885027Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5885227Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5885432Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5885659Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5885883Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5886113Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5886341Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5886570Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5886809Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5887040Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5887261Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5887489Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5887711Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5887939Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5888171Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5888397Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5888617Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5888844Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5889065Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5889295Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5889515Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5889750Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5889974Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5890212Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5890404Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5890643Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5890872Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5891116Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5891346Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5891566Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5891796Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5892021Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5892248Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5892482Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5892710Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5892933Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5893125Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5893350Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5893581Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5893802Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5894032Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5894255Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5894473Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5894687Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5894887Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5895099Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5895343Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5895567Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5895781Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5895983Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5896184Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5896385Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5896627Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5896848Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5897051Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5897252Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5897447Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5897598Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5897818Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5898010Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5898230Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5898458Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5898679Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5898891Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5899083Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5899323Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5899524Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5899713Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5899934Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5900162Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5900385Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5900587Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5900808Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5901002Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5901223Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5901453Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5901672Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5901902Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5902124Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5902354Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5902575Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5902802Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5903032Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5903261Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5903509Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5903710Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5903899Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5904121Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5904335Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5904540Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5904762Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5904965Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5905194Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5905412Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5905645Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5905866Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5906057Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5906277Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5906504Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5906724Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5906952Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5907185Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5907398Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5907609Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5907819Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5908012Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5908204Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5908423Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5908652Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5908885Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5909113Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5909333Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5909525Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5909748Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5909976Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5910232Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5910461Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5910683Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5910872Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5911093Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5911332Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5911552Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5911794Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5912026Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5912239Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5912442Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5912642Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5912844Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5913082Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5913303Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5913493Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5913713Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5913942Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5914166Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5914396Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5914618Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5914839Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5915040Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5915239Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5915448Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5915676Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5915906Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5916142Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5916363Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5916589Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5916813Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5917007Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5917239Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5917465Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5917684Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5917911Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5918132Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5918345Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5918547Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5918745Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5918946Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5919178Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5919401Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5919640Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5919864Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5920138Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5920374Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5920603Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5920822Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5921051Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5921273Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5921483Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5921687Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5921875Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5922072Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5922287Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5922494Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5922693Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5922885Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5923077Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5923249Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5923375Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5923522Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5923627Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5923765Z E1204 11:17:59.762000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5923921Z [W1204 11:17:59.225746157 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T12:10:21.5923924Z 2025-12-04T12:10:21.5924087Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Runtime error during autotuning: 2025-12-04T12:10:21.5924386Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] No valid triton configs. OutOfMemoryError: out of resource: triton_mm Required: 65536 Hardware limit:65536 Reducing block sizes or `num_stages` may help. 2025-12-04T12:10:21.5924678Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Exception raised from loadKernel at /var/lib/jenkins/workspace/torch/csrc/inductor/static_cuda_launcher.cpp:147 (most recent call first): 2025-12-04T12:10:21.5924810Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] C++ CapturedTraceback: 2025-12-04T12:10:21.5925287Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T12:10:21.5925548Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T12:10:21.5925774Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #6 (anonymous namespace)::load_kernel(_object*, _object*) [clone .cold] from static_cuda_launcher.cpp:0 2025-12-04T12:10:21.5925982Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #7 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:552 2025-12-04T12:10:21.5926181Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #8 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5926410Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #9 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5926628Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #10 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5926858Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #11 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5927077Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #12 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5927306Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #13 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5927529Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #14 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5927755Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #15 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5927985Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #16 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5928215Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5928454Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5928653Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #19 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5928862Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #20 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5929066Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #21 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5929292Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #22 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5929513Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #23 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5929713Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #24 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5929933Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #25 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5930189Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #26 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5930410Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5930600Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #28 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5930819Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #29 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5931016Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #30 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5931203Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5931422Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5931618Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #33 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5931810Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5932045Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5932270Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #36 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5932500Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5932745Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5932965Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5933161Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #40 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5933370Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #41 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5933573Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #42 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5933811Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5934029Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5934256Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5934475Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5934701Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #47 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5934923Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #48 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5935156Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #49 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5935374Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #50 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5935602Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #51 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5935821Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5936048Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #53 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5936276Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #54 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5936505Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #55 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5936751Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #56 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5936977Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5937196Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5937423Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5937641Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5937867Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #61 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5938098Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5938326Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5938542Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5938745Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5938942Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #66 type_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:1135 2025-12-04T12:10:21.5939172Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #67 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5939392Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #68 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5939618Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5939840Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5940067Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5940319Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5940560Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5940781Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5941031Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5941248Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5941477Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5941695Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5941894Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #79 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5942082Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5942314Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5942511Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #82 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5942717Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #83 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5942918Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #84 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5943144Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5943362Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5943558Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #87 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5943748Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #88 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5943968Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #89 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5944166Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #90 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5944356Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #91 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5944582Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5944810Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5945048Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5945275Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #95 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5945497Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5945696Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #97 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5945906Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #98 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5946108Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #99 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5946350Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #100 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5946573Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #101 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5946778Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5946977Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5947179Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5947407Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5947627Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5947857Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5948077Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5948307Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5948530Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5948771Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5948992Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5949240Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5949465Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5949662Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #115 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5949855Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #116 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5950074Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #117 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5950317Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5950534Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5950734Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5950965Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5951184Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5951414Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5951634Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5951861Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5952081Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5952308Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5952530Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5952757Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5952997Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5953201Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #131 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5953427Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #132 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5953621Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #133 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5953831Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #134 partial_call from /usr/local/src/conda/python-3.10.14/Modules/_functoolsmodule.c:323 2025-12-04T12:10:21.5954032Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #135 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5954259Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #136 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5954480Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #137 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5954693Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #138 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5954890Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #139 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5955093Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #140 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5955322Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5955548Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5955777Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #143 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5956000Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #144 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5956228Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #145 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5956447Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #146 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5956677Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #147 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5956897Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #148 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5957135Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #149 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5957359Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #150 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5957605Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #151 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5957833Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #152 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5958063Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #153 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5958286Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #154 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5958513Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #155 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5958733Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #156 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5958974Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #157 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5959195Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #158 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5959421Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #159 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5959642Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #160 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5959845Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #161 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5960038Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #162 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5960301Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #163 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5960529Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #164 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5960750Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #165 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5960981Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #166 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5961200Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #167 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5961444Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #168 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5961666Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #169 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5961919Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #170 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5962140Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #171 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5962375Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #172 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5962599Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #173 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5962789Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #174 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5963013Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #175 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5963252Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #176 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5963475Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #177 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5963708Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #178 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5963928Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #179 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5964143Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #180 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5964342Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #181 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5964546Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #182 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5964749Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #183 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5964979Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #184 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5965204Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #185 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5965428Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #186 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5965630Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #187 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5965828Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #188 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5966049Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #189 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5966277Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #190 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5966498Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #191 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5966699Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #192 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5966902Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #193 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5967099Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #194 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5967262Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #195 dynamo__custom_eval_frame from :0 2025-12-04T12:10:21.5967486Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #196 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5967676Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #197 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5967896Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #198 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5968125Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #199 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5968344Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #200 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5968544Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #201 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5968733Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #202 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5968954Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #203 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5969154Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #204 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5969345Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #205 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5969578Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #206 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5969768Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #207 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5970000Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #208 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5970230Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #209 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5970453Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #210 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5970643Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #211 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5970865Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #212 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5971096Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #213 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5971331Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #214 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5971562Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #215 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5971785Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #216 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5972014Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #217 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5972235Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #218 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5972464Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #219 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5972685Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #220 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5972912Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #221 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5973133Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #222 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5973331Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #223 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T12:10:21.5973524Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #224 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5973758Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #225 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5973973Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #226 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5974187Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #227 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5974397Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #228 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5974599Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #229 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5974827Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #230 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5975048Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #231 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5975277Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #232 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5975509Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #233 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5975699Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #234 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5975924Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #235 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5976152Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #236 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5976373Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #237 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5976602Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #238 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5976823Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #239 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5977036Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #240 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5977238Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #241 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5977436Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #242 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5977629Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #243 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T12:10:21.5977828Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #244 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5978049Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #245 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5978291Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #246 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5978521Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #247 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5978750Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #248 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5978970Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #249 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5979160Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #250 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5979379Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #251 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5979618Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #252 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5979838Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #253 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5980064Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #254 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5980316Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #255 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5980512Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #256 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5980735Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #257 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5980962Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #258 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5981182Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #259 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5981411Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #260 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5981633Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #261 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5981849Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #262 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5982062Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #263 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5982262Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #264 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5982473Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #265 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5982724Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #266 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5982949Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #267 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5983139Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #268 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5983359Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #269 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5983587Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #270 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5983821Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #271 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5984047Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #272 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5984269Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #273 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5984483Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #274 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5984686Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #275 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5984887Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #276 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5985089Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #277 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5985318Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #278 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5985536Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #279 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5985766Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #280 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5985988Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #281 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5986226Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #282 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5986447Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #283 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5986645Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #284 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T12:10:21.5986873Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #285 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5987101Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #286 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5987323Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #287 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5987551Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #288 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5987771Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #289 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5987993Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #290 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T12:10:21.5988193Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #291 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T12:10:21.5988391Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #292 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T12:10:21.5988590Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #293 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T12:10:21.5988820Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #294 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T12:10:21.5989041Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #295 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5989270Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #296 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5989494Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #297 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5989722Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #298 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5989943Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #299 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5990211Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #300 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5990444Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #301 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5990673Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #302 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T12:10:21.5990903Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #303 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T12:10:21.5991116Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #304 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T12:10:21.5991317Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #305 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T12:10:21.5991509Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #306 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T12:10:21.5991708Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #307 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T12:10:21.5991924Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #308 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T12:10:21.5992147Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #309 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T12:10:21.5992343Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #310 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T12:10:21.5992535Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #311 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T12:10:21.5992726Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #312 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T12:10:21.5992897Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #313 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T12:10:21.5993021Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #314 _start from ??:0 2025-12-04T12:10:21.5993167Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] #315 from ??:0 2025-12-04T12:10:21.5993270Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] . 2025-12-04T12:10:21.5993396Z E1204 11:17:59.764000 876171 site-packages/torch/_inductor/select_algorithm.py:3696] [0/0] Ignoring this choice. 2025-12-04T12:10:21.5993440Z FAILED [1.6320s] [100%] 2025-12-04T12:10:21.5993442Z 2025-12-04T12:10:21.5993500Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.5993660Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.5993709Z Traceback (most recent call last): 2025-12-04T12:10:21.5993872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5993917Z method(*args, **kwargs) 2025-12-04T12:10:21.5994072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.5995749Z method(*args, **kwargs) 2025-12-04T12:10:21.5995925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.5995964Z with policy(): 2025-12-04T12:10:21.5996118Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.5996162Z raise RuntimeError(msg) 2025-12-04T12:10:21.5996585Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1954545664. 2025-12-04T12:10:21.5996605Z 2025-12-04T12:10:21.5996684Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.5996956Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.5996961Z 2025-12-04T12:10:21.5997052Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.5997130Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.5997176Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.5997233Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.5997794Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.5997906Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.5997950Z graph_break [] 2025-12-04T12:10:21.5998016Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.5998093Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.5998584Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.5998639Z current_size = base.storage().size() 2025-12-04T12:10:21.5998681Z Autotune Choices Stats: 2025-12-04T12:10:21.5999059Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00860000029206276, "best_triton_pos": 0} 2025-12-04T12:10:21.5999129Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.5999180Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.5999303Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.5999547Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.5999595Z _scaled_mm 0.0096 ms 90.0% 2025-12-04T12:10:21.5999823Z triton_mm_33 0.0097 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6000057Z triton_mm_16 0.0113 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6000348Z triton_mm_22 0.0114 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6000584Z triton_mm_30 0.0118 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6000811Z triton_mm_23 0.0122 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6001032Z triton_mm_21 0.0123 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6001257Z triton_mm_15 0.0129 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6001481Z triton_mm_31 0.0134 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6001629Z SingleProcess AUTOTUNE benchmarking takes 0.1760 seconds and 8.8180 seconds precompiling for 33 choices 2025-12-04T12:10:21.6001790Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6001838Z Traceback (most recent call last): 2025-12-04T12:10:21.6001996Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6002037Z method(*args, **kwargs) 2025-12-04T12:10:21.6002191Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6002232Z method(*args, **kwargs) 2025-12-04T12:10:21.6002383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6002423Z with policy(): 2025-12-04T12:10:21.6002577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6002618Z raise RuntimeError(msg) 2025-12-04T12:10:21.6003036Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1954545664 and is now 2921332736. 2025-12-04T12:10:21.6003039Z 2025-12-04T12:10:21.6003115Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6003385Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6003389Z 2025-12-04T12:10:21.6003480Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6003554Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6003600Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6003656Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6004228Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6004335Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6004377Z graph_break [] 2025-12-04T12:10:21.6004441Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.6004514Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6004999Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6005046Z current_size = base.storage().size() 2025-12-04T12:10:21.6005091Z Autotune Choices Stats: 2025-12-04T12:10:21.6005465Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00860000029206276, "best_triton_pos": 0} 2025-12-04T12:10:21.6005544Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.6005593Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6005717Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6005952Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6005997Z _scaled_mm 0.0096 ms 90.0% 2025-12-04T12:10:21.6006230Z triton_mm_33 0.0097 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6006455Z triton_mm_16 0.0113 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6006679Z triton_mm_22 0.0114 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6006902Z triton_mm_30 0.0118 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6007128Z triton_mm_23 0.0122 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6007355Z triton_mm_21 0.0123 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6007588Z triton_mm_15 0.0129 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6007813Z triton_mm_31 0.0134 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6007952Z SingleProcess AUTOTUNE benchmarking takes 0.1760 seconds and 8.8180 seconds precompiling for 33 choices 2025-12-04T12:10:21.6008037Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6008083Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6008139Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6008237Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6008724Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6008761Z graph_break [] 2025-12-04T12:10:21.6008824Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.6008899Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6009264Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.6009366Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.6009407Z Autotune Choices Stats: 2025-12-04T12:10:21.6009777Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008678999729454517, "best_triton_pos": 0} 2025-12-04T12:10:21.6009840Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.6009893Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6010013Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6010279Z triton_mm_72 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6010507Z triton_mm_71 0.0098 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6010549Z _scaled_mm 0.0098 ms 88.2% 2025-12-04T12:10:21.6010774Z triton_mm_60 0.0112 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6010999Z triton_mm_54 0.0114 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6011223Z triton_mm_67 0.0115 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6011458Z triton_mm_59 0.0118 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6011682Z triton_mm_68 0.0118 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6011932Z triton_mm_61 0.0121 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6012157Z triton_mm_53 0.0123 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6012287Z SingleProcess AUTOTUNE benchmarking takes 0.2638 seconds and 0.8231 seconds precompiling for 39 choices 2025-12-04T12:10:21.6012341Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6012501Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6012549Z Traceback (most recent call last): 2025-12-04T12:10:21.6012707Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6012763Z method(*args, **kwargs) 2025-12-04T12:10:21.6012916Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6012956Z method(*args, **kwargs) 2025-12-04T12:10:21.6013106Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6013145Z with policy(): 2025-12-04T12:10:21.6013300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6013342Z raise RuntimeError(msg) 2025-12-04T12:10:21.6013748Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.6013753Z 2025-12-04T12:10:21.6013827Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6014098Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6014100Z 2025-12-04T12:10:21.6014188Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6014261Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6014306Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6014361Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6014910Z inductor [('triton_bundler_save_kernel', 312), ('generated_module_cache_miss', 38), ('benchmarking.InductorBenchmarker.benchmark_gpu', 33), ('select_algorithm_num_precompiles', 32), ('select_algorithm_num_precompilation_exceptions', 6), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6015010Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6015047Z graph_break [] 2025-12-04T12:10:21.6015124Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.6015196Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6015691Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6015748Z current_size = base.storage().size() 2025-12-04T12:10:21.6015790Z Autotune Choices Stats: 2025-12-04T12:10:21.6016156Z {"num_choices": 33, "num_triton_choices": 32, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00860000029206276, "best_triton_pos": 0} 2025-12-04T12:10:21.6016221Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.6016270Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6016391Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6016626Z triton_mm_34 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6016683Z _scaled_mm 0.0096 ms 90.0% 2025-12-04T12:10:21.6016910Z triton_mm_33 0.0097 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6017133Z triton_mm_16 0.0113 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6017356Z triton_mm_22 0.0114 ms 75.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6017580Z triton_mm_30 0.0118 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6017806Z triton_mm_23 0.0122 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6018029Z triton_mm_21 0.0123 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6018253Z triton_mm_15 0.0129 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6018481Z triton_mm_31 0.0134 ms 64.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6018609Z SingleProcess AUTOTUNE benchmarking takes 0.1760 seconds and 8.8180 seconds precompiling for 33 choices 2025-12-04T12:10:21.6018683Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6018725Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6018790Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6018888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6019383Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6019432Z graph_break [] 2025-12-04T12:10:21.6019495Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.6019568Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6019930Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/runtime/static_cuda_launcher.py:155: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T12:10:21.6020023Z (self.function, self.n_regs, self.n_spills) = _StaticCudaLauncher._load_kernel( 2025-12-04T12:10:21.6020063Z Autotune Choices Stats: 2025-12-04T12:10:21.6020472Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.008678999729454517, "best_triton_pos": 0} 2025-12-04T12:10:21.6020551Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.6020602Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6020722Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6020956Z triton_mm_72 0.0087 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6021182Z triton_mm_71 0.0098 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6021226Z _scaled_mm 0.0098 ms 88.2% 2025-12-04T12:10:21.6021450Z triton_mm_60 0.0112 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6021672Z triton_mm_54 0.0114 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6021896Z triton_mm_67 0.0115 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6022118Z triton_mm_59 0.0118 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6022340Z triton_mm_68 0.0118 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6022567Z triton_mm_61 0.0121 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6022805Z triton_mm_53 0.0123 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6022935Z SingleProcess AUTOTUNE benchmarking takes 0.2638 seconds and 0.8231 seconds precompiling for 39 choices 2025-12-04T12:10:21.6023033Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6023078Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6023134Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6023234Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6023715Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6023755Z graph_break [] 2025-12-04T12:10:21.6023817Z aten_mm_info [('aten._scaled_mm.default_1024_512_1024', 1)] 2025-12-04T12:10:21.6023889Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6023931Z Autotune Choices Stats: 2025-12-04T12:10:21.6024297Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_109", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009239999577403069, "best_triton_pos": 0} 2025-12-04T12:10:21.6024372Z AUTOTUNE scaled_mm(1024x1024, 1024x512, 1024x1, 1x512, 512) 2025-12-04T12:10:21.6024420Z strides: [1024, 1], [1, 1024], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6024541Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6024776Z triton_mm_109 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6024820Z _scaled_mm 0.0093 ms 99.1% 2025-12-04T12:10:21.6025047Z triton_mm_106 0.0110 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6025271Z triton_mm_92 0.0113 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6025495Z triton_mm_98 0.0115 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6025716Z triton_mm_97 0.0117 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6025945Z triton_mm_110 0.0118 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6026170Z triton_mm_99 0.0122 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6026413Z triton_mm_105 0.0122 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6026640Z triton_mm_91 0.0125 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6026790Z SingleProcess AUTOTUNE benchmarking takes 0.2638 seconds and 0.6438 seconds precompiling for 39 choices 2025-12-04T12:10:21.6026982Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-48788c463bb75a05.xml - 2025-12-04T12:10:21.6027042Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6027661Z FAILED [1.6320s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 2921332736 and is now 3888119808. 2025-12-04T12:10:21.6027664Z 2025-12-04T12:10:21.6027738Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6028006Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6028029Z 2025-12-04T12:10:21.6028119Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6028182Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6028254Z ================= 1 failed, 187 deselected, 2 rerun in 28.88s ================== 2025-12-04T12:10:21.6028292Z Got exit code 1 2025-12-04T12:10:21.6028509Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6028635Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6028782Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3ca22cc451a1182b.xml 2025-12-04T12:10:21.6028840Z ============================= test session starts ============================== 2025-12-04T12:10:21.6028952Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6028997Z cachedir: .pytest_cache 2025-12-04T12:10:21.6029157Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6029204Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6029247Z configfile: pytest.ini 2025-12-04T12:10:21.6029412Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6029489Z collecting ... collected 188 items / 111 deselected / 77 selected 2025-12-04T12:10:21.6029546Z stepcurrent: skipping 111 already run items. 2025-12-04T12:10:21.6029593Z Running 77 items in this shard 2025-12-04T12:10:21.6029595Z 2025-12-04T12:10:21.6029825Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8447s] [ 1%] 2025-12-04T12:10:21.6030046Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3572s] [ 1%] 2025-12-04T12:10:21.6030295Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.3430s] [ 1%] 2025-12-04T12:10:21.6030297Z 2025-12-04T12:10:21.6030349Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6030510Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6030569Z Traceback (most recent call last): 2025-12-04T12:10:21.6030728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6030770Z method(*args, **kwargs) 2025-12-04T12:10:21.6030921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6030962Z method(*args, **kwargs) 2025-12-04T12:10:21.6031114Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6031151Z with policy(): 2025-12-04T12:10:21.6031304Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6031348Z raise RuntimeError(msg) 2025-12-04T12:10:21.6031752Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6031767Z 2025-12-04T12:10:21.6031842Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6032108Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6032110Z 2025-12-04T12:10:21.6032196Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6032269Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6032317Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6032373Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6032441Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6032539Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6032578Z graph_break [] 2025-12-04T12:10:21.6032639Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6032787Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6032833Z Traceback (most recent call last): 2025-12-04T12:10:21.6032987Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6033027Z method(*args, **kwargs) 2025-12-04T12:10:21.6033177Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6033217Z method(*args, **kwargs) 2025-12-04T12:10:21.6033367Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6033407Z with policy(): 2025-12-04T12:10:21.6033558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6033599Z raise RuntimeError(msg) 2025-12-04T12:10:21.6034002Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6034005Z 2025-12-04T12:10:21.6034081Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6034358Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6034371Z 2025-12-04T12:10:21.6034459Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6034531Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6034575Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6034631Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6034697Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6034795Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6034834Z graph_break [] 2025-12-04T12:10:21.6034894Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6034967Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6035012Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6035069Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6035165Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6035241Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6035281Z graph_break [] 2025-12-04T12:10:21.6035338Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6035390Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6035538Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6035586Z Traceback (most recent call last): 2025-12-04T12:10:21.6035738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6035779Z method(*args, **kwargs) 2025-12-04T12:10:21.6035928Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6035972Z method(*args, **kwargs) 2025-12-04T12:10:21.6036124Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6036162Z with policy(): 2025-12-04T12:10:21.6036313Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6036357Z raise RuntimeError(msg) 2025-12-04T12:10:21.6036752Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6036754Z 2025-12-04T12:10:21.6036826Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6037090Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6037094Z 2025-12-04T12:10:21.6037178Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6037251Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6037293Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6037349Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6037421Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6037518Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6037556Z graph_break [] 2025-12-04T12:10:21.6037614Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6037705Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6037750Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6037804Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6037903Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6037967Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6038006Z graph_break [] 2025-12-04T12:10:21.6038064Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6038140Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6038182Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6038238Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6038331Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6038395Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6038433Z graph_break [] 2025-12-04T12:10:21.6038492Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6038681Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3ca22cc451a1182b.xml - 2025-12-04T12:10:21.6038756Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6039353Z FAILED [0.3430s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6039356Z 2025-12-04T12:10:21.6039427Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6039690Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6039694Z 2025-12-04T12:10:21.6039778Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6039840Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6039907Z ================== 1 failed, 111 deselected, 2 rerun in 2.56s ================== 2025-12-04T12:10:21.6039946Z Got exit code 1 2025-12-04T12:10:21.6039988Z Retrying single test... 2025-12-04T12:10:21.6040169Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-74fc2b45d2af22c9.xml 2025-12-04T12:10:21.6040226Z ============================= test session starts ============================== 2025-12-04T12:10:21.6040339Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6040381Z cachedir: .pytest_cache 2025-12-04T12:10:21.6040541Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6040590Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6040632Z configfile: pytest.ini 2025-12-04T12:10:21.6040795Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6040890Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6041151Z stepcurrent: skipping 111 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6041196Z Running 1 items in this shard 2025-12-04T12:10:21.6041198Z 2025-12-04T12:10:21.6041452Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [36.7216s] [100%] 2025-12-04T12:10:21.6041674Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3727s] [100%] 2025-12-04T12:10:21.6041871Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.3485s] [100%] 2025-12-04T12:10:21.6041873Z 2025-12-04T12:10:21.6041925Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6042072Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6042118Z Traceback (most recent call last): 2025-12-04T12:10:21.6042275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6042318Z method(*args, **kwargs) 2025-12-04T12:10:21.6042485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6042527Z method(*args, **kwargs) 2025-12-04T12:10:21.6042677Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6042715Z with policy(): 2025-12-04T12:10:21.6042867Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6042909Z raise RuntimeError(msg) 2025-12-04T12:10:21.6043309Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6043312Z 2025-12-04T12:10:21.6043386Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6043648Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6043651Z 2025-12-04T12:10:21.6043737Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6043811Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6043854Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6043909Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6043975Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6044072Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6044112Z graph_break [] 2025-12-04T12:10:21.6044171Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6044319Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6044365Z Traceback (most recent call last): 2025-12-04T12:10:21.6044516Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6044558Z method(*args, **kwargs) 2025-12-04T12:10:21.6044717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6044757Z method(*args, **kwargs) 2025-12-04T12:10:21.6044907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6044946Z with policy(): 2025-12-04T12:10:21.6045115Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6045158Z raise RuntimeError(msg) 2025-12-04T12:10:21.6045556Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6045559Z 2025-12-04T12:10:21.6045633Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6045897Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6045900Z 2025-12-04T12:10:21.6045984Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6046059Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6046111Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6046166Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6046231Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6046329Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6046366Z graph_break [] 2025-12-04T12:10:21.6046426Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6046498Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6046541Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6046595Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6046691Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6046755Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6046793Z graph_break [] 2025-12-04T12:10:21.6046850Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6046903Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6047050Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6047098Z Traceback (most recent call last): 2025-12-04T12:10:21.6047252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6047292Z method(*args, **kwargs) 2025-12-04T12:10:21.6047442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6047483Z method(*args, **kwargs) 2025-12-04T12:10:21.6047632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6047673Z with policy(): 2025-12-04T12:10:21.6047824Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6047868Z raise RuntimeError(msg) 2025-12-04T12:10:21.6048277Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6048280Z 2025-12-04T12:10:21.6048353Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6048614Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6048635Z 2025-12-04T12:10:21.6048720Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6048794Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6048836Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6048893Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6048958Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6049057Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6049093Z graph_break [] 2025-12-04T12:10:21.6049152Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6049226Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6049272Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6049325Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6049422Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6049485Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6049534Z graph_break [] 2025-12-04T12:10:21.6049591Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6049665Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6049709Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6049763Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6049858Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6049921Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6049957Z graph_break [] 2025-12-04T12:10:21.6050015Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6050244Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-74fc2b45d2af22c9.xml - 2025-12-04T12:10:21.6050305Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6050909Z FAILED [0.3485s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6050912Z 2025-12-04T12:10:21.6050983Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6051248Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6051252Z 2025-12-04T12:10:21.6051335Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6051398Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6051466Z ================= 1 failed, 187 deselected, 2 rerun in 37.46s ================== 2025-12-04T12:10:21.6051504Z Got exit code 1 2025-12-04T12:10:21.6051547Z Retrying single test... 2025-12-04T12:10:21.6051719Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7cef8e009421476d.xml 2025-12-04T12:10:21.6051776Z ============================= test session starts ============================== 2025-12-04T12:10:21.6051886Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6051927Z cachedir: .pytest_cache 2025-12-04T12:10:21.6052094Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6052157Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6052200Z configfile: pytest.ini 2025-12-04T12:10:21.6052362Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6052436Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6052698Z stepcurrent: skipping 111 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6052743Z Running 1 items in this shard 2025-12-04T12:10:21.6052745Z 2025-12-04T12:10:21.6052971Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [34.0599s] [100%] 2025-12-04T12:10:21.6053195Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1022s] [100%] 2025-12-04T12:10:21.6053415Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda FAILED [1.0524s] [100%] 2025-12-04T12:10:21.6053417Z 2025-12-04T12:10:21.6053469Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6053617Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6053665Z Traceback (most recent call last): 2025-12-04T12:10:21.6053820Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6053861Z method(*args, **kwargs) 2025-12-04T12:10:21.6054012Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6054056Z method(*args, **kwargs) 2025-12-04T12:10:21.6054209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6054247Z with policy(): 2025-12-04T12:10:21.6054398Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6054440Z raise RuntimeError(msg) 2025-12-04T12:10:21.6054835Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6054838Z 2025-12-04T12:10:21.6054911Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6055175Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6055179Z 2025-12-04T12:10:21.6055263Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6055337Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6055381Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6055451Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6055516Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6055613Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6055649Z graph_break [] 2025-12-04T12:10:21.6055708Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6055882Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6055929Z Traceback (most recent call last): 2025-12-04T12:10:21.6056084Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6056124Z method(*args, **kwargs) 2025-12-04T12:10:21.6056274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6056315Z method(*args, **kwargs) 2025-12-04T12:10:21.6056464Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6056502Z with policy(): 2025-12-04T12:10:21.6056652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6056694Z raise RuntimeError(msg) 2025-12-04T12:10:21.6057088Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6057104Z 2025-12-04T12:10:21.6057176Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6057438Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6057441Z 2025-12-04T12:10:21.6057525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6057597Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6057641Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6057698Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6057761Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6057860Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6057896Z graph_break [] 2025-12-04T12:10:21.6057954Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6058026Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6058069Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6058124Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6058220Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6058282Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6058321Z graph_break [] 2025-12-04T12:10:21.6058378Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6058432Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6058577Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6058625Z Traceback (most recent call last): 2025-12-04T12:10:21.6058776Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6058816Z method(*args, **kwargs) 2025-12-04T12:10:21.6058974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6059017Z method(*args, **kwargs) 2025-12-04T12:10:21.6059165Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6059202Z with policy(): 2025-12-04T12:10:21.6059362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6059415Z raise RuntimeError(msg) 2025-12-04T12:10:21.6059809Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6059811Z 2025-12-04T12:10:21.6059883Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6060183Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6060185Z 2025-12-04T12:10:21.6060270Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6060347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6060389Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6060459Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6060523Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6060621Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6060659Z graph_break [] 2025-12-04T12:10:21.6060719Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6060792Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6060835Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6060889Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6060984Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6062144Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6062190Z graph_break [] 2025-12-04T12:10:21.6062247Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6062324Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6062367Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6062423Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6062518Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6062582Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6062621Z graph_break [] 2025-12-04T12:10:21.6062679Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6062870Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7cef8e009421476d.xml - 2025-12-04T12:10:21.6062929Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6063535Z FAILED [1.0524s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6063539Z 2025-12-04T12:10:21.6063610Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6063894Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6063896Z 2025-12-04T12:10:21.6063982Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6064058Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6064126Z ================= 1 failed, 187 deselected, 2 rerun in 36.23s ================== 2025-12-04T12:10:21.6064166Z Got exit code 1 2025-12-04T12:10:21.6064375Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6064503Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6064648Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-da72f01c65b3639c.xml 2025-12-04T12:10:21.6064704Z ============================= test session starts ============================== 2025-12-04T12:10:21.6064817Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6064858Z cachedir: .pytest_cache 2025-12-04T12:10:21.6065016Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6065072Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6065114Z configfile: pytest.ini 2025-12-04T12:10:21.6065276Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6065352Z collecting ... collected 188 items / 112 deselected / 76 selected 2025-12-04T12:10:21.6065405Z stepcurrent: skipping 112 already run items. 2025-12-04T12:10:21.6065452Z Running 76 items in this shard 2025-12-04T12:10:21.6065454Z 2025-12-04T12:10:21.6065678Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [36.6560s] [ 1%] 2025-12-04T12:10:21.6065948Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1062s] [ 1%] 2025-12-04T12:10:21.6066165Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda FAILED [1.0746s] [ 1%] 2025-12-04T12:10:21.6066170Z 2025-12-04T12:10:21.6066221Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6066371Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6066418Z Traceback (most recent call last): 2025-12-04T12:10:21.6066576Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6066617Z method(*args, **kwargs) 2025-12-04T12:10:21.6066768Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6066811Z method(*args, **kwargs) 2025-12-04T12:10:21.6066961Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6067000Z with policy(): 2025-12-04T12:10:21.6067152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6067192Z raise RuntimeError(msg) 2025-12-04T12:10:21.6067603Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6067605Z 2025-12-04T12:10:21.6067678Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6067940Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6067953Z 2025-12-04T12:10:21.6068040Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6068112Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6068156Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6068211Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6068278Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6068375Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6068414Z graph_break [] 2025-12-04T12:10:21.6068472Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6068620Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6068668Z Traceback (most recent call last): 2025-12-04T12:10:21.6068820Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6068878Z method(*args, **kwargs) 2025-12-04T12:10:21.6069028Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6069070Z method(*args, **kwargs) 2025-12-04T12:10:21.6069220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6069261Z with policy(): 2025-12-04T12:10:21.6069413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6069455Z raise RuntimeError(msg) 2025-12-04T12:10:21.6069871Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6069875Z 2025-12-04T12:10:21.6069947Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6070244Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6070248Z 2025-12-04T12:10:21.6070334Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6070406Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6070449Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6070504Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6070574Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6070672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6070711Z graph_break [] 2025-12-04T12:10:21.6070771Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6070843Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6070886Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6070941Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6071048Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6071114Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6071149Z graph_break [] 2025-12-04T12:10:21.6071207Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6071262Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6071425Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6071473Z Traceback (most recent call last): 2025-12-04T12:10:21.6071626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6071666Z method(*args, **kwargs) 2025-12-04T12:10:21.6071816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6071856Z method(*args, **kwargs) 2025-12-04T12:10:21.6072005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6072042Z with policy(): 2025-12-04T12:10:21.6072192Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6072236Z raise RuntimeError(msg) 2025-12-04T12:10:21.6072628Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6072645Z 2025-12-04T12:10:21.6072718Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6072978Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6072980Z 2025-12-04T12:10:21.6073067Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6073139Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6073199Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6073255Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6073323Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6073418Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6073457Z graph_break [] 2025-12-04T12:10:21.6073514Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6073588Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6073635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6073690Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6073784Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6073848Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6073885Z graph_break [] 2025-12-04T12:10:21.6073945Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6074017Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6074063Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6074116Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6074211Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6074275Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6074314Z graph_break [] 2025-12-04T12:10:21.6074380Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6074568Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-da72f01c65b3639c.xml - 2025-12-04T12:10:21.6074628Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6075222Z FAILED [1.0746s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6075235Z 2025-12-04T12:10:21.6075308Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6075568Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6075571Z 2025-12-04T12:10:21.6075656Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6075717Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6075786Z ================= 1 failed, 112 deselected, 2 rerun in 38.86s ================== 2025-12-04T12:10:21.6075826Z Got exit code 1 2025-12-04T12:10:21.6075882Z Retrying single test... 2025-12-04T12:10:21.6076028Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-db30e22dfe046622.xml 2025-12-04T12:10:21.6076085Z ============================= test session starts ============================== 2025-12-04T12:10:21.6076196Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6076239Z cachedir: .pytest_cache 2025-12-04T12:10:21.6076401Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6076447Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6076489Z configfile: pytest.ini 2025-12-04T12:10:21.6076661Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6076737Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6076997Z stepcurrent: skipping 112 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6077043Z Running 1 items in this shard 2025-12-04T12:10:21.6077045Z 2025-12-04T12:10:21.6077266Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [35.8309s] [100%] 2025-12-04T12:10:21.6077488Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0823s] [100%] 2025-12-04T12:10:21.6077684Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.9787s] [100%] 2025-12-04T12:10:21.6077687Z 2025-12-04T12:10:21.6077737Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6077887Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6077934Z Traceback (most recent call last): 2025-12-04T12:10:21.6078088Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6078140Z method(*args, **kwargs) 2025-12-04T12:10:21.6078291Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6078332Z method(*args, **kwargs) 2025-12-04T12:10:21.6078482Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6078529Z with policy(): 2025-12-04T12:10:21.6078681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6078722Z raise RuntimeError(msg) 2025-12-04T12:10:21.6079123Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6079126Z 2025-12-04T12:10:21.6079199Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6079459Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6079462Z 2025-12-04T12:10:21.6079550Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6079622Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6079677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6079732Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6079796Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6079893Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6079933Z graph_break [] 2025-12-04T12:10:21.6079993Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6080182Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6080228Z Traceback (most recent call last): 2025-12-04T12:10:21.6080396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6080438Z method(*args, **kwargs) 2025-12-04T12:10:21.6080588Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6080630Z method(*args, **kwargs) 2025-12-04T12:10:21.6080784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6080823Z with policy(): 2025-12-04T12:10:21.6080976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6081020Z raise RuntimeError(msg) 2025-12-04T12:10:21.6081412Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6081415Z 2025-12-04T12:10:21.6081489Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6081751Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6081753Z 2025-12-04T12:10:21.6081840Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6081927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6081972Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6082027Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6082094Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6082190Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6082242Z graph_break [] 2025-12-04T12:10:21.6082301Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6082374Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6082416Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6082471Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6082565Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6082629Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6082664Z graph_break [] 2025-12-04T12:10:21.6082724Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6082777Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6082927Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6082977Z Traceback (most recent call last): 2025-12-04T12:10:21.6083130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6083196Z method(*args, **kwargs) 2025-12-04T12:10:21.6083345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6083387Z method(*args, **kwargs) 2025-12-04T12:10:21.6083536Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6083577Z with policy(): 2025-12-04T12:10:21.6083728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6083776Z raise RuntimeError(msg) 2025-12-04T12:10:21.6084183Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6084187Z 2025-12-04T12:10:21.6084260Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6084520Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6084522Z 2025-12-04T12:10:21.6084609Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6084681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6084723Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6084780Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6084845Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6084944Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6084983Z graph_break [] 2025-12-04T12:10:21.6085043Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6085114Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6085160Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6085215Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6085323Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6085388Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6085425Z graph_break [] 2025-12-04T12:10:21.6085483Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6085555Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6085595Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6085661Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6085756Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6085820Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6085860Z graph_break [] 2025-12-04T12:10:21.6085917Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6086104Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-db30e22dfe046622.xml - 2025-12-04T12:10:21.6086164Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6086760Z FAILED [0.9787s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6086773Z 2025-12-04T12:10:21.6086845Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6087105Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6087107Z 2025-12-04T12:10:21.6087194Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6087256Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6087323Z ================= 1 failed, 187 deselected, 2 rerun in 37.91s ================== 2025-12-04T12:10:21.6087360Z Got exit code 1 2025-12-04T12:10:21.6087401Z Retrying single test... 2025-12-04T12:10:21.6087557Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7a5c67982ccba347.xml 2025-12-04T12:10:21.6087614Z ============================= test session starts ============================== 2025-12-04T12:10:21.6087726Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6087766Z cachedir: .pytest_cache 2025-12-04T12:10:21.6087923Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6087971Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6088013Z configfile: pytest.ini 2025-12-04T12:10:21.6088173Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6088248Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6088505Z stepcurrent: skipping 112 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6088551Z Running 1 items in this shard 2025-12-04T12:10:21.6088553Z 2025-12-04T12:10:21.6088781Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [50.5078s] [100%] 2025-12-04T12:10:21.6089011Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3402s] [100%] 2025-12-04T12:10:21.6089206Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.3074s] [100%] 2025-12-04T12:10:21.6089208Z 2025-12-04T12:10:21.6089258Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6089420Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6089467Z Traceback (most recent call last): 2025-12-04T12:10:21.6089623Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6089664Z method(*args, **kwargs) 2025-12-04T12:10:21.6089816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6089859Z method(*args, **kwargs) 2025-12-04T12:10:21.6090009Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6090046Z with policy(): 2025-12-04T12:10:21.6090233Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6090278Z raise RuntimeError(msg) 2025-12-04T12:10:21.6090673Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6090688Z 2025-12-04T12:10:21.6090761Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6091021Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6091024Z 2025-12-04T12:10:21.6091109Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6091182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6091240Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6091296Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6091362Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6091460Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6091497Z graph_break [] 2025-12-04T12:10:21.6091555Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6091704Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6091752Z Traceback (most recent call last): 2025-12-04T12:10:21.6091904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6091946Z method(*args, **kwargs) 2025-12-04T12:10:21.6092098Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6092140Z method(*args, **kwargs) 2025-12-04T12:10:21.6092289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6092331Z with policy(): 2025-12-04T12:10:21.6092480Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6092524Z raise RuntimeError(msg) 2025-12-04T12:10:21.6092930Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6092932Z 2025-12-04T12:10:21.6093005Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6093279Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6093282Z 2025-12-04T12:10:21.6093368Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6093440Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6093486Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6093542Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6093607Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6093705Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6093745Z graph_break [] 2025-12-04T12:10:21.6093803Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6093877Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6093922Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6093977Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6094085Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6094148Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6094187Z graph_break [] 2025-12-04T12:10:21.6094243Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6094295Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6094442Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6094490Z Traceback (most recent call last): 2025-12-04T12:10:21.6094642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6094684Z method(*args, **kwargs) 2025-12-04T12:10:21.6094843Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6094888Z method(*args, **kwargs) 2025-12-04T12:10:21.6095035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6095075Z with policy(): 2025-12-04T12:10:21.6095225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6095267Z raise RuntimeError(msg) 2025-12-04T12:10:21.6095657Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6095660Z 2025-12-04T12:10:21.6095734Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6095993Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6095997Z 2025-12-04T12:10:21.6096082Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6096156Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6096209Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6096265Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6096331Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6096428Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6096468Z graph_break [] 2025-12-04T12:10:21.6096548Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6096620Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6096665Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6096719Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6096814Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6096877Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6096915Z graph_break [] 2025-12-04T12:10:21.6096971Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6097044Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6097087Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6097143Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6097237Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6097302Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6097339Z graph_break [] 2025-12-04T12:10:21.6097396Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6097595Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7a5c67982ccba347.xml - 2025-12-04T12:10:21.6097656Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6098249Z FAILED [0.3074s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6098254Z 2025-12-04T12:10:21.6098334Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6098599Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6098602Z 2025-12-04T12:10:21.6098687Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6098751Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6098819Z ================= 1 failed, 187 deselected, 2 rerun in 51.18s ================== 2025-12-04T12:10:21.6098858Z Got exit code 1 2025-12-04T12:10:21.6099066Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6099195Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6099339Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-59b429bc700bbea5.xml 2025-12-04T12:10:21.6099400Z ============================= test session starts ============================== 2025-12-04T12:10:21.6099510Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6099555Z cachedir: .pytest_cache 2025-12-04T12:10:21.6099724Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6099772Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6099815Z configfile: pytest.ini 2025-12-04T12:10:21.6099974Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6100054Z collecting ... collected 188 items / 113 deselected / 75 selected 2025-12-04T12:10:21.6100151Z stepcurrent: skipping 113 already run items. 2025-12-04T12:10:21.6100197Z Running 75 items in this shard 2025-12-04T12:10:21.6100199Z 2025-12-04T12:10:21.6100424Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7908s] [ 1%] 2025-12-04T12:10:21.6100646Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3479s] [ 1%] 2025-12-04T12:10:21.6100841Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.3116s] [ 1%] 2025-12-04T12:10:21.6100843Z 2025-12-04T12:10:21.6100896Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6101045Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6101092Z Traceback (most recent call last): 2025-12-04T12:10:21.6101247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6101305Z method(*args, **kwargs) 2025-12-04T12:10:21.6101456Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6101497Z method(*args, **kwargs) 2025-12-04T12:10:21.6101648Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6101685Z with policy(): 2025-12-04T12:10:21.6101838Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6101879Z raise RuntimeError(msg) 2025-12-04T12:10:21.6102286Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6102291Z 2025-12-04T12:10:21.6102364Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6102626Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6102628Z 2025-12-04T12:10:21.6102712Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6102786Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6102830Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6102888Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6102953Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6103051Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6103089Z graph_break [] 2025-12-04T12:10:21.6103147Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6103295Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6103340Z Traceback (most recent call last): 2025-12-04T12:10:21.6103510Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6103552Z method(*args, **kwargs) 2025-12-04T12:10:21.6103704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6103746Z method(*args, **kwargs) 2025-12-04T12:10:21.6103912Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6103950Z with policy(): 2025-12-04T12:10:21.6104101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6104142Z raise RuntimeError(msg) 2025-12-04T12:10:21.6104534Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6104537Z 2025-12-04T12:10:21.6104609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6104869Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6104872Z 2025-12-04T12:10:21.6104959Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6105041Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6105088Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6105142Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6105208Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6105306Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6105345Z graph_break [] 2025-12-04T12:10:21.6105403Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6105478Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6105520Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6105588Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6105683Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6105747Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6105783Z graph_break [] 2025-12-04T12:10:21.6105841Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6105893Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6106046Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6106095Z Traceback (most recent call last): 2025-12-04T12:10:21.6106249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6106289Z method(*args, **kwargs) 2025-12-04T12:10:21.6106440Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6106482Z method(*args, **kwargs) 2025-12-04T12:10:21.6106631Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6106671Z with policy(): 2025-12-04T12:10:21.6106825Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6106865Z raise RuntimeError(msg) 2025-12-04T12:10:21.6107264Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6107267Z 2025-12-04T12:10:21.6107340Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6107609Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6107612Z 2025-12-04T12:10:21.6107699Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6107770Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6107813Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6107868Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6107933Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6108029Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6108066Z graph_break [] 2025-12-04T12:10:21.6108124Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6108200Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6108243Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6108299Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6108405Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6108470Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6108507Z graph_break [] 2025-12-04T12:10:21.6108565Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6108638Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6108681Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6108735Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6108830Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6108892Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6108942Z graph_break [] 2025-12-04T12:10:21.6109001Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6109188Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-59b429bc700bbea5.xml - 2025-12-04T12:10:21.6109249Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6109840Z FAILED [0.3116s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6109843Z 2025-12-04T12:10:21.6109916Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6110222Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6110225Z 2025-12-04T12:10:21.6110310Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6110372Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6110441Z ================== 1 failed, 113 deselected, 2 rerun in 2.47s ================== 2025-12-04T12:10:21.6110501Z Got exit code 1 2025-12-04T12:10:21.6110545Z Retrying single test... 2025-12-04T12:10:21.6110687Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6a2b8072b017fa67.xml 2025-12-04T12:10:21.6110746Z ============================= test session starts ============================== 2025-12-04T12:10:21.6110861Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6110918Z cachedir: .pytest_cache 2025-12-04T12:10:21.6111081Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6111128Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6111171Z configfile: pytest.ini 2025-12-04T12:10:21.6111333Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6111407Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6111666Z stepcurrent: skipping 113 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6111712Z Running 1 items in this shard 2025-12-04T12:10:21.6111714Z 2025-12-04T12:10:21.6111938Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [125.3895s] [100%] 2025-12-04T12:10:21.6112172Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0954s] [100%] 2025-12-04T12:10:21.6112366Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [1.0898s] [100%] 2025-12-04T12:10:21.6112369Z 2025-12-04T12:10:21.6112422Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6112569Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6112614Z Traceback (most recent call last): 2025-12-04T12:10:21.6112784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6112828Z method(*args, **kwargs) 2025-12-04T12:10:21.6112981Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6113022Z method(*args, **kwargs) 2025-12-04T12:10:21.6113173Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6113211Z with policy(): 2025-12-04T12:10:21.6113366Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6113408Z raise RuntimeError(msg) 2025-12-04T12:10:21.6113812Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6113815Z 2025-12-04T12:10:21.6113887Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6114148Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6114150Z 2025-12-04T12:10:21.6114237Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6114318Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6114366Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6114423Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6114492Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6114591Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6114638Z graph_break [] 2025-12-04T12:10:21.6114696Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6114845Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6114890Z Traceback (most recent call last): 2025-12-04T12:10:21.6115046Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6115086Z method(*args, **kwargs) 2025-12-04T12:10:21.6115239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6115279Z method(*args, **kwargs) 2025-12-04T12:10:21.6115428Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6115465Z with policy(): 2025-12-04T12:10:21.6115620Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6115663Z raise RuntimeError(msg) 2025-12-04T12:10:21.6116067Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6116069Z 2025-12-04T12:10:21.6116146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6116406Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6116408Z 2025-12-04T12:10:21.6116504Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6116582Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6116627Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6116683Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6116748Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6116844Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6116881Z graph_break [] 2025-12-04T12:10:21.6116941Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6117019Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6117061Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6117116Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6117213Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6117279Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6117316Z graph_break [] 2025-12-04T12:10:21.6117374Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6117426Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6117575Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6117620Z Traceback (most recent call last): 2025-12-04T12:10:21.6117783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6117824Z method(*args, **kwargs) 2025-12-04T12:10:21.6117974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6118013Z method(*args, **kwargs) 2025-12-04T12:10:21.6118163Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6118211Z with policy(): 2025-12-04T12:10:21.6118362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6118407Z raise RuntimeError(msg) 2025-12-04T12:10:21.6118805Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6118807Z 2025-12-04T12:10:21.6118883Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6119142Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6119145Z 2025-12-04T12:10:21.6119234Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6119315Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6119358Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6119413Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6119479Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6119575Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6119617Z graph_break [] 2025-12-04T12:10:21.6119676Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6119749Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6119790Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6119849Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6119957Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6120022Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6120063Z graph_break [] 2025-12-04T12:10:21.6120162Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6120237Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6120280Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6120335Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6120431Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6120497Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6120537Z graph_break [] 2025-12-04T12:10:21.6120597Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6120784Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6a2b8072b017fa67.xml - 2025-12-04T12:10:21.6120845Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6121453Z FAILED [1.0898s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6121456Z 2025-12-04T12:10:21.6121529Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6121789Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6121806Z 2025-12-04T12:10:21.6121890Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6121953Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6122026Z ============ 1 failed, 187 deselected, 2 rerun in 127.59s (0:02:07) ============ 2025-12-04T12:10:21.6122066Z Got exit code 1 2025-12-04T12:10:21.6122107Z Retrying single test... 2025-12-04T12:10:21.6122253Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-67c015d69a03c474.xml 2025-12-04T12:10:21.6122310Z ============================= test session starts ============================== 2025-12-04T12:10:21.6122423Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6122466Z cachedir: .pytest_cache 2025-12-04T12:10:21.6122624Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6122673Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6122718Z configfile: pytest.ini 2025-12-04T12:10:21.6122892Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6122970Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6123230Z stepcurrent: skipping 113 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6123276Z Running 1 items in this shard 2025-12-04T12:10:21.6123278Z 2025-12-04T12:10:21.6123501Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [36.0743s] [100%] 2025-12-04T12:10:21.6123749Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3450s] [100%] 2025-12-04T12:10:21.6123946Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.3188s] [100%] 2025-12-04T12:10:21.6123949Z 2025-12-04T12:10:21.6124000Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6124149Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6124196Z Traceback (most recent call last): 2025-12-04T12:10:21.6124354Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6124396Z method(*args, **kwargs) 2025-12-04T12:10:21.6124549Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6124589Z method(*args, **kwargs) 2025-12-04T12:10:21.6124738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6124779Z with policy(): 2025-12-04T12:10:21.6124932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6124974Z raise RuntimeError(msg) 2025-12-04T12:10:21.6125378Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6125380Z 2025-12-04T12:10:21.6125454Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6125725Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6125729Z 2025-12-04T12:10:21.6125816Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6125887Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6125931Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6125987Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6126053Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6126150Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6126188Z graph_break [] 2025-12-04T12:10:21.6126247Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6126396Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6126441Z Traceback (most recent call last): 2025-12-04T12:10:21.6126605Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6126648Z method(*args, **kwargs) 2025-12-04T12:10:21.6126798Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6126841Z method(*args, **kwargs) 2025-12-04T12:10:21.6126991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6127031Z with policy(): 2025-12-04T12:10:21.6127183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6127224Z raise RuntimeError(msg) 2025-12-04T12:10:21.6127631Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6127635Z 2025-12-04T12:10:21.6127708Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6127967Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6127969Z 2025-12-04T12:10:21.6128057Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6128128Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6128173Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6128230Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6128296Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6128394Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6128435Z graph_break [] 2025-12-04T12:10:21.6128495Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6128568Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6128612Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6128680Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6128776Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6128842Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6128878Z graph_break [] 2025-12-04T12:10:21.6128937Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6128999Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6129146Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6129199Z Traceback (most recent call last): 2025-12-04T12:10:21.6129353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6129396Z method(*args, **kwargs) 2025-12-04T12:10:21.6129546Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6129588Z method(*args, **kwargs) 2025-12-04T12:10:21.6129737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6129781Z with policy(): 2025-12-04T12:10:21.6129934Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6129978Z raise RuntimeError(msg) 2025-12-04T12:10:21.6130410Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6130431Z 2025-12-04T12:10:21.6130506Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6130766Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6130768Z 2025-12-04T12:10:21.6130855Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6130942Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6130985Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6131042Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6131108Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6131208Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6131244Z graph_break [] 2025-12-04T12:10:21.6131306Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6131379Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6131424Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6131479Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6131574Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6131637Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6131676Z graph_break [] 2025-12-04T12:10:21.6131733Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6131807Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6131852Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6131908Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6132001Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6132065Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6132102Z graph_break [] 2025-12-04T12:10:21.6132172Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6132360Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-67c015d69a03c474.xml - 2025-12-04T12:10:21.6132419Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6133028Z FAILED [0.3188s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6133034Z 2025-12-04T12:10:21.6133105Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6133367Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6133369Z 2025-12-04T12:10:21.6133455Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6133519Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6133588Z ================= 1 failed, 187 deselected, 2 rerun in 36.76s ================== 2025-12-04T12:10:21.6133636Z Got exit code 1 2025-12-04T12:10:21.6133844Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6133972Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6134117Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6dc5febea5fdc88a.xml 2025-12-04T12:10:21.6134175Z ============================= test session starts ============================== 2025-12-04T12:10:21.6134286Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6134330Z cachedir: .pytest_cache 2025-12-04T12:10:21.6134497Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6134545Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6134592Z configfile: pytest.ini 2025-12-04T12:10:21.6134756Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6134834Z collecting ... collected 188 items / 114 deselected / 74 selected 2025-12-04T12:10:21.6134890Z stepcurrent: skipping 114 already run items. 2025-12-04T12:10:21.6134937Z Running 74 items in this shard 2025-12-04T12:10:21.6134940Z 2025-12-04T12:10:21.6135161Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [22.8222s] [ 1%] 2025-12-04T12:10:21.6135383Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3754s] [ 1%] 2025-12-04T12:10:21.6135578Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.3553s] [ 1%] 2025-12-04T12:10:21.6135581Z 2025-12-04T12:10:21.6135635Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6135782Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6135831Z Traceback (most recent call last): 2025-12-04T12:10:21.6135998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6136042Z method(*args, **kwargs) 2025-12-04T12:10:21.6136192Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6136243Z method(*args, **kwargs) 2025-12-04T12:10:21.6136394Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6136439Z with policy(): 2025-12-04T12:10:21.6136592Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6136636Z raise RuntimeError(msg) 2025-12-04T12:10:21.6137027Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6137030Z 2025-12-04T12:10:21.6137100Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6137365Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6137368Z 2025-12-04T12:10:21.6137468Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6137542Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6137586Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6137643Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6137708Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6137810Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6137846Z graph_break [] 2025-12-04T12:10:21.6137906Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6138054Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6138113Z Traceback (most recent call last): 2025-12-04T12:10:21.6138268Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6138313Z method(*args, **kwargs) 2025-12-04T12:10:21.6138462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6138503Z method(*args, **kwargs) 2025-12-04T12:10:21.6138652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6138693Z with policy(): 2025-12-04T12:10:21.6138844Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6138885Z raise RuntimeError(msg) 2025-12-04T12:10:21.6139277Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6139281Z 2025-12-04T12:10:21.6139353Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6139613Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6139615Z 2025-12-04T12:10:21.6139709Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6139783Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6139826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6139884Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6139951Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6140062Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6140140Z graph_break [] 2025-12-04T12:10:21.6140199Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6140272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6140316Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6140375Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6140471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6140536Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6140573Z graph_break [] 2025-12-04T12:10:21.6140631Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6140685Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6140833Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6140880Z Traceback (most recent call last): 2025-12-04T12:10:21.6141055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6141094Z method(*args, **kwargs) 2025-12-04T12:10:21.6141243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6141284Z method(*args, **kwargs) 2025-12-04T12:10:21.6141434Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6141472Z with policy(): 2025-12-04T12:10:21.6141623Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6141665Z raise RuntimeError(msg) 2025-12-04T12:10:21.6142071Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6142075Z 2025-12-04T12:10:21.6142149Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6142406Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6142409Z 2025-12-04T12:10:21.6142495Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6142566Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6142613Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6142670Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6142736Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6142834Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6142874Z graph_break [] 2025-12-04T12:10:21.6142932Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6143007Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6143051Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6143118Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6143213Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6143278Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6143317Z graph_break [] 2025-12-04T12:10:21.6143374Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6143461Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6143505Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6143560Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6143654Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6143718Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6143757Z graph_break [] 2025-12-04T12:10:21.6143813Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6144005Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6dc5febea5fdc88a.xml - 2025-12-04T12:10:21.6144064Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6144655Z FAILED [0.3553s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6144668Z 2025-12-04T12:10:21.6144740Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6145000Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6145002Z 2025-12-04T12:10:21.6145088Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6145149Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6145227Z ================= 1 failed, 114 deselected, 2 rerun in 23.57s ================== 2025-12-04T12:10:21.6145267Z Got exit code 1 2025-12-04T12:10:21.6145310Z Retrying single test... 2025-12-04T12:10:21.6145456Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f08cf26280cf2c5e.xml 2025-12-04T12:10:21.6145515Z ============================= test session starts ============================== 2025-12-04T12:10:21.6145625Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6145668Z cachedir: .pytest_cache 2025-12-04T12:10:21.6145826Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6145876Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6145921Z configfile: pytest.ini 2025-12-04T12:10:21.6146081Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6146157Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6146411Z stepcurrent: skipping 114 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6146460Z Running 1 items in this shard 2025-12-04T12:10:21.6146462Z 2025-12-04T12:10:21.6146681Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8103s] [100%] 2025-12-04T12:10:21.6146909Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3465s] [100%] 2025-12-04T12:10:21.6147103Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.3125s] [100%] 2025-12-04T12:10:21.6147117Z 2025-12-04T12:10:21.6147170Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6147316Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6147364Z Traceback (most recent call last): 2025-12-04T12:10:21.6147520Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6147563Z method(*args, **kwargs) 2025-12-04T12:10:21.6147717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6147757Z method(*args, **kwargs) 2025-12-04T12:10:21.6147913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6147952Z with policy(): 2025-12-04T12:10:21.6149652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6149697Z raise RuntimeError(msg) 2025-12-04T12:10:21.6150139Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6150142Z 2025-12-04T12:10:21.6150217Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6150477Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6150479Z 2025-12-04T12:10:21.6150585Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6150662Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6150705Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6150763Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6150828Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6150932Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6150969Z graph_break [] 2025-12-04T12:10:21.6151028Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6151175Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6151220Z Traceback (most recent call last): 2025-12-04T12:10:21.6151379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6151421Z method(*args, **kwargs) 2025-12-04T12:10:21.6151572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6151617Z method(*args, **kwargs) 2025-12-04T12:10:21.6151765Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6151802Z with policy(): 2025-12-04T12:10:21.6151954Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6152010Z raise RuntimeError(msg) 2025-12-04T12:10:21.6152407Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6152432Z 2025-12-04T12:10:21.6152506Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6152765Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6152770Z 2025-12-04T12:10:21.6152855Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6152927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6152974Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6153030Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6153095Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6153192Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6153230Z graph_break [] 2025-12-04T12:10:21.6153291Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6153365Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6153420Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6153476Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6153572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6153636Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6153673Z graph_break [] 2025-12-04T12:10:21.6153731Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6153783Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6153929Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6153975Z Traceback (most recent call last): 2025-12-04T12:10:21.6154140Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6154182Z method(*args, **kwargs) 2025-12-04T12:10:21.6154332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6154372Z method(*args, **kwargs) 2025-12-04T12:10:21.6154522Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6154560Z with policy(): 2025-12-04T12:10:21.6154714Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6154755Z raise RuntimeError(msg) 2025-12-04T12:10:21.6155146Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6155150Z 2025-12-04T12:10:21.6155226Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6155482Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6155485Z 2025-12-04T12:10:21.6155571Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6155654Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6155698Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6155753Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6155818Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6155915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6155964Z graph_break [] 2025-12-04T12:10:21.6156022Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6156096Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6156139Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6156194Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6156289Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6156355Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6156391Z graph_break [] 2025-12-04T12:10:21.6156451Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6156522Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6156565Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6156619Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6156715Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6156778Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6156828Z graph_break [] 2025-12-04T12:10:21.6156884Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6157078Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f08cf26280cf2c5e.xml - 2025-12-04T12:10:21.6157139Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6157738Z FAILED [0.3125s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6157742Z 2025-12-04T12:10:21.6157815Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6158071Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6158073Z 2025-12-04T12:10:21.6158158Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6158221Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6158289Z ================== 1 failed, 187 deselected, 2 rerun in 2.49s ================== 2025-12-04T12:10:21.6158328Z Got exit code 1 2025-12-04T12:10:21.6158370Z Retrying single test... 2025-12-04T12:10:21.6158514Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9cf40d774e55554b.xml 2025-12-04T12:10:21.6158574Z ============================= test session starts ============================== 2025-12-04T12:10:21.6158686Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6158727Z cachedir: .pytest_cache 2025-12-04T12:10:21.6158884Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6158931Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6158975Z configfile: pytest.ini 2025-12-04T12:10:21.6159145Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6159220Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6159480Z stepcurrent: skipping 114 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6159536Z Running 1 items in this shard 2025-12-04T12:10:21.6159539Z 2025-12-04T12:10:21.6159758Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9513s] [100%] 2025-12-04T12:10:21.6159974Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3597s] [100%] 2025-12-04T12:10:21.6160214Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.3183s] [100%] 2025-12-04T12:10:21.6160220Z 2025-12-04T12:10:21.6160271Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6160417Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6160464Z Traceback (most recent call last): 2025-12-04T12:10:21.6160634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6160676Z method(*args, **kwargs) 2025-12-04T12:10:21.6160826Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6160867Z method(*args, **kwargs) 2025-12-04T12:10:21.6161018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6161055Z with policy(): 2025-12-04T12:10:21.6161205Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6161247Z raise RuntimeError(msg) 2025-12-04T12:10:21.6161654Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1092616192. 2025-12-04T12:10:21.6161658Z 2025-12-04T12:10:21.6161730Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6161992Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6161994Z 2025-12-04T12:10:21.6162080Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6162152Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6162195Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6162251Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6162317Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6162414Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6162453Z graph_break [] 2025-12-04T12:10:21.6162511Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6162657Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6162704Z Traceback (most recent call last): 2025-12-04T12:10:21.6162869Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6162912Z method(*args, **kwargs) 2025-12-04T12:10:21.6163061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6163102Z method(*args, **kwargs) 2025-12-04T12:10:21.6163268Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6163306Z with policy(): 2025-12-04T12:10:21.6163457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6163499Z raise RuntimeError(msg) 2025-12-04T12:10:21.6163888Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1117782016. 2025-12-04T12:10:21.6163890Z 2025-12-04T12:10:21.6163962Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6164222Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6164225Z 2025-12-04T12:10:21.6164322Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6164394Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6164439Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6164494Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6164558Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6164656Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6164693Z graph_break [] 2025-12-04T12:10:21.6164752Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6164824Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6164866Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6164934Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6165029Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6165095Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6165132Z graph_break [] 2025-12-04T12:10:21.6165190Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6165241Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6165387Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6165435Z Traceback (most recent call last): 2025-12-04T12:10:21.6165587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6165628Z method(*args, **kwargs) 2025-12-04T12:10:21.6165780Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6165824Z method(*args, **kwargs) 2025-12-04T12:10:21.6165972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6166010Z with policy(): 2025-12-04T12:10:21.6166160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6166203Z raise RuntimeError(msg) 2025-12-04T12:10:21.6166607Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6166610Z 2025-12-04T12:10:21.6166683Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6166953Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6166957Z 2025-12-04T12:10:21.6167045Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6167116Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6167159Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6167214Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6167280Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6167376Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6167416Z graph_break [] 2025-12-04T12:10:21.6167474Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6167548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6167591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6167646Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6167750Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6167813Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6167853Z graph_break [] 2025-12-04T12:10:21.6167909Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6167983Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6168027Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6168082Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6168175Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6168238Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6168285Z graph_break [] 2025-12-04T12:10:21.6168343Z aten_mm_info [('aten._scaled_mm.default_16_32_16', 1)] 2025-12-04T12:10:21.6168530Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9cf40d774e55554b.xml - 2025-12-04T12:10:21.6168591Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6169184Z FAILED [0.3183s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1117782016 and is now 1142947840. 2025-12-04T12:10:21.6169187Z 2025-12-04T12:10:21.6169258Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6169517Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6169523Z 2025-12-04T12:10:21.6169609Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6169670Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6169737Z ================== 1 failed, 187 deselected, 2 rerun in 2.65s ================== 2025-12-04T12:10:21.6169789Z Got exit code 1 2025-12-04T12:10:21.6169997Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6170163Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6170322Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-70e45ac4f59bc65d.xml 2025-12-04T12:10:21.6170379Z ============================= test session starts ============================== 2025-12-04T12:10:21.6170491Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6170534Z cachedir: .pytest_cache 2025-12-04T12:10:21.6170691Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6170740Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6170781Z configfile: pytest.ini 2025-12-04T12:10:21.6170942Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6171017Z collecting ... collected 188 items / 115 deselected / 73 selected 2025-12-04T12:10:21.6171072Z stepcurrent: skipping 115 already run items. 2025-12-04T12:10:21.6171119Z Running 73 items in this shard 2025-12-04T12:10:21.6171121Z 2025-12-04T12:10:21.6171350Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8545s] [ 1%] 2025-12-04T12:10:21.6171586Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4842s] [ 1%] 2025-12-04T12:10:21.6171783Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.4145s] [ 1%] 2025-12-04T12:10:21.6171786Z 2025-12-04T12:10:21.6171837Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6171985Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6172056Z Traceback (most recent call last): 2025-12-04T12:10:21.6172211Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6172253Z method(*args, **kwargs) 2025-12-04T12:10:21.6172403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6172446Z method(*args, **kwargs) 2025-12-04T12:10:21.6172595Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6172633Z with policy(): 2025-12-04T12:10:21.6172783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6172825Z raise RuntimeError(msg) 2025-12-04T12:10:21.6173222Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6173227Z 2025-12-04T12:10:21.6173299Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6173563Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6173565Z 2025-12-04T12:10:21.6173661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6173734Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6173778Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6173835Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6174324Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6174435Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6174473Z graph_break [] 2025-12-04T12:10:21.6174532Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6174606Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6175091Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6175141Z current_size = base.storage().size() 2025-12-04T12:10:21.6175193Z Autotune Choices Stats: 2025-12-04T12:10:21.6175568Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.6175620Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6175666Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6175765Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6176012Z triton_mm_1 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6176248Z triton_mm_0 0.0096 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6176293Z _scaled_mm 0.0226 ms 29.0% 2025-12-04T12:10:21.6176422Z SingleProcess AUTOTUNE benchmarking takes 0.0234 seconds and 0.0982 seconds precompiling for 3 choices 2025-12-04T12:10:21.6176572Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6176620Z Traceback (most recent call last): 2025-12-04T12:10:21.6176774Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6176815Z method(*args, **kwargs) 2025-12-04T12:10:21.6176966Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6177008Z method(*args, **kwargs) 2025-12-04T12:10:21.6177158Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6177195Z with policy(): 2025-12-04T12:10:21.6177347Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6177388Z raise RuntimeError(msg) 2025-12-04T12:10:21.6177791Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6177794Z 2025-12-04T12:10:21.6177877Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6178141Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6178145Z 2025-12-04T12:10:21.6178231Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6178305Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6178348Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6178405Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6178890Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6178989Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6179039Z graph_break [] 2025-12-04T12:10:21.6179098Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6179170Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6179655Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6179703Z current_size = base.storage().size() 2025-12-04T12:10:21.6179744Z Autotune Choices Stats: 2025-12-04T12:10:21.6180172Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.6180223Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6180268Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6180366Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6180600Z triton_mm_1 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6180829Z triton_mm_0 0.0096 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6180872Z _scaled_mm 0.0226 ms 29.0% 2025-12-04T12:10:21.6181001Z SingleProcess AUTOTUNE benchmarking takes 0.0234 seconds and 0.0982 seconds precompiling for 3 choices 2025-12-04T12:10:21.6181074Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6181117Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6181174Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6181284Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6181767Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6181818Z graph_break [] 2025-12-04T12:10:21.6181876Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6181950Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6181991Z Autotune Choices Stats: 2025-12-04T12:10:21.6182352Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00583899999037385, "best_triton_pos": 0} 2025-12-04T12:10:21.6182401Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6182445Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6182542Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6182774Z triton_mm_3 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6183013Z triton_mm_2 0.0086 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6183055Z _scaled_mm 0.0195 ms 30.0% 2025-12-04T12:10:21.6183183Z SingleProcess AUTOTUNE benchmarking takes 0.0186 seconds and 0.0741 seconds precompiling for 3 choices 2025-12-04T12:10:21.6183235Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6183387Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6183433Z Traceback (most recent call last): 2025-12-04T12:10:21.6183603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6183645Z method(*args, **kwargs) 2025-12-04T12:10:21.6183796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6183837Z method(*args, **kwargs) 2025-12-04T12:10:21.6183986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6184026Z with policy(): 2025-12-04T12:10:21.6184179Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6184220Z raise RuntimeError(msg) 2025-12-04T12:10:21.6184617Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6184621Z 2025-12-04T12:10:21.6184693Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6184957Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6184959Z 2025-12-04T12:10:21.6185054Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6185126Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6185171Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6185226Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6185705Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6185814Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6185852Z graph_break [] 2025-12-04T12:10:21.6185911Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6185984Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6186467Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6186515Z current_size = base.storage().size() 2025-12-04T12:10:21.6186556Z Autotune Choices Stats: 2025-12-04T12:10:21.6186930Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.6186980Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6187023Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6187120Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6187361Z triton_mm_1 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6187592Z triton_mm_0 0.0096 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6187635Z _scaled_mm 0.0226 ms 29.0% 2025-12-04T12:10:21.6187762Z SingleProcess AUTOTUNE benchmarking takes 0.0234 seconds and 0.0982 seconds precompiling for 3 choices 2025-12-04T12:10:21.6187833Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6187878Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6187933Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6188033Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6188513Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6188553Z graph_break [] 2025-12-04T12:10:21.6188612Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6188683Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6188723Z Autotune Choices Stats: 2025-12-04T12:10:21.6189092Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00583899999037385, "best_triton_pos": 0} 2025-12-04T12:10:21.6189147Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6189201Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6189299Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6189530Z triton_mm_3 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6189757Z triton_mm_2 0.0086 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6189801Z _scaled_mm 0.0195 ms 30.0% 2025-12-04T12:10:21.6189927Z SingleProcess AUTOTUNE benchmarking takes 0.0186 seconds and 0.0741 seconds precompiling for 3 choices 2025-12-04T12:10:21.6189998Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6190045Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6190131Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6190229Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6190728Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6190766Z graph_break [] 2025-12-04T12:10:21.6190826Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6190896Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6190937Z Autotune Choices Stats: 2025-12-04T12:10:21.6191310Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.6191362Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6191406Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6191504Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6191736Z triton_mm_4 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6191961Z triton_mm_5 0.0064 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6192003Z _scaled_mm 0.0235 ms 25.0% 2025-12-04T12:10:21.6192129Z SingleProcess AUTOTUNE benchmarking takes 0.0162 seconds and 0.0819 seconds precompiling for 3 choices 2025-12-04T12:10:21.6192317Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-70e45ac4f59bc65d.xml - 2025-12-04T12:10:21.6192375Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6192995Z FAILED [0.4145s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6193010Z 2025-12-04T12:10:21.6193084Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6193348Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6193350Z 2025-12-04T12:10:21.6193436Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6193497Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6193564Z ================== 1 failed, 115 deselected, 2 rerun in 2.77s ================== 2025-12-04T12:10:21.6193601Z Got exit code 1 2025-12-04T12:10:21.6193644Z Retrying single test... 2025-12-04T12:10:21.6193787Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e808b55a254c14f7.xml 2025-12-04T12:10:21.6193845Z ============================= test session starts ============================== 2025-12-04T12:10:21.6193956Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6194015Z cachedir: .pytest_cache 2025-12-04T12:10:21.6194175Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6194225Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6194266Z configfile: pytest.ini 2025-12-04T12:10:21.6194430Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6194504Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6194774Z stepcurrent: skipping 115 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6194823Z Running 1 items in this shard 2025-12-04T12:10:21.6194825Z 2025-12-04T12:10:21.6195047Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1348s] [100%] 2025-12-04T12:10:21.6195272Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4487s] [100%] 2025-12-04T12:10:21.6195469Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda PASSED [0.6340s] [100%] 2025-12-04T12:10:21.6195471Z 2025-12-04T12:10:21.6195524Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6195672Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6195722Z Traceback (most recent call last): 2025-12-04T12:10:21.6195877Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6195920Z method(*args, **kwargs) 2025-12-04T12:10:21.6196071Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6196115Z method(*args, **kwargs) 2025-12-04T12:10:21.6196274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6196314Z with policy(): 2025-12-04T12:10:21.6196466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6196512Z raise RuntimeError(msg) 2025-12-04T12:10:21.6196909Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6196923Z 2025-12-04T12:10:21.6196995Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6197259Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6197262Z 2025-12-04T12:10:21.6197348Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6197421Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6197467Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6197523Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6198010Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6198117Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6198154Z graph_break [] 2025-12-04T12:10:21.6198214Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6198286Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6198776Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6198827Z current_size = base.storage().size() 2025-12-04T12:10:21.6198870Z Autotune Choices Stats: 2025-12-04T12:10:21.6199236Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.6199285Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6199329Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6199427Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6199664Z triton_mm_0 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6199897Z triton_mm_1 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6199937Z _scaled_mm 0.0271 ms 22.3% 2025-12-04T12:10:21.6200065Z SingleProcess AUTOTUNE benchmarking takes 0.0207 seconds and 0.1343 seconds precompiling for 3 choices 2025-12-04T12:10:21.6200261Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6200309Z Traceback (most recent call last): 2025-12-04T12:10:21.6200463Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6200521Z method(*args, **kwargs) 2025-12-04T12:10:21.6200671Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6200713Z method(*args, **kwargs) 2025-12-04T12:10:21.6200862Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6200901Z with policy(): 2025-12-04T12:10:21.6201053Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6201095Z raise RuntimeError(msg) 2025-12-04T12:10:21.6201492Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1128267776. 2025-12-04T12:10:21.6201497Z 2025-12-04T12:10:21.6201569Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6201831Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6201845Z 2025-12-04T12:10:21.6201930Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6202003Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6202048Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6202104Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6202597Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6202698Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6202737Z graph_break [] 2025-12-04T12:10:21.6202797Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6202868Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6203349Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6203396Z current_size = base.storage().size() 2025-12-04T12:10:21.6203439Z Autotune Choices Stats: 2025-12-04T12:10:21.6203804Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.6203856Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6203899Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6204006Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6204240Z triton_mm_0 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6204467Z triton_mm_1 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6204522Z _scaled_mm 0.0271 ms 22.3% 2025-12-04T12:10:21.6204649Z SingleProcess AUTOTUNE benchmarking takes 0.0207 seconds and 0.1343 seconds precompiling for 3 choices 2025-12-04T12:10:21.6204721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6204764Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6204820Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6204921Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6205334Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6205373Z graph_break [] 2025-12-04T12:10:21.6205432Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6205518Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6205560Z Autotune Choices Stats: 2025-12-04T12:10:21.6206023Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "_scaled_mm", "best_time": 0.0069989999756217, "best_triton_pos": 1, "best_triton_time": 0.00827999971807003, "best_triton_kernel": "triton_mm_3", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:21.6206073Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6206117Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6206224Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6206271Z _scaled_mm 0.0070 ms 100.0% 2025-12-04T12:10:21.6206499Z triton_mm_3 0.0083 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6206725Z triton_mm_2 0.0092 ms 75.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6206852Z SingleProcess AUTOTUNE benchmarking takes 0.0243 seconds and 0.0811 seconds precompiling for 3 choices 2025-12-04T12:10:21.6207039Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e808b55a254c14f7.xml - 2025-12-04T12:10:21.6207109Z ================== 1 passed, 187 deselected, 2 rerun in 3.24s ================== 2025-12-04T12:10:21.6207147Z Got exit code 0 2025-12-04T12:10:21.6207232Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T12:10:21.6207377Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a4250ad6c3fb6fa9.xml 2025-12-04T12:10:21.6207434Z ============================= test session starts ============================== 2025-12-04T12:10:21.6207544Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6207587Z cachedir: .pytest_cache 2025-12-04T12:10:21.6207756Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6207802Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6207845Z configfile: pytest.ini 2025-12-04T12:10:21.6208007Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6208102Z collecting ... collected 188 items / 116 deselected / 72 selected 2025-12-04T12:10:21.6208156Z stepcurrent: skipping 116 already run items. 2025-12-04T12:10:21.6208202Z Running 72 items in this shard 2025-12-04T12:10:21.6208205Z 2025-12-04T12:10:21.6208429Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9465s] [ 1%] 2025-12-04T12:10:21.6208649Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3922s] [ 1%] 2025-12-04T12:10:21.6208844Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda PASSED [0.5158s] [ 1%] 2025-12-04T12:10:21.6209062Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7092s] [ 2%] 2025-12-04T12:10:21.6209277Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7268s] [ 2%] 2025-12-04T12:10:21.6209484Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.7111s] [ 2%] 2025-12-04T12:10:21.6209486Z 2025-12-04T12:10:21.6209539Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6209690Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6209736Z Traceback (most recent call last): 2025-12-04T12:10:21.6209891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6209946Z method(*args, **kwargs) 2025-12-04T12:10:21.6210137Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6210180Z method(*args, **kwargs) 2025-12-04T12:10:21.6210330Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6210369Z with policy(): 2025-12-04T12:10:21.6210519Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6210563Z raise RuntimeError(msg) 2025-12-04T12:10:21.6210957Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6210961Z 2025-12-04T12:10:21.6211035Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6211297Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6211302Z 2025-12-04T12:10:21.6211387Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6211462Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6211521Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6211579Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6212064Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6212175Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6212212Z graph_break [] 2025-12-04T12:10:21.6212271Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6212342Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6212826Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6212873Z current_size = base.storage().size() 2025-12-04T12:10:21.6212917Z Autotune Choices Stats: 2025-12-04T12:10:21.6213281Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00723899994045496, "best_triton_pos": 0} 2025-12-04T12:10:21.6213343Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6213387Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6213486Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6213720Z triton_mm_0 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6213959Z triton_mm_1 0.0084 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6214005Z _scaled_mm 0.0284 ms 25.5% 2025-12-04T12:10:21.6214129Z SingleProcess AUTOTUNE benchmarking takes 0.0237 seconds and 0.0920 seconds precompiling for 3 choices 2025-12-04T12:10:21.6214279Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6214328Z Traceback (most recent call last): 2025-12-04T12:10:21.6214485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6214525Z method(*args, **kwargs) 2025-12-04T12:10:21.6214675Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6214716Z method(*args, **kwargs) 2025-12-04T12:10:21.6214868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6214906Z with policy(): 2025-12-04T12:10:21.6215060Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6215107Z raise RuntimeError(msg) 2025-12-04T12:10:21.6215510Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1128267776. 2025-12-04T12:10:21.6215513Z 2025-12-04T12:10:21.6215589Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6215854Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6215868Z 2025-12-04T12:10:21.6215956Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6216029Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6216076Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6216134Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6216613Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6216713Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6216752Z graph_break [] 2025-12-04T12:10:21.6216812Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6216896Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6217381Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6217427Z current_size = base.storage().size() 2025-12-04T12:10:21.6217468Z Autotune Choices Stats: 2025-12-04T12:10:21.6217842Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00723899994045496, "best_triton_pos": 0} 2025-12-04T12:10:21.6217893Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6217939Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6218040Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6218271Z triton_mm_0 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6218493Z triton_mm_1 0.0084 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6218538Z _scaled_mm 0.0284 ms 25.5% 2025-12-04T12:10:21.6218665Z SingleProcess AUTOTUNE benchmarking takes 0.0237 seconds and 0.0920 seconds precompiling for 3 choices 2025-12-04T12:10:21.6218738Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6218784Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6218840Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6218939Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6219362Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6219399Z graph_break [] 2025-12-04T12:10:21.6219460Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6219533Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6219586Z Autotune Choices Stats: 2025-12-04T12:10:21.6220047Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "_scaled_mm", "best_time": 0.006560000125318766, "best_triton_pos": 1, "best_triton_time": 0.0069599999114871025, "best_triton_kernel": "triton_mm_2", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2"} 2025-12-04T12:10:21.6220199Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32) 2025-12-04T12:10:21.6220244Z strides: [32, 1], [1, 32], [1, 1], [1, 1] 2025-12-04T12:10:21.6220341Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6220386Z _scaled_mm 0.0066 ms 100.0% 2025-12-04T12:10:21.6220614Z triton_mm_2 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6220838Z triton_mm_3 0.0074 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6220977Z SingleProcess AUTOTUNE benchmarking takes 0.0151 seconds and 0.0734 seconds precompiling for 3 choices 2025-12-04T12:10:21.6221128Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6221175Z Traceback (most recent call last): 2025-12-04T12:10:21.6221330Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6221372Z method(*args, **kwargs) 2025-12-04T12:10:21.6221538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6221581Z method(*args, **kwargs) 2025-12-04T12:10:21.6221731Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6221769Z with policy(): 2025-12-04T12:10:21.6221922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6221965Z raise RuntimeError(msg) 2025-12-04T12:10:21.6222360Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1080033280 and is now 1111490560. 2025-12-04T12:10:21.6222363Z 2025-12-04T12:10:21.6222436Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6222698Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6222702Z 2025-12-04T12:10:21.6222788Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6222859Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6222904Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6222959Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6223078Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6223555Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6223606Z graph_break [] 2025-12-04T12:10:21.6223666Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6223737Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6223778Z Autotune Choices Stats: 2025-12-04T12:10:21.6224144Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:21.6224199Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6224247Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6224369Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6224602Z triton_mm_7 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6224841Z triton_mm_6 0.0075 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6224886Z _scaled_mm 0.0279 ms 26.5% 2025-12-04T12:10:21.6225012Z SingleProcess AUTOTUNE benchmarking takes 0.0257 seconds and 0.2090 seconds precompiling for 3 choices 2025-12-04T12:10:21.6225160Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6225208Z Traceback (most recent call last): 2025-12-04T12:10:21.6225378Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6225420Z method(*args, **kwargs) 2025-12-04T12:10:21.6225573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6225612Z method(*args, **kwargs) 2025-12-04T12:10:21.6225762Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6225799Z with policy(): 2025-12-04T12:10:21.6225951Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6225993Z raise RuntimeError(msg) 2025-12-04T12:10:21.6226390Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1111490560 and is now 1121976320. 2025-12-04T12:10:21.6226394Z 2025-12-04T12:10:21.6226467Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6226729Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6226731Z 2025-12-04T12:10:21.6226827Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6226900Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6226952Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6227007Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6227106Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6227599Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6227639Z graph_break [] 2025-12-04T12:10:21.6227707Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6227781Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6227821Z Autotune Choices Stats: 2025-12-04T12:10:21.6228183Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:21.6228240Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6228301Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6228421Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6228657Z triton_mm_7 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6228888Z triton_mm_6 0.0075 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6228931Z _scaled_mm 0.0279 ms 26.5% 2025-12-04T12:10:21.6229067Z SingleProcess AUTOTUNE benchmarking takes 0.0257 seconds and 0.2090 seconds precompiling for 3 choices 2025-12-04T12:10:21.6229141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6229188Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6229243Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6229344Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6229822Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6229861Z graph_break [] 2025-12-04T12:10:21.6229921Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6229997Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6230038Z Autotune Choices Stats: 2025-12-04T12:10:21.6230436Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0068789999932050705, "best_triton_pos": 0} 2025-12-04T12:10:21.6230491Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6230554Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6230674Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6230908Z triton_mm_9 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6231147Z triton_mm_8 0.0088 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6231193Z _scaled_mm 0.0254 ms 27.1% 2025-12-04T12:10:21.6231321Z SingleProcess AUTOTUNE benchmarking takes 0.0395 seconds and 0.2137 seconds precompiling for 3 choices 2025-12-04T12:10:21.6231373Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6231522Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6231570Z Traceback (most recent call last): 2025-12-04T12:10:21.6231726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6231766Z method(*args, **kwargs) 2025-12-04T12:10:21.6231919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6231975Z method(*args, **kwargs) 2025-12-04T12:10:21.6232125Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6232167Z with policy(): 2025-12-04T12:10:21.6232317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6232360Z raise RuntimeError(msg) 2025-12-04T12:10:21.6232753Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1132462080. 2025-12-04T12:10:21.6232755Z 2025-12-04T12:10:21.6232844Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6233106Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6233109Z 2025-12-04T12:10:21.6233196Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6233270Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6233317Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6233374Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6233472Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6233947Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6233985Z graph_break [] 2025-12-04T12:10:21.6234046Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6234120Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6234161Z Autotune Choices Stats: 2025-12-04T12:10:21.6234532Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:21.6234601Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6234661Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6234780Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6235014Z triton_mm_7 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6235243Z triton_mm_6 0.0075 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6235284Z _scaled_mm 0.0279 ms 26.5% 2025-12-04T12:10:21.6235410Z SingleProcess AUTOTUNE benchmarking takes 0.0257 seconds and 0.2090 seconds precompiling for 3 choices 2025-12-04T12:10:21.6235487Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6235532Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6235589Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6235688Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6236174Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6236211Z graph_break [] 2025-12-04T12:10:21.6236270Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6236342Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6236384Z Autotune Choices Stats: 2025-12-04T12:10:21.6236759Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0068789999932050705, "best_triton_pos": 0} 2025-12-04T12:10:21.6236814Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6236863Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6236981Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6237214Z triton_mm_9 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6237440Z triton_mm_8 0.0088 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6237484Z _scaled_mm 0.0254 ms 27.1% 2025-12-04T12:10:21.6237611Z SingleProcess AUTOTUNE benchmarking takes 0.0395 seconds and 0.2137 seconds precompiling for 3 choices 2025-12-04T12:10:21.6237685Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6237728Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6237784Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6237893Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6238369Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6238417Z graph_break [] 2025-12-04T12:10:21.6238475Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6238548Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6238588Z Autotune Choices Stats: 2025-12-04T12:10:21.6238951Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6239003Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6239052Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6239170Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6239408Z triton_mm_10 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6239645Z triton_mm_11 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6239687Z _scaled_mm 0.0264 ms 22.6% 2025-12-04T12:10:21.6239813Z SingleProcess AUTOTUNE benchmarking takes 0.0246 seconds and 0.2231 seconds precompiling for 3 choices 2025-12-04T12:10:21.6240003Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a4250ad6c3fb6fa9.xml - 2025-12-04T12:10:21.6240064Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6240738Z FAILED [0.7111s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1121976320 and is now 1132462080. 2025-12-04T12:10:21.6240742Z 2025-12-04T12:10:21.6240814Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6241076Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6241078Z 2025-12-04T12:10:21.6241164Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6241229Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6241307Z ============= 1 failed, 1 passed, 116 deselected, 4 rerun in 5.03s ============= 2025-12-04T12:10:21.6241350Z Got exit code 1 2025-12-04T12:10:21.6241394Z Retrying single test... 2025-12-04T12:10:21.6241542Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-55e66602aab96975.xml 2025-12-04T12:10:21.6241600Z ============================= test session starts ============================== 2025-12-04T12:10:21.6241724Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6241765Z cachedir: .pytest_cache 2025-12-04T12:10:21.6241924Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6241971Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6242012Z configfile: pytest.ini 2025-12-04T12:10:21.6242189Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6242264Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6242524Z stepcurrent: skipping 117 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6242567Z Running 1 items in this shard 2025-12-04T12:10:21.6242569Z 2025-12-04T12:10:21.6242791Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9779s] [100%] 2025-12-04T12:10:21.6243011Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5535s] [100%] 2025-12-04T12:10:21.6243205Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.6571s] [100%] 2025-12-04T12:10:21.6243221Z 2025-12-04T12:10:21.6243273Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6243420Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6243466Z Traceback (most recent call last): 2025-12-04T12:10:21.6243624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6243667Z method(*args, **kwargs) 2025-12-04T12:10:21.6243817Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6243857Z method(*args, **kwargs) 2025-12-04T12:10:21.6244018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6244057Z with policy(): 2025-12-04T12:10:21.6244209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6244254Z raise RuntimeError(msg) 2025-12-04T12:10:21.6244648Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6244651Z 2025-12-04T12:10:21.6244723Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6244988Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6244991Z 2025-12-04T12:10:21.6245078Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6245153Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6245195Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6245251Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6245740Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6245839Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6245875Z graph_break [] 2025-12-04T12:10:21.6245946Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6246019Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6246501Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6246549Z current_size = base.storage().size() 2025-12-04T12:10:21.6246591Z Autotune Choices Stats: 2025-12-04T12:10:21.6246959Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:21.6247013Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6247069Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6247190Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6247429Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6247658Z triton_mm_1 0.0071 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6247701Z _scaled_mm 0.0250 ms 27.2% 2025-12-04T12:10:21.6247839Z SingleProcess AUTOTUNE benchmarking takes 0.0192 seconds and 0.1105 seconds precompiling for 3 choices 2025-12-04T12:10:21.6247990Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6248042Z Traceback (most recent call last): 2025-12-04T12:10:21.6248197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6248238Z method(*args, **kwargs) 2025-12-04T12:10:21.6248389Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6248429Z method(*args, **kwargs) 2025-12-04T12:10:21.6248580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6248616Z with policy(): 2025-12-04T12:10:21.6248768Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6248812Z raise RuntimeError(msg) 2025-12-04T12:10:21.6249210Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6249213Z 2025-12-04T12:10:21.6249285Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6249564Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6249567Z 2025-12-04T12:10:21.6249652Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6249727Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6249785Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6249840Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6250370Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6250468Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6250505Z graph_break [] 2025-12-04T12:10:21.6250563Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6250635Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6251116Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6251206Z current_size = base.storage().size() 2025-12-04T12:10:21.6251246Z Autotune Choices Stats: 2025-12-04T12:10:21.6251614Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:21.6251667Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6251715Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6251864Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6252097Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6252332Z triton_mm_1 0.0071 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6252439Z _scaled_mm 0.0250 ms 27.2% 2025-12-04T12:10:21.6252614Z SingleProcess AUTOTUNE benchmarking takes 0.0192 seconds and 0.1105 seconds precompiling for 3 choices 2025-12-04T12:10:21.6252698Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6252740Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6252796Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6252899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6253383Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6253487Z graph_break [] 2025-12-04T12:10:21.6253611Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6253691Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6253732Z Autotune Choices Stats: 2025-12-04T12:10:21.6254097Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0062790000811219215, "best_triton_pos": 0} 2025-12-04T12:10:21.6254163Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6254210Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6254329Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6254568Z triton_mm_3 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6254865Z triton_mm_2 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6254934Z _scaled_mm 0.0274 ms 22.9% 2025-12-04T12:10:21.6255061Z SingleProcess AUTOTUNE benchmarking takes 0.0279 seconds and 0.0925 seconds precompiling for 3 choices 2025-12-04T12:10:21.6255128Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6255281Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6255330Z Traceback (most recent call last): 2025-12-04T12:10:21.6255485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6255531Z method(*args, **kwargs) 2025-12-04T12:10:21.6255686Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6255744Z method(*args, **kwargs) 2025-12-04T12:10:21.6255905Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6255945Z with policy(): 2025-12-04T12:10:21.6256100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6256142Z raise RuntimeError(msg) 2025-12-04T12:10:21.6256538Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6256540Z 2025-12-04T12:10:21.6256618Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6256879Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6256884Z 2025-12-04T12:10:21.6256970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6257044Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6257087Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6257143Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6257632Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6257730Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6257778Z graph_break [] 2025-12-04T12:10:21.6257839Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6257910Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6258392Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6258439Z current_size = base.storage().size() 2025-12-04T12:10:21.6258479Z Autotune Choices Stats: 2025-12-04T12:10:21.6258842Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006800000090152025, "best_triton_pos": 0} 2025-12-04T12:10:21.6258895Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6258954Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6259073Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6259306Z triton_mm_0 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6259535Z triton_mm_1 0.0071 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6259577Z _scaled_mm 0.0250 ms 27.2% 2025-12-04T12:10:21.6259718Z SingleProcess AUTOTUNE benchmarking takes 0.0192 seconds and 0.1105 seconds precompiling for 3 choices 2025-12-04T12:10:21.6259794Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6259839Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6259895Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6259993Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6260511Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6260548Z graph_break [] 2025-12-04T12:10:21.6260607Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6260681Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6260722Z Autotune Choices Stats: 2025-12-04T12:10:21.6261082Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0062790000811219215, "best_triton_pos": 0} 2025-12-04T12:10:21.6261135Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6261203Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6261322Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6261554Z triton_mm_3 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6261796Z triton_mm_2 0.0077 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6261843Z _scaled_mm 0.0274 ms 22.9% 2025-12-04T12:10:21.6261969Z SingleProcess AUTOTUNE benchmarking takes 0.0279 seconds and 0.0925 seconds precompiling for 3 choices 2025-12-04T12:10:21.6262040Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6262088Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6262143Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6262243Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6262716Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6262768Z graph_break [] 2025-12-04T12:10:21.6262827Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6262899Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6262939Z Autotune Choices Stats: 2025-12-04T12:10:21.6263298Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.6263351Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6263412Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6263531Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6263762Z triton_mm_4 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6263806Z _scaled_mm 0.0064 ms 95.0% 2025-12-04T12:10:21.6264033Z triton_mm_5 0.0082 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6264160Z SingleProcess AUTOTUNE benchmarking takes 0.0253 seconds and 0.2242 seconds precompiling for 3 choices 2025-12-04T12:10:21.6264347Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-55e66602aab96975.xml - 2025-12-04T12:10:21.6264409Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6265016Z FAILED [0.6571s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6265020Z 2025-12-04T12:10:21.6265092Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6265354Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6265379Z 2025-12-04T12:10:21.6265465Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6265527Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6265595Z ================== 1 failed, 187 deselected, 2 rerun in 3.21s ================== 2025-12-04T12:10:21.6265634Z Got exit code 1 2025-12-04T12:10:21.6265674Z Retrying single test... 2025-12-04T12:10:21.6265818Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-125c00fcfa91626f.xml 2025-12-04T12:10:21.6265874Z ============================= test session starts ============================== 2025-12-04T12:10:21.6265984Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6266025Z cachedir: .pytest_cache 2025-12-04T12:10:21.6266183Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6266231Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6266275Z configfile: pytest.ini 2025-12-04T12:10:21.6266447Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6266523Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6266783Z stepcurrent: skipping 117 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6266827Z Running 1 items in this shard 2025-12-04T12:10:21.6266829Z 2025-12-04T12:10:21.6267050Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9612s] [100%] 2025-12-04T12:10:21.6267278Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6189s] [100%] 2025-12-04T12:10:21.6267475Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda FAILED [0.7328s] [100%] 2025-12-04T12:10:21.6267477Z 2025-12-04T12:10:21.6267528Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6267677Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6267726Z Traceback (most recent call last): 2025-12-04T12:10:21.6267883Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6267925Z method(*args, **kwargs) 2025-12-04T12:10:21.6268078Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6268121Z method(*args, **kwargs) 2025-12-04T12:10:21.6268273Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6268312Z with policy(): 2025-12-04T12:10:21.6268465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6268507Z raise RuntimeError(msg) 2025-12-04T12:10:21.6268914Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6268916Z 2025-12-04T12:10:21.6268990Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6269262Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6269265Z 2025-12-04T12:10:21.6269351Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6269425Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6269470Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6269526Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6270005Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6270145Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6270181Z graph_break [] 2025-12-04T12:10:21.6270259Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6270331Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6270812Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6270859Z current_size = base.storage().size() 2025-12-04T12:10:21.6270900Z Autotune Choices Stats: 2025-12-04T12:10:21.6271287Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007278999779373407, "best_triton_pos": 0} 2025-12-04T12:10:21.6271345Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6271392Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6271513Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6271746Z triton_mm_0 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6271975Z triton_mm_1 0.0086 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6272017Z _scaled_mm 0.0247 ms 29.4% 2025-12-04T12:10:21.6272143Z SingleProcess AUTOTUNE benchmarking takes 0.0199 seconds and 0.1140 seconds precompiling for 3 choices 2025-12-04T12:10:21.6272292Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6272341Z Traceback (most recent call last): 2025-12-04T12:10:21.6272495Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6272551Z method(*args, **kwargs) 2025-12-04T12:10:21.6272704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6272744Z method(*args, **kwargs) 2025-12-04T12:10:21.6272894Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6272944Z with policy(): 2025-12-04T12:10:21.6273096Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6273139Z raise RuntimeError(msg) 2025-12-04T12:10:21.6273537Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6273540Z 2025-12-04T12:10:21.6273613Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6273876Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6273879Z 2025-12-04T12:10:21.6273966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6274037Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6274091Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6274147Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6274624Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6274721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6274759Z graph_break [] 2025-12-04T12:10:21.6274817Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6274901Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6275513Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6275561Z current_size = base.storage().size() 2025-12-04T12:10:21.6275603Z Autotune Choices Stats: 2025-12-04T12:10:21.6275967Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007278999779373407, "best_triton_pos": 0} 2025-12-04T12:10:21.6276022Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6276071Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6276191Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6276426Z triton_mm_0 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6276666Z triton_mm_1 0.0086 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6276711Z _scaled_mm 0.0247 ms 29.4% 2025-12-04T12:10:21.6276836Z SingleProcess AUTOTUNE benchmarking takes 0.0199 seconds and 0.1140 seconds precompiling for 3 choices 2025-12-04T12:10:21.6276931Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6276974Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6277032Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6277131Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6277611Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6277648Z graph_break [] 2025-12-04T12:10:21.6277709Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6277781Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6277824Z Autotune Choices Stats: 2025-12-04T12:10:21.6278183Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6278247Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6278295Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6278416Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6278650Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6278886Z triton_mm_2 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6278933Z _scaled_mm 0.0215 ms 27.7% 2025-12-04T12:10:21.6279060Z SingleProcess AUTOTUNE benchmarking takes 0.0192 seconds and 0.0782 seconds precompiling for 3 choices 2025-12-04T12:10:21.6279114Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6279261Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6279307Z Traceback (most recent call last): 2025-12-04T12:10:21.6279461Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6279505Z method(*args, **kwargs) 2025-12-04T12:10:21.6279658Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6279700Z method(*args, **kwargs) 2025-12-04T12:10:21.6279850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6279888Z with policy(): 2025-12-04T12:10:21.6280040Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6280084Z raise RuntimeError(msg) 2025-12-04T12:10:21.6280533Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6280537Z 2025-12-04T12:10:21.6280609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6280886Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6280889Z 2025-12-04T12:10:21.6280975Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6281048Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6281090Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6281146Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6281627Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6281727Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6281775Z graph_break [] 2025-12-04T12:10:21.6281836Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6281907Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6282387Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6282435Z current_size = base.storage().size() 2025-12-04T12:10:21.6282475Z Autotune Choices Stats: 2025-12-04T12:10:21.6282856Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007278999779373407, "best_triton_pos": 0} 2025-12-04T12:10:21.6282910Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6282958Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6283078Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6283315Z triton_mm_0 0.0073 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6283546Z triton_mm_1 0.0086 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6283589Z _scaled_mm 0.0247 ms 29.4% 2025-12-04T12:10:21.6283718Z SingleProcess AUTOTUNE benchmarking takes 0.0199 seconds and 0.1140 seconds precompiling for 3 choices 2025-12-04T12:10:21.6283791Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6283835Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6283890Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6283990Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6284477Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6284528Z graph_break [] 2025-12-04T12:10:21.6284588Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6284663Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6284702Z Autotune Choices Stats: 2025-12-04T12:10:21.6285062Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6285113Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6285162Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6285282Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6285516Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6285755Z triton_mm_2 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6285797Z _scaled_mm 0.0215 ms 27.7% 2025-12-04T12:10:21.6285925Z SingleProcess AUTOTUNE benchmarking takes 0.0192 seconds and 0.0782 seconds precompiling for 3 choices 2025-12-04T12:10:21.6285998Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6286043Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6286099Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6286208Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6286682Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6286720Z graph_break [] 2025-12-04T12:10:21.6286778Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6286853Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6286894Z Autotune Choices Stats: 2025-12-04T12:10:21.6287256Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.6287310Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6287358Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6287479Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6287711Z triton_mm_5 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6287952Z triton_mm_4 0.0104 ms 60.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=False, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6287995Z _scaled_mm 0.0284 ms 22.0% 2025-12-04T12:10:21.6288122Z SingleProcess AUTOTUNE benchmarking takes 0.0272 seconds and 0.0950 seconds precompiling for 3 choices 2025-12-04T12:10:21.6288317Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-125c00fcfa91626f.xml - 2025-12-04T12:10:21.6288379Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6288974Z FAILED [0.7328s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6288977Z 2025-12-04T12:10:21.6289049Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6289314Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6289326Z 2025-12-04T12:10:21.6289411Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6289473Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6289540Z ================== 1 failed, 187 deselected, 2 rerun in 3.33s ================== 2025-12-04T12:10:21.6289580Z Got exit code 1 2025-12-04T12:10:21.6289790Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda 2025-12-04T12:10:21.6289921Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6290076Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-99216aeae7d7470e.xml 2025-12-04T12:10:21.6290188Z ============================= test session starts ============================== 2025-12-04T12:10:21.6290301Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6290341Z cachedir: .pytest_cache 2025-12-04T12:10:21.6290499Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6290546Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6290587Z configfile: pytest.ini 2025-12-04T12:10:21.6290752Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6290832Z collecting ... collected 188 items / 118 deselected / 70 selected 2025-12-04T12:10:21.6290885Z stepcurrent: skipping 118 already run items. 2025-12-04T12:10:21.6290932Z Running 70 items in this shard 2025-12-04T12:10:21.6290935Z 2025-12-04T12:10:21.6291157Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9962s] [ 1%] 2025-12-04T12:10:21.6291376Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6148s] [ 1%] 2025-12-04T12:10:21.6291588Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.7124s] [ 1%] 2025-12-04T12:10:21.6291591Z 2025-12-04T12:10:21.6291642Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6291793Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6291841Z Traceback (most recent call last): 2025-12-04T12:10:21.6292011Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6292054Z method(*args, **kwargs) 2025-12-04T12:10:21.6292209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6292251Z method(*args, **kwargs) 2025-12-04T12:10:21.6292402Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6292439Z with policy(): 2025-12-04T12:10:21.6292592Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6292634Z raise RuntimeError(msg) 2025-12-04T12:10:21.6293030Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6293046Z 2025-12-04T12:10:21.6293121Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6293384Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6293386Z 2025-12-04T12:10:21.6293473Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6293545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6293589Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6293645Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6294143Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6294242Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6294280Z graph_break [] 2025-12-04T12:10:21.6294339Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6294413Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6294898Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6294946Z current_size = base.storage().size() 2025-12-04T12:10:21.6294986Z Autotune Choices Stats: 2025-12-04T12:10:21.6295352Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6295422Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6295470Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6295591Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6295820Z triton_mm_1 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6296061Z triton_mm_0 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6296104Z _scaled_mm 0.0074 ms 91.3% 2025-12-04T12:10:21.6296231Z SingleProcess AUTOTUNE benchmarking takes 0.0220 seconds and 0.1076 seconds precompiling for 3 choices 2025-12-04T12:10:21.6296378Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6296426Z Traceback (most recent call last): 2025-12-04T12:10:21.6296580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6296624Z method(*args, **kwargs) 2025-12-04T12:10:21.6296777Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6296820Z method(*args, **kwargs) 2025-12-04T12:10:21.6296973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6297020Z with policy(): 2025-12-04T12:10:21.6297171Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6297214Z raise RuntimeError(msg) 2025-12-04T12:10:21.6297617Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6297619Z 2025-12-04T12:10:21.6297691Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6297963Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6297966Z 2025-12-04T12:10:21.6298052Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6298124Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6298168Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6298226Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6298707Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6298806Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6298844Z graph_break [] 2025-12-04T12:10:21.6298905Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6298979Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6299467Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6299515Z current_size = base.storage().size() 2025-12-04T12:10:21.6299555Z Autotune Choices Stats: 2025-12-04T12:10:21.6299922Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6299987Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6300034Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6300199Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6300430Z triton_mm_1 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6300657Z triton_mm_0 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6300700Z _scaled_mm 0.0074 ms 91.3% 2025-12-04T12:10:21.6300826Z SingleProcess AUTOTUNE benchmarking takes 0.0220 seconds and 0.1076 seconds precompiling for 3 choices 2025-12-04T12:10:21.6300915Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6300961Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6301016Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6301115Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6301605Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6301646Z graph_break [] 2025-12-04T12:10:21.6301705Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6301780Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6301820Z Autotune Choices Stats: 2025-12-04T12:10:21.6302183Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:21.6302239Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6302286Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6302408Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6302637Z triton_mm_2 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6302863Z triton_mm_3 0.0093 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6302907Z _scaled_mm 0.0249 ms 27.5% 2025-12-04T12:10:21.6303046Z SingleProcess AUTOTUNE benchmarking takes 0.0293 seconds and 0.0997 seconds precompiling for 3 choices 2025-12-04T12:10:21.6303100Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6303247Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6303294Z Traceback (most recent call last): 2025-12-04T12:10:21.6303463Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6303506Z method(*args, **kwargs) 2025-12-04T12:10:21.6303657Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6303702Z method(*args, **kwargs) 2025-12-04T12:10:21.6303851Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6303889Z with policy(): 2025-12-04T12:10:21.6304041Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6304083Z raise RuntimeError(msg) 2025-12-04T12:10:21.6304485Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6304498Z 2025-12-04T12:10:21.6304571Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6304831Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6304833Z 2025-12-04T12:10:21.6304922Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6304995Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6305041Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6305099Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6305597Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6305697Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6305734Z graph_break [] 2025-12-04T12:10:21.6305794Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6305865Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6306346Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6306394Z current_size = base.storage().size() 2025-12-04T12:10:21.6306435Z Autotune Choices Stats: 2025-12-04T12:10:21.6306803Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6306867Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6306917Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6307036Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6307265Z triton_mm_1 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6307497Z triton_mm_0 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6307540Z _scaled_mm 0.0074 ms 91.3% 2025-12-04T12:10:21.6307666Z SingleProcess AUTOTUNE benchmarking takes 0.0220 seconds and 0.1076 seconds precompiling for 3 choices 2025-12-04T12:10:21.6307738Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6307783Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6307839Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6307937Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6308414Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6308467Z graph_break [] 2025-12-04T12:10:21.6308525Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6308597Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6308637Z Autotune Choices Stats: 2025-12-04T12:10:21.6308998Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:21.6309061Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6309111Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6309229Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6309458Z triton_mm_2 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6309679Z triton_mm_3 0.0093 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6309723Z _scaled_mm 0.0249 ms 27.5% 2025-12-04T12:10:21.6309848Z SingleProcess AUTOTUNE benchmarking takes 0.0293 seconds and 0.0997 seconds precompiling for 3 choices 2025-12-04T12:10:21.6309922Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6309966Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6310022Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6310160Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6310647Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6310685Z graph_break [] 2025-12-04T12:10:21.6310743Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6310816Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6310868Z Autotune Choices Stats: 2025-12-04T12:10:21.6311227Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.6311280Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6311328Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6311446Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6311674Z triton_mm_5 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6311897Z triton_mm_4 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6311954Z _scaled_mm 0.0272 ms 23.1% 2025-12-04T12:10:21.6312080Z SingleProcess AUTOTUNE benchmarking takes 0.0300 seconds and 0.1982 seconds precompiling for 3 choices 2025-12-04T12:10:21.6312266Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-99216aeae7d7470e.xml - 2025-12-04T12:10:21.6312326Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6312930Z FAILED [0.7124s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6312934Z 2025-12-04T12:10:21.6313009Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6313270Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6313272Z 2025-12-04T12:10:21.6313358Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6313421Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6313488Z ================== 1 failed, 118 deselected, 2 rerun in 3.34s ================== 2025-12-04T12:10:21.6313526Z Got exit code 1 2025-12-04T12:10:21.6313566Z Retrying single test... 2025-12-04T12:10:21.6313712Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ca26df26b65bcbc0.xml 2025-12-04T12:10:21.6313769Z ============================= test session starts ============================== 2025-12-04T12:10:21.6313881Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6313921Z cachedir: .pytest_cache 2025-12-04T12:10:21.6314079Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6314127Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6314170Z configfile: pytest.ini 2025-12-04T12:10:21.6314341Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6314417Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6314675Z stepcurrent: skipping 118 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6314730Z Running 1 items in this shard 2025-12-04T12:10:21.6314733Z 2025-12-04T12:10:21.6314951Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9197s] [100%] 2025-12-04T12:10:21.6315168Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4874s] [100%] 2025-12-04T12:10:21.6315362Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.5341s] [100%] 2025-12-04T12:10:21.6315364Z 2025-12-04T12:10:21.6315414Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6315561Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6315609Z Traceback (most recent call last): 2025-12-04T12:10:21.6315776Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6315819Z method(*args, **kwargs) 2025-12-04T12:10:21.6315972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6316017Z method(*args, **kwargs) 2025-12-04T12:10:21.6316168Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6316204Z with policy(): 2025-12-04T12:10:21.6316357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6316400Z raise RuntimeError(msg) 2025-12-04T12:10:21.6316802Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6316805Z 2025-12-04T12:10:21.6316880Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6317140Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6317143Z 2025-12-04T12:10:21.6317227Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6317298Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6317345Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6317403Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6317881Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6317979Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6318026Z graph_break [] 2025-12-04T12:10:21.6318085Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6318159Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6318643Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6318701Z current_size = base.storage().size() 2025-12-04T12:10:21.6318742Z Autotune Choices Stats: 2025-12-04T12:10:21.6319105Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6319159Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6319206Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6319329Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6319563Z triton_mm_1 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6319800Z triton_mm_0 0.0089 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6319843Z _scaled_mm 0.0262 ms 24.5% 2025-12-04T12:10:21.6319969Z SingleProcess AUTOTUNE benchmarking takes 0.0240 seconds and 0.1120 seconds precompiling for 3 choices 2025-12-04T12:10:21.6320150Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6320198Z Traceback (most recent call last): 2025-12-04T12:10:21.6320365Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6320410Z method(*args, **kwargs) 2025-12-04T12:10:21.6320560Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6320601Z method(*args, **kwargs) 2025-12-04T12:10:21.6320751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6320788Z with policy(): 2025-12-04T12:10:21.6320940Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6320983Z raise RuntimeError(msg) 2025-12-04T12:10:21.6321377Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6321380Z 2025-12-04T12:10:21.6321452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6323874Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6323877Z 2025-12-04T12:10:21.6323966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6324064Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6324111Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6324170Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6324653Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6324781Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6324819Z graph_break [] 2025-12-04T12:10:21.6324879Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6324953Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6325438Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6325488Z current_size = base.storage().size() 2025-12-04T12:10:21.6325528Z Autotune Choices Stats: 2025-12-04T12:10:21.6325892Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6325971Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6326018Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6326140Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6326371Z triton_mm_1 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6326608Z triton_mm_0 0.0089 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6326654Z _scaled_mm 0.0262 ms 24.5% 2025-12-04T12:10:21.6326782Z SingleProcess AUTOTUNE benchmarking takes 0.0240 seconds and 0.1120 seconds precompiling for 3 choices 2025-12-04T12:10:21.6326856Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6326900Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6326958Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6327057Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6327541Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6327582Z graph_break [] 2025-12-04T12:10:21.6327642Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6327715Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6327757Z Autotune Choices Stats: 2025-12-04T12:10:21.6328124Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6328178Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6328225Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6328357Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6328585Z triton_mm_2 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6328810Z triton_mm_3 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6328853Z _scaled_mm 0.0253 ms 23.6% 2025-12-04T12:10:21.6328979Z SingleProcess AUTOTUNE benchmarking takes 0.0172 seconds and 0.0915 seconds precompiling for 3 choices 2025-12-04T12:10:21.6329032Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6329180Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6329229Z Traceback (most recent call last): 2025-12-04T12:10:21.6329396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6329439Z method(*args, **kwargs) 2025-12-04T12:10:21.6329590Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6329633Z method(*args, **kwargs) 2025-12-04T12:10:21.6329784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6329822Z with policy(): 2025-12-04T12:10:21.6329976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6330020Z raise RuntimeError(msg) 2025-12-04T12:10:21.6330470Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6330475Z 2025-12-04T12:10:21.6330549Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6330811Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6330813Z 2025-12-04T12:10:21.6330899Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6330974Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6331017Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6331079Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6331558Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6331658Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6331705Z graph_break [] 2025-12-04T12:10:21.6331766Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6331838Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6332321Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6332383Z current_size = base.storage().size() 2025-12-04T12:10:21.6332423Z Autotune Choices Stats: 2025-12-04T12:10:21.6332785Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6332838Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6332886Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6333006Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6333237Z triton_mm_1 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6333474Z triton_mm_0 0.0089 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6333517Z _scaled_mm 0.0262 ms 24.5% 2025-12-04T12:10:21.6333644Z SingleProcess AUTOTUNE benchmarking takes 0.0240 seconds and 0.1120 seconds precompiling for 3 choices 2025-12-04T12:10:21.6333717Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6333763Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6333819Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6333929Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6334406Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6334444Z graph_break [] 2025-12-04T12:10:21.6334502Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6334577Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6334617Z Autotune Choices Stats: 2025-12-04T12:10:21.6334974Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6335027Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6335076Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6335195Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6335433Z triton_mm_2 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6335656Z triton_mm_3 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6335699Z _scaled_mm 0.0253 ms 23.6% 2025-12-04T12:10:21.6335838Z SingleProcess AUTOTUNE benchmarking takes 0.0172 seconds and 0.0915 seconds precompiling for 3 choices 2025-12-04T12:10:21.6335910Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6335955Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6336010Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6336108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6336584Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6336623Z graph_break [] 2025-12-04T12:10:21.6336681Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6336756Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6336796Z Autotune Choices Stats: 2025-12-04T12:10:21.6337164Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005840000230818987, "best_triton_pos": 0} 2025-12-04T12:10:21.6337217Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6337265Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6337385Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6337622Z triton_mm_4 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6337847Z triton_mm_5 0.0060 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6337891Z _scaled_mm 0.0264 ms 22.1% 2025-12-04T12:10:21.6338017Z SingleProcess AUTOTUNE benchmarking takes 0.0170 seconds and 0.0775 seconds precompiling for 3 choices 2025-12-04T12:10:21.6338207Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ca26df26b65bcbc0.xml - 2025-12-04T12:10:21.6338268Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6338863Z FAILED [0.5341s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6338869Z 2025-12-04T12:10:21.6338941Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6339219Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6339222Z 2025-12-04T12:10:21.6339308Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6339370Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6339437Z ================== 1 failed, 187 deselected, 2 rerun in 2.96s ================== 2025-12-04T12:10:21.6339487Z Got exit code 1 2025-12-04T12:10:21.6339527Z Retrying single test... 2025-12-04T12:10:21.6339672Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-37080ab364b1eb8b.xml 2025-12-04T12:10:21.6339730Z ============================= test session starts ============================== 2025-12-04T12:10:21.6339841Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6339881Z cachedir: .pytest_cache 2025-12-04T12:10:21.6340045Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6340128Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6340170Z configfile: pytest.ini 2025-12-04T12:10:21.6340334Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6340409Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6340668Z stepcurrent: skipping 118 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6340724Z Running 1 items in this shard 2025-12-04T12:10:21.6340726Z 2025-12-04T12:10:21.6340946Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8390s] [100%] 2025-12-04T12:10:21.6341162Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4532s] [100%] 2025-12-04T12:10:21.6341355Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda FAILED [0.5314s] [100%] 2025-12-04T12:10:21.6341358Z 2025-12-04T12:10:21.6341421Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6341569Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6341620Z Traceback (most recent call last): 2025-12-04T12:10:21.6341776Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6341818Z method(*args, **kwargs) 2025-12-04T12:10:21.6341971Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6342014Z method(*args, **kwargs) 2025-12-04T12:10:21.6342163Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6342201Z with policy(): 2025-12-04T12:10:21.6342351Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6342395Z raise RuntimeError(msg) 2025-12-04T12:10:21.6342787Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6342790Z 2025-12-04T12:10:21.6342863Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6343136Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6343139Z 2025-12-04T12:10:21.6343226Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6343299Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6343361Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6343416Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6343901Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6344002Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6344039Z graph_break [] 2025-12-04T12:10:21.6344099Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6344171Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6344653Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6344709Z current_size = base.storage().size() 2025-12-04T12:10:21.6344750Z Autotune Choices Stats: 2025-12-04T12:10:21.6345207Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.6345262Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6345311Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6345447Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6345679Z triton_mm_1 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6345905Z triton_mm_0 0.0075 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6345947Z _scaled_mm 0.0262 ms 23.2% 2025-12-04T12:10:21.6346074Z SingleProcess AUTOTUNE benchmarking takes 0.0174 seconds and 0.0911 seconds precompiling for 3 choices 2025-12-04T12:10:21.6346220Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6346269Z Traceback (most recent call last): 2025-12-04T12:10:21.6346427Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6346471Z method(*args, **kwargs) 2025-12-04T12:10:21.6346623Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6346666Z method(*args, **kwargs) 2025-12-04T12:10:21.6346817Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6346863Z with policy(): 2025-12-04T12:10:21.6347016Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6347059Z raise RuntimeError(msg) 2025-12-04T12:10:21.6347457Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1050673152. 2025-12-04T12:10:21.6347470Z 2025-12-04T12:10:21.6347544Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6347804Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6347806Z 2025-12-04T12:10:21.6347893Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6347965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6348010Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6348065Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6348543Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6348653Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6348689Z graph_break [] 2025-12-04T12:10:21.6348748Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6348822Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6349316Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6349364Z current_size = base.storage().size() 2025-12-04T12:10:21.6349408Z Autotune Choices Stats: 2025-12-04T12:10:21.6349771Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.6349825Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6349873Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6349993Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6350257Z triton_mm_1 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6350481Z triton_mm_0 0.0075 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6350526Z _scaled_mm 0.0262 ms 23.2% 2025-12-04T12:10:21.6350652Z SingleProcess AUTOTUNE benchmarking takes 0.0174 seconds and 0.0911 seconds precompiling for 3 choices 2025-12-04T12:10:21.6350737Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6350781Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6350841Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6350939Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6351425Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6351474Z graph_break [] 2025-12-04T12:10:21.6351534Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6351605Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6351647Z Autotune Choices Stats: 2025-12-04T12:10:21.6352001Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6352057Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6352105Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6352236Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6352464Z triton_mm_2 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6352688Z triton_mm_3 0.0090 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6352731Z _scaled_mm 0.0236 ms 28.5% 2025-12-04T12:10:21.6352855Z SingleProcess AUTOTUNE benchmarking takes 0.0189 seconds and 0.0862 seconds precompiling for 3 choices 2025-12-04T12:10:21.6352927Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6353074Z _ TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6353123Z Traceback (most recent call last): 2025-12-04T12:10:21.6353278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6353320Z method(*args, **kwargs) 2025-12-04T12:10:21.6353475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6353516Z method(*args, **kwargs) 2025-12-04T12:10:21.6353665Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6353703Z with policy(): 2025-12-04T12:10:21.6353855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6353900Z raise RuntimeError(msg) 2025-12-04T12:10:21.6354294Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6354297Z 2025-12-04T12:10:21.6354369Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6354641Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6354643Z 2025-12-04T12:10:21.6354729Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6354812Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6354856Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6354912Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6355391Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6355490Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6355526Z graph_break [] 2025-12-04T12:10:21.6355585Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6355657Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6356141Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6356198Z current_size = base.storage().size() 2025-12-04T12:10:21.6356238Z Autotune Choices Stats: 2025-12-04T12:10:21.6356599Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.6356651Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6356700Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6356830Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6357064Z triton_mm_1 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6357287Z triton_mm_0 0.0075 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6357330Z _scaled_mm 0.0262 ms 23.2% 2025-12-04T12:10:21.6357456Z SingleProcess AUTOTUNE benchmarking takes 0.0174 seconds and 0.0911 seconds precompiling for 3 choices 2025-12-04T12:10:21.6357528Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6357570Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6357627Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6357726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6358207Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6358256Z graph_break [] 2025-12-04T12:10:21.6358315Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6358388Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6358427Z Autotune Choices Stats: 2025-12-04T12:10:21.6358782Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6358846Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6358894Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6359013Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6359241Z triton_mm_2 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6359465Z triton_mm_3 0.0090 ms 74.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6359509Z _scaled_mm 0.0236 ms 28.5% 2025-12-04T12:10:21.6359635Z SingleProcess AUTOTUNE benchmarking takes 0.0189 seconds and 0.0862 seconds precompiling for 3 choices 2025-12-04T12:10:21.6359718Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6359762Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6359818Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6359918Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6360455Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('generated_module_cache_miss', 2), ('select_algorithm_num_precompiles', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6360494Z graph_break [] 2025-12-04T12:10:21.6360553Z aten_mm_info [('aten._scaled_mm.default_16_32_32', 1)] 2025-12-04T12:10:21.6360626Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6360667Z Autotune Choices Stats: 2025-12-04T12:10:21.6361021Z {"num_choices": 3, "num_triton_choices": 2, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006680000107735395, "best_triton_pos": 0} 2025-12-04T12:10:21.6361073Z AUTOTUNE scaled_mm(16x32, 32x32, 16x1, 1x32, 32) 2025-12-04T12:10:21.6361121Z strides: [32, 1], [1, 32], [1, 1], [1, 1], [1] 2025-12-04T12:10:21.6361240Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32, torch.bfloat16 2025-12-04T12:10:21.6361471Z triton_mm_5 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6361696Z triton_mm_4 0.0073 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6361738Z _scaled_mm 0.0250 ms 26.7% 2025-12-04T12:10:21.6361875Z SingleProcess AUTOTUNE benchmarking takes 0.0187 seconds and 0.0860 seconds precompiling for 3 choices 2025-12-04T12:10:21.6362063Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-37080ab364b1eb8b.xml - 2025-12-04T12:10:21.6362122Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6362724Z FAILED [0.5314s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1082130432. 2025-12-04T12:10:21.6362739Z 2025-12-04T12:10:21.6362811Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6363072Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6363074Z 2025-12-04T12:10:21.6363160Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6363223Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6363293Z ================== 1 failed, 187 deselected, 2 rerun in 2.84s ================== 2025-12-04T12:10:21.6363331Z Got exit code 1 2025-12-04T12:10:21.6363554Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda 2025-12-04T12:10:21.6363680Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6363825Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7184cdb80a02c1ee.xml 2025-12-04T12:10:21.6363882Z ============================= test session starts ============================== 2025-12-04T12:10:21.6363993Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6364033Z cachedir: .pytest_cache 2025-12-04T12:10:21.6364202Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6364251Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6364292Z configfile: pytest.ini 2025-12-04T12:10:21.6364456Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6364533Z collecting ... collected 188 items / 119 deselected / 69 selected 2025-12-04T12:10:21.6364588Z stepcurrent: skipping 119 already run items. 2025-12-04T12:10:21.6364634Z Running 69 items in this shard 2025-12-04T12:10:21.6364635Z 2025-12-04T12:10:21.6364871Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_1024,1024,512_use_fast_accum_False_cuda SKIPPED [0.0002s] (Need device-side TMA support in Triton) [ 1%] 2025-12-04T12:10:21.6365099Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_1024,1024,512_use_fast_accum_True_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 2%] 2025-12-04T12:10:21.6365317Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_16,32,32_use_fast_accum_False_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 4%] 2025-12-04T12:10:21.6365530Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_tma_template_shape_16,32,32_use_fast_accum_True_cuda SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 5%] 2025-12-04T12:10:21.6365657Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_scaled_mm_preserves_strides_cuda PASSED [2.1579s] [ 7%] 2025-12-04T12:10:21.6365859Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda PASSED [1.0765s] [ 8%] 2025-12-04T12:10:21.6366079Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.2632s] [ 10%] 2025-12-04T12:10:21.6366307Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.3326s] [ 10%] 2025-12-04T12:10:21.6366501Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.2016s] [ 10%] 2025-12-04T12:10:21.6366504Z 2025-12-04T12:10:21.6366554Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6366705Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6366752Z Traceback (most recent call last): 2025-12-04T12:10:21.6366909Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6366953Z method(*args, **kwargs) 2025-12-04T12:10:21.6367104Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6367148Z method(*args, **kwargs) 2025-12-04T12:10:21.6367300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6367353Z with policy(): 2025-12-04T12:10:21.6367505Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6367549Z raise RuntimeError(msg) 2025-12-04T12:10:21.6367945Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1092616192 and is now 1285554176. 2025-12-04T12:10:21.6367947Z 2025-12-04T12:10:21.6368022Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6368300Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6368304Z 2025-12-04T12:10:21.6368390Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6368463Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6368506Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6368564Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6368663Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6369119Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6369157Z graph_break [] 2025-12-04T12:10:21.6369224Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6369296Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6369341Z Autotune Choices Stats: 2025-12-04T12:10:21.6369833Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013120000250637531, "best_triton_pos": 1, "best_triton_time": 0.013120000250637531, "best_triton_kernel": "triton_mm_54", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:21.6369885Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6369941Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6370041Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6370086Z _scaled_mm 0.0131 ms 100.0% 2025-12-04T12:10:21.6370365Z triton_mm_54 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6370595Z triton_mm_32 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6370821Z triton_mm_33 0.0143 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6371049Z triton_mm_34 0.0150 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6371291Z triton_mm_53 0.0151 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6371516Z triton_mm_51 0.0169 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6371739Z triton_mm_50 0.0171 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6371977Z triton_mm_52 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6372209Z triton_mm_31 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6372337Z SingleProcess AUTOTUNE benchmarking takes 0.2902 seconds and 0.4305 seconds precompiling for 39 choices 2025-12-04T12:10:21.6372487Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6372533Z Traceback (most recent call last): 2025-12-04T12:10:21.6372689Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6372731Z method(*args, **kwargs) 2025-12-04T12:10:21.6372884Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6372925Z method(*args, **kwargs) 2025-12-04T12:10:21.6373074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6373112Z with policy(): 2025-12-04T12:10:21.6373263Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6373307Z raise RuntimeError(msg) 2025-12-04T12:10:21.6373712Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1285554176 and is now 1291845632. 2025-12-04T12:10:21.6373727Z 2025-12-04T12:10:21.6373801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6374065Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6374069Z 2025-12-04T12:10:21.6374156Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6374229Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6374275Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6374331Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6374432Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6374887Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6374937Z graph_break [] 2025-12-04T12:10:21.6375002Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6375075Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6375115Z Autotune Choices Stats: 2025-12-04T12:10:21.6375582Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013120000250637531, "best_triton_pos": 1, "best_triton_time": 0.013120000250637531, "best_triton_kernel": "triton_mm_54", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:21.6375645Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6375691Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6375790Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6375835Z _scaled_mm 0.0131 ms 100.0% 2025-12-04T12:10:21.6376072Z triton_mm_54 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6376300Z triton_mm_32 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6376526Z triton_mm_33 0.0143 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6376755Z triton_mm_34 0.0150 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6376980Z triton_mm_53 0.0151 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6377216Z triton_mm_51 0.0169 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6377439Z triton_mm_50 0.0171 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6377674Z triton_mm_52 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6377903Z triton_mm_31 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6378032Z SingleProcess AUTOTUNE benchmarking takes 0.2902 seconds and 0.4305 seconds precompiling for 39 choices 2025-12-04T12:10:21.6378105Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6378147Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6378203Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6378302Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6378792Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6378840Z graph_break [] 2025-12-04T12:10:21.6378905Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6378980Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6379022Z Autotune Choices Stats: 2025-12-04T12:10:21.6379406Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_92", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012759000062942505, "best_triton_pos": 0} 2025-12-04T12:10:21.6379457Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6379502Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6379599Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6379835Z triton_mm_92 0.0128 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6380060Z triton_mm_70 0.0141 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6380323Z triton_mm_71 0.0146 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6380549Z triton_mm_72 0.0148 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6380787Z triton_mm_91 0.0158 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6381018Z triton_mm_89 0.0167 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6381241Z triton_mm_88 0.0170 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6381490Z triton_mm_90 0.0172 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6381718Z triton_mm_69 0.0176 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6381941Z triton_mm_73 0.0183 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6382069Z SingleProcess AUTOTUNE benchmarking takes 0.2704 seconds and 0.3653 seconds precompiling for 39 choices 2025-12-04T12:10:21.6382121Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6382284Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6382332Z Traceback (most recent call last): 2025-12-04T12:10:21.6382487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6382530Z method(*args, **kwargs) 2025-12-04T12:10:21.6382681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6382721Z method(*args, **kwargs) 2025-12-04T12:10:21.6382871Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6382908Z with policy(): 2025-12-04T12:10:21.6383074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6383117Z raise RuntimeError(msg) 2025-12-04T12:10:21.6383515Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1291845632 and is now 1455423488. 2025-12-04T12:10:21.6383517Z 2025-12-04T12:10:21.6383591Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6383855Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6383857Z 2025-12-04T12:10:21.6383944Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6384018Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6384061Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6384117Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6384216Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6384677Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6384715Z graph_break [] 2025-12-04T12:10:21.6384778Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6384850Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6384901Z Autotune Choices Stats: 2025-12-04T12:10:21.6385369Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013120000250637531, "best_triton_pos": 1, "best_triton_time": 0.013120000250637531, "best_triton_kernel": "triton_mm_54", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:21.6385419Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6385466Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6385566Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6385609Z _scaled_mm 0.0131 ms 100.0% 2025-12-04T12:10:21.6385846Z triton_mm_54 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6386072Z triton_mm_32 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6386309Z triton_mm_33 0.0143 ms 91.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6386535Z triton_mm_34 0.0150 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6386769Z triton_mm_53 0.0151 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6386994Z triton_mm_51 0.0169 ms 77.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6387218Z triton_mm_50 0.0171 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6387445Z triton_mm_52 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6387674Z triton_mm_31 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6387804Z SingleProcess AUTOTUNE benchmarking takes 0.2902 seconds and 0.4305 seconds precompiling for 39 choices 2025-12-04T12:10:21.6387877Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6387923Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6387979Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6388077Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6388569Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6388615Z graph_break [] 2025-12-04T12:10:21.6388680Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6388752Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6388792Z Autotune Choices Stats: 2025-12-04T12:10:21.6389156Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_92", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012759000062942505, "best_triton_pos": 0} 2025-12-04T12:10:21.6389205Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6389249Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6389347Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6389583Z triton_mm_92 0.0128 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6389821Z triton_mm_70 0.0141 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6390046Z triton_mm_71 0.0146 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6390312Z triton_mm_72 0.0148 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6390551Z triton_mm_91 0.0158 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6390777Z triton_mm_89 0.0167 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6391007Z triton_mm_88 0.0170 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6391234Z triton_mm_90 0.0172 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6391465Z triton_mm_69 0.0176 ms 72.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6391695Z triton_mm_73 0.0183 ms 69.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6391821Z SingleProcess AUTOTUNE benchmarking takes 0.2704 seconds and 0.3653 seconds precompiling for 39 choices 2025-12-04T12:10:21.6391911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6391959Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6392015Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6392112Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6392575Z inductor [('triton_bundler_save_kernel', 304), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('async_compile_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6392615Z graph_break [] 2025-12-04T12:10:21.6392682Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6392757Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6392800Z Autotune Choices Stats: 2025-12-04T12:10:21.6393269Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "_scaled_mm", "best_time": 0.013238999992609024, "best_triton_pos": 1, "best_triton_time": 0.013319999910891056, "best_triton_kernel": "triton_mm_130", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4"} 2025-12-04T12:10:21.6393318Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6393375Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6393472Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6393518Z _scaled_mm 0.0132 ms 100.0% 2025-12-04T12:10:21.6393749Z triton_mm_130 0.0133 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6393981Z triton_mm_108 0.0140 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6394221Z triton_mm_109 0.0147 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6394455Z triton_mm_110 0.0149 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6394685Z triton_mm_129 0.0154 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6394909Z triton_mm_127 0.0168 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6395136Z triton_mm_126 0.0172 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6395369Z triton_mm_128 0.0172 ms 76.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6395615Z triton_mm_107 0.0180 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6395742Z SingleProcess AUTOTUNE benchmarking takes 0.2708 seconds and 0.3797 seconds precompiling for 39 choices 2025-12-04T12:10:21.6395929Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7184cdb80a02c1ee.xml - 2025-12-04T12:10:21.6396000Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6396601Z FAILED [1.2016s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1291845632 and is now 1455423488. 2025-12-04T12:10:21.6396608Z 2025-12-04T12:10:21.6396684Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6396949Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6396952Z 2025-12-04T12:10:21.6397041Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6397106Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6397198Z ======= 1 failed, 2 passed, 4 skipped, 119 deselected, 2 rerun in 7.06s ======== 2025-12-04T12:10:21.6397236Z Got exit code 1 2025-12-04T12:10:21.6397276Z Retrying single test... 2025-12-04T12:10:21.6397420Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-79b6c1ab004c9229.xml 2025-12-04T12:10:21.6397479Z ============================= test session starts ============================== 2025-12-04T12:10:21.6397593Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6397633Z cachedir: .pytest_cache 2025-12-04T12:10:21.6397793Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6397851Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6397893Z configfile: pytest.ini 2025-12-04T12:10:21.6398056Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6398132Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6398394Z stepcurrent: skipping 125 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6398439Z Running 1 items in this shard 2025-12-04T12:10:21.6398441Z 2025-12-04T12:10:21.6398664Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.6004s] [100%] 2025-12-04T12:10:21.6398882Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5409s] [100%] 2025-12-04T12:10:21.6399077Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.4555s] [100%] 2025-12-04T12:10:21.6399080Z 2025-12-04T12:10:21.6399130Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6399278Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6399326Z Traceback (most recent call last): 2025-12-04T12:10:21.6399493Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6399537Z method(*args, **kwargs) 2025-12-04T12:10:21.6399689Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6399740Z method(*args, **kwargs) 2025-12-04T12:10:21.6399889Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6399929Z with policy(): 2025-12-04T12:10:21.6400080Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6400163Z raise RuntimeError(msg) 2025-12-04T12:10:21.6400557Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1115684864. 2025-12-04T12:10:21.6400559Z 2025-12-04T12:10:21.6400633Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6400898Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6400901Z 2025-12-04T12:10:21.6401002Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6401074Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6401120Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6401175Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6401663Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6401772Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6401810Z graph_break [] 2025-12-04T12:10:21.6401875Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6401947Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6402429Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6402476Z current_size = base.storage().size() 2025-12-04T12:10:21.6402519Z Autotune Choices Stats: 2025-12-04T12:10:21.6402891Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.013120000250637531, "best_triton_pos": 0} 2025-12-04T12:10:21.6402942Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6402989Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6403089Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6403341Z triton_mm_35 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6403573Z triton_mm_13 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6403801Z triton_mm_14 0.0145 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6404041Z triton_mm_15 0.0151 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6404269Z triton_mm_34 0.0152 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6404494Z triton_mm_33 0.0169 ms 77.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6404720Z triton_mm_31 0.0170 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6404956Z triton_mm_32 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6405183Z triton_mm_12 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6405410Z triton_mm_16 0.0183 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6405549Z SingleProcess AUTOTUNE benchmarking takes 0.2283 seconds and 0.3937 seconds precompiling for 39 choices 2025-12-04T12:10:21.6405699Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6405749Z Traceback (most recent call last): 2025-12-04T12:10:21.6405903Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6405947Z method(*args, **kwargs) 2025-12-04T12:10:21.6406098Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6406139Z method(*args, **kwargs) 2025-12-04T12:10:21.6406288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6406326Z with policy(): 2025-12-04T12:10:21.6406478Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6406523Z raise RuntimeError(msg) 2025-12-04T12:10:21.6406916Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1115684864 and is now 1212153856. 2025-12-04T12:10:21.6406919Z 2025-12-04T12:10:21.6406992Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6407265Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6407268Z 2025-12-04T12:10:21.6407355Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6407429Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6407483Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6407540Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6408024Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6408123Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6408159Z graph_break [] 2025-12-04T12:10:21.6408225Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6408296Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6408780Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6408837Z current_size = base.storage().size() 2025-12-04T12:10:21.6408878Z Autotune Choices Stats: 2025-12-04T12:10:21.6409246Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.013120000250637531, "best_triton_pos": 0} 2025-12-04T12:10:21.6409294Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6409341Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6409456Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6409692Z triton_mm_35 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6409920Z triton_mm_13 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6410185Z triton_mm_14 0.0145 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6410412Z triton_mm_15 0.0151 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6410638Z triton_mm_34 0.0152 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6410877Z triton_mm_33 0.0169 ms 77.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6411101Z triton_mm_31 0.0170 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6411326Z triton_mm_32 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6411566Z triton_mm_12 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6411791Z triton_mm_16 0.0183 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6411919Z SingleProcess AUTOTUNE benchmarking takes 0.2283 seconds and 0.3937 seconds precompiling for 39 choices 2025-12-04T12:10:21.6411991Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6412035Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6412092Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6412191Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6412688Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6412725Z graph_break [] 2025-12-04T12:10:21.6412789Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6412864Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6412904Z Autotune Choices Stats: 2025-12-04T12:10:21.6413281Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_73", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012880000285804272, "best_triton_pos": 0} 2025-12-04T12:10:21.6413333Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6413374Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6413473Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6413707Z triton_mm_73 0.0129 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6413934Z triton_mm_51 0.0142 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6414159Z triton_mm_52 0.0144 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6414387Z triton_mm_53 0.0149 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6414620Z triton_mm_72 0.0156 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6414845Z triton_mm_70 0.0168 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6415079Z triton_mm_71 0.0168 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6415302Z triton_mm_69 0.0170 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6415529Z triton_mm_50 0.0179 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6415751Z triton_mm_54 0.0181 ms 71.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6415880Z SingleProcess AUTOTUNE benchmarking takes 0.3062 seconds and 0.5203 seconds precompiling for 39 choices 2025-12-04T12:10:21.6415934Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6416091Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6416139Z Traceback (most recent call last): 2025-12-04T12:10:21.6416293Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6416338Z method(*args, **kwargs) 2025-12-04T12:10:21.6416492Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6416536Z method(*args, **kwargs) 2025-12-04T12:10:21.6416685Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6416733Z with policy(): 2025-12-04T12:10:21.6416884Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6416930Z raise RuntimeError(msg) 2025-12-04T12:10:21.6417324Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.6417327Z 2025-12-04T12:10:21.6417400Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6417663Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6417667Z 2025-12-04T12:10:21.6417753Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6417827Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6417870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6417927Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6418415Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6418514Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6418552Z graph_break [] 2025-12-04T12:10:21.6418616Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6418700Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6419184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6419231Z current_size = base.storage().size() 2025-12-04T12:10:21.6419273Z Autotune Choices Stats: 2025-12-04T12:10:21.6419641Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.013120000250637531, "best_triton_pos": 0} 2025-12-04T12:10:21.6419690Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6419736Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6419846Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6420082Z triton_mm_35 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6420343Z triton_mm_13 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6420581Z triton_mm_14 0.0145 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6420807Z triton_mm_15 0.0151 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6421038Z triton_mm_34 0.0152 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6421266Z triton_mm_33 0.0169 ms 77.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6421491Z triton_mm_31 0.0170 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6421717Z triton_mm_32 0.0171 ms 76.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6421945Z triton_mm_12 0.0175 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6422180Z triton_mm_16 0.0183 ms 71.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6422310Z SingleProcess AUTOTUNE benchmarking takes 0.2283 seconds and 0.3937 seconds precompiling for 39 choices 2025-12-04T12:10:21.6422383Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6422441Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6422496Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6422595Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6423081Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6423118Z graph_break [] 2025-12-04T12:10:21.6423182Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6423256Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6423296Z Autotune Choices Stats: 2025-12-04T12:10:21.6423663Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_73", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012880000285804272, "best_triton_pos": 0} 2025-12-04T12:10:21.6423733Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6423775Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6423875Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6424107Z triton_mm_73 0.0129 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6424344Z triton_mm_51 0.0142 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6424571Z triton_mm_52 0.0144 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6424796Z triton_mm_53 0.0149 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6425020Z triton_mm_72 0.0156 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6425247Z triton_mm_70 0.0168 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6425475Z triton_mm_71 0.0168 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6425707Z triton_mm_69 0.0170 ms 75.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6425939Z triton_mm_50 0.0179 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6426161Z triton_mm_54 0.0181 ms 71.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6426299Z SingleProcess AUTOTUNE benchmarking takes 0.3062 seconds and 0.5203 seconds precompiling for 39 choices 2025-12-04T12:10:21.6426372Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6426415Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6426471Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6426571Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6427052Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6427089Z graph_break [] 2025-12-04T12:10:21.6427154Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6427237Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6427279Z Autotune Choices Stats: 2025-12-04T12:10:21.6427649Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_111", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.013120000250637531, "best_triton_pos": 0} 2025-12-04T12:10:21.6427699Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6427739Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6427838Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6428085Z triton_mm_111 0.0131 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6428130Z _scaled_mm 0.0132 ms 99.1% 2025-12-04T12:10:21.6428356Z triton_mm_89 0.0141 ms 92.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6428580Z triton_mm_90 0.0148 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6428806Z triton_mm_91 0.0149 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6429032Z triton_mm_110 0.0153 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6429259Z triton_mm_108 0.0168 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6429495Z triton_mm_109 0.0169 ms 77.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6429723Z triton_mm_107 0.0170 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6429962Z triton_mm_88 0.0175 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6430119Z SingleProcess AUTOTUNE benchmarking takes 0.3251 seconds and 0.3594 seconds precompiling for 39 choices 2025-12-04T12:10:21.6430309Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-79b6c1ab004c9229.xml - 2025-12-04T12:10:21.6430368Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6430966Z FAILED [1.4555s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.6430985Z 2025-12-04T12:10:21.6431059Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6431321Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6431324Z 2025-12-04T12:10:21.6431412Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6431473Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6431541Z ================== 1 failed, 187 deselected, 2 rerun in 6.62s ================== 2025-12-04T12:10:21.6431579Z Got exit code 1 2025-12-04T12:10:21.6431633Z Retrying single test... 2025-12-04T12:10:21.6431777Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0c5ea548bf389d37.xml 2025-12-04T12:10:21.6431838Z ============================= test session starts ============================== 2025-12-04T12:10:21.6431950Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6431993Z cachedir: .pytest_cache 2025-12-04T12:10:21.6432151Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6432200Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6432241Z configfile: pytest.ini 2025-12-04T12:10:21.6432405Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6432481Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6432742Z stepcurrent: skipping 125 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6432787Z Running 1 items in this shard 2025-12-04T12:10:21.6432789Z 2025-12-04T12:10:21.6433008Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.5482s] [100%] 2025-12-04T12:10:21.6433236Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4883s] [100%] 2025-12-04T12:10:21.6433430Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.3271s] [100%] 2025-12-04T12:10:21.6433433Z 2025-12-04T12:10:21.6433485Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6433644Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6433693Z Traceback (most recent call last): 2025-12-04T12:10:21.6433850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6433896Z method(*args, **kwargs) 2025-12-04T12:10:21.6434048Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6434092Z method(*args, **kwargs) 2025-12-04T12:10:21.6434243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6434280Z with policy(): 2025-12-04T12:10:21.6434432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6434476Z raise RuntimeError(msg) 2025-12-04T12:10:21.6434875Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1115684864. 2025-12-04T12:10:21.6434888Z 2025-12-04T12:10:21.6434961Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6435226Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6435228Z 2025-12-04T12:10:21.6435314Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6435387Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6435439Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6435497Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6435979Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 5), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6436077Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6436115Z graph_break [] 2025-12-04T12:10:21.6436180Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6436252Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6436736Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6436784Z current_size = base.storage().size() 2025-12-04T12:10:21.6436824Z Autotune Choices Stats: 2025-12-04T12:10:21.6437203Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012799999676644802, "best_triton_pos": 0} 2025-12-04T12:10:21.6437254Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6437297Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6437416Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6437650Z triton_mm_35 0.0128 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6437878Z triton_mm_13 0.0141 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6438103Z triton_mm_14 0.0142 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6438328Z triton_mm_15 0.0148 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6438552Z triton_mm_34 0.0154 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6438791Z triton_mm_32 0.0170 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6439016Z triton_mm_31 0.0172 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6439250Z triton_mm_33 0.0174 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6439481Z triton_mm_12 0.0177 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6439705Z triton_mm_16 0.0181 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6439834Z SingleProcess AUTOTUNE benchmarking takes 0.2296 seconds and 0.2160 seconds precompiling for 39 choices 2025-12-04T12:10:21.6439983Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6440032Z Traceback (most recent call last): 2025-12-04T12:10:21.6440223Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6440266Z method(*args, **kwargs) 2025-12-04T12:10:21.6440416Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6440457Z method(*args, **kwargs) 2025-12-04T12:10:21.6440606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6440644Z with policy(): 2025-12-04T12:10:21.6440813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6440856Z raise RuntimeError(msg) 2025-12-04T12:10:21.6441254Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1115684864 and is now 1212153856. 2025-12-04T12:10:21.6441272Z 2025-12-04T12:10:21.6441346Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6441611Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6441613Z 2025-12-04T12:10:21.6441700Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6441772Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6441819Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6441876Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6442359Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 5), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6442469Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6442506Z graph_break [] 2025-12-04T12:10:21.6442569Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6442642Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6443122Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6443186Z current_size = base.storage().size() 2025-12-04T12:10:21.6443226Z Autotune Choices Stats: 2025-12-04T12:10:21.6443605Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012799999676644802, "best_triton_pos": 0} 2025-12-04T12:10:21.6443656Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6443702Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6443802Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6444036Z triton_mm_35 0.0128 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6444265Z triton_mm_13 0.0141 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6444490Z triton_mm_14 0.0142 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6444725Z triton_mm_15 0.0148 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6444951Z triton_mm_34 0.0154 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6445184Z triton_mm_32 0.0170 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6445407Z triton_mm_31 0.0172 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6445634Z triton_mm_33 0.0174 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6445863Z triton_mm_12 0.0177 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6446086Z triton_mm_16 0.0181 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6446225Z SingleProcess AUTOTUNE benchmarking takes 0.2296 seconds and 0.2160 seconds precompiling for 39 choices 2025-12-04T12:10:21.6446298Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6446341Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6446399Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6446497Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6446990Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6447029Z graph_break [] 2025-12-04T12:10:21.6447094Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6447166Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6447206Z Autotune Choices Stats: 2025-12-04T12:10:21.6447567Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_73", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012600000016391277, "best_triton_pos": 0} 2025-12-04T12:10:21.6447616Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6447660Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6447759Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6447993Z triton_mm_73 0.0126 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6448218Z triton_mm_51 0.0138 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6448289Z _scaled_mm 0.0145 ms 87.0% 2025-12-04T12:10:21.6448516Z triton_mm_52 0.0145 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6448742Z triton_mm_53 0.0151 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6448976Z triton_mm_72 0.0153 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6449203Z triton_mm_70 0.0171 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6449427Z triton_mm_69 0.0172 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6449652Z triton_mm_71 0.0172 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6449895Z triton_mm_50 0.0178 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6450022Z SingleProcess AUTOTUNE benchmarking takes 0.2936 seconds and 0.4680 seconds precompiling for 39 choices 2025-12-04T12:10:21.6450078Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6450269Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6450318Z Traceback (most recent call last): 2025-12-04T12:10:21.6450489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6450533Z method(*args, **kwargs) 2025-12-04T12:10:21.6450685Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6450728Z method(*args, **kwargs) 2025-12-04T12:10:21.6450877Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6450914Z with policy(): 2025-12-04T12:10:21.6451066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6451108Z raise RuntimeError(msg) 2025-12-04T12:10:21.6451506Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.6451509Z 2025-12-04T12:10:21.6451581Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6451846Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6451848Z 2025-12-04T12:10:21.6451933Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6452030Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6452077Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6452133Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6452622Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 5), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6452734Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6452771Z graph_break [] 2025-12-04T12:10:21.6452835Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6452909Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6453392Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6453443Z current_size = base.storage().size() 2025-12-04T12:10:21.6453483Z Autotune Choices Stats: 2025-12-04T12:10:21.6453850Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_35", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012799999676644802, "best_triton_pos": 0} 2025-12-04T12:10:21.6453912Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6453955Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6454054Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6454287Z triton_mm_35 0.0128 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6454529Z triton_mm_13 0.0141 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6454756Z triton_mm_14 0.0142 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6454982Z triton_mm_15 0.0148 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6455209Z triton_mm_34 0.0154 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6455434Z triton_mm_32 0.0170 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6455661Z triton_mm_31 0.0172 ms 74.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6455897Z triton_mm_33 0.0174 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6456126Z triton_mm_12 0.0177 ms 72.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6456361Z triton_mm_16 0.0181 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6456491Z SingleProcess AUTOTUNE benchmarking takes 0.2296 seconds and 0.2160 seconds precompiling for 39 choices 2025-12-04T12:10:21.6456564Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6456609Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6456665Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6456764Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6457247Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6457294Z graph_break [] 2025-12-04T12:10:21.6457358Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6457430Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6457471Z Autotune Choices Stats: 2025-12-04T12:10:21.6457835Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_73", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.012600000016391277, "best_triton_pos": 0} 2025-12-04T12:10:21.6457885Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6457926Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6458033Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6458268Z triton_mm_73 0.0126 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6458494Z triton_mm_51 0.0138 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6458539Z _scaled_mm 0.0145 ms 87.0% 2025-12-04T12:10:21.6458767Z triton_mm_52 0.0145 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6458996Z triton_mm_53 0.0151 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6459222Z triton_mm_72 0.0153 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6459457Z triton_mm_70 0.0171 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6459683Z triton_mm_69 0.0172 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6459907Z triton_mm_71 0.0172 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6460192Z triton_mm_50 0.0178 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6460319Z SingleProcess AUTOTUNE benchmarking takes 0.2936 seconds and 0.4680 seconds precompiling for 39 choices 2025-12-04T12:10:21.6460392Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6460435Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6460495Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6460592Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6461078Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6461131Z graph_break [] 2025-12-04T12:10:21.6461194Z aten_mm_info [('aten._scaled_mm.default_1024_2048_1024', 1)] 2025-12-04T12:10:21.6461266Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6461307Z Autotune Choices Stats: 2025-12-04T12:10:21.6461675Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_111", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0130390003323555, "best_triton_pos": 0} 2025-12-04T12:10:21.6461736Z AUTOTUNE scaled_mm(1024x1024, 1024x2048, , ) 2025-12-04T12:10:21.6461779Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6461876Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6462111Z triton_mm_111 0.0130 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6462154Z _scaled_mm 0.0132 ms 99.1% 2025-12-04T12:10:21.6462381Z triton_mm_89 0.0138 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6462607Z triton_mm_91 0.0146 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6462832Z triton_mm_90 0.0147 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6463062Z triton_mm_110 0.0151 ms 86.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6463304Z triton_mm_108 0.0166 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6463532Z triton_mm_109 0.0170 ms 76.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6463774Z triton_mm_107 0.0172 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6464003Z triton_mm_88 0.0175 ms 74.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6464131Z SingleProcess AUTOTUNE benchmarking takes 0.2879 seconds and 0.3660 seconds precompiling for 39 choices 2025-12-04T12:10:21.6464319Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0c5ea548bf389d37.xml - 2025-12-04T12:10:21.6464381Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6464977Z FAILED [1.3271s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.6464991Z 2025-12-04T12:10:21.6465063Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6465329Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6465331Z 2025-12-04T12:10:21.6465417Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6465497Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6465564Z ================== 1 failed, 187 deselected, 2 rerun in 6.38s ================== 2025-12-04T12:10:21.6465606Z Got exit code 1 2025-12-04T12:10:21.6465816Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6465941Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6466086Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b1ae99a0adb9151f.xml 2025-12-04T12:10:21.6466147Z ============================= test session starts ============================== 2025-12-04T12:10:21.6466262Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6466303Z cachedir: .pytest_cache 2025-12-04T12:10:21.6466461Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6466511Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6466552Z configfile: pytest.ini 2025-12-04T12:10:21.6466714Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6466792Z collecting ... collected 188 items / 126 deselected / 62 selected 2025-12-04T12:10:21.6466846Z stepcurrent: skipping 126 already run items. 2025-12-04T12:10:21.6466889Z Running 62 items in this shard 2025-12-04T12:10:21.6466902Z 2025-12-04T12:10:21.6467119Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9385s] [ 1%] 2025-12-04T12:10:21.6467334Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3872s] [ 1%] 2025-12-04T12:10:21.6467532Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3241s] [ 1%] 2025-12-04T12:10:21.6467536Z 2025-12-04T12:10:21.6467587Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6467731Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6467780Z Traceback (most recent call last): 2025-12-04T12:10:21.6467938Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6467983Z method(*args, **kwargs) 2025-12-04T12:10:21.6468134Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6468179Z method(*args, **kwargs) 2025-12-04T12:10:21.6468331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6468379Z with policy(): 2025-12-04T12:10:21.6468530Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6468574Z raise RuntimeError(msg) 2025-12-04T12:10:21.6468969Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6468972Z 2025-12-04T12:10:21.6469046Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6469313Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6469317Z 2025-12-04T12:10:21.6469403Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6469478Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6469521Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6469578Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6469644Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6469743Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6469780Z graph_break [] 2025-12-04T12:10:21.6469843Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6469987Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6470034Z Traceback (most recent call last): 2025-12-04T12:10:21.6470230Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6470273Z method(*args, **kwargs) 2025-12-04T12:10:21.6470423Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6470465Z method(*args, **kwargs) 2025-12-04T12:10:21.6470615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6470651Z with policy(): 2025-12-04T12:10:21.6470816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6470858Z raise RuntimeError(msg) 2025-12-04T12:10:21.6471245Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6471260Z 2025-12-04T12:10:21.6471332Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6471590Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6471592Z 2025-12-04T12:10:21.6471680Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6471755Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6471798Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6471853Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6471918Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6472019Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6472055Z graph_break [] 2025-12-04T12:10:21.6472133Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6472207Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6472254Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6472309Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6472404Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6472469Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6472506Z graph_break [] 2025-12-04T12:10:21.6472566Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6472617Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6472774Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6472822Z Traceback (most recent call last): 2025-12-04T12:10:21.6472976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6473019Z method(*args, **kwargs) 2025-12-04T12:10:21.6473169Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6473208Z method(*args, **kwargs) 2025-12-04T12:10:21.6473358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6473397Z with policy(): 2025-12-04T12:10:21.6473547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6473589Z raise RuntimeError(msg) 2025-12-04T12:10:21.6473975Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6473979Z 2025-12-04T12:10:21.6474052Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6474321Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6474323Z 2025-12-04T12:10:21.6474410Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6474482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6474524Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6474579Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6474654Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6474750Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6474790Z graph_break [] 2025-12-04T12:10:21.6474850Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6474922Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6474965Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6475019Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6475115Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6475179Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6475215Z graph_break [] 2025-12-04T12:10:21.6475274Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6475347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6475392Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6475447Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6475553Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6475615Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6475652Z graph_break [] 2025-12-04T12:10:21.6475709Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6475900Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b1ae99a0adb9151f.xml - 2025-12-04T12:10:21.6475959Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6476547Z FAILED [0.3241s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6476551Z 2025-12-04T12:10:21.6476623Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6476880Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6476883Z 2025-12-04T12:10:21.6476969Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6477029Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6477098Z ================== 1 failed, 126 deselected, 2 rerun in 2.67s ================== 2025-12-04T12:10:21.6477139Z Got exit code 1 2025-12-04T12:10:21.6477181Z Retrying single test... 2025-12-04T12:10:21.6477325Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-27738c0dec9cb9b2.xml 2025-12-04T12:10:21.6477383Z ============================= test session starts ============================== 2025-12-04T12:10:21.6477494Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6477536Z cachedir: .pytest_cache 2025-12-04T12:10:21.6477702Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6477751Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6477791Z configfile: pytest.ini 2025-12-04T12:10:21.6477954Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6478028Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6478293Z stepcurrent: skipping 126 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6478340Z Running 1 items in this shard 2025-12-04T12:10:21.6478342Z 2025-12-04T12:10:21.6478556Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9216s] [100%] 2025-12-04T12:10:21.6478767Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3536s] [100%] 2025-12-04T12:10:21.6478953Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3202s] [100%] 2025-12-04T12:10:21.6478956Z 2025-12-04T12:10:21.6479008Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6479151Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6479216Z Traceback (most recent call last): 2025-12-04T12:10:21.6479371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6479414Z method(*args, **kwargs) 2025-12-04T12:10:21.6479565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6479609Z method(*args, **kwargs) 2025-12-04T12:10:21.6479759Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6479796Z with policy(): 2025-12-04T12:10:21.6479948Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6480000Z raise RuntimeError(msg) 2025-12-04T12:10:21.6480422Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6480425Z 2025-12-04T12:10:21.6480497Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6480760Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6480762Z 2025-12-04T12:10:21.6480847Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6480921Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6480967Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6481023Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6481089Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6481187Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6481225Z graph_break [] 2025-12-04T12:10:21.6481286Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6481430Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6481492Z Traceback (most recent call last): 2025-12-04T12:10:21.6481647Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6481688Z method(*args, **kwargs) 2025-12-04T12:10:21.6481839Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6481894Z method(*args, **kwargs) 2025-12-04T12:10:21.6482044Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6482087Z with policy(): 2025-12-04T12:10:21.6482239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6482281Z raise RuntimeError(msg) 2025-12-04T12:10:21.6482669Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6482672Z 2025-12-04T12:10:21.6482743Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6483002Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6483016Z 2025-12-04T12:10:21.6483101Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6483175Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6483219Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6483274Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6483340Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6483517Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6483554Z graph_break [] 2025-12-04T12:10:21.6483613Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6483685Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6483744Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6483802Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6483899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6483962Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6483998Z graph_break [] 2025-12-04T12:10:21.6484057Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6484109Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6484254Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6484302Z Traceback (most recent call last): 2025-12-04T12:10:21.6484456Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6484497Z method(*args, **kwargs) 2025-12-04T12:10:21.6484649Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6484691Z method(*args, **kwargs) 2025-12-04T12:10:21.6484842Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6484878Z with policy(): 2025-12-04T12:10:21.6485033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6485075Z raise RuntimeError(msg) 2025-12-04T12:10:21.6485473Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6485487Z 2025-12-04T12:10:21.6485561Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6485815Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6485818Z 2025-12-04T12:10:21.6485904Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6485975Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6486020Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6486075Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6486140Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6486236Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6486273Z graph_break [] 2025-12-04T12:10:21.6486334Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6486410Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6486454Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6486521Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6486616Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6486679Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6486715Z graph_break [] 2025-12-04T12:10:21.6486778Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6486853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6486898Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6486953Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6487048Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6487122Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6487161Z graph_break [] 2025-12-04T12:10:21.6487219Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6487411Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-27738c0dec9cb9b2.xml - 2025-12-04T12:10:21.6487471Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6488052Z FAILED [0.3202s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6488055Z 2025-12-04T12:10:21.6488129Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6488386Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6488389Z 2025-12-04T12:10:21.6488476Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6488536Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6488612Z ================== 1 failed, 187 deselected, 2 rerun in 2.62s ================== 2025-12-04T12:10:21.6488651Z Got exit code 1 2025-12-04T12:10:21.6488692Z Retrying single test... 2025-12-04T12:10:21.6488834Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1610f58c9df394d1.xml 2025-12-04T12:10:21.6488892Z ============================= test session starts ============================== 2025-12-04T12:10:21.6489013Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6489056Z cachedir: .pytest_cache 2025-12-04T12:10:21.6489214Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6489267Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6489308Z configfile: pytest.ini 2025-12-04T12:10:21.6489469Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6489544Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6489800Z stepcurrent: skipping 126 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6489843Z Running 1 items in this shard 2025-12-04T12:10:21.6489846Z 2025-12-04T12:10:21.6490064Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7214s] [100%] 2025-12-04T12:10:21.6490330Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2632s] [100%] 2025-12-04T12:10:21.6490518Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2300s] [100%] 2025-12-04T12:10:21.6490520Z 2025-12-04T12:10:21.6490572Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6490717Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6490766Z Traceback (most recent call last): 2025-12-04T12:10:21.6490935Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6490980Z method(*args, **kwargs) 2025-12-04T12:10:21.6491132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6491176Z method(*args, **kwargs) 2025-12-04T12:10:21.6491325Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6491362Z with policy(): 2025-12-04T12:10:21.6491514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6491556Z raise RuntimeError(msg) 2025-12-04T12:10:21.6491944Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6491947Z 2025-12-04T12:10:21.6492019Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6492277Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6492279Z 2025-12-04T12:10:21.6492364Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6492452Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6492497Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6492553Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6492619Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6492723Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6492779Z graph_break [] 2025-12-04T12:10:21.6492839Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6492983Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6493033Z Traceback (most recent call last): 2025-12-04T12:10:21.6493186Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6493227Z method(*args, **kwargs) 2025-12-04T12:10:21.6493377Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6493417Z method(*args, **kwargs) 2025-12-04T12:10:21.6493568Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6493605Z with policy(): 2025-12-04T12:10:21.6493760Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6493802Z raise RuntimeError(msg) 2025-12-04T12:10:21.6494199Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6494201Z 2025-12-04T12:10:21.6494274Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6494532Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6494534Z 2025-12-04T12:10:21.6494620Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6494703Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6494751Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6494808Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6494874Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6494971Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6495008Z graph_break [] 2025-12-04T12:10:21.6495067Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6495141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6495185Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6495241Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6495336Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6495400Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6495436Z graph_break [] 2025-12-04T12:10:21.6495495Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6495549Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6495694Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6495741Z Traceback (most recent call last): 2025-12-04T12:10:21.6495895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6495949Z method(*args, **kwargs) 2025-12-04T12:10:21.6496100Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6496141Z method(*args, **kwargs) 2025-12-04T12:10:21.6496291Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6496340Z with policy(): 2025-12-04T12:10:21.6496491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6496538Z raise RuntimeError(msg) 2025-12-04T12:10:21.6496922Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6496925Z 2025-12-04T12:10:21.6496998Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6497254Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6497257Z 2025-12-04T12:10:21.6497343Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6497417Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6497471Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6497526Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6497589Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6497687Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6497726Z graph_break [] 2025-12-04T12:10:21.6497785Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6497860Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6497901Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6497957Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6498067Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6498133Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6498170Z graph_break [] 2025-12-04T12:10:21.6498229Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6498301Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6498344Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6498397Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6498494Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6498556Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6498593Z graph_break [] 2025-12-04T12:10:21.6498650Z aten_mm_info [('aten._scaled_mm.default_1024_16_16', 1)] 2025-12-04T12:10:21.6498838Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1610f58c9df394d1.xml - 2025-12-04T12:10:21.6498898Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6499477Z FAILED [0.2300s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6499489Z 2025-12-04T12:10:21.6499561Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6499819Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6499832Z 2025-12-04T12:10:21.6499920Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6499982Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6500051Z ================== 1 failed, 187 deselected, 2 rerun in 2.23s ================== 2025-12-04T12:10:21.6500130Z Got exit code 1 2025-12-04T12:10:21.6500335Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6500464Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6500604Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0dcad95635f5e330.xml 2025-12-04T12:10:21.6500662Z ============================= test session starts ============================== 2025-12-04T12:10:21.6500772Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6500814Z cachedir: .pytest_cache 2025-12-04T12:10:21.6500970Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6501033Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6501075Z configfile: pytest.ini 2025-12-04T12:10:21.6501237Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6501312Z collecting ... collected 188 items / 127 deselected / 61 selected 2025-12-04T12:10:21.6501368Z stepcurrent: skipping 127 already run items. 2025-12-04T12:10:21.6501412Z Running 61 items in this shard 2025-12-04T12:10:21.6501414Z 2025-12-04T12:10:21.6501635Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9229s] [ 1%] 2025-12-04T12:10:21.6501863Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3726s] [ 1%] 2025-12-04T12:10:21.6502056Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3328s] [ 1%] 2025-12-04T12:10:21.6502058Z 2025-12-04T12:10:21.6502109Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6502255Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6502304Z Traceback (most recent call last): 2025-12-04T12:10:21.6502459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6502502Z method(*args, **kwargs) 2025-12-04T12:10:21.6502652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6502694Z method(*args, **kwargs) 2025-12-04T12:10:21.6502844Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6502883Z with policy(): 2025-12-04T12:10:21.6503034Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6503075Z raise RuntimeError(msg) 2025-12-04T12:10:21.6503477Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.6503479Z 2025-12-04T12:10:21.6503553Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6503826Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6503830Z 2025-12-04T12:10:21.6503915Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6503988Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6504031Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6504087Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6504153Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6504251Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6504286Z graph_break [] 2025-12-04T12:10:21.6504351Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6504498Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6504548Z Traceback (most recent call last): 2025-12-04T12:10:21.6504713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6504754Z method(*args, **kwargs) 2025-12-04T12:10:21.6504903Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6504944Z method(*args, **kwargs) 2025-12-04T12:10:21.6505093Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6506550Z with policy(): 2025-12-04T12:10:21.6506706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6506751Z raise RuntimeError(msg) 2025-12-04T12:10:21.6507159Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.6507164Z 2025-12-04T12:10:21.6507238Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6507500Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6507503Z 2025-12-04T12:10:21.6507588Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6507664Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6507708Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6507771Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6507836Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6507933Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6507972Z graph_break [] 2025-12-04T12:10:21.6508035Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6508107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6508150Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6508226Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6508326Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6508388Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6508424Z graph_break [] 2025-12-04T12:10:21.6508484Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6508552Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6508698Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6508747Z Traceback (most recent call last): 2025-12-04T12:10:21.6508901Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6508942Z method(*args, **kwargs) 2025-12-04T12:10:21.6509092Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6509135Z method(*args, **kwargs) 2025-12-04T12:10:21.6509283Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6509322Z with policy(): 2025-12-04T12:10:21.6509473Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6509517Z raise RuntimeError(msg) 2025-12-04T12:10:21.6509904Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6509916Z 2025-12-04T12:10:21.6509988Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6510290Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6510293Z 2025-12-04T12:10:21.6510378Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6510470Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6510515Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6510570Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6510635Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6510732Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6510768Z graph_break [] 2025-12-04T12:10:21.6510830Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6510902Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6510946Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6511000Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6511096Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6511158Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6511198Z graph_break [] 2025-12-04T12:10:21.6511260Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6511332Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6511377Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6511431Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6511526Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6511588Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6511626Z graph_break [] 2025-12-04T12:10:21.6511699Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6511889Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0dcad95635f5e330.xml - 2025-12-04T12:10:21.6511948Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6512540Z FAILED [0.3328s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6512555Z 2025-12-04T12:10:21.6512627Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6512887Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6512890Z 2025-12-04T12:10:21.6512975Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6513038Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6513106Z ================== 1 failed, 127 deselected, 2 rerun in 2.65s ================== 2025-12-04T12:10:21.6513156Z Got exit code 1 2025-12-04T12:10:21.6513197Z Retrying single test... 2025-12-04T12:10:21.6513341Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a6340134f0248142.xml 2025-12-04T12:10:21.6513400Z ============================= test session starts ============================== 2025-12-04T12:10:21.6513512Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6513555Z cachedir: .pytest_cache 2025-12-04T12:10:21.6513711Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6513758Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6513799Z configfile: pytest.ini 2025-12-04T12:10:21.6513975Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6514051Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6514307Z stepcurrent: skipping 127 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6514351Z Running 1 items in this shard 2025-12-04T12:10:21.6514353Z 2025-12-04T12:10:21.6514572Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7344s] [100%] 2025-12-04T12:10:21.6514786Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2586s] [100%] 2025-12-04T12:10:21.6514978Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2315s] [100%] 2025-12-04T12:10:21.6514981Z 2025-12-04T12:10:21.6515032Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6515178Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6515225Z Traceback (most recent call last): 2025-12-04T12:10:21.6515380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6515432Z method(*args, **kwargs) 2025-12-04T12:10:21.6515584Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6515625Z method(*args, **kwargs) 2025-12-04T12:10:21.6515774Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6515826Z with policy(): 2025-12-04T12:10:21.6515977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6516021Z raise RuntimeError(msg) 2025-12-04T12:10:21.6516410Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.6516414Z 2025-12-04T12:10:21.6516486Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6516748Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6516751Z 2025-12-04T12:10:21.6516838Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6516911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6516966Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6517022Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6517087Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6517184Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6517220Z graph_break [] 2025-12-04T12:10:21.6517283Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6517428Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6517475Z Traceback (most recent call last): 2025-12-04T12:10:21.6517635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6517679Z method(*args, **kwargs) 2025-12-04T12:10:21.6517827Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6517869Z method(*args, **kwargs) 2025-12-04T12:10:21.6518017Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6518056Z with policy(): 2025-12-04T12:10:21.6518207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6518248Z raise RuntimeError(msg) 2025-12-04T12:10:21.6518639Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.6518643Z 2025-12-04T12:10:21.6518715Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6518976Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6518979Z 2025-12-04T12:10:21.6519064Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6519148Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6519191Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6519246Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6519311Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6519408Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6519456Z graph_break [] 2025-12-04T12:10:21.6519518Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6519589Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6519633Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6519687Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6519783Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6519845Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6519882Z graph_break [] 2025-12-04T12:10:21.6519947Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6519999Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6520177Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6520223Z Traceback (most recent call last): 2025-12-04T12:10:21.6520378Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6520434Z method(*args, **kwargs) 2025-12-04T12:10:21.6520583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6520623Z method(*args, **kwargs) 2025-12-04T12:10:21.6520773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6520810Z with policy(): 2025-12-04T12:10:21.6520962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6521003Z raise RuntimeError(msg) 2025-12-04T12:10:21.6521412Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6521417Z 2025-12-04T12:10:21.6521489Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6521747Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6521750Z 2025-12-04T12:10:21.6521837Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6521910Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6521951Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6522006Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6522069Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6522168Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6522204Z graph_break [] 2025-12-04T12:10:21.6522269Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6522343Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6522384Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6522440Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6522537Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6522612Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6522649Z graph_break [] 2025-12-04T12:10:21.6522710Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6522784Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6522827Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6522898Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6522995Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6523058Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6523096Z graph_break [] 2025-12-04T12:10:21.6523156Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6523344Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a6340134f0248142.xml - 2025-12-04T12:10:21.6523404Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6523992Z FAILED [0.2315s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6524005Z 2025-12-04T12:10:21.6524078Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6524337Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6524339Z 2025-12-04T12:10:21.6524426Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6524487Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6524554Z ================== 1 failed, 187 deselected, 2 rerun in 2.24s ================== 2025-12-04T12:10:21.6524591Z Got exit code 1 2025-12-04T12:10:21.6524632Z Retrying single test... 2025-12-04T12:10:21.6524787Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2fc7b3a2df6d1d06.xml 2025-12-04T12:10:21.6524845Z ============================= test session starts ============================== 2025-12-04T12:10:21.6524957Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6524998Z cachedir: .pytest_cache 2025-12-04T12:10:21.6525155Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6525200Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6525242Z configfile: pytest.ini 2025-12-04T12:10:21.6525405Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6525478Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6525740Z stepcurrent: skipping 127 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6525783Z Running 1 items in this shard 2025-12-04T12:10:21.6525787Z 2025-12-04T12:10:21.6526004Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8644s] [100%] 2025-12-04T12:10:21.6526231Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3163s] [100%] 2025-12-04T12:10:21.6526422Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2804s] [100%] 2025-12-04T12:10:21.6526424Z 2025-12-04T12:10:21.6526476Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6526633Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6526679Z Traceback (most recent call last): 2025-12-04T12:10:21.6526837Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6526879Z method(*args, **kwargs) 2025-12-04T12:10:21.6527030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6527070Z method(*args, **kwargs) 2025-12-04T12:10:21.6527220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6527258Z with policy(): 2025-12-04T12:10:21.6527408Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6527449Z raise RuntimeError(msg) 2025-12-04T12:10:21.6527841Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.6527854Z 2025-12-04T12:10:21.6527928Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6528189Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6528191Z 2025-12-04T12:10:21.6528276Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6528349Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6528390Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6528457Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6528522Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6528621Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6528658Z graph_break [] 2025-12-04T12:10:21.6528720Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6528864Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6528910Z Traceback (most recent call last): 2025-12-04T12:10:21.6529062Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6529103Z method(*args, **kwargs) 2025-12-04T12:10:21.6529252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6529292Z method(*args, **kwargs) 2025-12-04T12:10:21.6529441Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6529480Z with policy(): 2025-12-04T12:10:21.6529629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6529671Z raise RuntimeError(msg) 2025-12-04T12:10:21.6530072Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.6530075Z 2025-12-04T12:10:21.6530190Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6530452Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6530468Z 2025-12-04T12:10:21.6530554Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6530629Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6530671Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6530727Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6530791Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6530890Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6530926Z graph_break [] 2025-12-04T12:10:21.6530989Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6531061Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6531103Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6531162Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6531258Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6531335Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6531372Z graph_break [] 2025-12-04T12:10:21.6531432Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6531485Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6531632Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6531678Z Traceback (most recent call last): 2025-12-04T12:10:21.6531834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6531874Z method(*args, **kwargs) 2025-12-04T12:10:21.6532035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6532076Z method(*args, **kwargs) 2025-12-04T12:10:21.6532225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6532262Z with policy(): 2025-12-04T12:10:21.6532414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6532454Z raise RuntimeError(msg) 2025-12-04T12:10:21.6532953Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6532956Z 2025-12-04T12:10:21.6533028Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6533289Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6533293Z 2025-12-04T12:10:21.6533379Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6533451Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6533493Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6533547Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6533628Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6533726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6533762Z graph_break [] 2025-12-04T12:10:21.6533823Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6533897Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6533947Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6534002Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6534098Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6534162Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6534197Z graph_break [] 2025-12-04T12:10:21.6534261Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6534333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6534375Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6534428Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6534524Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6534586Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6534626Z graph_break [] 2025-12-04T12:10:21.6534685Z aten_mm_info [('aten._scaled_mm.default_1024_2048_16', 1)] 2025-12-04T12:10:21.6534873Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2fc7b3a2df6d1d06.xml - 2025-12-04T12:10:21.6534948Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6535534Z FAILED [0.2804s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.6535536Z 2025-12-04T12:10:21.6535607Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6535875Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6535878Z 2025-12-04T12:10:21.6535964Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6536025Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6536092Z ================== 1 failed, 187 deselected, 2 rerun in 2.48s ================== 2025-12-04T12:10:21.6536129Z Got exit code 1 2025-12-04T12:10:21.6536339Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6536467Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6536610Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-de4536db5592268d.xml 2025-12-04T12:10:21.6536668Z ============================= test session starts ============================== 2025-12-04T12:10:21.6536779Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6536820Z cachedir: .pytest_cache 2025-12-04T12:10:21.6536978Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6537023Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6537080Z configfile: pytest.ini 2025-12-04T12:10:21.6537241Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6537318Z collecting ... collected 188 items / 128 deselected / 60 selected 2025-12-04T12:10:21.6537372Z stepcurrent: skipping 128 already run items. 2025-12-04T12:10:21.6537427Z Running 60 items in this shard 2025-12-04T12:10:21.6537429Z 2025-12-04T12:10:21.6537646Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2639s] [ 1%] 2025-12-04T12:10:21.6537859Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8833s] [ 1%] 2025-12-04T12:10:21.6538052Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7544s] [ 1%] 2025-12-04T12:10:21.6538054Z 2025-12-04T12:10:21.6538104Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6538250Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6538296Z Traceback (most recent call last): 2025-12-04T12:10:21.6538455Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6538495Z method(*args, **kwargs) 2025-12-04T12:10:21.6538658Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6538697Z method(*args, **kwargs) 2025-12-04T12:10:21.6538847Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6538884Z with policy(): 2025-12-04T12:10:21.6539036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6539078Z raise RuntimeError(msg) 2025-12-04T12:10:21.6539472Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.6539476Z 2025-12-04T12:10:21.6539549Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6539808Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6539810Z 2025-12-04T12:10:21.6539896Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6539969Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6540012Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6540066Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6540586Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6540687Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6540723Z graph_break [] 2025-12-04T12:10:21.6540785Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6540871Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6541358Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6541418Z current_size = base.storage().size() 2025-12-04T12:10:21.6541460Z Autotune Choices Stats: 2025-12-04T12:10:21.6541830Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.6541876Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6541919Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6542019Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6542256Z triton_mm_3 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6542487Z triton_mm_2 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6542725Z triton_mm_4 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6542947Z triton_mm_5 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6543182Z triton_mm_6 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6543403Z triton_mm_7 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6543627Z triton_mm_0 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6543850Z triton_mm_1 0.0062 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6543893Z _scaled_mm 0.0231 ms 25.4% 2025-12-04T12:10:21.6544023Z SingleProcess AUTOTUNE benchmarking takes 0.0523 seconds and 0.2147 seconds precompiling for 9 choices 2025-12-04T12:10:21.6544170Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6544216Z Traceback (most recent call last): 2025-12-04T12:10:21.6544373Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6544414Z method(*args, **kwargs) 2025-12-04T12:10:21.6544566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6544607Z method(*args, **kwargs) 2025-12-04T12:10:21.6544767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6544805Z with policy(): 2025-12-04T12:10:21.6544956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6544997Z raise RuntimeError(msg) 2025-12-04T12:10:21.6545402Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.6545406Z 2025-12-04T12:10:21.6545479Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6545739Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6545741Z 2025-12-04T12:10:21.6545827Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6545900Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6545943Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6546001Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6546481Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6546589Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6546626Z graph_break [] 2025-12-04T12:10:21.6546687Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6546760Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6547256Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6547306Z current_size = base.storage().size() 2025-12-04T12:10:21.6547348Z Autotune Choices Stats: 2025-12-04T12:10:21.6547714Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.6547759Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6547801Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6547901Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6548134Z triton_mm_3 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6548360Z triton_mm_2 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6548595Z triton_mm_4 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6548818Z triton_mm_5 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6549055Z triton_mm_6 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6549276Z triton_mm_7 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6549503Z triton_mm_0 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6549728Z triton_mm_1 0.0062 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6549774Z _scaled_mm 0.0231 ms 25.4% 2025-12-04T12:10:21.6549902Z SingleProcess AUTOTUNE benchmarking takes 0.0523 seconds and 0.2147 seconds precompiling for 9 choices 2025-12-04T12:10:21.6549986Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6550027Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6550084Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6550222Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6550701Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6550737Z graph_break [] 2025-12-04T12:10:21.6550812Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6550885Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6550926Z Autotune Choices Stats: 2025-12-04T12:10:21.6551289Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005919000133872032, "best_triton_pos": 0} 2025-12-04T12:10:21.6551336Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6551377Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6551475Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6551706Z triton_mm_8 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6551934Z triton_mm_10 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6552159Z triton_mm_15 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6552394Z triton_mm_12 0.0063 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6552617Z triton_mm_13 0.0063 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6552853Z triton_mm_14 0.0064 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6553079Z triton_mm_9 0.0067 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6553306Z triton_mm_11 0.0094 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6553346Z _scaled_mm 0.0210 ms 28.1% 2025-12-04T12:10:21.6553475Z SingleProcess AUTOTUNE benchmarking takes 0.0579 seconds and 0.1265 seconds precompiling for 9 choices 2025-12-04T12:10:21.6553527Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6553688Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6553734Z Traceback (most recent call last): 2025-12-04T12:10:21.6553891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6553931Z method(*args, **kwargs) 2025-12-04T12:10:21.6554084Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6554124Z method(*args, **kwargs) 2025-12-04T12:10:21.6554274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6554313Z with policy(): 2025-12-04T12:10:21.6554474Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6554516Z raise RuntimeError(msg) 2025-12-04T12:10:21.6554904Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6554906Z 2025-12-04T12:10:21.6554981Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6555240Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6555242Z 2025-12-04T12:10:21.6555331Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6555405Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6555448Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6555506Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6555995Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6556094Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6556130Z graph_break [] 2025-12-04T12:10:21.6556191Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6556265Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6556758Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6556805Z current_size = base.storage().size() 2025-12-04T12:10:21.6556846Z Autotune Choices Stats: 2025-12-04T12:10:21.6557207Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.6557251Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6557293Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6557393Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6557635Z triton_mm_3 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6557863Z triton_mm_2 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6558092Z triton_mm_4 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6558324Z triton_mm_5 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6558550Z triton_mm_6 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6558770Z triton_mm_7 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6558995Z triton_mm_0 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6559220Z triton_mm_1 0.0062 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6559262Z _scaled_mm 0.0231 ms 25.4% 2025-12-04T12:10:21.6559390Z SingleProcess AUTOTUNE benchmarking takes 0.0523 seconds and 0.2147 seconds precompiling for 9 choices 2025-12-04T12:10:21.6559464Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6559506Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6559562Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6559673Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6560192Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6560244Z graph_break [] 2025-12-04T12:10:21.6560305Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6560377Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6560417Z Autotune Choices Stats: 2025-12-04T12:10:21.6560778Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005919000133872032, "best_triton_pos": 0} 2025-12-04T12:10:21.6560823Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6560864Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6560963Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6561194Z triton_mm_8 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6561434Z triton_mm_10 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6561661Z triton_mm_15 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6561898Z triton_mm_12 0.0063 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6562121Z triton_mm_13 0.0063 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6562344Z triton_mm_14 0.0064 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6562568Z triton_mm_9 0.0067 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6562790Z triton_mm_11 0.0094 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6562833Z _scaled_mm 0.0210 ms 28.1% 2025-12-04T12:10:21.6562960Z SingleProcess AUTOTUNE benchmarking takes 0.0579 seconds and 0.1265 seconds precompiling for 9 choices 2025-12-04T12:10:21.6563034Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6563075Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6563131Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6563229Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6563725Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6563772Z graph_break [] 2025-12-04T12:10:21.6563833Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6563908Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6563948Z Autotune Choices Stats: 2025-12-04T12:10:21.6564306Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_21", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.6564350Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6564392Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6564489Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6564720Z triton_mm_21 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6564962Z triton_mm_23 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6565187Z triton_mm_22 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6565413Z triton_mm_17 0.0063 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6565648Z triton_mm_20 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6565876Z triton_mm_16 0.0067 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6565916Z _scaled_mm 0.0070 ms 84.1% 2025-12-04T12:10:21.6566142Z triton_mm_19 0.0075 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6566368Z triton_mm_18 0.0088 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6566495Z SingleProcess AUTOTUNE benchmarking takes 0.0586 seconds and 0.2429 seconds precompiling for 9 choices 2025-12-04T12:10:21.6566682Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-de4536db5592268d.xml - 2025-12-04T12:10:21.6566742Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6567335Z FAILED [0.7544s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6567338Z 2025-12-04T12:10:21.6567411Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6567681Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6567684Z 2025-12-04T12:10:21.6567771Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6567832Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6567902Z ================== 1 failed, 128 deselected, 2 rerun in 3.92s ================== 2025-12-04T12:10:21.6567939Z Got exit code 1 2025-12-04T12:10:21.6567980Z Retrying single test... 2025-12-04T12:10:21.6568123Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-785c14d917e93f88.xml 2025-12-04T12:10:21.6568180Z ============================= test session starts ============================== 2025-12-04T12:10:21.6568292Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6568335Z cachedir: .pytest_cache 2025-12-04T12:10:21.6568492Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6568547Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6568587Z configfile: pytest.ini 2025-12-04T12:10:21.6568751Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6568825Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6569079Z stepcurrent: skipping 128 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6569121Z Running 1 items in this shard 2025-12-04T12:10:21.6569123Z 2025-12-04T12:10:21.6569347Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3794s] [100%] 2025-12-04T12:10:21.6569562Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8881s] [100%] 2025-12-04T12:10:21.6569751Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7897s] [100%] 2025-12-04T12:10:21.6569753Z 2025-12-04T12:10:21.6569805Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6569950Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6569996Z Traceback (most recent call last): 2025-12-04T12:10:21.6570202Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6570246Z method(*args, **kwargs) 2025-12-04T12:10:21.6570396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6570439Z method(*args, **kwargs) 2025-12-04T12:10:21.6570588Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6570627Z with policy(): 2025-12-04T12:10:21.6570779Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6570836Z raise RuntimeError(msg) 2025-12-04T12:10:21.6571226Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.6571243Z 2025-12-04T12:10:21.6571316Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6571575Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6571579Z 2025-12-04T12:10:21.6571665Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6571738Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6571781Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6571837Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6572323Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6572436Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6572474Z graph_break [] 2025-12-04T12:10:21.6572535Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6572606Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6573089Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6573140Z current_size = base.storage().size() 2025-12-04T12:10:21.6573193Z Autotune Choices Stats: 2025-12-04T12:10:21.6573558Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6573604Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6573648Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6573747Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6573984Z triton_mm_6 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6574214Z triton_mm_4 0.0070 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6574442Z triton_mm_3 0.0071 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6574678Z triton_mm_2 0.0072 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6574899Z triton_mm_5 0.0074 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6575121Z triton_mm_7 0.0082 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6575363Z triton_mm_1 0.0089 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6575587Z triton_mm_0 0.0103 ms 62.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6575629Z _scaled_mm 0.0217 ms 29.7% 2025-12-04T12:10:21.6575756Z SingleProcess AUTOTUNE benchmarking takes 0.0560 seconds and 0.2747 seconds precompiling for 9 choices 2025-12-04T12:10:21.6575902Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6575951Z Traceback (most recent call last): 2025-12-04T12:10:21.6576108Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6576160Z method(*args, **kwargs) 2025-12-04T12:10:21.6576312Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6576352Z method(*args, **kwargs) 2025-12-04T12:10:21.6576502Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6576540Z with policy(): 2025-12-04T12:10:21.6576692Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6576733Z raise RuntimeError(msg) 2025-12-04T12:10:21.6577137Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.6577142Z 2025-12-04T12:10:21.6577216Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6577475Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6577477Z 2025-12-04T12:10:21.6577565Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6577637Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6577680Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6577736Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6578214Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6578313Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6578350Z graph_break [] 2025-12-04T12:10:21.6578410Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6578492Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6578974Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6579032Z current_size = base.storage().size() 2025-12-04T12:10:21.6579074Z Autotune Choices Stats: 2025-12-04T12:10:21.6579435Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6579481Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6579521Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6579621Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6579854Z triton_mm_6 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6580082Z triton_mm_4 0.0070 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6580352Z triton_mm_3 0.0071 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6580579Z triton_mm_2 0.0072 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6580815Z triton_mm_5 0.0074 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6581038Z triton_mm_7 0.0082 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6581262Z triton_mm_1 0.0089 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6581487Z triton_mm_0 0.0103 ms 62.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6581528Z _scaled_mm 0.0217 ms 29.7% 2025-12-04T12:10:21.6581656Z SingleProcess AUTOTUNE benchmarking takes 0.0560 seconds and 0.2747 seconds precompiling for 9 choices 2025-12-04T12:10:21.6581730Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6581773Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6581829Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6581927Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6582416Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6582454Z graph_break [] 2025-12-04T12:10:21.6582514Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6582600Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6582640Z Autotune Choices Stats: 2025-12-04T12:10:21.6582998Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6583043Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6583084Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6583183Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6583413Z triton_mm_14 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6583638Z triton_mm_13 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6583879Z triton_mm_12 0.0066 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6584105Z triton_mm_9 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6584327Z triton_mm_15 0.0070 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6584561Z triton_mm_11 0.0073 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6584788Z triton_mm_10 0.0084 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6585013Z triton_mm_8 0.0100 ms 59.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6585054Z _scaled_mm 0.0222 ms 26.8% 2025-12-04T12:10:21.6585180Z SingleProcess AUTOTUNE benchmarking takes 0.0494 seconds and 0.1505 seconds precompiling for 9 choices 2025-12-04T12:10:21.6585233Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6585379Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6585426Z Traceback (most recent call last): 2025-12-04T12:10:21.6585580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6585622Z method(*args, **kwargs) 2025-12-04T12:10:21.6585773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6585826Z method(*args, **kwargs) 2025-12-04T12:10:21.6585976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6586014Z with policy(): 2025-12-04T12:10:21.6586165Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6586220Z raise RuntimeError(msg) 2025-12-04T12:10:21.6586608Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6586612Z 2025-12-04T12:10:21.6586684Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6586945Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6586947Z 2025-12-04T12:10:21.6587033Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6587107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6587151Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6587211Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6587689Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6587798Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6587834Z graph_break [] 2025-12-04T12:10:21.6587895Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6587969Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6588460Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6588510Z current_size = base.storage().size() 2025-12-04T12:10:21.6588553Z Autotune Choices Stats: 2025-12-04T12:10:21.6588916Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6588959Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6589000Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6589099Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6589331Z triton_mm_6 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6589559Z triton_mm_4 0.0070 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6589793Z triton_mm_3 0.0071 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6590019Z triton_mm_2 0.0072 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6590305Z triton_mm_5 0.0074 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6590527Z triton_mm_7 0.0082 ms 78.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6590749Z triton_mm_1 0.0089 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6590975Z triton_mm_0 0.0103 ms 62.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6591020Z _scaled_mm 0.0217 ms 29.7% 2025-12-04T12:10:21.6591146Z SingleProcess AUTOTUNE benchmarking takes 0.0560 seconds and 0.2747 seconds precompiling for 9 choices 2025-12-04T12:10:21.6591243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6591286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6591342Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6591440Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6591924Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6591974Z graph_break [] 2025-12-04T12:10:21.6592037Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6592110Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6592153Z Autotune Choices Stats: 2025-12-04T12:10:21.6592510Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6592558Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6592599Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6592696Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6592927Z triton_mm_14 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6593151Z triton_mm_13 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6593399Z triton_mm_12 0.0066 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6593623Z triton_mm_9 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6593846Z triton_mm_15 0.0070 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6594083Z triton_mm_11 0.0073 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6594307Z triton_mm_10 0.0084 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6594532Z triton_mm_8 0.0100 ms 59.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6594573Z _scaled_mm 0.0222 ms 26.8% 2025-12-04T12:10:21.6594702Z SingleProcess AUTOTUNE benchmarking takes 0.0494 seconds and 0.1505 seconds precompiling for 9 choices 2025-12-04T12:10:21.6594774Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6594827Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6594882Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6594981Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6595460Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6595498Z graph_break [] 2025-12-04T12:10:21.6595569Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6595642Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6595683Z Autotune Choices Stats: 2025-12-04T12:10:21.6596042Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.6596087Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6596127Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6596226Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6596459Z triton_mm_17 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6596685Z triton_mm_22 0.0069 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6596907Z triton_mm_21 0.0070 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6597142Z triton_mm_20 0.0082 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6597371Z triton_mm_18 0.0088 ms 71.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6597604Z triton_mm_23 0.0089 ms 70.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6597830Z triton_mm_19 0.0098 ms 63.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6598055Z triton_mm_16 0.0099 ms 63.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6598096Z _scaled_mm 0.0216 ms 29.1% 2025-12-04T12:10:21.6598223Z SingleProcess AUTOTUNE benchmarking takes 0.0665 seconds and 0.2343 seconds precompiling for 9 choices 2025-12-04T12:10:21.6598411Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-785c14d917e93f88.xml - 2025-12-04T12:10:21.6598482Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6599066Z FAILED [0.7897s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6599068Z 2025-12-04T12:10:21.6599142Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6599411Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6599416Z 2025-12-04T12:10:21.6599502Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6599566Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6599633Z ================== 1 failed, 187 deselected, 2 rerun in 4.08s ================== 2025-12-04T12:10:21.6599671Z Got exit code 1 2025-12-04T12:10:21.6599711Z Retrying single test... 2025-12-04T12:10:21.6599855Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-93669e71916f7ef8.xml 2025-12-04T12:10:21.6599912Z ============================= test session starts ============================== 2025-12-04T12:10:21.6600024Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6600065Z cachedir: .pytest_cache 2025-12-04T12:10:21.6600264Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6600308Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6600350Z configfile: pytest.ini 2025-12-04T12:10:21.6600513Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6600588Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6600857Z stepcurrent: skipping 128 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6600901Z Running 1 items in this shard 2025-12-04T12:10:21.6600904Z 2025-12-04T12:10:21.6601118Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1310s] [100%] 2025-12-04T12:10:21.6601347Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0125s] [100%] 2025-12-04T12:10:21.6601540Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7510s] [100%] 2025-12-04T12:10:21.6601542Z 2025-12-04T12:10:21.6601594Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6601745Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6601795Z Traceback (most recent call last): 2025-12-04T12:10:21.6601952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6601995Z method(*args, **kwargs) 2025-12-04T12:10:21.6602151Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6602193Z method(*args, **kwargs) 2025-12-04T12:10:21.6602358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6602396Z with policy(): 2025-12-04T12:10:21.6602548Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6602589Z raise RuntimeError(msg) 2025-12-04T12:10:21.6602979Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.6602982Z 2025-12-04T12:10:21.6603055Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6603329Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6603333Z 2025-12-04T12:10:21.6603420Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6603491Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6603534Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6603590Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6604076Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6604174Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6604212Z graph_break [] 2025-12-04T12:10:21.6604275Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6604350Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6604839Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6604887Z current_size = base.storage().size() 2025-12-04T12:10:21.6604928Z Autotune Choices Stats: 2025-12-04T12:10:21.6605294Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:21.6605358Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6605399Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6605499Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6605732Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6605961Z triton_mm_0 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6606190Z triton_mm_2 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6606424Z triton_mm_4 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6606648Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6606877Z triton_mm_7 0.0067 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6607100Z triton_mm_6 0.0070 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6607321Z triton_mm_3 0.0071 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6607362Z _scaled_mm 0.0238 ms 25.3% 2025-12-04T12:10:21.6607489Z SingleProcess AUTOTUNE benchmarking takes 0.0387 seconds and 0.1623 seconds precompiling for 9 choices 2025-12-04T12:10:21.6607635Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6607682Z Traceback (most recent call last): 2025-12-04T12:10:21.6607838Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6607881Z method(*args, **kwargs) 2025-12-04T12:10:21.6608033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6608074Z method(*args, **kwargs) 2025-12-04T12:10:21.6608225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6608263Z with policy(): 2025-12-04T12:10:21.6608425Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6608467Z raise RuntimeError(msg) 2025-12-04T12:10:21.6608856Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.6608867Z 2025-12-04T12:10:21.6608941Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6609201Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6609205Z 2025-12-04T12:10:21.6609290Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6609364Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6609407Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6609464Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6609941Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6610052Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6610128Z graph_break [] 2025-12-04T12:10:21.6610190Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6610263Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6610747Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6610807Z current_size = base.storage().size() 2025-12-04T12:10:21.6610849Z Autotune Choices Stats: 2025-12-04T12:10:21.6611214Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:21.6611259Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6611300Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6611399Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6611632Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6611860Z triton_mm_0 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6612088Z triton_mm_2 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6612326Z triton_mm_4 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6612549Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6612786Z triton_mm_7 0.0067 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6613007Z triton_mm_6 0.0070 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6613230Z triton_mm_3 0.0071 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6613270Z _scaled_mm 0.0238 ms 25.3% 2025-12-04T12:10:21.6613398Z SingleProcess AUTOTUNE benchmarking takes 0.0387 seconds and 0.1623 seconds precompiling for 9 choices 2025-12-04T12:10:21.6613471Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6613516Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6613571Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6613684Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6614160Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6614197Z graph_break [] 2025-12-04T12:10:21.6614258Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6614329Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6614370Z Autotune Choices Stats: 2025-12-04T12:10:21.6614741Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6614791Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6614830Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6614930Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6615159Z triton_mm_11 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6615385Z triton_mm_13 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6615611Z triton_mm_12 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6615835Z triton_mm_9 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6616069Z triton_mm_10 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6616293Z triton_mm_14 0.0068 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6616533Z triton_mm_8 0.0083 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6616760Z triton_mm_15 0.0096 ms 61.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6616801Z _scaled_mm 0.0184 ms 32.4% 2025-12-04T12:10:21.6616930Z SingleProcess AUTOTUNE benchmarking takes 0.0471 seconds and 0.1316 seconds precompiling for 9 choices 2025-12-04T12:10:21.6616982Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6617128Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6617174Z Traceback (most recent call last): 2025-12-04T12:10:21.6617330Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6617380Z method(*args, **kwargs) 2025-12-04T12:10:21.6617532Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6617572Z method(*args, **kwargs) 2025-12-04T12:10:21.6617723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6617761Z with policy(): 2025-12-04T12:10:21.6617914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6617954Z raise RuntimeError(msg) 2025-12-04T12:10:21.6618354Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6618358Z 2025-12-04T12:10:21.6618432Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6618691Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6618694Z 2025-12-04T12:10:21.6618781Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6618852Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6618899Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6618956Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6619436Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6619536Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6619573Z graph_break [] 2025-12-04T12:10:21.6619652Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6619726Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6620240Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6620301Z current_size = base.storage().size() 2025-12-04T12:10:21.6620342Z Autotune Choices Stats: 2025-12-04T12:10:21.6620707Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006039000116288662, "best_triton_pos": 0} 2025-12-04T12:10:21.6620751Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6620791Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6620890Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6621124Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6621366Z triton_mm_0 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6621590Z triton_mm_2 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6621815Z triton_mm_4 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6622047Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6622269Z triton_mm_7 0.0067 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6622491Z triton_mm_6 0.0070 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6622711Z triton_mm_3 0.0071 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6622753Z _scaled_mm 0.0238 ms 25.3% 2025-12-04T12:10:21.6622882Z SingleProcess AUTOTUNE benchmarking takes 0.0387 seconds and 0.1623 seconds precompiling for 9 choices 2025-12-04T12:10:21.6622955Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6622998Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6623054Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6623155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6623642Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6623680Z graph_break [] 2025-12-04T12:10:21.6623740Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6623824Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6623863Z Autotune Choices Stats: 2025-12-04T12:10:21.6624222Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.6624266Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6624308Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6624406Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6624638Z triton_mm_11 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6624867Z triton_mm_13 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6625107Z triton_mm_12 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6625333Z triton_mm_9 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6625565Z triton_mm_10 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6625790Z triton_mm_14 0.0068 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6626014Z triton_mm_8 0.0083 ms 72.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6626238Z triton_mm_15 0.0096 ms 61.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6626279Z _scaled_mm 0.0184 ms 32.4% 2025-12-04T12:10:21.6626405Z SingleProcess AUTOTUNE benchmarking takes 0.0471 seconds and 0.1316 seconds precompiling for 9 choices 2025-12-04T12:10:21.6626480Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6626521Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6626579Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6626676Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6627169Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6627206Z graph_break [] 2025-12-04T12:10:21.6627267Z aten_mm_info [('aten._scaled_mm.default_1024_16_32', 1)] 2025-12-04T12:10:21.6627339Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6627390Z Autotune Choices Stats: 2025-12-04T12:10:21.6627746Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_22", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.6627790Z AUTOTUNE scaled_mm(1024x32, 32x16, , ) 2025-12-04T12:10:21.6627831Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6627929Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6628158Z triton_mm_22 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6628382Z triton_mm_19 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6628618Z triton_mm_17 0.0064 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6628843Z triton_mm_18 0.0066 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6629068Z triton_mm_21 0.0066 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6629316Z triton_mm_20 0.0066 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6629542Z triton_mm_16 0.0067 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6629583Z _scaled_mm 0.0069 ms 85.5% 2025-12-04T12:10:21.6629806Z triton_mm_23 0.0074 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6629934Z SingleProcess AUTOTUNE benchmarking takes 0.0583 seconds and 0.2184 seconds precompiling for 9 choices 2025-12-04T12:10:21.6630176Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-93669e71916f7ef8.xml - 2025-12-04T12:10:21.6630237Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6630833Z FAILED [0.7510s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.6630836Z 2025-12-04T12:10:21.6630909Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6631169Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6631184Z 2025-12-04T12:10:21.6631271Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6631335Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6631403Z ================== 1 failed, 187 deselected, 2 rerun in 3.92s ================== 2025-12-04T12:10:21.6631441Z Got exit code 1 2025-12-04T12:10:21.6631646Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6631774Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6631919Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-31fe77a96a9b1820.xml 2025-12-04T12:10:21.6631975Z ============================= test session starts ============================== 2025-12-04T12:10:21.6632087Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6632128Z cachedir: .pytest_cache 2025-12-04T12:10:21.6632287Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6632347Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6632388Z configfile: pytest.ini 2025-12-04T12:10:21.6632551Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6632629Z collecting ... collected 188 items / 129 deselected / 59 selected 2025-12-04T12:10:21.6632684Z stepcurrent: skipping 129 already run items. 2025-12-04T12:10:21.6632729Z Running 59 items in this shard 2025-12-04T12:10:21.6632731Z 2025-12-04T12:10:21.6632952Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.8781s] [ 1%] 2025-12-04T12:10:21.6633187Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1342s] [ 1%] 2025-12-04T12:10:21.6633380Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.9036s] [ 1%] 2025-12-04T12:10:21.6633384Z 2025-12-04T12:10:21.6633437Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6633584Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6633630Z Traceback (most recent call last): 2025-12-04T12:10:21.6633788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6633829Z method(*args, **kwargs) 2025-12-04T12:10:21.6633982Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6634024Z method(*args, **kwargs) 2025-12-04T12:10:21.6634177Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6634214Z with policy(): 2025-12-04T12:10:21.6634368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6634409Z raise RuntimeError(msg) 2025-12-04T12:10:21.6634809Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.6634812Z 2025-12-04T12:10:21.6634887Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6635159Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6635163Z 2025-12-04T12:10:21.6635250Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6635322Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6635365Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6635422Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6635915Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6636013Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6636063Z graph_break [] 2025-12-04T12:10:21.6636128Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6636201Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6636682Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6636729Z current_size = base.storage().size() 2025-12-04T12:10:21.6636770Z Autotune Choices Stats: 2025-12-04T12:10:21.6637145Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6637198Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6637238Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6637339Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6637572Z triton_mm_10 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6637803Z triton_mm_17 0.0072 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6638030Z triton_mm_5 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6638255Z triton_mm_18 0.0074 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6638490Z triton_mm_2 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6638716Z triton_mm_16 0.0076 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6638954Z triton_mm_4 0.0077 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6639177Z triton_mm_14 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6639401Z triton_mm_11 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6639626Z triton_mm_13 0.0078 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6639755Z SingleProcess AUTOTUNE benchmarking takes 0.0973 seconds and 0.5897 seconds precompiling for 21 choices 2025-12-04T12:10:21.6639916Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6639961Z Traceback (most recent call last): 2025-12-04T12:10:21.6640165Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6640205Z method(*args, **kwargs) 2025-12-04T12:10:21.6640359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6640400Z method(*args, **kwargs) 2025-12-04T12:10:21.6640550Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6640589Z with policy(): 2025-12-04T12:10:21.6640758Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6640800Z raise RuntimeError(msg) 2025-12-04T12:10:21.6641190Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.6641193Z 2025-12-04T12:10:21.6641269Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6641531Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6641533Z 2025-12-04T12:10:21.6641621Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6641694Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6641738Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6641795Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6642297Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6642396Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6642433Z graph_break [] 2025-12-04T12:10:21.6642495Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6642583Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6643065Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6643113Z current_size = base.storage().size() 2025-12-04T12:10:21.6643154Z Autotune Choices Stats: 2025-12-04T12:10:21.6643519Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6643570Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6643612Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6643712Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6643959Z triton_mm_10 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6644185Z triton_mm_17 0.0072 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6644415Z triton_mm_5 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6644649Z triton_mm_18 0.0074 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6644878Z triton_mm_2 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6645101Z triton_mm_16 0.0076 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6645327Z triton_mm_4 0.0077 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6645551Z triton_mm_14 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6645776Z triton_mm_11 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6646006Z triton_mm_13 0.0078 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6646134Z SingleProcess AUTOTUNE benchmarking takes 0.0973 seconds and 0.5897 seconds precompiling for 21 choices 2025-12-04T12:10:21.6646209Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6646262Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6646319Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6646417Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6646904Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6646940Z graph_break [] 2025-12-04T12:10:21.6647006Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6647078Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6647120Z Autotune Choices Stats: 2025-12-04T12:10:21.6647480Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:21.6647544Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6647586Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6647684Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6647916Z triton_mm_36 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6648150Z triton_mm_32 0.0067 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6648379Z triton_mm_35 0.0069 ms 96.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6648604Z triton_mm_25 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6648827Z triton_mm_30 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6649052Z triton_mm_33 0.0074 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6649276Z triton_mm_34 0.0074 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6649503Z triton_mm_26 0.0074 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6649734Z triton_mm_29 0.0074 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6649962Z triton_mm_27 0.0075 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6650133Z SingleProcess AUTOTUNE benchmarking takes 0.1388 seconds and 0.3710 seconds precompiling for 21 choices 2025-12-04T12:10:21.6650187Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6650338Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6650385Z Traceback (most recent call last): 2025-12-04T12:10:21.6650546Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6650587Z method(*args, **kwargs) 2025-12-04T12:10:21.6650738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6650778Z method(*args, **kwargs) 2025-12-04T12:10:21.6650930Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6650969Z with policy(): 2025-12-04T12:10:21.6651123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6651181Z raise RuntimeError(msg) 2025-12-04T12:10:21.6651575Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6651577Z 2025-12-04T12:10:21.6651651Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6651924Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6651927Z 2025-12-04T12:10:21.6652016Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6652089Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6652134Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6652189Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6652681Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6652778Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6652815Z graph_break [] 2025-12-04T12:10:21.6652879Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6652953Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6653436Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6653497Z current_size = base.storage().size() 2025-12-04T12:10:21.6653540Z Autotune Choices Stats: 2025-12-04T12:10:21.6653903Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6653963Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6654004Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6654103Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6654333Z triton_mm_10 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6654559Z triton_mm_17 0.0072 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6654789Z triton_mm_5 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6655012Z triton_mm_18 0.0074 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6655246Z triton_mm_2 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6655471Z triton_mm_16 0.0076 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6655707Z triton_mm_4 0.0077 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6655931Z triton_mm_14 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6656156Z triton_mm_11 0.0077 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6656380Z triton_mm_13 0.0078 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6656508Z SingleProcess AUTOTUNE benchmarking takes 0.0973 seconds and 0.5897 seconds precompiling for 21 choices 2025-12-04T12:10:21.6656585Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6656626Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6656683Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6656782Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6657279Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6657316Z graph_break [] 2025-12-04T12:10:21.6657380Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6657451Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6657507Z Autotune Choices Stats: 2025-12-04T12:10:21.6657869Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:21.6657916Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6657958Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6658056Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6658287Z triton_mm_36 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6658513Z triton_mm_32 0.0067 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6658751Z triton_mm_35 0.0069 ms 96.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6658980Z triton_mm_25 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6659202Z triton_mm_30 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6659435Z triton_mm_33 0.0074 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6659658Z triton_mm_34 0.0074 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6659884Z triton_mm_26 0.0074 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6660145Z triton_mm_29 0.0074 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6660374Z triton_mm_27 0.0075 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6660504Z SingleProcess AUTOTUNE benchmarking takes 0.1388 seconds and 0.3710 seconds precompiling for 21 choices 2025-12-04T12:10:21.6660575Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6660619Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6660676Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6660791Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6661275Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6661331Z graph_break [] 2025-12-04T12:10:21.6661394Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6661468Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6661509Z Autotune Choices Stats: 2025-12-04T12:10:21.6661871Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_46", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006599999964237213, "best_triton_pos": 0} 2025-12-04T12:10:21.6661917Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6661957Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6662057Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6662290Z triton_mm_46 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6662531Z triton_mm_45 0.0067 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6662755Z triton_mm_58 0.0067 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6662981Z triton_mm_52 0.0068 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6663216Z triton_mm_53 0.0069 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6663440Z triton_mm_48 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6663665Z triton_mm_47 0.0072 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6663887Z triton_mm_54 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6664115Z triton_mm_51 0.0072 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6664338Z triton_mm_55 0.0073 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6664476Z SingleProcess AUTOTUNE benchmarking takes 0.1505 seconds and 0.2237 seconds precompiling for 21 choices 2025-12-04T12:10:21.6664666Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-31fe77a96a9b1820.xml - 2025-12-04T12:10:21.6664726Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6665324Z FAILED [0.9036s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6665338Z 2025-12-04T12:10:21.6665412Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6665675Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6665677Z 2025-12-04T12:10:21.6665765Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6665828Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6665897Z ================== 1 failed, 129 deselected, 2 rerun in 4.94s ================== 2025-12-04T12:10:21.6665934Z Got exit code 1 2025-12-04T12:10:21.6665985Z Retrying single test... 2025-12-04T12:10:21.6666128Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ed21fb057c6c24d9.xml 2025-12-04T12:10:21.6666185Z ============================= test session starts ============================== 2025-12-04T12:10:21.6666296Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6666339Z cachedir: .pytest_cache 2025-12-04T12:10:21.6666496Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6666541Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6666581Z configfile: pytest.ini 2025-12-04T12:10:21.6666754Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6666830Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6667091Z stepcurrent: skipping 129 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6667134Z Running 1 items in this shard 2025-12-04T12:10:21.6667136Z 2025-12-04T12:10:21.6667358Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.7364s] [100%] 2025-12-04T12:10:21.6667573Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9809s] [100%] 2025-12-04T12:10:21.6667769Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.9510s] [100%] 2025-12-04T12:10:21.6667772Z 2025-12-04T12:10:21.6667824Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6667970Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6668017Z Traceback (most recent call last): 2025-12-04T12:10:21.6668175Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6668217Z method(*args, **kwargs) 2025-12-04T12:10:21.6668379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6668423Z method(*args, **kwargs) 2025-12-04T12:10:21.6668573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6668623Z with policy(): 2025-12-04T12:10:21.6668774Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6668821Z raise RuntimeError(msg) 2025-12-04T12:10:21.6669214Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.6669217Z 2025-12-04T12:10:21.6669291Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6669554Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6669556Z 2025-12-04T12:10:21.6669644Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6669717Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6669770Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6669827Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6670354Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6670454Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6670492Z graph_break [] 2025-12-04T12:10:21.6670556Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6670643Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6671125Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6671174Z current_size = base.storage().size() 2025-12-04T12:10:21.6671215Z Autotune Choices Stats: 2025-12-04T12:10:21.6671582Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6671630Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6671671Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6671769Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6672004Z triton_mm_15 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6672244Z triton_mm_7 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6672471Z triton_mm_13 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6672710Z triton_mm_16 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6672936Z triton_mm_11 0.0072 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6673161Z triton_mm_5 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6673382Z triton_mm_18 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6673609Z triton_mm_10 0.0073 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6673846Z triton_mm_2 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6674068Z triton_mm_8 0.0074 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6674199Z SingleProcess AUTOTUNE benchmarking takes 0.0906 seconds and 0.5370 seconds precompiling for 21 choices 2025-12-04T12:10:21.6674345Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6674404Z Traceback (most recent call last): 2025-12-04T12:10:21.6674560Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6674602Z method(*args, **kwargs) 2025-12-04T12:10:21.6674753Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6674794Z method(*args, **kwargs) 2025-12-04T12:10:21.6674946Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6674986Z with policy(): 2025-12-04T12:10:21.6675139Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6675182Z raise RuntimeError(msg) 2025-12-04T12:10:21.6675576Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.6675581Z 2025-12-04T12:10:21.6675653Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6675917Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6675919Z 2025-12-04T12:10:21.6676021Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6676096Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6676139Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6676195Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6676693Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6676793Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6676829Z graph_break [] 2025-12-04T12:10:21.6676893Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6676967Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6677448Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6677508Z current_size = base.storage().size() 2025-12-04T12:10:21.6677548Z Autotune Choices Stats: 2025-12-04T12:10:21.6677920Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6677965Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6678007Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6678105Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6678349Z triton_mm_15 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6678579Z triton_mm_7 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6678804Z triton_mm_13 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6679029Z triton_mm_16 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6679252Z triton_mm_11 0.0072 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6679479Z triton_mm_5 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6679711Z triton_mm_18 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6679937Z triton_mm_10 0.0073 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6680203Z triton_mm_2 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6680439Z triton_mm_8 0.0074 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6680569Z SingleProcess AUTOTUNE benchmarking takes 0.0906 seconds and 0.5370 seconds precompiling for 21 choices 2025-12-04T12:10:21.6680643Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6680685Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6680741Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6680839Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6681325Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6681377Z graph_break [] 2025-12-04T12:10:21.6681440Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6681513Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6681554Z Autotune Choices Stats: 2025-12-04T12:10:21.6681917Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_25", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519000045955181, "best_triton_pos": 0} 2025-12-04T12:10:21.6681976Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6682017Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6682116Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6682349Z triton_mm_25 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6682575Z triton_mm_34 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6682798Z triton_mm_37 0.0072 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6683023Z triton_mm_38 0.0072 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6683246Z triton_mm_28 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6683481Z triton_mm_27 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6683708Z triton_mm_35 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6683947Z triton_mm_32 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6684172Z triton_mm_33 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6684394Z triton_mm_29 0.0074 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6684521Z SingleProcess AUTOTUNE benchmarking takes 0.1266 seconds and 0.2924 seconds precompiling for 21 choices 2025-12-04T12:10:21.6684576Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6684725Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6684772Z Traceback (most recent call last): 2025-12-04T12:10:21.6684939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6684981Z method(*args, **kwargs) 2025-12-04T12:10:21.6685132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6685172Z method(*args, **kwargs) 2025-12-04T12:10:21.6685322Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6685361Z with policy(): 2025-12-04T12:10:21.6685511Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6685553Z raise RuntimeError(msg) 2025-12-04T12:10:21.6685955Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6685959Z 2025-12-04T12:10:21.6686033Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6686297Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6686299Z 2025-12-04T12:10:21.6686386Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6686459Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6686502Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6686560Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6687042Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6687141Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6687187Z graph_break [] 2025-12-04T12:10:21.6687249Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6687323Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6687804Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6687863Z current_size = base.storage().size() 2025-12-04T12:10:21.6687904Z Autotune Choices Stats: 2025-12-04T12:10:21.6688271Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6688316Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6688358Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6688456Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6688692Z triton_mm_15 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6688929Z triton_mm_7 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6689153Z triton_mm_13 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6689378Z triton_mm_16 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6689616Z triton_mm_11 0.0072 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6689843Z triton_mm_5 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6690067Z triton_mm_18 0.0073 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6690328Z triton_mm_10 0.0073 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6690557Z triton_mm_2 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6690779Z triton_mm_8 0.0074 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6690922Z SingleProcess AUTOTUNE benchmarking takes 0.0906 seconds and 0.5370 seconds precompiling for 21 choices 2025-12-04T12:10:21.6690996Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6691039Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6691095Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6691194Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6691691Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6691729Z graph_break [] 2025-12-04T12:10:21.6691792Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6691864Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6691905Z Autotune Choices Stats: 2025-12-04T12:10:21.6692265Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_25", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519000045955181, "best_triton_pos": 0} 2025-12-04T12:10:21.6692313Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6692367Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6692466Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6692698Z triton_mm_25 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6692924Z triton_mm_34 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6693159Z triton_mm_37 0.0072 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6693382Z triton_mm_38 0.0072 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6694926Z triton_mm_28 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6695156Z triton_mm_27 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6695383Z triton_mm_35 0.0073 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6695610Z triton_mm_32 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6695840Z triton_mm_33 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6696077Z triton_mm_29 0.0074 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6696204Z SingleProcess AUTOTUNE benchmarking takes 0.1266 seconds and 0.2924 seconds precompiling for 21 choices 2025-12-04T12:10:21.6696292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6696334Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6696394Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6696492Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6696975Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6697012Z graph_break [] 2025-12-04T12:10:21.6697076Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6697150Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6697194Z Autotune Choices Stats: 2025-12-04T12:10:21.6697557Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_48", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.6697614Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6697656Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6697755Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6697989Z triton_mm_48 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6698226Z triton_mm_47 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6698455Z triton_mm_55 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6698679Z triton_mm_52 0.0066 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6698902Z triton_mm_58 0.0066 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6699126Z triton_mm_46 0.0067 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6699348Z triton_mm_50 0.0068 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6699581Z triton_mm_57 0.0074 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6699806Z triton_mm_49 0.0074 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6700033Z triton_mm_53 0.0074 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6700291Z SingleProcess AUTOTUNE benchmarking takes 0.1502 seconds and 0.2676 seconds precompiling for 21 choices 2025-12-04T12:10:21.6700481Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ed21fb057c6c24d9.xml - 2025-12-04T12:10:21.6700541Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6701137Z FAILED [0.9510s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6701141Z 2025-12-04T12:10:21.6701219Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6701500Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6701503Z 2025-12-04T12:10:21.6701590Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6701654Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6701722Z ================== 1 failed, 187 deselected, 2 rerun in 4.69s ================== 2025-12-04T12:10:21.6701760Z Got exit code 1 2025-12-04T12:10:21.6701799Z Retrying single test... 2025-12-04T12:10:21.6701957Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c4d418b9246bd6e0.xml 2025-12-04T12:10:21.6702015Z ============================= test session starts ============================== 2025-12-04T12:10:21.6702128Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6702170Z cachedir: .pytest_cache 2025-12-04T12:10:21.6702330Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6702376Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6702419Z configfile: pytest.ini 2025-12-04T12:10:21.6702584Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6702659Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6702916Z stepcurrent: skipping 129 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6702962Z Running 1 items in this shard 2025-12-04T12:10:21.6702964Z 2025-12-04T12:10:21.6703182Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6956s] [100%] 2025-12-04T12:10:21.6703396Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0367s] [100%] 2025-12-04T12:10:21.6703603Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8484s] [100%] 2025-12-04T12:10:21.6703606Z 2025-12-04T12:10:21.6703657Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6703805Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6703866Z Traceback (most recent call last): 2025-12-04T12:10:21.6704024Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6704068Z method(*args, **kwargs) 2025-12-04T12:10:21.6704221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6704261Z method(*args, **kwargs) 2025-12-04T12:10:21.6704413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6704450Z with policy(): 2025-12-04T12:10:21.6704601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6704642Z raise RuntimeError(msg) 2025-12-04T12:10:21.6705034Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.6705055Z 2025-12-04T12:10:21.6705130Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6705391Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6705394Z 2025-12-04T12:10:21.6705481Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6705554Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6705597Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6705653Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6706151Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6706250Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6706288Z graph_break [] 2025-12-04T12:10:21.6706351Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6706425Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6706910Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6706959Z current_size = base.storage().size() 2025-12-04T12:10:21.6707000Z Autotune Choices Stats: 2025-12-04T12:10:21.6707376Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6707423Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6707463Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6707563Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6707796Z triton_mm_6 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6708036Z triton_mm_11 0.0068 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6708261Z triton_mm_13 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6708488Z triton_mm_16 0.0076 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6708714Z triton_mm_10 0.0079 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6708947Z triton_mm_9 0.0079 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6709173Z triton_mm_15 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6709398Z triton_mm_12 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6709631Z triton_mm_18 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6709858Z triton_mm_5 0.0080 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6709986Z SingleProcess AUTOTUNE benchmarking takes 0.1005 seconds and 0.4247 seconds precompiling for 21 choices 2025-12-04T12:10:21.6710160Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6710206Z Traceback (most recent call last): 2025-12-04T12:10:21.6710361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6710401Z method(*args, **kwargs) 2025-12-04T12:10:21.6710554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6710594Z method(*args, **kwargs) 2025-12-04T12:10:21.6710749Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6710786Z with policy(): 2025-12-04T12:10:21.6710939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6711061Z raise RuntimeError(msg) 2025-12-04T12:10:21.6711469Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.6711472Z 2025-12-04T12:10:21.6711548Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6711824Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6711829Z 2025-12-04T12:10:21.6711915Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6711989Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6712033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6712090Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6712575Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6712673Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6712725Z graph_break [] 2025-12-04T12:10:21.6712787Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6712859Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6713410Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6713457Z current_size = base.storage().size() 2025-12-04T12:10:21.6713499Z Autotune Choices Stats: 2025-12-04T12:10:21.6713876Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6713926Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6713966Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6714066Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6714296Z triton_mm_6 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6714522Z triton_mm_11 0.0068 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6714746Z triton_mm_13 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6714971Z triton_mm_16 0.0076 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6715205Z triton_mm_10 0.0079 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6715428Z triton_mm_9 0.0079 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6715662Z triton_mm_15 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6715887Z triton_mm_12 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6716108Z triton_mm_18 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6716332Z triton_mm_5 0.0080 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6716462Z SingleProcess AUTOTUNE benchmarking takes 0.1005 seconds and 0.4247 seconds precompiling for 21 choices 2025-12-04T12:10:21.6716548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6716589Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6716646Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6716744Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6717232Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6717279Z graph_break [] 2025-12-04T12:10:21.6717341Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6717414Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6717456Z Autotune Choices Stats: 2025-12-04T12:10:21.6717815Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_28", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006599999964237213, "best_triton_pos": 0} 2025-12-04T12:10:21.6717860Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6717901Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6717999Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6718231Z triton_mm_28 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6718457Z triton_mm_33 0.0067 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6718691Z triton_mm_26 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6718914Z triton_mm_38 0.0076 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6719141Z triton_mm_25 0.0076 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6719381Z triton_mm_34 0.0078 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6719607Z triton_mm_27 0.0078 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6719830Z triton_mm_30 0.0078 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6720054Z triton_mm_31 0.0079 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6720333Z triton_mm_35 0.0080 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6720461Z SingleProcess AUTOTUNE benchmarking takes 0.1360 seconds and 0.2883 seconds precompiling for 21 choices 2025-12-04T12:10:21.6720514Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6720662Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6720709Z Traceback (most recent call last): 2025-12-04T12:10:21.6720865Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6720922Z method(*args, **kwargs) 2025-12-04T12:10:21.6721075Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6721117Z method(*args, **kwargs) 2025-12-04T12:10:21.6721268Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6721305Z with policy(): 2025-12-04T12:10:21.6721459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6721500Z raise RuntimeError(msg) 2025-12-04T12:10:21.6721896Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6721900Z 2025-12-04T12:10:21.6721974Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6722235Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6722238Z 2025-12-04T12:10:21.6722325Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6722396Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6722451Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6722507Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6722994Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6723107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6723144Z graph_break [] 2025-12-04T12:10:21.6723206Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6723279Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6723769Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6723816Z current_size = base.storage().size() 2025-12-04T12:10:21.6723860Z Autotune Choices Stats: 2025-12-04T12:10:21.6724224Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.6724283Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6724324Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6724424Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6724654Z triton_mm_6 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6724891Z triton_mm_11 0.0068 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6725117Z triton_mm_13 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6725341Z triton_mm_16 0.0076 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6725564Z triton_mm_10 0.0079 ms 85.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6725788Z triton_mm_9 0.0079 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6726013Z triton_mm_15 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6726249Z triton_mm_12 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6726472Z triton_mm_18 0.0080 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6726697Z triton_mm_5 0.0080 ms 84.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6726835Z SingleProcess AUTOTUNE benchmarking takes 0.1005 seconds and 0.4247 seconds precompiling for 21 choices 2025-12-04T12:10:21.6726908Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6726950Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6727006Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6727104Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6727589Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6727627Z graph_break [] 2025-12-04T12:10:21.6727699Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6727771Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6727812Z Autotune Choices Stats: 2025-12-04T12:10:21.6728172Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_28", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006599999964237213, "best_triton_pos": 0} 2025-12-04T12:10:21.6728218Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6728259Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6728356Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6728597Z triton_mm_28 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6728819Z triton_mm_33 0.0067 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6729044Z triton_mm_26 0.0070 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6729266Z triton_mm_38 0.0076 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6729492Z triton_mm_25 0.0076 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6729717Z triton_mm_34 0.0078 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6729950Z triton_mm_27 0.0078 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6730208Z triton_mm_30 0.0078 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6730444Z triton_mm_31 0.0079 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6730667Z triton_mm_35 0.0080 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6730796Z SingleProcess AUTOTUNE benchmarking takes 0.1360 seconds and 0.2883 seconds precompiling for 21 choices 2025-12-04T12:10:21.6730867Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6730910Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6730966Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6731065Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6731546Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6731600Z graph_break [] 2025-12-04T12:10:21.6731663Z aten_mm_info [('aten._scaled_mm.default_1024_2048_32', 1)] 2025-12-04T12:10:21.6731736Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6731777Z Autotune Choices Stats: 2025-12-04T12:10:21.6732146Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_58", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.6732194Z AUTOTUNE scaled_mm(1024x32, 32x2048, , ) 2025-12-04T12:10:21.6732236Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6732334Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6732563Z triton_mm_58 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6732789Z triton_mm_56 0.0066 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6733017Z triton_mm_45 0.0067 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6733240Z triton_mm_50 0.0067 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6733466Z triton_mm_51 0.0067 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6733710Z triton_mm_53 0.0071 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6733934Z triton_mm_57 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6734167Z triton_mm_54 0.0074 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6734392Z triton_mm_52 0.0076 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6734619Z triton_mm_47 0.0076 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6734744Z SingleProcess AUTOTUNE benchmarking takes 0.1550 seconds and 0.2216 seconds precompiling for 21 choices 2025-12-04T12:10:21.6734935Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c4d418b9246bd6e0.xml - 2025-12-04T12:10:21.6735006Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6735597Z FAILED [0.8484s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.6735600Z 2025-12-04T12:10:21.6735674Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6735951Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6735954Z 2025-12-04T12:10:21.6736042Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6736104Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6736172Z ================== 1 failed, 187 deselected, 2 rerun in 4.60s ================== 2025-12-04T12:10:21.6736209Z Got exit code 1 2025-12-04T12:10:21.6736419Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6736545Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6736690Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-eb1aff5bb1e10fb1.xml 2025-12-04T12:10:21.6736748Z ============================= test session starts ============================== 2025-12-04T12:10:21.6736860Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6736902Z cachedir: .pytest_cache 2025-12-04T12:10:21.6737061Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6737106Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6737148Z configfile: pytest.ini 2025-12-04T12:10:21.6737321Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6737398Z collecting ... collected 188 items / 130 deselected / 58 selected 2025-12-04T12:10:21.6737452Z stepcurrent: skipping 130 already run items. 2025-12-04T12:10:21.6737497Z Running 58 items in this shard 2025-12-04T12:10:21.6737499Z 2025-12-04T12:10:21.6737718Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0289s] [ 1%] 2025-12-04T12:10:21.6737942Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5449s] [ 1%] 2025-12-04T12:10:21.6738132Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7693s] [ 1%] 2025-12-04T12:10:21.6738134Z 2025-12-04T12:10:21.6738186Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6738329Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6738375Z Traceback (most recent call last): 2025-12-04T12:10:21.6738533Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6738576Z method(*args, **kwargs) 2025-12-04T12:10:21.6738729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6738780Z method(*args, **kwargs) 2025-12-04T12:10:21.6738930Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6738967Z with policy(): 2025-12-04T12:10:21.6739119Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6739161Z raise RuntimeError(msg) 2025-12-04T12:10:21.6739548Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.6739561Z 2025-12-04T12:10:21.6739636Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6739895Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6739898Z 2025-12-04T12:10:21.6739985Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6740059Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6740143Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6740199Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6740682Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6740781Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6740818Z graph_break [] 2025-12-04T12:10:21.6740880Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6740952Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6741445Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6741493Z current_size = base.storage().size() 2025-12-04T12:10:21.6741548Z Autotune Choices Stats: 2025-12-04T12:10:21.6741914Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.6741962Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6742004Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6742104Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6742340Z triton_mm_3 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6742571Z triton_mm_1 0.0059 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6742811Z triton_mm_2 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6743037Z triton_mm_0 0.0074 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6743080Z _scaled_mm 0.0210 ms 27.6% 2025-12-04T12:10:21.6743207Z SingleProcess AUTOTUNE benchmarking takes 0.0256 seconds and 0.1361 seconds precompiling for 5 choices 2025-12-04T12:10:21.6743351Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6743410Z Traceback (most recent call last): 2025-12-04T12:10:21.6743566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6743607Z method(*args, **kwargs) 2025-12-04T12:10:21.6743760Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6743799Z method(*args, **kwargs) 2025-12-04T12:10:21.6743950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6743988Z with policy(): 2025-12-04T12:10:21.6744140Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6744181Z raise RuntimeError(msg) 2025-12-04T12:10:21.6744575Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.6744579Z 2025-12-04T12:10:21.6744653Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6744913Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6744915Z 2025-12-04T12:10:21.6745011Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6745083Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6745126Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6745182Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6745664Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6745777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6745813Z graph_break [] 2025-12-04T12:10:21.6745876Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6745949Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6746431Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6746478Z current_size = base.storage().size() 2025-12-04T12:10:21.6746530Z Autotune Choices Stats: 2025-12-04T12:10:21.6746897Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.6746945Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6746987Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6747086Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6747337Z triton_mm_3 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6747567Z triton_mm_1 0.0059 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6747791Z triton_mm_2 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6748013Z triton_mm_0 0.0074 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6748056Z _scaled_mm 0.0210 ms 27.6% 2025-12-04T12:10:21.6748184Z SingleProcess AUTOTUNE benchmarking takes 0.0256 seconds and 0.1361 seconds precompiling for 5 choices 2025-12-04T12:10:21.6748258Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6748299Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6748357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6748457Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6748951Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6748989Z graph_break [] 2025-12-04T12:10:21.6749049Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6749123Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6749173Z Autotune Choices Stats: 2025-12-04T12:10:21.6749535Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.6749581Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6749622Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6749722Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6749953Z triton_mm_7 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6750218Z triton_mm_5 0.0084 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6750453Z triton_mm_6 0.0095 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6750676Z triton_mm_4 0.0110 ms 57.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6750716Z _scaled_mm 0.0228 ms 27.9% 2025-12-04T12:10:21.6750844Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1049 seconds precompiling for 5 choices 2025-12-04T12:10:21.6750896Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6751051Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6751099Z Traceback (most recent call last): 2025-12-04T12:10:21.6751255Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6751296Z method(*args, **kwargs) 2025-12-04T12:10:21.6751447Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6751488Z method(*args, **kwargs) 2025-12-04T12:10:21.6751640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6751676Z with policy(): 2025-12-04T12:10:21.6751829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6751871Z raise RuntimeError(msg) 2025-12-04T12:10:21.6752259Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6752262Z 2025-12-04T12:10:21.6752336Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6752609Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6752611Z 2025-12-04T12:10:21.6752698Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6752771Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6752828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6752883Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6753364Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6753464Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6753501Z graph_break [] 2025-12-04T12:10:21.6753560Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6753633Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6754115Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6754172Z current_size = base.storage().size() 2025-12-04T12:10:21.6754213Z Autotune Choices Stats: 2025-12-04T12:10:21.6754576Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.6754621Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6754662Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6754763Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6755007Z triton_mm_3 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6755237Z triton_mm_1 0.0059 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6755464Z triton_mm_2 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6755686Z triton_mm_0 0.0074 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6755731Z _scaled_mm 0.0210 ms 27.6% 2025-12-04T12:10:21.6755857Z SingleProcess AUTOTUNE benchmarking takes 0.0256 seconds and 0.1361 seconds precompiling for 5 choices 2025-12-04T12:10:21.6755930Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6755972Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6756028Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6756126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6756613Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6756660Z graph_break [] 2025-12-04T12:10:21.6756721Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6756793Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6756836Z Autotune Choices Stats: 2025-12-04T12:10:21.6757197Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.6757241Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6757282Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6757381Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6757613Z triton_mm_7 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6757849Z triton_mm_5 0.0084 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6758072Z triton_mm_6 0.0095 ms 67.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6758294Z triton_mm_4 0.0110 ms 57.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6758336Z _scaled_mm 0.0228 ms 27.9% 2025-12-04T12:10:21.6758475Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1049 seconds precompiling for 5 choices 2025-12-04T12:10:21.6758548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6758591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6758646Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6758745Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6759220Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('async_compile_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6759257Z graph_break [] 2025-12-04T12:10:21.6759316Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6759391Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6759430Z Autotune Choices Stats: 2025-12-04T12:10:21.6759791Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.6759836Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6759888Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6759986Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6760260Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6760502Z triton_mm_9 0.0064 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6760727Z triton_mm_10 0.0071 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6760950Z triton_mm_8 0.0077 ms 77.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6760990Z _scaled_mm 0.0198 ms 29.9% 2025-12-04T12:10:21.6761117Z SingleProcess AUTOTUNE benchmarking takes 0.0303 seconds and 0.1944 seconds precompiling for 5 choices 2025-12-04T12:10:21.6761308Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-eb1aff5bb1e10fb1.xml - 2025-12-04T12:10:21.6761369Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6761972Z FAILED [0.7693s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6761975Z 2025-12-04T12:10:21.6762047Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6762319Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6762322Z 2025-12-04T12:10:21.6762408Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6762473Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6762540Z ================== 1 failed, 130 deselected, 2 rerun in 3.36s ================== 2025-12-04T12:10:21.6762578Z Got exit code 1 2025-12-04T12:10:21.6762618Z Retrying single test... 2025-12-04T12:10:21.6762762Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-672ab55f446aee9c.xml 2025-12-04T12:10:21.6762819Z ============================= test session starts ============================== 2025-12-04T12:10:21.6762931Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6762972Z cachedir: .pytest_cache 2025-12-04T12:10:21.6763132Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6763178Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6763219Z configfile: pytest.ini 2025-12-04T12:10:21.6763383Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6763458Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6763726Z stepcurrent: skipping 130 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6763769Z Running 1 items in this shard 2025-12-04T12:10:21.6763771Z 2025-12-04T12:10:21.6763985Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2187s] [100%] 2025-12-04T12:10:21.6764208Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8313s] [100%] 2025-12-04T12:10:21.6764398Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7481s] [100%] 2025-12-04T12:10:21.6764400Z 2025-12-04T12:10:21.6764451Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6764595Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6764641Z Traceback (most recent call last): 2025-12-04T12:10:21.6764799Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6764840Z method(*args, **kwargs) 2025-12-04T12:10:21.6764993Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6765035Z method(*args, **kwargs) 2025-12-04T12:10:21.6765184Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6765233Z with policy(): 2025-12-04T12:10:21.6765384Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6765428Z raise RuntimeError(msg) 2025-12-04T12:10:21.6765815Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.6765818Z 2025-12-04T12:10:21.6765891Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6766163Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6766166Z 2025-12-04T12:10:21.6766254Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6766326Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6766369Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6766426Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6766905Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6767006Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6767044Z graph_break [] 2025-12-04T12:10:21.6767104Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6767177Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6767670Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6767718Z current_size = base.storage().size() 2025-12-04T12:10:21.6767759Z Autotune Choices Stats: 2025-12-04T12:10:21.6768125Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.6768181Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6768223Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6768322Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6768557Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6768784Z triton_mm_3 0.0080 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6769007Z triton_mm_0 0.0094 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6769238Z triton_mm_2 0.0095 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6769280Z _scaled_mm 0.0240 ms 28.7% 2025-12-04T12:10:21.6769408Z SingleProcess AUTOTUNE benchmarking takes 0.0330 seconds and 0.1721 seconds precompiling for 5 choices 2025-12-04T12:10:21.6769551Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6769597Z Traceback (most recent call last): 2025-12-04T12:10:21.6769761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6769803Z method(*args, **kwargs) 2025-12-04T12:10:21.6769954Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6769996Z method(*args, **kwargs) 2025-12-04T12:10:21.6770174Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6770211Z with policy(): 2025-12-04T12:10:21.6770366Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6770409Z raise RuntimeError(msg) 2025-12-04T12:10:21.6770797Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.6770804Z 2025-12-04T12:10:21.6770878Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6771137Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6771140Z 2025-12-04T12:10:21.6771227Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6771318Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6771361Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6771419Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6771897Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6772009Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6772045Z graph_break [] 2025-12-04T12:10:21.6772106Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6772177Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6772660Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6772708Z current_size = base.storage().size() 2025-12-04T12:10:21.6772750Z Autotune Choices Stats: 2025-12-04T12:10:21.6773112Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.6773170Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6773212Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6773311Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6773543Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6773783Z triton_mm_3 0.0080 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6774008Z triton_mm_0 0.0094 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6774230Z triton_mm_2 0.0095 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6774272Z _scaled_mm 0.0240 ms 28.7% 2025-12-04T12:10:21.6774399Z SingleProcess AUTOTUNE benchmarking takes 0.0330 seconds and 0.1721 seconds precompiling for 5 choices 2025-12-04T12:10:21.6774472Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6774517Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6774573Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6774674Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6775161Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6775199Z graph_break [] 2025-12-04T12:10:21.6775259Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6775331Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6775371Z Autotune Choices Stats: 2025-12-04T12:10:21.6775746Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.6775792Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6775833Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6775930Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6776161Z triton_mm_5 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6776384Z triton_mm_6 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6776610Z triton_mm_7 0.0068 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6776846Z triton_mm_4 0.0076 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6776887Z _scaled_mm 0.0203 ms 28.5% 2025-12-04T12:10:21.6777015Z SingleProcess AUTOTUNE benchmarking takes 0.0252 seconds and 0.1108 seconds precompiling for 5 choices 2025-12-04T12:10:21.6777069Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6777212Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6777269Z Traceback (most recent call last): 2025-12-04T12:10:21.6777426Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6777468Z method(*args, **kwargs) 2025-12-04T12:10:21.6777620Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6777659Z method(*args, **kwargs) 2025-12-04T12:10:21.6777810Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6777847Z with policy(): 2025-12-04T12:10:21.6777998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6778039Z raise RuntimeError(msg) 2025-12-04T12:10:21.6778428Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6778432Z 2025-12-04T12:10:21.6778505Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6778761Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6778763Z 2025-12-04T12:10:21.6778863Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6778936Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6778978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6779034Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6779526Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6779625Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6779661Z graph_break [] 2025-12-04T12:10:21.6779722Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6779794Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6780315Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6780376Z current_size = base.storage().size() 2025-12-04T12:10:21.6780417Z Autotune Choices Stats: 2025-12-04T12:10:21.6780780Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.6780826Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6780866Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6780965Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6781212Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6781439Z triton_mm_3 0.0080 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6781663Z triton_mm_0 0.0094 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6781885Z triton_mm_2 0.0095 ms 72.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6781927Z _scaled_mm 0.0240 ms 28.7% 2025-12-04T12:10:21.6782054Z SingleProcess AUTOTUNE benchmarking takes 0.0330 seconds and 0.1721 seconds precompiling for 5 choices 2025-12-04T12:10:21.6782128Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6782171Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6782229Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6782326Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6782815Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6782855Z graph_break [] 2025-12-04T12:10:21.6782915Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6783001Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6783041Z Autotune Choices Stats: 2025-12-04T12:10:21.6783402Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.6783446Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6783489Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6783586Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6783816Z triton_mm_5 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6784040Z triton_mm_6 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6784273Z triton_mm_7 0.0068 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6784495Z triton_mm_4 0.0076 ms 76.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6784535Z _scaled_mm 0.0203 ms 28.5% 2025-12-04T12:10:21.6784663Z SingleProcess AUTOTUNE benchmarking takes 0.0252 seconds and 0.1108 seconds precompiling for 5 choices 2025-12-04T12:10:21.6784745Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6784790Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6784845Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6784944Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6785419Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6785460Z graph_break [] 2025-12-04T12:10:21.6785520Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6785594Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6785635Z Autotune Choices Stats: 2025-12-04T12:10:21.6785993Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.6786039Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6786080Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6786188Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6786417Z triton_mm_10 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6786646Z triton_mm_11 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6786695Z _scaled_mm 0.0067 ms 89.3% 2025-12-04T12:10:21.6786918Z triton_mm_8 0.0093 ms 64.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6787144Z triton_mm_9 0.0098 ms 61.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6787271Z SingleProcess AUTOTUNE benchmarking takes 0.0439 seconds and 0.2228 seconds precompiling for 5 choices 2025-12-04T12:10:21.6787458Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-672ab55f446aee9c.xml - 2025-12-04T12:10:21.6787519Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6788100Z FAILED [0.7481s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6788113Z 2025-12-04T12:10:21.6788186Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6788445Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6788447Z 2025-12-04T12:10:21.6788547Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6788609Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6788681Z ================== 1 failed, 187 deselected, 2 rerun in 3.82s ================== 2025-12-04T12:10:21.6788718Z Got exit code 1 2025-12-04T12:10:21.6788760Z Retrying single test... 2025-12-04T12:10:21.6788903Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e19a76c714cb03be.xml 2025-12-04T12:10:21.6788960Z ============================= test session starts ============================== 2025-12-04T12:10:21.6789071Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6789112Z cachedir: .pytest_cache 2025-12-04T12:10:21.6789270Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6789319Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6789359Z configfile: pytest.ini 2025-12-04T12:10:21.6789522Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6789598Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6789853Z stepcurrent: skipping 130 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6789896Z Running 1 items in this shard 2025-12-04T12:10:21.6789913Z 2025-12-04T12:10:21.6790168Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1447s] [100%] 2025-12-04T12:10:21.6790379Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7485s] [100%] 2025-12-04T12:10:21.6790581Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7126s] [100%] 2025-12-04T12:10:21.6790585Z 2025-12-04T12:10:21.6790636Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6790777Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6790825Z Traceback (most recent call last): 2025-12-04T12:10:21.6790983Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6791025Z method(*args, **kwargs) 2025-12-04T12:10:21.6791176Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6791216Z method(*args, **kwargs) 2025-12-04T12:10:21.6791367Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6791405Z with policy(): 2025-12-04T12:10:21.6791572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6791614Z raise RuntimeError(msg) 2025-12-04T12:10:21.6792005Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.6792008Z 2025-12-04T12:10:21.6792081Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6792351Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6792354Z 2025-12-04T12:10:21.6792441Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6792515Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6792559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6792616Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6793095Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6793194Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6793232Z graph_break [] 2025-12-04T12:10:21.6793293Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6793365Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6793864Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6793912Z current_size = base.storage().size() 2025-12-04T12:10:21.6793952Z Autotune Choices Stats: 2025-12-04T12:10:21.6794320Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006680000107735395, "best_triton_pos": 0} 2025-12-04T12:10:21.6794376Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6794418Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6794518Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6794754Z triton_mm_3 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6794980Z triton_mm_2 0.0070 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6795206Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6795430Z triton_mm_0 0.0075 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6795481Z _scaled_mm 0.0226 ms 29.5% 2025-12-04T12:10:21.6795609Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1733 seconds precompiling for 5 choices 2025-12-04T12:10:21.6795752Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6795798Z Traceback (most recent call last): 2025-12-04T12:10:21.6795953Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6795994Z method(*args, **kwargs) 2025-12-04T12:10:21.6796157Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6796198Z method(*args, **kwargs) 2025-12-04T12:10:21.6796350Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6796388Z with policy(): 2025-12-04T12:10:21.6796540Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6796584Z raise RuntimeError(msg) 2025-12-04T12:10:21.6796971Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.6796974Z 2025-12-04T12:10:21.6797048Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6797307Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6797310Z 2025-12-04T12:10:21.6797395Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6797468Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6797510Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6797578Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6798057Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6798168Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6798206Z graph_break [] 2025-12-04T12:10:21.6798268Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6798339Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6798821Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6798868Z current_size = base.storage().size() 2025-12-04T12:10:21.6798909Z Autotune Choices Stats: 2025-12-04T12:10:21.6799276Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006680000107735395, "best_triton_pos": 0} 2025-12-04T12:10:21.6799334Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6799376Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6799473Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6799707Z triton_mm_3 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6799940Z triton_mm_2 0.0070 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6800203Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6800430Z triton_mm_0 0.0075 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6800470Z _scaled_mm 0.0226 ms 29.5% 2025-12-04T12:10:21.6800599Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1733 seconds precompiling for 5 choices 2025-12-04T12:10:21.6800670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6800713Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6800769Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6800869Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6801345Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6801384Z graph_break [] 2025-12-04T12:10:21.6801458Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6801534Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6801574Z Autotune Choices Stats: 2025-12-04T12:10:21.6801935Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005919000133872032, "best_triton_pos": 0} 2025-12-04T12:10:21.6801994Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6802036Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6802135Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6802363Z triton_mm_6 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6802592Z triton_mm_5 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6802816Z triton_mm_7 0.0067 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6803052Z triton_mm_4 0.0074 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6803095Z _scaled_mm 0.0198 ms 29.9% 2025-12-04T12:10:21.6803221Z SingleProcess AUTOTUNE benchmarking takes 0.0281 seconds and 0.1654 seconds precompiling for 5 choices 2025-12-04T12:10:21.6803276Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6803417Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6803463Z Traceback (most recent call last): 2025-12-04T12:10:21.6803635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6803679Z method(*args, **kwargs) 2025-12-04T12:10:21.6803831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6803873Z method(*args, **kwargs) 2025-12-04T12:10:21.6804022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6804060Z with policy(): 2025-12-04T12:10:21.6804212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6804254Z raise RuntimeError(msg) 2025-12-04T12:10:21.6804642Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6804645Z 2025-12-04T12:10:21.6804719Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6804978Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6804982Z 2025-12-04T12:10:21.6805068Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6805151Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6805193Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6805250Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6805728Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6805836Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6805872Z graph_break [] 2025-12-04T12:10:21.6805932Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6806004Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6806483Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6806531Z current_size = base.storage().size() 2025-12-04T12:10:21.6806572Z Autotune Choices Stats: 2025-12-04T12:10:21.6806938Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006680000107735395, "best_triton_pos": 0} 2025-12-04T12:10:21.6806994Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6807036Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6807134Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6807366Z triton_mm_3 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6807600Z triton_mm_2 0.0070 ms 95.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6807827Z triton_mm_1 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6808049Z triton_mm_0 0.0075 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6808091Z _scaled_mm 0.0226 ms 29.5% 2025-12-04T12:10:21.6808218Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1733 seconds precompiling for 5 choices 2025-12-04T12:10:21.6808290Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6808334Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6808390Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6808490Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6808981Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6809019Z graph_break [] 2025-12-04T12:10:21.6809079Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6809152Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6809193Z Autotune Choices Stats: 2025-12-04T12:10:21.6809560Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005919000133872032, "best_triton_pos": 0} 2025-12-04T12:10:21.6809605Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6809647Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6809745Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6809973Z triton_mm_6 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6810238Z triton_mm_5 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6810462Z triton_mm_7 0.0067 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6810698Z triton_mm_4 0.0074 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6810740Z _scaled_mm 0.0198 ms 29.9% 2025-12-04T12:10:21.6810868Z SingleProcess AUTOTUNE benchmarking takes 0.0281 seconds and 0.1654 seconds precompiling for 5 choices 2025-12-04T12:10:21.6810939Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6810981Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6811051Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6811151Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6811627Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6811664Z graph_break [] 2025-12-04T12:10:21.6811725Z aten_mm_info [('aten._scaled_mm.default_1_16_1024', 1)] 2025-12-04T12:10:21.6811797Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6811837Z Autotune Choices Stats: 2025-12-04T12:10:21.6812194Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.6812240Z AUTOTUNE scaled_mm(1x1024, 1024x16, , ) 2025-12-04T12:10:21.6812281Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6812452Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6812695Z triton_mm_9 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6812925Z triton_mm_11 0.0066 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6813163Z triton_mm_10 0.0079 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6813385Z triton_mm_8 0.0104 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6813425Z _scaled_mm 0.0244 ms 25.7% 2025-12-04T12:10:21.6813552Z SingleProcess AUTOTUNE benchmarking takes 0.0364 seconds and 0.2026 seconds precompiling for 5 choices 2025-12-04T12:10:21.6813739Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e19a76c714cb03be.xml - 2025-12-04T12:10:21.6813799Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6814378Z FAILED [0.7126s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.6814394Z 2025-12-04T12:10:21.6814468Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6814727Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6814729Z 2025-12-04T12:10:21.6814817Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6814888Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6814958Z ================== 1 failed, 187 deselected, 2 rerun in 3.63s ================== 2025-12-04T12:10:21.6814995Z Got exit code 1 2025-12-04T12:10:21.6815203Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6815330Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6815473Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-392593f8ed872a9e.xml 2025-12-04T12:10:21.6815529Z ============================= test session starts ============================== 2025-12-04T12:10:21.6815640Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6815681Z cachedir: .pytest_cache 2025-12-04T12:10:21.6815841Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6815887Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6815929Z configfile: pytest.ini 2025-12-04T12:10:21.6816092Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6816169Z collecting ... collected 188 items / 131 deselected / 57 selected 2025-12-04T12:10:21.6816224Z stepcurrent: skipping 131 already run items. 2025-12-04T12:10:21.6816268Z Running 57 items in this shard 2025-12-04T12:10:21.6816270Z 2025-12-04T12:10:21.6816500Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6013s] [ 1%] 2025-12-04T12:10:21.6816715Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0529s] [ 1%] 2025-12-04T12:10:21.6816927Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.8765s] [ 1%] 2025-12-04T12:10:21.6816930Z 2025-12-04T12:10:21.6816980Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6817127Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6817173Z Traceback (most recent call last): 2025-12-04T12:10:21.6817332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6817373Z method(*args, **kwargs) 2025-12-04T12:10:21.6817525Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6817565Z method(*args, **kwargs) 2025-12-04T12:10:21.6817717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6817755Z with policy(): 2025-12-04T12:10:21.6817924Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6817966Z raise RuntimeError(msg) 2025-12-04T12:10:21.6818359Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.6818362Z 2025-12-04T12:10:21.6818435Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6818704Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6818707Z 2025-12-04T12:10:21.6818794Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6818867Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6818911Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6818967Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6819459Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6819558Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6819597Z graph_break [] 2025-12-04T12:10:21.6819665Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6819738Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6820266Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6820326Z current_size = base.storage().size() 2025-12-04T12:10:21.6820372Z Autotune Choices Stats: 2025-12-04T12:10:21.6820740Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006678999867290258, "best_triton_pos": 0} 2025-12-04T12:10:21.6820802Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6820844Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6820949Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6821183Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6821412Z triton_mm_12 0.0071 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6821645Z triton_mm_16 0.0073 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6821873Z triton_mm_7 0.0073 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6822115Z triton_mm_9 0.0078 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6822340Z triton_mm_10 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6822582Z triton_mm_6 0.0080 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6822806Z triton_mm_14 0.0081 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6823027Z triton_mm_5 0.0088 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6823257Z triton_mm_18 0.0098 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6823385Z SingleProcess AUTOTUNE benchmarking takes 0.0891 seconds and 0.4188 seconds precompiling for 20 choices 2025-12-04T12:10:21.6823533Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6823580Z Traceback (most recent call last): 2025-12-04T12:10:21.6823736Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6823778Z method(*args, **kwargs) 2025-12-04T12:10:21.6823929Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6823970Z method(*args, **kwargs) 2025-12-04T12:10:21.6824128Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6824167Z with policy(): 2025-12-04T12:10:21.6824320Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6824372Z raise RuntimeError(msg) 2025-12-04T12:10:21.6824761Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.6824765Z 2025-12-04T12:10:21.6824839Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6825099Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6825102Z 2025-12-04T12:10:21.6825188Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6825261Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6825305Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6825362Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6825849Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6825962Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6825999Z graph_break [] 2025-12-04T12:10:21.6826063Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6826136Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6826627Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6826676Z current_size = base.storage().size() 2025-12-04T12:10:21.6826717Z Autotune Choices Stats: 2025-12-04T12:10:21.6827085Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006678999867290258, "best_triton_pos": 0} 2025-12-04T12:10:21.6827132Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6827180Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6827278Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6827515Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6827741Z triton_mm_12 0.0071 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6827978Z triton_mm_16 0.0073 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6828207Z triton_mm_7 0.0073 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6828444Z triton_mm_9 0.0078 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6828668Z triton_mm_10 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6828894Z triton_mm_6 0.0080 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6829121Z triton_mm_14 0.0081 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6829343Z triton_mm_5 0.0088 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6829582Z triton_mm_18 0.0098 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6829711Z SingleProcess AUTOTUNE benchmarking takes 0.0891 seconds and 0.4188 seconds precompiling for 20 choices 2025-12-04T12:10:21.6829784Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6829827Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6829882Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6829992Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6830514Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6830552Z graph_break [] 2025-12-04T12:10:21.6830613Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6830687Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6830727Z Autotune Choices Stats: 2025-12-04T12:10:21.6831086Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007079999893903732, "best_triton_pos": 0} 2025-12-04T12:10:21.6831133Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6831177Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6831275Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6831504Z triton_mm_31 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6831746Z triton_mm_26 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6831971Z triton_mm_36 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6832220Z triton_mm_28 0.0074 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6832446Z triton_mm_25 0.0075 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6832669Z triton_mm_29 0.0078 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6832892Z triton_mm_33 0.0079 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6833114Z triton_mm_24 0.0084 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6833355Z triton_mm_35 0.0089 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6833581Z triton_mm_37 0.0094 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6833710Z SingleProcess AUTOTUNE benchmarking takes 0.1347 seconds and 0.3093 seconds precompiling for 20 choices 2025-12-04T12:10:21.6833776Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6833926Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6833973Z Traceback (most recent call last): 2025-12-04T12:10:21.6834131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6834172Z method(*args, **kwargs) 2025-12-04T12:10:21.6834324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6834365Z method(*args, **kwargs) 2025-12-04T12:10:21.6834514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6834552Z with policy(): 2025-12-04T12:10:21.6834705Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6834750Z raise RuntimeError(msg) 2025-12-04T12:10:21.6835147Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6835149Z 2025-12-04T12:10:21.6835225Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6835495Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6835497Z 2025-12-04T12:10:21.6835584Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6835667Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6835712Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6835768Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6836257Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6836355Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6836391Z graph_break [] 2025-12-04T12:10:21.6836455Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6836530Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6837016Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6837072Z current_size = base.storage().size() 2025-12-04T12:10:21.6837114Z Autotune Choices Stats: 2025-12-04T12:10:21.6837489Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006678999867290258, "best_triton_pos": 0} 2025-12-04T12:10:21.6837538Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6837590Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6837688Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6837921Z triton_mm_17 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6838147Z triton_mm_12 0.0071 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6838375Z triton_mm_16 0.0073 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6838601Z triton_mm_7 0.0073 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6838834Z triton_mm_9 0.0078 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6839068Z triton_mm_10 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6839298Z triton_mm_6 0.0080 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6839521Z triton_mm_14 0.0081 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6839751Z triton_mm_5 0.0088 ms 75.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6839981Z triton_mm_18 0.0098 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6840139Z SingleProcess AUTOTUNE benchmarking takes 0.0891 seconds and 0.4188 seconds precompiling for 20 choices 2025-12-04T12:10:21.6840213Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6840257Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6840314Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6840416Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6840916Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6840954Z graph_break [] 2025-12-04T12:10:21.6841015Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6841088Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6841128Z Autotune Choices Stats: 2025-12-04T12:10:21.6841498Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007079999893903732, "best_triton_pos": 0} 2025-12-04T12:10:21.6841547Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6841589Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6841686Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6841919Z triton_mm_31 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6842146Z triton_mm_26 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6842373Z triton_mm_36 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6842602Z triton_mm_28 0.0074 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6842842Z triton_mm_25 0.0075 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6843066Z triton_mm_29 0.0078 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6843302Z triton_mm_33 0.0079 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6843526Z triton_mm_24 0.0084 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6843753Z triton_mm_35 0.0089 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6843979Z triton_mm_37 0.0094 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6844110Z SingleProcess AUTOTUNE benchmarking takes 0.1347 seconds and 0.3093 seconds precompiling for 20 choices 2025-12-04T12:10:21.6844182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6844236Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6844292Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6844391Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6844875Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6844912Z graph_break [] 2025-12-04T12:10:21.6844985Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6845057Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6845099Z Autotune Choices Stats: 2025-12-04T12:10:21.6845459Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_50", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:21.6845506Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6845547Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6845646Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6845875Z triton_mm_50 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6846102Z triton_mm_55 0.0067 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6846331Z triton_mm_45 0.0067 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6846575Z triton_mm_44 0.0070 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6846804Z triton_mm_54 0.0071 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6847036Z triton_mm_52 0.0078 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6847263Z triton_mm_47 0.0078 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6847485Z triton_mm_48 0.0084 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6847708Z triton_mm_43 0.0088 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6847935Z triton_mm_56 0.0097 ms 68.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6848072Z SingleProcess AUTOTUNE benchmarking takes 0.1587 seconds and 0.2718 seconds precompiling for 20 choices 2025-12-04T12:10:21.6848261Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-392593f8ed872a9e.xml - 2025-12-04T12:10:21.6848320Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6848921Z FAILED [0.8765s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6848926Z 2025-12-04T12:10:21.6849000Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6849259Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6849262Z 2025-12-04T12:10:21.6849349Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6849410Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6849478Z ================== 1 failed, 131 deselected, 2 rerun in 4.55s ================== 2025-12-04T12:10:21.6849514Z Got exit code 1 2025-12-04T12:10:21.6849558Z Retrying single test... 2025-12-04T12:10:21.6849702Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-08d2f754fc8a4f15.xml 2025-12-04T12:10:21.6849760Z ============================= test session starts ============================== 2025-12-04T12:10:21.6849871Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6849913Z cachedir: .pytest_cache 2025-12-04T12:10:21.6850071Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6850169Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6850211Z configfile: pytest.ini 2025-12-04T12:10:21.6850375Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6850451Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6850720Z stepcurrent: skipping 131 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6850766Z Running 1 items in this shard 2025-12-04T12:10:21.6850768Z 2025-12-04T12:10:21.6850988Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.7083s] [100%] 2025-12-04T12:10:21.6851205Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8877s] [100%] 2025-12-04T12:10:21.6851399Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.7881s] [100%] 2025-12-04T12:10:21.6851401Z 2025-12-04T12:10:21.6851455Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6851601Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6851661Z Traceback (most recent call last): 2025-12-04T12:10:21.6851817Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6851859Z method(*args, **kwargs) 2025-12-04T12:10:21.6852010Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6852051Z method(*args, **kwargs) 2025-12-04T12:10:21.6852203Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6852241Z with policy(): 2025-12-04T12:10:21.6852392Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6852446Z raise RuntimeError(msg) 2025-12-04T12:10:21.6852837Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.6852841Z 2025-12-04T12:10:21.6852914Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6853176Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6853178Z 2025-12-04T12:10:21.6853264Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6853337Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6853381Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6853437Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6853921Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6854030Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6854068Z graph_break [] 2025-12-04T12:10:21.6854130Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6854204Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6854686Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6854748Z current_size = base.storage().size() 2025-12-04T12:10:21.6854788Z Autotune Choices Stats: 2025-12-04T12:10:21.6855156Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.6855208Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6855249Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6855350Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6855588Z triton_mm_7 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6855830Z triton_mm_16 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6856056Z triton_mm_17 0.0071 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6856294Z triton_mm_6 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6856520Z triton_mm_12 0.0073 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6856745Z triton_mm_14 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6856975Z triton_mm_9 0.0078 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6857199Z triton_mm_10 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6857426Z triton_mm_5 0.0086 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6857656Z triton_mm_18 0.0093 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6857794Z SingleProcess AUTOTUNE benchmarking takes 0.1054 seconds and 0.5918 seconds precompiling for 20 choices 2025-12-04T12:10:21.6857943Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6857989Z Traceback (most recent call last): 2025-12-04T12:10:21.6858145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6858197Z method(*args, **kwargs) 2025-12-04T12:10:21.6858350Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6858390Z method(*args, **kwargs) 2025-12-04T12:10:21.6858540Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6858577Z with policy(): 2025-12-04T12:10:21.6858729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6858769Z raise RuntimeError(msg) 2025-12-04T12:10:21.6859161Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.6859165Z 2025-12-04T12:10:21.6859238Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6859509Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6859512Z 2025-12-04T12:10:21.6859600Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6859673Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6859717Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6859774Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6860331Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6860432Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6860469Z graph_break [] 2025-12-04T12:10:21.6860531Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6860605Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6861084Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6861132Z current_size = base.storage().size() 2025-12-04T12:10:21.6861173Z Autotune Choices Stats: 2025-12-04T12:10:21.6861542Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.6861592Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6861633Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6861746Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6861980Z triton_mm_7 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6862224Z triton_mm_16 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6862451Z triton_mm_17 0.0071 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6862679Z triton_mm_6 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6862902Z triton_mm_12 0.0073 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6863125Z triton_mm_14 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6863364Z triton_mm_9 0.0078 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6863586Z triton_mm_10 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6863808Z triton_mm_5 0.0086 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6864048Z triton_mm_18 0.0093 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6864176Z SingleProcess AUTOTUNE benchmarking takes 0.1054 seconds and 0.5918 seconds precompiling for 20 choices 2025-12-04T12:10:21.6864249Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6864291Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6864347Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6864448Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6864935Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6864973Z graph_break [] 2025-12-04T12:10:21.6865036Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6865108Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6865148Z Autotune Choices Stats: 2025-12-04T12:10:21.6865518Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:21.6865568Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6865610Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6865709Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6865954Z triton_mm_26 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6866179Z triton_mm_31 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6866405Z triton_mm_35 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6866634Z triton_mm_25 0.0073 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6866859Z triton_mm_36 0.0074 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6867097Z triton_mm_28 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6867321Z triton_mm_29 0.0078 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6867556Z triton_mm_24 0.0088 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6867781Z triton_mm_33 0.0088 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6868014Z triton_mm_27 0.0105 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6868143Z SingleProcess AUTOTUNE benchmarking takes 0.1216 seconds and 0.2579 seconds precompiling for 20 choices 2025-12-04T12:10:21.6868196Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6868342Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6868390Z Traceback (most recent call last): 2025-12-04T12:10:21.6868547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6868588Z method(*args, **kwargs) 2025-12-04T12:10:21.6868742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6868782Z method(*args, **kwargs) 2025-12-04T12:10:21.6868932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6868978Z with policy(): 2025-12-04T12:10:21.6869130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6869171Z raise RuntimeError(msg) 2025-12-04T12:10:21.6869561Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6869573Z 2025-12-04T12:10:21.6869647Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6869907Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6869909Z 2025-12-04T12:10:21.6869997Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6870072Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6870142Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6870198Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6870683Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6870797Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6870835Z graph_break [] 2025-12-04T12:10:21.6870896Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6870971Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6871463Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6871512Z current_size = base.storage().size() 2025-12-04T12:10:21.6871554Z Autotune Choices Stats: 2025-12-04T12:10:21.6871919Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.6871966Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6872007Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6872107Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6872342Z triton_mm_7 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6872572Z triton_mm_16 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6872799Z triton_mm_17 0.0071 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6873040Z triton_mm_6 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6873265Z triton_mm_12 0.0073 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6873501Z triton_mm_14 0.0077 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6873732Z triton_mm_9 0.0078 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6873955Z triton_mm_10 0.0083 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6874178Z triton_mm_5 0.0086 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6874410Z triton_mm_18 0.0093 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6874555Z SingleProcess AUTOTUNE benchmarking takes 0.1054 seconds and 0.5918 seconds precompiling for 20 choices 2025-12-04T12:10:21.6874629Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6874672Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6874728Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6874826Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6875319Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6875358Z graph_break [] 2025-12-04T12:10:21.6875421Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6875493Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6875534Z Autotune Choices Stats: 2025-12-04T12:10:21.6875898Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006639999803155661, "best_triton_pos": 0} 2025-12-04T12:10:21.6875946Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6875990Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6876087Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6876326Z triton_mm_26 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6876561Z triton_mm_31 0.0068 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6876790Z triton_mm_35 0.0072 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6877016Z triton_mm_25 0.0073 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6877253Z triton_mm_36 0.0074 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6877481Z triton_mm_28 0.0077 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6877703Z triton_mm_29 0.0078 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6877927Z triton_mm_24 0.0088 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6878158Z triton_mm_33 0.0088 ms 75.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6878389Z triton_mm_27 0.0105 ms 63.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6878517Z SingleProcess AUTOTUNE benchmarking takes 0.1216 seconds and 0.2579 seconds precompiling for 20 choices 2025-12-04T12:10:21.6878588Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6878630Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6878697Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6878796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6879278Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6879316Z graph_break [] 2025-12-04T12:10:21.6879377Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6879449Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6879489Z Autotune Choices Stats: 2025-12-04T12:10:21.6879848Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_50", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.6879897Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6879940Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6880037Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6880320Z triton_mm_50 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6880554Z triton_mm_44 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6880794Z triton_mm_54 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6881021Z triton_mm_55 0.0072 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6881247Z triton_mm_45 0.0073 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6881475Z triton_mm_47 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6881699Z triton_mm_48 0.0077 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6881935Z triton_mm_52 0.0082 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6882161Z triton_mm_43 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6882402Z triton_mm_49 0.0096 ms 69.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6882532Z SingleProcess AUTOTUNE benchmarking takes 0.1454 seconds and 0.2342 seconds precompiling for 20 choices 2025-12-04T12:10:21.6882721Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-08d2f754fc8a4f15.xml - 2025-12-04T12:10:21.6882781Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6883371Z FAILED [0.7881s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6883373Z 2025-12-04T12:10:21.6883447Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6883706Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6883709Z 2025-12-04T12:10:21.6883796Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6883858Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6883934Z ================== 1 failed, 187 deselected, 2 rerun in 4.40s ================== 2025-12-04T12:10:21.6883973Z Got exit code 1 2025-12-04T12:10:21.6884013Z Retrying single test... 2025-12-04T12:10:21.6884157Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fb0ed5ea470020cd.xml 2025-12-04T12:10:21.6884213Z ============================= test session starts ============================== 2025-12-04T12:10:21.6884334Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6884375Z cachedir: .pytest_cache 2025-12-04T12:10:21.6884534Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6884582Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6884623Z configfile: pytest.ini 2025-12-04T12:10:21.6884786Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6884860Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6885116Z stepcurrent: skipping 131 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6885161Z Running 1 items in this shard 2025-12-04T12:10:21.6885164Z 2025-12-04T12:10:21.6885381Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.8819s] [100%] 2025-12-04T12:10:21.6885605Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9770s] [100%] 2025-12-04T12:10:21.6885799Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.0014s] [100%] 2025-12-04T12:10:21.6885802Z 2025-12-04T12:10:21.6885856Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6886001Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6886048Z Traceback (most recent call last): 2025-12-04T12:10:21.6886213Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6886256Z method(*args, **kwargs) 2025-12-04T12:10:21.6886409Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6886449Z method(*args, **kwargs) 2025-12-04T12:10:21.6886599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6886638Z with policy(): 2025-12-04T12:10:21.6886791Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6886833Z raise RuntimeError(msg) 2025-12-04T12:10:21.6887226Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.6887230Z 2025-12-04T12:10:21.6887304Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6887563Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6887566Z 2025-12-04T12:10:21.6887652Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6887734Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6887777Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6887833Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6888317Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6888435Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6888471Z graph_break [] 2025-12-04T12:10:21.6888533Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6888606Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6889092Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6889142Z current_size = base.storage().size() 2025-12-04T12:10:21.6889183Z Autotune Choices Stats: 2025-12-04T12:10:21.6889555Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6889611Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6889654Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6889753Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6889986Z triton_mm_7 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6890263Z triton_mm_12 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6890492Z triton_mm_6 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6890720Z triton_mm_16 0.0077 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6890943Z triton_mm_14 0.0082 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6891173Z triton_mm_9 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6891398Z triton_mm_10 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6891635Z triton_mm_17 0.0094 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6891859Z triton_mm_11 0.0098 ms 66.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6892100Z triton_mm_18 0.0098 ms 65.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6892230Z SingleProcess AUTOTUNE benchmarking takes 0.0964 seconds and 0.5867 seconds precompiling for 20 choices 2025-12-04T12:10:21.6892376Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6893702Z Traceback (most recent call last): 2025-12-04T12:10:21.6893862Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6893905Z method(*args, **kwargs) 2025-12-04T12:10:21.6894055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6894098Z method(*args, **kwargs) 2025-12-04T12:10:21.6894249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6894308Z with policy(): 2025-12-04T12:10:21.6894460Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6894501Z raise RuntimeError(msg) 2025-12-04T12:10:21.6894898Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.6894900Z 2025-12-04T12:10:21.6894976Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6895248Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6895252Z 2025-12-04T12:10:21.6895340Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6895416Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6895459Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6895517Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6896007Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6896107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6896144Z graph_break [] 2025-12-04T12:10:21.6896206Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6896280Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6896771Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6896819Z current_size = base.storage().size() 2025-12-04T12:10:21.6896859Z Autotune Choices Stats: 2025-12-04T12:10:21.6897229Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6897287Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6897329Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6897428Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6897663Z triton_mm_7 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6897887Z triton_mm_12 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6898116Z triton_mm_6 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6898355Z triton_mm_16 0.0077 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6898578Z triton_mm_14 0.0082 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6898806Z triton_mm_9 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6899041Z triton_mm_10 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6899269Z triton_mm_17 0.0094 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6899496Z triton_mm_11 0.0098 ms 66.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6899724Z triton_mm_18 0.0098 ms 65.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6899854Z SingleProcess AUTOTUNE benchmarking takes 0.0964 seconds and 0.5867 seconds precompiling for 20 choices 2025-12-04T12:10:21.6899927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6899970Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6900026Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6900162Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6900656Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6900694Z graph_break [] 2025-12-04T12:10:21.6900756Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6900846Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6900885Z Autotune Choices Stats: 2025-12-04T12:10:21.6901252Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007160000037401915, "best_triton_pos": 0} 2025-12-04T12:10:21.6901300Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6901341Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6901441Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6901674Z triton_mm_26 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6901904Z triton_mm_31 0.0073 ms 98.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6902141Z triton_mm_35 0.0073 ms 98.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6902370Z triton_mm_36 0.0074 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6902609Z triton_mm_25 0.0076 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6902833Z triton_mm_29 0.0079 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6902877Z _scaled_mm 0.0080 ms 89.5% 2025-12-04T12:10:21.6903100Z triton_mm_33 0.0080 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6903323Z triton_mm_24 0.0087 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6903548Z triton_mm_30 0.0094 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6903678Z SingleProcess AUTOTUNE benchmarking takes 0.1383 seconds and 0.3092 seconds precompiling for 20 choices 2025-12-04T12:10:21.6903732Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6903878Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6903925Z Traceback (most recent call last): 2025-12-04T12:10:21.6904096Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6904138Z method(*args, **kwargs) 2025-12-04T12:10:21.6904288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6904329Z method(*args, **kwargs) 2025-12-04T12:10:21.6904479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6904529Z with policy(): 2025-12-04T12:10:21.6904681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6904724Z raise RuntimeError(msg) 2025-12-04T12:10:21.6905118Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6905120Z 2025-12-04T12:10:21.6905194Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6905455Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6905459Z 2025-12-04T12:10:21.6905546Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6905628Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6905670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6905727Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6906213Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6906312Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6906349Z graph_break [] 2025-12-04T12:10:21.6906421Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6906494Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6906980Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6907027Z current_size = base.storage().size() 2025-12-04T12:10:21.6907069Z Autotune Choices Stats: 2025-12-04T12:10:21.6907438Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.6907485Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6907528Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6907626Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6907859Z triton_mm_7 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6908093Z triton_mm_12 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6908323Z triton_mm_6 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6908559Z triton_mm_16 0.0077 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6908782Z triton_mm_14 0.0082 ms 78.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6909011Z triton_mm_9 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6909234Z triton_mm_10 0.0086 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6909463Z triton_mm_17 0.0094 ms 68.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6909697Z triton_mm_11 0.0098 ms 66.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6909926Z triton_mm_18 0.0098 ms 65.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6910055Z SingleProcess AUTOTUNE benchmarking takes 0.0964 seconds and 0.5867 seconds precompiling for 20 choices 2025-12-04T12:10:21.6910184Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6910227Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6910286Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6910384Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6910864Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6910902Z graph_break [] 2025-12-04T12:10:21.6910962Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6911034Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6911076Z Autotune Choices Stats: 2025-12-04T12:10:21.6911440Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007160000037401915, "best_triton_pos": 0} 2025-12-04T12:10:21.6911488Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6911529Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6911640Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6911872Z triton_mm_26 0.0072 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6912096Z triton_mm_31 0.0073 ms 98.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6912334Z triton_mm_35 0.0073 ms 98.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6912561Z triton_mm_36 0.0074 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6912790Z triton_mm_25 0.0076 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6913014Z triton_mm_29 0.0079 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6913069Z _scaled_mm 0.0080 ms 89.5% 2025-12-04T12:10:21.6913290Z triton_mm_33 0.0080 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6913516Z triton_mm_24 0.0087 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6913739Z triton_mm_30 0.0094 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6913876Z SingleProcess AUTOTUNE benchmarking takes 0.1383 seconds and 0.3092 seconds precompiling for 20 choices 2025-12-04T12:10:21.6913949Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6913993Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6914049Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6914147Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6914629Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6914665Z graph_break [] 2025-12-04T12:10:21.6914728Z aten_mm_info [('aten._scaled_mm.default_1_2048_1024', 1)] 2025-12-04T12:10:21.6914800Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6914842Z Autotune Choices Stats: 2025-12-04T12:10:21.6915205Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_55", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.6915261Z AUTOTUNE scaled_mm(1x1024, 1024x2048, , ) 2025-12-04T12:10:21.6915303Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.6915402Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6915634Z triton_mm_55 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6915869Z triton_mm_45 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6916096Z triton_mm_54 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6916318Z triton_mm_50 0.0073 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6916547Z triton_mm_44 0.0074 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6916773Z triton_mm_47 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6917007Z triton_mm_48 0.0078 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.6917231Z triton_mm_43 0.0084 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6917472Z triton_mm_52 0.0084 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.6917704Z triton_mm_56 0.0090 ms 68.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.6917831Z SingleProcess AUTOTUNE benchmarking takes 0.1479 seconds and 0.2903 seconds precompiling for 20 choices 2025-12-04T12:10:21.6918023Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fb0ed5ea470020cd.xml - 2025-12-04T12:10:21.6918083Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6918676Z FAILED [1.0014s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.6918680Z 2025-12-04T12:10:21.6918754Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6919012Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6919024Z 2025-12-04T12:10:21.6919113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6919174Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6919242Z ================== 1 failed, 187 deselected, 2 rerun in 4.88s ================== 2025-12-04T12:10:21.6919290Z Got exit code 1 2025-12-04T12:10:21.6919499Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6919625Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6919770Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8fe02393fad7d0bc.xml 2025-12-04T12:10:21.6919827Z ============================= test session starts ============================== 2025-12-04T12:10:21.6919941Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6919983Z cachedir: .pytest_cache 2025-12-04T12:10:21.6920182Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6920229Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6920271Z configfile: pytest.ini 2025-12-04T12:10:21.6920436Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6920526Z collecting ... collected 188 items / 132 deselected / 56 selected 2025-12-04T12:10:21.6920581Z stepcurrent: skipping 132 already run items. 2025-12-04T12:10:21.6920625Z Running 56 items in this shard 2025-12-04T12:10:21.6920627Z 2025-12-04T12:10:21.6920847Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9188s] [ 1%] 2025-12-04T12:10:21.6921057Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3612s] [ 1%] 2025-12-04T12:10:21.6921244Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3172s] [ 1%] 2025-12-04T12:10:21.6921259Z 2025-12-04T12:10:21.6921312Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6921454Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6921501Z Traceback (most recent call last): 2025-12-04T12:10:21.6921658Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6921700Z method(*args, **kwargs) 2025-12-04T12:10:21.6921855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6921896Z method(*args, **kwargs) 2025-12-04T12:10:21.6922046Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6922084Z with policy(): 2025-12-04T12:10:21.6922236Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6922278Z raise RuntimeError(msg) 2025-12-04T12:10:21.6922665Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6922667Z 2025-12-04T12:10:21.6922742Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6923011Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6923013Z 2025-12-04T12:10:21.6923101Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6923187Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6923231Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6923289Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6923355Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6923454Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6923490Z graph_break [] 2025-12-04T12:10:21.6923551Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6923692Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6923738Z Traceback (most recent call last): 2025-12-04T12:10:21.6923890Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6923930Z method(*args, **kwargs) 2025-12-04T12:10:21.6924084Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6924124Z method(*args, **kwargs) 2025-12-04T12:10:21.6924282Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6924322Z with policy(): 2025-12-04T12:10:21.6924472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6924513Z raise RuntimeError(msg) 2025-12-04T12:10:21.6924895Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6924897Z 2025-12-04T12:10:21.6924983Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6925240Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6925244Z 2025-12-04T12:10:21.6925330Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6925403Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6925445Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6925502Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6925567Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6925665Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6925701Z graph_break [] 2025-12-04T12:10:21.6925760Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6925835Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6925877Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6925933Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6926029Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6926093Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6926131Z graph_break [] 2025-12-04T12:10:21.6926189Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6926252Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6926395Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6926442Z Traceback (most recent call last): 2025-12-04T12:10:21.6926594Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6926646Z method(*args, **kwargs) 2025-12-04T12:10:21.6926795Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6926837Z method(*args, **kwargs) 2025-12-04T12:10:21.6926984Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6927021Z with policy(): 2025-12-04T12:10:21.6927171Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6927214Z raise RuntimeError(msg) 2025-12-04T12:10:21.6927595Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6927600Z 2025-12-04T12:10:21.6927672Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6927927Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6927943Z 2025-12-04T12:10:21.6928029Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6928102Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6928146Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6928203Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6928266Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6928365Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6928402Z graph_break [] 2025-12-04T12:10:21.6928471Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6928544Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6928587Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6928641Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6928737Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6928800Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6928837Z graph_break [] 2025-12-04T12:10:21.6928895Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6928968Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6929009Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6929064Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6929159Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6929223Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6929259Z graph_break [] 2025-12-04T12:10:21.6929320Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6929509Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8fe02393fad7d0bc.xml - 2025-12-04T12:10:21.6929569Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6930201Z FAILED [0.3172s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6930218Z 2025-12-04T12:10:21.6930290Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6930544Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6930547Z 2025-12-04T12:10:21.6930633Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6930695Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6930764Z ================== 1 failed, 132 deselected, 2 rerun in 2.62s ================== 2025-12-04T12:10:21.6930802Z Got exit code 1 2025-12-04T12:10:21.6930842Z Retrying single test... 2025-12-04T12:10:21.6930986Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-834bd4690df18cc2.xml 2025-12-04T12:10:21.6931042Z ============================= test session starts ============================== 2025-12-04T12:10:21.6931155Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6931218Z cachedir: .pytest_cache 2025-12-04T12:10:21.6931374Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6931420Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6931460Z configfile: pytest.ini 2025-12-04T12:10:21.6931624Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6931698Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6931951Z stepcurrent: skipping 132 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6931994Z Running 1 items in this shard 2025-12-04T12:10:21.6932009Z 2025-12-04T12:10:21.6932224Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9412s] [100%] 2025-12-04T12:10:21.6932432Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4005s] [100%] 2025-12-04T12:10:21.6932617Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3307s] [100%] 2025-12-04T12:10:21.6932620Z 2025-12-04T12:10:21.6932672Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6932815Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6932862Z Traceback (most recent call last): 2025-12-04T12:10:21.6933018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6933059Z method(*args, **kwargs) 2025-12-04T12:10:21.6933209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6933249Z method(*args, **kwargs) 2025-12-04T12:10:21.6933397Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6933434Z with policy(): 2025-12-04T12:10:21.6933597Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6933639Z raise RuntimeError(msg) 2025-12-04T12:10:21.6934022Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6934034Z 2025-12-04T12:10:21.6934109Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6934363Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6934366Z 2025-12-04T12:10:21.6934452Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6934526Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6934568Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6934624Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6934688Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6934787Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6934824Z graph_break [] 2025-12-04T12:10:21.6934884Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6935039Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6935085Z Traceback (most recent call last): 2025-12-04T12:10:21.6935238Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6935278Z method(*args, **kwargs) 2025-12-04T12:10:21.6935428Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6935468Z method(*args, **kwargs) 2025-12-04T12:10:21.6935616Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6935653Z with policy(): 2025-12-04T12:10:21.6935815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6935857Z raise RuntimeError(msg) 2025-12-04T12:10:21.6936238Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6936240Z 2025-12-04T12:10:21.6936314Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6936568Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6936570Z 2025-12-04T12:10:21.6936655Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6936730Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6936772Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6936829Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6936894Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6936991Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6937027Z graph_break [] 2025-12-04T12:10:21.6937085Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6937167Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6937211Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6937266Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6937363Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6937436Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6937473Z graph_break [] 2025-12-04T12:10:21.6937530Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6937583Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6937725Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6937772Z Traceback (most recent call last): 2025-12-04T12:10:21.6937924Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6937966Z method(*args, **kwargs) 2025-12-04T12:10:21.6938117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6938159Z method(*args, **kwargs) 2025-12-04T12:10:21.6938308Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6938346Z with policy(): 2025-12-04T12:10:21.6938500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6938555Z raise RuntimeError(msg) 2025-12-04T12:10:21.6938935Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6938939Z 2025-12-04T12:10:21.6939010Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6939265Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6939268Z 2025-12-04T12:10:21.6939363Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6939436Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6939479Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6939534Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6939599Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6939697Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6939735Z graph_break [] 2025-12-04T12:10:21.6939795Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6939868Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6939910Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6939963Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6940060Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6940160Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6940199Z graph_break [] 2025-12-04T12:10:21.6940258Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6940333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6940374Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6940428Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6940537Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6940601Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6940638Z graph_break [] 2025-12-04T12:10:21.6940695Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6940884Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-834bd4690df18cc2.xml - 2025-12-04T12:10:21.6940958Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6941533Z FAILED [0.3307s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6941536Z 2025-12-04T12:10:21.6941609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6941862Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6941864Z 2025-12-04T12:10:21.6941949Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6942012Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6942093Z ================== 1 failed, 187 deselected, 2 rerun in 2.69s ================== 2025-12-04T12:10:21.6942130Z Got exit code 1 2025-12-04T12:10:21.6942170Z Retrying single test... 2025-12-04T12:10:21.6942313Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-088f89f5a1eb43cd.xml 2025-12-04T12:10:21.6942371Z ============================= test session starts ============================== 2025-12-04T12:10:21.6942480Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6942521Z cachedir: .pytest_cache 2025-12-04T12:10:21.6942678Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6942739Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6942780Z configfile: pytest.ini 2025-12-04T12:10:21.6942943Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6943018Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6943269Z stepcurrent: skipping 132 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6943313Z Running 1 items in this shard 2025-12-04T12:10:21.6943316Z 2025-12-04T12:10:21.6943528Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8912s] [100%] 2025-12-04T12:10:21.6943737Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3468s] [100%] 2025-12-04T12:10:21.6943923Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3123s] [100%] 2025-12-04T12:10:21.6943926Z 2025-12-04T12:10:21.6943977Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6944116Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6944163Z Traceback (most recent call last): 2025-12-04T12:10:21.6944330Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6944371Z method(*args, **kwargs) 2025-12-04T12:10:21.6944521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6944562Z method(*args, **kwargs) 2025-12-04T12:10:21.6944728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6944767Z with policy(): 2025-12-04T12:10:21.6944919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6944960Z raise RuntimeError(msg) 2025-12-04T12:10:21.6945342Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6945345Z 2025-12-04T12:10:21.6945417Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6945674Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6945679Z 2025-12-04T12:10:21.6945764Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6945848Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6945889Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6945946Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6946010Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6946112Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6946149Z graph_break [] 2025-12-04T12:10:21.6946209Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6946348Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6946394Z Traceback (most recent call last): 2025-12-04T12:10:21.6946557Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6946598Z method(*args, **kwargs) 2025-12-04T12:10:21.6946748Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6946788Z method(*args, **kwargs) 2025-12-04T12:10:21.6946935Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6946973Z with policy(): 2025-12-04T12:10:21.6947124Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6947166Z raise RuntimeError(msg) 2025-12-04T12:10:21.6947547Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6947551Z 2025-12-04T12:10:21.6947623Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6947878Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6947880Z 2025-12-04T12:10:21.6947964Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6948049Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6948091Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6948148Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6948212Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6948312Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6948358Z graph_break [] 2025-12-04T12:10:21.6948417Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6948491Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6948533Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6948587Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6948683Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6948746Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6948783Z graph_break [] 2025-12-04T12:10:21.6948840Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6948892Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6949030Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6949078Z Traceback (most recent call last): 2025-12-04T12:10:21.6949230Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6949281Z method(*args, **kwargs) 2025-12-04T12:10:21.6949430Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6949470Z method(*args, **kwargs) 2025-12-04T12:10:21.6949619Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6949656Z with policy(): 2025-12-04T12:10:21.6949807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6949849Z raise RuntimeError(msg) 2025-12-04T12:10:21.6950286Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6950292Z 2025-12-04T12:10:21.6950364Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6950617Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6950619Z 2025-12-04T12:10:21.6950705Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6950778Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6950819Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6950875Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6950941Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6951038Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6951075Z graph_break [] 2025-12-04T12:10:21.6951134Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6951206Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6951247Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6951301Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6951409Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6951473Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6951510Z graph_break [] 2025-12-04T12:10:21.6951568Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6951641Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6951698Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6951753Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6951848Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6951911Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6951948Z graph_break [] 2025-12-04T12:10:21.6952004Z aten_mm_info [('aten._scaled_mm.default_1_16_16', 1)] 2025-12-04T12:10:21.6952195Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-088f89f5a1eb43cd.xml - 2025-12-04T12:10:21.6952256Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6952827Z FAILED [0.3123s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6952844Z 2025-12-04T12:10:21.6952916Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6953169Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6953171Z 2025-12-04T12:10:21.6953257Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6953318Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6953386Z ================== 1 failed, 187 deselected, 2 rerun in 2.57s ================== 2025-12-04T12:10:21.6953423Z Got exit code 1 2025-12-04T12:10:21.6953635Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6953763Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6953906Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-da924bd87b263fbb.xml 2025-12-04T12:10:21.6954035Z ============================= test session starts ============================== 2025-12-04T12:10:21.6954146Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6954188Z cachedir: .pytest_cache 2025-12-04T12:10:21.6954345Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6954390Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6954431Z configfile: pytest.ini 2025-12-04T12:10:21.6954597Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6954675Z collecting ... collected 188 items / 133 deselected / 55 selected 2025-12-04T12:10:21.6954729Z stepcurrent: skipping 133 already run items. 2025-12-04T12:10:21.6954774Z Running 55 items in this shard 2025-12-04T12:10:21.6954776Z 2025-12-04T12:10:21.6954992Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0623s] [ 1%] 2025-12-04T12:10:21.6955218Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3401s] [ 1%] 2025-12-04T12:10:21.6955407Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3288s] [ 1%] 2025-12-04T12:10:21.6955419Z 2025-12-04T12:10:21.6955471Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6955612Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6955659Z Traceback (most recent call last): 2025-12-04T12:10:21.6955815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6955855Z method(*args, **kwargs) 2025-12-04T12:10:21.6956006Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6956045Z method(*args, **kwargs) 2025-12-04T12:10:21.6956194Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6956231Z with policy(): 2025-12-04T12:10:21.6956382Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6956424Z raise RuntimeError(msg) 2025-12-04T12:10:21.6956814Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6956825Z 2025-12-04T12:10:21.6956900Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6957158Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6957160Z 2025-12-04T12:10:21.6957246Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6957330Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6957374Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6957428Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6957494Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6957591Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6957629Z graph_break [] 2025-12-04T12:10:21.6957690Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6957832Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6957877Z Traceback (most recent call last): 2025-12-04T12:10:21.6958031Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6958070Z method(*args, **kwargs) 2025-12-04T12:10:21.6958223Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6958263Z method(*args, **kwargs) 2025-12-04T12:10:21.6958413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6958452Z with policy(): 2025-12-04T12:10:21.6958604Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6958644Z raise RuntimeError(msg) 2025-12-04T12:10:21.6959046Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6959049Z 2025-12-04T12:10:21.6959123Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6959393Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6959396Z 2025-12-04T12:10:21.6959482Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6959554Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6959597Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6959654Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6959721Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6959819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6959855Z graph_break [] 2025-12-04T12:10:21.6959914Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6959988Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6960030Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6960085Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6960233Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6960297Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6960333Z graph_break [] 2025-12-04T12:10:21.6960393Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6960445Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6960587Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6960632Z Traceback (most recent call last): 2025-12-04T12:10:21.6960785Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6960838Z method(*args, **kwargs) 2025-12-04T12:10:21.6960991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6961032Z method(*args, **kwargs) 2025-12-04T12:10:21.6961183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6961222Z with policy(): 2025-12-04T12:10:21.6961372Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6961415Z raise RuntimeError(msg) 2025-12-04T12:10:21.6961799Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6961802Z 2025-12-04T12:10:21.6961875Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6962131Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6962134Z 2025-12-04T12:10:21.6962220Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6962292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6962347Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6962402Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6962467Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6962564Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6962601Z graph_break [] 2025-12-04T12:10:21.6962673Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6962746Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6962790Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6962843Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6962939Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6963003Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6963041Z graph_break [] 2025-12-04T12:10:21.6963101Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6963174Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6963215Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6963269Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6963363Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6963430Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6963466Z graph_break [] 2025-12-04T12:10:21.6963525Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6963723Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-da924bd87b263fbb.xml - 2025-12-04T12:10:21.6963784Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6964364Z FAILED [0.3288s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6964368Z 2025-12-04T12:10:21.6964449Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6964707Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6964710Z 2025-12-04T12:10:21.6964794Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6964859Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6964926Z ================== 1 failed, 133 deselected, 2 rerun in 2.75s ================== 2025-12-04T12:10:21.6964964Z Got exit code 1 2025-12-04T12:10:21.6965005Z Retrying single test... 2025-12-04T12:10:21.6965148Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a744535c17bb8640.xml 2025-12-04T12:10:21.6965204Z ============================= test session starts ============================== 2025-12-04T12:10:21.6965318Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6965359Z cachedir: .pytest_cache 2025-12-04T12:10:21.6965518Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6965562Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6965603Z configfile: pytest.ini 2025-12-04T12:10:21.6965764Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6965848Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6966102Z stepcurrent: skipping 133 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6966146Z Running 1 items in this shard 2025-12-04T12:10:21.6966158Z 2025-12-04T12:10:21.6966371Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9309s] [100%] 2025-12-04T12:10:21.6966584Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3632s] [100%] 2025-12-04T12:10:21.6966771Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3151s] [100%] 2025-12-04T12:10:21.6966774Z 2025-12-04T12:10:21.6966824Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6966965Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6967011Z Traceback (most recent call last): 2025-12-04T12:10:21.6967169Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6967210Z method(*args, **kwargs) 2025-12-04T12:10:21.6967374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6967413Z method(*args, **kwargs) 2025-12-04T12:10:21.6967564Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6967600Z with policy(): 2025-12-04T12:10:21.6967752Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6967793Z raise RuntimeError(msg) 2025-12-04T12:10:21.6968189Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6968192Z 2025-12-04T12:10:21.6968265Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6968522Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6968524Z 2025-12-04T12:10:21.6968610Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6968683Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6968725Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6968780Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6968844Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6968942Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6968980Z graph_break [] 2025-12-04T12:10:21.6969039Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6969181Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6969226Z Traceback (most recent call last): 2025-12-04T12:10:21.6969378Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6969419Z method(*args, **kwargs) 2025-12-04T12:10:21.6969581Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6969620Z method(*args, **kwargs) 2025-12-04T12:10:21.6969771Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6969809Z with policy(): 2025-12-04T12:10:21.6969972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6970123Z raise RuntimeError(msg) 2025-12-04T12:10:21.6970507Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6970510Z 2025-12-04T12:10:21.6970583Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6970837Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6970840Z 2025-12-04T12:10:21.6970925Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6970999Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6971042Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6971113Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6971178Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6971274Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6971311Z graph_break [] 2025-12-04T12:10:21.6971369Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6971444Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6971485Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6971541Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6971636Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6971728Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6971765Z graph_break [] 2025-12-04T12:10:21.6971825Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6971878Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6972020Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6972067Z Traceback (most recent call last): 2025-12-04T12:10:21.6972221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6972262Z method(*args, **kwargs) 2025-12-04T12:10:21.6972411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6972451Z method(*args, **kwargs) 2025-12-04T12:10:21.6972601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6972639Z with policy(): 2025-12-04T12:10:21.6972790Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6972833Z raise RuntimeError(msg) 2025-12-04T12:10:21.6973231Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6973233Z 2025-12-04T12:10:21.6973306Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6973560Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6973574Z 2025-12-04T12:10:21.6973663Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6973736Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6973778Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6973833Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6973896Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6973993Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6974029Z graph_break [] 2025-12-04T12:10:21.6974089Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6974162Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6974203Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6974256Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6974352Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6974415Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6974464Z graph_break [] 2025-12-04T12:10:21.6974521Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6974594Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6974635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6974689Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6974784Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6974848Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6974884Z graph_break [] 2025-12-04T12:10:21.6974942Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6975140Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a744535c17bb8640.xml - 2025-12-04T12:10:21.6975203Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6975781Z FAILED [0.3151s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6975786Z 2025-12-04T12:10:21.6975857Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6976115Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6976117Z 2025-12-04T12:10:21.6976202Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6976264Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6976331Z ================== 1 failed, 187 deselected, 2 rerun in 2.63s ================== 2025-12-04T12:10:21.6976369Z Got exit code 1 2025-12-04T12:10:21.6976409Z Retrying single test... 2025-12-04T12:10:21.6976550Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a31275549a78c72f.xml 2025-12-04T12:10:21.6976616Z ============================= test session starts ============================== 2025-12-04T12:10:21.6976728Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6976769Z cachedir: .pytest_cache 2025-12-04T12:10:21.6976927Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6976982Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6977023Z configfile: pytest.ini 2025-12-04T12:10:21.6977184Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6977260Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.6977514Z stepcurrent: skipping 133 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6977559Z Running 1 items in this shard 2025-12-04T12:10:21.6977562Z 2025-12-04T12:10:21.6977774Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7156s] [100%] 2025-12-04T12:10:21.6977984Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2772s] [100%] 2025-12-04T12:10:21.6978173Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2416s] [100%] 2025-12-04T12:10:21.6978188Z 2025-12-04T12:10:21.6978240Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6978381Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6978428Z Traceback (most recent call last): 2025-12-04T12:10:21.6978586Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6978626Z method(*args, **kwargs) 2025-12-04T12:10:21.6978779Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6978821Z method(*args, **kwargs) 2025-12-04T12:10:21.6978981Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6979019Z with policy(): 2025-12-04T12:10:21.6979172Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6979212Z raise RuntimeError(msg) 2025-12-04T12:10:21.6979599Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.6979601Z 2025-12-04T12:10:21.6979674Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6979930Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6979933Z 2025-12-04T12:10:21.6980020Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6980145Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6980188Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6980243Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6980308Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6980419Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6980457Z graph_break [] 2025-12-04T12:10:21.6980515Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6980656Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6980716Z Traceback (most recent call last): 2025-12-04T12:10:21.6980870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6980911Z method(*args, **kwargs) 2025-12-04T12:10:21.6981061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6981100Z method(*args, **kwargs) 2025-12-04T12:10:21.6981251Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6981289Z with policy(): 2025-12-04T12:10:21.6981441Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6981482Z raise RuntimeError(msg) 2025-12-04T12:10:21.6981866Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.6981885Z 2025-12-04T12:10:21.6981958Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6982212Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6982215Z 2025-12-04T12:10:21.6982302Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6982375Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6982417Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6982472Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6982537Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6982646Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6982684Z graph_break [] 2025-12-04T12:10:21.6982744Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6982817Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6982859Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6982914Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6983009Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6983074Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6983111Z graph_break [] 2025-12-04T12:10:21.6983169Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6983222Z =================================== FAILURES =================================== 2025-12-04T12:10:21.6983364Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6983411Z Traceback (most recent call last): 2025-12-04T12:10:21.6983563Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6983605Z method(*args, **kwargs) 2025-12-04T12:10:21.6983754Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6983793Z method(*args, **kwargs) 2025-12-04T12:10:21.6983952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6983991Z with policy(): 2025-12-04T12:10:21.6984141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6984182Z raise RuntimeError(msg) 2025-12-04T12:10:21.6984564Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6984580Z 2025-12-04T12:10:21.6984653Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6984910Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6984913Z 2025-12-04T12:10:21.6984999Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6985072Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6985113Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6985168Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6985233Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6985330Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6985385Z graph_break [] 2025-12-04T12:10:21.6985445Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6985517Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6985559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6985613Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6985709Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6985772Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6985808Z graph_break [] 2025-12-04T12:10:21.6985865Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6985948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6985991Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6986045Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6986140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6986202Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.6986238Z graph_break [] 2025-12-04T12:10:21.6986295Z aten_mm_info [('aten._scaled_mm.default_1_2048_16', 1)] 2025-12-04T12:10:21.6986481Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a31275549a78c72f.xml - 2025-12-04T12:10:21.6986540Z =========================== short test summary info ============================ 2025-12-04T12:10:21.6987119Z FAILED [0.2416s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.6987124Z 2025-12-04T12:10:21.6987194Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6987450Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6987463Z 2025-12-04T12:10:21.6987549Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6987611Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.6987678Z ================== 1 failed, 187 deselected, 2 rerun in 2.25s ================== 2025-12-04T12:10:21.6987728Z Got exit code 1 2025-12-04T12:10:21.6987933Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.6988061Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.6988202Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-025b616b139e01b4.xml 2025-12-04T12:10:21.6988259Z ============================= test session starts ============================== 2025-12-04T12:10:21.6988369Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.6988411Z cachedir: .pytest_cache 2025-12-04T12:10:21.6988567Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.6988615Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.6988656Z configfile: pytest.ini 2025-12-04T12:10:21.6988817Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.6988905Z collecting ... collected 188 items / 134 deselected / 54 selected 2025-12-04T12:10:21.6988958Z stepcurrent: skipping 134 already run items. 2025-12-04T12:10:21.6989002Z Running 54 items in this shard 2025-12-04T12:10:21.6989004Z 2025-12-04T12:10:21.6989219Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0491s] [ 1%] 2025-12-04T12:10:21.6989430Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4999s] [ 1%] 2025-12-04T12:10:21.6989624Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.5228s] [ 1%] 2025-12-04T12:10:21.6989627Z 2025-12-04T12:10:21.6989679Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.6989819Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6989866Z Traceback (most recent call last): 2025-12-04T12:10:21.6990021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6990062Z method(*args, **kwargs) 2025-12-04T12:10:21.6990251Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6990295Z method(*args, **kwargs) 2025-12-04T12:10:21.6990443Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6990481Z with policy(): 2025-12-04T12:10:21.6990634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6990676Z raise RuntimeError(msg) 2025-12-04T12:10:21.6991061Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.6991064Z 2025-12-04T12:10:21.6991151Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6991410Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6991413Z 2025-12-04T12:10:21.6991497Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6991584Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6991626Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6991683Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6992169Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6992268Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6992305Z graph_break [] 2025-12-04T12:10:21.6992364Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.6992437Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6992922Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6992984Z current_size = base.storage().size() 2025-12-04T12:10:21.6993025Z Autotune Choices Stats: 2025-12-04T12:10:21.6993394Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009999999776482582, "best_triton_pos": 0} 2025-12-04T12:10:21.6993439Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.6993494Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6993595Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6993834Z triton_mm_0 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6993877Z _scaled_mm 0.0248 ms 40.3% 2025-12-04T12:10:21.6994004Z SingleProcess AUTOTUNE benchmarking takes 0.0193 seconds and 0.0734 seconds precompiling for 2 choices 2025-12-04T12:10:21.6994148Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.6994192Z Traceback (most recent call last): 2025-12-04T12:10:21.6994347Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6994387Z method(*args, **kwargs) 2025-12-04T12:10:21.6994540Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.6994579Z method(*args, **kwargs) 2025-12-04T12:10:21.6994729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.6994766Z with policy(): 2025-12-04T12:10:21.6994918Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.6994957Z raise RuntimeError(msg) 2025-12-04T12:10:21.6995353Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.6995355Z 2025-12-04T12:10:21.6995439Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.6995695Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.6995698Z 2025-12-04T12:10:21.6995785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.6995860Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6995904Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6995960Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6996444Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6996542Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6996589Z graph_break [] 2025-12-04T12:10:21.6996648Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.6996722Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6997204Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.6997253Z current_size = base.storage().size() 2025-12-04T12:10:21.6997294Z Autotune Choices Stats: 2025-12-04T12:10:21.6997665Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009999999776482582, "best_triton_pos": 0} 2025-12-04T12:10:21.6997712Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.6997754Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6997855Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.6998090Z triton_mm_0 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.6998132Z _scaled_mm 0.0248 ms 40.3% 2025-12-04T12:10:21.6998258Z SingleProcess AUTOTUNE benchmarking takes 0.0193 seconds and 0.0734 seconds precompiling for 2 choices 2025-12-04T12:10:21.6998333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.6998374Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.6998431Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.6998528Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.6999014Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.6999052Z graph_break [] 2025-12-04T12:10:21.6999110Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.6999184Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.6999241Z Autotune Choices Stats: 2025-12-04T12:10:21.6999599Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:21.6999643Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.6999684Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.6999782Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7000014Z triton_mm_1 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7000054Z _scaled_mm 0.0212 ms 32.2% 2025-12-04T12:10:21.7000218Z SingleProcess AUTOTUNE benchmarking takes 0.0124 seconds and 0.0527 seconds precompiling for 2 choices 2025-12-04T12:10:21.7000271Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7000428Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7000473Z Traceback (most recent call last): 2025-12-04T12:10:21.7000627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7000670Z method(*args, **kwargs) 2025-12-04T12:10:21.7000821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7000864Z method(*args, **kwargs) 2025-12-04T12:10:21.7001014Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7001065Z with policy(): 2025-12-04T12:10:21.7001217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7001259Z raise RuntimeError(msg) 2025-12-04T12:10:21.7001642Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1134559232. 2025-12-04T12:10:21.7001645Z 2025-12-04T12:10:21.7001719Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7001975Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7001977Z 2025-12-04T12:10:21.7002066Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7002138Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7002182Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7002237Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7002739Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7002837Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7002873Z graph_break [] 2025-12-04T12:10:21.7002933Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7003022Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7003503Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7003550Z current_size = base.storage().size() 2025-12-04T12:10:21.7003593Z Autotune Choices Stats: 2025-12-04T12:10:21.7003955Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009999999776482582, "best_triton_pos": 0} 2025-12-04T12:10:21.7004001Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7004042Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7004142Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7004385Z triton_mm_0 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7004427Z _scaled_mm 0.0248 ms 40.3% 2025-12-04T12:10:21.7004556Z SingleProcess AUTOTUNE benchmarking takes 0.0193 seconds and 0.0734 seconds precompiling for 2 choices 2025-12-04T12:10:21.7004629Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7004672Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7004727Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7004825Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7005312Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7005350Z graph_break [] 2025-12-04T12:10:21.7005409Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7005483Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7005523Z Autotune Choices Stats: 2025-12-04T12:10:21.7005883Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006839999929070473, "best_triton_pos": 0} 2025-12-04T12:10:21.7005928Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7005968Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7006068Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7006296Z triton_mm_1 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7006347Z _scaled_mm 0.0212 ms 32.2% 2025-12-04T12:10:21.7006472Z SingleProcess AUTOTUNE benchmarking takes 0.0124 seconds and 0.0527 seconds precompiling for 2 choices 2025-12-04T12:10:21.7006546Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7006586Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7006642Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7006752Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7007195Z inductor [('triton_bundler_save_kernel', 8), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('async_compile_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7007232Z graph_break [] 2025-12-04T12:10:21.7007293Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7007365Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7007405Z Autotune Choices Stats: 2025-12-04T12:10:21.7007869Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "_scaled_mm", "best_time": 0.006320000160485506, "best_triton_pos": 1, "best_triton_time": 0.007720000110566616, "best_triton_kernel": "triton_mm_2", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:21.7007924Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7007965Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7008062Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7008105Z _scaled_mm 0.0063 ms 100.0% 2025-12-04T12:10:21.7008332Z triton_mm_2 0.0077 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7008459Z SingleProcess AUTOTUNE benchmarking takes 0.0166 seconds and 0.1654 seconds precompiling for 2 choices 2025-12-04T12:10:21.7008655Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-025b616b139e01b4.xml - 2025-12-04T12:10:21.7008716Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7009294Z FAILED [0.5228s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1134559232. 2025-12-04T12:10:21.7009297Z 2025-12-04T12:10:21.7009370Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7009629Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7009632Z 2025-12-04T12:10:21.7009719Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7009783Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7009850Z ================== 1 failed, 134 deselected, 2 rerun in 3.09s ================== 2025-12-04T12:10:21.7009888Z Got exit code 1 2025-12-04T12:10:21.7009928Z Retrying single test... 2025-12-04T12:10:21.7010080Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3b4948b5680fea2f.xml 2025-12-04T12:10:21.7010168Z ============================= test session starts ============================== 2025-12-04T12:10:21.7010281Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7010322Z cachedir: .pytest_cache 2025-12-04T12:10:21.7010482Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7010541Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7010582Z configfile: pytest.ini 2025-12-04T12:10:21.7010746Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7010823Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7011078Z stepcurrent: skipping 134 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7011121Z Running 1 items in this shard 2025-12-04T12:10:21.7011123Z 2025-12-04T12:10:21.7011336Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0663s] [100%] 2025-12-04T12:10:21.7011544Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.4894s] [100%] 2025-12-04T12:10:21.7011746Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6169s] [100%] 2025-12-04T12:10:21.7011748Z 2025-12-04T12:10:21.7011799Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7011940Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7011987Z Traceback (most recent call last): 2025-12-04T12:10:21.7012145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7012187Z method(*args, **kwargs) 2025-12-04T12:10:21.7012354Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7012397Z method(*args, **kwargs) 2025-12-04T12:10:21.7012547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7012585Z with policy(): 2025-12-04T12:10:21.7012738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7012780Z raise RuntimeError(msg) 2025-12-04T12:10:21.7013165Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.7013167Z 2025-12-04T12:10:21.7013240Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7013500Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7013503Z 2025-12-04T12:10:21.7013590Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7013662Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7013706Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7013761Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7014259Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7014369Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7014405Z graph_break [] 2025-12-04T12:10:21.7014465Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7014538Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7015023Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7015070Z current_size = base.storage().size() 2025-12-04T12:10:21.7015111Z Autotune Choices Stats: 2025-12-04T12:10:21.7015474Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399000063538551, "best_triton_pos": 0} 2025-12-04T12:10:21.7015533Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7015573Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7015673Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7015906Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7015948Z _scaled_mm 0.0069 ms 93.0% 2025-12-04T12:10:21.7016075Z SingleProcess AUTOTUNE benchmarking takes 0.0170 seconds and 0.0648 seconds precompiling for 2 choices 2025-12-04T12:10:21.7016225Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7016273Z Traceback (most recent call last): 2025-12-04T12:10:21.7016428Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7016472Z method(*args, **kwargs) 2025-12-04T12:10:21.7016625Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7016665Z method(*args, **kwargs) 2025-12-04T12:10:21.7016815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7016853Z with policy(): 2025-12-04T12:10:21.7017004Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7017045Z raise RuntimeError(msg) 2025-12-04T12:10:21.7017430Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.7017434Z 2025-12-04T12:10:21.7017509Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7017764Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7017767Z 2025-12-04T12:10:21.7017866Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7017941Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7017983Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7018039Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7018535Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7018635Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7018671Z graph_break [] 2025-12-04T12:10:21.7018732Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7018804Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7019286Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7019346Z current_size = base.storage().size() 2025-12-04T12:10:21.7019388Z Autotune Choices Stats: 2025-12-04T12:10:21.7019749Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399000063538551, "best_triton_pos": 0} 2025-12-04T12:10:21.7019793Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7019834Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7019933Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7020214Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7020257Z _scaled_mm 0.0069 ms 93.0% 2025-12-04T12:10:21.7020388Z SingleProcess AUTOTUNE benchmarking takes 0.0170 seconds and 0.0648 seconds precompiling for 2 choices 2025-12-04T12:10:21.7020460Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7020503Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7020558Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7020657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7021132Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7021170Z graph_break [] 2025-12-04T12:10:21.7021231Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7021303Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7021344Z Autotune Choices Stats: 2025-12-04T12:10:21.7021712Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.7021758Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7021798Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7021896Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7022145Z triton_mm_1 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7022187Z _scaled_mm 0.0208 ms 32.3% 2025-12-04T12:10:21.7022315Z SingleProcess AUTOTUNE benchmarking takes 0.0153 seconds and 0.0553 seconds precompiling for 2 choices 2025-12-04T12:10:21.7022369Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7022512Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7022559Z Traceback (most recent call last): 2025-12-04T12:10:21.7022714Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7022755Z method(*args, **kwargs) 2025-12-04T12:10:21.7022907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7022950Z method(*args, **kwargs) 2025-12-04T12:10:21.7023117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7023154Z with policy(): 2025-12-04T12:10:21.7023308Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7023349Z raise RuntimeError(msg) 2025-12-04T12:10:21.7023735Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.7023737Z 2025-12-04T12:10:21.7023810Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7024076Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7024080Z 2025-12-04T12:10:21.7024165Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7024239Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7024281Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7024337Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7024822Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7024924Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7024962Z graph_break [] 2025-12-04T12:10:21.7025020Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7025093Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7025584Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7025635Z current_size = base.storage().size() 2025-12-04T12:10:21.7025675Z Autotune Choices Stats: 2025-12-04T12:10:21.7026037Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399000063538551, "best_triton_pos": 0} 2025-12-04T12:10:21.7026092Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7026134Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7026234Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7026466Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7026508Z _scaled_mm 0.0069 ms 93.0% 2025-12-04T12:10:21.7026634Z SingleProcess AUTOTUNE benchmarking takes 0.0170 seconds and 0.0648 seconds precompiling for 2 choices 2025-12-04T12:10:21.7026708Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7026750Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7026806Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7026915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7027392Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7027429Z graph_break [] 2025-12-04T12:10:21.7027489Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7027562Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7027625Z Autotune Choices Stats: 2025-12-04T12:10:21.7027980Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.7028026Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7028066Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7028165Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7028394Z triton_mm_1 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7028434Z _scaled_mm 0.0208 ms 32.3% 2025-12-04T12:10:21.7028562Z SingleProcess AUTOTUNE benchmarking takes 0.0153 seconds and 0.0553 seconds precompiling for 2 choices 2025-12-04T12:10:21.7028636Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7028679Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7028734Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7028832Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7029317Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7029356Z graph_break [] 2025-12-04T12:10:21.7029415Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7029500Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7029542Z Autotune Choices Stats: 2025-12-04T12:10:21.7029902Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7029946Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7029987Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7030084Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7030347Z triton_mm_2 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7030389Z _scaled_mm 0.0242 ms 25.1% 2025-12-04T12:10:21.7030514Z SingleProcess AUTOTUNE benchmarking takes 0.0171 seconds and 0.0646 seconds precompiling for 2 choices 2025-12-04T12:10:21.7030717Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3b4948b5680fea2f.xml - 2025-12-04T12:10:21.7030776Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7031358Z FAILED [0.6169s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.7031360Z 2025-12-04T12:10:21.7031447Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7031703Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7031707Z 2025-12-04T12:10:21.7031793Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7031855Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7031925Z ================== 1 failed, 187 deselected, 2 rerun in 3.19s ================== 2025-12-04T12:10:21.7031963Z Got exit code 1 2025-12-04T12:10:21.7032005Z Retrying single test... 2025-12-04T12:10:21.7032149Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-00fbdee80af304b0.xml 2025-12-04T12:10:21.7032206Z ============================= test session starts ============================== 2025-12-04T12:10:21.7032319Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7032363Z cachedir: .pytest_cache 2025-12-04T12:10:21.7032523Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7032569Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7032611Z configfile: pytest.ini 2025-12-04T12:10:21.7032774Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7032862Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7033117Z stepcurrent: skipping 134 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7033160Z Running 1 items in this shard 2025-12-04T12:10:21.7033176Z 2025-12-04T12:10:21.7033389Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1803s] [100%] 2025-12-04T12:10:21.7033598Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5233s] [100%] 2025-12-04T12:10:21.7033784Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6215s] [100%] 2025-12-04T12:10:21.7033786Z 2025-12-04T12:10:21.7033839Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7033981Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7034027Z Traceback (most recent call last): 2025-12-04T12:10:21.7034183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7034226Z method(*args, **kwargs) 2025-12-04T12:10:21.7034376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7034428Z method(*args, **kwargs) 2025-12-04T12:10:21.7034579Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7034620Z with policy(): 2025-12-04T12:10:21.7034772Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7034814Z raise RuntimeError(msg) 2025-12-04T12:10:21.7035207Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.7035211Z 2025-12-04T12:10:21.7035284Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7035543Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7035545Z 2025-12-04T12:10:21.7035631Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7035705Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7035747Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7035803Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7036280Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7036381Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7036418Z graph_break [] 2025-12-04T12:10:21.7036477Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7036549Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7037038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7037101Z current_size = base.storage().size() 2025-12-04T12:10:21.7037142Z Autotune Choices Stats: 2025-12-04T12:10:21.7037505Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:21.7037549Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7037590Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7037692Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7037923Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7037963Z _scaled_mm 0.0212 ms 45.3% 2025-12-04T12:10:21.7038091Z SingleProcess AUTOTUNE benchmarking takes 0.0143 seconds and 0.0641 seconds precompiling for 2 choices 2025-12-04T12:10:21.7038232Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7038288Z Traceback (most recent call last): 2025-12-04T12:10:21.7038444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7038484Z method(*args, **kwargs) 2025-12-04T12:10:21.7038636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7038676Z method(*args, **kwargs) 2025-12-04T12:10:21.7038827Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7038864Z with policy(): 2025-12-04T12:10:21.7039025Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7039067Z raise RuntimeError(msg) 2025-12-04T12:10:21.7039453Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.7039456Z 2025-12-04T12:10:21.7039529Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7039786Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7039788Z 2025-12-04T12:10:21.7039875Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7039950Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7039993Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7040049Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7040584Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7040682Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7040719Z graph_break [] 2025-12-04T12:10:21.7040777Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7040850Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7041341Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7041390Z current_size = base.storage().size() 2025-12-04T12:10:21.7041429Z Autotune Choices Stats: 2025-12-04T12:10:21.7041792Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:21.7041836Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7041876Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7041977Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7042210Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7042270Z _scaled_mm 0.0212 ms 45.3% 2025-12-04T12:10:21.7042395Z SingleProcess AUTOTUNE benchmarking takes 0.0143 seconds and 0.0641 seconds precompiling for 2 choices 2025-12-04T12:10:21.7042469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7042511Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7042567Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7042664Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7043153Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7043192Z graph_break [] 2025-12-04T12:10:21.7043252Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7043323Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7043364Z Autotune Choices Stats: 2025-12-04T12:10:21.7043721Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.7043766Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7043807Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7043904Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7044137Z triton_mm_1 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7044177Z _scaled_mm 0.0252 ms 26.8% 2025-12-04T12:10:21.7044313Z SingleProcess AUTOTUNE benchmarking takes 0.0153 seconds and 0.0583 seconds precompiling for 2 choices 2025-12-04T12:10:21.7044366Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7044507Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7044551Z Traceback (most recent call last): 2025-12-04T12:10:21.7044718Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7044758Z method(*args, **kwargs) 2025-12-04T12:10:21.7044911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7044951Z method(*args, **kwargs) 2025-12-04T12:10:21.7045102Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7045139Z with policy(): 2025-12-04T12:10:21.7045292Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7045332Z raise RuntimeError(msg) 2025-12-04T12:10:21.7045718Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.7045722Z 2025-12-04T12:10:21.7045815Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7046070Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7046072Z 2025-12-04T12:10:21.7046159Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7046232Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7046275Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7046330Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7046819Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7046920Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7046957Z graph_break [] 2025-12-04T12:10:21.7047017Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7047090Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7047569Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7047616Z current_size = base.storage().size() 2025-12-04T12:10:21.7047657Z Autotune Choices Stats: 2025-12-04T12:10:21.7048017Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.009600000455975533, "best_triton_pos": 0} 2025-12-04T12:10:21.7048063Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7048113Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7048214Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7048444Z triton_mm_0 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7048497Z _scaled_mm 0.0212 ms 45.3% 2025-12-04T12:10:21.7048625Z SingleProcess AUTOTUNE benchmarking takes 0.0143 seconds and 0.0641 seconds precompiling for 2 choices 2025-12-04T12:10:21.7048699Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7048740Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7048796Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7048894Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7049370Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7049409Z graph_break [] 2025-12-04T12:10:21.7049467Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7049540Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7049590Z Autotune Choices Stats: 2025-12-04T12:10:21.7049950Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00675999978557229, "best_triton_pos": 0} 2025-12-04T12:10:21.7049993Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7050034Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7050173Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7050416Z triton_mm_1 0.0068 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7050459Z _scaled_mm 0.0252 ms 26.8% 2025-12-04T12:10:21.7050585Z SingleProcess AUTOTUNE benchmarking takes 0.0153 seconds and 0.0583 seconds precompiling for 2 choices 2025-12-04T12:10:21.7050659Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7050701Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7050757Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7050855Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7051330Z inductor [('triton_bundler_save_kernel', 16), ('async_compile_cache_miss', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7051367Z graph_break [] 2025-12-04T12:10:21.7051429Z aten_mm_info [('aten._scaled_mm.default_1_16_32', 1)] 2025-12-04T12:10:21.7051500Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7051541Z Autotune Choices Stats: 2025-12-04T12:10:21.7051909Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7051955Z AUTOTUNE scaled_mm(1x32, 32x16, , ) 2025-12-04T12:10:21.7051994Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7052092Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7052336Z triton_mm_2 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7052377Z _scaled_mm 0.0217 ms 28.0% 2025-12-04T12:10:21.7052504Z SingleProcess AUTOTUNE benchmarking takes 0.0143 seconds and 0.1603 seconds precompiling for 2 choices 2025-12-04T12:10:21.7052692Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-00fbdee80af304b0.xml - 2025-12-04T12:10:21.7052754Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7053328Z FAILED [0.6215s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.7053343Z 2025-12-04T12:10:21.7053416Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7055309Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7055313Z 2025-12-04T12:10:21.7055408Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7055472Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7055539Z ================== 1 failed, 187 deselected, 2 rerun in 3.35s ================== 2025-12-04T12:10:21.7055579Z Got exit code 1 2025-12-04T12:10:21.7055805Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7055936Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7056083Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a7e61c85f13ee3d9.xml 2025-12-04T12:10:21.7056141Z ============================= test session starts ============================== 2025-12-04T12:10:21.7056251Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7056293Z cachedir: .pytest_cache 2025-12-04T12:10:21.7056450Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7056497Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7056537Z configfile: pytest.ini 2025-12-04T12:10:21.7056706Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7056783Z collecting ... collected 188 items / 135 deselected / 53 selected 2025-12-04T12:10:21.7056843Z stepcurrent: skipping 135 already run items. 2025-12-04T12:10:21.7056887Z Running 53 items in this shard 2025-12-04T12:10:21.7056889Z 2025-12-04T12:10:21.7057192Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3965s] [ 1%] 2025-12-04T12:10:21.7057423Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8444s] [ 1%] 2025-12-04T12:10:21.7057611Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8156s] [ 1%] 2025-12-04T12:10:21.7057613Z 2025-12-04T12:10:21.7057699Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7057841Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7057891Z Traceback (most recent call last): 2025-12-04T12:10:21.7058050Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7058092Z method(*args, **kwargs) 2025-12-04T12:10:21.7058245Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7058286Z method(*args, **kwargs) 2025-12-04T12:10:21.7058435Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7058473Z with policy(): 2025-12-04T12:10:21.7058626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7058670Z raise RuntimeError(msg) 2025-12-04T12:10:21.7059059Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7059073Z 2025-12-04T12:10:21.7059146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7059408Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7059411Z 2025-12-04T12:10:21.7059497Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7059581Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7059626Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7059682Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7060205Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7060304Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7060340Z graph_break [] 2025-12-04T12:10:21.7060402Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7060475Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7060961Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7061011Z current_size = base.storage().size() 2025-12-04T12:10:21.7061052Z Autotune Choices Stats: 2025-12-04T12:10:21.7061439Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7061487Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7061528Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7061641Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7061876Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7062103Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7062329Z triton_mm_3 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7062556Z triton_mm_7 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7062598Z _scaled_mm 0.0066 ms 91.5% 2025-12-04T12:10:21.7062822Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7063057Z triton_mm_6 0.0072 ms 82.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7063282Z triton_mm_2 0.0080 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7063521Z triton_mm_4 0.0088 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7063649Z SingleProcess AUTOTUNE benchmarking takes 0.0458 seconds and 0.2136 seconds precompiling for 9 choices 2025-12-04T12:10:21.7063795Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7063840Z Traceback (most recent call last): 2025-12-04T12:10:21.7063996Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7064038Z method(*args, **kwargs) 2025-12-04T12:10:21.7064189Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7064229Z method(*args, **kwargs) 2025-12-04T12:10:21.7064379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7064418Z with policy(): 2025-12-04T12:10:21.7064571Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7064612Z raise RuntimeError(msg) 2025-12-04T12:10:21.7065004Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7065017Z 2025-12-04T12:10:21.7065091Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7065350Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7065362Z 2025-12-04T12:10:21.7065450Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7065525Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7065569Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7065625Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7066108Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7067526Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7067564Z graph_break [] 2025-12-04T12:10:21.7067628Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7067702Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7068183Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7068247Z current_size = base.storage().size() 2025-12-04T12:10:21.7068289Z Autotune Choices Stats: 2025-12-04T12:10:21.7068664Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7068719Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7068761Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7068861Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7069094Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7069322Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7069546Z triton_mm_3 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7069769Z triton_mm_7 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7069813Z _scaled_mm 0.0066 ms 91.5% 2025-12-04T12:10:21.7070034Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7070320Z triton_mm_6 0.0072 ms 82.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7070545Z triton_mm_2 0.0080 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7070769Z triton_mm_4 0.0088 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7070898Z SingleProcess AUTOTUNE benchmarking takes 0.0458 seconds and 0.2136 seconds precompiling for 9 choices 2025-12-04T12:10:21.7070974Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7071016Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7071075Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7071176Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7071705Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7071756Z graph_break [] 2025-12-04T12:10:21.7071818Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7071890Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7071930Z Autotune Choices Stats: 2025-12-04T12:10:21.7072293Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7072342Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7072399Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7072499Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7072731Z triton_mm_11 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7072957Z triton_mm_9 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7073183Z triton_mm_12 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7073408Z triton_mm_14 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7073632Z triton_mm_10 0.0063 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7073864Z triton_mm_15 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7073906Z _scaled_mm 0.0065 ms 93.3% 2025-12-04T12:10:21.7074127Z triton_mm_13 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7074352Z triton_mm_8 0.0073 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7074479Z SingleProcess AUTOTUNE benchmarking takes 0.0606 seconds and 0.2427 seconds precompiling for 9 choices 2025-12-04T12:10:21.7074532Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7074676Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7074722Z Traceback (most recent call last): 2025-12-04T12:10:21.7074880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7074940Z method(*args, **kwargs) 2025-12-04T12:10:21.7075092Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7075133Z method(*args, **kwargs) 2025-12-04T12:10:21.7075285Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7075334Z with policy(): 2025-12-04T12:10:21.7075485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7075527Z raise RuntimeError(msg) 2025-12-04T12:10:21.7075918Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7075922Z 2025-12-04T12:10:21.7075996Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7076265Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7076268Z 2025-12-04T12:10:21.7076356Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7076428Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7076472Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7076527Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7077007Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7077107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7077143Z graph_break [] 2025-12-04T12:10:21.7077204Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7077279Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7077773Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7077822Z current_size = base.storage().size() 2025-12-04T12:10:21.7077863Z Autotune Choices Stats: 2025-12-04T12:10:21.7078227Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7078274Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7078315Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7078414Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7078647Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7078870Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7079108Z triton_mm_3 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7079341Z triton_mm_7 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7079384Z _scaled_mm 0.0066 ms 91.5% 2025-12-04T12:10:21.7079609Z triton_mm_5 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7079831Z triton_mm_6 0.0072 ms 82.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7080063Z triton_mm_2 0.0080 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7080321Z triton_mm_4 0.0088 ms 67.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7080449Z SingleProcess AUTOTUNE benchmarking takes 0.0458 seconds and 0.2136 seconds precompiling for 9 choices 2025-12-04T12:10:21.7080522Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7080564Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7080621Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7080721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7081198Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7081237Z graph_break [] 2025-12-04T12:10:21.7081297Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7081384Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7081425Z Autotune Choices Stats: 2025-12-04T12:10:21.7081789Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7081835Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7081876Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7081975Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7082205Z triton_mm_11 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7082430Z triton_mm_9 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7082668Z triton_mm_12 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7082891Z triton_mm_14 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7083127Z triton_mm_10 0.0063 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7083348Z triton_mm_15 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7083390Z _scaled_mm 0.0065 ms 93.3% 2025-12-04T12:10:21.7083630Z triton_mm_13 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7083857Z triton_mm_8 0.0073 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7083985Z SingleProcess AUTOTUNE benchmarking takes 0.0606 seconds and 0.2427 seconds precompiling for 9 choices 2025-12-04T12:10:21.7084058Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7084100Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7084157Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7084257Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7084732Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7084771Z graph_break [] 2025-12-04T12:10:21.7084830Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7084903Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7084943Z Autotune Choices Stats: 2025-12-04T12:10:21.7085313Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7085360Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7085403Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7085501Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7085735Z triton_mm_16 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7085960Z triton_mm_22 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7086181Z triton_mm_18 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7086417Z triton_mm_23 0.0066 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7086651Z triton_mm_20 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7086874Z triton_mm_21 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7087098Z triton_mm_19 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7087333Z triton_mm_17 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7087376Z _scaled_mm 0.0251 ms 23.9% 2025-12-04T12:10:21.7087502Z SingleProcess AUTOTUNE benchmarking takes 0.0705 seconds and 0.2538 seconds precompiling for 9 choices 2025-12-04T12:10:21.7087693Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a7e61c85f13ee3d9.xml - 2025-12-04T12:10:21.7087754Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7088337Z FAILED [0.8156s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7088341Z 2025-12-04T12:10:21.7088415Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7088673Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7088675Z 2025-12-04T12:10:21.7088773Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7088835Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7088903Z ================== 1 failed, 135 deselected, 2 rerun in 4.08s ================== 2025-12-04T12:10:21.7088941Z Got exit code 1 2025-12-04T12:10:21.7088981Z Retrying single test... 2025-12-04T12:10:21.7089124Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-88378b21121ffce3.xml 2025-12-04T12:10:21.7089181Z ============================= test session starts ============================== 2025-12-04T12:10:21.7089294Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7089335Z cachedir: .pytest_cache 2025-12-04T12:10:21.7089492Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7089538Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7089579Z configfile: pytest.ini 2025-12-04T12:10:21.7089744Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7089831Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7090135Z stepcurrent: skipping 135 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7090178Z Running 1 items in this shard 2025-12-04T12:10:21.7090193Z 2025-12-04T12:10:21.7090409Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.1521s] [100%] 2025-12-04T12:10:21.7090619Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6195s] [100%] 2025-12-04T12:10:21.7090807Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.6148s] [100%] 2025-12-04T12:10:21.7090811Z 2025-12-04T12:10:21.7090862Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7091016Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7091062Z Traceback (most recent call last): 2025-12-04T12:10:21.7091219Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7091263Z method(*args, **kwargs) 2025-12-04T12:10:21.7091413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7091454Z method(*args, **kwargs) 2025-12-04T12:10:21.7091604Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7091642Z with policy(): 2025-12-04T12:10:21.7091794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7091837Z raise RuntimeError(msg) 2025-12-04T12:10:21.7092229Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7092233Z 2025-12-04T12:10:21.7092306Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7092584Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7092586Z 2025-12-04T12:10:21.7092672Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7092747Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7092790Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7092847Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7093325Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7093424Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7093462Z graph_break [] 2025-12-04T12:10:21.7093522Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7093595Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7094092Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7094151Z current_size = base.storage().size() 2025-12-04T12:10:21.7094191Z Autotune Choices Stats: 2025-12-04T12:10:21.7094560Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7094605Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7094646Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7094746Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7094990Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7095217Z triton_mm_2 0.0060 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7095440Z triton_mm_7 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7095664Z triton_mm_4 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7095886Z triton_mm_6 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7096109Z triton_mm_0 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7096341Z triton_mm_5 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7096567Z triton_mm_3 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7096610Z _scaled_mm 0.0222 ms 27.1% 2025-12-04T12:10:21.7096737Z SingleProcess AUTOTUNE benchmarking takes 0.0413 seconds and 0.2448 seconds precompiling for 9 choices 2025-12-04T12:10:21.7096880Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7096926Z Traceback (most recent call last): 2025-12-04T12:10:21.7097082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7097122Z method(*args, **kwargs) 2025-12-04T12:10:21.7097275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7097314Z method(*args, **kwargs) 2025-12-04T12:10:21.7097465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7097514Z with policy(): 2025-12-04T12:10:21.7097667Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7097707Z raise RuntimeError(msg) 2025-12-04T12:10:21.7098094Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7098116Z 2025-12-04T12:10:21.7098190Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7098450Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7098453Z 2025-12-04T12:10:21.7098540Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7098626Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7098670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7098726Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7099207Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7099305Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7099342Z graph_break [] 2025-12-04T12:10:21.7099402Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7099476Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7099957Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7100004Z current_size = base.storage().size() 2025-12-04T12:10:21.7100045Z Autotune Choices Stats: 2025-12-04T12:10:21.7100453Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7100499Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7100540Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7100640Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7100872Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7101098Z triton_mm_2 0.0060 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7101321Z triton_mm_7 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7101561Z triton_mm_4 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7101782Z triton_mm_6 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7102017Z triton_mm_0 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7102239Z triton_mm_5 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7102475Z triton_mm_3 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7102517Z _scaled_mm 0.0222 ms 27.1% 2025-12-04T12:10:21.7102645Z SingleProcess AUTOTUNE benchmarking takes 0.0413 seconds and 0.2448 seconds precompiling for 9 choices 2025-12-04T12:10:21.7102718Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7102761Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7102816Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7102917Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7103396Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7103434Z graph_break [] 2025-12-04T12:10:21.7103494Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7103568Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7103608Z Autotune Choices Stats: 2025-12-04T12:10:21.7103977Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7104022Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7104064Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7104162Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7104394Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7104621Z triton_mm_8 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7104845Z triton_mm_10 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7105073Z triton_mm_14 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7105317Z triton_mm_9 0.0061 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7105552Z triton_mm_13 0.0061 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7105812Z triton_mm_12 0.0066 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7106038Z triton_mm_15 0.0074 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7106092Z _scaled_mm 0.0217 ms 27.1% 2025-12-04T12:10:21.7106219Z SingleProcess AUTOTUNE benchmarking takes 0.0354 seconds and 0.0951 seconds precompiling for 9 choices 2025-12-04T12:10:21.7106274Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7106415Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7106464Z Traceback (most recent call last): 2025-12-04T12:10:21.7106624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7106728Z method(*args, **kwargs) 2025-12-04T12:10:21.7106908Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7106972Z method(*args, **kwargs) 2025-12-04T12:10:21.7107163Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7107222Z with policy(): 2025-12-04T12:10:21.7107374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7107416Z raise RuntimeError(msg) 2025-12-04T12:10:21.7107820Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7107823Z 2025-12-04T12:10:21.7107897Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7108157Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7108160Z 2025-12-04T12:10:21.7108248Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7108331Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7108376Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7108432Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7108916Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7109028Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7109064Z graph_break [] 2025-12-04T12:10:21.7109125Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7109198Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7109693Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7109742Z current_size = base.storage().size() 2025-12-04T12:10:21.7109784Z Autotune Choices Stats: 2025-12-04T12:10:21.7110191Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7110238Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7110278Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7110378Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7110615Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7110840Z triton_mm_2 0.0060 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7111069Z triton_mm_7 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7111292Z triton_mm_4 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7111520Z triton_mm_6 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7111766Z triton_mm_0 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7111990Z triton_mm_5 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7112220Z triton_mm_3 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7112260Z _scaled_mm 0.0222 ms 27.1% 2025-12-04T12:10:21.7112397Z SingleProcess AUTOTUNE benchmarking takes 0.0413 seconds and 0.2448 seconds precompiling for 9 choices 2025-12-04T12:10:21.7112469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7112512Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7112573Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7112672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7113175Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7113226Z graph_break [] 2025-12-04T12:10:21.7113287Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7113360Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7113404Z Autotune Choices Stats: 2025-12-04T12:10:21.7113767Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7113813Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7113864Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7113966Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7114205Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7114432Z triton_mm_8 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7114654Z triton_mm_10 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7114879Z triton_mm_14 0.0061 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7115104Z triton_mm_9 0.0061 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7115336Z triton_mm_13 0.0061 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7115564Z triton_mm_12 0.0066 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7115789Z triton_mm_15 0.0074 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7115831Z _scaled_mm 0.0217 ms 27.1% 2025-12-04T12:10:21.7115957Z SingleProcess AUTOTUNE benchmarking takes 0.0354 seconds and 0.0951 seconds precompiling for 9 choices 2025-12-04T12:10:21.7116029Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7116071Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7116126Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7116227Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7116703Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7116761Z graph_break [] 2025-12-04T12:10:21.7116820Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7116893Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7116933Z Autotune Choices Stats: 2025-12-04T12:10:21.7117291Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_20", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.7117336Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7117377Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7117487Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7117720Z triton_mm_20 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7117948Z triton_mm_17 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7118173Z triton_mm_19 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7118398Z triton_mm_23 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7118619Z triton_mm_18 0.0065 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7118844Z triton_mm_16 0.0065 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7119078Z triton_mm_22 0.0066 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7119302Z triton_mm_21 0.0067 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7119342Z _scaled_mm 0.0217 ms 27.3% 2025-12-04T12:10:21.7119468Z SingleProcess AUTOTUNE benchmarking takes 0.0554 seconds and 0.1874 seconds precompiling for 9 choices 2025-12-04T12:10:21.7119660Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-88378b21121ffce3.xml - 2025-12-04T12:10:21.7119720Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7120353Z FAILED [0.6148s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7120371Z 2025-12-04T12:10:21.7120446Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7120703Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7120724Z 2025-12-04T12:10:21.7120812Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7120874Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7120943Z ================== 1 failed, 187 deselected, 2 rerun in 3.41s ================== 2025-12-04T12:10:21.7120980Z Got exit code 1 2025-12-04T12:10:21.7121021Z Retrying single test... 2025-12-04T12:10:21.7121168Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bc9d671d4a59081.xml 2025-12-04T12:10:21.7121239Z ============================= test session starts ============================== 2025-12-04T12:10:21.7121352Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7121394Z cachedir: .pytest_cache 2025-12-04T12:10:21.7121553Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7121598Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7121639Z configfile: pytest.ini 2025-12-04T12:10:21.7121804Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7121878Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7122135Z stepcurrent: skipping 135 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7122180Z Running 1 items in this shard 2025-12-04T12:10:21.7122182Z 2025-12-04T12:10:21.7122394Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3112s] [100%] 2025-12-04T12:10:21.7122606Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9268s] [100%] 2025-12-04T12:10:21.7122807Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.7164s] [100%] 2025-12-04T12:10:21.7122809Z 2025-12-04T12:10:21.7122861Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7123004Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7123050Z Traceback (most recent call last): 2025-12-04T12:10:21.7123207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7123250Z method(*args, **kwargs) 2025-12-04T12:10:21.7123401Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7123442Z method(*args, **kwargs) 2025-12-04T12:10:21.7123592Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7123630Z with policy(): 2025-12-04T12:10:21.7123783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7123825Z raise RuntimeError(msg) 2025-12-04T12:10:21.7124227Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7124240Z 2025-12-04T12:10:21.7124313Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7124572Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7124574Z 2025-12-04T12:10:21.7124660Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7124733Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7124775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7124833Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7125320Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7125419Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7125456Z graph_break [] 2025-12-04T12:10:21.7125516Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7125590Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7126072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7126121Z current_size = base.storage().size() 2025-12-04T12:10:21.7126162Z Autotune Choices Stats: 2025-12-04T12:10:21.7126530Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7126573Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7126631Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7126731Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7126967Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7127193Z triton_mm_6 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7127416Z triton_mm_7 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7127641Z triton_mm_3 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7127876Z triton_mm_4 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7128101Z triton_mm_1 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7128335Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7128560Z triton_mm_0 0.0100 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7128602Z _scaled_mm 0.0231 ms 26.2% 2025-12-04T12:10:21.7128728Z SingleProcess AUTOTUNE benchmarking takes 0.0432 seconds and 0.2484 seconds precompiling for 9 choices 2025-12-04T12:10:21.7128879Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7128924Z Traceback (most recent call last): 2025-12-04T12:10:21.7129081Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7129121Z method(*args, **kwargs) 2025-12-04T12:10:21.7129275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7129315Z method(*args, **kwargs) 2025-12-04T12:10:21.7129466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7129505Z with policy(): 2025-12-04T12:10:21.7129659Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7129699Z raise RuntimeError(msg) 2025-12-04T12:10:21.7130088Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7130130Z 2025-12-04T12:10:21.7130203Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7130480Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7130482Z 2025-12-04T12:10:21.7130570Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7130645Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7130690Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7130746Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7131226Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7131325Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7131363Z graph_break [] 2025-12-04T12:10:21.7131423Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7131496Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7132000Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7132060Z current_size = base.storage().size() 2025-12-04T12:10:21.7132101Z Autotune Choices Stats: 2025-12-04T12:10:21.7132463Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7132508Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7132549Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7132649Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7132894Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7133120Z triton_mm_6 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7133343Z triton_mm_7 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7133568Z triton_mm_3 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7133795Z triton_mm_4 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7134019Z triton_mm_1 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7134254Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7134478Z triton_mm_0 0.0100 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7134519Z _scaled_mm 0.0231 ms 26.2% 2025-12-04T12:10:21.7134648Z SingleProcess AUTOTUNE benchmarking takes 0.0432 seconds and 0.2484 seconds precompiling for 9 choices 2025-12-04T12:10:21.7134722Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7134764Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7134820Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7134919Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7135396Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7135444Z graph_break [] 2025-12-04T12:10:21.7135505Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7135578Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7135628Z Autotune Choices Stats: 2025-12-04T12:10:21.7135985Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7136031Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7136071Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7136170Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7136410Z triton_mm_8 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7136635Z triton_mm_15 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7136862Z triton_mm_11 0.0067 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7137090Z triton_mm_12 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7137316Z triton_mm_13 0.0080 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7137538Z triton_mm_14 0.0082 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7137763Z triton_mm_9 0.0090 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7137995Z triton_mm_10 0.0093 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7138037Z _scaled_mm 0.0245 ms 25.4% 2025-12-04T12:10:21.7138163Z SingleProcess AUTOTUNE benchmarking takes 0.0596 seconds and 0.1782 seconds precompiling for 9 choices 2025-12-04T12:10:21.7138217Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7138359Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7138406Z Traceback (most recent call last): 2025-12-04T12:10:21.7138561Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7138603Z method(*args, **kwargs) 2025-12-04T12:10:21.7138755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7138795Z method(*args, **kwargs) 2025-12-04T12:10:21.7138945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7138994Z with policy(): 2025-12-04T12:10:21.7139148Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7139189Z raise RuntimeError(msg) 2025-12-04T12:10:21.7139587Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7139589Z 2025-12-04T12:10:21.7139662Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7139924Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7139927Z 2025-12-04T12:10:21.7140013Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7140153Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7140195Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7140253Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7140732Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7140830Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7140868Z graph_break [] 2025-12-04T12:10:21.7140928Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7141001Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7141482Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7141531Z current_size = base.storage().size() 2025-12-04T12:10:21.7141571Z Autotune Choices Stats: 2025-12-04T12:10:21.7141944Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7141992Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7142034Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7142132Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7142364Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7142591Z triton_mm_6 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7142814Z triton_mm_7 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7143053Z triton_mm_3 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7143291Z triton_mm_4 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7143520Z triton_mm_1 0.0068 ms 88.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7143743Z triton_mm_2 0.0078 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7143976Z triton_mm_0 0.0100 ms 60.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7144018Z _scaled_mm 0.0231 ms 26.2% 2025-12-04T12:10:21.7144144Z SingleProcess AUTOTUNE benchmarking takes 0.0432 seconds and 0.2484 seconds precompiling for 9 choices 2025-12-04T12:10:21.7144218Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7144258Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7144314Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7144413Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7144889Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7144926Z graph_break [] 2025-12-04T12:10:21.7144988Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7145060Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7145101Z Autotune Choices Stats: 2025-12-04T12:10:21.7145469Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7145515Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7145558Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7145657Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7145888Z triton_mm_8 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7146112Z triton_mm_15 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7146337Z triton_mm_11 0.0067 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7146561Z triton_mm_12 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7146794Z triton_mm_13 0.0080 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7147029Z triton_mm_14 0.0082 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7147255Z triton_mm_9 0.0090 ms 69.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7147491Z triton_mm_10 0.0093 ms 67.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7147531Z _scaled_mm 0.0245 ms 25.4% 2025-12-04T12:10:21.7147659Z SingleProcess AUTOTUNE benchmarking takes 0.0596 seconds and 0.1782 seconds precompiling for 9 choices 2025-12-04T12:10:21.7147732Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7147774Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7147829Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7147928Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7148404Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7148442Z graph_break [] 2025-12-04T12:10:21.7148502Z aten_mm_info [('aten._scaled_mm.default_1_2048_32', 1)] 2025-12-04T12:10:21.7148574Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7148616Z Autotune Choices Stats: 2025-12-04T12:10:21.7148987Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7149033Z AUTOTUNE scaled_mm(1x32, 32x2048, , ) 2025-12-04T12:10:21.7149073Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7149172Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7149405Z triton_mm_19 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7149633Z triton_mm_20 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7149857Z triton_mm_18 0.0064 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7150079Z triton_mm_23 0.0068 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7150351Z triton_mm_16 0.0068 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7150592Z triton_mm_17 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7150818Z triton_mm_21 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7151039Z triton_mm_22 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7151081Z _scaled_mm 0.0207 ms 29.4% 2025-12-04T12:10:21.7151219Z SingleProcess AUTOTUNE benchmarking takes 0.0606 seconds and 0.2286 seconds precompiling for 9 choices 2025-12-04T12:10:21.7151411Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bc9d671d4a59081.xml - 2025-12-04T12:10:21.7151473Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7152057Z FAILED [0.7164s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7152061Z 2025-12-04T12:10:21.7152135Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7152397Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7152400Z 2025-12-04T12:10:21.7152487Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7152550Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7152617Z ================== 1 failed, 187 deselected, 2 rerun in 3.97s ================== 2025-12-04T12:10:21.7152654Z Got exit code 1 2025-12-04T12:10:21.7152871Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7153000Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7153148Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c65e81b746cb3fdd.xml 2025-12-04T12:10:21.7153206Z ============================= test session starts ============================== 2025-12-04T12:10:21.7153319Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7153360Z cachedir: .pytest_cache 2025-12-04T12:10:21.7153517Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7153563Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7153603Z configfile: pytest.ini 2025-12-04T12:10:21.7153769Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7153858Z collecting ... collected 188 items / 136 deselected / 52 selected 2025-12-04T12:10:21.7153913Z stepcurrent: skipping 136 already run items. 2025-12-04T12:10:21.7153957Z Running 52 items in this shard 2025-12-04T12:10:21.7153960Z 2025-12-04T12:10:21.7154184Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.8440s] [ 1%] 2025-12-04T12:10:21.7154418Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9748s] [ 1%] 2025-12-04T12:10:21.7154607Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9118s] [ 1%] 2025-12-04T12:10:21.7154610Z 2025-12-04T12:10:21.7154662Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7154807Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7154854Z Traceback (most recent call last): 2025-12-04T12:10:21.7155020Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7155064Z method(*args, **kwargs) 2025-12-04T12:10:21.7155216Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7155257Z method(*args, **kwargs) 2025-12-04T12:10:21.7155407Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7155445Z with policy(): 2025-12-04T12:10:21.7155599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7155643Z raise RuntimeError(msg) 2025-12-04T12:10:21.7156038Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.7156041Z 2025-12-04T12:10:21.7156115Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7156375Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7156378Z 2025-12-04T12:10:21.7156464Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7156548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7156591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7156650Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7157137Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7157239Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7157277Z graph_break [] 2025-12-04T12:10:21.7157341Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7157415Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7157898Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7157956Z current_size = base.storage().size() 2025-12-04T12:10:21.7157997Z Autotune Choices Stats: 2025-12-04T12:10:21.7158384Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.7158432Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7158475Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7158574Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7158818Z triton_mm_8 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7159048Z triton_mm_14 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7159275Z triton_mm_17 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7159499Z triton_mm_16 0.0067 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7159728Z triton_mm_18 0.0068 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7159956Z triton_mm_9 0.0074 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7160216Z triton_mm_15 0.0076 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7160454Z triton_mm_13 0.0080 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7160679Z triton_mm_11 0.0081 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7160905Z triton_mm_6 0.0087 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7161037Z SingleProcess AUTOTUNE benchmarking takes 0.0937 seconds and 0.5234 seconds precompiling for 20 choices 2025-12-04T12:10:21.7161184Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7161231Z Traceback (most recent call last): 2025-12-04T12:10:21.7161387Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7161442Z method(*args, **kwargs) 2025-12-04T12:10:21.7161593Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7161634Z method(*args, **kwargs) 2025-12-04T12:10:21.7161784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7161836Z with policy(): 2025-12-04T12:10:21.7161988Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7162029Z raise RuntimeError(msg) 2025-12-04T12:10:21.7162422Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.7162425Z 2025-12-04T12:10:21.7162498Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7162772Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7162774Z 2025-12-04T12:10:21.7162862Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7162936Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7162978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7163035Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7163520Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7163620Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7163658Z graph_break [] 2025-12-04T12:10:21.7163720Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7163795Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7164285Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7164332Z current_size = base.storage().size() 2025-12-04T12:10:21.7164373Z Autotune Choices Stats: 2025-12-04T12:10:21.7164745Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.7164794Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7164836Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7164934Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7165168Z triton_mm_8 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7165395Z triton_mm_14 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7165634Z triton_mm_17 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7165867Z triton_mm_16 0.0067 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7166097Z triton_mm_18 0.0068 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7166325Z triton_mm_9 0.0074 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7166561Z triton_mm_15 0.0076 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7166787Z triton_mm_13 0.0080 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7167010Z triton_mm_11 0.0081 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7167236Z triton_mm_6 0.0087 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7167367Z SingleProcess AUTOTUNE benchmarking takes 0.0937 seconds and 0.5234 seconds precompiling for 20 choices 2025-12-04T12:10:21.7167439Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7167483Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7167538Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7167637Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7168129Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7168167Z graph_break [] 2025-12-04T12:10:21.7168229Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7168302Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7168343Z Autotune Choices Stats: 2025-12-04T12:10:21.7168705Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_27", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7168752Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7168795Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7168894Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7169144Z triton_mm_27 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7169368Z triton_mm_33 0.0065 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7169608Z triton_mm_37 0.0068 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7169832Z triton_mm_35 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7170068Z triton_mm_36 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7170330Z triton_mm_28 0.0071 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7170555Z triton_mm_30 0.0079 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7170781Z triton_mm_31 0.0080 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7171004Z triton_mm_34 0.0083 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7171226Z triton_mm_32 0.0084 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7171354Z SingleProcess AUTOTUNE benchmarking takes 0.1443 seconds and 0.3096 seconds precompiling for 20 choices 2025-12-04T12:10:21.7171408Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7171566Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7171614Z Traceback (most recent call last): 2025-12-04T12:10:21.7171770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7171814Z method(*args, **kwargs) 2025-12-04T12:10:21.7171965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7172007Z method(*args, **kwargs) 2025-12-04T12:10:21.7172159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7172196Z with policy(): 2025-12-04T12:10:21.7172348Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7172388Z raise RuntimeError(msg) 2025-12-04T12:10:21.7172783Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7172799Z 2025-12-04T12:10:21.7172873Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7173134Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7173152Z 2025-12-04T12:10:21.7173238Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7173312Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7173353Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7173411Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7173906Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7174005Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7174043Z graph_break [] 2025-12-04T12:10:21.7174105Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7174179Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7174660Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7174708Z current_size = base.storage().size() 2025-12-04T12:10:21.7174748Z Autotune Choices Stats: 2025-12-04T12:10:21.7175114Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.7175163Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7175204Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7175303Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7175548Z triton_mm_8 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7175777Z triton_mm_14 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7176002Z triton_mm_17 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7176226Z triton_mm_16 0.0067 ms 95.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7176452Z triton_mm_18 0.0068 ms 94.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7176691Z triton_mm_9 0.0074 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7176915Z triton_mm_15 0.0076 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7177152Z triton_mm_13 0.0080 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7177375Z triton_mm_11 0.0081 ms 78.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7177614Z triton_mm_6 0.0087 ms 73.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7177744Z SingleProcess AUTOTUNE benchmarking takes 0.0937 seconds and 0.5234 seconds precompiling for 20 choices 2025-12-04T12:10:21.7177817Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7177859Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7177915Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7178013Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7178495Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7178533Z graph_break [] 2025-12-04T12:10:21.7178594Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7178667Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7178708Z Autotune Choices Stats: 2025-12-04T12:10:21.7179077Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_27", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7179125Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7179165Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7179265Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7179496Z triton_mm_27 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7179721Z triton_mm_33 0.0065 ms 94.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7179950Z triton_mm_37 0.0068 ms 90.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7180207Z triton_mm_35 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7180448Z triton_mm_36 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7180690Z triton_mm_28 0.0071 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7180913Z triton_mm_30 0.0079 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7181137Z triton_mm_31 0.0080 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7181372Z triton_mm_34 0.0083 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7181596Z triton_mm_32 0.0084 ms 72.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7181723Z SingleProcess AUTOTUNE benchmarking takes 0.1443 seconds and 0.3096 seconds precompiling for 20 choices 2025-12-04T12:10:21.7181798Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7181841Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7181897Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7181995Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7182476Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7182517Z graph_break [] 2025-12-04T12:10:21.7182577Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7182650Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7182706Z Autotune Choices Stats: 2025-12-04T12:10:21.7183070Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_46", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.7183117Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7183160Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7183258Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7183490Z triton_mm_46 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7183716Z triton_mm_55 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7183758Z _scaled_mm 0.0063 ms 93.7% 2025-12-04T12:10:21.7183994Z triton_mm_52 0.0064 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7184214Z triton_mm_54 0.0064 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7184454Z triton_mm_56 0.0070 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7184678Z triton_mm_53 0.0074 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7184919Z triton_mm_47 0.0075 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7185147Z triton_mm_50 0.0076 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7185370Z triton_mm_49 0.0079 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7185500Z SingleProcess AUTOTUNE benchmarking takes 0.1273 seconds and 0.2624 seconds precompiling for 20 choices 2025-12-04T12:10:21.7185690Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c65e81b746cb3fdd.xml - 2025-12-04T12:10:21.7185753Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7186341Z FAILED [0.9118s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7186346Z 2025-12-04T12:10:21.7186420Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7186689Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7186692Z 2025-12-04T12:10:21.7186780Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7186842Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7186910Z ================== 1 failed, 136 deselected, 2 rerun in 4.75s ================== 2025-12-04T12:10:21.7186949Z Got exit code 1 2025-12-04T12:10:21.7186988Z Retrying single test... 2025-12-04T12:10:21.7187131Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7092ed04f830ac2b.xml 2025-12-04T12:10:21.7187187Z ============================= test session starts ============================== 2025-12-04T12:10:21.7187302Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7187344Z cachedir: .pytest_cache 2025-12-04T12:10:21.7187502Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7187560Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7187601Z configfile: pytest.ini 2025-12-04T12:10:21.7187765Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7187842Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7188106Z stepcurrent: skipping 136 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7188152Z Running 1 items in this shard 2025-12-04T12:10:21.7188154Z 2025-12-04T12:10:21.7188372Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6711s] [100%] 2025-12-04T12:10:21.7188586Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.9852s] [100%] 2025-12-04T12:10:21.7188789Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9187s] [100%] 2025-12-04T12:10:21.7188792Z 2025-12-04T12:10:21.7188844Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7188991Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7189036Z Traceback (most recent call last): 2025-12-04T12:10:21.7189194Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7189236Z method(*args, **kwargs) 2025-12-04T12:10:21.7189388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7189428Z method(*args, **kwargs) 2025-12-04T12:10:21.7189581Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7189619Z with policy(): 2025-12-04T12:10:21.7189772Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7189814Z raise RuntimeError(msg) 2025-12-04T12:10:21.7190240Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.7190257Z 2025-12-04T12:10:21.7190332Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7190591Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7190594Z 2025-12-04T12:10:21.7190680Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7190753Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7190797Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7190854Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7191341Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7191455Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7191491Z graph_break [] 2025-12-04T12:10:21.7191554Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7191626Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7192108Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7192171Z current_size = base.storage().size() 2025-12-04T12:10:21.7192214Z Autotune Choices Stats: 2025-12-04T12:10:21.7192594Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7192644Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7192685Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7192787Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7193021Z triton_mm_17 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7193249Z triton_mm_8 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7193474Z triton_mm_14 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7193701Z triton_mm_18 0.0070 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7193928Z triton_mm_9 0.0070 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7194164Z triton_mm_16 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7194390Z triton_mm_12 0.0075 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7194613Z triton_mm_11 0.0079 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7194836Z triton_mm_13 0.0081 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7195058Z triton_mm_15 0.0086 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7195198Z SingleProcess AUTOTUNE benchmarking takes 0.0885 seconds and 0.3863 seconds precompiling for 20 choices 2025-12-04T12:10:21.7195346Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7195392Z Traceback (most recent call last): 2025-12-04T12:10:21.7195548Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7195599Z method(*args, **kwargs) 2025-12-04T12:10:21.7195752Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7195792Z method(*args, **kwargs) 2025-12-04T12:10:21.7195944Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7195982Z with policy(): 2025-12-04T12:10:21.7196133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7196175Z raise RuntimeError(msg) 2025-12-04T12:10:21.7196582Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.7196585Z 2025-12-04T12:10:21.7196658Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7196918Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7196920Z 2025-12-04T12:10:21.7197007Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7197079Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7197124Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7197180Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7197665Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7197765Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7197800Z graph_break [] 2025-12-04T12:10:21.7197871Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7197944Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7198428Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7198476Z current_size = base.storage().size() 2025-12-04T12:10:21.7198518Z Autotune Choices Stats: 2025-12-04T12:10:21.7198883Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7198930Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7198971Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7199081Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7199319Z triton_mm_17 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7199556Z triton_mm_8 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7199782Z triton_mm_14 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7200010Z triton_mm_18 0.0070 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7200290Z triton_mm_9 0.0070 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7200520Z triton_mm_16 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7200746Z triton_mm_12 0.0075 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7200972Z triton_mm_11 0.0079 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7201197Z triton_mm_13 0.0081 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7201422Z triton_mm_15 0.0086 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7201551Z SingleProcess AUTOTUNE benchmarking takes 0.0885 seconds and 0.3863 seconds precompiling for 20 choices 2025-12-04T12:10:21.7201637Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7201679Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7201739Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7201839Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7202321Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7202359Z graph_break [] 2025-12-04T12:10:21.7202420Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7202494Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7202534Z Autotune Choices Stats: 2025-12-04T12:10:21.7202898Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7202956Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7202998Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7203108Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7203341Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7203567Z triton_mm_35 0.0066 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7203803Z triton_mm_27 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7204033Z triton_mm_37 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7204261Z triton_mm_28 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7204303Z _scaled_mm 0.0076 ms 78.9% 2025-12-04T12:10:21.7204527Z triton_mm_34 0.0078 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7204755Z triton_mm_30 0.0078 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7204982Z triton_mm_25 0.0085 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7205206Z triton_mm_33 0.0091 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7205344Z SingleProcess AUTOTUNE benchmarking takes 0.1214 seconds and 0.2582 seconds precompiling for 20 choices 2025-12-04T12:10:21.7205397Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7205547Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7205595Z Traceback (most recent call last): 2025-12-04T12:10:21.7205752Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7205793Z method(*args, **kwargs) 2025-12-04T12:10:21.7205945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7205985Z method(*args, **kwargs) 2025-12-04T12:10:21.7206136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7206174Z with policy(): 2025-12-04T12:10:21.7206329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7206380Z raise RuntimeError(msg) 2025-12-04T12:10:21.7206776Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7206788Z 2025-12-04T12:10:21.7206862Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7207121Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7207123Z 2025-12-04T12:10:21.7207211Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7207283Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7207327Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7207383Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7207875Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7207975Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7208011Z graph_break [] 2025-12-04T12:10:21.7208073Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7208146Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7208627Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7208674Z current_size = base.storage().size() 2025-12-04T12:10:21.7208716Z Autotune Choices Stats: 2025-12-04T12:10:21.7209083Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7209142Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7209185Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7209285Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7209520Z triton_mm_17 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7209748Z triton_mm_8 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7209972Z triton_mm_14 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7210267Z triton_mm_18 0.0070 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7210510Z triton_mm_9 0.0070 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7210732Z triton_mm_16 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7210982Z triton_mm_12 0.0075 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7211206Z triton_mm_11 0.0079 ms 75.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7211441Z triton_mm_13 0.0081 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7211665Z triton_mm_15 0.0086 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7211793Z SingleProcess AUTOTUNE benchmarking takes 0.0885 seconds and 0.3863 seconds precompiling for 20 choices 2025-12-04T12:10:21.7211867Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7211909Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7211965Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7212063Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7212548Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7212588Z graph_break [] 2025-12-04T12:10:21.7212648Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7212722Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7212761Z Autotune Choices Stats: 2025-12-04T12:10:21.7213139Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7213188Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7213229Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7213327Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7213560Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7213783Z triton_mm_35 0.0066 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7214009Z triton_mm_27 0.0066 ms 90.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7214250Z triton_mm_37 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7214489Z triton_mm_28 0.0074 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7214531Z _scaled_mm 0.0076 ms 78.9% 2025-12-04T12:10:21.7214755Z triton_mm_34 0.0078 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7214993Z triton_mm_30 0.0078 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7215224Z triton_mm_25 0.0085 ms 70.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7215447Z triton_mm_33 0.0091 ms 66.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7215576Z SingleProcess AUTOTUNE benchmarking takes 0.1214 seconds and 0.2582 seconds precompiling for 20 choices 2025-12-04T12:10:21.7215648Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7215692Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7215749Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7215850Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7216331Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7216370Z graph_break [] 2025-12-04T12:10:21.7216430Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7216515Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7216555Z Autotune Choices Stats: 2025-12-04T12:10:21.7216915Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7216963Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7217004Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7217103Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7217333Z triton_mm_54 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7217562Z triton_mm_46 0.0065 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7217802Z triton_mm_47 0.0068 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7218030Z triton_mm_55 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7218270Z triton_mm_56 0.0070 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7218493Z triton_mm_50 0.0076 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7218728Z triton_mm_53 0.0076 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7218955Z triton_mm_44 0.0082 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7219182Z triton_mm_49 0.0082 ms 75.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7219408Z triton_mm_51 0.0084 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7219537Z SingleProcess AUTOTUNE benchmarking takes 0.1393 seconds and 0.2757 seconds precompiling for 20 choices 2025-12-04T12:10:21.7219729Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7092ed04f830ac2b.xml - 2025-12-04T12:10:21.7219790Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7220431Z FAILED [0.9187s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7220435Z 2025-12-04T12:10:21.7220508Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7220768Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7220771Z 2025-12-04T12:10:21.7220859Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7220920Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7220989Z ================== 1 failed, 187 deselected, 2 rerun in 4.60s ================== 2025-12-04T12:10:21.7221026Z Got exit code 1 2025-12-04T12:10:21.7221067Z Retrying single test... 2025-12-04T12:10:21.7221211Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b498a2f0064e88e.xml 2025-12-04T12:10:21.7221268Z ============================= test session starts ============================== 2025-12-04T12:10:21.7221394Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7221437Z cachedir: .pytest_cache 2025-12-04T12:10:21.7221594Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7221657Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7221697Z configfile: pytest.ini 2025-12-04T12:10:21.7221861Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7221936Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7222192Z stepcurrent: skipping 136 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7222235Z Running 1 items in this shard 2025-12-04T12:10:21.7222238Z 2025-12-04T12:10:21.7222468Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.7588s] [100%] 2025-12-04T12:10:21.7222682Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0171s] [100%] 2025-12-04T12:10:21.7222875Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9056s] [100%] 2025-12-04T12:10:21.7222877Z 2025-12-04T12:10:21.7222929Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7223076Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7223122Z Traceback (most recent call last): 2025-12-04T12:10:21.7223279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7223321Z method(*args, **kwargs) 2025-12-04T12:10:21.7223475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7223516Z method(*args, **kwargs) 2025-12-04T12:10:21.7223668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7223706Z with policy(): 2025-12-04T12:10:21.7223857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7223899Z raise RuntimeError(msg) 2025-12-04T12:10:21.7224301Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.7224306Z 2025-12-04T12:10:21.7224381Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7224640Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7224643Z 2025-12-04T12:10:21.7224729Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7224804Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7224848Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7224905Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7225389Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7225509Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7225557Z graph_break [] 2025-12-04T12:10:21.7225620Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7225692Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7226176Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7226225Z current_size = base.storage().size() 2025-12-04T12:10:21.7226265Z Autotune Choices Stats: 2025-12-04T12:10:21.7226642Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.7226689Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7226731Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7226830Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7227063Z triton_mm_14 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7227290Z triton_mm_16 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7227521Z triton_mm_8 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7227753Z triton_mm_18 0.0068 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7227989Z triton_mm_17 0.0069 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7228219Z triton_mm_9 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7228442Z triton_mm_11 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7228665Z triton_mm_12 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7228891Z triton_mm_13 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7229128Z triton_mm_6 0.0085 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7229257Z SingleProcess AUTOTUNE benchmarking takes 0.0897 seconds and 0.3973 seconds precompiling for 20 choices 2025-12-04T12:10:21.7229413Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7229459Z Traceback (most recent call last): 2025-12-04T12:10:21.7229614Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7229656Z method(*args, **kwargs) 2025-12-04T12:10:21.7229806Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7229848Z method(*args, **kwargs) 2025-12-04T12:10:21.7229998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7230047Z with policy(): 2025-12-04T12:10:21.7230234Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7230276Z raise RuntimeError(msg) 2025-12-04T12:10:21.7230665Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.7230668Z 2025-12-04T12:10:21.7230742Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7231001Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7231004Z 2025-12-04T12:10:21.7231091Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7231163Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7231207Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7231265Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7231763Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7231862Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7231902Z graph_break [] 2025-12-04T12:10:21.7231963Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7232036Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7232521Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7232568Z current_size = base.storage().size() 2025-12-04T12:10:21.7232608Z Autotune Choices Stats: 2025-12-04T12:10:21.7232972Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.7233036Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7233080Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7233177Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7233422Z triton_mm_14 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7233648Z triton_mm_16 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7233874Z triton_mm_8 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7234117Z triton_mm_18 0.0068 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7234348Z triton_mm_17 0.0069 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7234578Z triton_mm_9 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7234803Z triton_mm_11 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7235027Z triton_mm_12 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7235250Z triton_mm_13 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7235487Z triton_mm_6 0.0085 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7235616Z SingleProcess AUTOTUNE benchmarking takes 0.0897 seconds and 0.3973 seconds precompiling for 20 choices 2025-12-04T12:10:21.7235689Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7235733Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7235789Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7235889Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7236377Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7236413Z graph_break [] 2025-12-04T12:10:21.7236477Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7236564Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7236605Z Autotune Choices Stats: 2025-12-04T12:10:21.7236967Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.7237024Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7237065Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7237164Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7237396Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7237635Z triton_mm_33 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7237864Z triton_mm_27 0.0066 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7237907Z _scaled_mm 0.0068 ms 88.2% 2025-12-04T12:10:21.7238132Z triton_mm_35 0.0068 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7238361Z triton_mm_28 0.0070 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7238591Z triton_mm_37 0.0072 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7238814Z triton_mm_32 0.0080 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7239056Z triton_mm_30 0.0081 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7239284Z triton_mm_25 0.0085 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7239414Z SingleProcess AUTOTUNE benchmarking takes 0.1238 seconds and 0.2581 seconds precompiling for 20 choices 2025-12-04T12:10:21.7239468Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7239615Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7239663Z Traceback (most recent call last): 2025-12-04T12:10:21.7239818Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7239860Z method(*args, **kwargs) 2025-12-04T12:10:21.7240012Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7240053Z method(*args, **kwargs) 2025-12-04T12:10:21.7240236Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7240290Z with policy(): 2025-12-04T12:10:21.7240444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7240486Z raise RuntimeError(msg) 2025-12-04T12:10:21.7240894Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7240897Z 2025-12-04T12:10:21.7240970Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7241230Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7241233Z 2025-12-04T12:10:21.7241320Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7241404Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7241447Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7241504Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7241987Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7242087Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7242124Z graph_break [] 2025-12-04T12:10:21.7242186Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7242262Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7242744Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7242794Z current_size = base.storage().size() 2025-12-04T12:10:21.7242833Z Autotune Choices Stats: 2025-12-04T12:10:21.7243210Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_14", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006519999820739031, "best_triton_pos": 0} 2025-12-04T12:10:21.7243258Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7243303Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7243402Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7243638Z triton_mm_14 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7243863Z triton_mm_16 0.0066 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7244088Z triton_mm_8 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7244329Z triton_mm_18 0.0068 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7244566Z triton_mm_17 0.0069 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7244795Z triton_mm_9 0.0072 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7245021Z triton_mm_11 0.0080 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7245255Z triton_mm_12 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7245480Z triton_mm_13 0.0081 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7245708Z triton_mm_6 0.0085 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7245837Z SingleProcess AUTOTUNE benchmarking takes 0.0897 seconds and 0.3973 seconds precompiling for 20 choices 2025-12-04T12:10:21.7245910Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7245953Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7246010Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7246109Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7246591Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7246638Z graph_break [] 2025-12-04T12:10:21.7246701Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7246774Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7246816Z Autotune Choices Stats: 2025-12-04T12:10:21.7247180Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.7247228Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7247269Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7247367Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7247600Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7247825Z triton_mm_33 0.0064 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7248062Z triton_mm_27 0.0066 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7248114Z _scaled_mm 0.0068 ms 88.2% 2025-12-04T12:10:21.7248337Z triton_mm_35 0.0068 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7248564Z triton_mm_28 0.0070 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7248803Z triton_mm_37 0.0072 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7249028Z triton_mm_32 0.0080 ms 74.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7249252Z triton_mm_30 0.0081 ms 73.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7249481Z triton_mm_25 0.0085 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7249610Z SingleProcess AUTOTUNE benchmarking takes 0.1238 seconds and 0.2581 seconds precompiling for 20 choices 2025-12-04T12:10:21.7249683Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7249724Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7249783Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7249882Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7250402Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7250439Z graph_break [] 2025-12-04T12:10:21.7250502Z aten_mm_info [('aten._scaled_mm.default_257_16_1024', 1)] 2025-12-04T12:10:21.7250575Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7250616Z Autotune Choices Stats: 2025-12-04T12:10:21.7250978Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_52", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7251026Z AUTOTUNE scaled_mm(257x1024, 1024x16, , ) 2025-12-04T12:10:21.7251071Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7251169Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7251398Z triton_mm_52 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7251638Z triton_mm_54 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7251880Z triton_mm_55 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7252106Z triton_mm_46 0.0067 ms 97.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7252336Z triton_mm_56 0.0074 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7252578Z triton_mm_49 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7252803Z triton_mm_53 0.0079 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7253029Z triton_mm_51 0.0081 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7253256Z triton_mm_44 0.0083 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7253484Z triton_mm_48 0.0092 ms 70.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7253613Z SingleProcess AUTOTUNE benchmarking takes 0.1484 seconds and 0.2438 seconds precompiling for 20 choices 2025-12-04T12:10:21.7253800Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b498a2f0064e88e.xml - 2025-12-04T12:10:21.7253862Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7254462Z FAILED [0.9056s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.7254466Z 2025-12-04T12:10:21.7254541Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7254801Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7254804Z 2025-12-04T12:10:21.7254890Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7254956Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7255028Z ================== 1 failed, 187 deselected, 2 rerun in 4.70s ================== 2025-12-04T12:10:21.7255079Z Got exit code 1 2025-12-04T12:10:21.7255287Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7255415Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7255569Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b75f227c5f664f4c.xml 2025-12-04T12:10:21.7255627Z ============================= test session starts ============================== 2025-12-04T12:10:21.7258173Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7258221Z cachedir: .pytest_cache 2025-12-04T12:10:21.7258385Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7258434Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7258476Z configfile: pytest.ini 2025-12-04T12:10:21.7258642Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7258753Z collecting ... collected 188 items / 137 deselected / 51 selected 2025-12-04T12:10:21.7258810Z stepcurrent: skipping 137 already run items. 2025-12-04T12:10:21.7258857Z Running 51 items in this shard 2025-12-04T12:10:21.7258859Z 2025-12-04T12:10:21.7259085Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.1526s] [ 1%] 2025-12-04T12:10:21.7259302Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.3689s] [ 1%] 2025-12-04T12:10:21.7259495Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.1925s] [ 1%] 2025-12-04T12:10:21.7259499Z 2025-12-04T12:10:21.7259552Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7259700Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7259748Z Traceback (most recent call last): 2025-12-04T12:10:21.7259910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7259952Z method(*args, **kwargs) 2025-12-04T12:10:21.7260167Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7260210Z method(*args, **kwargs) 2025-12-04T12:10:21.7260382Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7260421Z with policy(): 2025-12-04T12:10:21.7260573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7260615Z raise RuntimeError(msg) 2025-12-04T12:10:21.7261009Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1115684864. 2025-12-04T12:10:21.7261013Z 2025-12-04T12:10:21.7261089Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7261354Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7261357Z 2025-12-04T12:10:21.7261443Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7261531Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7261574Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7261635Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7262124Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7262239Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7262276Z graph_break [] 2025-12-04T12:10:21.7262342Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7262415Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7262913Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7262965Z current_size = base.storage().size() 2025-12-04T12:10:21.7263007Z Autotune Choices Stats: 2025-12-04T12:10:21.7263378Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.7263427Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7263471Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7263571Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7263809Z triton_mm_34 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7264035Z triton_mm_29 0.0096 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7264273Z triton_mm_30 0.0106 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7264501Z triton_mm_16 0.0110 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7264723Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7264949Z triton_mm_33 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7265171Z triton_mm_21 0.0113 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7265408Z triton_mm_23 0.0114 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7265634Z triton_mm_15 0.0120 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7265867Z triton_mm_14 0.0124 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7265997Z SingleProcess AUTOTUNE benchmarking takes 0.1769 seconds and 0.7842 seconds precompiling for 39 choices 2025-12-04T12:10:21.7266146Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7266195Z Traceback (most recent call last): 2025-12-04T12:10:21.7266361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7266402Z method(*args, **kwargs) 2025-12-04T12:10:21.7266556Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7266597Z method(*args, **kwargs) 2025-12-04T12:10:21.7266746Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7266784Z with policy(): 2025-12-04T12:10:21.7266937Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7266978Z raise RuntimeError(msg) 2025-12-04T12:10:21.7267376Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1115684864 and is now 1212153856. 2025-12-04T12:10:21.7267380Z 2025-12-04T12:10:21.7267453Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7267718Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7267720Z 2025-12-04T12:10:21.7267806Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7267880Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7267931Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7267990Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7268475Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7268576Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7268613Z graph_break [] 2025-12-04T12:10:21.7268677Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7268750Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7269235Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7269297Z current_size = base.storage().size() 2025-12-04T12:10:21.7269339Z Autotune Choices Stats: 2025-12-04T12:10:21.7269706Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.7269772Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7269815Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7269914Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7270189Z triton_mm_34 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7270430Z triton_mm_29 0.0096 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7270656Z triton_mm_30 0.0106 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7270880Z triton_mm_16 0.0110 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7271103Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7271330Z triton_mm_33 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7271554Z triton_mm_21 0.0113 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7271792Z triton_mm_23 0.0114 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7272017Z triton_mm_15 0.0120 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7272242Z triton_mm_14 0.0124 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7272373Z SingleProcess AUTOTUNE benchmarking takes 0.1769 seconds and 0.7842 seconds precompiling for 39 choices 2025-12-04T12:10:21.7272445Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7272488Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7272544Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7272646Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7273133Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7273185Z graph_break [] 2025-12-04T12:10:21.7273261Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7273333Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7273374Z Autotune Choices Stats: 2025-12-04T12:10:21.7273732Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009119999594986439, "best_triton_pos": 0} 2025-12-04T12:10:21.7273781Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7273822Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7273936Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7274165Z triton_mm_67 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7274394Z triton_mm_72 0.0097 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7274619Z triton_mm_71 0.0099 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7274844Z triton_mm_54 0.0106 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7275069Z triton_mm_59 0.0107 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7275291Z triton_mm_60 0.0109 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7275523Z triton_mm_68 0.0111 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7275749Z triton_mm_61 0.0115 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7275974Z triton_mm_53 0.0116 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7276200Z triton_mm_69 0.0118 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7276328Z SingleProcess AUTOTUNE benchmarking takes 0.2678 seconds and 0.5367 seconds precompiling for 39 choices 2025-12-04T12:10:21.7276381Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7276539Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7276586Z Traceback (most recent call last): 2025-12-04T12:10:21.7276742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7276794Z method(*args, **kwargs) 2025-12-04T12:10:21.7276945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7276986Z method(*args, **kwargs) 2025-12-04T12:10:21.7277135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7277172Z with policy(): 2025-12-04T12:10:21.7277324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7277365Z raise RuntimeError(msg) 2025-12-04T12:10:21.7277769Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7277773Z 2025-12-04T12:10:21.7277847Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7278111Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7278114Z 2025-12-04T12:10:21.7278199Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7278272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7278314Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7278372Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7278853Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7278952Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7278989Z graph_break [] 2025-12-04T12:10:21.7279052Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7279136Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7279619Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7279667Z current_size = base.storage().size() 2025-12-04T12:10:21.7279708Z Autotune Choices Stats: 2025-12-04T12:10:21.7280075Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009560000151395798, "best_triton_pos": 0} 2025-12-04T12:10:21.7280161Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7280203Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7280302Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7280553Z triton_mm_34 0.0096 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7280779Z triton_mm_29 0.0096 ms 99.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7281016Z triton_mm_30 0.0106 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7281241Z triton_mm_16 0.0110 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7281477Z triton_mm_22 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7281703Z triton_mm_33 0.0112 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7281927Z triton_mm_21 0.0113 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7282153Z triton_mm_23 0.0114 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7282378Z triton_mm_15 0.0120 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7282602Z triton_mm_14 0.0124 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7282731Z SingleProcess AUTOTUNE benchmarking takes 0.1769 seconds and 0.7842 seconds precompiling for 39 choices 2025-12-04T12:10:21.7282803Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7282859Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7282915Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7283014Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7283499Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7283538Z graph_break [] 2025-12-04T12:10:21.7283601Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7283673Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7283713Z Autotune Choices Stats: 2025-12-04T12:10:21.7284073Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009119999594986439, "best_triton_pos": 0} 2025-12-04T12:10:21.7284137Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7284179Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7284277Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7284516Z triton_mm_67 0.0091 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7284743Z triton_mm_72 0.0097 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7284967Z triton_mm_71 0.0099 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7285200Z triton_mm_54 0.0106 ms 86.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7285430Z triton_mm_59 0.0107 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7285652Z triton_mm_60 0.0109 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7285875Z triton_mm_68 0.0111 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7286101Z triton_mm_61 0.0115 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7286326Z triton_mm_53 0.0116 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7286560Z triton_mm_69 0.0118 ms 77.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7286687Z SingleProcess AUTOTUNE benchmarking takes 0.2678 seconds and 0.5367 seconds precompiling for 39 choices 2025-12-04T12:10:21.7286761Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7286802Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7286860Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7286958Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7287444Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7287481Z graph_break [] 2025-12-04T12:10:21.7287545Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7287619Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7287670Z Autotune Choices Stats: 2025-12-04T12:10:21.7288030Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_105", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009479999542236328, "best_triton_pos": 0} 2025-12-04T12:10:21.7288088Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7288130Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7288227Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7288460Z triton_mm_105 0.0095 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7288698Z triton_mm_110 0.0096 ms 99.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7288926Z triton_mm_109 0.0096 ms 98.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7288970Z _scaled_mm 0.0101 ms 93.7% 2025-12-04T12:10:21.7289195Z triton_mm_92 0.0103 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7289423Z triton_mm_106 0.0108 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7289648Z triton_mm_98 0.0109 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7289872Z triton_mm_97 0.0112 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7290134Z triton_mm_99 0.0116 ms 82.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7290375Z triton_mm_91 0.0117 ms 81.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7290505Z SingleProcess AUTOTUNE benchmarking takes 0.2676 seconds and 0.3657 seconds precompiling for 39 choices 2025-12-04T12:10:21.7290695Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b75f227c5f664f4c.xml - 2025-12-04T12:10:21.7290756Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7291351Z FAILED [1.1925s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7291354Z 2025-12-04T12:10:21.7291442Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7291710Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7291713Z 2025-12-04T12:10:21.7291800Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7291881Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7291949Z ================== 1 failed, 137 deselected, 2 rerun in 5.73s ================== 2025-12-04T12:10:21.7291987Z Got exit code 1 2025-12-04T12:10:21.7292026Z Retrying single test... 2025-12-04T12:10:21.7292171Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-133ffb51fe566405.xml 2025-12-04T12:10:21.7292228Z ============================= test session starts ============================== 2025-12-04T12:10:21.7292342Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7292383Z cachedir: .pytest_cache 2025-12-04T12:10:21.7292555Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7292601Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7292642Z configfile: pytest.ini 2025-12-04T12:10:21.7292806Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7292883Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7293140Z stepcurrent: skipping 137 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7293185Z Running 1 items in this shard 2025-12-04T12:10:21.7293187Z 2025-12-04T12:10:21.7293406Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.4735s] [100%] 2025-12-04T12:10:21.7293624Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4164s] [100%] 2025-12-04T12:10:21.7293820Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.2189s] [100%] 2025-12-04T12:10:21.7293823Z 2025-12-04T12:10:21.7293876Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7294034Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7294080Z Traceback (most recent call last): 2025-12-04T12:10:21.7294239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7294281Z method(*args, **kwargs) 2025-12-04T12:10:21.7294434Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7294473Z method(*args, **kwargs) 2025-12-04T12:10:21.7294625Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7294661Z with policy(): 2025-12-04T12:10:21.7294812Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7294852Z raise RuntimeError(msg) 2025-12-04T12:10:21.7295246Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1115684864. 2025-12-04T12:10:21.7295260Z 2025-12-04T12:10:21.7295335Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7295599Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7295611Z 2025-12-04T12:10:21.7295699Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7295771Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7295814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7295872Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7296368Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7296468Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7296506Z graph_break [] 2025-12-04T12:10:21.7296569Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7296641Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7297122Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7297169Z current_size = base.storage().size() 2025-12-04T12:10:21.7297210Z Autotune Choices Stats: 2025-12-04T12:10:21.7297577Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00863999966531992, "best_triton_pos": 0} 2025-12-04T12:10:21.7297627Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7297667Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7297767Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7298018Z triton_mm_29 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7298252Z triton_mm_34 0.0094 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7298476Z triton_mm_30 0.0101 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7298702Z triton_mm_33 0.0104 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7298926Z triton_mm_21 0.0107 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7299158Z triton_mm_22 0.0111 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7299383Z triton_mm_23 0.0114 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7299620Z triton_mm_15 0.0115 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7299847Z triton_mm_35 0.0118 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7300086Z triton_mm_31 0.0121 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7300250Z SingleProcess AUTOTUNE benchmarking takes 0.1823 seconds and 0.9156 seconds precompiling for 39 choices 2025-12-04T12:10:21.7300399Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7300445Z Traceback (most recent call last): 2025-12-04T12:10:21.7300599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7300640Z method(*args, **kwargs) 2025-12-04T12:10:21.7300792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7300833Z method(*args, **kwargs) 2025-12-04T12:10:21.7300983Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7301020Z with policy(): 2025-12-04T12:10:21.7301172Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7301214Z raise RuntimeError(msg) 2025-12-04T12:10:21.7301611Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1115684864 and is now 1212153856. 2025-12-04T12:10:21.7301613Z 2025-12-04T12:10:21.7301699Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7301963Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7301966Z 2025-12-04T12:10:21.7302055Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7302127Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7302171Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7302227Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7302715Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7302812Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7302867Z graph_break [] 2025-12-04T12:10:21.7302929Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7303003Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7303485Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7303545Z current_size = base.storage().size() 2025-12-04T12:10:21.7303585Z Autotune Choices Stats: 2025-12-04T12:10:21.7303949Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00863999966531992, "best_triton_pos": 0} 2025-12-04T12:10:21.7304010Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7304052Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7304151Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7304382Z triton_mm_29 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7304613Z triton_mm_34 0.0094 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7304837Z triton_mm_30 0.0101 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7305062Z triton_mm_33 0.0104 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7305285Z triton_mm_21 0.0107 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7305520Z triton_mm_22 0.0111 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7305745Z triton_mm_23 0.0114 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7305971Z triton_mm_15 0.0115 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7306198Z triton_mm_35 0.0118 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7306427Z triton_mm_31 0.0121 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7306555Z SingleProcess AUTOTUNE benchmarking takes 0.1823 seconds and 0.9156 seconds precompiling for 39 choices 2025-12-04T12:10:21.7306639Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7306682Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7306740Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7306838Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7307334Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7307372Z graph_break [] 2025-12-04T12:10:21.7307435Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7307508Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7307548Z Autotune Choices Stats: 2025-12-04T12:10:21.7307917Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009239999577403069, "best_triton_pos": 0} 2025-12-04T12:10:21.7307966Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7308009Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7308107Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7308338Z triton_mm_67 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7308568Z triton_mm_71 0.0099 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7308792Z triton_mm_54 0.0104 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7309015Z triton_mm_60 0.0108 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7309246Z triton_mm_59 0.0110 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7309476Z triton_mm_72 0.0110 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7309699Z triton_mm_68 0.0111 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7309924Z triton_mm_53 0.0113 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7310188Z triton_mm_61 0.0115 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7310427Z triton_mm_69 0.0118 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7310556Z SingleProcess AUTOTUNE benchmarking takes 0.2756 seconds and 0.5168 seconds precompiling for 39 choices 2025-12-04T12:10:21.7310621Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7310769Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7310815Z Traceback (most recent call last): 2025-12-04T12:10:21.7310972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7311013Z method(*args, **kwargs) 2025-12-04T12:10:21.7311166Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7311206Z method(*args, **kwargs) 2025-12-04T12:10:21.7311376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7311414Z with policy(): 2025-12-04T12:10:21.7311567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7311607Z raise RuntimeError(msg) 2025-12-04T12:10:21.7312001Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7312003Z 2025-12-04T12:10:21.7312078Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7312341Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7312344Z 2025-12-04T12:10:21.7312430Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7312504Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7312546Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7312601Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7313106Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7313207Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7313244Z graph_break [] 2025-12-04T12:10:21.7313307Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7313379Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7313861Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7313907Z current_size = base.storage().size() 2025-12-04T12:10:21.7313948Z Autotune Choices Stats: 2025-12-04T12:10:21.7314311Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00863999966531992, "best_triton_pos": 0} 2025-12-04T12:10:21.7314382Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7314434Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7314534Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7314764Z triton_mm_29 0.0086 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7314995Z triton_mm_34 0.0094 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7315229Z triton_mm_30 0.0101 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7315455Z triton_mm_33 0.0104 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7315680Z triton_mm_21 0.0107 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7315902Z triton_mm_22 0.0111 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7316133Z triton_mm_23 0.0114 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7316359Z triton_mm_15 0.0115 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7316598Z triton_mm_35 0.0118 ms 73.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7316827Z triton_mm_31 0.0121 ms 71.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7316958Z SingleProcess AUTOTUNE benchmarking takes 0.1823 seconds and 0.9156 seconds precompiling for 39 choices 2025-12-04T12:10:21.7317034Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7317076Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7317134Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7317231Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7317715Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7317762Z graph_break [] 2025-12-04T12:10:21.7317825Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7317898Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7317938Z Autotune Choices Stats: 2025-12-04T12:10:21.7318298Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_67", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009239999577403069, "best_triton_pos": 0} 2025-12-04T12:10:21.7318356Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7318398Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7318497Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7318725Z triton_mm_67 0.0092 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7318962Z triton_mm_71 0.0099 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7319188Z triton_mm_54 0.0104 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7319412Z triton_mm_60 0.0108 ms 85.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7319633Z triton_mm_59 0.0110 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7319859Z triton_mm_72 0.0110 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7320081Z triton_mm_68 0.0111 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7320344Z triton_mm_53 0.0113 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7320569Z triton_mm_61 0.0115 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7320796Z triton_mm_69 0.0118 ms 78.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7320926Z SingleProcess AUTOTUNE benchmarking takes 0.2756 seconds and 0.5168 seconds precompiling for 39 choices 2025-12-04T12:10:21.7320997Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7321040Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7321096Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7321195Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7321675Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7321737Z graph_break [] 2025-12-04T12:10:21.7321799Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7321873Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7321914Z Autotune Choices Stats: 2025-12-04T12:10:21.7322279Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_110", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009440000168979168, "best_triton_pos": 0} 2025-12-04T12:10:21.7322328Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7322368Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7322480Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7322714Z triton_mm_110 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7322943Z triton_mm_105 0.0096 ms 97.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7323168Z triton_mm_106 0.0105 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7323396Z triton_mm_109 0.0106 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7323620Z triton_mm_98 0.0108 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7323845Z triton_mm_92 0.0114 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7324079Z triton_mm_99 0.0114 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7324305Z triton_mm_91 0.0116 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7324527Z triton_mm_97 0.0117 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7324755Z triton_mm_107 0.0118 ms 80.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7324883Z SingleProcess AUTOTUNE benchmarking takes 0.2678 seconds and 0.3606 seconds precompiling for 39 choices 2025-12-04T12:10:21.7325072Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-133ffb51fe566405.xml - 2025-12-04T12:10:21.7325143Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7325737Z FAILED [1.2189s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7325756Z 2025-12-04T12:10:21.7325829Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7326093Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7326096Z 2025-12-04T12:10:21.7326183Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7326255Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7326325Z ================== 1 failed, 187 deselected, 2 rerun in 6.13s ================== 2025-12-04T12:10:21.7326363Z Got exit code 1 2025-12-04T12:10:21.7326404Z Retrying single test... 2025-12-04T12:10:21.7326547Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-166ea3810f1fa92d.xml 2025-12-04T12:10:21.7326604Z ============================= test session starts ============================== 2025-12-04T12:10:21.7326715Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7326756Z cachedir: .pytest_cache 2025-12-04T12:10:21.7326915Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7326962Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7327002Z configfile: pytest.ini 2025-12-04T12:10:21.7327168Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7327243Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7327501Z stepcurrent: skipping 137 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7327544Z Running 1 items in this shard 2025-12-04T12:10:21.7327546Z 2025-12-04T12:10:21.7327775Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.1435s] [100%] 2025-12-04T12:10:21.7327992Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.5459s] [100%] 2025-12-04T12:10:21.7328187Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.1489s] [100%] 2025-12-04T12:10:21.7328190Z 2025-12-04T12:10:21.7328241Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7328386Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7328433Z Traceback (most recent call last): 2025-12-04T12:10:21.7328591Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7328633Z method(*args, **kwargs) 2025-12-04T12:10:21.7328784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7328836Z method(*args, **kwargs) 2025-12-04T12:10:21.7328986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7329023Z with policy(): 2025-12-04T12:10:21.7329174Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7329228Z raise RuntimeError(msg) 2025-12-04T12:10:21.7329625Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1115684864. 2025-12-04T12:10:21.7329628Z 2025-12-04T12:10:21.7329700Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7329964Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7329977Z 2025-12-04T12:10:21.7330062Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7330172Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7330214Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7330271Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7330755Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7330854Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7330891Z graph_break [] 2025-12-04T12:10:21.7330955Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7331028Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7331509Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7331575Z current_size = base.storage().size() 2025-12-04T12:10:21.7331616Z Autotune Choices Stats: 2025-12-04T12:10:21.7331989Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:21.7332037Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7332081Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7332178Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7332414Z triton_mm_34 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7332644Z triton_mm_33 0.0101 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7332883Z triton_mm_30 0.0106 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7333105Z triton_mm_16 0.0108 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7333338Z triton_mm_21 0.0108 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7333562Z triton_mm_22 0.0109 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7333797Z triton_mm_29 0.0109 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7334022Z triton_mm_31 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7334247Z triton_mm_15 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7334471Z triton_mm_23 0.0118 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7334600Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 0.7692 seconds precompiling for 39 choices 2025-12-04T12:10:21.7334747Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7334795Z Traceback (most recent call last): 2025-12-04T12:10:21.7334951Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7334993Z method(*args, **kwargs) 2025-12-04T12:10:21.7335144Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7335184Z method(*args, **kwargs) 2025-12-04T12:10:21.7335342Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7335381Z with policy(): 2025-12-04T12:10:21.7335533Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7335574Z raise RuntimeError(msg) 2025-12-04T12:10:21.7335970Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1115684864 and is now 1212153856. 2025-12-04T12:10:21.7335973Z 2025-12-04T12:10:21.7336186Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7336451Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7336453Z 2025-12-04T12:10:21.7336538Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7336626Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7336668Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7336727Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7337209Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7337317Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7337354Z graph_break [] 2025-12-04T12:10:21.7337417Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7337490Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7337982Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7338031Z current_size = base.storage().size() 2025-12-04T12:10:21.7338071Z Autotune Choices Stats: 2025-12-04T12:10:21.7338445Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:21.7338492Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7338535Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7338634Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7338871Z triton_mm_34 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7339099Z triton_mm_33 0.0101 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7339333Z triton_mm_30 0.0106 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7339556Z triton_mm_16 0.0108 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7339780Z triton_mm_21 0.0108 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7340004Z triton_mm_22 0.0109 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7340269Z triton_mm_29 0.0109 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7340493Z triton_mm_31 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7340747Z triton_mm_15 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7340984Z triton_mm_23 0.0118 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7341114Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 0.7692 seconds precompiling for 39 choices 2025-12-04T12:10:21.7341187Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7341232Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7341289Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7341389Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7341892Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7341929Z graph_break [] 2025-12-04T12:10:21.7341993Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7342067Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7342110Z Autotune Choices Stats: 2025-12-04T12:10:21.7342473Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009359999559819698, "best_triton_pos": 0} 2025-12-04T12:10:21.7342522Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7342564Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7342663Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7342895Z triton_mm_72 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7343134Z triton_mm_71 0.0098 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7343358Z triton_mm_68 0.0100 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7343580Z triton_mm_60 0.0105 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7343804Z triton_mm_67 0.0106 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7344026Z triton_mm_59 0.0109 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7344263Z triton_mm_54 0.0110 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7344488Z triton_mm_53 0.0116 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7344722Z triton_mm_69 0.0117 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7344952Z triton_mm_73 0.0118 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7345078Z SingleProcess AUTOTUNE benchmarking takes 0.2821 seconds and 0.5218 seconds precompiling for 39 choices 2025-12-04T12:10:21.7345141Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7345289Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7345337Z Traceback (most recent call last): 2025-12-04T12:10:21.7345492Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7345535Z method(*args, **kwargs) 2025-12-04T12:10:21.7345687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7345728Z method(*args, **kwargs) 2025-12-04T12:10:21.7345880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7345917Z with policy(): 2025-12-04T12:10:21.7346070Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7346110Z raise RuntimeError(msg) 2025-12-04T12:10:21.7346506Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7346510Z 2025-12-04T12:10:21.7346583Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7346857Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7346860Z 2025-12-04T12:10:21.7346948Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7347023Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7347064Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7347120Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7347603Z inductor [('triton_bundler_save_kernel', 312), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7347701Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7347737Z graph_break [] 2025-12-04T12:10:21.7347810Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7347884Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7348364Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7348422Z current_size = base.storage().size() 2025-12-04T12:10:21.7348462Z Autotune Choices Stats: 2025-12-04T12:10:21.7348835Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_34", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009759999811649323, "best_triton_pos": 0} 2025-12-04T12:10:21.7348883Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7348924Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7349033Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7349268Z triton_mm_34 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7349497Z triton_mm_33 0.0101 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7349721Z triton_mm_30 0.0106 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7349947Z triton_mm_16 0.0108 ms 90.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7350212Z triton_mm_21 0.0108 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7350451Z triton_mm_22 0.0109 ms 89.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7350678Z triton_mm_29 0.0109 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7350903Z triton_mm_31 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7351130Z triton_mm_15 0.0117 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7351356Z triton_mm_23 0.0118 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7351483Z SingleProcess AUTOTUNE benchmarking takes 0.1747 seconds and 0.7692 seconds precompiling for 39 choices 2025-12-04T12:10:21.7351555Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7351610Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7351666Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7351764Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7352246Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7352295Z graph_break [] 2025-12-04T12:10:21.7352360Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7352431Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7352472Z Autotune Choices Stats: 2025-12-04T12:10:21.7352847Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_72", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.009359999559819698, "best_triton_pos": 0} 2025-12-04T12:10:21.7352897Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7352939Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7353037Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7353272Z triton_mm_72 0.0094 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7353499Z triton_mm_71 0.0098 ms 95.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7353722Z triton_mm_68 0.0100 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7353944Z triton_mm_60 0.0105 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7354180Z triton_mm_67 0.0106 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7354401Z triton_mm_59 0.0109 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7354626Z triton_mm_54 0.0110 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7354852Z triton_mm_53 0.0116 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7355076Z triton_mm_69 0.0117 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7355303Z triton_mm_73 0.0118 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7355442Z SingleProcess AUTOTUNE benchmarking takes 0.2821 seconds and 0.5218 seconds precompiling for 39 choices 2025-12-04T12:10:21.7355514Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7355566Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7355623Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7355721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7356202Z inductor [('triton_bundler_save_kernel', 312), ('async_compile_cache_miss', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 39), ('generated_module_cache_miss', 38), ('select_algorithm_num_precompiles', 38), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7356240Z graph_break [] 2025-12-04T12:10:21.7356311Z aten_mm_info [('aten._scaled_mm.default_257_2048_1024', 1)] 2025-12-04T12:10:21.7356386Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7356427Z Autotune Choices Stats: 2025-12-04T12:10:21.7356791Z {"num_choices": 39, "num_triton_choices": 38, "best_kernel": "triton_mm_105", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.009758999571204185, "best_triton_pos": 0} 2025-12-04T12:10:21.7356838Z AUTOTUNE scaled_mm(257x1024, 1024x2048, , ) 2025-12-04T12:10:21.7356880Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7356977Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7357212Z triton_mm_105 0.0098 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7357440Z triton_mm_110 0.0100 ms 97.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7357667Z triton_mm_109 0.0104 ms 94.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7357901Z triton_mm_106 0.0105 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7358126Z triton_mm_97 0.0106 ms 92.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7358350Z triton_mm_98 0.0107 ms 91.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7358574Z triton_mm_92 0.0110 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7358803Z triton_mm_91 0.0116 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7359043Z triton_mm_111 0.0120 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7359267Z triton_mm_99 0.0120 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7359404Z SingleProcess AUTOTUNE benchmarking takes 0.2628 seconds and 0.3666 seconds precompiling for 39 choices 2025-12-04T12:10:21.7359591Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-166ea3810f1fa92d.xml - 2025-12-04T12:10:21.7359652Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7360296Z FAILED [1.1489s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1212153856 and is now 1308622848. 2025-12-04T12:10:21.7360302Z 2025-12-04T12:10:21.7360374Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7360637Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7360640Z 2025-12-04T12:10:21.7360725Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7360788Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7360857Z ================== 1 failed, 187 deselected, 2 rerun in 5.86s ================== 2025-12-04T12:10:21.7360894Z Got exit code 1 2025-12-04T12:10:21.7361103Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7361231Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7361370Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-85009b8141265a72.xml 2025-12-04T12:10:21.7361430Z ============================= test session starts ============================== 2025-12-04T12:10:21.7361556Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7361599Z cachedir: .pytest_cache 2025-12-04T12:10:21.7361757Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7361803Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7361846Z configfile: pytest.ini 2025-12-04T12:10:21.7362011Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7362096Z collecting ... collected 188 items / 138 deselected / 50 selected 2025-12-04T12:10:21.7362150Z stepcurrent: skipping 138 already run items. 2025-12-04T12:10:21.7362195Z Running 50 items in this shard 2025-12-04T12:10:21.7362197Z 2025-12-04T12:10:21.7362414Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7012s] [ 2%] 2025-12-04T12:10:21.7362627Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2635s] [ 2%] 2025-12-04T12:10:21.7362825Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2250s] [ 2%] 2025-12-04T12:10:21.7362828Z 2025-12-04T12:10:21.7362881Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7363043Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7363093Z Traceback (most recent call last): 2025-12-04T12:10:21.7363250Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7363293Z method(*args, **kwargs) 2025-12-04T12:10:21.7363445Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7363488Z method(*args, **kwargs) 2025-12-04T12:10:21.7363642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7363680Z with policy(): 2025-12-04T12:10:21.7363842Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7363885Z raise RuntimeError(msg) 2025-12-04T12:10:21.7364281Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7364284Z 2025-12-04T12:10:21.7364356Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7364618Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7364621Z 2025-12-04T12:10:21.7364706Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7364785Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7364828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7364886Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7364952Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7365055Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7365095Z graph_break [] 2025-12-04T12:10:21.7365157Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7365309Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7365355Z Traceback (most recent call last): 2025-12-04T12:10:21.7365509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7365550Z method(*args, **kwargs) 2025-12-04T12:10:21.7365701Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7365741Z method(*args, **kwargs) 2025-12-04T12:10:21.7365892Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7365929Z with policy(): 2025-12-04T12:10:21.7366080Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7366121Z raise RuntimeError(msg) 2025-12-04T12:10:21.7366515Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7366527Z 2025-12-04T12:10:21.7366601Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7366860Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7366872Z 2025-12-04T12:10:21.7366965Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7367036Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7367083Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7367141Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7367214Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7367315Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7367357Z graph_break [] 2025-12-04T12:10:21.7367420Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7367516Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7367558Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7367621Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7367721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7367785Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7367820Z graph_break [] 2025-12-04T12:10:21.7367885Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7367940Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7368083Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7368133Z Traceback (most recent call last): 2025-12-04T12:10:21.7368294Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7368337Z method(*args, **kwargs) 2025-12-04T12:10:21.7368488Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7368530Z method(*args, **kwargs) 2025-12-04T12:10:21.7368683Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7368719Z with policy(): 2025-12-04T12:10:21.7368886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7368928Z raise RuntimeError(msg) 2025-12-04T12:10:21.7369312Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7369315Z 2025-12-04T12:10:21.7369388Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7369651Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7369653Z 2025-12-04T12:10:21.7369739Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7369813Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7369856Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7369911Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7369975Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7370081Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7370159Z graph_break [] 2025-12-04T12:10:21.7370219Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7370292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7370350Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7370405Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7370499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7370563Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7370600Z graph_break [] 2025-12-04T12:10:21.7370660Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7370731Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7370775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7370829Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7370936Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7370999Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7371036Z graph_break [] 2025-12-04T12:10:21.7371094Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7371282Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-85009b8141265a72.xml - 2025-12-04T12:10:21.7371342Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7371926Z FAILED [0.2250s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7371930Z 2025-12-04T12:10:21.7372002Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7372259Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7372261Z 2025-12-04T12:10:21.7372347Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7372408Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7372490Z ================== 1 failed, 138 deselected, 2 rerun in 2.21s ================== 2025-12-04T12:10:21.7372527Z Got exit code 1 2025-12-04T12:10:21.7372568Z Retrying single test... 2025-12-04T12:10:21.7372712Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e144dd498ad5417d.xml 2025-12-04T12:10:21.7372770Z ============================= test session starts ============================== 2025-12-04T12:10:21.7372880Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7372922Z cachedir: .pytest_cache 2025-12-04T12:10:21.7373079Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7373123Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7373164Z configfile: pytest.ini 2025-12-04T12:10:21.7373327Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7373403Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7373655Z stepcurrent: skipping 138 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7373718Z Running 1 items in this shard 2025-12-04T12:10:21.7373720Z 2025-12-04T12:10:21.7373933Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9024s] [100%] 2025-12-04T12:10:21.7374162Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3781s] [100%] 2025-12-04T12:10:21.7374352Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3294s] [100%] 2025-12-04T12:10:21.7374354Z 2025-12-04T12:10:21.7374406Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7374547Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7374594Z Traceback (most recent call last): 2025-12-04T12:10:21.7374761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7374805Z method(*args, **kwargs) 2025-12-04T12:10:21.7374956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7374995Z method(*args, **kwargs) 2025-12-04T12:10:21.7375145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7375182Z with policy(): 2025-12-04T12:10:21.7375334Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7375374Z raise RuntimeError(msg) 2025-12-04T12:10:21.7375762Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7375765Z 2025-12-04T12:10:21.7375836Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7376094Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7376095Z 2025-12-04T12:10:21.7376192Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7376267Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7376310Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7376366Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7376432Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7376530Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7376566Z graph_break [] 2025-12-04T12:10:21.7376625Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7376767Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7376812Z Traceback (most recent call last): 2025-12-04T12:10:21.7376965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7377005Z method(*args, **kwargs) 2025-12-04T12:10:21.7377155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7377205Z method(*args, **kwargs) 2025-12-04T12:10:21.7377353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7377390Z with policy(): 2025-12-04T12:10:21.7377541Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7377594Z raise RuntimeError(msg) 2025-12-04T12:10:21.7377977Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7377980Z 2025-12-04T12:10:21.7378052Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7378310Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7378312Z 2025-12-04T12:10:21.7378409Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7378482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7378525Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7378580Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7378645Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7378741Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7378778Z graph_break [] 2025-12-04T12:10:21.7378837Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7378911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7378952Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7379008Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7379102Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7379168Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7379204Z graph_break [] 2025-12-04T12:10:21.7379264Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7379316Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7379458Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7379503Z Traceback (most recent call last): 2025-12-04T12:10:21.7379666Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7379706Z method(*args, **kwargs) 2025-12-04T12:10:21.7379855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7379895Z method(*args, **kwargs) 2025-12-04T12:10:21.7380046Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7380083Z with policy(): 2025-12-04T12:10:21.7380286Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7380326Z raise RuntimeError(msg) 2025-12-04T12:10:21.7380714Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7380716Z 2025-12-04T12:10:21.7380788Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7381061Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7381063Z 2025-12-04T12:10:21.7381150Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7381243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7381285Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7381341Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7381406Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7381502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7381539Z graph_break [] 2025-12-04T12:10:21.7381597Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7381670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7381711Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7381767Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7381874Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7381938Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7381975Z graph_break [] 2025-12-04T12:10:21.7382034Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7382108Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7382151Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7382205Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7382300Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7382363Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7382400Z graph_break [] 2025-12-04T12:10:21.7382457Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7382649Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e144dd498ad5417d.xml - 2025-12-04T12:10:21.7382710Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7383301Z FAILED [0.3294s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7383303Z 2025-12-04T12:10:21.7383376Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7383634Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7383636Z 2025-12-04T12:10:21.7383724Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7383787Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7383854Z ================== 1 failed, 187 deselected, 2 rerun in 2.63s ================== 2025-12-04T12:10:21.7383890Z Got exit code 1 2025-12-04T12:10:21.7383931Z Retrying single test... 2025-12-04T12:10:21.7384076Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7aedb16c2b037ef8.xml 2025-12-04T12:10:21.7384134Z ============================= test session starts ============================== 2025-12-04T12:10:21.7384244Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7384296Z cachedir: .pytest_cache 2025-12-04T12:10:21.7384453Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7384498Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7384541Z configfile: pytest.ini 2025-12-04T12:10:21.7384714Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7384788Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7385042Z stepcurrent: skipping 138 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7385086Z Running 1 items in this shard 2025-12-04T12:10:21.7385088Z 2025-12-04T12:10:21.7385299Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.7127s] [100%] 2025-12-04T12:10:21.7385521Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2803s] [100%] 2025-12-04T12:10:21.7385710Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda FAILED [0.2293s] [100%] 2025-12-04T12:10:21.7385714Z 2025-12-04T12:10:21.7385765Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7385906Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7385952Z Traceback (most recent call last): 2025-12-04T12:10:21.7386108Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7386149Z method(*args, **kwargs) 2025-12-04T12:10:21.7386300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7386340Z method(*args, **kwargs) 2025-12-04T12:10:21.7386489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7386527Z with policy(): 2025-12-04T12:10:21.7386678Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7386718Z raise RuntimeError(msg) 2025-12-04T12:10:21.7387114Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7387118Z 2025-12-04T12:10:21.7387190Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7387447Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7387450Z 2025-12-04T12:10:21.7387536Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7387607Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7387649Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7387705Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7387770Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7387868Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7387906Z graph_break [] 2025-12-04T12:10:21.7387975Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7388117Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7388163Z Traceback (most recent call last): 2025-12-04T12:10:21.7388315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7388365Z method(*args, **kwargs) 2025-12-04T12:10:21.7388514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7388553Z method(*args, **kwargs) 2025-12-04T12:10:21.7388703Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7388739Z with policy(): 2025-12-04T12:10:21.7388891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7388932Z raise RuntimeError(msg) 2025-12-04T12:10:21.7389331Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7389334Z 2025-12-04T12:10:21.7389405Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7389663Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7389665Z 2025-12-04T12:10:21.7389751Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7389823Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7389867Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7389922Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7389988Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7390084Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7390161Z graph_break [] 2025-12-04T12:10:21.7390220Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7390293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7390333Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7390388Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7390497Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7390561Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7390597Z graph_break [] 2025-12-04T12:10:21.7390655Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7390707Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7390852Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7390897Z Traceback (most recent call last): 2025-12-04T12:10:21.7391051Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7391090Z method(*args, **kwargs) 2025-12-04T12:10:21.7391242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7391282Z method(*args, **kwargs) 2025-12-04T12:10:21.7391437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7391473Z with policy(): 2025-12-04T12:10:21.7391638Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7391679Z raise RuntimeError(msg) 2025-12-04T12:10:21.7392064Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7392079Z 2025-12-04T12:10:21.7392152Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7392407Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7392409Z 2025-12-04T12:10:21.7392494Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7392566Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7392609Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7392676Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7392741Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7392838Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7392874Z graph_break [] 2025-12-04T12:10:21.7392932Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7393005Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7393046Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7393103Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7393199Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7393262Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7393298Z graph_break [] 2025-12-04T12:10:21.7393356Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7393428Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7393471Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7393527Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7393622Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7393685Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7393720Z graph_break [] 2025-12-04T12:10:21.7393780Z aten_mm_info [('aten._scaled_mm.default_257_16_16', 1)] 2025-12-04T12:10:21.7393980Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7aedb16c2b037ef8.xml - 2025-12-04T12:10:21.7394041Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7394619Z FAILED [0.2293s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7394622Z 2025-12-04T12:10:21.7394695Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7394952Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7394954Z 2025-12-04T12:10:21.7395039Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7395120Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7395188Z ================== 1 failed, 187 deselected, 2 rerun in 2.24s ================== 2025-12-04T12:10:21.7395225Z Got exit code 1 2025-12-04T12:10:21.7395429Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7395568Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7395710Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-24c7d96920fc6823.xml 2025-12-04T12:10:21.7395769Z ============================= test session starts ============================== 2025-12-04T12:10:21.7395878Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7395919Z cachedir: .pytest_cache 2025-12-04T12:10:21.7396075Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7396131Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7396173Z configfile: pytest.ini 2025-12-04T12:10:21.7396335Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7396411Z collecting ... collected 188 items / 139 deselected / 49 selected 2025-12-04T12:10:21.7396466Z stepcurrent: skipping 139 already run items. 2025-12-04T12:10:21.7396510Z Running 49 items in this shard 2025-12-04T12:10:21.7396512Z 2025-12-04T12:10:21.7396733Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6821s] [ 2%] 2025-12-04T12:10:21.7396950Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2729s] [ 2%] 2025-12-04T12:10:21.7397144Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.2226s] [ 2%] 2025-12-04T12:10:21.7397146Z 2025-12-04T12:10:21.7397198Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7397343Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7397389Z Traceback (most recent call last): 2025-12-04T12:10:21.7397544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7397595Z method(*args, **kwargs) 2025-12-04T12:10:21.7397747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7397791Z method(*args, **kwargs) 2025-12-04T12:10:21.7397943Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7397983Z with policy(): 2025-12-04T12:10:21.7398135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7398177Z raise RuntimeError(msg) 2025-12-04T12:10:21.7398565Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.7398567Z 2025-12-04T12:10:21.7398641Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7398905Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7398917Z 2025-12-04T12:10:21.7399005Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7399078Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7399131Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7399188Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7399253Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7399351Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7399386Z graph_break [] 2025-12-04T12:10:21.7399448Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7399594Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7399641Z Traceback (most recent call last): 2025-12-04T12:10:21.7399793Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7399844Z method(*args, **kwargs) 2025-12-04T12:10:21.7399995Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7400038Z method(*args, **kwargs) 2025-12-04T12:10:21.7400226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7400263Z with policy(): 2025-12-04T12:10:21.7400414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7400456Z raise RuntimeError(msg) 2025-12-04T12:10:21.7400842Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.7400847Z 2025-12-04T12:10:21.7400919Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7401179Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7401183Z 2025-12-04T12:10:21.7401270Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7401343Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7401400Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7401457Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7401521Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7401620Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7401658Z graph_break [] 2025-12-04T12:10:21.7401720Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7401794Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7401838Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7401891Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7401988Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7402050Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7402089Z graph_break [] 2025-12-04T12:10:21.7402149Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7402201Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7402346Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7402407Z Traceback (most recent call last): 2025-12-04T12:10:21.7402559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7402600Z method(*args, **kwargs) 2025-12-04T12:10:21.7402766Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7402808Z method(*args, **kwargs) 2025-12-04T12:10:21.7402956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7402993Z with policy(): 2025-12-04T12:10:21.7403145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7403186Z raise RuntimeError(msg) 2025-12-04T12:10:21.7403585Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7403588Z 2025-12-04T12:10:21.7403663Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7403921Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7403924Z 2025-12-04T12:10:21.7404008Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7404083Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7404124Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7404181Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7404247Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7404346Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7404382Z graph_break [] 2025-12-04T12:10:21.7404442Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7404516Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7404557Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7404612Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7404707Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7404781Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7404817Z graph_break [] 2025-12-04T12:10:21.7404876Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7404949Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7404990Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7405044Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7405140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7405203Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7405239Z graph_break [] 2025-12-04T12:10:21.7405296Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7405484Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-24c7d96920fc6823.xml - 2025-12-04T12:10:21.7405545Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7406130Z FAILED [0.2226s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7406142Z 2025-12-04T12:10:21.7406226Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7406485Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7406487Z 2025-12-04T12:10:21.7406574Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7406635Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7406703Z ================== 1 failed, 139 deselected, 2 rerun in 2.20s ================== 2025-12-04T12:10:21.7406741Z Got exit code 1 2025-12-04T12:10:21.7406782Z Retrying single test... 2025-12-04T12:10:21.7406936Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3487eb5997ffd5e2.xml 2025-12-04T12:10:21.7406997Z ============================= test session starts ============================== 2025-12-04T12:10:21.7407109Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7407152Z cachedir: .pytest_cache 2025-12-04T12:10:21.7407307Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7407353Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7407393Z configfile: pytest.ini 2025-12-04T12:10:21.7407555Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7407629Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7407887Z stepcurrent: skipping 139 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7407930Z Running 1 items in this shard 2025-12-04T12:10:21.7407934Z 2025-12-04T12:10:21.7408149Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.6879s] [100%] 2025-12-04T12:10:21.7408364Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.2644s] [100%] 2025-12-04T12:10:21.7408570Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3787s] [100%] 2025-12-04T12:10:21.7408574Z 2025-12-04T12:10:21.7408626Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7408772Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7408818Z Traceback (most recent call last): 2025-12-04T12:10:21.7408973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7409016Z method(*args, **kwargs) 2025-12-04T12:10:21.7409169Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7409209Z method(*args, **kwargs) 2025-12-04T12:10:21.7409361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7409399Z with policy(): 2025-12-04T12:10:21.7409550Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7409602Z raise RuntimeError(msg) 2025-12-04T12:10:21.7409988Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.7410000Z 2025-12-04T12:10:21.7410073Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7410347Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7410349Z 2025-12-04T12:10:21.7410434Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7410507Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7410551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7410608Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7410696Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7410796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7410834Z graph_break [] 2025-12-04T12:10:21.7410894Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7411038Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7411083Z Traceback (most recent call last): 2025-12-04T12:10:21.7411234Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7411275Z method(*args, **kwargs) 2025-12-04T12:10:21.7411423Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7411464Z method(*args, **kwargs) 2025-12-04T12:10:21.7411612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7411650Z with policy(): 2025-12-04T12:10:21.7411801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7411842Z raise RuntimeError(msg) 2025-12-04T12:10:21.7412243Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.7412246Z 2025-12-04T12:10:21.7412319Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7412578Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7412580Z 2025-12-04T12:10:21.7412665Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7412738Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7412779Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7412836Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7412899Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7413000Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7413036Z graph_break [] 2025-12-04T12:10:21.7413097Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7413182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7413225Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7413280Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7413376Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7413452Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7413490Z graph_break [] 2025-12-04T12:10:21.7413547Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7413601Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7413747Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7413793Z Traceback (most recent call last): 2025-12-04T12:10:21.7413945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7413987Z method(*args, **kwargs) 2025-12-04T12:10:21.7414146Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7414188Z method(*args, **kwargs) 2025-12-04T12:10:21.7414339Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7414376Z with policy(): 2025-12-04T12:10:21.7414527Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7414568Z raise RuntimeError(msg) 2025-12-04T12:10:21.7414956Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7414959Z 2025-12-04T12:10:21.7415031Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7415292Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7415295Z 2025-12-04T12:10:21.7415382Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7415455Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7415497Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7415553Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7415626Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7415723Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7415760Z graph_break [] 2025-12-04T12:10:21.7415820Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7415893Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7415936Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7415991Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7416088Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7416151Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7416187Z graph_break [] 2025-12-04T12:10:21.7416245Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7416317Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7416360Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7416413Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7416510Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7416585Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7416625Z graph_break [] 2025-12-04T12:10:21.7416685Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7416875Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3487eb5997ffd5e2.xml - 2025-12-04T12:10:21.7416944Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7417532Z FAILED [0.3787s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7417535Z 2025-12-04T12:10:21.7417607Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7417873Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7417876Z 2025-12-04T12:10:21.7417961Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7418022Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7418091Z ================== 1 failed, 187 deselected, 2 rerun in 2.35s ================== 2025-12-04T12:10:21.7418127Z Got exit code 1 2025-12-04T12:10:21.7418168Z Retrying single test... 2025-12-04T12:10:21.7418311Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-423c40ebf306b625.xml 2025-12-04T12:10:21.7418368Z ============================= test session starts ============================== 2025-12-04T12:10:21.7418478Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7418520Z cachedir: .pytest_cache 2025-12-04T12:10:21.7418676Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7418725Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7418766Z configfile: pytest.ini 2025-12-04T12:10:21.7418929Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7419002Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7419267Z stepcurrent: skipping 139 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7419312Z Running 1 items in this shard 2025-12-04T12:10:21.7419315Z 2025-12-04T12:10:21.7419532Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9612s] [100%] 2025-12-04T12:10:21.7419746Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3669s] [100%] 2025-12-04T12:10:21.7419936Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3632s] [100%] 2025-12-04T12:10:21.7419938Z 2025-12-04T12:10:21.7419990Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7420183Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7420230Z Traceback (most recent call last): 2025-12-04T12:10:21.7420406Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7420448Z method(*args, **kwargs) 2025-12-04T12:10:21.7420598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7420654Z method(*args, **kwargs) 2025-12-04T12:10:21.7420802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7420839Z with policy(): 2025-12-04T12:10:21.7420989Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7421031Z raise RuntimeError(msg) 2025-12-04T12:10:21.7421419Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1113587712. 2025-12-04T12:10:21.7421423Z 2025-12-04T12:10:21.7421513Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7421773Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7421776Z 2025-12-04T12:10:21.7421861Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7421933Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7421974Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7422031Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7422096Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7422197Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7422232Z graph_break [] 2025-12-04T12:10:21.7422293Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7422438Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7422485Z Traceback (most recent call last): 2025-12-04T12:10:21.7422636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7422676Z method(*args, **kwargs) 2025-12-04T12:10:21.7422823Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7422875Z method(*args, **kwargs) 2025-12-04T12:10:21.7423027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7423065Z with policy(): 2025-12-04T12:10:21.7423219Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7423260Z raise RuntimeError(msg) 2025-12-04T12:10:21.7423644Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1128267776. 2025-12-04T12:10:21.7423647Z 2025-12-04T12:10:21.7423719Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7423979Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7423982Z 2025-12-04T12:10:21.7424078Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7424151Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7424194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7424250Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7424313Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7424423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7424458Z graph_break [] 2025-12-04T12:10:21.7424522Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7424594Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7424637Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7424692Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7424788Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7424852Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7424888Z graph_break [] 2025-12-04T12:10:21.7424956Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7425008Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7425155Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7425200Z Traceback (most recent call last): 2025-12-04T12:10:21.7425353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7425393Z method(*args, **kwargs) 2025-12-04T12:10:21.7425544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7425583Z method(*args, **kwargs) 2025-12-04T12:10:21.7425733Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7425769Z with policy(): 2025-12-04T12:10:21.7425922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7425962Z raise RuntimeError(msg) 2025-12-04T12:10:21.7426349Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7426351Z 2025-12-04T12:10:21.7426434Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7426693Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7426695Z 2025-12-04T12:10:21.7426781Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7426853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7426895Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7426951Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7427015Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7427111Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7427147Z graph_break [] 2025-12-04T12:10:21.7427206Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7427279Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7427320Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7427375Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7427481Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7427545Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7427580Z graph_break [] 2025-12-04T12:10:21.7427638Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7427722Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7427764Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7427817Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7427915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7427977Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7428014Z graph_break [] 2025-12-04T12:10:21.7428071Z aten_mm_info [('aten._scaled_mm.default_257_2048_16', 1)] 2025-12-04T12:10:21.7428259Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-423c40ebf306b625.xml - 2025-12-04T12:10:21.7428329Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7428913Z FAILED [0.3632s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1128267776 and is now 1142947840. 2025-12-04T12:10:21.7428916Z 2025-12-04T12:10:21.7428990Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7429249Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7429252Z 2025-12-04T12:10:21.7429337Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7429398Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7429466Z ================== 1 failed, 187 deselected, 2 rerun in 2.71s ================== 2025-12-04T12:10:21.7429503Z Got exit code 1 2025-12-04T12:10:21.7429711Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7429838Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7429992Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ac8bcc98978b1262.xml 2025-12-04T12:10:21.7430050Z ============================= test session starts ============================== 2025-12-04T12:10:21.7430207Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7430250Z cachedir: .pytest_cache 2025-12-04T12:10:21.7430408Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7430455Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7430495Z configfile: pytest.ini 2025-12-04T12:10:21.7430656Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7430731Z collecting ... collected 188 items / 140 deselected / 48 selected 2025-12-04T12:10:21.7432159Z stepcurrent: skipping 140 already run items. 2025-12-04T12:10:21.7432209Z Running 48 items in this shard 2025-12-04T12:10:21.7432212Z 2025-12-04T12:10:21.7432433Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3579s] [ 2%] 2025-12-04T12:10:21.7432674Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.7521s] [ 2%] 2025-12-04T12:10:21.7432862Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7393s] [ 2%] 2025-12-04T12:10:21.7432881Z 2025-12-04T12:10:21.7432933Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7433077Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7433122Z Traceback (most recent call last): 2025-12-04T12:10:21.7433280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7433321Z method(*args, **kwargs) 2025-12-04T12:10:21.7433475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7433531Z method(*args, **kwargs) 2025-12-04T12:10:21.7433681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7433720Z with policy(): 2025-12-04T12:10:21.7433872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7433915Z raise RuntimeError(msg) 2025-12-04T12:10:21.7434305Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7434310Z 2025-12-04T12:10:21.7434384Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7434645Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7434648Z 2025-12-04T12:10:21.7434737Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7434810Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7434855Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7434912Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7435412Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7435516Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7435552Z graph_break [] 2025-12-04T12:10:21.7435615Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7435690Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7436178Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7436226Z current_size = base.storage().size() 2025-12-04T12:10:21.7436269Z Autotune Choices Stats: 2025-12-04T12:10:21.7436655Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7436718Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7436759Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7436862Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7437104Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7437330Z triton_mm_7 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7437564Z triton_mm_5 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7437790Z triton_mm_4 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7438013Z triton_mm_6 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7438239Z triton_mm_0 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7438462Z triton_mm_3 0.0094 ms 63.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7438688Z triton_mm_2 0.0098 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7438730Z _scaled_mm 0.0227 ms 26.3% 2025-12-04T12:10:21.7438861Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.1823 seconds precompiling for 9 choices 2025-12-04T12:10:21.7439014Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7439060Z Traceback (most recent call last): 2025-12-04T12:10:21.7439219Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7439262Z method(*args, **kwargs) 2025-12-04T12:10:21.7439415Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7439458Z method(*args, **kwargs) 2025-12-04T12:10:21.7439608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7439645Z with policy(): 2025-12-04T12:10:21.7439798Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7439838Z raise RuntimeError(msg) 2025-12-04T12:10:21.7440279Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7440294Z 2025-12-04T12:10:21.7440368Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7440630Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7440647Z 2025-12-04T12:10:21.7440733Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7440808Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7440850Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7440907Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7441400Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7441502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7441540Z graph_break [] 2025-12-04T12:10:21.7441601Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7441674Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7442157Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7442205Z current_size = base.storage().size() 2025-12-04T12:10:21.7442247Z Autotune Choices Stats: 2025-12-04T12:10:21.7442613Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7442660Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7442701Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7442803Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7443050Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7443279Z triton_mm_7 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7443500Z triton_mm_5 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7443724Z triton_mm_4 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7443947Z triton_mm_6 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7444182Z triton_mm_0 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7444403Z triton_mm_3 0.0094 ms 63.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7444639Z triton_mm_2 0.0098 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7444682Z _scaled_mm 0.0227 ms 26.3% 2025-12-04T12:10:21.7444812Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.1823 seconds precompiling for 9 choices 2025-12-04T12:10:21.7444887Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7444927Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7444995Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7445093Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7445573Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7445610Z graph_break [] 2025-12-04T12:10:21.7445671Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7445744Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7445785Z Autotune Choices Stats: 2025-12-04T12:10:21.7446148Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7446194Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7446234Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7446333Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7446573Z triton_mm_11 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7446800Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7447029Z triton_mm_10 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7447254Z triton_mm_8 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7447477Z triton_mm_15 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7447700Z triton_mm_14 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7447936Z triton_mm_12 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7448175Z triton_mm_13 0.0094 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7448215Z _scaled_mm 0.0251 ms 24.5% 2025-12-04T12:10:21.7448345Z SingleProcess AUTOTUNE benchmarking takes 0.0586 seconds and 0.1873 seconds precompiling for 9 choices 2025-12-04T12:10:21.7448398Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7448542Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7448587Z Traceback (most recent call last): 2025-12-04T12:10:21.7448755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7448797Z method(*args, **kwargs) 2025-12-04T12:10:21.7448952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7448992Z method(*args, **kwargs) 2025-12-04T12:10:21.7449141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7449179Z with policy(): 2025-12-04T12:10:21.7449331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7449371Z raise RuntimeError(msg) 2025-12-04T12:10:21.7449759Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7449762Z 2025-12-04T12:10:21.7449835Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7450126Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7450128Z 2025-12-04T12:10:21.7450218Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7450306Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7450349Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7450406Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7450886Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7450986Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7451022Z graph_break [] 2025-12-04T12:10:21.7451083Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7451156Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7451638Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7451710Z current_size = base.storage().size() 2025-12-04T12:10:21.7451753Z Autotune Choices Stats: 2025-12-04T12:10:21.7452128Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7452173Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7452214Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7452314Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7452563Z triton_mm_1 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7452789Z triton_mm_7 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7453013Z triton_mm_5 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7453239Z triton_mm_4 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7453462Z triton_mm_6 0.0071 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7453685Z triton_mm_0 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7453905Z triton_mm_3 0.0094 ms 63.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7454139Z triton_mm_2 0.0098 ms 60.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7454181Z _scaled_mm 0.0227 ms 26.3% 2025-12-04T12:10:21.7454309Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.1823 seconds precompiling for 9 choices 2025-12-04T12:10:21.7454382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7454425Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7454481Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7454581Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7455058Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7455095Z graph_break [] 2025-12-04T12:10:21.7455169Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7455241Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7455283Z Autotune Choices Stats: 2025-12-04T12:10:21.7455641Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7455696Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7455736Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7455835Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7456064Z triton_mm_11 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7456300Z triton_mm_9 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7456528Z triton_mm_10 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7456755Z triton_mm_8 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7456979Z triton_mm_15 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7457206Z triton_mm_14 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7457433Z triton_mm_12 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7457665Z triton_mm_13 0.0094 ms 65.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7457707Z _scaled_mm 0.0251 ms 24.5% 2025-12-04T12:10:21.7457832Z SingleProcess AUTOTUNE benchmarking takes 0.0586 seconds and 0.1873 seconds precompiling for 9 choices 2025-12-04T12:10:21.7457907Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7457950Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7458007Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7458108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7458585Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7458621Z graph_break [] 2025-12-04T12:10:21.7458682Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7458755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7458805Z Autotune Choices Stats: 2025-12-04T12:10:21.7459168Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_18", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.7459222Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7459263Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7459360Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7459593Z triton_mm_18 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7459830Z triton_mm_20 0.0063 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7460055Z triton_mm_17 0.0063 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7460316Z triton_mm_23 0.0067 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7460539Z triton_mm_21 0.0069 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7460764Z triton_mm_19 0.0080 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7460990Z triton_mm_16 0.0092 ms 64.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7461215Z triton_mm_22 0.0100 ms 59.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7461256Z _scaled_mm 0.0207 ms 28.6% 2025-12-04T12:10:21.7461397Z SingleProcess AUTOTUNE benchmarking takes 0.0595 seconds and 0.1917 seconds precompiling for 9 choices 2025-12-04T12:10:21.7461588Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ac8bcc98978b1262.xml - 2025-12-04T12:10:21.7461649Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7462235Z FAILED [0.7393s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7462239Z 2025-12-04T12:10:21.7462312Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7462571Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7462586Z 2025-12-04T12:10:21.7462676Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7462740Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7462809Z ================== 1 failed, 140 deselected, 2 rerun in 3.87s ================== 2025-12-04T12:10:21.7462859Z Got exit code 1 2025-12-04T12:10:21.7462899Z Retrying single test... 2025-12-04T12:10:21.7463042Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1c3a860092535db4.xml 2025-12-04T12:10:21.7463099Z ============================= test session starts ============================== 2025-12-04T12:10:21.7463212Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7463253Z cachedir: .pytest_cache 2025-12-04T12:10:21.7463412Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7463458Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7463499Z configfile: pytest.ini 2025-12-04T12:10:21.7463676Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7463752Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7464007Z stepcurrent: skipping 140 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7464050Z Running 1 items in this shard 2025-12-04T12:10:21.7464053Z 2025-12-04T12:10:21.7464267Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.3402s] [100%] 2025-12-04T12:10:21.7464479Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8107s] [100%] 2025-12-04T12:10:21.7464669Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7475s] [100%] 2025-12-04T12:10:21.7464671Z 2025-12-04T12:10:21.7464724Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7464866Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7464912Z Traceback (most recent call last): 2025-12-04T12:10:21.7465069Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7465127Z method(*args, **kwargs) 2025-12-04T12:10:21.7465278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7465321Z method(*args, **kwargs) 2025-12-04T12:10:21.7465472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7465509Z with policy(): 2025-12-04T12:10:21.7465662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7465705Z raise RuntimeError(msg) 2025-12-04T12:10:21.7466096Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7466098Z 2025-12-04T12:10:21.7466171Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7466429Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7466443Z 2025-12-04T12:10:21.7466530Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7466604Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7466656Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7466713Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7467191Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7467291Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7467328Z graph_break [] 2025-12-04T12:10:21.7467398Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7467472Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7467955Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7468003Z current_size = base.storage().size() 2025-12-04T12:10:21.7468044Z Autotune Choices Stats: 2025-12-04T12:10:21.7468413Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7468458Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7468499Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7468600Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7468835Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7469071Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7469293Z triton_mm_7 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7469519Z triton_mm_1 0.0066 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7469741Z triton_mm_5 0.0068 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7469967Z triton_mm_2 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7470231Z triton_mm_0 0.0071 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7470466Z triton_mm_3 0.0084 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7470521Z _scaled_mm 0.0238 ms 25.2% 2025-12-04T12:10:21.7470648Z SingleProcess AUTOTUNE benchmarking takes 0.0449 seconds and 0.1816 seconds precompiling for 9 choices 2025-12-04T12:10:21.7470791Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7470836Z Traceback (most recent call last): 2025-12-04T12:10:21.7470993Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7471034Z method(*args, **kwargs) 2025-12-04T12:10:21.7471185Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7471238Z method(*args, **kwargs) 2025-12-04T12:10:21.7471388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7471427Z with policy(): 2025-12-04T12:10:21.7471579Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7471620Z raise RuntimeError(msg) 2025-12-04T12:10:21.7472007Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7472010Z 2025-12-04T12:10:21.7472085Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7472344Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7472347Z 2025-12-04T12:10:21.7472434Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7472506Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7472549Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7472604Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7473093Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7473193Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7473231Z graph_break [] 2025-12-04T12:10:21.7473292Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7473365Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7473846Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7473893Z current_size = base.storage().size() 2025-12-04T12:10:21.7473945Z Autotune Choices Stats: 2025-12-04T12:10:21.7474307Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7474366Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7474406Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7474506Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7474740Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7474963Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7475196Z triton_mm_7 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7475421Z triton_mm_1 0.0066 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7475643Z triton_mm_5 0.0068 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7475864Z triton_mm_2 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7476088Z triton_mm_0 0.0071 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7476311Z triton_mm_3 0.0084 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7476351Z _scaled_mm 0.0238 ms 25.2% 2025-12-04T12:10:21.7476492Z SingleProcess AUTOTUNE benchmarking takes 0.0449 seconds and 0.1816 seconds precompiling for 9 choices 2025-12-04T12:10:21.7476565Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7476607Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7476664Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7476763Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7477239Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7477277Z graph_break [] 2025-12-04T12:10:21.7477337Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7477411Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7477450Z Autotune Choices Stats: 2025-12-04T12:10:21.7477808Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7477862Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7477914Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7478012Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7478242Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7478473Z triton_mm_14 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7478716Z triton_mm_15 0.0084 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7478942Z triton_mm_12 0.0088 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7479165Z triton_mm_13 0.0088 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7479390Z triton_mm_10 0.0091 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7479615Z triton_mm_9 0.0092 ms 63.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7479837Z triton_mm_8 0.0096 ms 61.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7479879Z _scaled_mm 0.0248 ms 23.7% 2025-12-04T12:10:21.7480004Z SingleProcess AUTOTUNE benchmarking takes 0.0533 seconds and 0.1724 seconds precompiling for 9 choices 2025-12-04T12:10:21.7480068Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7480248Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7480294Z Traceback (most recent call last): 2025-12-04T12:10:21.7480454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7480497Z method(*args, **kwargs) 2025-12-04T12:10:21.7480649Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7480691Z method(*args, **kwargs) 2025-12-04T12:10:21.7480841Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7480879Z with policy(): 2025-12-04T12:10:21.7481030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7481072Z raise RuntimeError(msg) 2025-12-04T12:10:21.7481460Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7481477Z 2025-12-04T12:10:21.7481550Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7481810Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7481829Z 2025-12-04T12:10:21.7481917Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7481993Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7482036Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7482092Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7482582Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7482684Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7482721Z graph_break [] 2025-12-04T12:10:21.7482780Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7482856Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7483336Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7483386Z current_size = base.storage().size() 2025-12-04T12:10:21.7483427Z Autotune Choices Stats: 2025-12-04T12:10:21.7483790Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7483837Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7483879Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7483978Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7484224Z triton_mm_4 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7484452Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7484674Z triton_mm_7 0.0063 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7484900Z triton_mm_1 0.0066 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7485121Z triton_mm_5 0.0068 ms 88.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7485358Z triton_mm_2 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7485582Z triton_mm_0 0.0071 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7485817Z triton_mm_3 0.0084 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7485858Z _scaled_mm 0.0238 ms 25.2% 2025-12-04T12:10:21.7485985Z SingleProcess AUTOTUNE benchmarking takes 0.0449 seconds and 0.1816 seconds precompiling for 9 choices 2025-12-04T12:10:21.7486060Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7486101Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7486168Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7486266Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7486744Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7486781Z graph_break [] 2025-12-04T12:10:21.7486842Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7486914Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7486956Z Autotune Choices Stats: 2025-12-04T12:10:21.7487317Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7487362Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7487406Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7487508Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7487749Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7487973Z triton_mm_14 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7488198Z triton_mm_15 0.0084 ms 70.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7488423Z triton_mm_12 0.0088 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7488647Z triton_mm_13 0.0088 ms 66.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7488872Z triton_mm_10 0.0091 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7489110Z triton_mm_9 0.0092 ms 63.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7489347Z triton_mm_8 0.0096 ms 61.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7489387Z _scaled_mm 0.0248 ms 23.7% 2025-12-04T12:10:21.7489520Z SingleProcess AUTOTUNE benchmarking takes 0.0533 seconds and 0.1724 seconds precompiling for 9 choices 2025-12-04T12:10:21.7489595Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7489638Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7489695Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7489807Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7490325Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7490365Z graph_break [] 2025-12-04T12:10:21.7490426Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7490499Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7490540Z Autotune Choices Stats: 2025-12-04T12:10:21.7490904Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_20", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006318999920040369, "best_triton_pos": 0} 2025-12-04T12:10:21.7490953Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7490997Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7491098Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7491343Z triton_mm_20 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7491574Z triton_mm_18 0.0067 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7491802Z triton_mm_23 0.0069 ms 91.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7492026Z triton_mm_22 0.0071 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7492251Z triton_mm_16 0.0075 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7492472Z triton_mm_21 0.0098 ms 64.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7492713Z triton_mm_17 0.0098 ms 64.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7492956Z triton_mm_19 0.0101 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7492999Z _scaled_mm 0.0246 ms 25.7% 2025-12-04T12:10:21.7493127Z SingleProcess AUTOTUNE benchmarking takes 0.0642 seconds and 0.1915 seconds precompiling for 9 choices 2025-12-04T12:10:21.7493314Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1c3a860092535db4.xml - 2025-12-04T12:10:21.7493375Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7493973Z FAILED [0.7475s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7493976Z 2025-12-04T12:10:21.7494050Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7494308Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7494312Z 2025-12-04T12:10:21.7494399Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7494461Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7494529Z ================== 1 failed, 187 deselected, 2 rerun in 3.92s ================== 2025-12-04T12:10:21.7494568Z Got exit code 1 2025-12-04T12:10:21.7494607Z Retrying single test... 2025-12-04T12:10:21.7494754Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4ab75f29e5632c79.xml 2025-12-04T12:10:21.7494810Z ============================= test session starts ============================== 2025-12-04T12:10:21.7494922Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7494961Z cachedir: .pytest_cache 2025-12-04T12:10:21.7495131Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7495178Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7495224Z configfile: pytest.ini 2025-12-04T12:10:21.7495387Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7495463Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7495715Z stepcurrent: skipping 140 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7495761Z Running 1 items in this shard 2025-12-04T12:10:21.7495763Z 2025-12-04T12:10:21.7495975Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2772s] [100%] 2025-12-04T12:10:21.7496188Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8226s] [100%] 2025-12-04T12:10:21.7496389Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda FAILED [0.7163s] [100%] 2025-12-04T12:10:21.7496391Z 2025-12-04T12:10:21.7496444Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7496586Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7496648Z Traceback (most recent call last): 2025-12-04T12:10:21.7496806Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7496848Z method(*args, **kwargs) 2025-12-04T12:10:21.7497001Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7497043Z method(*args, **kwargs) 2025-12-04T12:10:21.7497194Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7497233Z with policy(): 2025-12-04T12:10:21.7497395Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7497438Z raise RuntimeError(msg) 2025-12-04T12:10:21.7497826Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.7497828Z 2025-12-04T12:10:21.7497902Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7498159Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7498162Z 2025-12-04T12:10:21.7498249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7498322Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7498367Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7498422Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7498917Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7499016Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7499056Z graph_break [] 2025-12-04T12:10:21.7499117Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7499191Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7499676Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7499723Z current_size = base.storage().size() 2025-12-04T12:10:21.7499766Z Autotune Choices Stats: 2025-12-04T12:10:21.7500172Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7500234Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7500276Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7500377Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7500609Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7500852Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7501077Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7501316Z triton_mm_2 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7501540Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7501763Z triton_mm_1 0.0064 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7501987Z triton_mm_4 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7502209Z triton_mm_7 0.0084 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7502252Z _scaled_mm 0.0210 ms 28.6% 2025-12-04T12:10:21.7502382Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.1886 seconds precompiling for 9 choices 2025-12-04T12:10:21.7502524Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7502571Z Traceback (most recent call last): 2025-12-04T12:10:21.7502749Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7502791Z method(*args, **kwargs) 2025-12-04T12:10:21.7502943Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7502985Z method(*args, **kwargs) 2025-12-04T12:10:21.7503135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7503173Z with policy(): 2025-12-04T12:10:21.7503325Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7503368Z raise RuntimeError(msg) 2025-12-04T12:10:21.7503759Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.7503761Z 2025-12-04T12:10:21.7503835Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7504104Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7504108Z 2025-12-04T12:10:21.7504195Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7504279Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7504324Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7504380Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7504859Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7504958Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7504995Z graph_break [] 2025-12-04T12:10:21.7505065Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7505138Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7505620Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7505667Z current_size = base.storage().size() 2025-12-04T12:10:21.7505709Z Autotune Choices Stats: 2025-12-04T12:10:21.7506078Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7506124Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7506165Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7506265Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7506494Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7506731Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7506956Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7507179Z triton_mm_2 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7507400Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7507623Z triton_mm_1 0.0064 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7507866Z triton_mm_4 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7508089Z triton_mm_7 0.0084 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7508142Z _scaled_mm 0.0210 ms 28.6% 2025-12-04T12:10:21.7508271Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.1886 seconds precompiling for 9 choices 2025-12-04T12:10:21.7508343Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7508388Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7508444Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7508543Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7509034Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7509072Z graph_break [] 2025-12-04T12:10:21.7509134Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7509206Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7509247Z Autotune Choices Stats: 2025-12-04T12:10:21.7509605Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005878999829292297, "best_triton_pos": 0} 2025-12-04T12:10:21.7509655Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7509697Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7509796Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7510026Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7510306Z triton_mm_13 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7510536Z triton_mm_9 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7510762Z triton_mm_12 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7510985Z triton_mm_15 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7511208Z triton_mm_14 0.0062 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7511433Z triton_mm_10 0.0063 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7511673Z triton_mm_8 0.0066 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7511727Z _scaled_mm 0.0207 ms 28.4% 2025-12-04T12:10:21.7511855Z SingleProcess AUTOTUNE benchmarking takes 0.0405 seconds and 0.0943 seconds precompiling for 9 choices 2025-12-04T12:10:21.7511907Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7512053Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7512099Z Traceback (most recent call last): 2025-12-04T12:10:21.7512263Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7512305Z method(*args, **kwargs) 2025-12-04T12:10:21.7512475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7512516Z method(*args, **kwargs) 2025-12-04T12:10:21.7512669Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7512707Z with policy(): 2025-12-04T12:10:21.7512859Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7512900Z raise RuntimeError(msg) 2025-12-04T12:10:21.7513287Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7513291Z 2025-12-04T12:10:21.7513366Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7513625Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7513628Z 2025-12-04T12:10:21.7513714Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7513786Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7513828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7513883Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7514372Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7514473Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7514510Z graph_break [] 2025-12-04T12:10:21.7514571Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7514643Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7515128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7515174Z current_size = base.storage().size() 2025-12-04T12:10:21.7515229Z Autotune Choices Stats: 2025-12-04T12:10:21.7515590Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005998999811708927, "best_triton_pos": 0} 2025-12-04T12:10:21.7515646Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7515687Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7515787Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7516018Z triton_mm_3 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7516244Z triton_mm_0 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7516478Z triton_mm_6 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7516703Z triton_mm_2 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7516925Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7517149Z triton_mm_1 0.0064 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7517374Z triton_mm_4 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7517595Z triton_mm_7 0.0084 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7517636Z _scaled_mm 0.0210 ms 28.6% 2025-12-04T12:10:21.7517773Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.1886 seconds precompiling for 9 choices 2025-12-04T12:10:21.7517847Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7517890Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7517946Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7518046Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7518522Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7518559Z graph_break [] 2025-12-04T12:10:21.7518620Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7518693Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7518734Z Autotune Choices Stats: 2025-12-04T12:10:21.7519093Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005878999829292297, "best_triton_pos": 0} 2025-12-04T12:10:21.7519148Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7519201Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7519299Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7519531Z triton_mm_11 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7519759Z triton_mm_13 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7519995Z triton_mm_9 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7520254Z triton_mm_12 0.0059 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7520477Z triton_mm_15 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7520701Z triton_mm_14 0.0062 ms 94.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7520928Z triton_mm_10 0.0063 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7521152Z triton_mm_8 0.0066 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7521194Z _scaled_mm 0.0207 ms 28.4% 2025-12-04T12:10:21.7521321Z SingleProcess AUTOTUNE benchmarking takes 0.0405 seconds and 0.0943 seconds precompiling for 9 choices 2025-12-04T12:10:21.7521415Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7521457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7521513Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7521612Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7522089Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7522125Z graph_break [] 2025-12-04T12:10:21.7522186Z aten_mm_info [('aten._scaled_mm.default_257_16_32', 1)] 2025-12-04T12:10:21.7522258Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7522299Z Autotune Choices Stats: 2025-12-04T12:10:21.7522654Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_21", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.005799999926239252, "best_triton_pos": 0} 2025-12-04T12:10:21.7522714Z AUTOTUNE scaled_mm(257x32, 32x16, , ) 2025-12-04T12:10:21.7522754Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7522852Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7523094Z triton_mm_21 0.0058 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7523321Z triton_mm_17 0.0059 ms 98.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7523546Z triton_mm_18 0.0059 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7523783Z triton_mm_22 0.0060 ms 96.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7524010Z triton_mm_20 0.0064 ms 90.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7524237Z triton_mm_19 0.0067 ms 86.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7524459Z triton_mm_23 0.0073 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7524685Z triton_mm_16 0.0089 ms 65.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.7524727Z _scaled_mm 0.0246 ms 23.6% 2025-12-04T12:10:21.7524854Z SingleProcess AUTOTUNE benchmarking takes 0.0611 seconds and 0.1859 seconds precompiling for 9 choices 2025-12-04T12:10:21.7525041Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4ab75f29e5632c79.xml - 2025-12-04T12:10:21.7525111Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7525692Z FAILED [0.7163s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.7525696Z 2025-12-04T12:10:21.7525768Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7526033Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7526035Z 2025-12-04T12:10:21.7526121Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7526183Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7526261Z ================== 1 failed, 187 deselected, 2 rerun in 3.84s ================== 2025-12-04T12:10:21.7526301Z Got exit code 1 2025-12-04T12:10:21.7526508Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7526634Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7526788Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8842b1aa6c8bb713.xml 2025-12-04T12:10:21.7526844Z ============================= test session starts ============================== 2025-12-04T12:10:21.7526956Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7526997Z cachedir: .pytest_cache 2025-12-04T12:10:21.7527155Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7527203Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7527244Z configfile: pytest.ini 2025-12-04T12:10:21.7527419Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7527497Z collecting ... collected 188 items / 141 deselected / 47 selected 2025-12-04T12:10:21.7527551Z stepcurrent: skipping 141 already run items. 2025-12-04T12:10:21.7527596Z Running 47 items in this shard 2025-12-04T12:10:21.7527598Z 2025-12-04T12:10:21.7527818Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [23.6966s] [ 2%] 2025-12-04T12:10:21.7528034Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1714s] [ 2%] 2025-12-04T12:10:21.7528225Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.0719s] [ 2%] 2025-12-04T12:10:21.7528229Z 2025-12-04T12:10:21.7528281Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7528429Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7528477Z Traceback (most recent call last): 2025-12-04T12:10:21.7528636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7528680Z method(*args, **kwargs) 2025-12-04T12:10:21.7528845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7528885Z method(*args, **kwargs) 2025-12-04T12:10:21.7529035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7529072Z with policy(): 2025-12-04T12:10:21.7529226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7529267Z raise RuntimeError(msg) 2025-12-04T12:10:21.7529658Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.7529662Z 2025-12-04T12:10:21.7529734Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7529997Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7530008Z 2025-12-04T12:10:21.7530127Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7530201Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7530245Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7530302Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7530809Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7530908Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7530946Z graph_break [] 2025-12-04T12:10:21.7531009Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7531082Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7531576Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7531626Z current_size = base.storage().size() 2025-12-04T12:10:21.7531668Z Autotune Choices Stats: 2025-12-04T12:10:21.7532038Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7532086Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7532128Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7532230Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7532464Z triton_mm_5 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7532694Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7532929Z triton_mm_8 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7533158Z triton_mm_10 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7533383Z triton_mm_13 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7533606Z triton_mm_18 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7533828Z triton_mm_14 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7534067Z triton_mm_11 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7534291Z triton_mm_15 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7534525Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7534653Z SingleProcess AUTOTUNE benchmarking takes 0.0807 seconds and 0.4379 seconds precompiling for 21 choices 2025-12-04T12:10:21.7534805Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7534853Z Traceback (most recent call last): 2025-12-04T12:10:21.7535024Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7535065Z method(*args, **kwargs) 2025-12-04T12:10:21.7535220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7535260Z method(*args, **kwargs) 2025-12-04T12:10:21.7535413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7535450Z with policy(): 2025-12-04T12:10:21.7535604Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7535647Z raise RuntimeError(msg) 2025-12-04T12:10:21.7536040Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.7536043Z 2025-12-04T12:10:21.7536117Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7536378Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7536380Z 2025-12-04T12:10:21.7536467Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7536548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7536593Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7536648Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7537135Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7537233Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7537270Z graph_break [] 2025-12-04T12:10:21.7537332Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7537410Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7537896Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7537954Z current_size = base.storage().size() 2025-12-04T12:10:21.7537995Z Autotune Choices Stats: 2025-12-04T12:10:21.7538371Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7538418Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7538460Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7538559Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7538790Z triton_mm_5 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7539030Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7539255Z triton_mm_8 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7539477Z triton_mm_10 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7539701Z triton_mm_13 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7539926Z triton_mm_18 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7540185Z triton_mm_14 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7540431Z triton_mm_11 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7540656Z triton_mm_15 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7540881Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7541009Z SingleProcess AUTOTUNE benchmarking takes 0.0807 seconds and 0.4379 seconds precompiling for 21 choices 2025-12-04T12:10:21.7541082Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7541125Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7541183Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7541282Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7541765Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7541829Z graph_break [] 2025-12-04T12:10:21.7541891Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7541964Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7542005Z Autotune Choices Stats: 2025-12-04T12:10:21.7542366Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7542411Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7542454Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7542565Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7542796Z triton_mm_33 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7543019Z triton_mm_38 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7543245Z triton_mm_30 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7543472Z triton_mm_36 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7543695Z triton_mm_31 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7543921Z triton_mm_37 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7544153Z triton_mm_29 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7544382Z triton_mm_25 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7544606Z triton_mm_27 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7544832Z triton_mm_22 0.0071 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7544962Z SingleProcess AUTOTUNE benchmarking takes 0.1335 seconds and 0.4272 seconds precompiling for 21 choices 2025-12-04T12:10:21.7545014Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7545171Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7545220Z Traceback (most recent call last): 2025-12-04T12:10:21.7545376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7545430Z method(*args, **kwargs) 2025-12-04T12:10:21.7545581Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7545621Z method(*args, **kwargs) 2025-12-04T12:10:21.7545771Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7545808Z with policy(): 2025-12-04T12:10:21.7545959Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7546001Z raise RuntimeError(msg) 2025-12-04T12:10:21.7546408Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7546412Z 2025-12-04T12:10:21.7546486Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7546747Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7546749Z 2025-12-04T12:10:21.7546836Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7546908Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7546952Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7547008Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7547493Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7547591Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7547628Z graph_break [] 2025-12-04T12:10:21.7547689Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7547777Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7548262Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7548309Z current_size = base.storage().size() 2025-12-04T12:10:21.7548350Z Autotune Choices Stats: 2025-12-04T12:10:21.7548721Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7548768Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7548809Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7548908Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7549149Z triton_mm_5 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7549380Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7549620Z triton_mm_8 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7549843Z triton_mm_10 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7550075Z triton_mm_13 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7550331Z triton_mm_18 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7550555Z triton_mm_14 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7550783Z triton_mm_11 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7551008Z triton_mm_15 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7551231Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7551358Z SingleProcess AUTOTUNE benchmarking takes 0.0807 seconds and 0.4379 seconds precompiling for 21 choices 2025-12-04T12:10:21.7551431Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7551489Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7551546Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7551645Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7552130Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7552170Z graph_break [] 2025-12-04T12:10:21.7552231Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7552304Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7552343Z Autotune Choices Stats: 2025-12-04T12:10:21.7552702Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_33", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7552765Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7552808Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7552905Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7553150Z triton_mm_33 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7553373Z triton_mm_38 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7553595Z triton_mm_30 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7553833Z triton_mm_36 0.0066 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7554057Z triton_mm_31 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7554280Z triton_mm_37 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7554501Z triton_mm_29 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7554727Z triton_mm_25 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7554954Z triton_mm_27 0.0070 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7555190Z triton_mm_22 0.0071 ms 87.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7555318Z SingleProcess AUTOTUNE benchmarking takes 0.1335 seconds and 0.4272 seconds precompiling for 21 choices 2025-12-04T12:10:21.7555391Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7555434Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7555489Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7555588Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7556069Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7556107Z graph_break [] 2025-12-04T12:10:21.7556168Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7556252Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7556291Z Autotune Choices Stats: 2025-12-04T12:10:21.7556654Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_56", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7556711Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7556752Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7556850Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7557083Z triton_mm_56 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7557319Z triton_mm_58 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7557544Z triton_mm_47 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7557768Z triton_mm_53 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7557993Z triton_mm_55 0.0066 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7558217Z triton_mm_46 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7558440Z triton_mm_48 0.0068 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7558662Z triton_mm_49 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7558897Z triton_mm_52 0.0073 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7559126Z triton_mm_44 0.0075 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7559255Z SingleProcess AUTOTUNE benchmarking takes 0.1536 seconds and 0.3899 seconds precompiling for 21 choices 2025-12-04T12:10:21.7559447Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8842b1aa6c8bb713.xml - 2025-12-04T12:10:21.7559507Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7560127Z FAILED [1.0719s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7560144Z 2025-12-04T12:10:21.7560217Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7560477Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7560492Z 2025-12-04T12:10:21.7560579Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7560641Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7560714Z ================= 1 failed, 141 deselected, 2 rerun in 25.96s ================== 2025-12-04T12:10:21.7560752Z Got exit code 1 2025-12-04T12:10:21.7560793Z Retrying single test... 2025-12-04T12:10:21.7560934Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1b8472a77b12e9a5.xml 2025-12-04T12:10:21.7560993Z ============================= test session starts ============================== 2025-12-04T12:10:21.7561117Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7561159Z cachedir: .pytest_cache 2025-12-04T12:10:21.7561319Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7561367Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7561407Z configfile: pytest.ini 2025-12-04T12:10:21.7561573Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7561648Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7561905Z stepcurrent: skipping 141 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7561951Z Running 1 items in this shard 2025-12-04T12:10:21.7561953Z 2025-12-04T12:10:21.7562174Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [29.7655s] [100%] 2025-12-04T12:10:21.7562391Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1044s] [100%] 2025-12-04T12:10:21.7562586Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8061s] [100%] 2025-12-04T12:10:21.7562606Z 2025-12-04T12:10:21.7562660Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7562805Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7562855Z Traceback (most recent call last): 2025-12-04T12:10:21.7563015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7563057Z method(*args, **kwargs) 2025-12-04T12:10:21.7563209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7563253Z method(*args, **kwargs) 2025-12-04T12:10:21.7563404Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7563441Z with policy(): 2025-12-04T12:10:21.7563594Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7563637Z raise RuntimeError(msg) 2025-12-04T12:10:21.7564028Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.7564050Z 2025-12-04T12:10:21.7564136Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7564397Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7564399Z 2025-12-04T12:10:21.7564485Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7564559Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7564602Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7564658Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7565153Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7565253Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7565289Z graph_break [] 2025-12-04T12:10:21.7565352Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7565425Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7565910Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7565958Z current_size = base.storage().size() 2025-12-04T12:10:21.7565998Z Autotune Choices Stats: 2025-12-04T12:10:21.7566370Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7566416Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7566469Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7566569Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7566808Z triton_mm_13 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7567037Z triton_mm_15 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7567262Z triton_mm_7 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7567487Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7567708Z triton_mm_10 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7567945Z triton_mm_5 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7568180Z triton_mm_6 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7568409Z triton_mm_14 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7568636Z triton_mm_2 0.0072 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7568869Z triton_mm_3 0.0076 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7568999Z SingleProcess AUTOTUNE benchmarking takes 0.0969 seconds and 0.4617 seconds precompiling for 21 choices 2025-12-04T12:10:21.7569144Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7569194Z Traceback (most recent call last): 2025-12-04T12:10:21.7569350Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7569391Z method(*args, **kwargs) 2025-12-04T12:10:21.7569544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7569585Z method(*args, **kwargs) 2025-12-04T12:10:21.7569735Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7569775Z with policy(): 2025-12-04T12:10:21.7569928Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7569970Z raise RuntimeError(msg) 2025-12-04T12:10:21.7570410Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.7570412Z 2025-12-04T12:10:21.7570486Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7570745Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7570747Z 2025-12-04T12:10:21.7570833Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7570908Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7570951Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7571009Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7571494Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7571608Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7571646Z graph_break [] 2025-12-04T12:10:21.7571707Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7571797Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7572279Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7572326Z current_size = base.storage().size() 2025-12-04T12:10:21.7572366Z Autotune Choices Stats: 2025-12-04T12:10:21.7572742Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7572788Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7572830Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7572928Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7573164Z triton_mm_13 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7573393Z triton_mm_15 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7573618Z triton_mm_7 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7573842Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7574075Z triton_mm_10 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7574300Z triton_mm_5 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7574523Z triton_mm_6 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7574746Z triton_mm_14 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7574972Z triton_mm_2 0.0072 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7575195Z triton_mm_3 0.0076 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7575338Z SingleProcess AUTOTUNE benchmarking takes 0.0969 seconds and 0.4617 seconds precompiling for 21 choices 2025-12-04T12:10:21.7575409Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7575466Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7575522Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7575620Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7576101Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('async_compile_cache_miss', 14), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7576139Z graph_break [] 2025-12-04T12:10:21.7576201Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7576283Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7576324Z Autotune Choices Stats: 2025-12-04T12:10:21.7576684Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.7576730Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7576773Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7576872Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7577104Z triton_mm_32 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7577339Z triton_mm_27 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7577567Z triton_mm_38 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7577815Z triton_mm_30 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7578038Z triton_mm_37 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7578261Z triton_mm_34 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7578488Z triton_mm_31 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7578711Z triton_mm_28 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7578936Z triton_mm_35 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7579179Z triton_mm_24 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7579319Z SingleProcess AUTOTUNE benchmarking takes 0.1191 seconds and 0.2648 seconds precompiling for 21 choices 2025-12-04T12:10:21.7579373Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7579520Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7579567Z Traceback (most recent call last): 2025-12-04T12:10:21.7579723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7579769Z method(*args, **kwargs) 2025-12-04T12:10:21.7579930Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7579972Z method(*args, **kwargs) 2025-12-04T12:10:21.7580155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7580194Z with policy(): 2025-12-04T12:10:21.7580345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7580388Z raise RuntimeError(msg) 2025-12-04T12:10:21.7580779Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7580782Z 2025-12-04T12:10:21.7580855Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7581115Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7581118Z 2025-12-04T12:10:21.7581203Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7581276Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7581319Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7581377Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7581878Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7581978Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7582017Z graph_break [] 2025-12-04T12:10:21.7582079Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7582153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7582634Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7582697Z current_size = base.storage().size() 2025-12-04T12:10:21.7582737Z Autotune Choices Stats: 2025-12-04T12:10:21.7583102Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_13", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7583159Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7583200Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7583299Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7583530Z triton_mm_13 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7583759Z triton_mm_15 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7583999Z triton_mm_7 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7584226Z triton_mm_17 0.0063 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7584450Z triton_mm_10 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7584675Z triton_mm_5 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7584898Z triton_mm_6 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7585121Z triton_mm_14 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7585355Z triton_mm_2 0.0072 ms 84.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=256, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7585581Z triton_mm_3 0.0076 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7585712Z SingleProcess AUTOTUNE benchmarking takes 0.0969 seconds and 0.4617 seconds precompiling for 21 choices 2025-12-04T12:10:21.7585785Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7585828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7585884Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7585982Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7586469Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('async_compile_cache_miss', 14), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7586516Z graph_break [] 2025-12-04T12:10:21.7586579Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7586653Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7586705Z Autotune Choices Stats: 2025-12-04T12:10:21.7587065Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.7587110Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7587152Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7587250Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7587491Z triton_mm_32 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7587720Z triton_mm_27 0.0064 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7587944Z triton_mm_38 0.0066 ms 93.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7588165Z triton_mm_30 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7588394Z triton_mm_37 0.0067 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7588616Z triton_mm_34 0.0067 ms 92.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7588842Z triton_mm_31 0.0068 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7589075Z triton_mm_28 0.0069 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7589301Z triton_mm_35 0.0072 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7589529Z triton_mm_24 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7589657Z SingleProcess AUTOTUNE benchmarking takes 0.1191 seconds and 0.2648 seconds precompiling for 21 choices 2025-12-04T12:10:21.7589729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7589772Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7589828Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7589925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7590469Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7590519Z graph_break [] 2025-12-04T12:10:21.7590581Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7590654Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7590693Z Autotune Choices Stats: 2025-12-04T12:10:21.7591057Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_52", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.7591124Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7591166Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7591264Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7591495Z triton_mm_52 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7591719Z triton_mm_50 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7591944Z triton_mm_57 0.0064 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7592170Z triton_mm_51 0.0070 ms 87.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7592392Z triton_mm_53 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7592631Z triton_mm_49 0.0071 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7592854Z triton_mm_58 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7593080Z triton_mm_54 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7593308Z triton_mm_47 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7593536Z triton_mm_55 0.0073 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7593663Z SingleProcess AUTOTUNE benchmarking takes 0.1554 seconds and 0.2308 seconds precompiling for 21 choices 2025-12-04T12:10:21.7593865Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-1b8472a77b12e9a5.xml - 2025-12-04T12:10:21.7593927Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7594513Z FAILED [0.8061s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7594529Z 2025-12-04T12:10:21.7594603Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7594864Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7594867Z 2025-12-04T12:10:21.7594963Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7595026Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7595095Z ================= 1 failed, 187 deselected, 2 rerun in 31.70s ================== 2025-12-04T12:10:21.7595133Z Got exit code 1 2025-12-04T12:10:21.7595173Z Retrying single test... 2025-12-04T12:10:21.7595317Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bcf0c73b7944047.xml 2025-12-04T12:10:21.7595373Z ============================= test session starts ============================== 2025-12-04T12:10:21.7595485Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7595525Z cachedir: .pytest_cache 2025-12-04T12:10:21.7595683Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7595729Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7595771Z configfile: pytest.ini 2025-12-04T12:10:21.7595933Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7596009Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7596263Z stepcurrent: skipping 141 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7596318Z Running 1 items in this shard 2025-12-04T12:10:21.7596321Z 2025-12-04T12:10:21.7596539Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [33.4154s] [100%] 2025-12-04T12:10:21.7596754Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1928s] [100%] 2025-12-04T12:10:21.7596947Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.2949s] [100%] 2025-12-04T12:10:21.7596950Z 2025-12-04T12:10:21.7597000Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7597147Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7597194Z Traceback (most recent call last): 2025-12-04T12:10:21.7597352Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7597395Z method(*args, **kwargs) 2025-12-04T12:10:21.7597562Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7597601Z method(*args, **kwargs) 2025-12-04T12:10:21.7597753Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7597803Z with policy(): 2025-12-04T12:10:21.7597955Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7597997Z raise RuntimeError(msg) 2025-12-04T12:10:21.7598386Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1077936128. 2025-12-04T12:10:21.7598388Z 2025-12-04T12:10:21.7598464Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7598731Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7598734Z 2025-12-04T12:10:21.7598822Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7598896Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7598941Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7598997Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7599483Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7599584Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7599622Z graph_break [] 2025-12-04T12:10:21.7599685Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7599759Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7600291Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7600339Z current_size = base.storage().size() 2025-12-04T12:10:21.7600382Z Autotune Choices Stats: 2025-12-04T12:10:21.7600750Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7600796Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7600838Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7600938Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7601171Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7601399Z triton_mm_16 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7601644Z triton_mm_8 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7601881Z triton_mm_14 0.0069 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7602107Z triton_mm_11 0.0070 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7602331Z triton_mm_7 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7602565Z triton_mm_13 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7602791Z triton_mm_3 0.0073 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7603015Z triton_mm_15 0.0073 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7603237Z triton_mm_18 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7603365Z SingleProcess AUTOTUNE benchmarking takes 0.0989 seconds and 0.4522 seconds precompiling for 21 choices 2025-12-04T12:10:21.7603512Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7603560Z Traceback (most recent call last): 2025-12-04T12:10:21.7603715Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7603755Z method(*args, **kwargs) 2025-12-04T12:10:21.7603910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7603961Z method(*args, **kwargs) 2025-12-04T12:10:21.7604114Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7604152Z with policy(): 2025-12-04T12:10:21.7604305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7604349Z raise RuntimeError(msg) 2025-12-04T12:10:21.7604742Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1077936128 and is now 1136656384. 2025-12-04T12:10:21.7604745Z 2025-12-04T12:10:21.7604819Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7605079Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7605081Z 2025-12-04T12:10:21.7605177Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7605250Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7605294Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7605349Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7605858Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7605957Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7605993Z graph_break [] 2025-12-04T12:10:21.7606055Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7606129Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7606626Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7606675Z current_size = base.storage().size() 2025-12-04T12:10:21.7606717Z Autotune Choices Stats: 2025-12-04T12:10:21.7607083Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7607130Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7607171Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7607274Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7607508Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7607736Z triton_mm_16 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7607972Z triton_mm_8 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7608198Z triton_mm_14 0.0069 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7608423Z triton_mm_11 0.0070 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7608652Z triton_mm_7 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7608877Z triton_mm_13 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7609113Z triton_mm_3 0.0073 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7609337Z triton_mm_15 0.0073 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7609573Z triton_mm_18 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7609701Z SingleProcess AUTOTUNE benchmarking takes 0.0989 seconds and 0.4522 seconds precompiling for 21 choices 2025-12-04T12:10:21.7609774Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7609818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7609875Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7609982Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7610505Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7610544Z graph_break [] 2025-12-04T12:10:21.7610605Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7610681Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7610721Z Autotune Choices Stats: 2025-12-04T12:10:21.7611088Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7611133Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7611176Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7611273Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7611519Z triton_mm_32 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7611745Z triton_mm_36 0.0063 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7611971Z triton_mm_35 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7612194Z triton_mm_34 0.0064 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7612417Z triton_mm_38 0.0064 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7612640Z triton_mm_33 0.0065 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7612880Z triton_mm_31 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7613119Z triton_mm_25 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7613344Z triton_mm_30 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7613567Z triton_mm_37 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7613709Z SingleProcess AUTOTUNE benchmarking takes 0.1336 seconds and 0.4272 seconds precompiling for 21 choices 2025-12-04T12:10:21.7613761Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7613909Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7613955Z Traceback (most recent call last): 2025-12-04T12:10:21.7614110Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7614152Z method(*args, **kwargs) 2025-12-04T12:10:21.7614305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7614346Z method(*args, **kwargs) 2025-12-04T12:10:21.7614498Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7614537Z with policy(): 2025-12-04T12:10:21.7614691Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7614738Z raise RuntimeError(msg) 2025-12-04T12:10:21.7615130Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7615133Z 2025-12-04T12:10:21.7615218Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7615479Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7615482Z 2025-12-04T12:10:21.7615571Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7615645Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7615690Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7615746Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7616235Z inductor [('triton_bundler_save_kernel', 168), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7616333Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7616380Z graph_break [] 2025-12-04T12:10:21.7616442Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7616516Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7616998Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7617054Z current_size = base.storage().size() 2025-12-04T12:10:21.7617096Z Autotune Choices Stats: 2025-12-04T12:10:21.7617465Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.7617511Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7617565Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7617666Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7617900Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7618129Z triton_mm_16 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7618353Z triton_mm_8 0.0063 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7618578Z triton_mm_14 0.0069 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7618804Z triton_mm_11 0.0070 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7619038Z triton_mm_7 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7619261Z triton_mm_13 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7619486Z triton_mm_3 0.0073 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7619712Z triton_mm_15 0.0073 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7619936Z triton_mm_18 0.0076 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7620063Z SingleProcess AUTOTUNE benchmarking takes 0.0989 seconds and 0.4522 seconds precompiling for 21 choices 2025-12-04T12:10:21.7620190Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7620234Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7620292Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7620389Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7620887Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7620927Z graph_break [] 2025-12-04T12:10:21.7620987Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7621061Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7621101Z Autotune Choices Stats: 2025-12-04T12:10:21.7621478Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7621524Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7621566Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7621665Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7621901Z triton_mm_32 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7622126Z triton_mm_36 0.0063 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7622351Z triton_mm_35 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7622573Z triton_mm_34 0.0064 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7622808Z triton_mm_38 0.0064 ms 95.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7623033Z triton_mm_33 0.0065 ms 93.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7623259Z triton_mm_31 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7623485Z triton_mm_25 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7623711Z triton_mm_30 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7623935Z triton_mm_37 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7624081Z SingleProcess AUTOTUNE benchmarking takes 0.1336 seconds and 0.4272 seconds precompiling for 21 choices 2025-12-04T12:10:21.7624165Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7624209Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7624265Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7624363Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7624841Z inductor [('triton_bundler_save_kernel', 168), ('async_compile_cache_miss', 22), ('benchmarking.InductorBenchmarker.benchmark_gpu', 21), ('generated_module_cache_miss', 20), ('select_algorithm_num_precompiles', 20), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7624880Z graph_break [] 2025-12-04T12:10:21.7624954Z aten_mm_info [('aten._scaled_mm.default_257_2048_32', 1)] 2025-12-04T12:10:21.7625027Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7625069Z Autotune Choices Stats: 2025-12-04T12:10:21.7625426Z {"num_choices": 21, "num_triton_choices": 20, "best_kernel": "triton_mm_48", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:21.7625471Z AUTOTUNE scaled_mm(257x32, 32x2048, , ) 2025-12-04T12:10:21.7625511Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7625609Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7625842Z triton_mm_48 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7626069Z triton_mm_49 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7626296Z triton_mm_52 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7626527Z triton_mm_51 0.0065 ms 97.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7626754Z triton_mm_46 0.0067 ms 94.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7626976Z triton_mm_55 0.0067 ms 94.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7627199Z triton_mm_54 0.0069 ms 91.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7627420Z triton_mm_58 0.0070 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7627657Z triton_mm_47 0.0071 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=128, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7627883Z triton_mm_50 0.0071 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7628022Z SingleProcess AUTOTUNE benchmarking takes 0.1504 seconds and 0.3700 seconds precompiling for 21 choices 2025-12-04T12:10:21.7628212Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bcf0c73b7944047.xml - 2025-12-04T12:10:21.7628273Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7628869Z FAILED [1.2949s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1136656384 and is now 1195376640. 2025-12-04T12:10:21.7628873Z 2025-12-04T12:10:21.7628947Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7629206Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7629210Z 2025-12-04T12:10:21.7629298Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7629359Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7629428Z ================= 1 failed, 187 deselected, 2 rerun in 35.92s ================== 2025-12-04T12:10:21.7629465Z Got exit code 1 2025-12-04T12:10:21.7629674Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7629799Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7629945Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4139a9adbef9b55b.xml 2025-12-04T12:10:21.7630001Z ============================= test session starts ============================== 2025-12-04T12:10:21.7630168Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7630208Z cachedir: .pytest_cache 2025-12-04T12:10:21.7630367Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7630414Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7630456Z configfile: pytest.ini 2025-12-04T12:10:21.7630618Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7630696Z collecting ... collected 188 items / 142 deselected / 46 selected 2025-12-04T12:10:21.7630751Z stepcurrent: skipping 142 already run items. 2025-12-04T12:10:21.7630795Z Running 46 items in this shard 2025-12-04T12:10:21.7630797Z 2025-12-04T12:10:21.7631016Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [37.4180s] [ 2%] 2025-12-04T12:10:21.7631231Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8373s] [ 2%] 2025-12-04T12:10:21.7631440Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [1.0263s] [ 2%] 2025-12-04T12:10:21.7631442Z 2025-12-04T12:10:21.7631493Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7631655Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7631703Z Traceback (most recent call last): 2025-12-04T12:10:21.7631861Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7631904Z method(*args, **kwargs) 2025-12-04T12:10:21.7632059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7632099Z method(*args, **kwargs) 2025-12-04T12:10:21.7632252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7632289Z with policy(): 2025-12-04T12:10:21.7632454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7632499Z raise RuntimeError(msg) 2025-12-04T12:10:21.7632893Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1038090240. 2025-12-04T12:10:21.7632895Z 2025-12-04T12:10:21.7632972Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7633232Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7633234Z 2025-12-04T12:10:21.7633322Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7633395Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7633440Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7633497Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7633995Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7635364Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7635405Z graph_break [] 2025-12-04T12:10:21.7635471Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7635547Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7636032Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7636083Z current_size = base.storage().size() 2025-12-04T12:10:21.7636125Z Autotune Choices Stats: 2025-12-04T12:10:21.7636501Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7636566Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7636612Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7636714Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7636960Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7637186Z triton_mm_2 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7637409Z triton_mm_8 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7637645Z triton_mm_3 0.0068 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7637867Z triton_mm_6 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7638087Z triton_mm_5 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7638309Z triton_mm_4 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7638533Z triton_mm_7 0.0093 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7638756Z triton_mm_1 0.0112 ms 53.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7638993Z triton_mm_0 0.0119 ms 50.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7639121Z SingleProcess AUTOTUNE benchmarking takes 0.0564 seconds and 0.2361 seconds precompiling for 11 choices 2025-12-04T12:10:21.7639269Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7639317Z Traceback (most recent call last): 2025-12-04T12:10:21.7639474Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7639517Z method(*args, **kwargs) 2025-12-04T12:10:21.7639670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7639713Z method(*args, **kwargs) 2025-12-04T12:10:21.7639863Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7639901Z with policy(): 2025-12-04T12:10:21.7640054Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7640145Z raise RuntimeError(msg) 2025-12-04T12:10:21.7640552Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1075838976. 2025-12-04T12:10:21.7640567Z 2025-12-04T12:10:21.7640642Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7640904Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7640906Z 2025-12-04T12:10:21.7640995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7641069Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7641113Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7641172Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7641669Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7641770Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7641807Z graph_break [] 2025-12-04T12:10:21.7641870Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7641943Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7642426Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7642473Z current_size = base.storage().size() 2025-12-04T12:10:21.7642516Z Autotune Choices Stats: 2025-12-04T12:10:21.7642881Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7642941Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7642987Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7643085Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7643322Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7643549Z triton_mm_2 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7643772Z triton_mm_8 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7643995Z triton_mm_3 0.0068 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7644229Z triton_mm_6 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7644450Z triton_mm_5 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7644681Z triton_mm_4 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7644902Z triton_mm_7 0.0093 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7645131Z triton_mm_1 0.0112 ms 53.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7645357Z triton_mm_0 0.0119 ms 50.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7645486Z SingleProcess AUTOTUNE benchmarking takes 0.0564 seconds and 0.2361 seconds precompiling for 11 choices 2025-12-04T12:10:21.7645559Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7645607Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7645664Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7645763Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7646247Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7646285Z graph_break [] 2025-12-04T12:10:21.7646346Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7646419Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7646459Z Autotune Choices Stats: 2025-12-04T12:10:21.7646829Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.7646876Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7646921Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7647019Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7647248Z triton_mm_16 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7647483Z triton_mm_12 0.0066 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7647707Z triton_mm_13 0.0067 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7647941Z triton_mm_17 0.0074 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7648173Z triton_mm_15 0.0076 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7648397Z triton_mm_11 0.0078 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7648622Z triton_mm_19 0.0080 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7648859Z triton_mm_18 0.0083 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7649084Z triton_mm_14 0.0098 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7649312Z triton_mm_10 0.0114 ms 56.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7649439Z SingleProcess AUTOTUNE benchmarking takes 0.0525 seconds and 0.1374 seconds precompiling for 11 choices 2025-12-04T12:10:21.7649492Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7649638Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7649686Z Traceback (most recent call last): 2025-12-04T12:10:21.7649844Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7649889Z method(*args, **kwargs) 2025-12-04T12:10:21.7650041Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7650083Z method(*args, **kwargs) 2025-12-04T12:10:21.7650273Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7650313Z with policy(): 2025-12-04T12:10:21.7650465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7650510Z raise RuntimeError(msg) 2025-12-04T12:10:21.7650902Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7650905Z 2025-12-04T12:10:21.7650979Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7651239Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7651242Z 2025-12-04T12:10:21.7651331Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7651402Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7651460Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7651516Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7652001Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7652118Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7652155Z graph_break [] 2025-12-04T12:10:21.7652219Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7652290Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7652783Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7652831Z current_size = base.storage().size() 2025-12-04T12:10:21.7652873Z Autotune Choices Stats: 2025-12-04T12:10:21.7653238Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.7653284Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7653328Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7653429Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7653663Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7653889Z triton_mm_2 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7654121Z triton_mm_8 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7654345Z triton_mm_3 0.0068 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7654569Z triton_mm_6 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7654791Z triton_mm_5 0.0076 ms 79.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7655011Z triton_mm_4 0.0086 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7655232Z triton_mm_7 0.0093 ms 64.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7655462Z triton_mm_1 0.0112 ms 53.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7655695Z triton_mm_0 0.0119 ms 50.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7655821Z SingleProcess AUTOTUNE benchmarking takes 0.0564 seconds and 0.2361 seconds precompiling for 11 choices 2025-12-04T12:10:21.7655895Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7655941Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7655997Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7656096Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7656587Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7656627Z graph_break [] 2025-12-04T12:10:21.7656688Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7656760Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7656800Z Autotune Choices Stats: 2025-12-04T12:10:21.7657161Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0063599999994039536, "best_triton_pos": 0} 2025-12-04T12:10:21.7657207Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7657253Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7657351Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7657582Z triton_mm_16 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7657821Z triton_mm_12 0.0066 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7658042Z triton_mm_13 0.0067 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7658267Z triton_mm_17 0.0074 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7658491Z triton_mm_15 0.0076 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7658713Z triton_mm_11 0.0078 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7658939Z triton_mm_19 0.0080 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7659171Z triton_mm_18 0.0083 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7659406Z triton_mm_14 0.0098 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7659627Z triton_mm_10 0.0114 ms 56.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7659754Z SingleProcess AUTOTUNE benchmarking takes 0.0525 seconds and 0.1374 seconds precompiling for 11 choices 2025-12-04T12:10:21.7659826Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7659880Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7659937Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7660037Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7660550Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7660586Z graph_break [] 2025-12-04T12:10:21.7660647Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7660720Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7660763Z Autotune Choices Stats: 2025-12-04T12:10:21.7661121Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006680000107735395, "best_triton_pos": 0} 2025-12-04T12:10:21.7661168Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7661211Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7661308Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7661552Z triton_mm_26 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7661780Z triton_mm_24 0.0068 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7662002Z triton_mm_28 0.0069 ms 97.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7662226Z triton_mm_23 0.0071 ms 94.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7662455Z triton_mm_22 0.0074 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7662693Z triton_mm_27 0.0076 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7662917Z triton_mm_25 0.0076 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7663161Z triton_mm_29 0.0078 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7663383Z triton_mm_21 0.0084 ms 79.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7663618Z triton_mm_20 0.0118 ms 56.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7663746Z SingleProcess AUTOTUNE benchmarking takes 0.0523 seconds and 0.1263 seconds precompiling for 11 choices 2025-12-04T12:10:21.7663935Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4139a9adbef9b55b.xml - 2025-12-04T12:10:21.7663995Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7664585Z FAILED [1.0263s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7664588Z 2025-12-04T12:10:21.7664664Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7664923Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7664926Z 2025-12-04T12:10:21.7665013Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7665075Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7665152Z ================= 1 failed, 142 deselected, 2 rerun in 39.30s ================== 2025-12-04T12:10:21.7665192Z Got exit code 1 2025-12-04T12:10:21.7665232Z Retrying single test... 2025-12-04T12:10:21.7665375Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-74c92ecca6364584.xml 2025-12-04T12:10:21.7665435Z ============================= test session starts ============================== 2025-12-04T12:10:21.7665546Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7665589Z cachedir: .pytest_cache 2025-12-04T12:10:21.7665747Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7665795Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7665834Z configfile: pytest.ini 2025-12-04T12:10:21.7666000Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7666074Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7666331Z stepcurrent: skipping 142 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7666389Z Running 1 items in this shard 2025-12-04T12:10:21.7666392Z 2025-12-04T12:10:21.7666609Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4243s] [100%] 2025-12-04T12:10:21.7666831Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1356s] [100%] 2025-12-04T12:10:21.7667022Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9010s] [100%] 2025-12-04T12:10:21.7667025Z 2025-12-04T12:10:21.7667077Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7667220Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7667269Z Traceback (most recent call last): 2025-12-04T12:10:21.7667437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7667483Z method(*args, **kwargs) 2025-12-04T12:10:21.7667636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7667678Z method(*args, **kwargs) 2025-12-04T12:10:21.7667830Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7667867Z with policy(): 2025-12-04T12:10:21.7668021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7668064Z raise RuntimeError(msg) 2025-12-04T12:10:21.7668458Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1038090240. 2025-12-04T12:10:21.7668461Z 2025-12-04T12:10:21.7668534Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7668793Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7668796Z 2025-12-04T12:10:21.7668882Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7668966Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7669009Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7669066Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7669555Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7669653Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7669691Z graph_break [] 2025-12-04T12:10:21.7669753Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7669827Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7670350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7670413Z current_size = base.storage().size() 2025-12-04T12:10:21.7670454Z Autotune Choices Stats: 2025-12-04T12:10:21.7670834Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7670878Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7670923Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7671022Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7671271Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7671500Z triton_mm_2 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7671723Z triton_mm_8 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7671947Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7672168Z triton_mm_3 0.0067 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7672389Z triton_mm_1 0.0077 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7672611Z triton_mm_7 0.0077 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7672842Z triton_mm_5 0.0082 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7673063Z triton_mm_4 0.0106 ms 56.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7673283Z triton_mm_0 0.0112 ms 53.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7673412Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.2406 seconds precompiling for 11 choices 2025-12-04T12:10:21.7673556Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7673604Z Traceback (most recent call last): 2025-12-04T12:10:21.7673761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7673803Z method(*args, **kwargs) 2025-12-04T12:10:21.7673970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7674010Z method(*args, **kwargs) 2025-12-04T12:10:21.7674164Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7674212Z with policy(): 2025-12-04T12:10:21.7674364Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7674407Z raise RuntimeError(msg) 2025-12-04T12:10:21.7674800Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1075838976. 2025-12-04T12:10:21.7674803Z 2025-12-04T12:10:21.7674876Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7675143Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7675146Z 2025-12-04T12:10:21.7675233Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7675307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7675352Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7675408Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7675890Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7675990Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7676027Z graph_break [] 2025-12-04T12:10:21.7676088Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7676163Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7676656Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7676704Z current_size = base.storage().size() 2025-12-04T12:10:21.7676745Z Autotune Choices Stats: 2025-12-04T12:10:21.7677112Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7677158Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7677200Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7677298Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7677532Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7677759Z triton_mm_2 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7677999Z triton_mm_8 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7678232Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7678455Z triton_mm_3 0.0067 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7678676Z triton_mm_1 0.0077 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7678906Z triton_mm_7 0.0077 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7679127Z triton_mm_5 0.0082 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7679349Z triton_mm_4 0.0106 ms 56.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7679569Z triton_mm_0 0.0112 ms 53.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7679699Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.2406 seconds precompiling for 11 choices 2025-12-04T12:10:21.7679773Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7679816Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7679874Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7679972Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7680505Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 5), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7680543Z graph_break [] 2025-12-04T12:10:21.7680604Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7680677Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7680718Z Autotune Choices Stats: 2025-12-04T12:10:21.7681082Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7681129Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7681171Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7681269Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7681503Z triton_mm_19 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7681744Z triton_mm_16 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7681984Z triton_mm_12 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7682208Z triton_mm_18 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7682432Z triton_mm_13 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7682666Z triton_mm_14 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7682892Z triton_mm_17 0.0074 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7683115Z triton_mm_15 0.0076 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7683337Z triton_mm_11 0.0079 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7683561Z triton_mm_10 0.0113 ms 52.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7683688Z SingleProcess AUTOTUNE benchmarking takes 0.0535 seconds and 0.4313 seconds precompiling for 11 choices 2025-12-04T12:10:21.7683742Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7683893Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7683942Z Traceback (most recent call last): 2025-12-04T12:10:21.7684097Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7684140Z method(*args, **kwargs) 2025-12-04T12:10:21.7684292Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7684332Z method(*args, **kwargs) 2025-12-04T12:10:21.7684482Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7684520Z with policy(): 2025-12-04T12:10:21.7684671Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7684714Z raise RuntimeError(msg) 2025-12-04T12:10:21.7685105Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7685119Z 2025-12-04T12:10:21.7685192Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7685452Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7685469Z 2025-12-04T12:10:21.7685555Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7685627Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7685671Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7685727Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7686210Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7686319Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7686356Z graph_break [] 2025-12-04T12:10:21.7686418Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7686490Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7686971Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7687019Z current_size = base.storage().size() 2025-12-04T12:10:21.7687061Z Autotune Choices Stats: 2025-12-04T12:10:21.7687430Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7687476Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7687517Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7687616Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7687858Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7688087Z triton_mm_2 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7688310Z triton_mm_8 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7688534Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7688756Z triton_mm_3 0.0067 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7688976Z triton_mm_1 0.0077 ms 77.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7689209Z triton_mm_7 0.0077 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7689444Z triton_mm_5 0.0082 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7689666Z triton_mm_4 0.0106 ms 56.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7689886Z triton_mm_0 0.0112 ms 53.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7690028Z SingleProcess AUTOTUNE benchmarking takes 0.0450 seconds and 0.2406 seconds precompiling for 11 choices 2025-12-04T12:10:21.7690141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7690185Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7690241Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7690340Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7690819Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 5), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7690856Z graph_break [] 2025-12-04T12:10:21.7690918Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7690991Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7691033Z Autotune Choices Stats: 2025-12-04T12:10:21.7691394Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7691464Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7691505Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7691603Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7691842Z triton_mm_19 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7692066Z triton_mm_16 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7692293Z triton_mm_12 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7692517Z triton_mm_18 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7692756Z triton_mm_13 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7692978Z triton_mm_14 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7693213Z triton_mm_17 0.0074 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7693436Z triton_mm_15 0.0076 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7693674Z triton_mm_11 0.0079 ms 75.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7693898Z triton_mm_10 0.0113 ms 52.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7694026Z SingleProcess AUTOTUNE benchmarking takes 0.0535 seconds and 0.4313 seconds precompiling for 11 choices 2025-12-04T12:10:21.7694099Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7694142Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7694199Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7694296Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7694778Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7694816Z graph_break [] 2025-12-04T12:10:21.7694876Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7694948Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7694988Z Autotune Choices Stats: 2025-12-04T12:10:21.7695359Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_29", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.7695405Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7695447Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7695545Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7695780Z triton_mm_29 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7696004Z triton_mm_28 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7696226Z triton_mm_26 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7696466Z triton_mm_22 0.0062 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7696699Z triton_mm_24 0.0074 ms 81.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7696922Z triton_mm_23 0.0074 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7697143Z triton_mm_27 0.0074 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7697189Z _scaled_mm 0.0075 ms 79.2% 2025-12-04T12:10:21.7697420Z triton_mm_21 0.0076 ms 78.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7697645Z triton_mm_25 0.0080 ms 74.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7697773Z SingleProcess AUTOTUNE benchmarking takes 0.0613 seconds and 0.3577 seconds precompiling for 11 choices 2025-12-04T12:10:21.7697963Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-74c92ecca6364584.xml - 2025-12-04T12:10:21.7698024Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7698610Z FAILED [0.9010s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7698615Z 2025-12-04T12:10:21.7698687Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7698957Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7698959Z 2025-12-04T12:10:21.7699045Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7699108Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7699179Z ================== 1 failed, 187 deselected, 2 rerun in 4.48s ================== 2025-12-04T12:10:21.7699218Z Got exit code 1 2025-12-04T12:10:21.7699258Z Retrying single test... 2025-12-04T12:10:21.7699403Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aa22ab62148368dd.xml 2025-12-04T12:10:21.7699459Z ============================= test session starts ============================== 2025-12-04T12:10:21.7699571Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7699611Z cachedir: .pytest_cache 2025-12-04T12:10:21.7699772Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7699818Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7699874Z configfile: pytest.ini 2025-12-04T12:10:21.7700037Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7700235Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7700492Z stepcurrent: skipping 142 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7700554Z Running 1 items in this shard 2025-12-04T12:10:21.7700557Z 2025-12-04T12:10:21.7700775Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [46.9015s] [100%] 2025-12-04T12:10:21.7700987Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0900s] [100%] 2025-12-04T12:10:21.7701190Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.9652s] [100%] 2025-12-04T12:10:21.7701193Z 2025-12-04T12:10:21.7701244Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7701390Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7701437Z Traceback (most recent call last): 2025-12-04T12:10:21.7701595Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7701638Z method(*args, **kwargs) 2025-12-04T12:10:21.7701792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7701832Z method(*args, **kwargs) 2025-12-04T12:10:21.7701984Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7702021Z with policy(): 2025-12-04T12:10:21.7702174Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7702218Z raise RuntimeError(msg) 2025-12-04T12:10:21.7702607Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1038090240. 2025-12-04T12:10:21.7702609Z 2025-12-04T12:10:21.7702697Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7702958Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7702961Z 2025-12-04T12:10:21.7703049Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7703124Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7703168Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7703225Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7703709Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7703808Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7703857Z graph_break [] 2025-12-04T12:10:21.7703918Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7703991Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7704473Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7704531Z current_size = base.storage().size() 2025-12-04T12:10:21.7704572Z Autotune Choices Stats: 2025-12-04T12:10:21.7704940Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.7704987Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7705045Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7705145Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7705384Z triton_mm_2 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7705612Z triton_mm_8 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7705835Z triton_mm_6 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7706058Z triton_mm_7 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7706280Z triton_mm_4 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7706513Z triton_mm_9 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7706735Z triton_mm_3 0.0085 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7706958Z triton_mm_5 0.0092 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7707180Z triton_mm_1 0.0098 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7707404Z triton_mm_0 0.0118 ms 56.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7707531Z SingleProcess AUTOTUNE benchmarking takes 0.0533 seconds and 0.2231 seconds precompiling for 11 choices 2025-12-04T12:10:21.7707687Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7707733Z Traceback (most recent call last): 2025-12-04T12:10:21.7707890Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7707944Z method(*args, **kwargs) 2025-12-04T12:10:21.7708094Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7708137Z method(*args, **kwargs) 2025-12-04T12:10:21.7708287Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7708325Z with policy(): 2025-12-04T12:10:21.7708478Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7708521Z raise RuntimeError(msg) 2025-12-04T12:10:21.7708919Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1075838976. 2025-12-04T12:10:21.7708923Z 2025-12-04T12:10:21.7708997Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7709256Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7709258Z 2025-12-04T12:10:21.7709345Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7709417Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7709463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7709519Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7710010Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7710155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7710192Z graph_break [] 2025-12-04T12:10:21.7710254Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7710340Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7710825Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7710871Z current_size = base.storage().size() 2025-12-04T12:10:21.7710914Z Autotune Choices Stats: 2025-12-04T12:10:21.7711279Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.7711326Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7711369Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7711467Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7711715Z triton_mm_2 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7711944Z triton_mm_8 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7712181Z triton_mm_6 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7712402Z triton_mm_7 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7712636Z triton_mm_4 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7712861Z triton_mm_9 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7713081Z triton_mm_3 0.0085 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7713302Z triton_mm_5 0.0092 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7713522Z triton_mm_1 0.0098 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7713742Z triton_mm_0 0.0118 ms 56.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7713870Z SingleProcess AUTOTUNE benchmarking takes 0.0533 seconds and 0.2231 seconds precompiling for 11 choices 2025-12-04T12:10:21.7713944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7713987Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7714054Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7714154Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7714634Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7714673Z graph_break [] 2025-12-04T12:10:21.7714734Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7714806Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7714847Z Autotune Choices Stats: 2025-12-04T12:10:21.7715209Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7715266Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7715309Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7715407Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7715650Z triton_mm_19 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7715878Z triton_mm_12 0.0063 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7716102Z triton_mm_18 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7716340Z triton_mm_14 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7716563Z triton_mm_13 0.0073 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7716786Z triton_mm_15 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7717010Z triton_mm_11 0.0079 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7717234Z triton_mm_16 0.0094 ms 66.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7717460Z triton_mm_17 0.0107 ms 58.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7717691Z triton_mm_10 0.0118 ms 52.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7717819Z SingleProcess AUTOTUNE benchmarking takes 0.0573 seconds and 0.3653 seconds precompiling for 11 choices 2025-12-04T12:10:21.7717872Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7718021Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7718067Z Traceback (most recent call last): 2025-12-04T12:10:21.7718224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7718268Z method(*args, **kwargs) 2025-12-04T12:10:21.7718420Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7718462Z method(*args, **kwargs) 2025-12-04T12:10:21.7718612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7718651Z with policy(): 2025-12-04T12:10:21.7718802Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7718857Z raise RuntimeError(msg) 2025-12-04T12:10:21.7719246Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7719266Z 2025-12-04T12:10:21.7719339Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7719597Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7719600Z 2025-12-04T12:10:21.7719688Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7719759Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7719804Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7719859Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7720395Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7720497Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7720534Z graph_break [] 2025-12-04T12:10:21.7720596Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7720669Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7721156Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7721203Z current_size = base.storage().size() 2025-12-04T12:10:21.7721243Z Autotune Choices Stats: 2025-12-04T12:10:21.7721623Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.7721669Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7721711Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7721811Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7722045Z triton_mm_2 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7722270Z triton_mm_8 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7722493Z triton_mm_6 0.0071 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7722714Z triton_mm_7 0.0074 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7722954Z triton_mm_4 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7723192Z triton_mm_9 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7723412Z triton_mm_3 0.0085 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7723635Z triton_mm_5 0.0092 ms 72.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7723864Z triton_mm_1 0.0098 ms 68.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7724088Z triton_mm_0 0.0118 ms 56.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7724216Z SingleProcess AUTOTUNE benchmarking takes 0.0533 seconds and 0.2231 seconds precompiling for 11 choices 2025-12-04T12:10:21.7724289Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7724333Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7724390Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7724491Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7724973Z inductor [('triton_bundler_save_kernel', 88), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('async_compile_cache_miss', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7725011Z graph_break [] 2025-12-04T12:10:21.7725071Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7725143Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7725192Z Autotune Choices Stats: 2025-12-04T12:10:21.7725555Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_19", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7725601Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7725645Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7725745Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7725976Z triton_mm_19 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7726206Z triton_mm_12 0.0063 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7726429Z triton_mm_18 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7726664Z triton_mm_14 0.0070 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7726896Z triton_mm_13 0.0073 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7727122Z triton_mm_15 0.0078 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7727345Z triton_mm_11 0.0079 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7727580Z triton_mm_16 0.0094 ms 66.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7727806Z triton_mm_17 0.0107 ms 58.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7728028Z triton_mm_10 0.0118 ms 52.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7728156Z SingleProcess AUTOTUNE benchmarking takes 0.0573 seconds and 0.3653 seconds precompiling for 11 choices 2025-12-04T12:10:21.7728227Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7728272Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7728327Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7728427Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7728919Z inductor [('triton_bundler_save_kernel', 88), ('async_compile_cache_miss', 12), ('benchmarking.InductorBenchmarker.benchmark_gpu', 11), ('generated_module_cache_miss', 10), ('select_algorithm_num_precompiles', 10), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7728957Z graph_break [] 2025-12-04T12:10:21.7729017Z aten_mm_info [('aten._scaled_mm.default_33_16_1024', 1)] 2025-12-04T12:10:21.7729091Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7729133Z Autotune Choices Stats: 2025-12-04T12:10:21.7729493Z {"num_choices": 11, "num_triton_choices": 10, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7729540Z AUTOTUNE scaled_mm(33x1024, 1024x16, , ) 2025-12-04T12:10:21.7729580Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7729679Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7729909Z triton_mm_26 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7729964Z _scaled_mm 0.0068 ms 95.9% 2025-12-04T12:10:21.7730216Z triton_mm_24 0.0069 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7730456Z triton_mm_28 0.0072 ms 89.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7730680Z triton_mm_27 0.0074 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7730907Z triton_mm_22 0.0075 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7731143Z triton_mm_25 0.0076 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7731372Z triton_mm_29 0.0077 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7731596Z triton_mm_21 0.0081 ms 80.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7731819Z triton_mm_23 0.0082 ms 79.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7731946Z SingleProcess AUTOTUNE benchmarking takes 0.0718 seconds and 0.4025 seconds precompiling for 11 choices 2025-12-04T12:10:21.7732134Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aa22ab62148368dd.xml - 2025-12-04T12:10:21.7732194Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7732792Z FAILED [0.9652s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1075838976 and is now 1113587712. 2025-12-04T12:10:21.7732796Z 2025-12-04T12:10:21.7732869Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7733129Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7733132Z 2025-12-04T12:10:21.7733220Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7733281Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7733351Z ================= 1 failed, 187 deselected, 2 rerun in 48.98s ================== 2025-12-04T12:10:21.7733389Z Got exit code 1 2025-12-04T12:10:21.7733596Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7733721Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7733883Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8d72081be42bc079.xml 2025-12-04T12:10:21.7733941Z ============================= test session starts ============================== 2025-12-04T12:10:21.7734055Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7734107Z cachedir: .pytest_cache 2025-12-04T12:10:21.7734264Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7734310Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7734351Z configfile: pytest.ini 2025-12-04T12:10:21.7734515Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7734592Z collecting ... collected 188 items / 143 deselected / 45 selected 2025-12-04T12:10:21.7734647Z stepcurrent: skipping 143 already run items. 2025-12-04T12:10:21.7734691Z Running 45 items in this shard 2025-12-04T12:10:21.7734693Z 2025-12-04T12:10:21.7734925Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.2104s] [ 2%] 2025-12-04T12:10:21.7735143Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9359s] [ 2%] 2025-12-04T12:10:21.7735336Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.6063s] [ 2%] 2025-12-04T12:10:21.7735338Z 2025-12-04T12:10:21.7735390Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7735539Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7735585Z Traceback (most recent call last): 2025-12-04T12:10:21.7735747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7735790Z method(*args, **kwargs) 2025-12-04T12:10:21.7735945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7735985Z method(*args, **kwargs) 2025-12-04T12:10:21.7736136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7736173Z with policy(): 2025-12-04T12:10:21.7736336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7736380Z raise RuntimeError(msg) 2025-12-04T12:10:21.7736771Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1088421888. 2025-12-04T12:10:21.7736775Z 2025-12-04T12:10:21.7736849Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7737111Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7737114Z 2025-12-04T12:10:21.7737200Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7737272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7737318Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7737373Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7737868Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7737980Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7738016Z graph_break [] 2025-12-04T12:10:21.7738081Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7738153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7738645Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7738703Z current_size = base.storage().size() 2025-12-04T12:10:21.7738747Z Autotune Choices Stats: 2025-12-04T12:10:21.7739115Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7739165Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7739207Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7739306Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7739542Z triton_mm_31 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7739768Z triton_mm_19 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7739998Z triton_mm_32 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7740270Z triton_mm_27 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7740498Z triton_mm_8 0.0081 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7740723Z triton_mm_20 0.0082 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7740947Z triton_mm_13 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7741173Z triton_mm_14 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7741394Z triton_mm_23 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7741632Z triton_mm_11 0.0094 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7741773Z SingleProcess AUTOTUNE benchmarking takes 0.1508 seconds and 0.7221 seconds precompiling for 35 choices 2025-12-04T12:10:21.7741924Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7741971Z Traceback (most recent call last): 2025-12-04T12:10:21.7742127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7742169Z method(*args, **kwargs) 2025-12-04T12:10:21.7742321Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7742363Z method(*args, **kwargs) 2025-12-04T12:10:21.7742525Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7742563Z with policy(): 2025-12-04T12:10:21.7742716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7742760Z raise RuntimeError(msg) 2025-12-04T12:10:21.7743157Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1088421888 and is now 1176502272. 2025-12-04T12:10:21.7743160Z 2025-12-04T12:10:21.7743233Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7743494Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7743496Z 2025-12-04T12:10:21.7743583Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7743656Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7743705Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7743762Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7744260Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7744361Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7744398Z graph_break [] 2025-12-04T12:10:21.7744462Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7744536Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7745019Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7745065Z current_size = base.storage().size() 2025-12-04T12:10:21.7745107Z Autotune Choices Stats: 2025-12-04T12:10:21.7745476Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7748243Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7748288Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7748386Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7748621Z triton_mm_31 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7748845Z triton_mm_19 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7749087Z triton_mm_32 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7749310Z triton_mm_27 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7749535Z triton_mm_8 0.0081 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7749761Z triton_mm_20 0.0082 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7749988Z triton_mm_13 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7750265Z triton_mm_14 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7750511Z triton_mm_23 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7750734Z triton_mm_11 0.0094 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7750864Z SingleProcess AUTOTUNE benchmarking takes 0.1508 seconds and 0.7221 seconds precompiling for 35 choices 2025-12-04T12:10:21.7750936Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7750985Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7751043Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7751141Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7751625Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7751679Z graph_break [] 2025-12-04T12:10:21.7751742Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7751821Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7751863Z Autotune Choices Stats: 2025-12-04T12:10:21.7752225Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:21.7752286Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7752331Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7752429Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7752662Z triton_mm_65 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7752719Z _scaled_mm 0.0075 ms 98.9% 2025-12-04T12:10:21.7752947Z triton_mm_66 0.0076 ms 97.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7753172Z triton_mm_53 0.0076 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7753396Z triton_mm_61 0.0076 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7753624Z triton_mm_42 0.0083 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7753846Z triton_mm_47 0.0086 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7754068Z triton_mm_54 0.0088 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7754300Z triton_mm_49 0.0090 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7754525Z triton_mm_57 0.0092 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7754654Z SingleProcess AUTOTUNE benchmarking takes 0.2479 seconds and 0.5137 seconds precompiling for 35 choices 2025-12-04T12:10:21.7754708Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7754856Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7754903Z Traceback (most recent call last): 2025-12-04T12:10:21.7755060Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7755102Z method(*args, **kwargs) 2025-12-04T12:10:21.7755256Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7755307Z method(*args, **kwargs) 2025-12-04T12:10:21.7755459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7755501Z with policy(): 2025-12-04T12:10:21.7755653Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7755710Z raise RuntimeError(msg) 2025-12-04T12:10:21.7756104Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7756106Z 2025-12-04T12:10:21.7756180Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7756441Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7756460Z 2025-12-04T12:10:21.7756548Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7756621Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7756667Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7756723Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7757210Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7757309Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7757345Z graph_break [] 2025-12-04T12:10:21.7757409Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7757481Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7757964Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7758024Z current_size = base.storage().size() 2025-12-04T12:10:21.7758065Z Autotune Choices Stats: 2025-12-04T12:10:21.7758429Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7758478Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7758522Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7758621Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7758855Z triton_mm_31 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7759082Z triton_mm_19 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7759320Z triton_mm_32 0.0071 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7759541Z triton_mm_27 0.0076 ms 85.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7759776Z triton_mm_8 0.0081 ms 79.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7759999Z triton_mm_20 0.0082 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7760269Z triton_mm_13 0.0085 ms 76.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7760493Z triton_mm_14 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7760715Z triton_mm_23 0.0088 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7760938Z triton_mm_11 0.0094 ms 69.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7761066Z SingleProcess AUTOTUNE benchmarking takes 0.1508 seconds and 0.7221 seconds precompiling for 35 choices 2025-12-04T12:10:21.7761141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7761187Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7761243Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7761344Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7761836Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7761876Z graph_break [] 2025-12-04T12:10:21.7761940Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7762012Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7762055Z Autotune Choices Stats: 2025-12-04T12:10:21.7762418Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_65", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007400000002235174, "best_triton_pos": 0} 2025-12-04T12:10:21.7762467Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7762511Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7762609Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7762843Z triton_mm_65 0.0074 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7762899Z _scaled_mm 0.0075 ms 98.9% 2025-12-04T12:10:21.7763127Z triton_mm_66 0.0076 ms 97.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7763364Z triton_mm_53 0.0076 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7763588Z triton_mm_61 0.0076 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7763814Z triton_mm_42 0.0083 ms 89.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7764053Z triton_mm_47 0.0086 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7764276Z triton_mm_54 0.0088 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7764499Z triton_mm_49 0.0090 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7764719Z triton_mm_57 0.0092 ms 80.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7764848Z SingleProcess AUTOTUNE benchmarking takes 0.2479 seconds and 0.5137 seconds precompiling for 35 choices 2025-12-04T12:10:21.7764919Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7764963Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7765020Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7765119Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7765611Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7765652Z graph_break [] 2025-12-04T12:10:21.7765714Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7765787Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7765830Z Autotune Choices Stats: 2025-12-04T12:10:21.7766189Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_99", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7766238Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7766281Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7766383Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7766625Z triton_mm_99 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7766848Z triton_mm_87 0.0075 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7767086Z triton_mm_100 0.0076 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7767310Z triton_mm_81 0.0082 ms 86.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7767353Z _scaled_mm 0.0082 ms 85.9% 2025-12-04T12:10:21.7767585Z triton_mm_95 0.0082 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7767812Z triton_mm_76 0.0085 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7768036Z triton_mm_88 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7768259Z triton_mm_83 0.0090 ms 77.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7768481Z triton_mm_96 0.0092 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7768609Z SingleProcess AUTOTUNE benchmarking takes 0.2217 seconds and 0.1959 seconds precompiling for 35 choices 2025-12-04T12:10:21.7768799Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-8d72081be42bc079.xml - 2025-12-04T12:10:21.7768858Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7769460Z FAILED [1.6063s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7769464Z 2025-12-04T12:10:21.7769538Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7769802Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7769804Z 2025-12-04T12:10:21.7769890Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7769952Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7770021Z ================== 1 failed, 143 deselected, 2 rerun in 6.77s ================== 2025-12-04T12:10:21.7770063Z Got exit code 1 2025-12-04T12:10:21.7770152Z Retrying single test... 2025-12-04T12:10:21.7770296Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-557a2b5e72b7e8b6.xml 2025-12-04T12:10:21.7770356Z ============================= test session starts ============================== 2025-12-04T12:10:21.7770467Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7770526Z cachedir: .pytest_cache 2025-12-04T12:10:21.7770683Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7770731Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7770773Z configfile: pytest.ini 2025-12-04T12:10:21.7770937Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7771011Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7771268Z stepcurrent: skipping 143 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7771323Z Running 1 items in this shard 2025-12-04T12:10:21.7771325Z 2025-12-04T12:10:21.7771545Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.2096s] [100%] 2025-12-04T12:10:21.7771764Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9722s] [100%] 2025-12-04T12:10:21.7771958Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.0939s] [100%] 2025-12-04T12:10:21.7771960Z 2025-12-04T12:10:21.7772014Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7772161Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7772211Z Traceback (most recent call last): 2025-12-04T12:10:21.7772367Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7772413Z method(*args, **kwargs) 2025-12-04T12:10:21.7772565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7772607Z method(*args, **kwargs) 2025-12-04T12:10:21.7772757Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7772807Z with policy(): 2025-12-04T12:10:21.7772959Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7773003Z raise RuntimeError(msg) 2025-12-04T12:10:21.7773393Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1088421888. 2025-12-04T12:10:21.7773398Z 2025-12-04T12:10:21.7773470Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7773732Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7773733Z 2025-12-04T12:10:21.7773820Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7773894Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7773951Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7774008Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7774491Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7774604Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7774640Z graph_break [] 2025-12-04T12:10:21.7774703Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7774777Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7775271Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7775320Z current_size = base.storage().size() 2025-12-04T12:10:21.7775361Z Autotune Choices Stats: 2025-12-04T12:10:21.7775728Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7775776Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7775819Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7775917Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7776154Z triton_mm_32 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7776380Z triton_mm_19 0.0071 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7776606Z triton_mm_31 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7776842Z triton_mm_8 0.0080 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7777069Z triton_mm_13 0.0084 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7777292Z triton_mm_20 0.0084 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7777514Z triton_mm_15 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7777738Z triton_mm_14 0.0087 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7777975Z triton_mm_23 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7778197Z triton_mm_28 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7778342Z SingleProcess AUTOTUNE benchmarking takes 0.1544 seconds and 0.6954 seconds precompiling for 35 choices 2025-12-04T12:10:21.7778488Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7778538Z Traceback (most recent call last): 2025-12-04T12:10:21.7778692Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7778738Z method(*args, **kwargs) 2025-12-04T12:10:21.7778899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7778941Z method(*args, **kwargs) 2025-12-04T12:10:21.7779091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7779130Z with policy(): 2025-12-04T12:10:21.7779281Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7779325Z raise RuntimeError(msg) 2025-12-04T12:10:21.7779717Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1088421888 and is now 1176502272. 2025-12-04T12:10:21.7779720Z 2025-12-04T12:10:21.7779792Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7780054Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7780057Z 2025-12-04T12:10:21.7780176Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7780249Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7780293Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7780350Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7780847Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7780949Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7780986Z graph_break [] 2025-12-04T12:10:21.7781049Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7781123Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7781607Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7781653Z current_size = base.storage().size() 2025-12-04T12:10:21.7781707Z Autotune Choices Stats: 2025-12-04T12:10:21.7782074Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7782135Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7782179Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7782278Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7782512Z triton_mm_32 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7782738Z triton_mm_19 0.0071 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7782978Z triton_mm_31 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7783206Z triton_mm_8 0.0080 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7783430Z triton_mm_13 0.0084 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7783656Z triton_mm_20 0.0084 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7783880Z triton_mm_15 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7784101Z triton_mm_14 0.0087 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7784334Z triton_mm_23 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7784559Z triton_mm_28 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7784688Z SingleProcess AUTOTUNE benchmarking takes 0.1544 seconds and 0.6954 seconds precompiling for 35 choices 2025-12-04T12:10:21.7784761Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7784807Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7784863Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7784963Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7785450Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7785496Z graph_break [] 2025-12-04T12:10:21.7785561Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7785635Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7785686Z Autotune Choices Stats: 2025-12-04T12:10:21.7786049Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_66", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:21.7786098Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7786141Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7786239Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7786480Z triton_mm_66 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7786708Z triton_mm_65 0.0076 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7786931Z triton_mm_53 0.0080 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7787154Z triton_mm_61 0.0080 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7787198Z _scaled_mm 0.0082 ms 85.8% 2025-12-04T12:10:21.7787425Z triton_mm_42 0.0085 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7787648Z triton_mm_54 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7787881Z triton_mm_49 0.0088 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7788104Z triton_mm_48 0.0094 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7788328Z triton_mm_62 0.0095 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7788456Z SingleProcess AUTOTUNE benchmarking takes 0.2705 seconds and 0.4807 seconds precompiling for 35 choices 2025-12-04T12:10:21.7788511Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7788657Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7788704Z Traceback (most recent call last): 2025-12-04T12:10:21.7788859Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7788904Z method(*args, **kwargs) 2025-12-04T12:10:21.7789066Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7789109Z method(*args, **kwargs) 2025-12-04T12:10:21.7789258Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7789307Z with policy(): 2025-12-04T12:10:21.7789458Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7789501Z raise RuntimeError(msg) 2025-12-04T12:10:21.7789894Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7789898Z 2025-12-04T12:10:21.7789971Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7790298Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7790301Z 2025-12-04T12:10:21.7790388Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7790462Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7790506Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7790562Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7791043Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7791143Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7791181Z graph_break [] 2025-12-04T12:10:21.7791243Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7791316Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7791811Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7791858Z current_size = base.storage().size() 2025-12-04T12:10:21.7791898Z Autotune Choices Stats: 2025-12-04T12:10:21.7792268Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_32", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7792317Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7792363Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7792460Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7792696Z triton_mm_32 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7792921Z triton_mm_19 0.0071 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7793172Z triton_mm_31 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7793410Z triton_mm_8 0.0080 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7793632Z triton_mm_13 0.0084 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7793856Z triton_mm_20 0.0084 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7794086Z triton_mm_15 0.0086 ms 81.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7794309Z triton_mm_14 0.0087 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7794534Z triton_mm_23 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7794761Z triton_mm_28 0.0088 ms 80.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7794893Z SingleProcess AUTOTUNE benchmarking takes 0.1544 seconds and 0.6954 seconds precompiling for 35 choices 2025-12-04T12:10:21.7794965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7795012Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7795068Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7795171Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7795661Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7795702Z graph_break [] 2025-12-04T12:10:21.7795767Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7795839Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7795882Z Autotune Choices Stats: 2025-12-04T12:10:21.7796241Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_66", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:21.7796289Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7796334Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7796433Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7796678Z triton_mm_66 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7796913Z triton_mm_65 0.0076 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7797149Z triton_mm_53 0.0080 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7797379Z triton_mm_61 0.0080 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7797425Z _scaled_mm 0.0082 ms 85.8% 2025-12-04T12:10:21.7797662Z triton_mm_42 0.0085 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7797887Z triton_mm_54 0.0088 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7798110Z triton_mm_49 0.0088 ms 79.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7798334Z triton_mm_48 0.0094 ms 74.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7798558Z triton_mm_62 0.0095 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7798684Z SingleProcess AUTOTUNE benchmarking takes 0.2705 seconds and 0.4807 seconds precompiling for 35 choices 2025-12-04T12:10:21.7798759Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7798802Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7798859Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7798958Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7799450Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7799490Z graph_break [] 2025-12-04T12:10:21.7799553Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7799625Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7799668Z Autotune Choices Stats: 2025-12-04T12:10:21.7800033Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_100", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7800084Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7800170Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7800283Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7800518Z triton_mm_100 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7800759Z triton_mm_99 0.0071 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7800984Z triton_mm_95 0.0075 ms 93.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7801208Z triton_mm_87 0.0076 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7801444Z triton_mm_88 0.0078 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7801671Z triton_mm_76 0.0080 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7801714Z _scaled_mm 0.0081 ms 87.1% 2025-12-04T12:10:21.7801937Z triton_mm_81 0.0083 ms 85.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7802158Z triton_mm_83 0.0086 ms 81.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7802380Z triton_mm_82 0.0087 ms 80.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7802507Z SingleProcess AUTOTUNE benchmarking takes 0.2224 seconds and 0.1959 seconds precompiling for 35 choices 2025-12-04T12:10:21.7802697Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-557a2b5e72b7e8b6.xml - 2025-12-04T12:10:21.7802768Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7803365Z FAILED [1.0939s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7803369Z 2025-12-04T12:10:21.7803443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7803705Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7803708Z 2025-12-04T12:10:21.7803798Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7803860Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7803928Z ================== 1 failed, 187 deselected, 2 rerun in 6.30s ================== 2025-12-04T12:10:21.7803975Z Got exit code 1 2025-12-04T12:10:21.7804015Z Retrying single test... 2025-12-04T12:10:21.7804158Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9503605d52f27902.xml 2025-12-04T12:10:21.7804217Z ============================= test session starts ============================== 2025-12-04T12:10:21.7804340Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7804382Z cachedir: .pytest_cache 2025-12-04T12:10:21.7804541Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7804588Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7804630Z configfile: pytest.ini 2025-12-04T12:10:21.7804793Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7804869Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7805134Z stepcurrent: skipping 143 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7805182Z Running 1 items in this shard 2025-12-04T12:10:21.7805185Z 2025-12-04T12:10:21.7805402Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [3.2639s] [100%] 2025-12-04T12:10:21.7805618Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.3467s] [100%] 2025-12-04T12:10:21.7805810Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda FAILED [0.9762s] [100%] 2025-12-04T12:10:21.7805813Z 2025-12-04T12:10:21.7805864Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7806011Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7806059Z Traceback (most recent call last): 2025-12-04T12:10:21.7806217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7806259Z method(*args, **kwargs) 2025-12-04T12:10:21.7806410Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7806453Z method(*args, **kwargs) 2025-12-04T12:10:21.7806627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7806665Z with policy(): 2025-12-04T12:10:21.7806816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7806860Z raise RuntimeError(msg) 2025-12-04T12:10:21.7807251Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1088421888. 2025-12-04T12:10:21.7807254Z 2025-12-04T12:10:21.7807328Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7807590Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7807592Z 2025-12-04T12:10:21.7807679Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7807763Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7807812Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7807870Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7808362Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7808471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7808510Z graph_break [] 2025-12-04T12:10:21.7808572Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7808648Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7809138Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7809191Z current_size = base.storage().size() 2025-12-04T12:10:21.7809236Z Autotune Choices Stats: 2025-12-04T12:10:21.7809605Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.7809653Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7809699Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7809798Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7810033Z triton_mm_31 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7810292Z triton_mm_19 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7810537Z triton_mm_32 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7810764Z triton_mm_27 0.0076 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7810992Z triton_mm_8 0.0083 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7811214Z triton_mm_20 0.0084 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7811436Z triton_mm_15 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7811656Z triton_mm_14 0.0088 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7811895Z triton_mm_11 0.0093 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7812129Z triton_mm_23 0.0094 ms 66.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7812257Z SingleProcess AUTOTUNE benchmarking takes 0.1521 seconds and 0.7300 seconds precompiling for 35 choices 2025-12-04T12:10:21.7812407Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7812454Z Traceback (most recent call last): 2025-12-04T12:10:21.7812612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7812667Z method(*args, **kwargs) 2025-12-04T12:10:21.7812822Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7812865Z method(*args, **kwargs) 2025-12-04T12:10:21.7813018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7813055Z with policy(): 2025-12-04T12:10:21.7813207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7813252Z raise RuntimeError(msg) 2025-12-04T12:10:21.7813645Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1088421888 and is now 1176502272. 2025-12-04T12:10:21.7813648Z 2025-12-04T12:10:21.7813721Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7813983Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7813986Z 2025-12-04T12:10:21.7814073Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7814145Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7814204Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7814261Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7814745Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7814843Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7814882Z graph_break [] 2025-12-04T12:10:21.7814945Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7815021Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7815502Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7815559Z current_size = base.storage().size() 2025-12-04T12:10:21.7815604Z Autotune Choices Stats: 2025-12-04T12:10:21.7815969Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.7816027Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7816074Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7816175Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7816408Z triton_mm_31 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7816643Z triton_mm_19 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7816868Z triton_mm_32 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7817093Z triton_mm_27 0.0076 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7817319Z triton_mm_8 0.0083 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7817542Z triton_mm_20 0.0084 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7817768Z triton_mm_15 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7818001Z triton_mm_14 0.0088 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7818228Z triton_mm_11 0.0093 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7818454Z triton_mm_23 0.0094 ms 66.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7818582Z SingleProcess AUTOTUNE benchmarking takes 0.1521 seconds and 0.7300 seconds precompiling for 35 choices 2025-12-04T12:10:21.7818656Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7818704Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7818760Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7818861Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7819353Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7819399Z graph_break [] 2025-12-04T12:10:21.7819461Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7819545Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7819587Z Autotune Choices Stats: 2025-12-04T12:10:21.7819950Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_66", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:21.7819998Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7820044Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7820174Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7820426Z triton_mm_66 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7820651Z triton_mm_53 0.0073 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7820876Z triton_mm_61 0.0078 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7821102Z triton_mm_42 0.0080 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7821328Z triton_mm_54 0.0081 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7821561Z triton_mm_65 0.0085 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7821606Z _scaled_mm 0.0086 ms 81.3% 2025-12-04T12:10:21.7821841Z triton_mm_49 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7822064Z triton_mm_62 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7822289Z triton_mm_48 0.0092 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7822418Z SingleProcess AUTOTUNE benchmarking takes 0.2359 seconds and 0.4649 seconds precompiling for 35 choices 2025-12-04T12:10:21.7822472Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7822618Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7822666Z Traceback (most recent call last): 2025-12-04T12:10:21.7822821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7822877Z method(*args, **kwargs) 2025-12-04T12:10:21.7823029Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7823070Z method(*args, **kwargs) 2025-12-04T12:10:21.7823234Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7823271Z with policy(): 2025-12-04T12:10:21.7823422Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7823465Z raise RuntimeError(msg) 2025-12-04T12:10:21.7823863Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7823866Z 2025-12-04T12:10:21.7823948Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7824210Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7824213Z 2025-12-04T12:10:21.7824301Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7824373Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7824420Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7824476Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7824960Z inductor [('triton_bundler_save_kernel', 280), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('async_compile_cache_miss', 2), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7825059Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7825098Z graph_break [] 2025-12-04T12:10:21.7825159Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7825233Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7825724Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7825774Z current_size = base.storage().size() 2025-12-04T12:10:21.7825815Z Autotune Choices Stats: 2025-12-04T12:10:21.7826186Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_31", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.7826236Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7826278Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7826378Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7826612Z triton_mm_31 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7826848Z triton_mm_19 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7827074Z triton_mm_32 0.0075 ms 82.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7827307Z triton_mm_27 0.0076 ms 81.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7827534Z triton_mm_8 0.0083 ms 74.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7827766Z triton_mm_20 0.0084 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7827989Z triton_mm_15 0.0086 ms 72.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7828215Z triton_mm_14 0.0088 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7828438Z triton_mm_11 0.0093 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7828662Z triton_mm_23 0.0094 ms 66.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7828788Z SingleProcess AUTOTUNE benchmarking takes 0.1521 seconds and 0.7300 seconds precompiling for 35 choices 2025-12-04T12:10:21.7828862Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7828905Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7828962Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7829060Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7829554Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7829593Z graph_break [] 2025-12-04T12:10:21.7829656Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7829727Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7829770Z Autotune Choices Stats: 2025-12-04T12:10:21.7830176Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_66", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0069599999114871025, "best_triton_pos": 0} 2025-12-04T12:10:21.7830225Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7830269Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7830380Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7830618Z triton_mm_66 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7830854Z triton_mm_53 0.0073 ms 95.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7831079Z triton_mm_61 0.0078 ms 89.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7831308Z triton_mm_42 0.0080 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7831552Z triton_mm_54 0.0081 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7831780Z triton_mm_65 0.0085 ms 81.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7831824Z _scaled_mm 0.0086 ms 81.3% 2025-12-04T12:10:21.7832047Z triton_mm_49 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7832268Z triton_mm_62 0.0086 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7832496Z triton_mm_48 0.0092 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7832623Z SingleProcess AUTOTUNE benchmarking takes 0.2359 seconds and 0.4649 seconds precompiling for 35 choices 2025-12-04T12:10:21.7832695Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7832741Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7832805Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7832922Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7833411Z inductor [('triton_bundler_save_kernel', 280), ('async_compile_cache_miss', 36), ('benchmarking.InductorBenchmarker.benchmark_gpu', 35), ('generated_module_cache_miss', 34), ('select_algorithm_num_precompiles', 34), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7833450Z graph_break [] 2025-12-04T12:10:21.7833513Z aten_mm_info [('aten._scaled_mm.default_33_2048_1024', 1)] 2025-12-04T12:10:21.7833589Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7833629Z Autotune Choices Stats: 2025-12-04T12:10:21.7833993Z {"num_choices": 35, "num_triton_choices": 34, "best_kernel": "triton_mm_87", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.007040000054985285, "best_triton_pos": 0} 2025-12-04T12:10:21.7834049Z AUTOTUNE scaled_mm(33x1024, 1024x2048, , ) 2025-12-04T12:10:21.7834094Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.7834194Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7834428Z triton_mm_87 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7834685Z triton_mm_100 0.0072 ms 97.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7834912Z triton_mm_95 0.0076 ms 93.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7835174Z triton_mm_99 0.0078 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7835404Z triton_mm_76 0.0080 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7835631Z triton_mm_83 0.0082 ms 86.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7835855Z triton_mm_88 0.0084 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7836083Z triton_mm_81 0.0085 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7836304Z triton_mm_91 0.0088 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7836527Z triton_mm_82 0.0090 ms 78.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7836668Z SingleProcess AUTOTUNE benchmarking takes 0.2112 seconds and 0.1919 seconds precompiling for 35 choices 2025-12-04T12:10:21.7836861Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9503605d52f27902.xml - 2025-12-04T12:10:21.7836923Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7837517Z FAILED [0.9762s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1176502272 and is now 1264582656. 2025-12-04T12:10:21.7837520Z 2025-12-04T12:10:21.7837592Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7837856Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7837869Z 2025-12-04T12:10:21.7837956Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7838018Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7838084Z ================== 1 failed, 187 deselected, 2 rerun in 5.61s ================== 2025-12-04T12:10:21.7838140Z Got exit code 1 2025-12-04T12:10:21.7838347Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7838473Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7838616Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b8a919c93b056f41.xml 2025-12-04T12:10:21.7838673Z ============================= test session starts ============================== 2025-12-04T12:10:21.7838787Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7838829Z cachedir: .pytest_cache 2025-12-04T12:10:21.7838997Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7839047Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7839091Z configfile: pytest.ini 2025-12-04T12:10:21.7839254Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7839332Z collecting ... collected 188 items / 144 deselected / 44 selected 2025-12-04T12:10:21.7839386Z stepcurrent: skipping 144 already run items. 2025-12-04T12:10:21.7839437Z Running 44 items in this shard 2025-12-04T12:10:21.7839440Z 2025-12-04T12:10:21.7839654Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [39.3513s] [ 2%] 2025-12-04T12:10:21.7839868Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3687s] [ 2%] 2025-12-04T12:10:21.7840053Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3222s] [ 2%] 2025-12-04T12:10:21.7840057Z 2025-12-04T12:10:21.7840146Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7840288Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7840339Z Traceback (most recent call last): 2025-12-04T12:10:21.7840516Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7840562Z method(*args, **kwargs) 2025-12-04T12:10:21.7840715Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7840759Z method(*args, **kwargs) 2025-12-04T12:10:21.7840911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7840948Z with policy(): 2025-12-04T12:10:21.7841101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7841147Z raise RuntimeError(msg) 2025-12-04T12:10:21.7841532Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7841535Z 2025-12-04T12:10:21.7841607Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7846279Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7846285Z 2025-12-04T12:10:21.7846385Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7846502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7846551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7846609Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7846677Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7846780Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7846818Z graph_break [] 2025-12-04T12:10:21.7846879Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7847025Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7847072Z Traceback (most recent call last): 2025-12-04T12:10:21.7847247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7847290Z method(*args, **kwargs) 2025-12-04T12:10:21.7847440Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7847482Z method(*args, **kwargs) 2025-12-04T12:10:21.7847630Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7847667Z with policy(): 2025-12-04T12:10:21.7847818Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7847858Z raise RuntimeError(msg) 2025-12-04T12:10:21.7848247Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7848251Z 2025-12-04T12:10:21.7848325Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7848592Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7848595Z 2025-12-04T12:10:21.7848696Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7848771Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7848814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7848871Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7848938Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7849037Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7849074Z graph_break [] 2025-12-04T12:10:21.7849133Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7849209Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7849251Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7849306Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7849402Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7849469Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7849506Z graph_break [] 2025-12-04T12:10:21.7849564Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7849632Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7849775Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7849821Z Traceback (most recent call last): 2025-12-04T12:10:21.7849974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7850024Z method(*args, **kwargs) 2025-12-04T12:10:21.7850224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7850263Z method(*args, **kwargs) 2025-12-04T12:10:21.7850414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7850452Z with policy(): 2025-12-04T12:10:21.7850604Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7850646Z raise RuntimeError(msg) 2025-12-04T12:10:21.7851050Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7851053Z 2025-12-04T12:10:21.7851127Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7851383Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7851386Z 2025-12-04T12:10:21.7851473Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7851545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7851588Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7851643Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7851711Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7851807Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7851846Z graph_break [] 2025-12-04T12:10:21.7851904Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7851979Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7852020Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7852076Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7852186Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7852250Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7852286Z graph_break [] 2025-12-04T12:10:21.7852346Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7852418Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7852462Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7852518Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7852618Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7852681Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7852718Z graph_break [] 2025-12-04T12:10:21.7852775Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7852968Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b8a919c93b056f41.xml - 2025-12-04T12:10:21.7853028Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7853605Z FAILED [0.3222s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7853644Z 2025-12-04T12:10:21.7853717Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7853971Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7853974Z 2025-12-04T12:10:21.7854060Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7854122Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7854193Z ================= 1 failed, 144 deselected, 2 rerun in 40.07s ================== 2025-12-04T12:10:21.7854229Z Got exit code 1 2025-12-04T12:10:21.7854284Z Retrying single test... 2025-12-04T12:10:21.7854428Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-45bb837555643f48.xml 2025-12-04T12:10:21.7854487Z ============================= test session starts ============================== 2025-12-04T12:10:21.7854602Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7854642Z cachedir: .pytest_cache 2025-12-04T12:10:21.7854804Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7854850Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7854891Z configfile: pytest.ini 2025-12-04T12:10:21.7855056Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7855132Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7855386Z stepcurrent: skipping 144 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7855432Z Running 1 items in this shard 2025-12-04T12:10:21.7855434Z 2025-12-04T12:10:21.7855650Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [154.6896s] [100%] 2025-12-04T12:10:21.7855873Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [4.7884s] [100%] 2025-12-04T12:10:21.7856060Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [4.7164s] [100%] 2025-12-04T12:10:21.7856063Z 2025-12-04T12:10:21.7856116Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7856258Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7856305Z Traceback (most recent call last): 2025-12-04T12:10:21.7856462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7856503Z method(*args, **kwargs) 2025-12-04T12:10:21.7856656Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7856695Z method(*args, **kwargs) 2025-12-04T12:10:21.7856846Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7856882Z with policy(): 2025-12-04T12:10:21.7857047Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7857088Z raise RuntimeError(msg) 2025-12-04T12:10:21.7857476Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7857493Z 2025-12-04T12:10:21.7857566Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7857823Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7857825Z 2025-12-04T12:10:21.7857915Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7857988Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7858030Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7858094Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7858161Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7858260Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7858296Z graph_break [] 2025-12-04T12:10:21.7858355Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7858495Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7858539Z Traceback (most recent call last): 2025-12-04T12:10:21.7858692Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7858732Z method(*args, **kwargs) 2025-12-04T12:10:21.7858883Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7858923Z method(*args, **kwargs) 2025-12-04T12:10:21.7859071Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7859109Z with policy(): 2025-12-04T12:10:21.7859261Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7859301Z raise RuntimeError(msg) 2025-12-04T12:10:21.7859698Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7859702Z 2025-12-04T12:10:21.7859774Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7860031Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7860033Z 2025-12-04T12:10:21.7860157Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7860229Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7860272Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7860327Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7860392Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7860489Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7860526Z graph_break [] 2025-12-04T12:10:21.7860585Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7860674Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7860715Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7860771Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7860865Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7860944Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7860980Z graph_break [] 2025-12-04T12:10:21.7861038Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7861091Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7861232Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7861277Z Traceback (most recent call last): 2025-12-04T12:10:21.7861432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7861472Z method(*args, **kwargs) 2025-12-04T12:10:21.7861636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7861675Z method(*args, **kwargs) 2025-12-04T12:10:21.7861826Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7861862Z with policy(): 2025-12-04T12:10:21.7862013Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7862053Z raise RuntimeError(msg) 2025-12-04T12:10:21.7862441Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7862445Z 2025-12-04T12:10:21.7862517Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7862773Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7862776Z 2025-12-04T12:10:21.7862862Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7862933Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7862978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7863032Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7863114Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7863210Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7863248Z graph_break [] 2025-12-04T12:10:21.7863307Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7863380Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7863422Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7863476Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7863572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7863637Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7863673Z graph_break [] 2025-12-04T12:10:21.7863731Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7863803Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7863845Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7863899Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7863994Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7864066Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7864104Z graph_break [] 2025-12-04T12:10:21.7864161Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7864350Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-45bb837555643f48.xml - 2025-12-04T12:10:21.7864422Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7864994Z FAILED [4.7164s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7864998Z 2025-12-04T12:10:21.7865070Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7865335Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7865339Z 2025-12-04T12:10:21.7865425Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7865486Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7865559Z ============ 1 failed, 187 deselected, 2 rerun in 164.21s (0:02:44) ============ 2025-12-04T12:10:21.7865598Z Got exit code 1 2025-12-04T12:10:21.7865639Z Retrying single test... 2025-12-04T12:10:21.7865784Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3d957c0767c098b9.xml 2025-12-04T12:10:21.7865841Z ============================= test session starts ============================== 2025-12-04T12:10:21.7865953Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7865994Z cachedir: .pytest_cache 2025-12-04T12:10:21.7866152Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7866198Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7866240Z configfile: pytest.ini 2025-12-04T12:10:21.7866402Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7866477Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7866744Z stepcurrent: skipping 144 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7866790Z Running 1 items in this shard 2025-12-04T12:10:21.7866792Z 2025-12-04T12:10:21.7867005Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [207.9819s] [100%] 2025-12-04T12:10:21.7867215Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3825s] [100%] 2025-12-04T12:10:21.7867401Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3506s] [100%] 2025-12-04T12:10:21.7867405Z 2025-12-04T12:10:21.7867456Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7867598Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7867654Z Traceback (most recent call last): 2025-12-04T12:10:21.7867811Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7867853Z method(*args, **kwargs) 2025-12-04T12:10:21.7868004Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7868055Z method(*args, **kwargs) 2025-12-04T12:10:21.7868207Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7868243Z with policy(): 2025-12-04T12:10:21.7868394Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7868435Z raise RuntimeError(msg) 2025-12-04T12:10:21.7868820Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7868833Z 2025-12-04T12:10:21.7868906Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7869162Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7869165Z 2025-12-04T12:10:21.7869251Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7869323Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7869366Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7869421Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7869487Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7869585Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7869622Z graph_break [] 2025-12-04T12:10:21.7869681Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7869822Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7869867Z Traceback (most recent call last): 2025-12-04T12:10:21.7870023Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7870061Z method(*args, **kwargs) 2025-12-04T12:10:21.7870258Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7870311Z method(*args, **kwargs) 2025-12-04T12:10:21.7870461Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7870499Z with policy(): 2025-12-04T12:10:21.7870650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7870691Z raise RuntimeError(msg) 2025-12-04T12:10:21.7871074Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7871077Z 2025-12-04T12:10:21.7871149Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7871404Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7871406Z 2025-12-04T12:10:21.7871506Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7871578Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7871621Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7871676Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7871756Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7871852Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7871889Z graph_break [] 2025-12-04T12:10:21.7871947Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7872020Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7872062Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7872120Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7872218Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7872282Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7872318Z graph_break [] 2025-12-04T12:10:21.7872388Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7872441Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7872583Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7872628Z Traceback (most recent call last): 2025-12-04T12:10:21.7872781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7872820Z method(*args, **kwargs) 2025-12-04T12:10:21.7872973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7873014Z method(*args, **kwargs) 2025-12-04T12:10:21.7873162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7873203Z with policy(): 2025-12-04T12:10:21.7873357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7873399Z raise RuntimeError(msg) 2025-12-04T12:10:21.7873780Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7873783Z 2025-12-04T12:10:21.7873864Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7874119Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7874122Z 2025-12-04T12:10:21.7874212Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7874285Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7874328Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7874384Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7874454Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7874551Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7874589Z graph_break [] 2025-12-04T12:10:21.7874649Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7874725Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7874768Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7874828Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7874935Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7875005Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7875042Z graph_break [] 2025-12-04T12:10:21.7875103Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7875186Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7875232Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7875290Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7875386Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7875449Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7875486Z graph_break [] 2025-12-04T12:10:21.7875543Z aten_mm_info [('aten._scaled_mm.default_33_16_16', 1)] 2025-12-04T12:10:21.7875731Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3d957c0767c098b9.xml - 2025-12-04T12:10:21.7875794Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7876379Z FAILED [0.3506s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7876383Z 2025-12-04T12:10:21.7876456Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7876713Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7876717Z 2025-12-04T12:10:21.7876802Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7876865Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7876937Z ============ 1 failed, 187 deselected, 2 rerun in 208.74s (0:03:28) ============ 2025-12-04T12:10:21.7876975Z Got exit code 1 2025-12-04T12:10:21.7877179Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7877307Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7877460Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a7eca5cc05228d86.xml 2025-12-04T12:10:21.7877519Z ============================= test session starts ============================== 2025-12-04T12:10:21.7877629Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7877671Z cachedir: .pytest_cache 2025-12-04T12:10:21.7877827Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7877872Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7877913Z configfile: pytest.ini 2025-12-04T12:10:21.7878076Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7878153Z collecting ... collected 188 items / 145 deselected / 43 selected 2025-12-04T12:10:21.7878209Z stepcurrent: skipping 145 already run items. 2025-12-04T12:10:21.7878253Z Running 43 items in this shard 2025-12-04T12:10:21.7878255Z 2025-12-04T12:10:21.7878473Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8332s] [ 2%] 2025-12-04T12:10:21.7878699Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3291s] [ 2%] 2025-12-04T12:10:21.7878890Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3073s] [ 2%] 2025-12-04T12:10:21.7878903Z 2025-12-04T12:10:21.7878956Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7879099Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7879144Z Traceback (most recent call last): 2025-12-04T12:10:21.7879302Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7879343Z method(*args, **kwargs) 2025-12-04T12:10:21.7879495Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7879535Z method(*args, **kwargs) 2025-12-04T12:10:21.7879699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7879738Z with policy(): 2025-12-04T12:10:21.7879889Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7879932Z raise RuntimeError(msg) 2025-12-04T12:10:21.7880352Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7880356Z 2025-12-04T12:10:21.7880428Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7880687Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7880690Z 2025-12-04T12:10:21.7880781Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7880856Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7880900Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7880959Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7881025Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7881141Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7881177Z graph_break [] 2025-12-04T12:10:21.7881239Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7881383Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7881429Z Traceback (most recent call last): 2025-12-04T12:10:21.7881587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7881631Z method(*args, **kwargs) 2025-12-04T12:10:21.7881780Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7881819Z method(*args, **kwargs) 2025-12-04T12:10:21.7881969Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7882007Z with policy(): 2025-12-04T12:10:21.7882160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7882215Z raise RuntimeError(msg) 2025-12-04T12:10:21.7882602Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7882619Z 2025-12-04T12:10:21.7882691Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7882948Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7882950Z 2025-12-04T12:10:21.7883037Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7883110Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7883152Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7883209Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7883273Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7883386Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7883423Z graph_break [] 2025-12-04T12:10:21.7883485Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7883557Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7883601Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7883655Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7883751Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7883815Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7883852Z graph_break [] 2025-12-04T12:10:21.7883910Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7883964Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7884108Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7884154Z Traceback (most recent call last): 2025-12-04T12:10:21.7884307Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7884349Z method(*args, **kwargs) 2025-12-04T12:10:21.7884499Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7884538Z method(*args, **kwargs) 2025-12-04T12:10:21.7884696Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7884733Z with policy(): 2025-12-04T12:10:21.7884885Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7884926Z raise RuntimeError(msg) 2025-12-04T12:10:21.7885314Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7885317Z 2025-12-04T12:10:21.7885389Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7885651Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7885653Z 2025-12-04T12:10:21.7885738Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7885811Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7885862Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7885921Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7885985Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7886082Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7886129Z graph_break [] 2025-12-04T12:10:21.7886189Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7886261Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7886302Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7886357Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7886453Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7886516Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7886552Z graph_break [] 2025-12-04T12:10:21.7886611Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7886693Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7886738Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7886792Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7886890Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7886953Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7886992Z graph_break [] 2025-12-04T12:10:21.7887049Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7887239Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a7eca5cc05228d86.xml - 2025-12-04T12:10:21.7887298Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7887880Z FAILED [0.3073s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7887883Z 2025-12-04T12:10:21.7887955Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7888223Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7888225Z 2025-12-04T12:10:21.7888310Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7888371Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7888441Z ================== 1 failed, 145 deselected, 2 rerun in 2.49s ================== 2025-12-04T12:10:21.7888478Z Got exit code 1 2025-12-04T12:10:21.7888519Z Retrying single test... 2025-12-04T12:10:21.7888663Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9c2f3b9fbfb6c136.xml 2025-12-04T12:10:21.7888722Z ============================= test session starts ============================== 2025-12-04T12:10:21.7888832Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7888873Z cachedir: .pytest_cache 2025-12-04T12:10:21.7889030Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7889076Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7889116Z configfile: pytest.ini 2025-12-04T12:10:21.7889278Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7889364Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7889621Z stepcurrent: skipping 145 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7889677Z Running 1 items in this shard 2025-12-04T12:10:21.7889679Z 2025-12-04T12:10:21.7889895Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [238.6589s] [100%] 2025-12-04T12:10:21.7890146Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [5.2476s] [100%] 2025-12-04T12:10:21.7890336Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [5.1384s] [100%] 2025-12-04T12:10:21.7890339Z 2025-12-04T12:10:21.7890406Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7890549Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7890596Z Traceback (most recent call last): 2025-12-04T12:10:21.7890751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7890792Z method(*args, **kwargs) 2025-12-04T12:10:21.7890942Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7890983Z method(*args, **kwargs) 2025-12-04T12:10:21.7891133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7891171Z with policy(): 2025-12-04T12:10:21.7891323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7891365Z raise RuntimeError(msg) 2025-12-04T12:10:21.7891749Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7891753Z 2025-12-04T12:10:21.7891826Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7892099Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7892102Z 2025-12-04T12:10:21.7892187Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7892259Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7892302Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7892358Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7892422Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7892519Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7892555Z graph_break [] 2025-12-04T12:10:21.7892617Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7892761Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7892807Z Traceback (most recent call last): 2025-12-04T12:10:21.7892958Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7893011Z method(*args, **kwargs) 2025-12-04T12:10:21.7893162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7893201Z method(*args, **kwargs) 2025-12-04T12:10:21.7893349Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7893409Z with policy(): 2025-12-04T12:10:21.7893559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7893600Z raise RuntimeError(msg) 2025-12-04T12:10:21.7893988Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7893991Z 2025-12-04T12:10:21.7894064Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7894331Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7894335Z 2025-12-04T12:10:21.7894420Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7894493Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7894534Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7894589Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7894654Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7894752Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7894788Z graph_break [] 2025-12-04T12:10:21.7894848Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7894920Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7894962Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7895016Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7895113Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7895176Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7895212Z graph_break [] 2025-12-04T12:10:21.7895270Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7895323Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7895476Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7895522Z Traceback (most recent call last): 2025-12-04T12:10:21.7895676Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7895715Z method(*args, **kwargs) 2025-12-04T12:10:21.7895866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7895906Z method(*args, **kwargs) 2025-12-04T12:10:21.7896056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7896092Z with policy(): 2025-12-04T12:10:21.7896249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7896289Z raise RuntimeError(msg) 2025-12-04T12:10:21.7896673Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7896686Z 2025-12-04T12:10:21.7896758Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7897016Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7897027Z 2025-12-04T12:10:21.7897113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7897185Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7897227Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7897282Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7897347Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7897444Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7897481Z graph_break [] 2025-12-04T12:10:21.7897549Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7897623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7897664Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7897720Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7897814Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7897878Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7897914Z graph_break [] 2025-12-04T12:10:21.7897972Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7898045Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7898088Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7898144Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7898241Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7898304Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7898341Z graph_break [] 2025-12-04T12:10:21.7898398Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7898589Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-9c2f3b9fbfb6c136.xml - 2025-12-04T12:10:21.7898647Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7899233Z FAILED [5.1384s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7899236Z 2025-12-04T12:10:21.7899312Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7899570Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7899573Z 2025-12-04T12:10:21.7899659Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7899720Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7899793Z ============ 1 failed, 187 deselected, 2 rerun in 249.06s (0:04:09) ============ 2025-12-04T12:10:21.7899830Z Got exit code 1 2025-12-04T12:10:21.7899871Z Retrying single test... 2025-12-04T12:10:21.7900013Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e94a831888b98cbc.xml 2025-12-04T12:10:21.7900080Z ============================= test session starts ============================== 2025-12-04T12:10:21.7900224Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7900265Z cachedir: .pytest_cache 2025-12-04T12:10:21.7900440Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7900486Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7900526Z configfile: pytest.ini 2025-12-04T12:10:21.7900689Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7900765Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7901020Z stepcurrent: skipping 145 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7901064Z Running 1 items in this shard 2025-12-04T12:10:21.7901067Z 2025-12-04T12:10:21.7901295Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8217s] [100%] 2025-12-04T12:10:21.7901510Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3323s] [100%] 2025-12-04T12:10:21.7901700Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3123s] [100%] 2025-12-04T12:10:21.7901703Z 2025-12-04T12:10:21.7901755Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7901897Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7901945Z Traceback (most recent call last): 2025-12-04T12:10:21.7902101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7902144Z method(*args, **kwargs) 2025-12-04T12:10:21.7902296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7902336Z method(*args, **kwargs) 2025-12-04T12:10:21.7902485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7902522Z with policy(): 2025-12-04T12:10:21.7902687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7902731Z raise RuntimeError(msg) 2025-12-04T12:10:21.7903118Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.7903121Z 2025-12-04T12:10:21.7903193Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7903451Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7903453Z 2025-12-04T12:10:21.7903537Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7903610Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7903652Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7903709Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7903788Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7903888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7903923Z graph_break [] 2025-12-04T12:10:21.7903984Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7904137Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7904184Z Traceback (most recent call last): 2025-12-04T12:10:21.7904336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7904377Z method(*args, **kwargs) 2025-12-04T12:10:21.7904527Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7904566Z method(*args, **kwargs) 2025-12-04T12:10:21.7904716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7904753Z with policy(): 2025-12-04T12:10:21.7904916Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7904957Z raise RuntimeError(msg) 2025-12-04T12:10:21.7905342Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.7905345Z 2025-12-04T12:10:21.7905416Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7905674Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7905677Z 2025-12-04T12:10:21.7905761Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7905835Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7905877Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7905934Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7905998Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7906095Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7906130Z graph_break [] 2025-12-04T12:10:21.7906191Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7906271Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7906314Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7906368Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7906465Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7906530Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7906565Z graph_break [] 2025-12-04T12:10:21.7906624Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7906676Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7906822Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7906866Z Traceback (most recent call last): 2025-12-04T12:10:21.7907021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7907062Z method(*args, **kwargs) 2025-12-04T12:10:21.7907212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7907268Z method(*args, **kwargs) 2025-12-04T12:10:21.7907419Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7907455Z with policy(): 2025-12-04T12:10:21.7907606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7907657Z raise RuntimeError(msg) 2025-12-04T12:10:21.7908043Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7908045Z 2025-12-04T12:10:21.7908117Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7908374Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7908387Z 2025-12-04T12:10:21.7908474Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7908546Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7908589Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7908643Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7908708Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7908803Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7908840Z graph_break [] 2025-12-04T12:10:21.7908899Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7908973Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7909015Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7909070Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7909167Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7909231Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7909267Z graph_break [] 2025-12-04T12:10:21.7909326Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7909397Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7909439Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7909494Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7909601Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7909664Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.7909701Z graph_break [] 2025-12-04T12:10:21.7909759Z aten_mm_info [('aten._scaled_mm.default_33_2048_16', 1)] 2025-12-04T12:10:21.7909948Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-e94a831888b98cbc.xml - 2025-12-04T12:10:21.7910006Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7910624Z FAILED [0.3123s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.7910627Z 2025-12-04T12:10:21.7910699Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7910954Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7910971Z 2025-12-04T12:10:21.7911057Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7911119Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7911200Z ================== 1 failed, 187 deselected, 2 rerun in 2.49s ================== 2025-12-04T12:10:21.7911237Z Got exit code 1 2025-12-04T12:10:21.7911443Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7911570Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7911712Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c95d1a653773ad80.xml 2025-12-04T12:10:21.7911770Z ============================= test session starts ============================== 2025-12-04T12:10:21.7911892Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7911933Z cachedir: .pytest_cache 2025-12-04T12:10:21.7912089Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7912138Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7912178Z configfile: pytest.ini 2025-12-04T12:10:21.7912340Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7912416Z collecting ... collected 188 items / 146 deselected / 42 selected 2025-12-04T12:10:21.7912470Z stepcurrent: skipping 146 already run items. 2025-12-04T12:10:21.7912513Z Running 42 items in this shard 2025-12-04T12:10:21.7912515Z 2025-12-04T12:10:21.7912731Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [186.8767s] [ 2%] 2025-12-04T12:10:21.7912942Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6136s] [ 2%] 2025-12-04T12:10:21.7913130Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6590s] [ 2%] 2025-12-04T12:10:21.7913132Z 2025-12-04T12:10:21.7913182Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7913338Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7913384Z Traceback (most recent call last): 2025-12-04T12:10:21.7913542Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7913583Z method(*args, **kwargs) 2025-12-04T12:10:21.7913738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7913777Z method(*args, **kwargs) 2025-12-04T12:10:21.7913927Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7913965Z with policy(): 2025-12-04T12:10:21.7914116Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7914157Z raise RuntimeError(msg) 2025-12-04T12:10:21.7914538Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.7914550Z 2025-12-04T12:10:21.7914624Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7914880Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7914892Z 2025-12-04T12:10:21.7914979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7915051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7915094Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7915150Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7915649Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7915750Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7915787Z graph_break [] 2025-12-04T12:10:21.7915848Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7915920Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7916407Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7916457Z current_size = base.storage().size() 2025-12-04T12:10:21.7916498Z Autotune Choices Stats: 2025-12-04T12:10:21.7916868Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.7916914Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7916955Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7917058Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7917304Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7917530Z triton_mm_0 0.0070 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7917755Z triton_mm_2 0.0077 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7917978Z triton_mm_3 0.0093 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7918020Z _scaled_mm 0.0245 ms 28.1% 2025-12-04T12:10:21.7918149Z SingleProcess AUTOTUNE benchmarking takes 0.0273 seconds and 0.1177 seconds precompiling for 5 choices 2025-12-04T12:10:21.7918292Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7918347Z Traceback (most recent call last): 2025-12-04T12:10:21.7918504Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7918543Z method(*args, **kwargs) 2025-12-04T12:10:21.7918694Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7918746Z method(*args, **kwargs) 2025-12-04T12:10:21.7918896Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7918933Z with policy(): 2025-12-04T12:10:21.7919086Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7919127Z raise RuntimeError(msg) 2025-12-04T12:10:21.7919521Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.7919524Z 2025-12-04T12:10:21.7919599Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7919858Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7919860Z 2025-12-04T12:10:21.7919948Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7920022Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7920066Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7920155Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7920639Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7920739Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7920775Z graph_break [] 2025-12-04T12:10:21.7920839Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7920912Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7921415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7921464Z current_size = base.storage().size() 2025-12-04T12:10:21.7921507Z Autotune Choices Stats: 2025-12-04T12:10:21.7921869Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.7921916Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7921956Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7922058Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7922292Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7922532Z triton_mm_0 0.0070 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7922766Z triton_mm_2 0.0077 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7922987Z triton_mm_3 0.0093 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7923029Z _scaled_mm 0.0245 ms 28.1% 2025-12-04T12:10:21.7923157Z SingleProcess AUTOTUNE benchmarking takes 0.0273 seconds and 0.1177 seconds precompiling for 5 choices 2025-12-04T12:10:21.7923230Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7923283Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7923341Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7923439Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7923920Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7923957Z graph_break [] 2025-12-04T12:10:21.7924017Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7924090Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7924131Z Autotune Choices Stats: 2025-12-04T12:10:21.7924492Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006318999920040369, "best_triton_pos": 0} 2025-12-04T12:10:21.7924537Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7924577Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7924676Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7924914Z triton_mm_7 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7925138Z triton_mm_5 0.0065 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7925360Z triton_mm_6 0.0065 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7925582Z triton_mm_4 0.0090 ms 70.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7925622Z _scaled_mm 0.0202 ms 31.3% 2025-12-04T12:10:21.7925751Z SingleProcess AUTOTUNE benchmarking takes 0.0275 seconds and 0.0632 seconds precompiling for 5 choices 2025-12-04T12:10:21.7925805Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7925961Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7926006Z Traceback (most recent call last): 2025-12-04T12:10:21.7926161Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7926217Z method(*args, **kwargs) 2025-12-04T12:10:21.7926369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7926411Z method(*args, **kwargs) 2025-12-04T12:10:21.7926562Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7926599Z with policy(): 2025-12-04T12:10:21.7926751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7926793Z raise RuntimeError(msg) 2025-12-04T12:10:21.7927189Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7927192Z 2025-12-04T12:10:21.7927267Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7927526Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7927528Z 2025-12-04T12:10:21.7927615Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7927687Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7927731Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7927786Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7928266Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7928365Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7928400Z graph_break [] 2025-12-04T12:10:21.7928461Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7928543Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7929026Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7929073Z current_size = base.storage().size() 2025-12-04T12:10:21.7929116Z Autotune Choices Stats: 2025-12-04T12:10:21.7929476Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.00687999976798892, "best_triton_pos": 0} 2025-12-04T12:10:21.7929521Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7929561Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7929660Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7929901Z triton_mm_1 0.0069 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7930157Z triton_mm_0 0.0070 ms 98.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7930396Z triton_mm_2 0.0077 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7930619Z triton_mm_3 0.0093 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7930661Z _scaled_mm 0.0245 ms 28.1% 2025-12-04T12:10:21.7930801Z SingleProcess AUTOTUNE benchmarking takes 0.0273 seconds and 0.1177 seconds precompiling for 5 choices 2025-12-04T12:10:21.7930875Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7930916Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7930974Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7931072Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7931550Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7931588Z graph_break [] 2025-12-04T12:10:21.7931647Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7931721Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7931760Z Autotune Choices Stats: 2025-12-04T12:10:21.7932117Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006318999920040369, "best_triton_pos": 0} 2025-12-04T12:10:21.7932161Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7932201Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7932313Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7932543Z triton_mm_7 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7932771Z triton_mm_5 0.0065 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7932994Z triton_mm_6 0.0065 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7933218Z triton_mm_4 0.0090 ms 70.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7933258Z _scaled_mm 0.0202 ms 31.3% 2025-12-04T12:10:21.7933385Z SingleProcess AUTOTUNE benchmarking takes 0.0275 seconds and 0.0632 seconds precompiling for 5 choices 2025-12-04T12:10:21.7933469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7933511Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7933566Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7933665Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7934156Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7934195Z graph_break [] 2025-12-04T12:10:21.7934254Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7934328Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7934367Z Autotune Choices Stats: 2025-12-04T12:10:21.7934749Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_10", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.7934796Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7934835Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7934934Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7935167Z triton_mm_10 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7935390Z triton_mm_8 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7935611Z triton_mm_9 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7935835Z triton_mm_11 0.0070 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7935888Z _scaled_mm 0.0206 ms 31.5% 2025-12-04T12:10:21.7936015Z SingleProcess AUTOTUNE benchmarking takes 0.0356 seconds and 0.1819 seconds precompiling for 5 choices 2025-12-04T12:10:21.7936202Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c95d1a653773ad80.xml - 2025-12-04T12:10:21.7936265Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7936843Z FAILED [0.6590s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7936847Z 2025-12-04T12:10:21.7936920Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7937180Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7937192Z 2025-12-04T12:10:21.7937277Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7937341Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7937413Z ============ 1 failed, 146 deselected, 2 rerun in 188.17s (0:03:08) ============ 2025-12-04T12:10:21.7937462Z Got exit code 1 2025-12-04T12:10:21.7937501Z Retrying single test... 2025-12-04T12:10:21.7937646Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ad631ddcd1d6cc5a.xml 2025-12-04T12:10:21.7937702Z ============================= test session starts ============================== 2025-12-04T12:10:21.7937816Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7937857Z cachedir: .pytest_cache 2025-12-04T12:10:21.7938015Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7938061Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7938100Z configfile: pytest.ini 2025-12-04T12:10:21.7938274Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7938350Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7938603Z stepcurrent: skipping 146 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7938646Z Running 1 items in this shard 2025-12-04T12:10:21.7938648Z 2025-12-04T12:10:21.7938863Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.0946s] [100%] 2025-12-04T12:10:21.7939072Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.5720s] [100%] 2025-12-04T12:10:21.7939262Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.6211s] [100%] 2025-12-04T12:10:21.7939265Z 2025-12-04T12:10:21.7939318Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7939459Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7939505Z Traceback (most recent call last): 2025-12-04T12:10:21.7939673Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7939716Z method(*args, **kwargs) 2025-12-04T12:10:21.7939866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7939908Z method(*args, **kwargs) 2025-12-04T12:10:21.7940058Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7940128Z with policy(): 2025-12-04T12:10:21.7940279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7940321Z raise RuntimeError(msg) 2025-12-04T12:10:21.7940705Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.7940707Z 2025-12-04T12:10:21.7940781Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7941037Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7941056Z 2025-12-04T12:10:21.7941141Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7941216Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7941271Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7941328Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7941808Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7941908Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7941946Z graph_break [] 2025-12-04T12:10:21.7942019Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7942091Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7942575Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7942622Z current_size = base.storage().size() 2025-12-04T12:10:21.7942664Z Autotune Choices Stats: 2025-12-04T12:10:21.7943028Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.7943073Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7943113Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7943214Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7943448Z triton_mm_3 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7943690Z triton_mm_1 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7943915Z triton_mm_0 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7944135Z triton_mm_2 0.0094 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7944181Z _scaled_mm 0.0231 ms 28.4% 2025-12-04T12:10:21.7944310Z SingleProcess AUTOTUNE benchmarking takes 0.0265 seconds and 0.1192 seconds precompiling for 5 choices 2025-12-04T12:10:21.7944451Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7944498Z Traceback (most recent call last): 2025-12-04T12:10:21.7944654Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7944704Z method(*args, **kwargs) 2025-12-04T12:10:21.7944856Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7944898Z method(*args, **kwargs) 2025-12-04T12:10:21.7945047Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7945098Z with policy(): 2025-12-04T12:10:21.7945250Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7945291Z raise RuntimeError(msg) 2025-12-04T12:10:21.7945678Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.7945681Z 2025-12-04T12:10:21.7945754Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7946022Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7946027Z 2025-12-04T12:10:21.7946113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7946185Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7946227Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7946283Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7946763Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7946863Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7946899Z graph_break [] 2025-12-04T12:10:21.7946960Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7947031Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7947524Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7947571Z current_size = base.storage().size() 2025-12-04T12:10:21.7947613Z Autotune Choices Stats: 2025-12-04T12:10:21.7947979Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.7948023Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7948063Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7948162Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7948394Z triton_mm_3 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7948615Z triton_mm_1 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7948849Z triton_mm_0 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7949086Z triton_mm_2 0.0094 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7949129Z _scaled_mm 0.0231 ms 28.4% 2025-12-04T12:10:21.7949258Z SingleProcess AUTOTUNE benchmarking takes 0.0265 seconds and 0.1192 seconds precompiling for 5 choices 2025-12-04T12:10:21.7949331Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7949373Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7949430Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7949529Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7950018Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7950057Z graph_break [] 2025-12-04T12:10:21.7950152Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7950228Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7950269Z Autotune Choices Stats: 2025-12-04T12:10:21.7950627Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7950671Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7950712Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7950810Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7951038Z triton_mm_4 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7951273Z triton_mm_6 0.0060 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7951494Z triton_mm_7 0.0060 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7951716Z triton_mm_5 0.0071 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7951757Z _scaled_mm 0.0224 ms 26.3% 2025-12-04T12:10:21.7951885Z SingleProcess AUTOTUNE benchmarking takes 0.0272 seconds and 0.0863 seconds precompiling for 5 choices 2025-12-04T12:10:21.7951937Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7952082Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7952127Z Traceback (most recent call last): 2025-12-04T12:10:21.7952296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7952339Z method(*args, **kwargs) 2025-12-04T12:10:21.7952491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7952544Z method(*args, **kwargs) 2025-12-04T12:10:21.7952693Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7952731Z with policy(): 2025-12-04T12:10:21.7952881Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7952923Z raise RuntimeError(msg) 2025-12-04T12:10:21.7953307Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7953310Z 2025-12-04T12:10:21.7953396Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7953654Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7953658Z 2025-12-04T12:10:21.7953744Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7953815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7953859Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7953915Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7954395Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7954494Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7954531Z graph_break [] 2025-12-04T12:10:21.7954591Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7954662Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7955152Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7955200Z current_size = base.storage().size() 2025-12-04T12:10:21.7955242Z Autotune Choices Stats: 2025-12-04T12:10:21.7955602Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006560000125318766, "best_triton_pos": 0} 2025-12-04T12:10:21.7955647Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7955686Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7955786Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7956016Z triton_mm_3 0.0066 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7956249Z triton_mm_1 0.0070 ms 93.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7956473Z triton_mm_0 0.0074 ms 88.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7956706Z triton_mm_2 0.0094 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7956748Z _scaled_mm 0.0231 ms 28.4% 2025-12-04T12:10:21.7956874Z SingleProcess AUTOTUNE benchmarking takes 0.0265 seconds and 0.1192 seconds precompiling for 5 choices 2025-12-04T12:10:21.7956949Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7956990Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7957057Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7957158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7957636Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7957675Z graph_break [] 2025-12-04T12:10:21.7957734Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7957807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7957847Z Autotune Choices Stats: 2025-12-04T12:10:21.7958204Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.7958248Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7958288Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7958388Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7958628Z triton_mm_4 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7958854Z triton_mm_6 0.0060 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7959079Z triton_mm_7 0.0060 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7959301Z triton_mm_5 0.0071 ms 83.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7959341Z _scaled_mm 0.0224 ms 26.3% 2025-12-04T12:10:21.7959468Z SingleProcess AUTOTUNE benchmarking takes 0.0272 seconds and 0.0863 seconds precompiling for 5 choices 2025-12-04T12:10:21.7959539Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7959591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7959646Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7959745Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7960252Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7960302Z graph_break [] 2025-12-04T12:10:21.7960361Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7960437Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7960477Z Autotune Choices Stats: 2025-12-04T12:10:21.7960853Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_8", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.7960899Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7960939Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7961039Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7961265Z triton_mm_8 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7961492Z triton_mm_11 0.0064 ms 92.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7961715Z triton_mm_9 0.0066 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7961938Z triton_mm_10 0.0084 ms 70.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7961980Z _scaled_mm 0.0205 ms 29.1% 2025-12-04T12:10:21.7962106Z SingleProcess AUTOTUNE benchmarking takes 0.0330 seconds and 0.1760 seconds precompiling for 5 choices 2025-12-04T12:10:21.7962307Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ad631ddcd1d6cc5a.xml - 2025-12-04T12:10:21.7962371Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7962955Z FAILED [0.6211s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7962958Z 2025-12-04T12:10:21.7963031Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7963290Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7963292Z 2025-12-04T12:10:21.7963380Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7963460Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7963528Z ================== 1 failed, 187 deselected, 2 rerun in 3.31s ================== 2025-12-04T12:10:21.7963565Z Got exit code 1 2025-12-04T12:10:21.7963606Z Retrying single test... 2025-12-04T12:10:21.7963749Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5c1e239a1ec6bccf.xml 2025-12-04T12:10:21.7963816Z ============================= test session starts ============================== 2025-12-04T12:10:21.7963929Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7963971Z cachedir: .pytest_cache 2025-12-04T12:10:21.7964129Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7964174Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7964214Z configfile: pytest.ini 2025-12-04T12:10:21.7964378Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7964464Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.7964717Z stepcurrent: skipping 146 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7964762Z Running 1 items in this shard 2025-12-04T12:10:21.7964764Z 2025-12-04T12:10:21.7964980Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.9656s] [100%] 2025-12-04T12:10:21.7965190Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1530s] [100%] 2025-12-04T12:10:21.7965378Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda FAILED [0.9947s] [100%] 2025-12-04T12:10:21.7965381Z 2025-12-04T12:10:21.7965434Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7965575Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7965622Z Traceback (most recent call last): 2025-12-04T12:10:21.7965778Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7965820Z method(*args, **kwargs) 2025-12-04T12:10:21.7965981Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7966023Z method(*args, **kwargs) 2025-12-04T12:10:21.7966174Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7966216Z with policy(): 2025-12-04T12:10:21.7966368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7966410Z raise RuntimeError(msg) 2025-12-04T12:10:21.7966794Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.7966798Z 2025-12-04T12:10:21.7966871Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7967134Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7967145Z 2025-12-04T12:10:21.7967231Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7967304Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7967346Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7967402Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7967891Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7967991Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7968027Z graph_break [] 2025-12-04T12:10:21.7968087Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7968160Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7968651Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7968699Z current_size = base.storage().size() 2025-12-04T12:10:21.7968739Z Autotune Choices Stats: 2025-12-04T12:10:21.7969106Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7969150Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7969191Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7969291Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7969524Z triton_mm_0 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7969749Z triton_mm_3 0.0067 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7969981Z triton_mm_1 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7970249Z triton_mm_2 0.0072 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7970289Z _scaled_mm 0.0244 ms 25.1% 2025-12-04T12:10:21.7970416Z SingleProcess AUTOTUNE benchmarking takes 0.0243 seconds and 0.1078 seconds precompiling for 5 choices 2025-12-04T12:10:21.7970559Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7970606Z Traceback (most recent call last): 2025-12-04T12:10:21.7970761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7970805Z method(*args, **kwargs) 2025-12-04T12:10:21.7970956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7971011Z method(*args, **kwargs) 2025-12-04T12:10:21.7971160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7971199Z with policy(): 2025-12-04T12:10:21.7971350Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7971405Z raise RuntimeError(msg) 2025-12-04T12:10:21.7971792Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.7971796Z 2025-12-04T12:10:21.7971869Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7972127Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7972130Z 2025-12-04T12:10:21.7972227Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7972301Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7972344Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7972401Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7972879Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7972978Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7973015Z graph_break [] 2025-12-04T12:10:21.7973075Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7973148Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7973629Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7973677Z current_size = base.storage().size() 2025-12-04T12:10:21.7973717Z Autotune Choices Stats: 2025-12-04T12:10:21.7974093Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7974138Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7974179Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7974278Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7974509Z triton_mm_0 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7974731Z triton_mm_3 0.0067 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7974954Z triton_mm_1 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7975188Z triton_mm_2 0.0072 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7975242Z _scaled_mm 0.0244 ms 25.1% 2025-12-04T12:10:21.7975369Z SingleProcess AUTOTUNE benchmarking takes 0.0243 seconds and 0.1078 seconds precompiling for 5 choices 2025-12-04T12:10:21.7975441Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7975483Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7975539Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7975641Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7976132Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7976172Z graph_break [] 2025-12-04T12:10:21.7976232Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7976306Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7976345Z Autotune Choices Stats: 2025-12-04T12:10:21.7976705Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7976752Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7976792Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7976893Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7977121Z triton_mm_7 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7977346Z triton_mm_5 0.0063 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7977582Z triton_mm_6 0.0064 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7977809Z triton_mm_4 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7977850Z _scaled_mm 0.0226 ms 27.7% 2025-12-04T12:10:21.7977978Z SingleProcess AUTOTUNE benchmarking takes 0.0253 seconds and 0.0911 seconds precompiling for 5 choices 2025-12-04T12:10:21.7978031Z =================================== FAILURES =================================== 2025-12-04T12:10:21.7978172Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7978217Z Traceback (most recent call last): 2025-12-04T12:10:21.7978374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7978415Z method(*args, **kwargs) 2025-12-04T12:10:21.7978578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7978618Z method(*args, **kwargs) 2025-12-04T12:10:21.7978769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7978820Z with policy(): 2025-12-04T12:10:21.7978971Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7979013Z raise RuntimeError(msg) 2025-12-04T12:10:21.7979398Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7979400Z 2025-12-04T12:10:21.7979474Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7979740Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7979744Z 2025-12-04T12:10:21.7979830Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7979905Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7979947Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7980003Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7980526Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7980626Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7980662Z graph_break [] 2025-12-04T12:10:21.7980722Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7980795Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7981289Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7981336Z current_size = base.storage().size() 2025-12-04T12:10:21.7981378Z Autotune Choices Stats: 2025-12-04T12:10:21.7981740Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.7981784Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7981825Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7981924Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7982157Z triton_mm_0 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7982382Z triton_mm_3 0.0067 ms 91.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7982619Z triton_mm_1 0.0070 ms 87.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7982841Z triton_mm_2 0.0072 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7982897Z _scaled_mm 0.0244 ms 25.1% 2025-12-04T12:10:21.7983025Z SingleProcess AUTOTUNE benchmarking takes 0.0243 seconds and 0.1078 seconds precompiling for 5 choices 2025-12-04T12:10:21.7983099Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7983140Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7983196Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7983295Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7983783Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7983823Z graph_break [] 2025-12-04T12:10:21.7983883Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7983956Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7983997Z Autotune Choices Stats: 2025-12-04T12:10:21.7984359Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.7984403Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7984444Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7984543Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7984772Z triton_mm_7 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7985005Z triton_mm_5 0.0063 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7985226Z triton_mm_6 0.0064 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7985451Z triton_mm_4 0.0068 ms 91.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7985491Z _scaled_mm 0.0226 ms 27.7% 2025-12-04T12:10:21.7985618Z SingleProcess AUTOTUNE benchmarking takes 0.0253 seconds and 0.0911 seconds precompiling for 5 choices 2025-12-04T12:10:21.7985689Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7985730Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7985786Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7985886Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7986378Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7986424Z graph_break [] 2025-12-04T12:10:21.7986486Z aten_mm_info [('aten._scaled_mm.default_33_16_32', 1)] 2025-12-04T12:10:21.7986558Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7986599Z Autotune Choices Stats: 2025-12-04T12:10:21.7986957Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7987002Z AUTOTUNE scaled_mm(33x32, 32x16, , ) 2025-12-04T12:10:21.7987041Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7987151Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7987381Z triton_mm_11 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7987609Z triton_mm_10 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.7987833Z triton_mm_9 0.0065 ms 92.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7988055Z triton_mm_8 0.0070 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7988096Z _scaled_mm 0.0190 ms 31.5% 2025-12-04T12:10:21.7988223Z SingleProcess AUTOTUNE benchmarking takes 0.0321 seconds and 0.3532 seconds precompiling for 5 choices 2025-12-04T12:10:21.7988413Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-5c1e239a1ec6bccf.xml - 2025-12-04T12:10:21.7988472Z =========================== short test summary info ============================ 2025-12-04T12:10:21.7989064Z FAILED [0.9947s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.7989068Z 2025-12-04T12:10:21.7989141Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7989399Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7989401Z 2025-12-04T12:10:21.7989489Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7989551Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.7989620Z ================== 1 failed, 187 deselected, 2 rerun in 4.13s ================== 2025-12-04T12:10:21.7989657Z Got exit code 1 2025-12-04T12:10:21.7989874Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.7990001Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.7990182Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b7332a80964b2c56.xml 2025-12-04T12:10:21.7990252Z ============================= test session starts ============================== 2025-12-04T12:10:21.7990363Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.7990405Z cachedir: .pytest_cache 2025-12-04T12:10:21.7990566Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.7990611Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.7990652Z configfile: pytest.ini 2025-12-04T12:10:21.7990816Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.7990912Z collecting ... collected 188 items / 147 deselected / 41 selected 2025-12-04T12:10:21.7990967Z stepcurrent: skipping 147 already run items. 2025-12-04T12:10:21.7991012Z Running 41 items in this shard 2025-12-04T12:10:21.7991014Z 2025-12-04T12:10:21.7991235Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.4372s] [ 2%] 2025-12-04T12:10:21.7991448Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.8679s] [ 2%] 2025-12-04T12:10:21.7991640Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda FAILED [0.8113s] [ 2%] 2025-12-04T12:10:21.7991643Z 2025-12-04T12:10:21.7991693Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.7991840Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7991884Z Traceback (most recent call last): 2025-12-04T12:10:21.7992047Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7992090Z method(*args, **kwargs) 2025-12-04T12:10:21.7992243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7992283Z method(*args, **kwargs) 2025-12-04T12:10:21.7992448Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7992486Z with policy(): 2025-12-04T12:10:21.7992638Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7992681Z raise RuntimeError(msg) 2025-12-04T12:10:21.7993072Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1050673152. 2025-12-04T12:10:21.7993076Z 2025-12-04T12:10:21.7993150Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7993410Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7993413Z 2025-12-04T12:10:21.7993500Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7993585Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7993629Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7993685Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.7994171Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.7994281Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.7994318Z graph_break [] 2025-12-04T12:10:21.7994382Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.7994453Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.7994947Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.7994995Z current_size = base.storage().size() 2025-12-04T12:10:21.7995037Z Autotune Choices Stats: 2025-12-04T12:10:21.7995405Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.7995451Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.7995492Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.7995590Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.7995821Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7996053Z triton_mm_10 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7996289Z triton_mm_11 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7996513Z triton_mm_4 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7996737Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7996961Z triton_mm_13 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7997187Z triton_mm_2 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7997407Z triton_mm_8 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7997643Z triton_mm_14 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.7997875Z triton_mm_6 0.0062 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.7998004Z SingleProcess AUTOTUNE benchmarking takes 0.0673 seconds and 0.3214 seconds precompiling for 17 choices 2025-12-04T12:10:21.7998151Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.7998196Z Traceback (most recent call last): 2025-12-04T12:10:21.7998353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7998407Z method(*args, **kwargs) 2025-12-04T12:10:21.7998560Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.7998601Z method(*args, **kwargs) 2025-12-04T12:10:21.7998751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.7998788Z with policy(): 2025-12-04T12:10:21.7998940Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.7998982Z raise RuntimeError(msg) 2025-12-04T12:10:21.7999374Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1101004800. 2025-12-04T12:10:21.7999377Z 2025-12-04T12:10:21.7999452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.7999710Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.7999713Z 2025-12-04T12:10:21.7999799Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.7999872Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.7999914Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.7999982Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8000509Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8000609Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8000646Z graph_break [] 2025-12-04T12:10:21.8000710Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8000782Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8001266Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8001326Z current_size = base.storage().size() 2025-12-04T12:10:21.8001367Z Autotune Choices Stats: 2025-12-04T12:10:21.8001733Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8001792Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8001833Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8001932Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8002162Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8002404Z triton_mm_10 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8002630Z triton_mm_11 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8002855Z triton_mm_4 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8003077Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8003303Z triton_mm_13 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8003525Z triton_mm_2 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8003749Z triton_mm_8 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8003983Z triton_mm_14 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8004209Z triton_mm_6 0.0062 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8004336Z SingleProcess AUTOTUNE benchmarking takes 0.0673 seconds and 0.3214 seconds precompiling for 17 choices 2025-12-04T12:10:21.8004412Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8004455Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8004511Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8004610Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8005095Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8005142Z graph_break [] 2025-12-04T12:10:21.8005202Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8005290Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8005331Z Autotune Choices Stats: 2025-12-04T12:10:21.8005694Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.8005738Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8005782Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8005880Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8006121Z triton_mm_30 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8006346Z triton_mm_22 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8006568Z triton_mm_23 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8006791Z triton_mm_24 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8007017Z triton_mm_29 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8007240Z triton_mm_21 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8007475Z triton_mm_27 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8007702Z triton_mm_16 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8007927Z triton_mm_18 0.0062 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8008153Z triton_mm_20 0.0062 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8009231Z SingleProcess AUTOTUNE benchmarking takes 0.0936 seconds and 0.2337 seconds precompiling for 17 choices 2025-12-04T12:10:21.8009287Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8009433Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8009492Z Traceback (most recent call last): 2025-12-04T12:10:21.8009650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8009691Z method(*args, **kwargs) 2025-12-04T12:10:21.8009844Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8011549Z method(*args, **kwargs) 2025-12-04T12:10:21.8011711Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8011750Z with policy(): 2025-12-04T12:10:21.8011914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8011955Z raise RuntimeError(msg) 2025-12-04T12:10:21.8012350Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8012354Z 2025-12-04T12:10:21.8012430Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8012694Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8012696Z 2025-12-04T12:10:21.8012785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8012860Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8012904Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8012961Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8013449Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8013549Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8013586Z graph_break [] 2025-12-04T12:10:21.8013647Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8013722Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8014237Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8014286Z current_size = base.storage().size() 2025-12-04T12:10:21.8014328Z Autotune Choices Stats: 2025-12-04T12:10:21.8014697Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_7", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8014742Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8014823Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8014925Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8015157Z triton_mm_7 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8015402Z triton_mm_10 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8015640Z triton_mm_11 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8015866Z triton_mm_4 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8016087Z triton_mm_5 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8016312Z triton_mm_13 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8016538Z triton_mm_2 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8016761Z triton_mm_8 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8016985Z triton_mm_14 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8017209Z triton_mm_6 0.0062 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8017338Z SingleProcess AUTOTUNE benchmarking takes 0.0673 seconds and 0.3214 seconds precompiling for 17 choices 2025-12-04T12:10:21.8017413Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8017456Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8017512Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8017621Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8018108Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8018146Z graph_break [] 2025-12-04T12:10:21.8018208Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8018280Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8018320Z Autotune Choices Stats: 2025-12-04T12:10:21.8018705Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_30", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.8018760Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8018801Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8018900Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8019131Z triton_mm_30 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8019369Z triton_mm_22 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8019594Z triton_mm_23 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8019818Z triton_mm_24 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8020042Z triton_mm_29 0.0060 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8020302Z triton_mm_21 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8020524Z triton_mm_27 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8020752Z triton_mm_16 0.0061 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8020973Z triton_mm_18 0.0062 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8021198Z triton_mm_20 0.0062 ms 96.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8021343Z SingleProcess AUTOTUNE benchmarking takes 0.0936 seconds and 0.2337 seconds precompiling for 17 choices 2025-12-04T12:10:21.8021414Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8021457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8021513Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8021611Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8022089Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8022127Z graph_break [] 2025-12-04T12:10:21.8022209Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8022284Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8022325Z Autotune Choices Stats: 2025-12-04T12:10:21.8022687Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_47", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006320000160485506, "best_triton_pos": 0} 2025-12-04T12:10:21.8022757Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8022798Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8022897Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8023127Z triton_mm_47 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8023354Z triton_mm_38 0.0064 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8023579Z triton_mm_41 0.0069 ms 91.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8023802Z triton_mm_40 0.0070 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8024026Z triton_mm_44 0.0070 ms 90.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8024249Z triton_mm_42 0.0070 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8024473Z triton_mm_45 0.0070 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8024698Z triton_mm_34 0.0070 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8024933Z triton_mm_39 0.0070 ms 89.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8025156Z triton_mm_33 0.0072 ms 87.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8025284Z SingleProcess AUTOTUNE benchmarking takes 0.1343 seconds and 0.2288 seconds precompiling for 17 choices 2025-12-04T12:10:21.8025474Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-b7332a80964b2c56.xml - 2025-12-04T12:10:21.8025535Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8026136Z FAILED [0.8113s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8026148Z 2025-12-04T12:10:21.8026221Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8026483Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8026485Z 2025-12-04T12:10:21.8026584Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8026647Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8026716Z ================== 1 failed, 147 deselected, 2 rerun in 4.14s ================== 2025-12-04T12:10:21.8026753Z Got exit code 1 2025-12-04T12:10:21.8026793Z Retrying single test... 2025-12-04T12:10:21.8026937Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-51c236f60d527fad.xml 2025-12-04T12:10:21.8026997Z ============================= test session starts ============================== 2025-12-04T12:10:21.8027111Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8027154Z cachedir: .pytest_cache 2025-12-04T12:10:21.8027312Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8027359Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8027399Z configfile: pytest.ini 2025-12-04T12:10:21.8027565Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8027641Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8027897Z stepcurrent: skipping 147 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8027941Z Running 1 items in this shard 2025-12-04T12:10:21.8027944Z 2025-12-04T12:10:21.8028162Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [32.4809s] [100%] 2025-12-04T12:10:21.8028373Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0095s] [100%] 2025-12-04T12:10:21.8028565Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.0139s] [100%] 2025-12-04T12:10:21.8028567Z 2025-12-04T12:10:21.8028619Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8028775Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8028824Z Traceback (most recent call last): 2025-12-04T12:10:21.8028983Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8029026Z method(*args, **kwargs) 2025-12-04T12:10:21.8029178Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8029219Z method(*args, **kwargs) 2025-12-04T12:10:21.8029370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8029408Z with policy(): 2025-12-04T12:10:21.8029559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8029601Z raise RuntimeError(msg) 2025-12-04T12:10:21.8030000Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1050673152. 2025-12-04T12:10:21.8030014Z 2025-12-04T12:10:21.8030087Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8030385Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8030401Z 2025-12-04T12:10:21.8030489Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8030563Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8030605Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8030664Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8031149Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8031254Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8031291Z graph_break [] 2025-12-04T12:10:21.8031353Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8031426Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8031911Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8031960Z current_size = base.storage().size() 2025-12-04T12:10:21.8032000Z Autotune Choices Stats: 2025-12-04T12:10:21.8032369Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8032414Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8032455Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8032554Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8032802Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8033030Z triton_mm_3 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8033256Z triton_mm_13 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8033486Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8033724Z triton_mm_9 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8033959Z triton_mm_11 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8034181Z triton_mm_4 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8034414Z triton_mm_2 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8034638Z triton_mm_14 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8034860Z triton_mm_5 0.0078 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8034989Z SingleProcess AUTOTUNE benchmarking takes 0.0809 seconds and 0.3670 seconds precompiling for 17 choices 2025-12-04T12:10:21.8035136Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8035182Z Traceback (most recent call last): 2025-12-04T12:10:21.8035337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8035379Z method(*args, **kwargs) 2025-12-04T12:10:21.8035534Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8035577Z method(*args, **kwargs) 2025-12-04T12:10:21.8035728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8035767Z with policy(): 2025-12-04T12:10:21.8035918Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8035960Z raise RuntimeError(msg) 2025-12-04T12:10:21.8036349Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1101004800. 2025-12-04T12:10:21.8036355Z 2025-12-04T12:10:21.8036446Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8036706Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8036709Z 2025-12-04T12:10:21.8036796Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8036870Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8036914Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8036970Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8037468Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8037567Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8037614Z graph_break [] 2025-12-04T12:10:21.8037675Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8037748Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8038232Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8038289Z current_size = base.storage().size() 2025-12-04T12:10:21.8038330Z Autotune Choices Stats: 2025-12-04T12:10:21.8038695Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8038740Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8038781Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8038880Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8039111Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8039336Z triton_mm_3 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8039563Z triton_mm_13 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8039787Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8040011Z triton_mm_9 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8040289Z triton_mm_11 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8040511Z triton_mm_4 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8040732Z triton_mm_2 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8040956Z triton_mm_14 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8041191Z triton_mm_5 0.0078 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8041321Z SingleProcess AUTOTUNE benchmarking takes 0.0809 seconds and 0.3670 seconds precompiling for 17 choices 2025-12-04T12:10:21.8041411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8041453Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8041509Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8041609Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8042108Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8042146Z graph_break [] 2025-12-04T12:10:21.8042208Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8042281Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8042322Z Autotune Choices Stats: 2025-12-04T12:10:21.8042681Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_25", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8042727Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8042767Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8042866Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8043098Z triton_mm_25 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8043325Z triton_mm_20 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8043546Z triton_mm_23 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8043771Z triton_mm_16 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8044012Z triton_mm_30 0.0063 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8044236Z triton_mm_24 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8044458Z triton_mm_28 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8044681Z triton_mm_17 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8044917Z triton_mm_26 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8045150Z triton_mm_31 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8045276Z SingleProcess AUTOTUNE benchmarking takes 0.0823 seconds and 0.1119 seconds precompiling for 17 choices 2025-12-04T12:10:21.8045340Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8045483Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8045531Z Traceback (most recent call last): 2025-12-04T12:10:21.8045689Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8045730Z method(*args, **kwargs) 2025-12-04T12:10:21.8045882Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8045925Z method(*args, **kwargs) 2025-12-04T12:10:21.8046076Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8046114Z with policy(): 2025-12-04T12:10:21.8046265Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8046308Z raise RuntimeError(msg) 2025-12-04T12:10:21.8046696Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8046701Z 2025-12-04T12:10:21.8046773Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8047033Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8047035Z 2025-12-04T12:10:21.8047121Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8047195Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8047237Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8047295Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8047790Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8047889Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8047926Z graph_break [] 2025-12-04T12:10:21.8047987Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8048060Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8048545Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8048602Z current_size = base.storage().size() 2025-12-04T12:10:21.8048642Z Autotune Choices Stats: 2025-12-04T12:10:21.8049006Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8049063Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8049105Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8049224Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8049455Z triton_mm_12 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8049681Z triton_mm_3 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8049906Z triton_mm_13 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8050171Z triton_mm_6 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8050397Z triton_mm_9 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8050624Z triton_mm_11 0.0064 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8050847Z triton_mm_4 0.0065 ms 92.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8051070Z triton_mm_2 0.0068 ms 88.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8051295Z triton_mm_14 0.0072 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8051533Z triton_mm_5 0.0078 ms 76.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8051663Z SingleProcess AUTOTUNE benchmarking takes 0.0809 seconds and 0.3670 seconds precompiling for 17 choices 2025-12-04T12:10:21.8051734Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8051776Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8051832Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8051933Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8052436Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8052474Z graph_break [] 2025-12-04T12:10:21.8052549Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8052621Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8052662Z Autotune Choices Stats: 2025-12-04T12:10:21.8053021Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_25", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8053080Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8053120Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8053220Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8053451Z triton_mm_25 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8053676Z triton_mm_20 0.0060 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8053898Z triton_mm_23 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8054122Z triton_mm_16 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8054348Z triton_mm_30 0.0063 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8054570Z triton_mm_24 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8054792Z triton_mm_28 0.0063 ms 95.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8055026Z triton_mm_17 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8055250Z triton_mm_26 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8055472Z triton_mm_31 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8055599Z SingleProcess AUTOTUNE benchmarking takes 0.0823 seconds and 0.1119 seconds precompiling for 17 choices 2025-12-04T12:10:21.8055672Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8055713Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8055770Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8055879Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8056369Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8056417Z graph_break [] 2025-12-04T12:10:21.8056478Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8056563Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8056603Z Autotune Choices Stats: 2025-12-04T12:10:21.8056966Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_45", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.8057011Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8057053Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8057149Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8057378Z triton_mm_45 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8057602Z triton_mm_47 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8057828Z triton_mm_46 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8058050Z triton_mm_40 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8058276Z triton_mm_33 0.0063 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8058500Z triton_mm_38 0.0065 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8058732Z triton_mm_43 0.0067 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8058954Z triton_mm_34 0.0067 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8059175Z triton_mm_44 0.0068 ms 88.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8059399Z triton_mm_42 0.0069 ms 87.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8059538Z SingleProcess AUTOTUNE benchmarking takes 0.1328 seconds and 0.3366 seconds precompiling for 17 choices 2025-12-04T12:10:21.8059725Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-51c236f60d527fad.xml - 2025-12-04T12:10:21.8059798Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8060419Z FAILED [1.0139s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8060440Z 2025-12-04T12:10:21.8060514Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8060776Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8060779Z 2025-12-04T12:10:21.8060866Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8060931Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8061000Z ================= 1 failed, 187 deselected, 2 rerun in 34.53s ================== 2025-12-04T12:10:21.8061037Z Got exit code 1 2025-12-04T12:10:21.8061077Z Retrying single test... 2025-12-04T12:10:21.8061223Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ab2dec73e4020814.xml 2025-12-04T12:10:21.8061279Z ============================= test session starts ============================== 2025-12-04T12:10:21.8061391Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8061432Z cachedir: .pytest_cache 2025-12-04T12:10:21.8061591Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8061636Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8061678Z configfile: pytest.ini 2025-12-04T12:10:21.8061841Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8061916Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8062170Z stepcurrent: skipping 147 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8062215Z Running 1 items in this shard 2025-12-04T12:10:21.8062217Z 2025-12-04T12:10:21.8062446Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.5633s] [100%] 2025-12-04T12:10:21.8062661Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4508s] [100%] 2025-12-04T12:10:21.8062852Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.3141s] [100%] 2025-12-04T12:10:21.8062854Z 2025-12-04T12:10:21.8062904Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8063050Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8063095Z Traceback (most recent call last): 2025-12-04T12:10:21.8063254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8063295Z method(*args, **kwargs) 2025-12-04T12:10:21.8063463Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8063504Z method(*args, **kwargs) 2025-12-04T12:10:21.8063675Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8063712Z with policy(): 2025-12-04T12:10:21.8063865Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8063906Z raise RuntimeError(msg) 2025-12-04T12:10:21.8064308Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1050673152. 2025-12-04T12:10:21.8064310Z 2025-12-04T12:10:21.8064386Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8064643Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8064646Z 2025-12-04T12:10:21.8064733Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8064807Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8064852Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8064910Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8065399Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8065606Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8065646Z graph_break [] 2025-12-04T12:10:21.8065707Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8065780Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8066262Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8066310Z current_size = base.storage().size() 2025-12-04T12:10:21.8066351Z Autotune Choices Stats: 2025-12-04T12:10:21.8066728Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8066777Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8066817Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8066916Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8067148Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8067388Z triton_mm_2 0.0069 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8067612Z triton_mm_8 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8067845Z triton_mm_15 0.0071 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8068081Z triton_mm_9 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8068305Z triton_mm_13 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8068528Z triton_mm_5 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8068753Z triton_mm_14 0.0075 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8068975Z triton_mm_0 0.0076 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8069204Z triton_mm_10 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8069332Z SingleProcess AUTOTUNE benchmarking takes 0.0902 seconds and 0.3437 seconds precompiling for 17 choices 2025-12-04T12:10:21.8069479Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8069525Z Traceback (most recent call last): 2025-12-04T12:10:21.8069683Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8069724Z method(*args, **kwargs) 2025-12-04T12:10:21.8069877Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8069916Z method(*args, **kwargs) 2025-12-04T12:10:21.8070068Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8070139Z with policy(): 2025-12-04T12:10:21.8070292Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8070334Z raise RuntimeError(msg) 2025-12-04T12:10:21.8070721Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1101004800. 2025-12-04T12:10:21.8070724Z 2025-12-04T12:10:21.8070798Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8071055Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8071057Z 2025-12-04T12:10:21.8071161Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8071234Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8071278Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8071348Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8071835Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8071945Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8071982Z graph_break [] 2025-12-04T12:10:21.8072044Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8072118Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8072598Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8072645Z current_size = base.storage().size() 2025-12-04T12:10:21.8072689Z Autotune Choices Stats: 2025-12-04T12:10:21.8073051Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8073098Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8073138Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8073239Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8073469Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8073696Z triton_mm_2 0.0069 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8073918Z triton_mm_8 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8074152Z triton_mm_15 0.0071 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8074377Z triton_mm_9 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8074601Z triton_mm_13 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8074823Z triton_mm_5 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8075066Z triton_mm_14 0.0075 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8075299Z triton_mm_0 0.0076 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8075525Z triton_mm_10 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8075665Z SingleProcess AUTOTUNE benchmarking takes 0.0902 seconds and 0.3437 seconds precompiling for 17 choices 2025-12-04T12:10:21.8075740Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8075783Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8075840Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8075937Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8076420Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8076457Z graph_break [] 2025-12-04T12:10:21.8076518Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8076592Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8076631Z Autotune Choices Stats: 2025-12-04T12:10:21.8076997Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:21.8077041Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8077082Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8077179Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8077409Z triton_mm_26 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8077647Z triton_mm_22 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8077873Z triton_mm_29 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8078098Z triton_mm_24 0.0065 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8078323Z triton_mm_30 0.0068 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8078564Z triton_mm_21 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8078786Z triton_mm_28 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8079019Z triton_mm_31 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8079251Z triton_mm_27 0.0071 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8079475Z triton_mm_18 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8079603Z SingleProcess AUTOTUNE benchmarking takes 0.1174 seconds and 0.3212 seconds precompiling for 17 choices 2025-12-04T12:10:21.8079656Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8079801Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8079846Z Traceback (most recent call last): 2025-12-04T12:10:21.8080002Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8080043Z method(*args, **kwargs) 2025-12-04T12:10:21.8080231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8080270Z method(*args, **kwargs) 2025-12-04T12:10:21.8080422Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8080458Z with policy(): 2025-12-04T12:10:21.8080610Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8080651Z raise RuntimeError(msg) 2025-12-04T12:10:21.8081039Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8081043Z 2025-12-04T12:10:21.8081116Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8081375Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8081392Z 2025-12-04T12:10:21.8081480Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8081550Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8081594Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8081650Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8082139Z inductor [('triton_bundler_save_kernel', 136), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8082238Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8082274Z graph_break [] 2025-12-04T12:10:21.8082349Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8082421Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8082916Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8082977Z current_size = base.storage().size() 2025-12-04T12:10:21.8083018Z Autotune Choices Stats: 2025-12-04T12:10:21.8083382Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_12", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8083427Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8083467Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8083567Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8083798Z triton_mm_12 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8084026Z triton_mm_2 0.0069 ms 90.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8084250Z triton_mm_8 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8084472Z triton_mm_15 0.0071 ms 88.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8084696Z triton_mm_9 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8084921Z triton_mm_13 0.0072 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8085151Z triton_mm_5 0.0074 ms 84.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8085377Z triton_mm_14 0.0075 ms 83.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8085601Z triton_mm_0 0.0076 ms 82.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8085827Z triton_mm_10 0.0077 ms 80.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8085956Z SingleProcess AUTOTUNE benchmarking takes 0.0902 seconds and 0.3437 seconds precompiling for 17 choices 2025-12-04T12:10:21.8086040Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8086082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8086139Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8086246Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8086728Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8086778Z graph_break [] 2025-12-04T12:10:21.8086837Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8086910Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8086951Z Autotune Choices Stats: 2025-12-04T12:10:21.8087311Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_26", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:21.8087356Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8087397Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8087495Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8087726Z triton_mm_26 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8087952Z triton_mm_22 0.0062 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8088179Z triton_mm_29 0.0065 ms 93.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8088403Z triton_mm_24 0.0065 ms 93.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8088628Z triton_mm_30 0.0068 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8088859Z triton_mm_21 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8089081Z triton_mm_28 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8089419Z triton_mm_31 0.0070 ms 86.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8089643Z triton_mm_27 0.0071 ms 85.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8089877Z triton_mm_18 0.0072 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8090005Z SingleProcess AUTOTUNE benchmarking takes 0.1174 seconds and 0.3212 seconds precompiling for 17 choices 2025-12-04T12:10:21.8090087Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8090169Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8090225Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8090343Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8090825Z inductor [('triton_bundler_save_kernel', 136), ('async_compile_cache_miss', 18), ('benchmarking.InductorBenchmarker.benchmark_gpu', 17), ('generated_module_cache_miss', 16), ('select_algorithm_num_precompiles', 16), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8090862Z graph_break [] 2025-12-04T12:10:21.8090922Z aten_mm_info [('aten._scaled_mm.default_33_2048_32', 1)] 2025-12-04T12:10:21.8090996Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8091036Z Autotune Choices Stats: 2025-12-04T12:10:21.8091396Z {"num_choices": 17, "num_triton_choices": 16, "best_kernel": "triton_mm_45", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8091442Z AUTOTUNE scaled_mm(33x32, 32x2048, , ) 2025-12-04T12:10:21.8091482Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8091580Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8091811Z triton_mm_45 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8092038Z triton_mm_37 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8092264Z triton_mm_43 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8092488Z triton_mm_34 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8092730Z triton_mm_39 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8092952Z triton_mm_44 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=32, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8093176Z triton_mm_38 0.0071 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8093410Z triton_mm_47 0.0072 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=64, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8093637Z triton_mm_41 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8093877Z triton_mm_42 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8094005Z SingleProcess AUTOTUNE benchmarking takes 0.1368 seconds and 0.3271 seconds precompiling for 17 choices 2025-12-04T12:10:21.8094207Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-ab2dec73e4020814.xml - 2025-12-04T12:10:21.8094266Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8094856Z FAILED [1.3141s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1101004800 and is now 1151336448. 2025-12-04T12:10:21.8094859Z 2025-12-04T12:10:21.8094932Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8095191Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8095195Z 2025-12-04T12:10:21.8095282Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8095342Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8095412Z ================== 1 failed, 187 deselected, 2 rerun in 5.35s ================== 2025-12-04T12:10:21.8095450Z Got exit code 1 2025-12-04T12:10:21.8095656Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8095783Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8095927Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c0c04ba5bc37a69f.xml 2025-12-04T12:10:21.8095985Z ============================= test session starts ============================== 2025-12-04T12:10:21.8096100Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8096140Z cachedir: .pytest_cache 2025-12-04T12:10:21.8096312Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8096357Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8096400Z configfile: pytest.ini 2025-12-04T12:10:21.8096562Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8096641Z collecting ... collected 188 items / 148 deselected / 40 selected 2025-12-04T12:10:21.8096697Z stepcurrent: skipping 148 already run items. 2025-12-04T12:10:21.8096742Z Running 40 items in this shard 2025-12-04T12:10:21.8096744Z 2025-12-04T12:10:21.8096968Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [33.2763s] [ 2%] 2025-12-04T12:10:21.8097180Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6931s] [ 2%] 2025-12-04T12:10:21.8097380Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.6758s] [ 2%] 2025-12-04T12:10:21.8097383Z 2025-12-04T12:10:21.8097444Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8097585Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8097631Z Traceback (most recent call last): 2025-12-04T12:10:21.8097789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8097840Z method(*args, **kwargs) 2025-12-04T12:10:21.8097991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8098031Z method(*args, **kwargs) 2025-12-04T12:10:21.8098186Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8098223Z with policy(): 2025-12-04T12:10:21.8098374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8098416Z raise RuntimeError(msg) 2025-12-04T12:10:21.8098804Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.8098807Z 2025-12-04T12:10:21.8098883Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8099146Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8099149Z 2025-12-04T12:10:21.8099236Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8099309Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8099353Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8099409Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8099889Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8099989Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8100024Z graph_break [] 2025-12-04T12:10:21.8100134Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8100208Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8100690Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8100738Z current_size = base.storage().size() 2025-12-04T12:10:21.8100778Z Autotune Choices Stats: 2025-12-04T12:10:21.8101162Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8101209Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8101251Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8101363Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8101598Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8101839Z triton_mm_2 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8102067Z triton_mm_3 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8102286Z triton_mm_0 0.0076 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8102332Z _scaled_mm 0.0232 ms 26.9% 2025-12-04T12:10:21.8102459Z SingleProcess AUTOTUNE benchmarking takes 0.0258 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:21.8102604Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8102650Z Traceback (most recent call last): 2025-12-04T12:10:21.8102805Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8102846Z method(*args, **kwargs) 2025-12-04T12:10:21.8103003Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8103042Z method(*args, **kwargs) 2025-12-04T12:10:21.8103194Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8103232Z with policy(): 2025-12-04T12:10:21.8103383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8103425Z raise RuntimeError(msg) 2025-12-04T12:10:21.8103811Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.8103815Z 2025-12-04T12:10:21.8103890Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8104159Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8104162Z 2025-12-04T12:10:21.8104249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8104321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8104364Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8104420Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8104900Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8105008Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8105044Z graph_break [] 2025-12-04T12:10:21.8105107Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8105190Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8105672Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8105736Z current_size = base.storage().size() 2025-12-04T12:10:21.8105777Z Autotune Choices Stats: 2025-12-04T12:10:21.8106143Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8106190Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8106233Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8106333Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8106563Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8106786Z triton_mm_2 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8107012Z triton_mm_3 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8107235Z triton_mm_0 0.0076 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8107277Z _scaled_mm 0.0232 ms 26.9% 2025-12-04T12:10:21.8107405Z SingleProcess AUTOTUNE benchmarking takes 0.0258 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:21.8107478Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8107519Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8107575Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8107684Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8108161Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8108199Z graph_break [] 2025-12-04T12:10:21.8108259Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8108333Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8108372Z Autotune Choices Stats: 2025-12-04T12:10:21.8108742Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.8108789Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8108840Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8108937Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8109166Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8109400Z triton_mm_6 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8109625Z triton_mm_7 0.0066 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8109847Z triton_mm_4 0.0076 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8109888Z _scaled_mm 0.0224 ms 27.0% 2025-12-04T12:10:21.8110015Z SingleProcess AUTOTUNE benchmarking takes 0.0248 seconds and 0.1266 seconds precompiling for 5 choices 2025-12-04T12:10:21.8110069Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8110244Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8110290Z Traceback (most recent call last): 2025-12-04T12:10:21.8110447Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8110489Z method(*args, **kwargs) 2025-12-04T12:10:21.8110641Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8110682Z method(*args, **kwargs) 2025-12-04T12:10:21.8110833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8110869Z with policy(): 2025-12-04T12:10:21.8111021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8111062Z raise RuntimeError(msg) 2025-12-04T12:10:21.8111453Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.8111470Z 2025-12-04T12:10:21.8111544Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8111802Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8111805Z 2025-12-04T12:10:21.8111891Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8111962Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8112006Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8112062Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8112554Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8112651Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8112701Z graph_break [] 2025-12-04T12:10:21.8112762Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8112834Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8113314Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8113375Z current_size = base.storage().size() 2025-12-04T12:10:21.8113419Z Autotune Choices Stats: 2025-12-04T12:10:21.8113785Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006240000016987324, "best_triton_pos": 0} 2025-12-04T12:10:21.8113832Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8113873Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8113973Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8114204Z triton_mm_1 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8114429Z triton_mm_2 0.0064 ms 96.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8114652Z triton_mm_3 0.0065 ms 95.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8114873Z triton_mm_0 0.0076 ms 82.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8114917Z _scaled_mm 0.0232 ms 26.9% 2025-12-04T12:10:21.8115043Z SingleProcess AUTOTUNE benchmarking takes 0.0258 seconds and 0.1609 seconds precompiling for 5 choices 2025-12-04T12:10:21.8115116Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8115159Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8115226Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8115325Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8115805Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8115842Z graph_break [] 2025-12-04T12:10:21.8115902Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8115974Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8116017Z Autotune Choices Stats: 2025-12-04T12:10:21.8116385Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006039999891072512, "best_triton_pos": 0} 2025-12-04T12:10:21.8116440Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8116481Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8116580Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8116809Z triton_mm_5 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8117042Z triton_mm_6 0.0061 ms 99.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8117265Z triton_mm_7 0.0066 ms 91.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8117486Z triton_mm_4 0.0076 ms 79.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8117528Z _scaled_mm 0.0224 ms 27.0% 2025-12-04T12:10:21.8117654Z SingleProcess AUTOTUNE benchmarking takes 0.0248 seconds and 0.1266 seconds precompiling for 5 choices 2025-12-04T12:10:21.8117727Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8117769Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8117825Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8117925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8118402Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8118441Z graph_break [] 2025-12-04T12:10:21.8118501Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8118574Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8118613Z Autotune Choices Stats: 2025-12-04T12:10:21.8118986Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_11", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005960000213235617, "best_triton_pos": 0} 2025-12-04T12:10:21.8119032Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8119073Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8119170Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8119404Z triton_mm_11 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8119630Z triton_mm_10 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8119864Z triton_mm_8 0.0076 ms 78.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8120121Z triton_mm_9 0.0084 ms 71.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8120182Z _scaled_mm 0.0242 ms 24.6% 2025-12-04T12:10:21.8120308Z SingleProcess AUTOTUNE benchmarking takes 0.0257 seconds and 0.1438 seconds precompiling for 5 choices 2025-12-04T12:10:21.8120510Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-c0c04ba5bc37a69f.xml - 2025-12-04T12:10:21.8120571Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8121157Z FAILED [0.6758s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.8121161Z 2025-12-04T12:10:21.8121234Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8121493Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8121496Z 2025-12-04T12:10:21.8121582Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8121644Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8121714Z ================= 1 failed, 148 deselected, 2 rerun in 34.67s ================== 2025-12-04T12:10:21.8121752Z Got exit code 1 2025-12-04T12:10:21.8121790Z Retrying single test... 2025-12-04T12:10:21.8121933Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bb6683db971732a.xml 2025-12-04T12:10:21.8121990Z ============================= test session starts ============================== 2025-12-04T12:10:21.8122103Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8122143Z cachedir: .pytest_cache 2025-12-04T12:10:21.8122306Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8122351Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8122394Z configfile: pytest.ini 2025-12-04T12:10:21.8122560Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8122650Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8122905Z stepcurrent: skipping 148 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8122948Z Running 1 items in this shard 2025-12-04T12:10:21.8122950Z 2025-12-04T12:10:21.8123165Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2405s] [100%] 2025-12-04T12:10:21.8123377Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6696s] [100%] 2025-12-04T12:10:21.8123578Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.5886s] [100%] 2025-12-04T12:10:21.8123582Z 2025-12-04T12:10:21.8123633Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8123775Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8123833Z Traceback (most recent call last): 2025-12-04T12:10:21.8123991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8124034Z method(*args, **kwargs) 2025-12-04T12:10:21.8124197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8124237Z method(*args, **kwargs) 2025-12-04T12:10:21.8124388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8124425Z with policy(): 2025-12-04T12:10:21.8124578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8124620Z raise RuntimeError(msg) 2025-12-04T12:10:21.8125005Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.8125008Z 2025-12-04T12:10:21.8125082Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8125340Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8125342Z 2025-12-04T12:10:21.8125428Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8125502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8125545Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8125600Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8126080Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8126180Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8126215Z graph_break [] 2025-12-04T12:10:21.8126276Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8126348Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8126845Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8126893Z current_size = base.storage().size() 2025-12-04T12:10:21.8126934Z Autotune Choices Stats: 2025-12-04T12:10:21.8127294Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:21.8127340Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8127392Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8127492Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8127723Z triton_mm_2 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8127960Z triton_mm_1 0.0067 ms 95.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8128196Z triton_mm_0 0.0076 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8128422Z triton_mm_3 0.0090 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8128464Z _scaled_mm 0.0245 ms 26.1% 2025-12-04T12:10:21.8128592Z SingleProcess AUTOTUNE benchmarking takes 0.0259 seconds and 0.1476 seconds precompiling for 5 choices 2025-12-04T12:10:21.8128735Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8128780Z Traceback (most recent call last): 2025-12-04T12:10:21.8128936Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8128979Z method(*args, **kwargs) 2025-12-04T12:10:21.8129130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8129171Z method(*args, **kwargs) 2025-12-04T12:10:21.8129322Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8129360Z with policy(): 2025-12-04T12:10:21.8129510Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8129553Z raise RuntimeError(msg) 2025-12-04T12:10:21.8129939Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.8129943Z 2025-12-04T12:10:21.8130017Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8130332Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8130334Z 2025-12-04T12:10:21.8130421Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8130494Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8130537Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8130594Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8131077Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8131178Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8131228Z graph_break [] 2025-12-04T12:10:21.8131291Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8131362Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8131859Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8131918Z current_size = base.storage().size() 2025-12-04T12:10:21.8131959Z Autotune Choices Stats: 2025-12-04T12:10:21.8132320Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:21.8132365Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8132407Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8132505Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8132734Z triton_mm_2 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8132960Z triton_mm_1 0.0067 ms 95.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8133186Z triton_mm_0 0.0076 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8133408Z triton_mm_3 0.0090 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8133450Z _scaled_mm 0.0245 ms 26.1% 2025-12-04T12:10:21.8133577Z SingleProcess AUTOTUNE benchmarking takes 0.0259 seconds and 0.1476 seconds precompiling for 5 choices 2025-12-04T12:10:21.8133649Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8133691Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8133748Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8133846Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8134333Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8134371Z graph_break [] 2025-12-04T12:10:21.8134431Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8134505Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8134546Z Autotune Choices Stats: 2025-12-04T12:10:21.8134900Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.8134962Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8135004Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8135102Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8135340Z triton_mm_6 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8135565Z triton_mm_5 0.0066 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8135798Z triton_mm_7 0.0069 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8136023Z triton_mm_4 0.0076 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8136064Z _scaled_mm 0.0210 ms 28.1% 2025-12-04T12:10:21.8136191Z SingleProcess AUTOTUNE benchmarking takes 0.0263 seconds and 0.1074 seconds precompiling for 5 choices 2025-12-04T12:10:21.8136243Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8136387Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8136433Z Traceback (most recent call last): 2025-12-04T12:10:21.8136587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8136627Z method(*args, **kwargs) 2025-12-04T12:10:21.8136780Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8136820Z method(*args, **kwargs) 2025-12-04T12:10:21.8136970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8137008Z with policy(): 2025-12-04T12:10:21.8137161Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8137203Z raise RuntimeError(msg) 2025-12-04T12:10:21.8137591Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1153433600. 2025-12-04T12:10:21.8137594Z 2025-12-04T12:10:21.8137669Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8137936Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8137939Z 2025-12-04T12:10:21.8138026Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8138097Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8138140Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8138195Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8138687Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8138787Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8138825Z graph_break [] 2025-12-04T12:10:21.8138897Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8138968Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8139450Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8139506Z current_size = base.storage().size() 2025-12-04T12:10:21.8139547Z Autotune Choices Stats: 2025-12-04T12:10:21.8139907Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006399999838322401, "best_triton_pos": 0} 2025-12-04T12:10:21.8139955Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8139995Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8140142Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8140370Z triton_mm_2 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8140599Z triton_mm_1 0.0067 ms 95.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8140822Z triton_mm_0 0.0076 ms 84.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8141044Z triton_mm_3 0.0090 ms 71.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8141086Z _scaled_mm 0.0245 ms 26.1% 2025-12-04T12:10:21.8141215Z SingleProcess AUTOTUNE benchmarking takes 0.0259 seconds and 0.1476 seconds precompiling for 5 choices 2025-12-04T12:10:21.8141289Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8141330Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8141390Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8141505Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8141984Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8142022Z graph_break [] 2025-12-04T12:10:21.8142081Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8142156Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8142196Z Autotune Choices Stats: 2025-12-04T12:10:21.8142563Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_6", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.005880000069737434, "best_triton_pos": 0} 2025-12-04T12:10:21.8142608Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8142662Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8142758Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8142987Z triton_mm_6 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8143225Z triton_mm_5 0.0066 ms 89.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8143452Z triton_mm_7 0.0069 ms 85.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8143680Z triton_mm_4 0.0076 ms 77.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8143721Z _scaled_mm 0.0210 ms 28.1% 2025-12-04T12:10:21.8143848Z SingleProcess AUTOTUNE benchmarking takes 0.0263 seconds and 0.1074 seconds precompiling for 5 choices 2025-12-04T12:10:21.8143922Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8143963Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8144019Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8144118Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8144564Z inductor [('triton_bundler_save_kernel', 32), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('async_compile_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8144602Z graph_break [] 2025-12-04T12:10:21.8144661Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8144735Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8144776Z Autotune Choices Stats: 2025-12-04T12:10:21.8145257Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "_scaled_mm", "best_time": 0.006159000098705292, "best_triton_pos": 1, "best_triton_time": 0.006399999838322401, "best_triton_kernel": "triton_mm_9", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:21.8145303Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8145343Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8145441Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8145483Z _scaled_mm 0.0062 ms 100.0% 2025-12-04T12:10:21.8145714Z triton_mm_9 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8145939Z triton_mm_10 0.0068 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8146171Z triton_mm_8 0.0076 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8146398Z triton_mm_11 0.0093 ms 66.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8146536Z SingleProcess AUTOTUNE benchmarking takes 0.0323 seconds and 0.2200 seconds precompiling for 5 choices 2025-12-04T12:10:21.8146723Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-0bb6683db971732a.xml - 2025-12-04T12:10:21.8146793Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8147375Z FAILED [0.5886s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1153433600. 2025-12-04T12:10:21.8147379Z 2025-12-04T12:10:21.8147452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8147711Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8147714Z 2025-12-04T12:10:21.8147801Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8147862Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8147929Z ================== 1 failed, 187 deselected, 2 rerun in 3.52s ================== 2025-12-04T12:10:21.8147967Z Got exit code 1 2025-12-04T12:10:21.8148009Z Retrying single test... 2025-12-04T12:10:21.8148152Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3ddab0696dc61691.xml 2025-12-04T12:10:21.8148209Z ============================= test session starts ============================== 2025-12-04T12:10:21.8148319Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8148360Z cachedir: .pytest_cache 2025-12-04T12:10:21.8148517Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8148564Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8148605Z configfile: pytest.ini 2025-12-04T12:10:21.8148768Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8148844Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8149114Z stepcurrent: skipping 148 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8149158Z Running 1 items in this shard 2025-12-04T12:10:21.8149160Z 2025-12-04T12:10:21.8149374Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.2671s] [100%] 2025-12-04T12:10:21.8149583Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.6670s] [100%] 2025-12-04T12:10:21.8149773Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda FAILED [0.7618s] [100%] 2025-12-04T12:10:21.8149775Z 2025-12-04T12:10:21.8149838Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8149982Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8150039Z Traceback (most recent call last): 2025-12-04T12:10:21.8150227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8150269Z method(*args, **kwargs) 2025-12-04T12:10:21.8150420Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8150474Z method(*args, **kwargs) 2025-12-04T12:10:21.8150624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8150663Z with policy(): 2025-12-04T12:10:21.8150814Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8150857Z raise RuntimeError(msg) 2025-12-04T12:10:21.8151241Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1025507328. 2025-12-04T12:10:21.8151244Z 2025-12-04T12:10:21.8151317Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8151576Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8151579Z 2025-12-04T12:10:21.8151664Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8151739Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8151782Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8151839Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8152322Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8152423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8152458Z graph_break [] 2025-12-04T12:10:21.8152523Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8152595Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8153089Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8153138Z current_size = base.storage().size() 2025-12-04T12:10:21.8153178Z Autotune Choices Stats: 2025-12-04T12:10:21.8153542Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8153587Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8153628Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8153740Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8153972Z triton_mm_2 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8154211Z triton_mm_3 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8154450Z triton_mm_1 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8154673Z triton_mm_0 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8154714Z _scaled_mm 0.0204 ms 30.1% 2025-12-04T12:10:21.8154842Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1474 seconds precompiling for 5 choices 2025-12-04T12:10:21.8154984Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8155030Z Traceback (most recent call last): 2025-12-04T12:10:21.8155185Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8155227Z method(*args, **kwargs) 2025-12-04T12:10:21.8155377Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8155419Z method(*args, **kwargs) 2025-12-04T12:10:21.8155569Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8155610Z with policy(): 2025-12-04T12:10:21.8155761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8155802Z raise RuntimeError(msg) 2025-12-04T12:10:21.8156186Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1025507328 and is now 1050673152. 2025-12-04T12:10:21.8156191Z 2025-12-04T12:10:21.8156263Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8156521Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8156524Z 2025-12-04T12:10:21.8156621Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8156695Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8156738Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8156795Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8157276Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8157377Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8157413Z graph_break [] 2025-12-04T12:10:21.8157486Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8157559Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8158041Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8158099Z current_size = base.storage().size() 2025-12-04T12:10:21.8158150Z Autotune Choices Stats: 2025-12-04T12:10:21.8158513Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8158558Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8158600Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8158698Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8158932Z triton_mm_2 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8159157Z triton_mm_3 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8159386Z triton_mm_1 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8159608Z triton_mm_0 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8159649Z _scaled_mm 0.0204 ms 30.1% 2025-12-04T12:10:21.8159776Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1474 seconds precompiling for 5 choices 2025-12-04T12:10:21.8159848Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8159891Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8159948Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8160045Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8160569Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8160607Z graph_break [] 2025-12-04T12:10:21.8160667Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8160741Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8160780Z Autotune Choices Stats: 2025-12-04T12:10:21.8161141Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.8161191Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8161245Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8161347Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8161577Z triton_mm_5 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8161814Z triton_mm_6 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8162055Z triton_mm_4 0.0076 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8162282Z triton_mm_7 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8162324Z _scaled_mm 0.0210 ms 30.9% 2025-12-04T12:10:21.8162453Z SingleProcess AUTOTUNE benchmarking takes 0.0276 seconds and 0.1423 seconds precompiling for 5 choices 2025-12-04T12:10:21.8162504Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8162646Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8162697Z Traceback (most recent call last): 2025-12-04T12:10:21.8162853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8162894Z method(*args, **kwargs) 2025-12-04T12:10:21.8163046Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8163090Z method(*args, **kwargs) 2025-12-04T12:10:21.8163240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8163278Z with policy(): 2025-12-04T12:10:21.8163429Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8163471Z raise RuntimeError(msg) 2025-12-04T12:10:21.8163855Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.8163858Z 2025-12-04T12:10:21.8163935Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8164203Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8164207Z 2025-12-04T12:10:21.8164293Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8164367Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8164409Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8164466Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8164941Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8165051Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8165088Z graph_break [] 2025-12-04T12:10:21.8165148Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8165236Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8165715Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8165773Z current_size = base.storage().size() 2025-12-04T12:10:21.8165815Z Autotune Choices Stats: 2025-12-04T12:10:21.8166177Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8166223Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8166265Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8166362Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8166592Z triton_mm_2 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8166818Z triton_mm_3 0.0066 ms 92.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8167047Z triton_mm_1 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8167269Z triton_mm_0 0.0076 ms 80.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8167312Z _scaled_mm 0.0204 ms 30.1% 2025-12-04T12:10:21.8167439Z SingleProcess AUTOTUNE benchmarking takes 0.0268 seconds and 0.1474 seconds precompiling for 5 choices 2025-12-04T12:10:21.8167513Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8167555Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8167611Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8167710Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8168199Z inductor [('triton_bundler_save_kernel', 40), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8168238Z graph_break [] 2025-12-04T12:10:21.8168297Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8168370Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8168412Z Autotune Choices Stats: 2025-12-04T12:10:21.8168785Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_5", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.8168831Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8168959Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8169073Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8169308Z triton_mm_5 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8169546Z triton_mm_6 0.0065 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8169767Z triton_mm_4 0.0076 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8169992Z triton_mm_7 0.0076 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8170033Z _scaled_mm 0.0210 ms 30.9% 2025-12-04T12:10:21.8170207Z SingleProcess AUTOTUNE benchmarking takes 0.0276 seconds and 0.1423 seconds precompiling for 5 choices 2025-12-04T12:10:21.8170278Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8170321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8170376Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8170475Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8170951Z inductor [('triton_bundler_save_kernel', 40), ('async_compile_cache_miss', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 5), ('generated_module_cache_miss', 4), ('select_algorithm_num_precompiles', 4), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8170988Z graph_break [] 2025-12-04T12:10:21.8171048Z aten_mm_info [('aten._scaled_mm.default_3_16_1024', 1)] 2025-12-04T12:10:21.8171120Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8171160Z Autotune Choices Stats: 2025-12-04T12:10:21.8171521Z {"num_choices": 5, "num_triton_choices": 4, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8171566Z AUTOTUNE scaled_mm(3x1024, 1024x16, , ) 2025-12-04T12:10:21.8171624Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8171723Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8171956Z triton_mm_9 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8172185Z triton_mm_11 0.0078 ms 78.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8172410Z triton_mm_10 0.0095 ms 64.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8172646Z triton_mm_8 0.0098 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8172699Z _scaled_mm 0.0217 ms 28.2% 2025-12-04T12:10:21.8172825Z SingleProcess AUTOTUNE benchmarking takes 0.0348 seconds and 0.2232 seconds precompiling for 5 choices 2025-12-04T12:10:21.8173011Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-3ddab0696dc61691.xml - 2025-12-04T12:10:21.8173070Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8173669Z FAILED [0.7618s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1050673152 and is now 1075838976. 2025-12-04T12:10:21.8173672Z 2025-12-04T12:10:21.8173746Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8174005Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8174007Z 2025-12-04T12:10:21.8174094Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8174157Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8174227Z ================== 1 failed, 187 deselected, 2 rerun in 3.72s ================== 2025-12-04T12:10:21.8174264Z Got exit code 1 2025-12-04T12:10:21.8174472Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8174598Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8174743Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4b6f9dcfde219408.xml 2025-12-04T12:10:21.8174798Z ============================= test session starts ============================== 2025-12-04T12:10:21.8174910Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8174953Z cachedir: .pytest_cache 2025-12-04T12:10:21.8175111Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8175156Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8175197Z configfile: pytest.ini 2025-12-04T12:10:21.8175368Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8175445Z collecting ... collected 188 items / 149 deselected / 39 selected 2025-12-04T12:10:21.8175499Z stepcurrent: skipping 149 already run items. 2025-12-04T12:10:21.8175544Z Running 39 items in this shard 2025-12-04T12:10:21.8175546Z 2025-12-04T12:10:21.8175770Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [45.4685s] [ 2%] 2025-12-04T12:10:21.8175986Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1501s] [ 2%] 2025-12-04T12:10:21.8176179Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.7176s] [ 2%] 2025-12-04T12:10:21.8176182Z 2025-12-04T12:10:21.8176244Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8176391Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8176437Z Traceback (most recent call last): 2025-12-04T12:10:21.8176616Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8176658Z method(*args, **kwargs) 2025-12-04T12:10:21.8176811Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8176862Z method(*args, **kwargs) 2025-12-04T12:10:21.8177014Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8177051Z with policy(): 2025-12-04T12:10:21.8177204Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8177245Z raise RuntimeError(msg) 2025-12-04T12:10:21.8177635Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.8177639Z 2025-12-04T12:10:21.8177712Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8177975Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8177978Z 2025-12-04T12:10:21.8178065Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8178137Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8178181Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8178237Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8178723Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8178823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8178859Z graph_break [] 2025-12-04T12:10:21.8178925Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8178997Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8179486Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8179536Z current_size = base.storage().size() 2025-12-04T12:10:21.8179578Z Autotune Choices Stats: 2025-12-04T12:10:21.8179946Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.8179996Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8180038Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8180193Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8180431Z triton_mm_16 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8180674Z triton_mm_17 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8180917Z triton_mm_7 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8181146Z triton_mm_6 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8181370Z triton_mm_12 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8181594Z triton_mm_14 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8181823Z triton_mm_9 0.0092 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8182052Z triton_mm_18 0.0098 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8182279Z triton_mm_11 0.0102 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8182503Z triton_mm_13 0.0109 ms 61.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8182633Z SingleProcess AUTOTUNE benchmarking takes 0.0831 seconds and 0.4006 seconds precompiling for 20 choices 2025-12-04T12:10:21.8182780Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8182828Z Traceback (most recent call last): 2025-12-04T12:10:21.8182997Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8183038Z method(*args, **kwargs) 2025-12-04T12:10:21.8183191Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8183232Z method(*args, **kwargs) 2025-12-04T12:10:21.8183381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8183419Z with policy(): 2025-12-04T12:10:21.8183569Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8183613Z raise RuntimeError(msg) 2025-12-04T12:10:21.8184014Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.8184017Z 2025-12-04T12:10:21.8184090Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8184361Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8184364Z 2025-12-04T12:10:21.8184451Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8184535Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8184579Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8184635Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8185121Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8185221Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8185257Z graph_break [] 2025-12-04T12:10:21.8185319Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8185391Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8185873Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8185921Z current_size = base.storage().size() 2025-12-04T12:10:21.8185964Z Autotune Choices Stats: 2025-12-04T12:10:21.8186330Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.8186380Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8186422Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8186521Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8186758Z triton_mm_16 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8186995Z triton_mm_17 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8187223Z triton_mm_7 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8187450Z triton_mm_6 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8187674Z triton_mm_12 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8187909Z triton_mm_14 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8188147Z triton_mm_9 0.0092 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8188376Z triton_mm_18 0.0098 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8188611Z triton_mm_11 0.0102 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8188838Z triton_mm_13 0.0109 ms 61.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8188966Z SingleProcess AUTOTUNE benchmarking takes 0.0831 seconds and 0.4006 seconds precompiling for 20 choices 2025-12-04T12:10:21.8189039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8189082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8189138Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8189237Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8189722Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8189759Z graph_break [] 2025-12-04T12:10:21.8189820Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8189894Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8189934Z Autotune Choices Stats: 2025-12-04T12:10:21.8190334Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8190382Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8190423Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8190534Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8190769Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8190999Z triton_mm_35 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8191220Z triton_mm_31 0.0070 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8191466Z triton_mm_25 0.0072 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8191692Z triton_mm_28 0.0076 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8191931Z triton_mm_33 0.0080 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8192171Z triton_mm_37 0.0092 ms 65.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8192396Z triton_mm_26 0.0103 ms 58.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8192623Z triton_mm_27 0.0112 ms 53.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8192850Z triton_mm_30 0.0113 ms 53.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8192980Z SingleProcess AUTOTUNE benchmarking takes 0.0922 seconds and 0.1609 seconds precompiling for 20 choices 2025-12-04T12:10:21.8193032Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8193178Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8193225Z Traceback (most recent call last): 2025-12-04T12:10:21.8193381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8193423Z method(*args, **kwargs) 2025-12-04T12:10:21.8193574Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8193614Z method(*args, **kwargs) 2025-12-04T12:10:21.8193762Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8193800Z with policy(): 2025-12-04T12:10:21.8193952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8193993Z raise RuntimeError(msg) 2025-12-04T12:10:21.8194396Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8194399Z 2025-12-04T12:10:21.8194473Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8194733Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8194735Z 2025-12-04T12:10:21.8194822Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8194897Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8194940Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8194995Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8195492Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8195602Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8195638Z graph_break [] 2025-12-04T12:10:21.8195700Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8195785Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8196270Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8196317Z current_size = base.storage().size() 2025-12-04T12:10:21.8196360Z Autotune Choices Stats: 2025-12-04T12:10:21.8196725Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.8196773Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8196815Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8196914Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8197154Z triton_mm_16 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8197380Z triton_mm_17 0.0075 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8197607Z triton_mm_7 0.0077 ms 87.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8197834Z triton_mm_6 0.0079 ms 84.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8198069Z triton_mm_12 0.0080 ms 84.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8198293Z triton_mm_14 0.0084 ms 79.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8198520Z triton_mm_9 0.0092 ms 73.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8198750Z triton_mm_18 0.0098 ms 68.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8198986Z triton_mm_11 0.0102 ms 65.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8199213Z triton_mm_13 0.0109 ms 61.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8199354Z SingleProcess AUTOTUNE benchmarking takes 0.0831 seconds and 0.4006 seconds precompiling for 20 choices 2025-12-04T12:10:21.8199426Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8199478Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8199534Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8199633Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8200153Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8200192Z graph_break [] 2025-12-04T12:10:21.8200252Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8200325Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8200365Z Autotune Choices Stats: 2025-12-04T12:10:21.8200729Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8200777Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8200820Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8200918Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8201154Z triton_mm_36 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8201386Z triton_mm_35 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8201608Z triton_mm_31 0.0070 ms 86.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8201860Z triton_mm_25 0.0072 ms 83.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8202088Z triton_mm_28 0.0076 ms 78.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8202313Z triton_mm_33 0.0080 ms 74.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8202545Z triton_mm_37 0.0092 ms 65.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8202784Z triton_mm_26 0.0103 ms 58.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8203024Z triton_mm_27 0.0112 ms 53.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8203251Z triton_mm_30 0.0113 ms 53.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8203394Z SingleProcess AUTOTUNE benchmarking takes 0.0922 seconds and 0.1609 seconds precompiling for 20 choices 2025-12-04T12:10:21.8203466Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8203509Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8203566Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8203665Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8204148Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8204186Z graph_break [] 2025-12-04T12:10:21.8204248Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8204319Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8204360Z Autotune Choices Stats: 2025-12-04T12:10:21.8204723Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.8204772Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8204812Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8204910Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8205142Z triton_mm_54 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8205391Z triton_mm_45 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8205614Z triton_mm_50 0.0067 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8205838Z triton_mm_55 0.0070 ms 88.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8206068Z triton_mm_47 0.0073 ms 84.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8206306Z triton_mm_44 0.0074 ms 83.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8206530Z triton_mm_52 0.0081 ms 76.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8206764Z triton_mm_48 0.0083 ms 74.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8206996Z triton_mm_43 0.0088 ms 70.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8207225Z triton_mm_56 0.0094 ms 65.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8207352Z SingleProcess AUTOTUNE benchmarking takes 0.1318 seconds and 0.3623 seconds precompiling for 20 choices 2025-12-04T12:10:21.8207541Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-4b6f9dcfde219408.xml - 2025-12-04T12:10:21.8207601Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8208189Z FAILED [1.7176s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8208193Z 2025-12-04T12:10:21.8208268Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8208527Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8208530Z 2025-12-04T12:10:21.8208616Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8208676Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8208747Z ================= 1 failed, 149 deselected, 2 rerun in 48.36s ================== 2025-12-04T12:10:21.8208784Z Got exit code 1 2025-12-04T12:10:21.8208825Z Retrying single test... 2025-12-04T12:10:21.8208967Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a1d05159a2fa9c53.xml 2025-12-04T12:10:21.8209024Z ============================= test session starts ============================== 2025-12-04T12:10:21.8209144Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8210618Z cachedir: .pytest_cache 2025-12-04T12:10:21.8210787Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8210835Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8210876Z configfile: pytest.ini 2025-12-04T12:10:21.8211044Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8211124Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8211382Z stepcurrent: skipping 149 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8211429Z Running 1 items in this shard 2025-12-04T12:10:21.8211454Z 2025-12-04T12:10:21.8211673Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [2.6367s] [100%] 2025-12-04T12:10:21.8211904Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0847s] [100%] 2025-12-04T12:10:21.8212095Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.7427s] [100%] 2025-12-04T12:10:21.8212113Z 2025-12-04T12:10:21.8212167Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8212311Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8212358Z Traceback (most recent call last): 2025-12-04T12:10:21.8212521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8212564Z method(*args, **kwargs) 2025-12-04T12:10:21.8212716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8212761Z method(*args, **kwargs) 2025-12-04T12:10:21.8212910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8212947Z with policy(): 2025-12-04T12:10:21.8213106Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8213148Z raise RuntimeError(msg) 2025-12-04T12:10:21.8213548Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.8213551Z 2025-12-04T12:10:21.8213625Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8213887Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8213889Z 2025-12-04T12:10:21.8213976Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8214052Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8214095Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8214153Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8214656Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8214755Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8214792Z graph_break [] 2025-12-04T12:10:21.8214854Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8214928Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8215423Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8215476Z current_size = base.storage().size() 2025-12-04T12:10:21.8215516Z Autotune Choices Stats: 2025-12-04T12:10:21.8215885Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8215943Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8215997Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8216097Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8216333Z triton_mm_17 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8216563Z triton_mm_16 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8216790Z triton_mm_7 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8217014Z triton_mm_12 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8217241Z triton_mm_6 0.0074 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8217470Z triton_mm_9 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8217695Z triton_mm_14 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8217917Z triton_mm_10 0.0082 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8218141Z triton_mm_5 0.0083 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8218380Z triton_mm_18 0.0092 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8218512Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.3953 seconds precompiling for 20 choices 2025-12-04T12:10:21.8218659Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8218706Z Traceback (most recent call last): 2025-12-04T12:10:21.8218863Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8218905Z method(*args, **kwargs) 2025-12-04T12:10:21.8219056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8219107Z method(*args, **kwargs) 2025-12-04T12:10:21.8219259Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8219296Z with policy(): 2025-12-04T12:10:21.8219459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8219499Z raise RuntimeError(msg) 2025-12-04T12:10:21.8219896Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.8219916Z 2025-12-04T12:10:21.8219989Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8220288Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8220290Z 2025-12-04T12:10:21.8220379Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8220453Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8220497Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8220553Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8221039Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8221139Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8221176Z graph_break [] 2025-12-04T12:10:21.8221236Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8221310Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8221793Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8221843Z current_size = base.storage().size() 2025-12-04T12:10:21.8221883Z Autotune Choices Stats: 2025-12-04T12:10:21.8222265Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8222315Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8222356Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8222455Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8222687Z triton_mm_17 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8222917Z triton_mm_16 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8223156Z triton_mm_7 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8223393Z triton_mm_12 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8223621Z triton_mm_6 0.0074 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8223858Z triton_mm_9 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8224088Z triton_mm_14 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8224311Z triton_mm_10 0.0082 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8224534Z triton_mm_5 0.0083 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8224761Z triton_mm_18 0.0092 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8224891Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.3953 seconds precompiling for 20 choices 2025-12-04T12:10:21.8224964Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8225006Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8225065Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8225163Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8225646Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8225684Z graph_break [] 2025-12-04T12:10:21.8225746Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8225828Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8225869Z Autotune Choices Stats: 2025-12-04T12:10:21.8226231Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8226280Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8226325Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8226423Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8226666Z triton_mm_36 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8226893Z triton_mm_26 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8227128Z triton_mm_31 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8227349Z triton_mm_33 0.0077 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8227592Z triton_mm_28 0.0077 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8227816Z triton_mm_29 0.0083 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8228047Z triton_mm_37 0.0093 ms 65.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8228272Z triton_mm_35 0.0096 ms 64.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8228498Z triton_mm_30 0.0098 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8228726Z triton_mm_25 0.0102 ms 59.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8228854Z SingleProcess AUTOTUNE benchmarking takes 0.1202 seconds and 0.3704 seconds precompiling for 20 choices 2025-12-04T12:10:21.8228907Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8229053Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8229100Z Traceback (most recent call last): 2025-12-04T12:10:21.8229256Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8229296Z method(*args, **kwargs) 2025-12-04T12:10:21.8229460Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8229500Z method(*args, **kwargs) 2025-12-04T12:10:21.8229650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8229688Z with policy(): 2025-12-04T12:10:21.8229842Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8229882Z raise RuntimeError(msg) 2025-12-04T12:10:21.8230313Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8230315Z 2025-12-04T12:10:21.8230400Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8230664Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8230683Z 2025-12-04T12:10:21.8230770Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8230841Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8230885Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8230954Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8231439Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8231536Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8231573Z graph_break [] 2025-12-04T12:10:21.8231634Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8231707Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8232185Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8232233Z current_size = base.storage().size() 2025-12-04T12:10:21.8232273Z Autotune Choices Stats: 2025-12-04T12:10:21.8232643Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8232692Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8232732Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8232833Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8233067Z triton_mm_17 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8233314Z triton_mm_16 0.0066 ms 93.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8233539Z triton_mm_7 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8233783Z triton_mm_12 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8234012Z triton_mm_6 0.0074 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8234263Z triton_mm_9 0.0075 ms 81.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8234490Z triton_mm_14 0.0080 ms 76.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8234729Z triton_mm_10 0.0082 ms 75.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8234964Z triton_mm_5 0.0083 ms 73.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8235193Z triton_mm_18 0.0092 ms 66.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8235337Z SingleProcess AUTOTUNE benchmarking takes 0.0920 seconds and 0.3953 seconds precompiling for 20 choices 2025-12-04T12:10:21.8235411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8235453Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8235509Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8235607Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8236093Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8236131Z graph_break [] 2025-12-04T12:10:21.8236195Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8236281Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8236323Z Autotune Choices Stats: 2025-12-04T12:10:21.8236718Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8236767Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8236809Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8236907Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8237150Z triton_mm_36 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8237378Z triton_mm_26 0.0066 ms 92.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8237618Z triton_mm_31 0.0068 ms 90.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8237841Z triton_mm_33 0.0077 ms 79.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8238078Z triton_mm_28 0.0077 ms 79.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8238303Z triton_mm_29 0.0083 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8238543Z triton_mm_37 0.0093 ms 65.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8238781Z triton_mm_35 0.0096 ms 64.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8239006Z triton_mm_30 0.0098 ms 62.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8239234Z triton_mm_25 0.0102 ms 59.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8239364Z SingleProcess AUTOTUNE benchmarking takes 0.1202 seconds and 0.3704 seconds precompiling for 20 choices 2025-12-04T12:10:21.8239437Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8239479Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8239535Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8239633Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8240151Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8240191Z graph_break [] 2025-12-04T12:10:21.8240253Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8240327Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8240369Z Autotune Choices Stats: 2025-12-04T12:10:21.8240734Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_45", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006719999946653843, "best_triton_pos": 0} 2025-12-04T12:10:21.8240796Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8240839Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8240936Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8241170Z triton_mm_45 0.0067 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8241393Z triton_mm_50 0.0068 ms 98.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8241631Z triton_mm_54 0.0072 ms 92.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8241859Z triton_mm_55 0.0074 ms 90.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8242099Z triton_mm_44 0.0078 ms 86.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8242144Z _scaled_mm 0.0079 ms 85.3% 2025-12-04T12:10:21.8242382Z triton_mm_47 0.0080 ms 84.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8242610Z triton_mm_52 0.0081 ms 82.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8242833Z triton_mm_48 0.0086 ms 77.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8243055Z triton_mm_43 0.0088 ms 76.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8243183Z SingleProcess AUTOTUNE benchmarking takes 0.1506 seconds and 0.3928 seconds precompiling for 20 choices 2025-12-04T12:10:21.8243371Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a1d05159a2fa9c53.xml - 2025-12-04T12:10:21.8243432Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8244025Z FAILED [1.7427s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8244028Z 2025-12-04T12:10:21.8244101Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8244362Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8244364Z 2025-12-04T12:10:21.8244450Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8244522Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8244591Z ================== 1 failed, 187 deselected, 2 rerun in 5.48s ================== 2025-12-04T12:10:21.8244629Z Got exit code 1 2025-12-04T12:10:21.8244670Z Retrying single test... 2025-12-04T12:10:21.8244816Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b05c84b46cb1f19.xml 2025-12-04T12:10:21.8244872Z ============================= test session starts ============================== 2025-12-04T12:10:21.8244985Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8245027Z cachedir: .pytest_cache 2025-12-04T12:10:21.8245185Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8245231Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8245272Z configfile: pytest.ini 2025-12-04T12:10:21.8245449Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8245524Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8245790Z stepcurrent: skipping 149 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8245833Z Running 1 items in this shard 2025-12-04T12:10:21.8245835Z 2025-12-04T12:10:21.8246053Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [34.6420s] [100%] 2025-12-04T12:10:21.8246280Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.0969s] [100%] 2025-12-04T12:10:21.8246472Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda FAILED [1.0337s] [100%] 2025-12-04T12:10:21.8246475Z 2025-12-04T12:10:21.8246525Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8246672Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8246718Z Traceback (most recent call last): 2025-12-04T12:10:21.8246876Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8246919Z method(*args, **kwargs) 2025-12-04T12:10:21.8247072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8247113Z method(*args, **kwargs) 2025-12-04T12:10:21.8247264Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8247303Z with policy(): 2025-12-04T12:10:21.8247454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8247498Z raise RuntimeError(msg) 2025-12-04T12:10:21.8247887Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1056964608. 2025-12-04T12:10:21.8247890Z 2025-12-04T12:10:21.8247963Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8248223Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8248226Z 2025-12-04T12:10:21.8248323Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8248395Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8248440Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8248496Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8248980Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8249080Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8249116Z graph_break [] 2025-12-04T12:10:21.8249197Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8249271Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8249754Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8249810Z current_size = base.storage().size() 2025-12-04T12:10:21.8249862Z Autotune Choices Stats: 2025-12-04T12:10:21.8250270Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:21.8250319Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8250361Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8250460Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8250697Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8250925Z triton_mm_7 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8251155Z triton_mm_6 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8251383Z triton_mm_9 0.0073 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8251609Z triton_mm_12 0.0076 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8251832Z triton_mm_14 0.0081 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8252069Z triton_mm_10 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8252300Z triton_mm_18 0.0094 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8252526Z triton_mm_17 0.0096 ms 63.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8252749Z triton_mm_2 0.0108 ms 56.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8252878Z SingleProcess AUTOTUNE benchmarking takes 0.0925 seconds and 0.4062 seconds precompiling for 20 choices 2025-12-04T12:10:21.8253036Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8253084Z Traceback (most recent call last): 2025-12-04T12:10:21.8253239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8253294Z method(*args, **kwargs) 2025-12-04T12:10:21.8253446Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8253486Z method(*args, **kwargs) 2025-12-04T12:10:21.8253636Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8253686Z with policy(): 2025-12-04T12:10:21.8253837Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8253878Z raise RuntimeError(msg) 2025-12-04T12:10:21.8254270Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1056964608 and is now 1113587712. 2025-12-04T12:10:21.8254273Z 2025-12-04T12:10:21.8254347Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8254605Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8254609Z 2025-12-04T12:10:21.8254695Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8254768Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8254809Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8254867Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8255351Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8255450Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8255487Z graph_break [] 2025-12-04T12:10:21.8255548Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8255621Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8256114Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8256161Z current_size = base.storage().size() 2025-12-04T12:10:21.8256203Z Autotune Choices Stats: 2025-12-04T12:10:21.8256570Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:21.8256617Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8256658Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8256757Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8257038Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8257273Z triton_mm_7 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8257501Z triton_mm_6 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8257744Z triton_mm_9 0.0073 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8257971Z triton_mm_12 0.0076 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8258194Z triton_mm_14 0.0081 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8258416Z triton_mm_10 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8258645Z triton_mm_18 0.0094 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8258871Z triton_mm_17 0.0096 ms 63.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8259094Z triton_mm_2 0.0108 ms 56.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8259223Z SingleProcess AUTOTUNE benchmarking takes 0.0925 seconds and 0.4062 seconds precompiling for 20 choices 2025-12-04T12:10:21.8259297Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8259340Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8259410Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8259510Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8260008Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8260047Z graph_break [] 2025-12-04T12:10:21.8260149Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8260223Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8260264Z Autotune Choices Stats: 2025-12-04T12:10:21.8260640Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.8260687Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8260728Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8260826Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8261070Z triton_mm_36 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8261296Z triton_mm_35 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8261542Z triton_mm_26 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8261765Z triton_mm_31 0.0067 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8261995Z triton_mm_28 0.0076 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8262218Z triton_mm_33 0.0078 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8262447Z triton_mm_25 0.0080 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8262669Z triton_mm_24 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8262894Z triton_mm_29 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8263121Z triton_mm_37 0.0092 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8263250Z SingleProcess AUTOTUNE benchmarking takes 0.1252 seconds and 0.3497 seconds precompiling for 20 choices 2025-12-04T12:10:21.8263324Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8263470Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8263517Z Traceback (most recent call last): 2025-12-04T12:10:21.8263672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8263713Z method(*args, **kwargs) 2025-12-04T12:10:21.8263866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8263909Z method(*args, **kwargs) 2025-12-04T12:10:21.8264059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8264098Z with policy(): 2025-12-04T12:10:21.8264260Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8264303Z raise RuntimeError(msg) 2025-12-04T12:10:21.8264692Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8264705Z 2025-12-04T12:10:21.8264779Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8265038Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8265051Z 2025-12-04T12:10:21.8265137Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8265211Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8265255Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8265311Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8265796Z inductor [('triton_bundler_save_kernel', 160), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8265895Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8265932Z graph_break [] 2025-12-04T12:10:21.8265994Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8266067Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8266554Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8266602Z current_size = base.storage().size() 2025-12-04T12:10:21.8266642Z Autotune Choices Stats: 2025-12-04T12:10:21.8267010Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_16", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060789999552071095, "best_triton_pos": 0} 2025-12-04T12:10:21.8267056Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8267098Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8267209Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8267445Z triton_mm_16 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8267673Z triton_mm_7 0.0066 ms 92.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8267902Z triton_mm_6 0.0071 ms 85.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8268140Z triton_mm_9 0.0073 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8268368Z triton_mm_12 0.0076 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8268601Z triton_mm_14 0.0081 ms 75.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8268834Z triton_mm_10 0.0082 ms 73.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8269062Z triton_mm_18 0.0094 ms 64.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8269286Z triton_mm_17 0.0096 ms 63.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8269510Z triton_mm_2 0.0108 ms 56.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8269639Z SingleProcess AUTOTUNE benchmarking takes 0.0925 seconds and 0.4062 seconds precompiling for 20 choices 2025-12-04T12:10:21.8269711Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8269754Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8269810Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8269909Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8270435Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8270474Z graph_break [] 2025-12-04T12:10:21.8270535Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8270609Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8270650Z Autotune Choices Stats: 2025-12-04T12:10:21.8271024Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_36", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006479999981820583, "best_triton_pos": 0} 2025-12-04T12:10:21.8271071Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8271112Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8271211Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8271442Z triton_mm_36 0.0065 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8271669Z triton_mm_35 0.0066 ms 98.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8271905Z triton_mm_26 0.0066 ms 97.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8272130Z triton_mm_31 0.0067 ms 96.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8272372Z triton_mm_28 0.0076 ms 85.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8272609Z triton_mm_33 0.0078 ms 82.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8272839Z triton_mm_25 0.0080 ms 80.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8273062Z triton_mm_24 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8273286Z triton_mm_29 0.0082 ms 79.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8273515Z triton_mm_37 0.0092 ms 70.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8273645Z SingleProcess AUTOTUNE benchmarking takes 0.1252 seconds and 0.3497 seconds precompiling for 20 choices 2025-12-04T12:10:21.8273718Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8273759Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8273817Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8273915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8274401Z inductor [('triton_bundler_save_kernel', 160), ('async_compile_cache_miss', 21), ('benchmarking.InductorBenchmarker.benchmark_gpu', 20), ('generated_module_cache_miss', 19), ('select_algorithm_num_precompiles', 19), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8274437Z graph_break [] 2025-12-04T12:10:21.8274499Z aten_mm_info [('aten._scaled_mm.default_3_2048_1024', 1)] 2025-12-04T12:10:21.8274583Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8274624Z Autotune Choices Stats: 2025-12-04T12:10:21.8274985Z {"num_choices": 20, "num_triton_choices": 19, "best_kernel": "triton_mm_54", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.8275033Z AUTOTUNE scaled_mm(3x1024, 1024x2048, , ) 2025-12-04T12:10:21.8275075Z strides: [1024, 1], [1, 1024], [], [] 2025-12-04T12:10:21.8275172Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8275415Z triton_mm_54 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8275641Z triton_mm_45 0.0067 ms 91.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8275876Z triton_mm_50 0.0073 ms 83.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8276102Z triton_mm_47 0.0076 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8276346Z triton_mm_48 0.0079 ms 77.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8276574Z triton_mm_55 0.0084 ms 72.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=256, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8276800Z triton_mm_43 0.0085 ms 71.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8277027Z triton_mm_56 0.0094 ms 64.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8277252Z triton_mm_49 0.0101 ms 60.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8277481Z triton_mm_44 0.0104 ms 58.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=128, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8277608Z SingleProcess AUTOTUNE benchmarking takes 0.1420 seconds and 0.3541 seconds precompiling for 20 choices 2025-12-04T12:10:21.8277796Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6b05c84b46cb1f19.xml - 2025-12-04T12:10:21.8277858Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8278463Z FAILED [1.0337s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1113587712 and is now 1170210816. 2025-12-04T12:10:21.8278466Z 2025-12-04T12:10:21.8278541Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8278801Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8278803Z 2025-12-04T12:10:21.8278891Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8278952Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8279021Z ================= 1 failed, 187 deselected, 2 rerun in 36.79s ================== 2025-12-04T12:10:21.8279058Z Got exit code 1 2025-12-04T12:10:21.8279275Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8279401Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8279554Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-57623d80bb0084f4.xml 2025-12-04T12:10:21.8279611Z ============================= test session starts ============================== 2025-12-04T12:10:21.8279723Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8279776Z cachedir: .pytest_cache 2025-12-04T12:10:21.8279933Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8279980Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8280021Z configfile: pytest.ini 2025-12-04T12:10:21.8280222Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8280298Z collecting ... collected 188 items / 150 deselected / 38 selected 2025-12-04T12:10:21.8280354Z stepcurrent: skipping 150 already run items. 2025-12-04T12:10:21.8280399Z Running 38 items in this shard 2025-12-04T12:10:21.8280401Z 2025-12-04T12:10:21.8280618Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8965s] [ 2%] 2025-12-04T12:10:21.8280831Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3576s] [ 2%] 2025-12-04T12:10:21.8281019Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3167s] [ 2%] 2025-12-04T12:10:21.8281021Z 2025-12-04T12:10:21.8281074Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8281215Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8281262Z Traceback (most recent call last): 2025-12-04T12:10:21.8281419Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8281461Z method(*args, **kwargs) 2025-12-04T12:10:21.8281612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8281654Z method(*args, **kwargs) 2025-12-04T12:10:21.8281805Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8281842Z with policy(): 2025-12-04T12:10:21.8281993Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8282056Z raise RuntimeError(msg) 2025-12-04T12:10:21.8282440Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8282443Z 2025-12-04T12:10:21.8282518Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8282775Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8282779Z 2025-12-04T12:10:21.8282865Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8282950Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8282993Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8283051Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8283118Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8283232Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8283268Z graph_break [] 2025-12-04T12:10:21.8283329Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8283483Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8283544Z Traceback (most recent call last): 2025-12-04T12:10:21.8283696Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8283736Z method(*args, **kwargs) 2025-12-04T12:10:21.8283886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8283926Z method(*args, **kwargs) 2025-12-04T12:10:21.8284073Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8284112Z with policy(): 2025-12-04T12:10:21.8284262Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8284303Z raise RuntimeError(msg) 2025-12-04T12:10:21.8284685Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8284689Z 2025-12-04T12:10:21.8284763Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8285021Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8285024Z 2025-12-04T12:10:21.8285109Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8285182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8285224Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8285279Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8285345Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8285443Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8285479Z graph_break [] 2025-12-04T12:10:21.8285539Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8285612Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8285673Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8285728Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8285823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8285887Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8285924Z graph_break [] 2025-12-04T12:10:21.8285981Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8286033Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8286173Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8286219Z Traceback (most recent call last): 2025-12-04T12:10:21.8286370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8286411Z method(*args, **kwargs) 2025-12-04T12:10:21.8286573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8286614Z method(*args, **kwargs) 2025-12-04T12:10:21.8286773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8286813Z with policy(): 2025-12-04T12:10:21.8286964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8287006Z raise RuntimeError(msg) 2025-12-04T12:10:21.8287402Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8287405Z 2025-12-04T12:10:21.8287478Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8287735Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8287738Z 2025-12-04T12:10:21.8287824Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8287896Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8287939Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8287996Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8288061Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8288160Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8288196Z graph_break [] 2025-12-04T12:10:21.8288254Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8288327Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8288372Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8288426Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8288525Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8288589Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8288625Z graph_break [] 2025-12-04T12:10:21.8288683Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8288757Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8288800Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8288855Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8288951Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8289015Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8289062Z graph_break [] 2025-12-04T12:10:21.8289120Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8289314Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-57623d80bb0084f4.xml - 2025-12-04T12:10:21.8289374Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8289948Z FAILED [0.3167s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8289951Z 2025-12-04T12:10:21.8290040Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8290344Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8290372Z 2025-12-04T12:10:21.8290458Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8290519Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8290587Z ================== 1 failed, 150 deselected, 2 rerun in 2.59s ================== 2025-12-04T12:10:21.8290637Z Got exit code 1 2025-12-04T12:10:21.8290679Z Retrying single test... 2025-12-04T12:10:21.8290823Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aa6d2951fd87c5a8.xml 2025-12-04T12:10:21.8290880Z ============================= test session starts ============================== 2025-12-04T12:10:21.8290991Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8291031Z cachedir: .pytest_cache 2025-12-04T12:10:21.8291187Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8291235Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8291275Z configfile: pytest.ini 2025-12-04T12:10:21.8291438Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8291513Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8291767Z stepcurrent: skipping 150 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8291811Z Running 1 items in this shard 2025-12-04T12:10:21.8291813Z 2025-12-04T12:10:21.8292026Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.8822s] [100%] 2025-12-04T12:10:21.8292234Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3482s] [100%] 2025-12-04T12:10:21.8292420Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3079s] [100%] 2025-12-04T12:10:21.8292422Z 2025-12-04T12:10:21.8292474Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8292613Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8292659Z Traceback (most recent call last): 2025-12-04T12:10:21.8292815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8292869Z method(*args, **kwargs) 2025-12-04T12:10:21.8293021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8293063Z method(*args, **kwargs) 2025-12-04T12:10:21.8293212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8293250Z with policy(): 2025-12-04T12:10:21.8293401Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8293442Z raise RuntimeError(msg) 2025-12-04T12:10:21.8293840Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8293844Z 2025-12-04T12:10:21.8293919Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8294175Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8294187Z 2025-12-04T12:10:21.8294272Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8294345Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8294396Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8294453Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8294518Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8294616Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8294654Z graph_break [] 2025-12-04T12:10:21.8294714Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8294855Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8294901Z Traceback (most recent call last): 2025-12-04T12:10:21.8295052Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8295092Z method(*args, **kwargs) 2025-12-04T12:10:21.8295243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8295287Z method(*args, **kwargs) 2025-12-04T12:10:21.8295435Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8295474Z with policy(): 2025-12-04T12:10:21.8295626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8295667Z raise RuntimeError(msg) 2025-12-04T12:10:21.8296052Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8296057Z 2025-12-04T12:10:21.8296129Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8296385Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8296388Z 2025-12-04T12:10:21.8296472Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8296548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8296599Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8296656Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8296720Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8296819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8296854Z graph_break [] 2025-12-04T12:10:21.8296914Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8296986Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8297029Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8297083Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8297179Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8297242Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8297292Z graph_break [] 2025-12-04T12:10:21.8297351Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8297406Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8297558Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8297604Z Traceback (most recent call last): 2025-12-04T12:10:21.8297755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8297797Z method(*args, **kwargs) 2025-12-04T12:10:21.8297957Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8298001Z method(*args, **kwargs) 2025-12-04T12:10:21.8298152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8298190Z with policy(): 2025-12-04T12:10:21.8298343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8298383Z raise RuntimeError(msg) 2025-12-04T12:10:21.8298765Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8298769Z 2025-12-04T12:10:21.8298840Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8299094Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8299096Z 2025-12-04T12:10:21.8299181Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8299255Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8299296Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8299352Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8299416Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8299512Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8299548Z graph_break [] 2025-12-04T12:10:21.8299606Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8299679Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8299721Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8299775Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8299871Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8299943Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8299980Z graph_break [] 2025-12-04T12:10:21.8300039Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8300156Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8300198Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8300252Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8300348Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8300411Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8300449Z graph_break [] 2025-12-04T12:10:21.8300505Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8300694Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-aa6d2951fd87c5a8.xml - 2025-12-04T12:10:21.8300770Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8301342Z FAILED [0.3079s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8301357Z 2025-12-04T12:10:21.8301443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8301699Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8301701Z 2025-12-04T12:10:21.8301787Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8301850Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8301918Z ================== 1 failed, 187 deselected, 2 rerun in 2.56s ================== 2025-12-04T12:10:21.8301955Z Got exit code 1 2025-12-04T12:10:21.8301996Z Retrying single test... 2025-12-04T12:10:21.8302139Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a19a634e653b2e3a.xml 2025-12-04T12:10:21.8302196Z ============================= test session starts ============================== 2025-12-04T12:10:21.8302308Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8302348Z cachedir: .pytest_cache 2025-12-04T12:10:21.8302506Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8302552Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8302593Z configfile: pytest.ini 2025-12-04T12:10:21.8302758Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8302832Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8303085Z stepcurrent: skipping 150 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8303129Z Running 1 items in this shard 2025-12-04T12:10:21.8303131Z 2025-12-04T12:10:21.8303359Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [39.3003s] [100%] 2025-12-04T12:10:21.8303571Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3686s] [100%] 2025-12-04T12:10:21.8303771Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda FAILED [0.3347s] [100%] 2025-12-04T12:10:21.8303773Z 2025-12-04T12:10:21.8303826Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8303966Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8304011Z Traceback (most recent call last): 2025-12-04T12:10:21.8304165Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8304209Z method(*args, **kwargs) 2025-12-04T12:10:21.8304360Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8304400Z method(*args, **kwargs) 2025-12-04T12:10:21.8304577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8304615Z with policy(): 2025-12-04T12:10:21.8304769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8304820Z raise RuntimeError(msg) 2025-12-04T12:10:21.8305202Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8305220Z 2025-12-04T12:10:21.8305295Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8305550Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8305552Z 2025-12-04T12:10:21.8305638Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8305711Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8305754Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8305812Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8305877Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8305976Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8306013Z graph_break [] 2025-12-04T12:10:21.8306071Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8306210Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8306255Z Traceback (most recent call last): 2025-12-04T12:10:21.8306408Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8306449Z method(*args, **kwargs) 2025-12-04T12:10:21.8306599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8306640Z method(*args, **kwargs) 2025-12-04T12:10:21.8306788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8306826Z with policy(): 2025-12-04T12:10:21.8306977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8307020Z raise RuntimeError(msg) 2025-12-04T12:10:21.8307415Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8307417Z 2025-12-04T12:10:21.8307491Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8307747Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8307750Z 2025-12-04T12:10:21.8307835Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8307909Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8307952Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8308009Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8308073Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8308172Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8308219Z graph_break [] 2025-12-04T12:10:21.8308279Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8308351Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8308403Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8308457Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8308553Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8308616Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8308667Z graph_break [] 2025-12-04T12:10:21.8308724Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8308776Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8308916Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8308963Z Traceback (most recent call last): 2025-12-04T12:10:21.8309117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8309158Z method(*args, **kwargs) 2025-12-04T12:10:21.8309312Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8309351Z method(*args, **kwargs) 2025-12-04T12:10:21.8309500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8309538Z with policy(): 2025-12-04T12:10:21.8309689Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8309730Z raise RuntimeError(msg) 2025-12-04T12:10:21.8310153Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8310155Z 2025-12-04T12:10:21.8310229Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8310483Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8310485Z 2025-12-04T12:10:21.8310569Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8310643Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8310685Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8310742Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8310805Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8310923Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8310961Z graph_break [] 2025-12-04T12:10:21.8311018Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8311094Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8311135Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8311190Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8311286Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8311351Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8311387Z graph_break [] 2025-12-04T12:10:21.8311446Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8311517Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8311559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8311626Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8311723Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8311785Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8311842Z graph_break [] 2025-12-04T12:10:21.8311898Z aten_mm_info [('aten._scaled_mm.default_3_16_16', 1)] 2025-12-04T12:10:21.8312087Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-a19a634e653b2e3a.xml - 2025-12-04T12:10:21.8312146Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8312734Z FAILED [0.3347s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8312737Z 2025-12-04T12:10:21.8312808Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8313063Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8313064Z 2025-12-04T12:10:21.8313150Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8313212Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8313281Z ================= 1 failed, 187 deselected, 2 rerun in 40.02s ================== 2025-12-04T12:10:21.8313318Z Got exit code 1 2025-12-04T12:10:21.8313524Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8313650Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8313792Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2369fab22243df0e.xml 2025-12-04T12:10:21.8313849Z ============================= test session starts ============================== 2025-12-04T12:10:21.8313959Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8314000Z cachedir: .pytest_cache 2025-12-04T12:10:21.8314159Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8314204Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8314245Z configfile: pytest.ini 2025-12-04T12:10:21.8314416Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8314493Z collecting ... collected 188 items / 151 deselected / 37 selected 2025-12-04T12:10:21.8314547Z stepcurrent: skipping 151 already run items. 2025-12-04T12:10:21.8314593Z Running 37 items in this shard 2025-12-04T12:10:21.8314595Z 2025-12-04T12:10:21.8314814Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [33.1314s] [ 2%] 2025-12-04T12:10:21.8315026Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3696s] [ 2%] 2025-12-04T12:10:21.8315216Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3465s] [ 2%] 2025-12-04T12:10:21.8315218Z 2025-12-04T12:10:21.8315280Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8315424Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8315469Z Traceback (most recent call last): 2025-12-04T12:10:21.8315638Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8315678Z method(*args, **kwargs) 2025-12-04T12:10:21.8315830Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8315884Z method(*args, **kwargs) 2025-12-04T12:10:21.8316033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8316070Z with policy(): 2025-12-04T12:10:21.8316221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8316263Z raise RuntimeError(msg) 2025-12-04T12:10:21.8316649Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8316653Z 2025-12-04T12:10:21.8316726Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8316986Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8316992Z 2025-12-04T12:10:21.8317078Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8317150Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8317194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8317251Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8317316Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8317414Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8317453Z graph_break [] 2025-12-04T12:10:21.8317513Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8317654Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8317700Z Traceback (most recent call last): 2025-12-04T12:10:21.8317854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8317893Z method(*args, **kwargs) 2025-12-04T12:10:21.8318043Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8318100Z method(*args, **kwargs) 2025-12-04T12:10:21.8318250Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8318287Z with policy(): 2025-12-04T12:10:21.8318439Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8318480Z raise RuntimeError(msg) 2025-12-04T12:10:21.8318863Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8318866Z 2025-12-04T12:10:21.8318938Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8319209Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8319211Z 2025-12-04T12:10:21.8319297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8319381Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8319423Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8319492Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8319558Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8319669Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8319706Z graph_break [] 2025-12-04T12:10:21.8319765Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8319839Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8319881Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8319938Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8320033Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8320139Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8320191Z graph_break [] 2025-12-04T12:10:21.8320250Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8320303Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8320449Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8320495Z Traceback (most recent call last): 2025-12-04T12:10:21.8320647Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8320688Z method(*args, **kwargs) 2025-12-04T12:10:21.8320839Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8320880Z method(*args, **kwargs) 2025-12-04T12:10:21.8321028Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8321066Z with policy(): 2025-12-04T12:10:21.8321217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8321259Z raise RuntimeError(msg) 2025-12-04T12:10:21.8321642Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8321644Z 2025-12-04T12:10:21.8321717Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8321996Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8322000Z 2025-12-04T12:10:21.8322085Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8322159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8322200Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8322256Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8322322Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8322421Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8322457Z graph_break [] 2025-12-04T12:10:21.8322516Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8322602Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8322646Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8322700Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8322811Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8322874Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8322910Z graph_break [] 2025-12-04T12:10:21.8322968Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8323058Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8323099Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8323153Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8323246Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8323311Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8323348Z graph_break [] 2025-12-04T12:10:21.8323407Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8323594Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-2369fab22243df0e.xml - 2025-12-04T12:10:21.8323657Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8324238Z FAILED [0.3465s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8324243Z 2025-12-04T12:10:21.8324317Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8324579Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8324582Z 2025-12-04T12:10:21.8324666Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8324728Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8324798Z ================= 1 failed, 151 deselected, 2 rerun in 33.87s ================== 2025-12-04T12:10:21.8324836Z Got exit code 1 2025-12-04T12:10:21.8324876Z Retrying single test... 2025-12-04T12:10:21.8325022Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d76e312ae9ed3a39.xml 2025-12-04T12:10:21.8325078Z ============================= test session starts ============================== 2025-12-04T12:10:21.8325201Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8325244Z cachedir: .pytest_cache 2025-12-04T12:10:21.8325401Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8325446Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8325487Z configfile: pytest.ini 2025-12-04T12:10:21.8325648Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8325723Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8325976Z stepcurrent: skipping 151 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8326020Z Running 1 items in this shard 2025-12-04T12:10:21.8326022Z 2025-12-04T12:10:21.8326249Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [10.2086s] [100%] 2025-12-04T12:10:21.8326463Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3655s] [100%] 2025-12-04T12:10:21.8326664Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3231s] [100%] 2025-12-04T12:10:21.8326666Z 2025-12-04T12:10:21.8326727Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8326869Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8326913Z Traceback (most recent call last): 2025-12-04T12:10:21.8327072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8327113Z method(*args, **kwargs) 2025-12-04T12:10:21.8327266Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8327306Z method(*args, **kwargs) 2025-12-04T12:10:21.8327457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8327493Z with policy(): 2025-12-04T12:10:21.8327645Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8327687Z raise RuntimeError(msg) 2025-12-04T12:10:21.8328073Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8328076Z 2025-12-04T12:10:21.8328150Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8328406Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8328409Z 2025-12-04T12:10:21.8328496Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8328571Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8328616Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8328671Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8328737Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8328833Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8328871Z graph_break [] 2025-12-04T12:10:21.8328942Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8329085Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8329133Z Traceback (most recent call last): 2025-12-04T12:10:21.8329285Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8329324Z method(*args, **kwargs) 2025-12-04T12:10:21.8329475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8329517Z method(*args, **kwargs) 2025-12-04T12:10:21.8329665Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8329702Z with policy(): 2025-12-04T12:10:21.8329861Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8329903Z raise RuntimeError(msg) 2025-12-04T12:10:21.8330337Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8330353Z 2025-12-04T12:10:21.8330426Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8330701Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8330703Z 2025-12-04T12:10:21.8330788Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8330864Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8330906Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8330961Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8331026Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8331123Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8331160Z graph_break [] 2025-12-04T12:10:21.8331218Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8331291Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8331333Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8331387Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8331482Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8331545Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8331584Z graph_break [] 2025-12-04T12:10:21.8331642Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8331694Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8331836Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8331882Z Traceback (most recent call last): 2025-12-04T12:10:21.8332033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8332074Z method(*args, **kwargs) 2025-12-04T12:10:21.8332225Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8332265Z method(*args, **kwargs) 2025-12-04T12:10:21.8332413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8332451Z with policy(): 2025-12-04T12:10:21.8332615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8332656Z raise RuntimeError(msg) 2025-12-04T12:10:21.8333042Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8333045Z 2025-12-04T12:10:21.8333118Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8333373Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8333375Z 2025-12-04T12:10:21.8333475Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8333563Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8333604Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8333670Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8333735Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8333831Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8333868Z graph_break [] 2025-12-04T12:10:21.8333927Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8334014Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8334059Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8334112Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8334208Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8334272Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8334308Z graph_break [] 2025-12-04T12:10:21.8334365Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8334439Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8334479Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8334533Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8334627Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8334692Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8334729Z graph_break [] 2025-12-04T12:10:21.8334786Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8334977Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-d76e312ae9ed3a39.xml - 2025-12-04T12:10:21.8335040Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8335618Z FAILED [0.3231s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8335622Z 2025-12-04T12:10:21.8335694Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8335951Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8335953Z 2025-12-04T12:10:21.8336038Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8336110Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8336178Z ================= 1 failed, 187 deselected, 2 rerun in 10.92s ================== 2025-12-04T12:10:21.8336217Z Got exit code 1 2025-12-04T12:10:21.8336256Z Retrying single test... 2025-12-04T12:10:21.8336399Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-19dda7a42061b70c.xml 2025-12-04T12:10:21.8336454Z ============================= test session starts ============================== 2025-12-04T12:10:21.8336566Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8336606Z cachedir: .pytest_cache 2025-12-04T12:10:21.8336780Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8336825Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8336876Z configfile: pytest.ini 2025-12-04T12:10:21.8337039Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8337126Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8337383Z stepcurrent: skipping 151 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8337427Z Running 1 items in this shard 2025-12-04T12:10:21.8337439Z 2025-12-04T12:10:21.8337654Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [37.7123s] [100%] 2025-12-04T12:10:21.8337867Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [0.3460s] [100%] 2025-12-04T12:10:21.8338056Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda FAILED [0.3200s] [100%] 2025-12-04T12:10:21.8338059Z 2025-12-04T12:10:21.8338110Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8338252Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8338297Z Traceback (most recent call last): 2025-12-04T12:10:21.8338453Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8338494Z method(*args, **kwargs) 2025-12-04T12:10:21.8338645Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8338685Z method(*args, **kwargs) 2025-12-04T12:10:21.8338836Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8338873Z with policy(): 2025-12-04T12:10:21.8339023Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8339066Z raise RuntimeError(msg) 2025-12-04T12:10:21.8339451Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1094713344. 2025-12-04T12:10:21.8339454Z 2025-12-04T12:10:21.8339528Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8339788Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8339800Z 2025-12-04T12:10:21.8339887Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8339959Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8340003Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8340058Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8340167Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8340264Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8340302Z graph_break [] 2025-12-04T12:10:21.8340360Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8340501Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8340549Z Traceback (most recent call last): 2025-12-04T12:10:21.8340720Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8340761Z method(*args, **kwargs) 2025-12-04T12:10:21.8340910Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8340963Z method(*args, **kwargs) 2025-12-04T12:10:21.8341112Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8341150Z with policy(): 2025-12-04T12:10:21.8341316Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8341358Z raise RuntimeError(msg) 2025-12-04T12:10:21.8341746Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1094713344 and is now 1109393408. 2025-12-04T12:10:21.8341749Z 2025-12-04T12:10:21.8341823Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8342080Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8342084Z 2025-12-04T12:10:21.8342168Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8342242Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8342283Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8342340Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8342404Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8342502Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8342539Z graph_break [] 2025-12-04T12:10:21.8342598Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8342670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8342715Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8342768Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8342863Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8342926Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8342965Z graph_break [] 2025-12-04T12:10:21.8343024Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8343078Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8343221Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8343284Z Traceback (most recent call last): 2025-12-04T12:10:21.8343437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8343478Z method(*args, **kwargs) 2025-12-04T12:10:21.8343628Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8343667Z method(*args, **kwargs) 2025-12-04T12:10:21.8343820Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8343858Z with policy(): 2025-12-04T12:10:21.8344010Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8344053Z raise RuntimeError(msg) 2025-12-04T12:10:21.8344446Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8344468Z 2025-12-04T12:10:21.8344542Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8344799Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8344811Z 2025-12-04T12:10:21.8344897Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8344970Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8345011Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8345067Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8345130Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8345228Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8345264Z graph_break [] 2025-12-04T12:10:21.8345325Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8345396Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8345439Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8345493Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8345592Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8345656Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8345693Z graph_break [] 2025-12-04T12:10:21.8345751Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8345829Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8345870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8345926Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8346021Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8346086Z inductor [('fxgraph_cache_miss', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8346121Z graph_break [] 2025-12-04T12:10:21.8346180Z aten_mm_info [('aten._scaled_mm.default_3_2048_16', 1)] 2025-12-04T12:10:21.8346368Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-19dda7a42061b70c.xml - 2025-12-04T12:10:21.8346431Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8347016Z FAILED [0.3200s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1109393408 and is now 1124073472. 2025-12-04T12:10:21.8347020Z 2025-12-04T12:10:21.8347092Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8347348Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8347351Z 2025-12-04T12:10:21.8347435Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8347498Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8347581Z ================= 1 failed, 187 deselected, 2 rerun in 38.40s ================== 2025-12-04T12:10:21.8347620Z Got exit code 1 2025-12-04T12:10:21.8347841Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8347969Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8348126Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f54f8121a627b22b.xml 2025-12-04T12:10:21.8348182Z ============================= test session starts ============================== 2025-12-04T12:10:21.8348295Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8348348Z cachedir: .pytest_cache 2025-12-04T12:10:21.8348505Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8348551Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8348593Z configfile: pytest.ini 2025-12-04T12:10:21.8348754Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8348831Z collecting ... collected 188 items / 152 deselected / 36 selected 2025-12-04T12:10:21.8348886Z stepcurrent: skipping 152 already run items. 2025-12-04T12:10:21.8348933Z Running 36 items in this shard 2025-12-04T12:10:21.8348935Z 2025-12-04T12:10:21.8349148Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [44.2857s] [ 2%] 2025-12-04T12:10:21.8349360Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.2185s] [ 2%] 2025-12-04T12:10:21.8349545Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda FAILED [1.1482s] [ 2%] 2025-12-04T12:10:21.8349547Z 2025-12-04T12:10:21.8349600Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8349746Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8349792Z Traceback (most recent call last): 2025-12-04T12:10:21.8349947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8349987Z method(*args, **kwargs) 2025-12-04T12:10:21.8350177Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8350218Z method(*args, **kwargs) 2025-12-04T12:10:21.8350369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8350406Z with policy(): 2025-12-04T12:10:21.8350578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8350619Z raise RuntimeError(msg) 2025-12-04T12:10:21.8350999Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.8351002Z 2025-12-04T12:10:21.8351075Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8351333Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8351335Z 2025-12-04T12:10:21.8351419Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8351505Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8351550Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8351605Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8352087Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8352217Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8352254Z graph_break [] 2025-12-04T12:10:21.8352313Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8352386Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8352875Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8352925Z current_size = base.storage().size() 2025-12-04T12:10:21.8352966Z Autotune Choices Stats: 2025-12-04T12:10:21.8353335Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.8353383Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8353424Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8353542Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8353775Z triton_mm_0 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8353818Z _scaled_mm 0.0232 ms 26.6% 2025-12-04T12:10:21.8353946Z SingleProcess AUTOTUNE benchmarking takes 0.0123 seconds and 0.0628 seconds precompiling for 2 choices 2025-12-04T12:10:21.8354088Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8354135Z Traceback (most recent call last): 2025-12-04T12:10:21.8354290Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8354330Z method(*args, **kwargs) 2025-12-04T12:10:21.8354492Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8354532Z method(*args, **kwargs) 2025-12-04T12:10:21.8354683Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8354723Z with policy(): 2025-12-04T12:10:21.8354877Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8354917Z raise RuntimeError(msg) 2025-12-04T12:10:21.8355304Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.8355307Z 2025-12-04T12:10:21.8355380Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8355646Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8355660Z 2025-12-04T12:10:21.8355749Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8355821Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8355865Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8355920Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8356415Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8356514Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8356555Z graph_break [] 2025-12-04T12:10:21.8356614Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8356688Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8357172Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8357220Z current_size = base.storage().size() 2025-12-04T12:10:21.8357262Z Autotune Choices Stats: 2025-12-04T12:10:21.8357630Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.8357676Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8357717Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8357817Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8358048Z triton_mm_0 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8358091Z _scaled_mm 0.0232 ms 26.6% 2025-12-04T12:10:21.8358218Z SingleProcess AUTOTUNE benchmarking takes 0.0123 seconds and 0.0628 seconds precompiling for 2 choices 2025-12-04T12:10:21.8358292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8358344Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8358401Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8358499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8358993Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8359032Z graph_break [] 2025-12-04T12:10:21.8359090Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8359164Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8359221Z Autotune Choices Stats: 2025-12-04T12:10:21.8359583Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.008960000239312649, "best_triton_pos": 0} 2025-12-04T12:10:21.8359639Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8359681Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8359779Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8360019Z triton_mm_1 0.0090 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8360060Z _scaled_mm 0.0197 ms 45.4% 2025-12-04T12:10:21.8360228Z SingleProcess AUTOTUNE benchmarking takes 0.0118 seconds and 0.0484 seconds precompiling for 2 choices 2025-12-04T12:10:21.8360281Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8360422Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8360469Z Traceback (most recent call last): 2025-12-04T12:10:21.8360624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8360665Z method(*args, **kwargs) 2025-12-04T12:10:21.8360816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8360856Z method(*args, **kwargs) 2025-12-04T12:10:21.8361005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8361042Z with policy(): 2025-12-04T12:10:21.8361196Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8361237Z raise RuntimeError(msg) 2025-12-04T12:10:21.8361624Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.8361627Z 2025-12-04T12:10:21.8361703Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8361958Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8361962Z 2025-12-04T12:10:21.8362046Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8362141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8362184Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8362240Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8362720Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8362819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8362855Z graph_break [] 2025-12-04T12:10:21.8362915Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8363006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8363486Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8363548Z current_size = base.storage().size() 2025-12-04T12:10:21.8363591Z Autotune Choices Stats: 2025-12-04T12:10:21.8363959Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.0061599998734891415, "best_triton_pos": 0} 2025-12-04T12:10:21.8364018Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8364059Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8364159Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8364391Z triton_mm_0 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8364432Z _scaled_mm 0.0232 ms 26.6% 2025-12-04T12:10:21.8364559Z SingleProcess AUTOTUNE benchmarking takes 0.0123 seconds and 0.0628 seconds precompiling for 2 choices 2025-12-04T12:10:21.8364633Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8364676Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8364731Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8364831Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8365309Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8365347Z graph_break [] 2025-12-04T12:10:21.8365405Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8365477Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8365521Z Autotune Choices Stats: 2025-12-04T12:10:21.8365879Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.008960000239312649, "best_triton_pos": 0} 2025-12-04T12:10:21.8365935Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8365976Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8366075Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8366306Z triton_mm_1 0.0090 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8366348Z _scaled_mm 0.0197 ms 45.4% 2025-12-04T12:10:21.8366476Z SingleProcess AUTOTUNE benchmarking takes 0.0118 seconds and 0.0484 seconds precompiling for 2 choices 2025-12-04T12:10:21.8366549Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8366590Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8366646Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8366755Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8367231Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8367281Z graph_break [] 2025-12-04T12:10:21.8367340Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8367425Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8367464Z Autotune Choices Stats: 2025-12-04T12:10:21.8367825Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00824000034481287, "best_triton_pos": 0} 2025-12-04T12:10:21.8367869Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8367909Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8368007Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8368236Z triton_mm_2 0.0082 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8368277Z _scaled_mm 0.0210 ms 39.3% 2025-12-04T12:10:21.8368404Z SingleProcess AUTOTUNE benchmarking takes 0.0117 seconds and 0.0520 seconds precompiling for 2 choices 2025-12-04T12:10:21.8368589Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-f54f8121a627b22b.xml - 2025-12-04T12:10:21.8368654Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8369229Z FAILED [1.1482s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.8369233Z 2025-12-04T12:10:21.8369306Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8369562Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8369564Z 2025-12-04T12:10:21.8369660Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8369722Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8369790Z ================= 1 failed, 152 deselected, 2 rerun in 46.67s ================== 2025-12-04T12:10:21.8369828Z Got exit code 1 2025-12-04T12:10:21.8369868Z Retrying single test... 2025-12-04T12:10:21.8370012Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bff1deb47a35137e.xml 2025-12-04T12:10:21.8370067Z ============================= test session starts ============================== 2025-12-04T12:10:21.8370391Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8370431Z cachedir: .pytest_cache 2025-12-04T12:10:21.8370589Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8370633Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8370700Z configfile: pytest.ini 2025-12-04T12:10:21.8370866Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8370957Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8371209Z stepcurrent: skipping 152 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8371252Z Running 1 items in this shard 2025-12-04T12:10:21.8371269Z 2025-12-04T12:10:21.8371482Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [46.2644s] [100%] 2025-12-04T12:10:21.8371692Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.1522s] [100%] 2025-12-04T12:10:21.8371878Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda FAILED [1.4915s] [100%] 2025-12-04T12:10:21.8371881Z 2025-12-04T12:10:21.8371932Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8372074Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8372120Z Traceback (most recent call last): 2025-12-04T12:10:21.8372275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8372318Z method(*args, **kwargs) 2025-12-04T12:10:21.8372469Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8372510Z method(*args, **kwargs) 2025-12-04T12:10:21.8372662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8372700Z with policy(): 2025-12-04T12:10:21.8372851Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8372894Z raise RuntimeError(msg) 2025-12-04T12:10:21.8373275Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.8373278Z 2025-12-04T12:10:21.8373353Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8373634Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8373636Z 2025-12-04T12:10:21.8373724Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8373797Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8373843Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8373899Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8374378Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8374478Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8374527Z graph_break [] 2025-12-04T12:10:21.8374588Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8374662Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8375156Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8375215Z current_size = base.storage().size() 2025-12-04T12:10:21.8375256Z Autotune Choices Stats: 2025-12-04T12:10:21.8375621Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.8375668Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8375709Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8375808Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8376038Z triton_mm_0 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8376080Z _scaled_mm 0.0215 ms 29.2% 2025-12-04T12:10:21.8376207Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0639 seconds precompiling for 2 choices 2025-12-04T12:10:21.8376348Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8376394Z Traceback (most recent call last): 2025-12-04T12:10:21.8376549Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8376591Z method(*args, **kwargs) 2025-12-04T12:10:21.8376741Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8376782Z method(*args, **kwargs) 2025-12-04T12:10:21.8376932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8376969Z with policy(): 2025-12-04T12:10:21.8377123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8377164Z raise RuntimeError(msg) 2025-12-04T12:10:21.8377559Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.8377563Z 2025-12-04T12:10:21.8377635Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8377892Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8377895Z 2025-12-04T12:10:21.8377980Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8379585Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8379632Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8379691Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8380240Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8380356Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8380393Z graph_break [] 2025-12-04T12:10:21.8380454Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8380526Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8381029Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8381079Z current_size = base.storage().size() 2025-12-04T12:10:21.8381119Z Autotune Choices Stats: 2025-12-04T12:10:21.8381479Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.8381523Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8381566Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8381665Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8381900Z triton_mm_0 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8381946Z _scaled_mm 0.0215 ms 29.2% 2025-12-04T12:10:21.8382077Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0639 seconds precompiling for 2 choices 2025-12-04T12:10:21.8382149Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8382191Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8382247Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8382361Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8382839Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8382878Z graph_break [] 2025-12-04T12:10:21.8382959Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8383033Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8383077Z Autotune Choices Stats: 2025-12-04T12:10:21.8383434Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:21.8383482Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8383521Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8383620Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8383859Z triton_mm_1 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8383903Z _scaled_mm 0.0238 ms 29.4% 2025-12-04T12:10:21.8384028Z SingleProcess AUTOTUNE benchmarking takes 0.0120 seconds and 0.0530 seconds precompiling for 2 choices 2025-12-04T12:10:21.8384092Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8384233Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8384281Z Traceback (most recent call last): 2025-12-04T12:10:21.8384449Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8384491Z method(*args, **kwargs) 2025-12-04T12:10:21.8384642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8384682Z method(*args, **kwargs) 2025-12-04T12:10:21.8384833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8384870Z with policy(): 2025-12-04T12:10:21.8385022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8385063Z raise RuntimeError(msg) 2025-12-04T12:10:21.8385449Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1134559232. 2025-12-04T12:10:21.8385452Z 2025-12-04T12:10:21.8385526Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8385783Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8385785Z 2025-12-04T12:10:21.8385872Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8385947Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8385988Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8386059Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8386540Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8386641Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8386691Z graph_break [] 2025-12-04T12:10:21.8386750Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8386823Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8387303Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8387352Z current_size = base.storage().size() 2025-12-04T12:10:21.8387392Z Autotune Choices Stats: 2025-12-04T12:10:21.8387767Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006279999855905771, "best_triton_pos": 0} 2025-12-04T12:10:21.8387812Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8387863Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8387962Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8388191Z triton_mm_0 0.0063 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8388250Z _scaled_mm 0.0215 ms 29.2% 2025-12-04T12:10:21.8388378Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0639 seconds precompiling for 2 choices 2025-12-04T12:10:21.8388452Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8388494Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8388551Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8388649Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8389129Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8389166Z graph_break [] 2025-12-04T12:10:21.8389225Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8389297Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8389337Z Autotune Choices Stats: 2025-12-04T12:10:21.8389698Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.007000000216066837, "best_triton_pos": 0} 2025-12-04T12:10:21.8389742Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8389782Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8389879Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8390147Z triton_mm_1 0.0070 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8390187Z _scaled_mm 0.0238 ms 29.4% 2025-12-04T12:10:21.8390313Z SingleProcess AUTOTUNE benchmarking takes 0.0120 seconds and 0.0530 seconds precompiling for 2 choices 2025-12-04T12:10:21.8390405Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8390447Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8390516Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8390617Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8391059Z inductor [('triton_bundler_save_kernel', 8), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('async_compile_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('extern_calls', 1)] 2025-12-04T12:10:21.8391098Z graph_break [] 2025-12-04T12:10:21.8391157Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8391230Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8391269Z Autotune Choices Stats: 2025-12-04T12:10:21.8391747Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "_scaled_mm", "best_time": 0.006240000016987324, "best_triton_pos": 1, "best_triton_time": 0.010200000368058681, "best_triton_kernel": "triton_mm_2", "best_triton_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1"} 2025-12-04T12:10:21.8391806Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8391846Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8391964Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8392006Z _scaled_mm 0.0062 ms 100.0% 2025-12-04T12:10:21.8392236Z triton_mm_2 0.0102 ms 61.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8392363Z SingleProcess AUTOTUNE benchmarking takes 0.0155 seconds and 0.4322 seconds precompiling for 2 choices 2025-12-04T12:10:21.8392553Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-bff1deb47a35137e.xml - 2025-12-04T12:10:21.8392614Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8393193Z FAILED [1.4915s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1134559232. 2025-12-04T12:10:21.8393197Z 2025-12-04T12:10:21.8393270Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8393527Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8393530Z 2025-12-04T12:10:21.8393618Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8393678Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8393749Z ================= 1 failed, 187 deselected, 2 rerun in 48.93s ================== 2025-12-04T12:10:21.8393789Z Got exit code 1 2025-12-04T12:10:21.8393829Z Retrying single test... 2025-12-04T12:10:21.8393974Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7ec28eba9f0ae67d.xml 2025-12-04T12:10:21.8394032Z ============================= test session starts ============================== 2025-12-04T12:10:21.8394156Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8394197Z cachedir: .pytest_cache 2025-12-04T12:10:21.8394356Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8394402Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8394442Z configfile: pytest.ini 2025-12-04T12:10:21.8394607Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8394681Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8394935Z stepcurrent: skipping 152 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8394979Z Running 1 items in this shard 2025-12-04T12:10:21.8394981Z 2025-12-04T12:10:21.8395201Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [46.4232s] [100%] 2025-12-04T12:10:21.8395425Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.2407s] [100%] 2025-12-04T12:10:21.8395620Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda FAILED [1.1006s] [100%] 2025-12-04T12:10:21.8395623Z 2025-12-04T12:10:21.8395674Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8395829Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8395875Z Traceback (most recent call last): 2025-12-04T12:10:21.8396032Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8396074Z method(*args, **kwargs) 2025-12-04T12:10:21.8396226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8396268Z method(*args, **kwargs) 2025-12-04T12:10:21.8396419Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8396456Z with policy(): 2025-12-04T12:10:21.8396607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8396649Z raise RuntimeError(msg) 2025-12-04T12:10:21.8397033Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1019215872. 2025-12-04T12:10:21.8397036Z 2025-12-04T12:10:21.8397109Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8397366Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8397369Z 2025-12-04T12:10:21.8397455Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8397529Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8397572Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8397629Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8398119Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8398218Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8398256Z graph_break [] 2025-12-04T12:10:21.8398316Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8398389Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8398869Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8398919Z current_size = base.storage().size() 2025-12-04T12:10:21.8398968Z Autotune Choices Stats: 2025-12-04T12:10:21.8399334Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.008799999952316284, "best_triton_pos": 0} 2025-12-04T12:10:21.8399389Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8399430Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8399528Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8399772Z triton_mm_0 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8399816Z _scaled_mm 0.0223 ms 39.5% 2025-12-04T12:10:21.8399944Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0614 seconds precompiling for 2 choices 2025-12-04T12:10:21.8400086Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8400181Z Traceback (most recent call last): 2025-12-04T12:10:21.8400337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8400377Z method(*args, **kwargs) 2025-12-04T12:10:21.8400531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8400572Z method(*args, **kwargs) 2025-12-04T12:10:21.8400723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8400761Z with policy(): 2025-12-04T12:10:21.8400916Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8400957Z raise RuntimeError(msg) 2025-12-04T12:10:21.8401341Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1019215872 and is now 1038090240. 2025-12-04T12:10:21.8401344Z 2025-12-04T12:10:21.8401418Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8401673Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8401677Z 2025-12-04T12:10:21.8401762Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8401836Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8401901Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8401957Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8402436Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8402535Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8402572Z graph_break [] 2025-12-04T12:10:21.8402630Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8402704Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8403200Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8403279Z current_size = base.storage().size() 2025-12-04T12:10:21.8403321Z Autotune Choices Stats: 2025-12-04T12:10:21.8403684Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.008799999952316284, "best_triton_pos": 0} 2025-12-04T12:10:21.8403743Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8403784Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8403885Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8404115Z triton_mm_0 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8404158Z _scaled_mm 0.0223 ms 39.5% 2025-12-04T12:10:21.8404283Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0614 seconds precompiling for 2 choices 2025-12-04T12:10:21.8404356Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8404399Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8404458Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8404556Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8405035Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8405074Z graph_break [] 2025-12-04T12:10:21.8405133Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8405207Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8405246Z Autotune Choices Stats: 2025-12-04T12:10:21.8405603Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00711899995803833, "best_triton_pos": 0} 2025-12-04T12:10:21.8405646Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8405700Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8405798Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8406027Z triton_mm_1 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8406068Z _scaled_mm 0.0241 ms 29.6% 2025-12-04T12:10:21.8406195Z SingleProcess AUTOTUNE benchmarking takes 0.0120 seconds and 0.0523 seconds precompiling for 2 choices 2025-12-04T12:10:21.8406248Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8406390Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8406434Z Traceback (most recent call last): 2025-12-04T12:10:21.8406602Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8406644Z method(*args, **kwargs) 2025-12-04T12:10:21.8406796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8406848Z method(*args, **kwargs) 2025-12-04T12:10:21.8406998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8407036Z with policy(): 2025-12-04T12:10:21.8407188Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8407241Z raise RuntimeError(msg) 2025-12-04T12:10:21.8407634Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.8407637Z 2025-12-04T12:10:21.8407710Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8407966Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8407969Z 2025-12-04T12:10:21.8408056Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8408131Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8408176Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8408233Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8408714Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8408813Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8408849Z graph_break [] 2025-12-04T12:10:21.8408909Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8408981Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8409462Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8409509Z current_size = base.storage().size() 2025-12-04T12:10:21.8409566Z Autotune Choices Stats: 2025-12-04T12:10:21.8409930Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.008799999952316284, "best_triton_pos": 0} 2025-12-04T12:10:21.8409976Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8410019Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8410163Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8410394Z triton_mm_0 0.0088 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8410434Z _scaled_mm 0.0223 ms 39.5% 2025-12-04T12:10:21.8410583Z SingleProcess AUTOTUNE benchmarking takes 0.0134 seconds and 0.0614 seconds precompiling for 2 choices 2025-12-04T12:10:21.8410657Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8410712Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8410767Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8410865Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8411340Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8411400Z graph_break [] 2025-12-04T12:10:21.8411460Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8411535Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8411574Z Autotune Choices Stats: 2025-12-04T12:10:21.8411935Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_1", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.00711899995803833, "best_triton_pos": 0} 2025-12-04T12:10:21.8411980Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8412020Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8412119Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8412350Z triton_mm_1 0.0071 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8412392Z _scaled_mm 0.0241 ms 29.6% 2025-12-04T12:10:21.8412520Z SingleProcess AUTOTUNE benchmarking takes 0.0120 seconds and 0.0523 seconds precompiling for 2 choices 2025-12-04T12:10:21.8412594Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8412636Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8412691Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8412788Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8413267Z inductor [('triton_bundler_save_kernel', 16), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 1), ('generated_module_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_num_precompiles', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8413320Z graph_break [] 2025-12-04T12:10:21.8413380Z aten_mm_info [('aten._scaled_mm.default_3_16_32', 1)] 2025-12-04T12:10:21.8413451Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8413493Z Autotune Choices Stats: 2025-12-04T12:10:21.8413854Z {"num_choices": 2, "num_triton_choices": 1, "best_kernel": "triton_mm_2", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.010040000081062317, "best_triton_pos": 0} 2025-12-04T12:10:21.8413898Z AUTOTUNE scaled_mm(3x32, 32x16, , ) 2025-12-04T12:10:21.8413939Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8414035Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8414275Z triton_mm_2 0.0100 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8414315Z _scaled_mm 0.0239 ms 42.0% 2025-12-04T12:10:21.8414453Z SingleProcess AUTOTUNE benchmarking takes 0.0119 seconds and 0.0476 seconds precompiling for 2 choices 2025-12-04T12:10:21.8414641Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-7ec28eba9f0ae67d.xml - 2025-12-04T12:10:21.8414701Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8415288Z FAILED [1.1006s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1038090240 and is now 1056964608. 2025-12-04T12:10:21.8415291Z 2025-12-04T12:10:21.8415364Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8415622Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8415624Z 2025-12-04T12:10:21.8415709Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8415771Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8415840Z ================= 1 failed, 187 deselected, 2 rerun in 48.78s ================== 2025-12-04T12:10:21.8415878Z Got exit code 1 2025-12-04T12:10:21.8416085Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda 2025-12-04T12:10:21.8416212Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8416357Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-faffce0c2b45da00.xml 2025-12-04T12:10:21.8416416Z ============================= test session starts ============================== 2025-12-04T12:10:21.8416527Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8416568Z cachedir: .pytest_cache 2025-12-04T12:10:21.8416726Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8416773Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8416813Z configfile: pytest.ini 2025-12-04T12:10:21.8416977Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8417063Z collecting ... collected 188 items / 153 deselected / 35 selected 2025-12-04T12:10:21.8417118Z stepcurrent: skipping 153 already run items. 2025-12-04T12:10:21.8417164Z Running 35 items in this shard 2025-12-04T12:10:21.8417167Z 2025-12-04T12:10:21.8417384Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [46.9327s] [ 2%] 2025-12-04T12:10:21.8417596Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4174s] [ 2%] 2025-12-04T12:10:21.8417786Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.3604s] [ 2%] 2025-12-04T12:10:21.8417788Z 2025-12-04T12:10:21.8417857Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8418000Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8418048Z Traceback (most recent call last): 2025-12-04T12:10:21.8418217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8418259Z method(*args, **kwargs) 2025-12-04T12:10:21.8418411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8418464Z method(*args, **kwargs) 2025-12-04T12:10:21.8418612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8418650Z with policy(): 2025-12-04T12:10:21.8418801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8418845Z raise RuntimeError(msg) 2025-12-04T12:10:21.8419233Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.8419236Z 2025-12-04T12:10:21.8419309Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8419567Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8419571Z 2025-12-04T12:10:21.8419656Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8419729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8419772Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8419831Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8420354Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8420456Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8420493Z graph_break [] 2025-12-04T12:10:21.8420555Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8420628Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8421128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8421180Z current_size = base.storage().size() 2025-12-04T12:10:21.8421220Z Autotune Choices Stats: 2025-12-04T12:10:21.8421588Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.8421636Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8421677Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8421776Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8422023Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8422262Z triton_mm_5 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8422485Z triton_mm_2 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8422724Z triton_mm_1 0.0072 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8422945Z triton_mm_7 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8423170Z triton_mm_3 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8423393Z triton_mm_6 0.0078 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8423619Z triton_mm_4 0.0087 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8423662Z _scaled_mm 0.0211 ms 30.6% 2025-12-04T12:10:21.8423789Z SingleProcess AUTOTUNE benchmarking takes 0.0430 seconds and 0.1814 seconds precompiling for 9 choices 2025-12-04T12:10:21.8423934Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8423979Z Traceback (most recent call last): 2025-12-04T12:10:21.8424134Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8424175Z method(*args, **kwargs) 2025-12-04T12:10:21.8424329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8424369Z method(*args, **kwargs) 2025-12-04T12:10:21.8424520Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8424557Z with policy(): 2025-12-04T12:10:21.8424722Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8424764Z raise RuntimeError(msg) 2025-12-04T12:10:21.8425154Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.8425157Z 2025-12-04T12:10:21.8425230Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8425488Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8425490Z 2025-12-04T12:10:21.8425587Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8425660Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8425703Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8425770Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8426250Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8426361Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8426398Z graph_break [] 2025-12-04T12:10:21.8426459Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8426535Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8427022Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8427069Z current_size = base.storage().size() 2025-12-04T12:10:21.8427110Z Autotune Choices Stats: 2025-12-04T12:10:21.8427472Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.8427519Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8427560Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8427659Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8427891Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8428116Z triton_mm_5 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8428337Z triton_mm_2 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8428570Z triton_mm_1 0.0072 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8428792Z triton_mm_7 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8429016Z triton_mm_3 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8429238Z triton_mm_6 0.0078 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8429471Z triton_mm_4 0.0087 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8429522Z _scaled_mm 0.0211 ms 30.6% 2025-12-04T12:10:21.8429649Z SingleProcess AUTOTUNE benchmarking takes 0.0430 seconds and 0.1814 seconds precompiling for 9 choices 2025-12-04T12:10:21.8429721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8429764Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8429820Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8429929Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8430440Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8430478Z graph_break [] 2025-12-04T12:10:21.8430539Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8430612Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8430651Z Autotune Choices Stats: 2025-12-04T12:10:21.8431011Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.8431057Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8431097Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8431197Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8431425Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8431652Z triton_mm_11 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8431874Z triton_mm_13 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8432128Z triton_mm_12 0.0071 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8432349Z triton_mm_10 0.0072 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8432571Z triton_mm_14 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8432794Z triton_mm_15 0.0073 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8433041Z triton_mm_8 0.0093 ms 63.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8433083Z _scaled_mm 0.0215 ms 27.7% 2025-12-04T12:10:21.8433208Z SingleProcess AUTOTUNE benchmarking takes 0.0412 seconds and 0.1046 seconds precompiling for 9 choices 2025-12-04T12:10:21.8433284Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8433425Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8433472Z Traceback (most recent call last): 2025-12-04T12:10:21.8433648Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8433691Z method(*args, **kwargs) 2025-12-04T12:10:21.8433842Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8433883Z method(*args, **kwargs) 2025-12-04T12:10:21.8434035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8434075Z with policy(): 2025-12-04T12:10:21.8434227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8434269Z raise RuntimeError(msg) 2025-12-04T12:10:21.8434658Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8434661Z 2025-12-04T12:10:21.8434734Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8434995Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8434998Z 2025-12-04T12:10:21.8435085Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8435159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8435201Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8435257Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8435736Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8435834Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8435883Z graph_break [] 2025-12-04T12:10:21.8435944Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8436016Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8436498Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8436546Z current_size = base.storage().size() 2025-12-04T12:10:21.8436587Z Autotune Choices Stats: 2025-12-04T12:10:21.8436963Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_0", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006440000142902136, "best_triton_pos": 0} 2025-12-04T12:10:21.8437009Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8437061Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8437161Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8437393Z triton_mm_0 0.0064 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8437630Z triton_mm_5 0.0068 ms 95.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8437852Z triton_mm_2 0.0068 ms 94.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8438075Z triton_mm_1 0.0072 ms 89.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8438295Z triton_mm_7 0.0073 ms 88.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8438520Z triton_mm_3 0.0077 ms 83.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8438743Z triton_mm_6 0.0078 ms 83.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8438963Z triton_mm_4 0.0087 ms 73.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8439005Z _scaled_mm 0.0211 ms 30.6% 2025-12-04T12:10:21.8439131Z SingleProcess AUTOTUNE benchmarking takes 0.0430 seconds and 0.1814 seconds precompiling for 9 choices 2025-12-04T12:10:21.8439206Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8439246Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8439303Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8439401Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8439892Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8439929Z graph_break [] 2025-12-04T12:10:21.8439990Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8440061Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8440141Z Autotune Choices Stats: 2025-12-04T12:10:21.8440498Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.00595899997279048, "best_triton_pos": 0} 2025-12-04T12:10:21.8440567Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8440608Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8440706Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8440950Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8441175Z triton_mm_11 0.0061 ms 98.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8441420Z triton_mm_13 0.0069 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8441647Z triton_mm_12 0.0071 ms 84.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8441870Z triton_mm_10 0.0072 ms 83.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8442093Z triton_mm_14 0.0072 ms 82.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8442315Z triton_mm_15 0.0073 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8442542Z triton_mm_8 0.0093 ms 63.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8442583Z _scaled_mm 0.0215 ms 27.7% 2025-12-04T12:10:21.8442710Z SingleProcess AUTOTUNE benchmarking takes 0.0412 seconds and 0.1046 seconds precompiling for 9 choices 2025-12-04T12:10:21.8442782Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8442824Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8442881Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8442980Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8443478Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8443515Z graph_break [] 2025-12-04T12:10:21.8443577Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8443649Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8443690Z Autotune Choices Stats: 2025-12-04T12:10:21.8444047Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_22", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2", "best_time": 0.006200000178068876, "best_triton_pos": 0} 2025-12-04T12:10:21.8444094Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8444144Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8444245Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8444474Z triton_mm_22 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8444707Z triton_mm_21 0.0069 ms 90.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8444951Z triton_mm_20 0.0070 ms 89.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8445173Z triton_mm_23 0.0071 ms 87.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8445399Z triton_mm_16 0.0071 ms 87.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8445624Z triton_mm_17 0.0072 ms 86.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8445849Z triton_mm_19 0.0072 ms 85.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8446071Z triton_mm_18 0.0074 ms 83.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8446111Z _scaled_mm 0.0204 ms 30.3% 2025-12-04T12:10:21.8446239Z SingleProcess AUTOTUNE benchmarking takes 0.0568 seconds and 0.3163 seconds precompiling for 9 choices 2025-12-04T12:10:21.8446428Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-faffce0c2b45da00.xml - 2025-12-04T12:10:21.8446488Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8447080Z FAILED [1.3604s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8447083Z 2025-12-04T12:10:21.8447158Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8447418Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8447420Z 2025-12-04T12:10:21.8447506Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8447569Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8447638Z ================= 1 failed, 153 deselected, 2 rerun in 49.73s ================== 2025-12-04T12:10:21.8447676Z Got exit code 1 2025-12-04T12:10:21.8447715Z Retrying single test... 2025-12-04T12:10:21.8447870Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-721574c55326e156.xml 2025-12-04T12:10:21.8447927Z ============================= test session starts ============================== 2025-12-04T12:10:21.8448040Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8448091Z cachedir: .pytest_cache 2025-12-04T12:10:21.8448249Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8448294Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8448335Z configfile: pytest.ini 2025-12-04T12:10:21.8448508Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8448583Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8448839Z stepcurrent: skipping 153 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8448883Z Running 1 items in this shard 2025-12-04T12:10:21.8448886Z 2025-12-04T12:10:21.8449101Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [44.8985s] [100%] 2025-12-04T12:10:21.8449313Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.4128s] [100%] 2025-12-04T12:10:21.8449503Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.4128s] [100%] 2025-12-04T12:10:21.8449506Z 2025-12-04T12:10:21.8449557Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8449700Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8449746Z Traceback (most recent call last): 2025-12-04T12:10:21.8449905Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8449948Z method(*args, **kwargs) 2025-12-04T12:10:21.8450141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8450182Z method(*args, **kwargs) 2025-12-04T12:10:21.8450332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8450369Z with policy(): 2025-12-04T12:10:21.8450522Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8450563Z raise RuntimeError(msg) 2025-12-04T12:10:21.8450972Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.8450975Z 2025-12-04T12:10:21.8451049Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8451307Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8451310Z 2025-12-04T12:10:21.8451397Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8451469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8451512Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8451567Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8452067Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8452180Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8452217Z graph_break [] 2025-12-04T12:10:21.8452277Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8452362Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8452846Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8452894Z current_size = base.storage().size() 2025-12-04T12:10:21.8452939Z Autotune Choices Stats: 2025-12-04T12:10:21.8453304Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8453351Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8453392Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8453494Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8453729Z triton_mm_4 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8453955Z triton_mm_5 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8454182Z triton_mm_3 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8454408Z triton_mm_6 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8454641Z triton_mm_7 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8454864Z triton_mm_1 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8455086Z triton_mm_0 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8455310Z triton_mm_2 0.0080 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8455349Z _scaled_mm 0.0227 ms 26.9% 2025-12-04T12:10:21.8455491Z SingleProcess AUTOTUNE benchmarking takes 0.0388 seconds and 0.1873 seconds precompiling for 9 choices 2025-12-04T12:10:21.8455635Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8455681Z Traceback (most recent call last): 2025-12-04T12:10:21.8455849Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8455890Z method(*args, **kwargs) 2025-12-04T12:10:21.8456041Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8456093Z method(*args, **kwargs) 2025-12-04T12:10:21.8456242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8456279Z with policy(): 2025-12-04T12:10:21.8456434Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8456477Z raise RuntimeError(msg) 2025-12-04T12:10:21.8456867Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.8456871Z 2025-12-04T12:10:21.8456943Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8457200Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8457203Z 2025-12-04T12:10:21.8457289Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8457363Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8457407Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8457464Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8457941Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8458041Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8458077Z graph_break [] 2025-12-04T12:10:21.8458138Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8458209Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8458704Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8458753Z current_size = base.storage().size() 2025-12-04T12:10:21.8458794Z Autotune Choices Stats: 2025-12-04T12:10:21.8459158Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8459204Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8459244Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8459352Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8459585Z triton_mm_4 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8459824Z triton_mm_5 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8460048Z triton_mm_3 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8460340Z triton_mm_6 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8460559Z triton_mm_7 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8460785Z triton_mm_1 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8461008Z triton_mm_0 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8461232Z triton_mm_2 0.0080 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8461273Z _scaled_mm 0.0227 ms 26.9% 2025-12-04T12:10:21.8461400Z SingleProcess AUTOTUNE benchmarking takes 0.0388 seconds and 0.1873 seconds precompiling for 9 choices 2025-12-04T12:10:21.8461473Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8461515Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8461573Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8461672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8462152Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8462210Z graph_break [] 2025-12-04T12:10:21.8462271Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8462342Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8462383Z Autotune Choices Stats: 2025-12-04T12:10:21.8462742Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8462790Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8462830Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8462929Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8463169Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8463394Z triton_mm_10 0.0061 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8463636Z triton_mm_15 0.0064 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8463876Z triton_mm_13 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8464103Z triton_mm_12 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8464326Z triton_mm_8 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8464550Z triton_mm_14 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8464778Z triton_mm_11 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8464817Z _scaled_mm 0.0246 ms 24.4% 2025-12-04T12:10:21.8464947Z SingleProcess AUTOTUNE benchmarking takes 0.0365 seconds and 0.0932 seconds precompiling for 9 choices 2025-12-04T12:10:21.8464998Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8465142Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8465186Z Traceback (most recent call last): 2025-12-04T12:10:21.8465344Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8465385Z method(*args, **kwargs) 2025-12-04T12:10:21.8465536Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8465576Z method(*args, **kwargs) 2025-12-04T12:10:21.8465726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8465763Z with policy(): 2025-12-04T12:10:21.8465925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8465965Z raise RuntimeError(msg) 2025-12-04T12:10:21.8466355Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8466358Z 2025-12-04T12:10:21.8466432Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8466688Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8466690Z 2025-12-04T12:10:21.8466786Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8466859Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8466904Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8466970Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8467447Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8467554Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8467592Z graph_break [] 2025-12-04T12:10:21.8467653Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8467726Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8468206Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8468253Z current_size = base.storage().size() 2025-12-04T12:10:21.8468294Z Autotune Choices Stats: 2025-12-04T12:10:21.8468655Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_4", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.006120000034570694, "best_triton_pos": 0} 2025-12-04T12:10:21.8468701Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8468741Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8468841Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8469075Z triton_mm_4 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8469301Z triton_mm_5 0.0062 ms 99.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8469524Z triton_mm_3 0.0062 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8469756Z triton_mm_6 0.0064 ms 96.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8469983Z triton_mm_7 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8470238Z triton_mm_1 0.0072 ms 84.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8470461Z triton_mm_0 0.0075 ms 81.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8470705Z triton_mm_2 0.0080 ms 76.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8470758Z _scaled_mm 0.0227 ms 26.9% 2025-12-04T12:10:21.8470885Z SingleProcess AUTOTUNE benchmarking takes 0.0388 seconds and 0.1873 seconds precompiling for 9 choices 2025-12-04T12:10:21.8470956Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8470999Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8471067Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8471167Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8471644Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8471682Z graph_break [] 2025-12-04T12:10:21.8471742Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8471815Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8471855Z Autotune Choices Stats: 2025-12-04T12:10:21.8472215Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_9", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8472262Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8472302Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8472402Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8472630Z triton_mm_9 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8472856Z triton_mm_10 0.0061 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8473080Z triton_mm_15 0.0064 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8473336Z triton_mm_13 0.0066 ms 90.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8473560Z triton_mm_12 0.0067 ms 89.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8473784Z triton_mm_8 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8474009Z triton_mm_14 0.0068 ms 87.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8474253Z triton_mm_11 0.0069 ms 86.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8474295Z _scaled_mm 0.0246 ms 24.4% 2025-12-04T12:10:21.8474421Z SingleProcess AUTOTUNE benchmarking takes 0.0365 seconds and 0.0932 seconds precompiling for 9 choices 2025-12-04T12:10:21.8474509Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8474551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8474608Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8474706Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8475199Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8475237Z graph_break [] 2025-12-04T12:10:21.8475297Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8475370Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8475409Z Autotune Choices Stats: 2025-12-04T12:10:21.8475768Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_17", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8", "best_time": 0.005919999908655882, "best_triton_pos": 0} 2025-12-04T12:10:21.8475813Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8475855Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8475952Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8476188Z triton_mm_17 0.0059 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8476413Z triton_mm_18 0.0062 ms 96.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8476638Z triton_mm_22 0.0062 ms 94.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8476860Z triton_mm_23 0.0063 ms 94.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8477094Z triton_mm_20 0.0071 ms 83.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8477322Z triton_mm_16 0.0073 ms 81.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8477545Z triton_mm_19 0.0074 ms 80.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8477767Z triton_mm_21 0.0084 ms 70.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8477820Z _scaled_mm 0.0212 ms 28.0% 2025-12-04T12:10:21.8477947Z SingleProcess AUTOTUNE benchmarking takes 0.0571 seconds and 0.3496 seconds precompiling for 9 choices 2025-12-04T12:10:21.8478134Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-721574c55326e156.xml - 2025-12-04T12:10:21.8478206Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8478787Z FAILED [1.4128s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8478799Z 2025-12-04T12:10:21.8478874Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8479134Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8479137Z 2025-12-04T12:10:21.8479224Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8479285Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8479354Z ================= 1 failed, 187 deselected, 2 rerun in 47.74s ================== 2025-12-04T12:10:21.8479392Z Got exit code 1 2025-12-04T12:10:21.8479432Z Retrying single test... 2025-12-04T12:10:21.8479575Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e0c1d9ead29bd58.xml 2025-12-04T12:10:21.8479632Z ============================= test session starts ============================== 2025-12-04T12:10:21.8479744Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8479785Z cachedir: .pytest_cache 2025-12-04T12:10:21.8479943Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8479989Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8480029Z configfile: pytest.ini 2025-12-04T12:10:21.8480226Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8480302Z collecting ... collected 188 items / 187 deselected / 1 selected 2025-12-04T12:10:21.8480556Z stepcurrent: skipping 153 already run items. Running only test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8480598Z Running 1 items in this shard 2025-12-04T12:10:21.8480600Z 2025-12-04T12:10:21.8480838Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [24.9233s] [100%] 2025-12-04T12:10:21.8481052Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda ('RERUN', {'yellow': True}) [1.3837s] [100%] 2025-12-04T12:10:21.8481241Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda FAILED [1.5259s] [100%] 2025-12-04T12:10:21.8481244Z 2025-12-04T12:10:21.8481295Z ==================================== RERUNS ==================================== 2025-12-04T12:10:21.8481436Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8481482Z Traceback (most recent call last): 2025-12-04T12:10:21.8481659Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8481701Z method(*args, **kwargs) 2025-12-04T12:10:21.8481853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8481907Z method(*args, **kwargs) 2025-12-04T12:10:21.8482057Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8482095Z with policy(): 2025-12-04T12:10:21.8482248Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8482305Z raise RuntimeError(msg) 2025-12-04T12:10:21.8482693Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 807403520 and is now 1033895936. 2025-12-04T12:10:21.8482695Z 2025-12-04T12:10:21.8482770Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8483029Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8483031Z 2025-12-04T12:10:21.8483116Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8483190Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8483232Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8483288Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8483766Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8483866Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8483902Z graph_break [] 2025-12-04T12:10:21.8483963Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8484035Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8484518Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8484576Z current_size = base.storage().size() 2025-12-04T12:10:21.8484616Z Autotune Choices Stats: 2025-12-04T12:10:21.8484982Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.8485028Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8485070Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8485170Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8485403Z triton_mm_3 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8485639Z triton_mm_1 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8485874Z triton_mm_2 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8486098Z triton_mm_7 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8486334Z triton_mm_4 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8486556Z triton_mm_6 0.0074 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8486777Z triton_mm_5 0.0074 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8486999Z triton_mm_0 0.0102 ms 59.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8487042Z _scaled_mm 0.0229 ms 26.5% 2025-12-04T12:10:21.8487169Z SingleProcess AUTOTUNE benchmarking takes 0.0459 seconds and 0.2046 seconds precompiling for 9 choices 2025-12-04T12:10:21.8487314Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8487382Z Traceback (most recent call last): 2025-12-04T12:10:21.8487538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8487580Z method(*args, **kwargs) 2025-12-04T12:10:21.8487732Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8487771Z method(*args, **kwargs) 2025-12-04T12:10:21.8487922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8487961Z with policy(): 2025-12-04T12:10:21.8488113Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8488154Z raise RuntimeError(msg) 2025-12-04T12:10:21.8488561Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1033895936 and is now 1067450368. 2025-12-04T12:10:21.8488564Z 2025-12-04T12:10:21.8488637Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8488894Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8488897Z 2025-12-04T12:10:21.8488983Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8489056Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8489099Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8489155Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8489646Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8489754Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8489791Z graph_break [] 2025-12-04T12:10:21.8489863Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8489936Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8490451Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8490499Z current_size = base.storage().size() 2025-12-04T12:10:21.8490540Z Autotune Choices Stats: 2025-12-04T12:10:21.8490903Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.8490948Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8490989Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8491087Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8491319Z triton_mm_3 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8491542Z triton_mm_1 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8491765Z triton_mm_2 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8491989Z triton_mm_7 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8492238Z triton_mm_4 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8492460Z triton_mm_6 0.0074 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8492683Z triton_mm_5 0.0074 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8492906Z triton_mm_0 0.0102 ms 59.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8492947Z _scaled_mm 0.0229 ms 26.5% 2025-12-04T12:10:21.8493087Z SingleProcess AUTOTUNE benchmarking takes 0.0459 seconds and 0.2046 seconds precompiling for 9 choices 2025-12-04T12:10:21.8493160Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8493217Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8493274Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8493372Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8493849Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8493900Z graph_break [] 2025-12-04T12:10:21.8493960Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8494034Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8494074Z Autotune Choices Stats: 2025-12-04T12:10:21.8494436Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8494481Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8494523Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8494620Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8494850Z triton_mm_15 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8495077Z triton_mm_13 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8495304Z triton_mm_12 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8495536Z triton_mm_10 0.0074 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8496087Z triton_mm_14 0.0078 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8496609Z triton_mm_9 0.0088 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8497112Z triton_mm_11 0.0092 ms 65.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8497618Z triton_mm_8 0.0099 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8497927Z _scaled_mm 0.0225 ms 26.6% 2025-12-04T12:10:21.8498128Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.0726 seconds precompiling for 9 choices 2025-12-04T12:10:21.8498369Z =================================== FAILURES =================================== 2025-12-04T12:10:21.8498605Z _ TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda _ 2025-12-04T12:10:21.8498843Z Traceback (most recent call last): 2025-12-04T12:10:21.8499080Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8499311Z method(*args, **kwargs) 2025-12-04T12:10:21.8499530Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T12:10:21.8499772Z method(*args, **kwargs) 2025-12-04T12:10:21.8499988Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T12:10:21.8500262Z with policy(): 2025-12-04T12:10:21.8500472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T12:10:21.8500720Z raise RuntimeError(msg) 2025-12-04T12:10:21.8501187Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8501637Z 2025-12-04T12:10:21.8501711Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8502083Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8502377Z 2025-12-04T12:10:21.8502465Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8502663Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8502818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8502946Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8503521Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8504144Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8504331Z graph_break [] 2025-12-04T12:10:21.8504446Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8504614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8505230Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/select_algorithm.py:3433: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly. To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage() 2025-12-04T12:10:21.8505797Z current_size = base.storage().size() 2025-12-04T12:10:21.8505918Z Autotune Choices Stats: 2025-12-04T12:10:21.8506348Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_3", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4", "best_time": 0.0060800001956522465, "best_triton_pos": 0} 2025-12-04T12:10:21.8506807Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8506956Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8507145Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8507512Z triton_mm_3 0.0061 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8508020Z triton_mm_1 0.0062 ms 97.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8508547Z triton_mm_2 0.0068 ms 88.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8509034Z triton_mm_7 0.0069 ms 88.4% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8509634Z triton_mm_4 0.0070 ms 86.9% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8510179Z triton_mm_6 0.0074 ms 82.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8510660Z triton_mm_5 0.0074 ms 82.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8511141Z triton_mm_0 0.0102 ms 59.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8511441Z _scaled_mm 0.0229 ms 26.5% 2025-12-04T12:10:21.8511639Z SingleProcess AUTOTUNE benchmarking takes 0.0459 seconds and 0.2046 seconds precompiling for 9 choices 2025-12-04T12:10:21.8511881Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8512033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8512161Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8512354Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8512996Z inductor [('triton_bundler_save_kernel', 72), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8513545Z graph_break [] 2025-12-04T12:10:21.8513660Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8513828Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8513978Z Autotune Choices Stats: 2025-12-04T12:10:21.8514399Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_15", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006000000052154064, "best_triton_pos": 0} 2025-12-04T12:10:21.8514836Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8514956Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8515125Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8515503Z triton_mm_15 0.0060 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8516004Z triton_mm_13 0.0061 ms 98.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8516498Z triton_mm_12 0.0062 ms 96.8% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8517010Z triton_mm_10 0.0074 ms 81.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8517494Z triton_mm_14 0.0078 ms 77.3% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8517984Z triton_mm_9 0.0088 ms 68.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8518466Z triton_mm_11 0.0092 ms 65.2% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8518950Z triton_mm_8 0.0099 ms 60.5% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8519251Z _scaled_mm 0.0225 ms 26.6% 2025-12-04T12:10:21.8519449Z SingleProcess AUTOTUNE benchmarking takes 0.0436 seconds and 0.0726 seconds precompiling for 9 choices 2025-12-04T12:10:21.8519685Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:21.8519836Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:21.8519964Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T12:10:21.8520198Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:21.8520811Z inductor [('triton_bundler_save_kernel', 72), ('async_compile_cache_miss', 10), ('benchmarking.InductorBenchmarker.benchmark_gpu', 9), ('generated_module_cache_miss', 8), ('select_algorithm_num_precompiles', 8), ('fxgraph_cache_miss', 1), ('select_algorithm_precompile', 1), ('select_algorithm_autotune', 1), ('benchmarking.InductorBenchmarker.benchmark', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T12:10:21.8521358Z graph_break [] 2025-12-04T12:10:21.8521494Z aten_mm_info [('aten._scaled_mm.default_3_2048_32', 1)] 2025-12-04T12:10:21.8521663Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:21.8521812Z Autotune Choices Stats: 2025-12-04T12:10:21.8522236Z {"num_choices": 9, "num_triton_choices": 8, "best_kernel": "triton_mm_23", "best_kernel_desc": "ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1", "best_time": 0.006159000098705292, "best_triton_pos": 0} 2025-12-04T12:10:21.8522673Z AUTOTUNE scaled_mm(3x32, 32x2048, , ) 2025-12-04T12:10:21.8522790Z strides: [32, 1], [1, 32], [], [] 2025-12-04T12:10:21.8522959Z dtypes: torch.float8_e4m3fnuz, torch.float8_e4m3fnuz, torch.float32, torch.float32 2025-12-04T12:10:21.8523335Z triton_mm_23 0.0062 ms 100.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=16, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=1 2025-12-04T12:10:21.8523822Z triton_mm_19 0.0063 ms 98.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8524322Z triton_mm_21 0.0064 ms 95.6% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=32, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8524831Z triton_mm_17 0.0067 ms 91.7% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8525329Z triton_mm_20 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=128, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8525814Z triton_mm_18 0.0069 ms 89.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=4 2025-12-04T12:10:21.8526300Z triton_mm_16 0.0072 ms 86.0% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=256, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=8 2025-12-04T12:10:21.8526788Z triton_mm_22 0.0072 ms 85.1% ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=32, BLOCK_M=16, BLOCK_N=64, EVEN_K=True, GROUP_M=8, USE_FAST_ACCUM=True, kpack=2, matrix_instr_nonkdim=16, waves_per_eu=0, num_stages=2, num_warps=2 2025-12-04T12:10:21.8527091Z _scaled_mm 0.0245 ms 25.2% 2025-12-04T12:10:21.8527287Z SingleProcess AUTOTUNE benchmarking takes 0.0538 seconds and 0.3781 seconds precompiling for 9 choices 2025-12-04T12:10:21.8527641Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-6e0c1d9ead29bd58.xml - 2025-12-04T12:10:21.8527925Z =========================== short test summary info ============================ 2025-12-04T12:10:21.8528605Z FAILED [1.5259s] inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda! Caching allocator allocated memory was 0 and is now reported as 1024 on device 0. CUDA driver allocated memory was 1067450368 and is now 1101004800. 2025-12-04T12:10:21.8529217Z 2025-12-04T12:10:21.8529290Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:21.8529672Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_fp8.py TestFP8LoweringCUDA.test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8529967Z 2025-12-04T12:10:21.8530056Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:21.8530281Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:21.8530448Z ================= 1 failed, 187 deselected, 2 rerun in 27.85s ================== 2025-12-04T12:10:21.8530590Z Got exit code 1 2025-12-04T12:10:21.8530853Z FAILED CONSISTENTLY: test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda 2025-12-04T12:10:21.8531222Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:10:21.8531525Z Test results will be stored in test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe7b686c9d4c129a.xml 2025-12-04T12:10:21.8531778Z ============================= test session starts ============================== 2025-12-04T12:10:21.8531984Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:21.8532182Z cachedir: .pytest_cache 2025-12-04T12:10:21.8532404Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:21.8532641Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:10:21.8532755Z configfile: pytest.ini 2025-12-04T12:10:21.8532982Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:10:21.8533272Z collecting ... collected 188 items / 154 deselected / 34 selected 2025-12-04T12:10:21.8533439Z stepcurrent: skipping 154 already run items. 2025-12-04T12:10:21.8533572Z Running 34 items in this shard 2025-12-04T12:10:21.8533642Z 2025-12-04T12:10:21.8533948Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0365s] (XPU does not support use_fast_accum=True for now) [ 2%] 2025-12-04T12:10:21.8534579Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0332s] (XPU does not support use_fast_accum=True for now) [ 5%] 2025-12-04T12:10:21.8535201Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0321s] (XPU does not support use_fast_accum=True for now) [ 8%] 2025-12-04T12:10:21.8535830Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0027s] (XPU does not support use_fast_accum=True for now) [ 11%] 2025-12-04T12:10:21.8536445Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0022s] (XPU does not support use_fast_accum=True for now) [ 14%] 2025-12-04T12:10:21.8537053Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 17%] 2025-12-04T12:10:21.8537656Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0022s] (XPU does not support use_fast_accum=True for now) [ 20%] 2025-12-04T12:10:21.8538273Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 23%] 2025-12-04T12:10:21.8538875Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 26%] 2025-12-04T12:10:21.8539481Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 29%] 2025-12-04T12:10:21.8540083Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 32%] 2025-12-04T12:10:21.8540733Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_bfloat16 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 35%] 2025-12-04T12:10:21.8541343Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 38%] 2025-12-04T12:10:21.8541981Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 41%] 2025-12-04T12:10:21.8542618Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0012s] (bias is not supported when output dtype is float32) [ 44%] 2025-12-04T12:10:21.8543240Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0011s] (bias is not supported when output dtype is float32) [ 47%] 2025-12-04T12:10:21.8543850Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 50%] 2025-12-04T12:10:21.8544450Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 52%] 2025-12-04T12:10:21.8545050Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0012s] (bias is not supported when output dtype is float32) [ 55%] 2025-12-04T12:10:21.8545655Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0012s] (bias is not supported when output dtype is float32) [ 58%] 2025-12-04T12:10:21.8546255Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0021s] (XPU does not support use_fast_accum=True for now) [ 61%] 2025-12-04T12:10:21.8546852Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0020s] (XPU does not support use_fast_accum=True for now) [ 64%] 2025-12-04T12:10:21.8547451Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda_float32 SKIPPED [0.0012s] (bias is not supported when output dtype is float32) [ 67%] 2025-12-04T12:10:21.8548066Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_float32_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda_float32 SKIPPED [0.0012s] (bias is not supported when output dtype is float32) [ 70%] 2025-12-04T12:10:21.8548642Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_1024,1024,512_use_fast_accum_False_cuda_bfloat16 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 73%] 2025-12-04T12:10:21.8549182Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_1024,1024,512_use_fast_accum_True_cuda_bfloat16 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 76%] 2025-12-04T12:10:21.8549713Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_16,32,32_use_fast_accum_False_cuda_bfloat16 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 79%] 2025-12-04T12:10:21.8550301Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_bfloat16_shape_16,32,32_use_fast_accum_True_cuda_bfloat16 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 82%] 2025-12-04T12:10:21.8550840Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_1024,1024,512_use_fast_accum_False_cuda_float32 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 85%] 2025-12-04T12:10:21.8551378Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_1024,1024,512_use_fast_accum_True_cuda_float32 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 88%] 2025-12-04T12:10:21.8551927Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_16,32,32_use_fast_accum_False_cuda_float32 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 91%] 2025-12-04T12:10:21.8552444Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_tma_template_float32_shape_16,32,32_use_fast_accum_True_cuda_float32 SKIPPED [0.0001s] (Need device-side TMA support in Triton) [ 94%] 2025-12-04T12:10:21.8552997Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_unacceptable_input_dims_cuda E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten._scaled_mm.default 2025-12-04T12:10:21.8553457Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.8553901Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T12:10:21.8554325Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T12:10:21.8554713Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T12:10:21.8555104Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T12:10:21.8555530Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 6528, in meta_scaled_mm 2025-12-04T12:10:21.8555953Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return _check_scaled_mm_sizes( 2025-12-04T12:10:21.8556385Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 6384, in _check_scaled_mm_sizes 2025-12-04T12:10:21.8556798Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check( 2025-12-04T12:10:21.8557199Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T12:10:21.8557653Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T12:10:21.8558111Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T12:10:21.8558513Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T12:10:21.8558900Z E1204 12:10:18.370000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] RuntimeError: Expected self.size(1) to be divisible by 16, but got self.size(1)=15 2025-12-04T12:10:21.8559179Z PASSED [0.5782s] [ 97%] 2025-12-04T12:10:21.8559547Z inductor/test_fp8.py::TestFP8LoweringCUDA::test_unacceptable_scale_dims_rowwise_scaling_cuda E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten._scaled_mm.default 2025-12-04T12:10:21.8560051Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T12:10:21.8560531Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T12:10:21.8560977Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T12:10:21.8561358Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T12:10:21.8561745Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T12:10:21.8562167Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 6528, in meta_scaled_mm 2025-12-04T12:10:21.8562586Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return _check_scaled_mm_sizes( 2025-12-04T12:10:21.8563020Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 6498, in _check_scaled_mm_sizes 2025-12-04T12:10:21.8563428Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check( 2025-12-04T12:10:21.8563801Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T12:10:21.8564250Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T12:10:21.8564706Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T12:10:21.8565112Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T12:10:21.8565901Z E1204 12:10:18.547000 1138511 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] RuntimeError: Invalid scaling configuration. For tensorwise scaling, both scales should be scalar. For rowwise scaling, scale_a should be (233, 1), scale_b should be (1, 128). For (BlockWise1x128, BlockWise128x128), scale_a should be (233, 1), scale_b should be (1, 1). For (BlockWise1x128, BlockWise1x128), scale_a should be (233, 1), scale_b should be (1, 128). Got scale_a.size()=(1, 128) and scale_b.size()=(233, 1) 2025-12-04T12:10:21.8566552Z PASSED [0.1673s] [100%] 2025-12-04T12:10:21.8566617Z 2025-12-04T12:10:21.8566808Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_fp8/inductor.test_fp8-fe7b686c9d4c129a.xml - 2025-12-04T12:10:21.8567105Z ================ 2 passed, 32 skipped, 154 deselected in 0.92s ================= 2025-12-04T12:10:21.8567684Z The following tests failed and then succeeded when run in a new process['test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda'] 2025-12-04T12:10:21.8580296Z The following tests failed consistently: ['test/inductor/test_fp8.py::TestFP8TypesCUDA::test_eager_fallback_float16_cuda_float16', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_1024,1024,512_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_False_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_False_use_fast_accum_True_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,16,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_False_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_rowwise_scaling_shape_16,32,32_has_bias_True_use_fast_accum_True_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1024_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_1_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_257_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_33_K_32_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_1024_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_16_N_2048_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_16_persistent_matmul_False_cuda', 'test/inductor/test_fp8.py::TestFP8LoweringCUDA::test_tensorwise_scaling_acceptable_input_dims_M_3_K_32_N_2048_persistent_matmul_False_cuda'] 2025-12-04T12:10:21.8592406Z 2025-12-04T12:10:21.8592550Z FINISHED PRINTING LOG FILE of inductor/test_fp8 1/1 (test/test-reports/inductor.test_fp8_1.1_79611dc7575145fb_.log) 2025-12-04T12:10:21.8592725Z 2025-12-04T12:10:21.8592828Z Finished inductor/test_fp8 1/1 ... [2025-12-04 12:10:19.544389][2199682.005364201], took 82.14min 2025-12-04T12:10:21.8593207Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:10:21.8593564Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:10:21.8593785Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T12:10:21.8593970Z Uploading artifacts took 0.00 seconds 2025-12-04T12:10:21.8594254Z inductor/test_fp8 1/1 failed! 2025-12-04T12:10:21.8594428Z Running inductor/test_flex_flash 1/1 ... [2025-12-04 12:10:19.550730][2199682.011719295] 2025-12-04T12:10:21.8594619Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:10:21.8595010Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_flash.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:19.550938] 2025-12-04T12:10:22.8214634Z 2025-12-04T12:10:22.8215125Z inductor/test_flex_flash 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_flash_1.1_607ec3eba3893f4b_.log 2025-12-04T12:10:22.8226811Z Running 58 items in this shard: test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_kernel_called_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_kernel_called_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_mask_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_mask_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_capture_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_capture_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_basic_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_basic_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_block_mask_with_score_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_block_mask_with_score_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_kernel_called_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_kernel_called_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_dual_buffers_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_dual_buffers_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_view_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_view_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_score_mod_with_many_buffer_indexing_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_score_mod_with_many_buffer_indexing_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_127_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_127_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_255_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_255_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_383_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_383_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_511_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_511_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_alibi_learned_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_alibi_learned_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_head_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_head_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_block_mask_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_block_mask_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_doc_mask_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_doc_mask_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_dual_buffer_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_dual_buffer_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_head_scale_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_head_scale_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_mask_mod_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_mask_mod_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_pos_bias_table_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_pos_bias_table_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_and_mask_buffers_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_and_mask_buffers_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_causal_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_causal_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_rel_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_rel_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_times_two_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_times_two_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_view_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_view_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_impl_error_with_requires_grad_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_impl_error_with_requires_grad_cuda_float16 2025-12-04T12:10:22.8236229Z 2025-12-04T12:10:22.8236360Z Finished inductor/test_flex_flash 1/1 ... [2025-12-04 12:10:22.821289][2199685.282272925], took 0.05min 2025-12-04T12:10:22.8236774Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:10:22.8277190Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:10:22.8280937Z Running dynamo/test_model_output 1/1 ... [2025-12-04 12:10:22.827820][2199685.288809836] 2025-12-04T12:10:22.8281435Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:10:22.8282347Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_model_output.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:22.828015] 2025-12-04T12:10:25.3462746Z 2025-12-04T12:10:25.3463265Z dynamo/test_model_output 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_model_output_1.1_f726078f11415e18_.log 2025-12-04T12:10:25.3465576Z Running 18 items in this shard: test/dynamo/test_model_output.py::TestHFPretrained::test_pretrained, test/dynamo/test_model_output.py::TestHFPretrained::test_pretrained_non_const_attr, test/dynamo/test_model_output.py::TestModelOutput::test_mo_assign, test/dynamo/test_model_output.py::TestModelOutput::test_mo_create, test/dynamo/test_model_output.py::TestModelOutput::test_mo_from_outside, test/dynamo/test_model_output.py::TestModelOutput::test_mo_getattr, test/dynamo/test_model_output.py::TestModelOutput::test_mo_getattr_missing, test/dynamo/test_model_output.py::TestModelOutput::test_mo_getitem, test/dynamo/test_model_output.py::TestModelOutput::test_mo_index, test/dynamo/test_model_output.py::TestModelOutput::test_mo_init, test/dynamo/test_model_output.py::TestModelOutput::test_mo_init2, test/dynamo/test_model_output.py::TestModelOutput::test_mo_init_with_disable, test/dynamo/test_model_output.py::TestModelOutput::test_mo_newkey, test/dynamo/test_model_output.py::TestModelOutput::test_mo_reconstruct_bytecode, test/dynamo/test_model_output.py::TestModelOutput::test_mo_tuple, test/dynamo/test_model_output.py::TestModelOutput::test_none, test/dynamo/test_model_output.py::TestModelOutput::test_reconstruction, test/dynamo/test_model_output.py::TestModelOutputBertCUDA::test_HF_bert_model_output_cuda 2025-12-04T12:10:25.3467439Z 2025-12-04T12:10:25.3467563Z Finished dynamo/test_model_output 1/1 ... [2025-12-04 12:10:25.346006][2199687.806992703], took 0.04min 2025-12-04T12:10:25.3472100Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:10:25.3522620Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:10:25.3524588Z Running inductor/test_metrics 1/1 ... [2025-12-04 12:10:25.352308][2199687.813296947] 2025-12-04T12:10:25.3525035Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:10:25.3526826Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_metrics.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:25.352495] 2025-12-04T12:10:38.0370657Z 2025-12-04T12:10:38.0371703Z inductor/test_metrics 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_metrics_1.1_9f87b764ab8118f2_.log 2025-12-04T12:10:38.0373715Z Running 6 items in this shard: test/inductor/test_metrics.py::TestMetrics::test_atomic_add, test/inductor/test_metrics.py::TestMetrics::test_count_args, test/inductor/test_metrics.py::TestMetrics::test_count_pattern, test/inductor/test_metrics.py::TestMetrics::test_kernel_args_num_gb, test/inductor/test_metrics.py::TestMetrics::test_parse_proper_kernel_fn_code, test/inductor/test_metrics.py::TestMetrics::test_parse_reduction_hint 2025-12-04T12:10:38.0375196Z 2025-12-04T12:10:38.0375476Z Finished inductor/test_metrics 1/1 ... [2025-12-04 12:10:38.036771][2199700.497756668], took 0.21min 2025-12-04T12:10:38.0385236Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:10:38.0436708Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:10:38.0438743Z Running export/test_unflatten_training_ir 1/1 ... [2025-12-04 12:10:38.043725][2199700.504714583] 2025-12-04T12:10:38.0439126Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:10:38.0440378Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_unflatten_training_ir.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:38.043915] 2025-12-04T12:10:50.9289172Z 2025-12-04T12:10:50.9289831Z export/test_unflatten_training_ir 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_unflatten_training_ir_1.1_65e19b6c42de1613_.log 2025-12-04T12:10:50.9297082Z Running 29 items in this shard: test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_assert_tensor_metadata_stack_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_attr_as_submod_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_dedup_sym_size_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_double_nested_submodule_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_duplicate_placeholder_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_fx_trace_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_nested_leaf_non_strict_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_placeholder_and_get_attr_ordering_after_unflattened_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_simple_alias_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_buffer_mutation_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_constant_obj_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_constant_tensor_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_container_type_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_eager_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_empty_branch_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_nested_access_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_nested_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_none_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_param_list_dict_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_preserve_signature_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_preserve_with_unused_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_requires_grad_param_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_root_module_type_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_shared_submodule_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_skipped_call_module_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_submodule_ordering_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_with_inplace_compile_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_wrong_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflattened_module_nodes_has_meta_val_training_ir 2025-12-04T12:10:50.9302284Z 2025-12-04T12:10:50.9302421Z Finished export/test_unflatten_training_ir 1/1 ... [2025-12-04 12:10:50.928741][2199713.389728248], took 0.21min 2025-12-04T12:10:50.9302844Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:10:50.9349655Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:10:50.9373249Z Running inductor/test_triton_kernels 1/1 ... [2025-12-04 12:10:50.935056][2199713.396045321] 2025-12-04T12:10:50.9374217Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:10:50.9375001Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_triton_kernels.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:50.935247] 2025-12-04T12:12:24.6605902Z 2025-12-04T12:12:24.6607056Z inductor/test_triton_kernels 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_triton_kernels_1.1_754fd6bade9a512a_.log 2025-12-04T12:12:24.6666913Z Running 366 items in this shard: test/inductor/test_triton_kernels.py::KernelTests::test_constexpr_dynamic_shapes_wrapped_False_autotune_False, test/inductor/test_triton_kernels.py::KernelTests::test_constexpr_dynamic_shapes_wrapped_False_autotune_True, test/inductor/test_triton_kernels.py::KernelTests::test_constexpr_dynamic_shapes_wrapped_True_autotune_False, test/inductor/test_triton_kernels.py::KernelTests::test_constexpr_dynamic_shapes_wrapped_True_autotune_True, test/inductor/test_triton_kernels.py::KernelTests::test_i64_input, test/inductor/test_triton_kernels.py::KernelTests::test_kernel_inline_asm_quotes_double, test/inductor/test_triton_kernels.py::KernelTests::test_kernel_inline_asm_quotes_single, test/inductor/test_triton_kernels.py::KernelTests::test_kernel_with_docstring_quotes_double, test/inductor/test_triton_kernels.py::KernelTests::test_kernel_with_docstring_quotes_single, test/inductor/test_triton_kernels.py::KernelTests::test_layout_constraint_needs_fixed_stride_order, test/inductor/test_triton_kernels.py::KernelTests::test_no_nan_kernels, test/inductor/test_triton_kernels.py::KernelTests::test_on_device_tma_dynamic_False_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_on_device_tma_dynamic_False_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_on_device_tma_dynamic_True_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_on_device_tma_dynamic_True_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_capture_and_functionalize_dynamic_False_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_capture_and_functionalize_dynamic_False_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_capture_and_functionalize_dynamic_True_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_capture_and_functionalize_dynamic_True_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_aot_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_aot_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_inductor_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_False_backend_inductor_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_aot_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_aot_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_inductor_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_1d_dynamic_True_backend_inductor_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_False_backend_aot_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_False_backend_aot_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_False_backend_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_False_backend_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_True_backend_aot_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_True_backend_aot_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_True_backend_eager_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_2d_dynamic_True_backend_eager_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_dedup_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_descriptor_dedup_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_False_after_create_desc_False_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_False_after_create_desc_False_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_False_after_create_desc_True_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_False_after_create_desc_True_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_True_after_create_desc_False_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_True_after_create_desc_False_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_True_after_create_desc_True_tma_version_new, test/inductor/test_triton_kernels.py::KernelTests::test_tma_graph_breaks_after_data_ptr_True_after_create_desc_True_tma_version_old, test/inductor/test_triton_kernels.py::KernelTests::test_triton_attrs_dict_equal_1_None_format, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_False_backend_inductor_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_False_dynamic_True_backend_inductor_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_False_backend_inductor_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_eager_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_1_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_1_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_2_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_2_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_3_tdlp_0, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_2d_autotune_grad_True_dynamic_True_backend_inductor_grid_type_3_tdlp_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_aot_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_inductor_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_inductor_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_False_backend_inductor_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_aot_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_inductor_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_inductor_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_False_dynamic_True_backend_inductor_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_aot_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_inductor_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_inductor_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_False_backend_inductor_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_aot_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_eager_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_eager_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_eager_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_inductor_grid_type_1, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_inductor_grid_type_2, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_grad_True_dynamic_True_backend_inductor_grid_type_3, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_with_unsupported_args_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_with_unsupported_args_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_autotune_with_unsupported_args_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_caching, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_caching_duplicate, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_constants, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_dependancies, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_different_shapes_size_16_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_different_shapes_size_16_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_different_shapes_size_4_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_different_shapes_size_4_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_dtype_view_cfg_cpp_wrapper, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_dtype_view_cfg_normal, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_empty_autotune_config_dict_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_empty_autotune_config_dict_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_empty_autotune_config_dict_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_emulate_precision_mm_kernels_do_not_change, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_emulate_precision_unaffected, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_arg_dump_launch_params_0_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_arg_dump_launch_params_0_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_arg_dump_launch_params_1_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_arg_dump_launch_params_1_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_float_arg_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_equal_to_1_float_arg_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_fallback, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_float64_constant_float16, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_float64_constant_float32, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_float64_constant_float64, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_functionalize, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_global_constexpr, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_higher_order_func, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_inner_triton_function_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_inner_triton_function_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_inner_triton_function_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_inputs_buffer_reuse, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_matmul_tracking, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multi_kernel_grad_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multi_kernel_grad_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_multiple_outputs_dynamic_True_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_mutation_not_mark_dirty, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_mutation_type, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_False_dynamic_True_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_native_grad_True_dynamic_True_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_no_clones_grad_False_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_no_clones_grad_False_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_no_clones_grad_True_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_no_clones_grad_True_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_none_args, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_num_ctas_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_num_ctas_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_num_ctas_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_out_of_order, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_reinplace_inplaceable_pass, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_aot_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_aot_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_inductor_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_restore_value_backend_inductor_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_slice_and_view_input, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_with_autotune_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_with_autotune_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_with_autotune_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_without_autotune_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_without_autotune_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_kwargs_without_autotune_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_special_params_autotune_True_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_strided_input, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_strided_input_nonzero_offset, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_to_cpu, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_tracing_dynamic_False, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_tracing_dynamic_True, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_triton_dtype_dynamic_True_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_unbacked_shape_tensor_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_unbacked_shape_tensor_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_unbacked_shape_tensor_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_various_args, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_constexpr_function, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn0_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn0_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn0_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn1_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn1_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_grad_option_grad_fn1_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_imported_symbol, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_imported_symbol_with_custom_name, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_kernel_param, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_False_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_False_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_False_backend_inductor, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_True_backend_aot_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_True_backend_eager, test/inductor/test_triton_kernels.py::KernelTests::test_triton_kernel_with_views_dynamic_True_backend_inductor, test/inductor/test_triton_kernels.py::MutationTests::test_add_for_loop, test/inductor/test_triton_kernels.py::MutationTests::test_add_for_loop2, test/inductor/test_triton_kernels.py::MutationTests::test_add_kernel_on_device_tma_new_api, test/inductor/test_triton_kernels.py::MutationTests::test_add_kernel_on_device_tma_old_api, test/inductor/test_triton_kernels.py::MutationTests::test_add_nested_for_loop, test/inductor/test_triton_kernels.py::MutationTests::test_add_nested_for_loop_multi_return, test/inductor/test_triton_kernels.py::MutationTests::test_argmax, test/inductor/test_triton_kernels.py::MutationTests::test_branch_with_multiple_yield_args, test/inductor/test_triton_kernels.py::MutationTests::test_cumsum, test/inductor/test_triton_kernels.py::MutationTests::test_fn_call_multi_return, test/inductor/test_triton_kernels.py::MutationTests::test_fn_call_one_return, test/inductor/test_triton_kernels.py::MutationTests::test_for_loop_arg, test/inductor/test_triton_kernels.py::MutationTests::test_for_loop_arg_2, test/inductor/test_triton_kernels.py::MutationTests::test_get_tma_stores, test/inductor/test_triton_kernels.py::MutationTests::test_labels, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_add_4_times_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_add_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_add_kernel_2d_autotuned, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_add_kernel_with_block_ptr, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_add_kernel_with_import, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_atomic_add_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_cond_op_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_indirection_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_indirection_kernel1, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_inline_asm_kernel_is_pure_false, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_inline_asm_kernel_is_pure_true, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_kernel_with_block_ptr_2d, test/inductor/test_triton_kernels.py::MutationTests::test_mutations_mul2_inplace_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_nested_cond_op_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_out_of_order_kernel, test/inductor/test_triton_kernels.py::MutationTests::test_out_of_order_kernel_call, test/inductor/test_triton_kernels.py::MutationTests::test_reduce_sum, test/inductor/test_triton_kernels.py::MutationTests::test_triton_kernel_inference_mode, test/inductor/test_triton_kernels.py::MutationTests::test_while_loop, test/inductor/test_triton_kernels.py::CustomOpTests::test_add_kernel_autotuned_False_dynamic_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_add_kernel_autotuned_False_dynamic_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_add_kernel_autotuned_True_dynamic_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_add_kernel_autotuned_True_dynamic_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_autotune_no_pre_or_post_hook_user_defined, test/inductor/test_triton_kernels.py::CustomOpTests::test_autotune_unbacked, test/inductor/test_triton_kernels.py::CustomOpTests::test_capture_triton_meta, test/inductor/test_triton_kernels.py::CustomOpTests::test_capture_triton_special_kwargs_dynamic_False_autotune_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_capture_triton_special_kwargs_dynamic_False_autotune_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_capture_triton_special_kwargs_dynamic_True_autotune_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_capture_triton_special_kwargs_dynamic_True_autotune_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_preserves_strides_variant_custom_op, test/inductor/test_triton_kernels.py::CustomOpTests::test_preserves_strides_variant_mutable_custom_op, test/inductor/test_triton_kernels.py::CustomOpTests::test_preserves_strides_variant_triton_kernel, test/inductor/test_triton_kernels.py::CustomOpTests::test_subclass, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_dynamic_grid_no_recompile, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_aot_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_aot_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_inductor_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_False_backend_inductor_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_aot_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_aot_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_inductor_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_heuristic_non_strict_True_backend_inductor_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_aot_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_aot_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_inductor_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_False_backend_inductor_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_aot_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_aot_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_inductor_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_non_strict_True_backend_inductor_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_aot_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_aot_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_eager_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_eager_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_inductor_with_perf_model_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_prune_configs_by_recompile_backend_inductor_with_perf_model_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_aot_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_aot_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_eager_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_eager_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_inductor_autotune_at_compile_time_False, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_kernel_reset_to_zero_backend_inductor_autotune_at_compile_time_True, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_single_autotune_backend_aot_eager, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_single_autotune_backend_eager, test/inductor/test_triton_kernels.py::CustomOpTests::test_triton_single_autotune_backend_inductor, test/inductor/test_triton_kernels.py::CustomOpTests::test_wrap_triton_disabled_in_triton_op 2025-12-04T12:12:24.6722303Z 2025-12-04T12:12:24.6722436Z Finished inductor/test_triton_kernels 1/1 ... [2025-12-04 12:12:24.660556][2199807.121543373], took 1.56min 2025-12-04T12:12:24.6722862Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:12:24.6723216Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:12:24.6723433Z Running dynamo/test_modules 1/1 ... [2025-12-04 12:12:24.666754][2199807.127743598] 2025-12-04T12:12:24.6723612Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:12:24.6723991Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_modules.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:12:24.666941] 2025-12-04T12:12:45.0415440Z 2025-12-04T12:12:45.0416084Z dynamo/test_modules 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_modules_1.1_fd28dd1eb1a1fec6_.log 2025-12-04T12:12:45.0429658Z Running 135 items in this shard: test/dynamo/test_modules.py::NNModuleTests::test_access_by_keys, test/dynamo/test_modules.py::NNModuleTests::test_basicmodule1, test/dynamo/test_modules.py::NNModuleTests::test_basicmodule2, test/dynamo/test_modules.py::NNModuleTests::test_call_fn_with_non_const_inputs_safe, test/dynamo/test_modules.py::NNModuleTests::test_cfgmod, test/dynamo/test_modules.py::NNModuleTests::test_children, test/dynamo/test_modules.py::NNModuleTests::test_constloop, test/dynamo/test_modules.py::NNModuleTests::test_conv_call_forward_directly, test/dynamo/test_modules.py::NNModuleTests::test_conv_call_super_forward_directly, test/dynamo/test_modules.py::NNModuleTests::test_conv_transpose_call_forward_directly, test/dynamo/test_modules.py::NNModuleTests::test_conv_transpose_call_super_forward_directly, test/dynamo/test_modules.py::NNModuleTests::test_densenet, test/dynamo/test_modules.py::NNModuleTests::test_enumvalues, test/dynamo/test_modules.py::NNModuleTests::test_fnmember, test/dynamo/test_modules.py::NNModuleTests::test_fnmembercmp1, test/dynamo/test_modules.py::NNModuleTests::test_fnmembercmp2, test/dynamo/test_modules.py::NNModuleTests::test_forward_directly, test/dynamo/test_modules.py::NNModuleTests::test_generation_tag, test/dynamo/test_modules.py::NNModuleTests::test_hasattr, test/dynamo/test_modules.py::NNModuleTests::test_inject_module_parameters, test/dynamo/test_modules.py::NNModuleTests::test_intarg, test/dynamo/test_modules.py::NNModuleTests::test_iseval1, test/dynamo/test_modules.py::NNModuleTests::test_iseval2, test/dynamo/test_modules.py::NNModuleTests::test_isnonelayer, test/dynamo/test_modules.py::NNModuleTests::test_istraining1, test/dynamo/test_modules.py::NNModuleTests::test_istraining2, test/dynamo/test_modules.py::NNModuleTests::test_layerlist, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module1, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module2, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module4, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module5, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module6, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module7, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module_bad_params, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module_bad_params_call_function, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module_kwargs, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module_no_cls_to_become, test/dynamo/test_modules.py::NNModuleTests::test_lazy_module_speculation_log_divergence, test/dynamo/test_modules.py::NNModuleTests::test_module_attribute_precedence, test/dynamo/test_modules.py::NNModuleTests::test_module_call_module_with_static_forward, test/dynamo/test_modules.py::NNModuleTests::test_module_class_method, test/dynamo/test_modules.py::NNModuleTests::test_module_comparison, test/dynamo/test_modules.py::NNModuleTests::test_module_forward_has_graph_break, test/dynamo/test_modules.py::NNModuleTests::test_module_guard_name_is_valid, test/dynamo/test_modules.py::NNModuleTests::test_module_name_string, test/dynamo/test_modules.py::NNModuleTests::test_module_property, test/dynamo/test_modules.py::NNModuleTests::test_module_static_method, test/dynamo/test_modules.py::NNModuleTests::test_moduledict, test/dynamo/test_modules.py::NNModuleTests::test_moduledict_custom, test/dynamo/test_modules.py::NNModuleTests::test_modulelist, test/dynamo/test_modules.py::NNModuleTests::test_modulelist_custom, test/dynamo/test_modules.py::NNModuleTests::test_modulelist_nested, test/dynamo/test_modules.py::NNModuleTests::test_modulemethod1, test/dynamo/test_modules.py::NNModuleTests::test_modulemethod2, test/dynamo/test_modules.py::NNModuleTests::test_named_children, test/dynamo/test_modules.py::NNModuleTests::test_nn_module_setattr, test/dynamo/test_modules.py::NNModuleTests::test_nn_module_unspec_int_attr, test/dynamo/test_modules.py::NNModuleTests::test_nn_moduledict_contains, test/dynamo/test_modules.py::NNModuleTests::test_parameterdict, test/dynamo/test_modules.py::NNModuleTests::test_parameterdict_custom, test/dynamo/test_modules.py::NNModuleTests::test_parameters1, test/dynamo/test_modules.py::NNModuleTests::test_parameters2, test/dynamo/test_modules.py::NNModuleTests::test_parameters3, test/dynamo/test_modules.py::NNModuleTests::test_parameters4, test/dynamo/test_modules.py::NNModuleTests::test_parameters5, test/dynamo/test_modules.py::NNModuleTests::test_self_mutating1, test/dynamo/test_modules.py::NNModuleTests::test_seq, test/dynamo/test_modules.py::NNModuleTests::test_sequential_with_duplicated_module, test/dynamo/test_modules.py::NNModuleTests::test_sequential_with_duplicated_module2, test/dynamo/test_modules.py::NNModuleTests::test_simple_torch_function, test/dynamo/test_modules.py::NNModuleTests::test_stringmember, test/dynamo/test_modules.py::NNModuleTests::test_submodules1, test/dynamo/test_modules.py::NNModuleTests::test_submodules2, test/dynamo/test_modules.py::NNModuleTests::test_super1, test/dynamo/test_modules.py::NNModuleTests::test_super2, test/dynamo/test_modules.py::NNModuleTests::test_super_class_method, test/dynamo/test_modules.py::NNModuleTests::test_tensorlist, test/dynamo/test_modules.py::NNModuleTests::test_torch_function_with_closure, test/dynamo/test_modules.py::NNModuleTests::test_torch_mangled_class_name, test/dynamo/test_modules.py::NNModuleTests::test_unsupportedmethod, test/dynamo/test_modules.py::NNModuleTests::test_unsupportedmodule, test/dynamo/test_modules.py::NNModuleTests::test_viamodulecall, test/dynamo/test_modules.py::OptimizedModuleTest::test_assign_does_not_exist, test/dynamo/test_modules.py::OptimizedModuleTest::test_attr, test/dynamo/test_modules.py::OptimizedModuleTest::test_attr_precedence, test/dynamo/test_modules.py::OptimizedModuleTest::test_backward_hooks, test/dynamo/test_modules.py::OptimizedModuleTest::test_branch_on_nn_module_custom_bool, test/dynamo/test_modules.py::OptimizedModuleTest::test_branch_on_nn_module_custom_len, test/dynamo/test_modules.py::OptimizedModuleTest::test_buffer_order, test/dynamo/test_modules.py::OptimizedModuleTest::test_composition, test/dynamo/test_modules.py::OptimizedModuleTest::test_composition_with_opt_mod, test/dynamo/test_modules.py::OptimizedModuleTest::test_delattr_on_compiled_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_dir, test/dynamo/test_modules.py::OptimizedModuleTest::test_dunder_call_explicitly, test/dynamo/test_modules.py::OptimizedModuleTest::test_globals_change_in_other_file, test/dynamo/test_modules.py::OptimizedModuleTest::test_guard_on_torch_nn_modules, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_allowed_modules, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_allowed_modules_compiles, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_allowed_modules_compiles_self_contained, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_inner, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_outer, test/dynamo/test_modules.py::OptimizedModuleTest::test_hooks_skip_guards, test/dynamo/test_modules.py::OptimizedModuleTest::test_inline_inbuilt_nn_modules, test/dynamo/test_modules.py::OptimizedModuleTest::test_mark_static_nn_module_tensor, test/dynamo/test_modules.py::OptimizedModuleTest::test_mark_static_previously_seen_tensor, test/dynamo/test_modules.py::OptimizedModuleTest::test_mark_static_with_freezing, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_dict_iter_keys, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_dict_iter_name, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_dict_iter_values, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_order, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_patch, test/dynamo/test_modules.py::OptimizedModuleTest::test_module_setattr, test/dynamo/test_modules.py::OptimizedModuleTest::test_monkeypatching_forward, test/dynamo/test_modules.py::OptimizedModuleTest::test_nn_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_no_op_assignment, test/dynamo/test_modules.py::OptimizedModuleTest::test_no_recompile_on_nn_guarded_modules, test/dynamo/test_modules.py::OptimizedModuleTest::test_overridden_call, test/dynamo/test_modules.py::OptimizedModuleTest::test_param_order, test/dynamo/test_modules.py::OptimizedModuleTest::test_param_requires_grad, test/dynamo/test_modules.py::OptimizedModuleTest::test_patch_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_recompile_limit_on_freed_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_recompile_limit_on_guarded_nn_modules, test/dynamo/test_modules.py::OptimizedModuleTest::test_recursion, test/dynamo/test_modules.py::OptimizedModuleTest::test_save_and_load_all_backends, test/dynamo/test_modules.py::OptimizedModuleTest::test_save_and_load_inductor, test/dynamo/test_modules.py::OptimizedModuleTest::test_setattr_on_compiled_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_specialized_module___iter__, test/dynamo/test_modules.py::OptimizedModuleTest::test_to, test/dynamo/test_modules.py::OptimizedModuleTest::test_trace_delattr, test/dynamo/test_modules.py::OptimizedModuleTest::test_udo_instance_method_as_hook, test/dynamo/test_modules.py::OptimizedModuleTest::test_unhashable_nn_submodule, test/dynamo/test_modules.py::OptimizedModuleTest::test_unspec_non_inlinable_module, test/dynamo/test_modules.py::OptimizedModuleTest::test_unspecialized_seq, test/dynamo/test_modules.py::OptimizedModuleTest::test_user_defined_nn_module_dynamic, test/dynamo/test_modules.py::NNModuleTestsDeviceCUDA::test_lazy_module3_cuda 2025-12-04T12:12:45.0442649Z 2025-12-04T12:12:45.0442769Z Finished dynamo/test_modules 1/1 ... [2025-12-04 12:12:45.041366][2199827.502353067], took 0.34min 2025-12-04T12:12:45.0443185Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:12:45.0477806Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:12:45.0479825Z Running inductor/test_cudacodecache 1/1 ... [2025-12-04 12:12:45.047839][2199827.508828038] 2025-12-04T12:12:45.0480028Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:12:45.0481336Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cudacodecache.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:12:45.048027] 2025-12-04T12:12:50.4318598Z 2025-12-04T12:12:50.4319531Z inductor/test_cudacodecache 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cudacodecache_1.1_fa2ffb8b5f126100_.log 2025-12-04T12:12:50.4320376Z 2025-12-04T12:12:50.4320650Z Finished inductor/test_cudacodecache 1/1 ... [2025-12-04 12:12:50.431501][2199832.8924871], took 0.09min 2025-12-04T12:12:50.4330583Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:12:50.4382106Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:12:50.4384053Z Running dynamo/test_fx_graph_runnable 1/1 ... [2025-12-04 12:12:50.438242][2199832.899231657] 2025-12-04T12:12:50.4384414Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:12:50.4385599Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_fx_graph_runnable.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:12:50.438434] 2025-12-04T12:14:25.5667468Z 2025-12-04T12:14:25.5668164Z dynamo/test_fx_graph_runnable 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_fx_graph_runnable_1.1_f0a268008cf4859d_.log 2025-12-04T12:14:25.5671433Z Running 17 items in this shard: test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_all_gather_collective, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_all_reduce_collective, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_basic_tensor_add, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_broadcast_add_dynamic, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_broadcast_collective, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_dtensor_compile_redistribute, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_dynamic_expression, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_dynamic_shapes_run, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_metrics_context, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_reduce_scatter_collective, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_scalar_multiply, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_toy_model_basic, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_toy_model_batch_processing, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_toy_model_dynamic_batch, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_two_inputs_matmul, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_user_defined_triton_kernel, test/dynamo/test_fx_graph_runnable.py::FxGraphRunnableTest::test_user_defined_triton_kernel_autotune 2025-12-04T12:14:25.5673724Z 2025-12-04T12:14:25.5673854Z Finished dynamo/test_fx_graph_runnable 1/1 ... [2025-12-04 12:14:25.566528][2199928.027514923], took 1.59min 2025-12-04T12:14:25.5677076Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:25.5726352Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:25.5728432Z Running inductor/test_codegen_triton 1/1 ... [2025-12-04 12:14:25.572768][2199928.033757627] 2025-12-04T12:14:25.5728643Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:25.5730509Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_codegen_triton.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:25.572962] 2025-12-04T12:14:31.4972821Z 2025-12-04T12:14:31.4973958Z inductor/test_codegen_triton 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_codegen_triton_1.1_3f30f40ff35e4c9a_.log 2025-12-04T12:14:31.4975160Z Running 1 items in this shard: test/inductor/test_codegen_triton.py::TestCodegenTriton::test_config_of_sizearg 2025-12-04T12:14:31.4975701Z 2025-12-04T12:14:31.4976041Z Finished inductor/test_codegen_triton 1/1 ... [2025-12-04 12:14:31.496880][2199933.957864502], took 0.10min 2025-12-04T12:14:31.4987575Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:31.5040427Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:31.5040982Z Running dynamo/test_frame_init 1/1 ... [2025-12-04 12:14:31.503919][2199933.964908764] 2025-12-04T12:14:31.5041331Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:31.5042318Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_frame_init.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:31.504111] 2025-12-04T12:14:33.7223527Z 2025-12-04T12:14:33.7224425Z dynamo/test_frame_init 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_frame_init_1.1_dc5dd38a729343ee_.log 2025-12-04T12:14:33.7225239Z Running 1 items in this shard: test/dynamo/test_frame_init.py::FrameInitTests::test_frame_init 2025-12-04T12:14:33.7225556Z 2025-12-04T12:14:33.7225796Z Finished dynamo/test_frame_init 1/1 ... [2025-12-04 12:14:33.722079][2199936.183063431], took 0.04min 2025-12-04T12:14:33.7239735Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:33.7292637Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:33.7293017Z Running inductor/test_device_assert 1/1 ... [2025-12-04 12:14:33.729176][2199936.190165803] 2025-12-04T12:14:33.7293225Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:33.7295037Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_device_assert.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:33.729366] 2025-12-04T12:14:42.0069608Z 2025-12-04T12:14:42.0071447Z inductor/test_device_assert 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_device_assert_1.1_84b045ca398414fa_.log 2025-12-04T12:14:42.0074359Z Running 8 items in this shard: test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_fusion, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_not_throw_backend_aot_eager, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_not_throw_backend_eager, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_not_throw_backend_inductor, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_throw_backend_aot_eager, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_throw_backend_eager, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_assert_should_throw_backend_inductor, test/inductor/test_device_assert.py::TestTorchDeviceAssertTrigger::test_run_assert_triton 2025-12-04T12:14:42.0076951Z 2025-12-04T12:14:42.0077201Z Finished inductor/test_device_assert 1/1 ... [2025-12-04 12:14:42.006633][2199944.46761917], took 0.14min 2025-12-04T12:14:42.0083200Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:42.0135923Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:42.0136614Z Running dynamo/test_skip_non_tensor 1/1 ... [2025-12-04 12:14:42.013550][2199944.474538825] 2025-12-04T12:14:42.0136896Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:42.0139247Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_skip_non_tensor.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:42.013737] 2025-12-04T12:14:44.6821302Z 2025-12-04T12:14:44.6822414Z dynamo/test_skip_non_tensor 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_skip_non_tensor_1.1_c9525db897c05681_.log 2025-12-04T12:14:44.6825015Z Running 8 items in this shard: test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_add_skip, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_add_tensor1, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_add_tensor2, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_add_tensor_dict, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_add_tensor_list, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_custom_list, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_do_not_skip_side_effects, test/dynamo/test_skip_non_tensor.py::SkipNonTensorTests::test_recursive_list 2025-12-04T12:14:44.6827076Z 2025-12-04T12:14:44.6827359Z Finished dynamo/test_skip_non_tensor 1/1 ... [2025-12-04 12:14:44.681729][2199947.142715317], took 0.04min 2025-12-04T12:14:44.6833953Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:44.6884623Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:44.6886521Z Running dynamo/test_skip_guard_eval_unsafe 1/1 ... [2025-12-04 12:14:44.688506][2199947.149494624] 2025-12-04T12:14:44.6886859Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:44.6888129Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_skip_guard_eval_unsafe.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:44.688696] 2025-12-04T12:14:52.1137303Z 2025-12-04T12:14:52.1138025Z dynamo/test_skip_guard_eval_unsafe 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_skip_guard_eval_unsafe_1.1_28b3a472cb459084_.log 2025-12-04T12:14:52.1139621Z Running 5 items in this shard: test/dynamo/test_skip_guard_eval_unsafe.py::RunDiffGuardTests::test_bool_recompile, test/dynamo/test_skip_guard_eval_unsafe.py::RunDiffGuardTests::test_cache_line_pickup, test/dynamo/test_skip_guard_eval_unsafe.py::RunDiffGuardTests::test_fail_on_tensor_shape_change, test/dynamo/test_skip_guard_eval_unsafe.py::RunDiffGuardTests::test_post_recompile, test/dynamo/test_skip_guard_eval_unsafe.py::RunDiffGuardTests::test_tensor_recompile 2025-12-04T12:14:52.1140536Z 2025-12-04T12:14:52.1140674Z Finished dynamo/test_skip_guard_eval_unsafe 1/1 ... [2025-12-04 12:14:52.113471][2199954.574457593], took 0.12min 2025-12-04T12:14:52.1146477Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:14:52.1197280Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:14:52.1199101Z Running inductor/test_decompose_mem_bound_mm 1/1 ... [2025-12-04 12:14:52.119802][2199954.580791666] 2025-12-04T12:14:52.1199329Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:14:52.1201027Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_decompose_mem_bound_mm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:52.120002] 2025-12-04T12:18:41.8566621Z 2025-12-04T12:18:41.8567899Z inductor/test_decompose_mem_bound_mm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_decompose_mem_bound_mm_1.1_0bb47488199a4fd2_.log 2025-12-04T12:18:41.8582179Z Running 37 items in this shard: test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_check_device, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_10240_m_2_k_2_n_2_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_10240_m_2_k_32_n_32_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_b_2000_m_2_k_2_n_2_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_cpu_b_1_m_2_k_2_n_2_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_bmm_cpu_b_2_m_2_k_2_n_2_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_linear_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_1_k_64_n_16_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_1_k_64_n_32_should_decompose_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_cpu_m_2_k_64_n_16_should_decompose_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_32_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_decompose_mm_mixed_precision_m_2048_k_2_n_2_should_decompose_False_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_decompose_addmm, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_m_20480_k_5_n_2_should_decompose_True_has_bias_False, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_dynamic_shape_m_20480_k_5_n_2_should_decompose_True_has_bias_True, test/inductor/test_decompose_mem_bound_mm.py::TestDecomposeMemMM::test_realize_input 2025-12-04T12:18:41.8591133Z 2025-12-04T12:18:41.8591295Z Finished inductor/test_decompose_mem_bound_mm 1/1 ... [2025-12-04 12:18:41.856325][2200184.317309123], took 3.83min 2025-12-04T12:18:41.8591805Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:18:41.8627022Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:18:41.8628901Z Running inductor/test_op_dtype_prop 1/1 ... [2025-12-04 12:18:41.862777][2200184.323766714] 2025-12-04T12:18:41.8629121Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:18:41.8631128Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_op_dtype_prop.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:18:41.862996] 2025-12-04T12:24:24.0336097Z 2025-12-04T12:24:24.0336884Z inductor/test_op_dtype_prop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_op_dtype_prop_1.1_3415656bf60727c1_.log 2025-12-04T12:24:24.0420389Z Running 571 items in this shard: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_any_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_assoc_scan_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_binary_math_mixed_precision_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_codegen_upcast_to_fp32_upcast_to_fp32_False_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_codegen_upcast_to_fp32_upcast_to_fp32_True_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_constant_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_downcast_div_mod_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cos_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cos_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cos_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cos_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_left_shift_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_left_shift_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_or_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_or_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_or_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_xor_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_xor_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_xor_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_expm1_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_expm1_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_expm1_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_expm1_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_expm1_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gcd_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gcd_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_hypot_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_hypot_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igamma_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igamma_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igammac_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igammac_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_nextafter_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_nextafter_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_remainder_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_remainder_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_remainder_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_remainder_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_0_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_0_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_3_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_3_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_neg_3_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_neg_3_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_where_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_where_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_where_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_where_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_where_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_support_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_False_bfloat16_cuda_bfloat16, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_False_float16_cuda_float16, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_float16_cuda_float16 2025-12-04T12:24:24.0497106Z 2025-12-04T12:24:24.0497240Z Finished inductor/test_op_dtype_prop 1/1 ... [2025-12-04 12:24:24.033860][2200526.494845246], took 5.70min 2025-12-04T12:24:24.0497642Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:24:24.0498008Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:24:24.0498237Z Running inductor/test_control_flow 4/4 ... [2025-12-04 12:24:24.039959][2200526.500948471] 2025-12-04T12:24:24.0498431Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:24:24.0498892Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_control_flow.py', '--shard-id=4', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:24:24.040188] 2025-12-04T12:35:23.5978197Z 2025-12-04T12:35:23.5979608Z inductor/test_control_flow 4/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_control_flow_4.4_fb4cb0ee30ab1d03_.log 2025-12-04T12:35:23.6064453Z Running 183 items in this shard: test/inductor/test_control_flow.py::CondTests::test_cond_advanced_dynamic_shapes_device_cpu, test/inductor/test_control_flow.py::CondTests::test_cond_decompose_ops_in_subgraph_recursive_device_cpu, test/inductor/test_control_flow.py::CondTests::test_cond_decompose_ops_in_subgraph_recursive_device_cuda, test/inductor/test_control_flow.py::CondTests::test_cond_nested_control_flow_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_select_with_input_idx_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_simple_control_flow_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_simple_with_int_closure_device_cuda, test/inductor/test_control_flow.py::CondTests::test_cond_subgraphs_with_parameters_device_cuda_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_unbacked_symint_closure_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_unbacked_symint_outer_to_inner_device_cpu, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_models_with_mixed_device_device_cuda, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_nested_control_flow_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cuda_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_mismatch_dynamic_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_mismatch_dynamic_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_ops_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_ops_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_buffers_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_buffers_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_code_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_code_device_cuda_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_parameters_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_parameters_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_pytree_inputs_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_pytree_inputs_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_pytree_inputs_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_unbacked_symint_closure_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_zero_loop_device_cuda_dynamic_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_False_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_False_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_chunked_ce_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_chunked_ce_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_chunked_ce_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_chunked_ce_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_False_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_False_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_0_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_3_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_3_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_3_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_3_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_1_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_3_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_0_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_3_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_3_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_0_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_0_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_1_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_3_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_3_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_0_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_1_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_3_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_3_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_3_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_0_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_0_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_False_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_False_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_False_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_False_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_False_dim_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_True_dim_2_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_False_dim_0_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_True_dim_0_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_True_dim_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_False_dim_2_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_True_dim_0_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_True_dim_0_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_False_dim_0_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_False_dim_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_False_dim_2_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_True_dim_2_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_with_clamp_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_with_clamp_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_with_clamp_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_nested_with_cond_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_nested_with_cond_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_pytree_in_out_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_pytree_in_out_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_pytree_in_out_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_pytree_in_out_device_cuda_dynamic_True_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_simple_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_simple_linear_with_view_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_simple_linear_with_view_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_simple_linear_with_view_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_simple_linear_with_view_device_cuda_dynamic_True_autograd_False 2025-12-04T12:35:23.6147985Z 2025-12-04T12:35:23.6148222Z Finished inductor/test_control_flow 4/4 ... [2025-12-04 12:35:23.614714][2201186.07569124], took 10.99min 2025-12-04T12:35:23.6160469Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:35:23.6210884Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:35:23.6211357Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T12:35:23.6211722Z Uploading artifacts took 0.00 seconds 2025-12-04T12:35:23.6213516Z Running dynamo/test_structured_trace 1/1 ... [2025-12-04 12:35:23.621190][2201186.082175807] 2025-12-04T12:35:23.6213923Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:35:23.6215808Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_structured_trace.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:35:23.621440] 2025-12-04T12:35:47.2188591Z 2025-12-04T12:35:47.2189311Z dynamo/test_structured_trace 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_structured_trace_1.1_895ab9a3af010c17_.log 2025-12-04T12:35:47.2194610Z Running 29 items in this shard: test/dynamo/test_structured_trace.py::StructuredTraceTest::test_chromium_event, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_codecache, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_collective_schedule_empty, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_collective_schedule_real, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_compile_id_serialization_deserialization, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_compiled_autograd_attribution, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_compiled_autograd_chromium, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_compiled_autograd_id, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_cudagraphs, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_dump_file, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_dynamo_error, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_example_fn, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_example_training_fn, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_graph_breaks, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_graph_execution_order, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_graph_sizes_dynamic, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_guards_recompiles, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_inductor_error, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_make_fx_fail_partial, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_recompile_user_contexts, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_recompile_user_contexts_iteration, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_recompiles, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_runtime_estimates_mixed, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_runtime_estimates_simple, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_schedule, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_tensor_metadata_logging, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_tensor_metadata_logging_dynamic_shapes, test/dynamo/test_structured_trace.py::StructuredTraceTest::test_tensor_metadata_logging_multiple_ops 2025-12-04T12:35:47.2199102Z 2025-12-04T12:35:47.2199259Z Finished dynamo/test_structured_trace 1/1 ... [2025-12-04 12:35:47.218565][2201209.679552139], took 0.39min 2025-12-04T12:35:47.2199850Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:35:47.2248397Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:35:47.2250914Z Running export/test_hop 1/1 ... [2025-12-04 12:35:47.224933][2201209.685921706] 2025-12-04T12:35:47.2251110Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:35:47.2252257Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_hop.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:35:47.225137] 2025-12-04T12:35:58.7027043Z 2025-12-04T12:35:58.7028275Z export/test_hop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_hop_1.1_ad49ed20dd8ae3f3_.log 2025-12-04T12:35:58.7041222Z Running 44 items in this shard: test/export/test_hop.py::TestHOPCUDA::test_aot_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_while_loop_stack_output_simple_cuda_float32 2025-12-04T12:35:58.7049533Z 2025-12-04T12:35:58.7049680Z Finished export/test_hop 1/1 ... [2025-12-04 12:35:58.702586][2201221.163571676], took 0.19min 2025-12-04T12:35:58.7050294Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:35:58.7088598Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:35:58.7091647Z Running export/test_experimental 1/1 ... [2025-12-04 12:35:58.708960][2201221.169946963] 2025-12-04T12:35:58.7092123Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:35:58.7092721Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_experimental.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:35:58.709169] 2025-12-04T12:36:02.8293132Z 2025-12-04T12:36:02.8294192Z export/test_experimental 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_experimental_1.1_d1580682bbdfad3d_.log 2025-12-04T12:36:02.8302037Z Running 22 items in this shard: test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_closure, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_ctx_return, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_custom_pytree_type, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_default_args, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_dict_keys_getitem, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_full_tracing_context, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_fx_graph_annotate_overlap_pass, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_side_effects, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_with_call_override, test/export/test_experimental.py::TestExperiment::test_dynamo_graph_capture_with_tensor_constant, test/export/test_experimental.py::TestExperiment::test_export_add_in_out_info, test/export/test_experimental.py::TestExperiment::test_export_leaf, test/export/test_experimental.py::TestExperiment::test_joint_basic, test/export/test_experimental.py::TestExperiment::test_joint_buffer_input_mutations, test/export/test_experimental.py::TestExperiment::test_joint_cifar10_backwards, test/export/test_experimental.py::TestExperiment::test_joint_dynamic, test/export/test_experimental.py::TestExperiment::test_joint_loss_index, test/export/test_experimental.py::TestExperiment::test_side_effect, test/export/test_experimental.py::TestExperiment::test_sticky_export, test/export/test_experimental.py::TestExperiment::test_sticky_export_dynamic, test/export/test_experimental.py::TestExperiment::test_sticky_export_nested_inp 2025-12-04T12:36:02.8306600Z 2025-12-04T12:36:02.8306807Z Finished export/test_experimental 1/1 ... [2025-12-04 12:36:02.828994][2201225.289979714], took 0.07min 2025-12-04T12:36:02.8307487Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:36:02.8356680Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:36:02.8357382Z Running export/test_export 1/1 ... [2025-12-04 12:36:02.835581][2201225.296567798] 2025-12-04T12:36:02.8357612Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:36:02.8359912Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_export.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:36:02.835797] 2025-12-04T12:36:52.8412802Z 2025-12-04T12:36:52.8419748Z export/test_export 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_export_1.1_a6a38eae1d4d93c8_.log 2025-12-04T12:36:52.8485401Z Running 470 items in this shard: test/export/test_export.py::TestDynamismExpression::test_export_assume_static_by_default, test/export/test_export.py::TestDynamismExpression::test_export_constraints_error, test/export/test_export.py::TestDynamismExpression::test_export_constraints_error_not_in_range, test/export/test_export.py::TestDynamismExpression::test_export_inline_constraints, test/export/test_export.py::TestDynamismExpression::test_export_slice_maxsize, test/export/test_export.py::TestDynamismExpression::test_export_slice_unbacked_dim1, test/export/test_export.py::TestDynamismExpression::test_export_strict_narrow_unbacked_expr, test/export/test_export.py::TestDynamismExpression::test_no_grad_param_inplace, test/export/test_export.py::TestDynamismExpression::test_reshape_view_backed_size_oblivious, test/export/test_export.py::TestExport::test__scaled_dot_product_flash_attention, test/export/test_export.py::TestExport::test_additional_inputs_constants, test/export/test_export.py::TestExport::test_allow_explicit_guards_as_runtime_asserts, test/export/test_export.py::TestExport::test_annotate_on_assert, test/export/test_export.py::TestExport::test_args_type_checked, test/export/test_export.py::TestExport::test_aten_lift_fresh_copy, test/export/test_export.py::TestExport::test_attention, test/export/test_export.py::TestExport::test_attr_assignment_extra, test/export/test_export.py::TestExport::test_automatic_constrain_size, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_constant_relation, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_linear_relation, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_simple_equality, test/export/test_export.py::TestExport::test_baddbmm, test/export/test_export.py::TestExport::test_basic, test/export/test_export.py::TestExport::test_basic_non_strict_fake_tensor, test/export/test_export.py::TestExport::test_basic_non_strict_real_tensor, test/export/test_export.py::TestExport::test_bincount, test/export/test_export.py::TestExport::test_buffer_util, test/export/test_export.py::TestExport::test_capture_subclass_constructor, test/export/test_export.py::TestExport::test_capture_subclass_constructor_torch_ir, test/export/test_export.py::TestExport::test_capture_subclass_wrong, test/export/test_export.py::TestExport::test_ccode_python_mod, test/export/test_export.py::TestExport::test_cdist_forward_compute_mode_zero_export, test/export/test_export.py::TestExport::test_check_specialized_int, test/export/test_export.py::TestExport::test_checks_to_constrain_range, test/export/test_export.py::TestExport::test_cleanup_dynamic_markers, test/export/test_export.py::TestExport::test_colin_unbacked_backed_vr_sub, test/export/test_export.py::TestExport::test_colon_parameter, test/export/test_export.py::TestExport::test_compiling_state, test/export/test_export.py::TestExport::test_cond_access_identical_symint_closure, test/export/test_export.py::TestExport::test_cond_branches_return_constant_int, test/export/test_export.py::TestExport::test_cond_branches_return_same_int, test/export/test_export.py::TestExport::test_cond_buffers, test/export/test_export.py::TestExport::test_cond_contains_unbacked_no_escape, test/export/test_export.py::TestExport::test_cond_int_closure, test/export/test_export.py::TestExport::test_cond_unflatten, test/export/test_export.py::TestExport::test_cond_with_module_stack_export_with, test/export/test_export.py::TestExport::test_cond_with_module_stack_export_with_unflatten, test/export/test_export.py::TestExport::test_constant_aliasing, test/export/test_export.py::TestExport::test_constant_input_naming, test/export/test_export.py::TestExport::test_constant_no_user_inp, test/export/test_export.py::TestExport::test_constant_output, test/export/test_export.py::TestExport::test_constant_output_dup, test/export/test_export.py::TestExport::test_constant_requires_grad_const, test/export/test_export.py::TestExport::test_constant_return, test/export/test_export.py::TestExport::test_constant_tensor_mutation, test/export/test_export.py::TestExport::test_constant_tensor_with_non_functional, test/export/test_export.py::TestExport::test_constant_tensor_with_non_functional_nested, test/export/test_export.py::TestExport::test_constrain_decomp, test/export/test_export.py::TestExport::test_constrain_size_in_eager, test/export/test_export.py::TestExport::test_constrain_size_with_constrain_value, test/export/test_export.py::TestExport::test_constrain_size_with_various_cases, test/export/test_export.py::TestExport::test_conv_dynamic, test/export/test_export.py::TestExport::test_crop_like, test/export/test_export.py::TestExport::test_cse_for_symint, test/export/test_export.py::TestExport::test_custom_op_auto_functionalize, test/export/test_export.py::TestExport::test_custom_op_auto_functionalize_pre_dispatch, test/export/test_export.py::TestExport::test_custom_op_auto_warn_pre_dispatch, test/export/test_export.py::TestExport::test_custom_op_preserve, test/export/test_export.py::TestExport::test_custom_pytree, test/export/test_export.py::TestExport::test_custom_tag_metadata_re_export, test/export/test_export.py::TestExport::test_decomp_batch_norm_functional_predispatch, test/export/test_export.py::TestExport::test_decomp_item_in_prim_after_decomposition, test/export/test_export.py::TestExport::test_decomp_item_in_prim_before_decomposition, test/export/test_export.py::TestExport::test_default_decomposition_core_cia_ops, test/export/test_export.py::TestExport::test_derived_dim_1_2, test/export/test_export.py::TestExport::test_derived_dim_basic, test/export/test_export.py::TestExport::test_derived_dim_integer, test/export/test_export.py::TestExport::test_derived_dim_nested, test/export/test_export.py::TestExport::test_derived_dim_out_of_order, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_repeat_derived, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_simplified, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived, test/export/test_export.py::TestExport::test_derived_dim_repeat_derived, test/export/test_export.py::TestExport::test_detect_leak_nonstrict, test/export/test_export.py::TestExport::test_detect_leak_nonstrict_with_stacktrace, test/export/test_export.py::TestExport::test_detect_leak_strict, test/export/test_export.py::TestExport::test_device_to_dynamic, test/export/test_export.py::TestExport::test_device_to_gpu, test/export/test_export.py::TestExport::test_device_to_mutation, test/export/test_export.py::TestExport::test_device_to_mutation_float, test/export/test_export.py::TestExport::test_device_to_static, test/export/test_export.py::TestExport::test_dim_1_2, test/export/test_export.py::TestExport::test_dim_auto_and_dim, test/export/test_export.py::TestExport::test_dim_dynamic, test/export/test_export.py::TestExport::test_dim_dynamic_divisibility, test/export/test_export.py::TestExport::test_dim_dynamic_specialization, test/export/test_export.py::TestExport::test_dim_hint_range_violations, test/export/test_export.py::TestExport::test_dim_hint_ranges, test/export/test_export.py::TestExport::test_disable_forced_specializations_errors, test/export/test_export.py::TestExport::test_disable_forced_specializations_ok, test/export/test_export.py::TestExport::test_distributed_all_gather, test/export/test_export.py::TestExport::test_distributed_all_gather_into_tensor, test/export/test_export.py::TestExport::test_distributed_all_reduce, test/export/test_export.py::TestExport::test_distributed_all_to_all_single, test/export/test_export.py::TestExport::test_distributed_reduce_scatter_tensor, test/export/test_export.py::TestExport::test_dont_duck_size_for_auto_dynamic, test/export/test_export.py::TestExport::test_double_lifted_constants, test/export/test_export.py::TestExport::test_draft_export_checks_aliasing, test/export/test_export.py::TestExport::test_draft_export_checks_mutation, test/export/test_export.py::TestExport::test_draft_export_checks_mutation_list, test/export/test_export.py::TestExport::test_draft_export_checks_mutation_with_nan, test/export/test_export.py::TestExport::test_draft_export_fake_kernel_inference_errors, test/export/test_export.py::TestExport::test_draft_export_infers_fake_kernel, test/export/test_export.py::TestExport::test_duplicate_modules_with_non_persistent_buffers, test/export/test_export.py::TestExport::test_dynamic_lr_shift, test/export/test_export.py::TestExport::test_dynamic_shapes_bounds, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_basic, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_kwargs, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_pytree, test/export/test_export.py::TestExport::test_dynamic_shapes_dataclass, test/export/test_export.py::TestExport::test_dynamic_shapes_inferred_basic, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_generic, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_user_errors, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_various, test/export/test_export.py::TestExport::test_dynamic_shapes_spec_with_pytree, test/export/test_export.py::TestExport::test_dynamic_shapes_wrapped_with_shape_guards, test/export/test_export.py::TestExport::test_dynamic_sym_round, test/export/test_export.py::TestExport::test_ends_of_bounds_oblivious, test/export/test_export.py::TestExport::test_enum_str, test/export/test_export.py::TestExport::test_error_does_not_reference_eager_fallback, test/export/test_export.py::TestExport::test_error_when_passing_mutating_primitive_op, test/export/test_export.py::TestExport::test_exception, test/export/test_export.py::TestExport::test_expand_copy_export_handles_implicit_true, test/export/test_export.py::TestExport::test_export_api_with_dynamic_shapes, test/export/test_export.py::TestExport::test_export_as_backend, test/export/test_export.py::TestExport::test_export_associative_scan_lifted_buffers, test/export/test_export.py::TestExport::test_export_associative_scan_symbol_dim, test/export/test_export.py::TestExport::test_export_associative_scan_symbol_scandim, test/export/test_export.py::TestExport::test_export_aten_to_unflatten, test/export/test_export.py::TestExport::test_export_aten_to_unflatten_subclass, test/export/test_export.py::TestExport::test_export_aten_to_unflatten_subclass_pre_dispatch, test/export/test_export.py::TestExport::test_export_cond_preserve_torch_fn_for_subgraphs, test/export/test_export.py::TestExport::test_export_cond_symbool_pred, test/export/test_export.py::TestExport::test_export_cond_warns_constant_pred, test/export/test_export.py::TestExport::test_export_custom_decomp_table_basic_pop, test/export/test_export.py::TestExport::test_export_custom_decomp_table_container_methods, test/export/test_export.py::TestExport::test_export_custom_op_lib, test/export/test_export.py::TestExport::test_export_custom_triton_kernel, test/export/test_export.py::TestExport::test_export_custom_triton_kernel_mutable, test/export/test_export.py::TestExport::test_export_cyclic_reference_leak, test/export/test_export.py::TestExport::test_export_decomp_torture_case_1, test/export/test_export.py::TestExport::test_export_decomp_torture_case_2, test/export/test_export.py::TestExport::test_export_decomps_dynamic, test/export/test_export.py::TestExport::test_export_decomps_simple, test/export/test_export.py::TestExport::test_export_dynamo_config, test/export/test_export.py::TestExport::test_export_for_training_run_decomp, test/export/test_export.py::TestExport::test_export_for_training_with_container_type, test/export/test_export.py::TestExport::test_export_for_training_with_dynamic_shapes, test/export/test_export.py::TestExport::test_export_for_training_with_mutation, test/export/test_export.py::TestExport::test_export_for_training_with_state_dict_hooks, test/export/test_export.py::TestExport::test_export_func_with_default_kwargs, test/export/test_export.py::TestExport::test_export_func_with_keyword_only_args, test/export/test_export.py::TestExport::test_export_func_with_kwargs, test/export/test_export.py::TestExport::test_export_func_with_pytree_kwargs, test/export/test_export.py::TestExport::test_export_func_with_var_keyword_args, test/export/test_export.py::TestExport::test_export_func_with_var_keyword_pytree_args, test/export/test_export.py::TestExport::test_export_func_with_var_postional_args, test/export/test_export.py::TestExport::test_export_function_schema, test/export/test_export.py::TestExport::test_export_graph_with_no_inputs, test/export/test_export.py::TestExport::test_export_input_mutation_bug, test/export/test_export.py::TestExport::test_export_input_mutation_dynamic_shape, test/export/test_export.py::TestExport::test_export_input_mutation_static_shape, test/export/test_export.py::TestExport::test_export_leak_compile, test/export/test_export.py::TestExport::test_export_linear_preserve_dynamic_shape, test/export/test_export.py::TestExport::test_export_max_nonstrict, test/export/test_export.py::TestExport::test_export_max_onnx_reported, test/export/test_export.py::TestExport::test_export_method, test/export/test_export.py::TestExport::test_export_mod_constraints, test/export/test_export.py::TestExport::test_export_module, test/export/test_export.py::TestExport::test_export_preserve_linear_at_aot_level, test/export/test_export.py::TestExport::test_export_preserve_linear_but_not_custom_op, test/export/test_export.py::TestExport::test_export_rnn_variants_with_warning, test/export/test_export.py::TestExport::test_export_scan_pytree_output, test/export/test_export.py::TestExport::test_export_script_module, test/export/test_export.py::TestExport::test_export_statically_known_true, test/export/test_export.py::TestExport::test_export_then_compile_tensor_ctor, test/export/test_export.py::TestExport::test_export_with_autocast, test/export/test_export.py::TestExport::test_export_with_fake_tensor_inputs, test/export/test_export.py::TestExport::test_export_with_fake_tensor_inputs_on_cuda_devices, test/export/test_export.py::TestExport::test_export_with_inline_constraints, test/export/test_export.py::TestExport::test_export_with_inline_constraints_complex, test/export/test_export.py::TestExport::test_export_with_set_grad_enabled, test/export/test_export.py::TestExport::test_export_with_wrong_inputs, test/export/test_export.py::TestExport::test_external_call_non_strict_real_tensor, test/export/test_export.py::TestExport::test_fake_inputs, test/export/test_export.py::TestExport::test_fake_weights, test/export/test_export.py::TestExport::test_filter_traceback_frames, test/export/test_export.py::TestExport::test_flex_attention_export, test/export/test_export.py::TestExport::test_float_conversion, test/export/test_export.py::TestExport::test_float_conversion_from_int, test/export/test_export.py::TestExport::test_fqn, test/export/test_export.py::TestExport::test_from_node_metadata_export, test/export/test_export.py::TestExport::test_full_on_scalar_tensor, test/export/test_export.py::TestExport::test_function_holding_tensor, test/export/test_export.py::TestExport::test_hints_wrapper, test/export/test_export.py::TestExport::test_hoo_inline_users_issue, test/export/test_export.py::TestExport::test_if_functional, test/export/test_export.py::TestExport::test_if_post_autograd_op_preserved, test/export/test_export.py::TestExport::test_inductor_backend_inside_nonstrict, test/export/test_export.py::TestExport::test_inline_script_class_method, test/export/test_export.py::TestExport::test_inline_script_class_method_recursive, test/export/test_export.py::TestExport::test_inline_script_function, test/export/test_export.py::TestExport::test_inline_script_method, test/export/test_export.py::TestExport::test_int_shape_specialization, test/export/test_export.py::TestExport::test_intermediate_shape_comp, test/export/test_export.py::TestExport::test_invalid_pytree_dynamo_graph_capture, test/export/test_export.py::TestExport::test_is_exporting, test/export/test_export.py::TestExport::test_is_nonzero, test/export/test_export.py::TestExport::test_isnonzero, test/export/test_export.py::TestExport::test_issue_113041, test/export/test_export.py::TestExport::test_issue_157289, test/export/test_export.py::TestExport::test_issue_161902, test/export/test_export.py::TestExport::test_istft_op, test/export/test_export.py::TestExport::test_keep_composite_ops_invalid, test/export/test_export.py::TestExport::test_keep_composite_ops_linear_convd, test/export/test_export.py::TestExport::test_keep_composite_ops_linear_convd_for_training_ir, test/export/test_export.py::TestExport::test_kwarg_dynamic_shapes_diff_order, test/export/test_export.py::TestExport::test_kwargs_reorder, test/export/test_export.py::TestExport::test_layer_norm_unbacked_normalized_shape, test/export/test_export.py::TestExport::test_layer_sharing, test/export/test_export.py::TestExport::test_lazy_module_kwargs, test/export/test_export.py::TestExport::test_lifted_constants, test/export/test_export.py::TestExport::test_linear_conv, test/export/test_export.py::TestExport::test_malformed_fqn_from_source_name, test/export/test_export.py::TestExport::test_map, test/export/test_export.py::TestExport::test_map_buffers, test/export/test_export.py::TestExport::test_mask_nonzero_static, test/export/test_export.py::TestExport::test_masked_select_dynamic, test/export/test_export.py::TestExport::test_math_pow, test/export/test_export.py::TestExport::test_mismatched_dynamic_shapes, test/export/test_export.py::TestExport::test_mixed_input, test/export/test_export.py::TestExport::test_module, test/export/test_export.py::TestExport::test_module_dict_key, test/export/test_export.py::TestExport::test_module_input, test/export/test_export.py::TestExport::test_module_input_subclasses_parameterization_nested, test/export/test_export.py::TestExport::test_module_list_slice, test/export/test_export.py::TestExport::test_module_with_dict_container_inp_out, test/export/test_export.py::TestExport::test_modules_access_for_deleted_submodule, test/export/test_export.py::TestExport::test_more_multidimensional_slicing, test/export/test_export.py::TestExport::test_multidimensional_slicing, test/export/test_export.py::TestExport::test_multinomial_dynamic, test/export/test_export.py::TestExport::test_multiple_definitions_same_name_dim, test/export/test_export.py::TestExport::test_namedtuple_input_export, test/export/test_export.py::TestExport::test_native_multi_attention_head, test/export/test_export.py::TestExport::test_nested_dynamic_shapes_spec, test/export/test_export.py::TestExport::test_nested_module, test/export/test_export.py::TestExport::test_nested_module_fake_tensor_leak, test/export/test_export.py::TestExport::test_nested_module_with_constant_buffer, test/export/test_export.py::TestExport::test_nested_module_with_init_buffer, test/export/test_export.py::TestExport::test_nested_module_with_parameter, test/export/test_export.py::TestExport::test_nn_module_stack, test/export/test_export.py::TestExport::test_nn_module_stack_shared_submodule, test/export/test_export.py::TestExport::test_no_check_is_size_error, test/export/test_export.py::TestExport::test_no_suggested_fixes_for_data_dependent_errors, test/export/test_export.py::TestExport::test_no_tensor_computation, test/export/test_export.py::TestExport::test_no_tensor_computation_2, test/export/test_export.py::TestExport::test_no_tensor_computation_3, test/export/test_export.py::TestExport::test_no_tensor_computation_4, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api_with_container_type, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg, test/export/test_export.py::TestExport::test_non_persistent_buffer, test/export/test_export.py::TestExport::test_non_strict_dynamic_shapes, test/export/test_export.py::TestExport::test_non_strict_dynamic_shapes_suggested_fixes, test/export/test_export.py::TestExport::test_none_buffers, test/export/test_export.py::TestExport::test_nonstrict_retrace_preserves_metadata, test/export/test_export.py::TestExport::test_nonzero_2, test/export/test_export.py::TestExport::test_nonzero_dynamic, test/export/test_export.py::TestExport::test_not_registered_parameter, test/export/test_export.py::TestExport::test_operator_aten_tensor_mode_variant, test/export/test_export.py::TestExport::test_output_node_name, test/export/test_export.py::TestExport::test_pad_sequence, test/export/test_export.py::TestExport::test_param_util, test/export/test_export.py::TestExport::test_partial_patched_forward, test/export/test_export.py::TestExport::test_placeholder_naming_collisions, test/export/test_export.py::TestExport::test_placeholder_naming_collisions_hoo_subgraphs, test/export/test_export.py::TestExport::test_placeholder_naming_order, test/export/test_export.py::TestExport::test_placeholder_naming_order_variadic, test/export/test_export.py::TestExport::test_placeholder_update_preserving, test/export/test_export.py::TestExport::test_predispatch_cond, test/export/test_export.py::TestExport::test_predispatch_grad_wrappers, test/export/test_export.py::TestExport::test_preserve_annotation, test/export/test_export.py::TestExport::test_preserve_module_call_signature_unflatten_specialization, test/export/test_export.py::TestExport::test_preserve_requires_grad_placeholders, test/export/test_export.py::TestExport::test_preserve_shape_dynamism_for_unused_inputs, test/export/test_export.py::TestExport::test_profiling_code, test/export/test_export.py::TestExport::test_python_asserts_with_sym_int, test/export/test_export.py::TestExport::test_pytree_register_data_class, test/export/test_export.py::TestExport::test_pytree_register_nested_data_class, test/export/test_export.py::TestExport::test_raise_user_error_when_guard_on_data_dependent_operation, test/export/test_export.py::TestExport::test_range_constraints_with_replacement, test/export/test_export.py::TestExport::test_real_tensor_alias_dtype_mismatch, test/export/test_export.py::TestExport::test_real_tensor_bool_cast, test/export/test_export.py::TestExport::test_real_tensor_errors_on_aliasing_custom_op, test/export/test_export.py::TestExport::test_real_tensor_for_max_op, test/export/test_export.py::TestExport::test_real_tensor_size_mismatch, test/export/test_export.py::TestExport::test_redundant_assert_max_upper_bound, test/export/test_export.py::TestExport::test_redundant_asserts, test/export/test_export.py::TestExport::test_refine_dynamic_shapes_from_suggested_fixes, test/export/test_export.py::TestExport::test_register_constant, test/export/test_export.py::TestExport::test_repeat_interleave, test/export/test_export.py::TestExport::test_replace_unbacked_with_very_large_upperbound, test/export/test_export.py::TestExport::test_replaced_unbacked_bindings, test/export/test_export.py::TestExport::test_reshape_view_helper, test/export/test_export.py::TestExport::test_retracable_ep, test/export/test_export.py::TestExport::test_retrace_pre_autograd, test/export/test_export.py::TestExport::test_run_decomposition_supports_user_input_mutation, test/export/test_export.py::TestExport::test_run_decompositions_keep_metadata, test/export/test_export.py::TestExport::test_run_decompositions_keep_tensor_constant_metadata, test/export/test_export.py::TestExport::test_runtime_assert_for_prim, test/export/test_export.py::TestExport::test_runtime_assert_for_prm_str, test/export/test_export.py::TestExport::test_runtime_assert_with_size, test/export/test_export.py::TestExport::test_sdpa_gqa, test/export/test_export.py::TestExport::test_sequential_slicing, test/export/test_export.py::TestExport::test_set_example_inputs, test/export/test_export.py::TestExport::test_set_grad_as_side_effect, test/export/test_export.py::TestExport::test_set_grad_empty, test/export/test_export.py::TestExport::test_set_grad_unflatten, test/export/test_export.py::TestExport::test_setgrad_lifted_tensor, test/export/test_export.py::TestExport::test_shared_submodule_nn_module_stack, test/export/test_export.py::TestExport::test_simple_export_for_training, test/export/test_export.py::TestExport::test_simple_unbacked_view, test/export/test_export.py::TestExport::test_size_input, test/export/test_export.py::TestExport::test_slice_nn_module_stack, test/export/test_export.py::TestExport::test_solver_unsupported_sympy_function, test/export/test_export.py::TestExport::test_specialize_derived_dim_roots, test/export/test_export.py::TestExport::test_split_const_gm_with_lifted_constants, test/export/test_export.py::TestExport::test_stack_trace, test/export/test_export.py::TestExport::test_stack_trace_make_fx, test/export/test_export.py::TestExport::test_state_primitives, test/export/test_export.py::TestExport::test_state_shape_attribute_assignment, test/export/test_export.py::TestExport::test_state_tensors, test/export/test_export.py::TestExport::test_static_dim_constraints, test/export/test_export.py::TestExport::test_subclass_context, test/export/test_export.py::TestExport::test_subclass_nested_attr_access, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_complicated_metadata, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_const_metadata, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_const_metadata_not_top_level, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_submodule, test/export/test_export.py::TestExport::test_subclasses_parameterization, test/export/test_export.py::TestExport::test_subclasses_parameterization_nested, test/export/test_export.py::TestExport::test_suggest_torch_checks_with_non_negative_check, test/export/test_export.py::TestExport::test_suggest_torch_checks_with_regular_check, test/export/test_export.py::TestExport::test_suggested_fixes_for_data_dependent_errors_basic, test/export/test_export.py::TestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers, test/export/test_export.py::TestExport::test_suggested_fixes_new_roots, test/export/test_export.py::TestExport::test_sym_float_operators, test/export/test_export.py::TestExport::test_sym_or_sym_and, test/export/test_export.py::TestExport::test_sym_sqrt, test/export/test_export.py::TestExport::test_symbool_item, test/export/test_export.py::TestExport::test_symfloat_item, test/export/test_export.py::TestExport::test_symint_input_additional_inputs, test/export/test_export.py::TestExport::test_symint_input_basic, test/export/test_export.py::TestExport::test_symint_input_ranges, test/export/test_export.py::TestExport::test_symint_input_shapes_collection, test/export/test_export.py::TestExport::test_symint_input_specialization, test/export/test_export.py::TestExport::test_symint_item, test/export/test_export.py::TestExport::test_symint_output, test/export/test_export.py::TestExport::test_symint_tensor_return, test/export/test_export.py::TestExport::test_tag_ac_export, test/export/test_export.py::TestExport::test_tensor_attribute_zero_args, test/export/test_export.py::TestExport::test_tensor_constant_aten_to, test/export/test_export.py::TestExport::test_tensor_constant_with_wrapped_method, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer_multiple, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later, test/export/test_export.py::TestExport::test_tolist, test/export/test_export.py::TestExport::test_torch_check_eq_commutativity, test/export/test_export.py::TestExport::test_torch_fn, test/export/test_export.py::TestExport::test_trace_under_fake, test/export/test_export.py::TestExport::test_train_eval_on_exported_preautograd_module, test/export/test_export.py::TestExport::test_tril_dynamic_diagonal, test/export/test_export.py::TestExport::test_triu_dynamic_diagonal, test/export/test_export.py::TestExport::test_unbacked_3d_matmul, test/export/test_export.py::TestExport::test_unbacked_bincount, test/export/test_export.py::TestExport::test_unbacked_bindings_for_divisible_u_symint, test/export/test_export.py::TestExport::test_unbacked_deferred_runtime_retrace, test/export/test_export.py::TestExport::test_unbacked_expand, test/export/test_export.py::TestExport::test_unbacked_infer_size, test/export/test_export.py::TestExport::test_unbacked_kth_value, test/export/test_export.py::TestExport::test_unbacked_linear_layer_norm_input, test/export/test_export.py::TestExport::test_unbacked_noncontig_lin, test/export/test_export.py::TestExport::test_unbacked_pad, test/export/test_export.py::TestExport::test_unbacked_scalar_constructor, test/export/test_export.py::TestExport::test_unbacked_slice_forward, test/export/test_export.py::TestExport::test_unbacked_slice_simple, test/export/test_export.py::TestExport::test_unbacked_stack, test/export/test_export.py::TestExport::test_unbacked_to_cond, test/export/test_export.py::TestExport::test_unbacked_to_cond_passthrough, test/export/test_export.py::TestExport::test_unbacked_unsqueeze, test/export/test_export.py::TestExport::test_unflatten_asserts, test/export/test_export.py::TestExport::test_unflatten_buffer_update_child2parent_swap, test/export/test_export.py::TestExport::test_unflatten_closure, test/export/test_export.py::TestExport::test_unflatten_isinstance, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_dispatch, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_preserve_signature_no_error, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_shared_submodule, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_state, test/export/test_export.py::TestExport::test_unflatten_no_unroll, test/export/test_export.py::TestExport::test_unflatten_placeholder_update_child2parent_swap, test/export/test_export.py::TestExport::test_unflatten_placeholder_update_grandchild2cousin_swap, test/export/test_export.py::TestExport::test_unflatten_random_dag_5, test/export/test_export.py::TestExport::test_unflatten_random_dag_6, test/export/test_export.py::TestExport::test_unflatten_random_dag_buf_8, test/export/test_export.py::TestExport::test_unflatten_random_dag_const_preserving_3, test/export/test_export.py::TestExport::test_unflatten_random_dag_const_preserving_3_1, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_4, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_6, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_9, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_10, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_4, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_5, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_7, test/export/test_export.py::TestExport::test_unflatten_random_dag_preserving_4, test/export/test_export.py::TestExport::test_unused_aliases, test/export/test_export.py::TestExport::test_unused_constant, test/export/test_export.py::TestExport::test_uplift_common_custom_meta, test/export/test_export.py::TestExport::test_uplift_common_custom_meta_with_multiple_calls, test/export/test_export.py::TestExport::test_use_embedding_twice, test/export/test_export.py::TestExport::test_user_input_and_buffer_mutation, test/export/test_export.py::TestExport::test_vmap, test/export/test_export.py::TestExport::test_vmap_custom_autograd_function, test/export/test_export.py::TestExport::test_vmap_to_assert, test/export/test_export.py::TestExport::test_where_decomp, test/export/test_export.py::TestExport::test_while_loop_assert_separation, test/export/test_export.py::TestExport::test_while_loop_index_assertions, test/export/test_export.py::TestExport::test_while_loop_simple, test/export/test_export.py::TestExport::test_while_loop_tensor_constant_idx, test/export/test_export.py::TestExport::test_wrapper_module, test/export/test_export.py::TestOneOffModelExportResult::test_assert_tensor_metadata_device_index, test/export/test_export.py::TestOneOffModelExportResult::test_constant_fqn, test/export/test_export.py::TestOneOffModelExportResult::test_constant_name, test/export/test_export.py::TestOneOffModelExportResult::test_duplicated_getitem, test/export/test_export.py::TestOneOffModelExportResult::test_export_with_dict_input_nested_in_args, test/export/test_export.py::TestOneOffModelExportResult::test_hf_logging_logger, test/export/test_export.py::TestOneOffModelExportResult::test_input_output_no_stacktrace, test/export/test_export.py::TestOneOffModelExportResult::test_int_list_output, test/export/test_export.py::TestOneOffModelExportResult::test_logging_logger, test/export/test_export.py::TestOneOffModelExportResult::test_nested_retrace, test/export/test_export.py::TestOneOffModelExportResult::test_none_input_output, test/export/test_export.py::TestOneOffModelExportResult::test_primitive_constant_output, test/export/test_export.py::TestOneOffModelExportResult::test_print, test/export/test_export.py::TestOneOffModelExportResult::test_print_graph_signature, test/export/test_export.py::TestOneOffModelExportResult::test_scaled_dot_product_attention_cpu, test/export/test_export.py::TestOneOffModelExportResult::test_scaled_dot_product_attention_cuda, test/export/test_export.py::TestOneOffModelExportResult::test_strict_export_with_shared_parameters, test/export/test_export.py::TestOneOffModelExportResult::test_torchrec_jagged_tensor, test/export/test_export.py::TestOneOffModelExportResult::test_unbacked_sdpa, test/export/test_export.py::TestOneOffModelExportResult::test_warning, test/export/test_export.py::TestExportCustomClass::test_export_script_module, test/export/test_export.py::TestExportCustomClass::test_export_unbacked_lt, test/export/test_export.py::TestExportCustomClass::test_int_lift_constant, test/export/test_export.py::TestExportCustomClass::test_is_fx_tracing, test/export/test_export.py::TestExportCustomClass::test_item, test/export/test_export.py::TestExportCustomClass::test_lift_custom_obj, test/export/test_export.py::TestExportCustomClass::test_preserve_cia_op, test/export/test_export.py::TestExportCustomClass::test_preserve_non_cia_op, test/export/test_export.py::TestExportCustomClass::test_unbacked_contiguous, test/export/test_export.py::TestExportCustomClass::test_unbacked_select_index 2025-12-04T12:36:52.8531207Z 2025-12-04T12:36:52.8531344Z Finished export/test_export 1/1 ... [2025-12-04 12:36:52.843713][2201275.30468242], took 0.83min 2025-12-04T12:36:52.8531769Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:36:52.8532142Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:36:52.8532368Z Running dynamo/test_comptime 1/1 ... [2025-12-04 12:36:52.850257][2201275.311242574] 2025-12-04T12:36:52.8532546Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:36:52.8532933Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_comptime.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:36:52.850577] 2025-12-04T12:37:00.5276287Z 2025-12-04T12:37:00.5277089Z dynamo/test_comptime 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_comptime_1.1_5c300150ebc8d3d0_.log 2025-12-04T12:37:00.5279314Z Running 12 items in this shard: test/dynamo/test_comptime.py::ComptimeTests::test_get_local, test/dynamo/test_comptime.py::ComptimeTests::test_get_local_closure_variable, test/dynamo/test_comptime.py::ComptimeTests::test_graph_break, test/dynamo/test_comptime.py::ComptimeTests::test_print_bt, test/dynamo/test_comptime.py::ComptimeTests::test_print_direct, test/dynamo/test_comptime.py::ComptimeTests::test_print_disas, test/dynamo/test_comptime.py::ComptimeTests::test_print_graph, test/dynamo/test_comptime.py::ComptimeTests::test_print_guards, test/dynamo/test_comptime.py::ComptimeTests::test_print_locals, test/dynamo/test_comptime.py::ComptimeTests::test_print_single, test/dynamo/test_comptime.py::ComptimeTests::test_print_value_stack, test/dynamo/test_comptime.py::ComptimeTests::test_sleep 2025-12-04T12:37:00.5281267Z 2025-12-04T12:37:00.5281458Z Finished dynamo/test_comptime 1/1 ... [2025-12-04 12:37:00.527300][2201282.988284407], took 0.13min 2025-12-04T12:37:00.5289094Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:37:00.5339897Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:37:00.5342829Z Running test_mkl_verbose 1/1 ... [2025-12-04 12:37:00.533948][2201282.994937149] 2025-12-04T12:37:00.5343064Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:37:00.5343570Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_mkl_verbose.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:37:00.534142] 2025-12-04T12:37:04.7582017Z 2025-12-04T12:37:04.7582990Z test_mkl_verbose 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_mkl_verbose_1.1_f5fabbb742dec65e_.log 2025-12-04T12:37:04.7584428Z Running 2 items in this shard: test/test_mkl_verbose.py::TestMKLVerbose::test_verbose_off, test/test_mkl_verbose.py::TestMKLVerbose::test_verbose_on 2025-12-04T12:37:04.7584994Z 2025-12-04T12:37:04.7585266Z Finished test_mkl_verbose 1/1 ... [2025-12-04 12:37:04.757843][2201287.218829134], took 0.07min 2025-12-04T12:37:04.7597109Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:37:04.7649356Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:37:04.7649888Z Running test_comparison_utils 1/1 ... [2025-12-04 12:37:04.764797][2201287.225786313] 2025-12-04T12:37:04.7650416Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:37:04.7651630Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_comparison_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:37:04.764990] 2025-12-04T12:37:06.9826551Z 2025-12-04T12:37:06.9827246Z test_comparison_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_comparison_utils_1.1_163013ce35eec01c_.log 2025-12-04T12:37:06.9828395Z Running 7 items in this shard: test/test_comparison_utils.py::TestComparisonUtils::test_all_equal_no_assert, test/test_comparison_utils.py::TestComparisonUtils::test_all_equal_no_assert_nones, test/test_comparison_utils.py::TestComparisonUtils::test_assert_device, test/test_comparison_utils.py::TestComparisonUtils::test_assert_dtype, test/test_comparison_utils.py::TestComparisonUtils::test_assert_layout, test/test_comparison_utils.py::TestComparisonUtils::test_assert_sizes, test/test_comparison_utils.py::TestComparisonUtils::test_assert_strides 2025-12-04T12:37:06.9829253Z 2025-12-04T12:37:06.9829383Z Finished test_comparison_utils 1/1 ... [2025-12-04 12:37:06.982329][2201289.443315729], took 0.04min 2025-12-04T12:37:06.9838711Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:37:06.9891298Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:37:06.9892156Z Running functorch/test_ac_logging 1/1 ... [2025-12-04 12:37:06.989072][2201289.45006163] 2025-12-04T12:37:06.9892351Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:37:06.9893924Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_ac_logging.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:37:06.989266] 2025-12-04T12:37:09.4074135Z 2025-12-04T12:37:09.4075010Z functorch/test_ac_logging 1/1 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_ac_logging_1.1_a7bbe9795434e3a8_.log 2025-12-04T12:37:09.4076590Z Running 4 items in this shard: test/functorch/test_ac_logging.py::TestAcLogging::test_create_activation_checkpointing_logging_structure_payload, test/functorch/test_ac_logging.py::TestAcLogging::test_create_joint_graph_edges, test/functorch/test_ac_logging.py::TestAcLogging::test_create_joint_graph_node_information, test/functorch/test_ac_logging.py::TestAcLogging::test_create_structured_trace_for_min_cut_info 2025-12-04T12:37:09.4077698Z 2025-12-04T12:37:09.4077943Z Finished functorch/test_ac_logging 1/1 ... [2025-12-04 12:37:09.407058][2201291.868043267], took 0.04min 2025-12-04T12:37:09.4086928Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:37:09.4138757Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:37:09.4140072Z Running test_mkldnn_verbose 1/1 ... [2025-12-04 12:37:09.413905][2201291.874893327] 2025-12-04T12:37:09.4140403Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:37:09.4142896Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_mkldnn_verbose.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:37:09.414125] 2025-12-04T12:37:13.3391063Z 2025-12-04T12:37:13.3392162Z test_mkldnn_verbose 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_mkldnn_verbose_1.1_440b8ef3a5228a82_.log 2025-12-04T12:37:13.3393460Z Running 2 items in this shard: test/test_mkldnn_verbose.py::TestMKLDNNVerbose::test_verbose_off, test/test_mkldnn_verbose.py::TestMKLDNNVerbose::test_verbose_on 2025-12-04T12:37:13.3394164Z 2025-12-04T12:37:13.3394483Z Finished test_mkldnn_verbose 1/1 ... [2025-12-04 12:37:13.338801][2201295.799788655], took 0.07min 2025-12-04T12:37:13.3402135Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:37:13.3453618Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:37:13.3460548Z Running test_cpp_api_parity 1/1 ... [2025-12-04 12:37:13.345318][2201295.806307899] 2025-12-04T12:37:13.3461077Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:37:13.3461484Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_cpp_api_parity.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:37:13.345513] 2025-12-04T12:38:57.1944151Z 2025-12-04T12:38:57.1944883Z test_cpp_api_parity 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_cpp_api_parity_1.1_2ccc8ecf4ca9cf21_.log 2025-12-04T12:38:57.2012191Z Running 488 items in this shard: test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCELoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_BCEWithLogitsLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_circular_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_circular_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_groups, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_groups_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad1, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad1_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad1size1, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad1size1_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad2size1, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad2size1_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_same_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_valid, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_pad_valid_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_reflect_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_reflect_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_replicate_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_replicate_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_stride, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_stride_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_zero_batch, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_zero_batch_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_zeros_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv1d_zeros_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_circular_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_circular_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_padded, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_padded_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_strided, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_strided_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_with_multiplier, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_depthwise_with_multiplier_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_groups, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_groups_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_groups_thnn, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_groups_thnn_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_same, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_same_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_same_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_same_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_valid, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_pad_valid_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_padding, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_padding_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_reflect_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_reflect_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_replicate_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_replicate_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_strided, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_strided_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_zero_batch, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_zero_batch_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_zeros_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv2d_zeros_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_1x1x1_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_1x1x1_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_circular_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_circular_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_dilated_strided, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_dilated_strided_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_groups, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_groups_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_same, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_same_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_same_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_same_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_valid, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_pad_valid_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_replicate_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_replicate_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_stride, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_stride_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_stride_padding, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_stride_padding_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_zero_batch, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_zero_batch_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_zeros_stride2_pad2, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Conv3d_zeros_stride2_pad2_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_groups, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_groups_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose1d_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_groups, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_groups_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose2d_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose3d_dilated, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ConvTranspose3d_dilated_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CosineEmbeddingLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CrossMapLRN2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_CrossMapLRN2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_discontiguous, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_discontiguous_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_max, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_max_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_max_padding_idx, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_max_padding_idx_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_mean_padding_idx, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_mean_padding_idx_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sparse, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sparse_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sum_padding_idx, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_EmbeddingBag_sum_padding_idx_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding_discontiguous, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding_discontiguous_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding_sparse, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Embedding_sparse_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Flatten, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Flatten_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Flatten_no_batch_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Flatten_no_batch_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_int_input, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_int_input_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_no_batch_dim_input, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_no_batch_dim_input_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_no_batch_dim_int_input, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Fold_no_batch_dim_int_input_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_HingeEmbeddingLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_LayerNorm_3d_no_affine_large_feature, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_LayerNorm_3d_no_affine_large_feature_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear_no_batch_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear_no_batch_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear_no_bias, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Linear_no_bias_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MarginRankingLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelMarginLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_NLLLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_broadcast_lhs, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_broadcast_lhs_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_broadcast_rhs, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_broadcast_rhs_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_no_batch_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_no_batch_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_with_non_default_args, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PairwiseDistance_with_non_default_args_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PixelShuffle, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PixelShuffle_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PixelUnshuffle, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_PixelUnshuffle_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU_with_up_down, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU_with_up_down_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU_with_up_down_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_RReLU_with_up_down_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d_complex, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d_complex_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d_no_batch_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_ReplicationPad3d_no_batch_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SampleModule_has_parity, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SampleModule_has_parity_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SampleModule_no_parity, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SampleModule_no_parity_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_SoftMarginLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerDecoderLayer_gelu_activation, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerDecoderLayer_gelu_activation_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerDecoderLayer_relu_activation, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerDecoderLayer_relu_activation_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerEncoderLayer_gelu_activation, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerEncoderLayer_gelu_activation_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerEncoderLayer_relu_activation, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TransformerEncoderLayer_relu_activation_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Transformer_multilayer_coder, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Transformer_multilayer_coder_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_mean, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_mean_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_none, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_none_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_sum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_TripletMarginLoss_no_batch_dim_sum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unflatten_no_batch_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unflatten_no_batch_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unfold, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unfold_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unfold_int_input, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_Unfold_int_input_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_weights_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_weights_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_weights_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCELoss_weights_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_legacy_enum, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_legacy_enum_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_BCEWithLogitsLoss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HingeEmbeddingLoss_margin_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HingeEmbeddingLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HingeEmbeddingLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HuberLoss_delta, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_HuberLoss_delta_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_log_target, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_log_target_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_scalar_log_target, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_no_reduce_scalar_log_target_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_with_log_target_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_with_log_target_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_with_target_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_KLDivLoss_with_target_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce_complex, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce_complex_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_L1Loss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MSELoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MSELoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MSELoss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MSELoss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_0d_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_1d_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_index_neg, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_index_neg_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelMarginLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelSoftMarginLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelSoftMarginLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelSoftMarginLoss_weights_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiLabelSoftMarginLoss_weights_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_1d_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_1d_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_margin_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_margin_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_p_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_p_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_weights_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_MultiMarginLoss_weights_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce_ignore_index, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce_weights, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss2d_no_reduce_weights_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce_ignore_index, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce_ignore_index_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce_weights, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLossNd_no_reduce_weights_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_ignore_index, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_ignore_index_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights_ignore_index, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights_ignore_index_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_NLLLoss_no_reduce_weights_ignore_index_neg_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_PoissonNLLLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_PoissonNLLLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_beta, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_beta_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_no_reduce_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_no_reduce_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_zero_beta, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SmoothL1Loss_zero_beta_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SoftMarginLoss_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_SoftMarginLoss_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_2d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_2d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_shared_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_shared_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_skewed_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_skewed_2d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_skewed_2d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_scale_tuple_skewed_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_tuple_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_tuple_2d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_tuple_2d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bicubic_tuple_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_2d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_2d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_shared_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_skewed_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_skewed_2d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_scale_tuple_skewed_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_tuple_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_tuple_2d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_tuple_2d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_bilinear_tuple_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_1d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_scale_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_scale_1d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_scale_1d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_scale_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_tuple_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_linear_tuple_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_1d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_1d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d_launch_configs, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d_launch_configs_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_2d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_3d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_3d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_scale_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_1d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_1d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_2d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_2d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_nearest_tuple_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_3d_zero_dim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_3d_zero_dim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_scale_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_scale_3d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_scale_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_tuple_3d, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_tuple_3d_align_corners, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_tuple_3d_align_corners_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_interpolate_trilinear_tuple_3d_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_dim0, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_dim0_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_dim3, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_dim3_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_lastdim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_lastdim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_spatial, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_spatial_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_spatial_special, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_log_softmax_spatial_special_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_multimarginloss_1d_input_0d_target_no_reduce, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_sample_functional_has_parity, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_sample_functional_has_parity_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_sample_functional_no_parity, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_sample_functional_no_parity_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_dim0, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_dim0_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_dim3, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_dim3_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_scalar, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_functional_scalar_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_lastdim, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_lastdim_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_lastdim_dtype, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_lastdim_dtype_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial_dtype, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial_dtype_cuda, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial_special, test/test_cpp_api_parity.py::TestCppApiParity::test_torch_nn_functional_softmax_spatial_special_cuda 2025-12-04T12:38:57.2071679Z 2025-12-04T12:38:57.2071800Z Finished test_cpp_api_parity 1/1 ... [2025-12-04 12:38:57.194589][2201399.655573995], took 1.73min 2025-12-04T12:38:57.2072188Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:38:57.2072542Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:57.2072816Z Running nn/attention/test_open_registry 1/1 ... [2025-12-04 12:38:57.201393][2201399.662381823] 2025-12-04T12:38:57.2073016Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:57.2073424Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/attention/test_open_registry.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:57.201590] 2025-12-04T12:38:59.3690746Z 2025-12-04T12:38:59.3691684Z nn/attention/test_open_registry 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.attention.test_open_registry_1.1_f18e4ccba5eb8a46_.log 2025-12-04T12:38:59.3692832Z Running 2 items in this shard: test/nn/attention/test_open_registry.py::TestFlashAttentionRegistry::test_activate_unknown_impl_errors, test/nn/attention/test_open_registry.py::TestFlashAttentionRegistry::test_register_and_activate_impl 2025-12-04T12:38:59.3693497Z 2025-12-04T12:38:59.3694020Z Finished nn/attention/test_open_registry 1/1 ... [2025-12-04 12:38:59.368829][2201401.829815265], took 0.04min 2025-12-04T12:38:59.3709044Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:38:59.3758587Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:59.3761259Z Running test_as_strided 1/1 ... [2025-12-04 12:38:59.375917][2201401.836906399] 2025-12-04T12:38:59.3761542Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:59.3762354Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_as_strided.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:59.376107] 2025-12-04T12:39:01.7943066Z 2025-12-04T12:39:01.7943987Z test_as_strided 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_as_strided_1.1_2fbf19ea0f9f7e66_.log 2025-12-04T12:39:01.7944928Z Running 2 items in this shard: test/test_as_strided.py::TestAsStrided::test_size_10_exhaustive, test/test_as_strided.py::TestAsStrided::test_subset_property 2025-12-04T12:39:01.7945417Z 2025-12-04T12:39:01.7945629Z Finished test_as_strided 1/1 ... [2025-12-04 12:39:01.793985][2201404.254970712], took 0.04min 2025-12-04T12:39:01.7961513Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:39:01.8011657Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:39:01.8013820Z Running test_proxy_tensor 1/1 ... [2025-12-04 12:39:01.801173][2201404.262161985] 2025-12-04T12:39:01.8014145Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:39:01.8014958Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_proxy_tensor.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:39:01.801368] 2025-12-04T12:39:40.8337480Z 2025-12-04T12:39:40.8341936Z test_proxy_tensor 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_proxy_tensor_1.1_79c4a4e56a13d2fb_.log 2025-12-04T12:39:40.8369738Z Running 176 items in this shard: test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_varargs, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_varargs, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_varargs, test/test_proxy_tensor.py::TestRealProxyTensor::test_error_on_data_dependent_ops, test/test_proxy_tensor.py::TestFakeProxyTensor::test_alias, test/test_proxy_tensor.py::TestFakeProxyTensor::test_fake_tensor_mode, test/test_proxy_tensor.py::TestFakeProxyTensor::test_free_fake, test/test_proxy_tensor.py::TestFakeProxyTensor::test_fused_adam, test/test_proxy_tensor.py::TestFakeProxyTensor::test_issue82547, test/test_proxy_tensor.py::TestFakeProxyTensor::test_meta, test/test_proxy_tensor.py::TestFakeProxyTensor::test_use_fake_and_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_adv_index_batch, test/test_proxy_tensor.py::TestSymbolicTracing::test_arange_unbacked_output_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_binary_broadcast, test/test_proxy_tensor.py::TestSymbolicTracing::test_boolean_index, test/test_proxy_tensor.py::TestSymbolicTracing::test_broadcast_shapes, test/test_proxy_tensor.py::TestSymbolicTracing::test_cat, test/test_proxy_tensor.py::TestSymbolicTracing::test_constant_specialization, test/test_proxy_tensor.py::TestSymbolicTracing::test_cpu_scalar_cuda, test/test_proxy_tensor.py::TestSymbolicTracing::test_cumsum_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_debug_interpreter, test/test_proxy_tensor.py::TestSymbolicTracing::test_deduped_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_dynamic_pointwise_scalar, test/test_proxy_tensor.py::TestSymbolicTracing::test_elementwise_meta_with_sym_numbers, test/test_proxy_tensor.py::TestSymbolicTracing::test_expand, test/test_proxy_tensor.py::TestSymbolicTracing::test_fake_tensor_as_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_lowerbound_range_refinement, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_lowerbound_range_refinement_multivariate, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_upperbound_range_refinement, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_upperbound_range_refinement_multivariate, test/test_proxy_tensor.py::TestSymbolicTracing::test_guards_equal, test/test_proxy_tensor.py::TestSymbolicTracing::test_int_input, test/test_proxy_tensor.py::TestSymbolicTracing::test_invalidate_nonzero, test/test_proxy_tensor.py::TestSymbolicTracing::test_invalidate_nonzero_propagate_real_tensors, test/test_proxy_tensor.py::TestSymbolicTracing::test_item, test/test_proxy_tensor.py::TestSymbolicTracing::test_item_to_constructor, test/test_proxy_tensor.py::TestSymbolicTracing::test_make_fx_with_custom_tracer_preserving_nn_module_stack, test/test_proxy_tensor.py::TestSymbolicTracing::test_mega_guard, test/test_proxy_tensor.py::TestSymbolicTracing::test_metadata, test/test_proxy_tensor.py::TestSymbolicTracing::test_metadata_fresh, test/test_proxy_tensor.py::TestSymbolicTracing::test_mod_gcd_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_multiply_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_neg_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_new_empty, test/test_proxy_tensor.py::TestSymbolicTracing::test_non_deduped_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_non_symint_size_spec, test/test_proxy_tensor.py::TestSymbolicTracing::test_nonidentity_transitive_guards, test/test_proxy_tensor.py::TestSymbolicTracing::test_reflect_r_over_x, test/test_proxy_tensor.py::TestSymbolicTracing::test_repeat_interleave, test/test_proxy_tensor.py::TestSymbolicTracing::test_repeat_interleave_unbacked_output_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_reshape_divisibility_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_resize_from_zero, test/test_proxy_tensor.py::TestSymbolicTracing::test_return_symint, test/test_proxy_tensor.py::TestSymbolicTracing::test_rmethod, test/test_proxy_tensor.py::TestSymbolicTracing::test_setitem_symint, test/test_proxy_tensor.py::TestSymbolicTracing::test_size_with_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_split_unbacked_sizes, test/test_proxy_tensor.py::TestSymbolicTracing::test_sqrt_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_sym_storage_offset, test/test_proxy_tensor.py::TestSymbolicTracing::test_symbolic_repeat_interleave, test/test_proxy_tensor.py::TestSymbolicTracing::test_symint_to_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_tensor_symfloat, test/test_proxy_tensor.py::TestSymbolicTracing::test_unary, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_batch_resnet, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_slice, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unification, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_dependency_violation, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_guard, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_guard_transitivity, test/test_proxy_tensor.py::TestSymbolicTracing::test_view_divisibility_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_view_divisibility_unbacked_relatively_prime 2025-12-04T12:39:40.8388792Z 2025-12-04T12:39:40.8388899Z Finished test_proxy_tensor 1/1 ... [2025-12-04 12:39:40.833550][2201443.294536892], took 0.65min 2025-12-04T12:39:40.8389326Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:39:40.8400928Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:39:40.8402773Z Running test_matmul_cuda 1/1 ... [2025-12-04 12:39:40.840168][2201443.301157182] 2025-12-04T12:39:40.8402983Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:39:40.8404586Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_matmul_cuda.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:39:40.840367] 2025-12-04T12:47:20.2348145Z 2025-12-04T12:47:20.2349078Z test_matmul_cuda 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_matmul_cuda_1.1_519d8b2e48f8f6b7_.log 2025-12-04T12:47:20.2668949Z Running 1584 items in this shard: test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_False_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_False_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_addmm_baddmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_broadcast_self_True_high_precision_self_True_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_alignment_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublas_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_bias_shapes_size_128_backend_cublaslt_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_no_reduced_precision_small_size_4_size_32768_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_no_reduced_precision_small_size_4_size_32768_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_no_reduced_precision_small_size_8_size_32768_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_no_reduced_precision_small_size_8_size_32768_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_10000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_10000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_10000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_10000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_1000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_1000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_1000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_1000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_100_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_100_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_100_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_fp16_accumulate_size_100_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_10000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_10000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_10000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_10000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_1000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_1000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_1000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_1000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_100_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_100_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_100_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_reduced_precision_size_100_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublas_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_10000_backend_cublaslt_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublas_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_1000_backend_cublaslt_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublas_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublas_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublas_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublaslt_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublaslt_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_addmm_size_100_backend_cublaslt_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_and_lt_reduced_precision_fp16_accumulate_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_10000_10000_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_10000_10000_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_10000_10000_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_1000_10000_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_1000_10000_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_1_10000_1000_10000_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_1000_1000_1000_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_1000_1000_1000_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_1000_1000_1000_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_100_100_100_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_100_100_100_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_baddbmm_large_input_2_100_100_100_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_batch_invariance_blackwell_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_batch_invariance_blackwell_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_1024_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_1024_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_1024_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_128_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_128_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_128_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_2048_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_2048_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_2048_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_256_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_256_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_256_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_32_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_32_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_32_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_4096_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_4096_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_4096_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_512_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_512_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_512_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_64_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_64_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_64_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_8192_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_8192_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_cublas_deterministic_shape_8192_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_fp16_accum_and_fp32_out_failure_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_fp16_accum_and_fp32_out_failure_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_fp16_accum_and_fp32_out_failure_batch_size_32_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_fp16_accum_and_fp32_out_failure_batch_size_32_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_greencontext_carveout_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_2d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_False_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_2d_strided_True_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_False_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_False_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_False_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_bfloat16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_float16, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_3d_3d_strided_True_a_row_major_True_b_row_major_True_cuda_float32, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_False_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_False_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_False_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_False_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_True_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_True_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_True_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/2d_a_row_major_True_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_False_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_False_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_False_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_False_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_True_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_True_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_True_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_2d/3d_a_row_major_True_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_False_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_False_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_False_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_False_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_True_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_True_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_True_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/2d_a_row_major_True_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_False_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_False_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_False_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_False_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_True_b_row_major_False_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_True_b_row_major_False_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_True_b_row_major_True_max_autotune_False_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_grouped_gemm_compiled_op_3d/3d_a_row_major_True_b_row_major_True_max_autotune_True_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_input_dimension_checking_out_dtype_ops0_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_input_dimension_checking_out_dtype_ops1_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_input_dimension_checking_out_dtype_ops2_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_input_dimension_checking_out_dtype_ops3_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_1_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_32_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_bfloat16_M_64_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_1_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_32_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float16_M_64_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_1_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_32_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_1_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_32_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_1_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_32_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size0_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_16_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_16_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_backend_cublas_cuda, test/test_matmul_cuda.py::TestMatmulCudaCUDA::test_mm_bmm_dtype_overload_float32_M_64_N_64_K_64_batch_size_1_backend_cublaslt_cuda, test/test_matmul_cuda.py::TestMixedDtypesLinearCudaCUDA::test_mixed_dtypes_linear_cuda_bfloat16, test/test_matmul_cuda.py::TestMixedDtypesLinearCudaCUDA::test_mixed_dtypes_linear_cuda_float16 2025-12-04T12:47:20.2998325Z 2025-12-04T12:47:20.2998450Z Finished test_matmul_cuda 1/1 ... [2025-12-04 12:47:20.236075][2201902.697060132], took 7.66min 2025-12-04T12:47:20.2998872Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:47:20.2999267Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:47:20.2999495Z Running xpu/test_gemm 1/1 ... [2025-12-04 12:47:20.242246][2201902.703235971] 2025-12-04T12:47:20.2999682Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:47:20.3000117Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'xpu/test_gemm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:47:20.242465] 2025-12-04T12:47:22.4981002Z 2025-12-04T12:47:22.4981449Z xpu/test_gemm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/xpu.test_gemm_1.1_fdfd0412cafcf8dd_.log 2025-12-04T12:47:22.4981901Z Running 0 items in this shard: 2025-12-04T12:47:22.4982028Z 2025-12-04T12:47:22.4982188Z Finished xpu/test_gemm 1/1 ... [2025-12-04 12:47:22.497839][2201904.958821653], took 0.04min 2025-12-04T12:47:22.4997621Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:47:22.5045085Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:47:22.5047114Z Running test_fx_passes 1/1 ... [2025-12-04 12:47:22.504578][2201904.965568055] 2025-12-04T12:47:22.5047390Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:47:22.5049100Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_fx_passes.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:47:22.504797] 2025-12-04T12:47:24.9228387Z 2025-12-04T12:47:24.9229391Z test_fx_passes 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_fx_passes_1.1_380fef43f9102adf_.log 2025-12-04T12:47:24.9238285Z Running 53 items in this shard: test/test_fx_passes.py::TestFXGraphPasses::test_fuser_pass_deep_model, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition0, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition1, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition10, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition11, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition2, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition3, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition4, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition5, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition6, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition7, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition8, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_partition9, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_xfail_partition0, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_xfail_partition1, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_xfail_partition2, test/test_fx_passes.py::TestFXGraphPasses::test_fuser_util_xfail_partition3, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn0_expected_partition0_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn10_expected_partition10_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn11_expected_partition11_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn12_expected_partition12_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn13_expected_partition13_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn14_expected_partition14_bookend_non_compute_pass_True, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn15_expected_partition15_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn16_expected_partition16_bookend_non_compute_pass_True, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn17_expected_partition17_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn18_expected_partition18_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn1_expected_partition1_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn2_expected_partition2_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn3_expected_partition3_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn4_expected_partition4_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn5_expected_partition5_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn6_expected_partition6_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn7_expected_partition7_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn8_expected_partition8_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_fn9_expected_partition9_bookend_non_compute_pass_False, test/test_fx_passes.py::TestFXGraphPasses::test_partitioner_independent_output_fn0_expected_partition0, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model0, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model1, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model10, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model11, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model12, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model13, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model14, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model15, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model2, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model3, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model4, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model5, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model6, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model7, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model8, test/test_fx_passes.py::TestFXMatcherUtils::test_subgraph_matcher_test_model9 2025-12-04T12:47:24.9245198Z 2025-12-04T12:47:24.9245307Z Finished test_fx_passes 1/1 ... [2025-12-04 12:47:24.922524][2201907.383509944], took 0.04min 2025-12-04T12:47:24.9245779Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:47:24.9286969Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:47:24.9288554Z Running functorch/test_logging 1/1 ... [2025-12-04 12:47:24.928729][2201907.389718823] 2025-12-04T12:47:24.9291084Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:47:24.9292016Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_logging.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:47:24.928944] 2025-12-04T12:47:27.3473996Z 2025-12-04T12:47:27.3474705Z functorch/test_logging 1/1 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_logging_1.1_5cdf72398c48bdcc_.log 2025-12-04T12:47:27.3475881Z Running 1 items in this shard: test/functorch/test_logging.py::TestAOTLogging::test_logging 2025-12-04T12:47:27.3476043Z 2025-12-04T12:47:27.3476177Z Finished functorch/test_logging 1/1 ... [2025-12-04 12:47:27.347095][2201909.80808071], took 0.04min 2025-12-04T12:47:27.3485728Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:47:27.3532252Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:47:27.3534316Z Running higher_order_ops/test_local_map 1/1 ... [2025-12-04 12:47:27.353308][2201909.814296678] 2025-12-04T12:47:27.3534537Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:47:27.3536345Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'higher_order_ops/test_local_map.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:47:27.353531] 2025-12-04T12:47:36.0358056Z 2025-12-04T12:47:36.0358801Z higher_order_ops/test_local_map 1/1 was successful, full logs can be found in artifacts with path test/test-reports/higher_order_ops.test_local_map_1.1_d3e58bb472a2af02_.log 2025-12-04T12:47:36.0361067Z Running 12 items in this shard: test/higher_order_ops/test_local_map.py::TestLocalMap::test_filtered_gradients, test/higher_order_ops/test_local_map.py::TestLocalMap::test_fx_annotations, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_dynamo_mismatch_placements, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_dynamo_reordered_inputs, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_with_local_shapes_dynamo_tracing, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_with_local_shapes_hop_tracing, test/higher_order_ops/test_local_map.py::TestLocalMap::test_none_gradients, test/higher_order_ops/test_local_map.py::TestLocalMap::test_none_placements, test/higher_order_ops/test_local_map.py::TestLocalMap::test_sac, test/higher_order_ops/test_local_map.py::TestLocalMap::test_sac_deferred, test/higher_order_ops/test_local_map.py::TestLocalMap::test_simple, test/higher_order_ops/test_local_map.py::TestLocalMap::test_symint_activations 2025-12-04T12:47:36.0362458Z 2025-12-04T12:47:36.0362590Z Finished higher_order_ops/test_local_map 1/1 ... [2025-12-04 12:47:36.035409][2201918.496395378], took 0.14min 2025-12-04T12:47:36.0369358Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:47:36.0419727Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:47:36.0422694Z Running test_tensorexpr 1/1 ... [2025-12-04 12:47:36.042190][2201918.503177089] 2025-12-04T12:47:36.0423821Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:47:36.0427437Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_tensorexpr.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:47:36.042595] 2025-12-04T12:48:04.1119667Z 2025-12-04T12:48:04.1120849Z test_tensorexpr 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_tensorexpr_1.1_d93664e0fe650c5a_.log 2025-12-04T12:48:04.1134231Z Running 74 items in this shard: test/test_tensorexpr.py::TestTensorExprFuser::test_add_const_rhs, test/test_tensorexpr.py::TestTensorExprFuser::test_add_sub, test/test_tensorexpr.py::TestTensorExprFuser::test_alias_analysis_input_and_module, test/test_tensorexpr.py::TestTensorExprFuser::test_alias_analysis_inputs, test/test_tensorexpr.py::TestTensorExprFuser::test_alias_analysis_module, test/test_tensorexpr.py::TestTensorExprFuser::test_all_combos, test/test_tensorexpr.py::TestTensorExprFuser::test_alpha, test/test_tensorexpr.py::TestTensorExprFuser::test_binary_ops, test/test_tensorexpr.py::TestTensorExprFuser::test_bitwise_ops, test/test_tensorexpr.py::TestTensorExprFuser::test_broadcast, test/test_tensorexpr.py::TestTensorExprFuser::test_broadcast3, test/test_tensorexpr.py::TestTensorExprFuser::test_broadcast_2, test/test_tensorexpr.py::TestTensorExprFuser::test_broadcast_big2, test/test_tensorexpr.py::TestTensorExprFuser::test_cat, test/test_tensorexpr.py::TestTensorExprFuser::test_cat_empty_tensors, test/test_tensorexpr.py::TestTensorExprFuser::test_cat_negative_dim, test/test_tensorexpr.py::TestTensorExprFuser::test_cat_only, test/test_tensorexpr.py::TestTensorExprFuser::test_cat_promote_inputs, test/test_tensorexpr.py::TestTensorExprFuser::test_cat_with_constant_dim, test/test_tensorexpr.py::TestTensorExprFuser::test_char, test/test_tensorexpr.py::TestTensorExprFuser::test_chunk, test/test_tensorexpr.py::TestTensorExprFuser::test_clamp, test/test_tensorexpr.py::TestTensorExprFuser::test_constant, test/test_tensorexpr.py::TestTensorExprFuser::test_double, test/test_tensorexpr.py::TestTensorExprFuser::test_double_intrinsics, test/test_tensorexpr.py::TestTensorExprFuser::test_dynamic_shape, test/test_tensorexpr.py::TestTensorExprFuser::test_easy, test/test_tensorexpr.py::TestTensorExprFuser::test_eq, test/test_tensorexpr.py::TestTensorExprFuser::test_exp_pow, test/test_tensorexpr.py::TestTensorExprFuser::test_four_arg, test/test_tensorexpr.py::TestTensorExprFuser::test_ge, test/test_tensorexpr.py::TestTensorExprFuser::test_gt, test/test_tensorexpr.py::TestTensorExprFuser::test_guard_fails, test/test_tensorexpr.py::TestTensorExprFuser::test_half_bn_relu, test/test_tensorexpr.py::TestTensorExprFuser::test_half_gelu, test/test_tensorexpr.py::TestTensorExprFuser::test_int64_promotion, test/test_tensorexpr.py::TestTensorExprFuser::test_int_output, test/test_tensorexpr.py::TestTensorExprFuser::test_le, test/test_tensorexpr.py::TestTensorExprFuser::test_loop, test/test_tensorexpr.py::TestTensorExprFuser::test_lt, test/test_tensorexpr.py::TestTensorExprFuser::test_mask, test/test_tensorexpr.py::TestTensorExprFuser::test_min_max, test/test_tensorexpr.py::TestTensorExprFuser::test_min_max_reduction, test/test_tensorexpr.py::TestTensorExprFuser::test_min_max_reduction2, test/test_tensorexpr.py::TestTensorExprFuser::test_min_max_reduction_dim1, test/test_tensorexpr.py::TestTensorExprFuser::test_min_max_reduction_dim1_2, test/test_tensorexpr.py::TestTensorExprFuser::test_multi_rand, test/test_tensorexpr.py::TestTensorExprFuser::test_multioutput, test/test_tensorexpr.py::TestTensorExprFuser::test_multiple_outputs, test/test_tensorexpr.py::TestTensorExprFuser::test_nans, test/test_tensorexpr.py::TestTensorExprFuser::test_ne, test/test_tensorexpr.py::TestTensorExprFuser::test_promotion, test/test_tensorexpr.py::TestTensorExprFuser::test_propagated_mem_layout, test/test_tensorexpr.py::TestTensorExprFuser::test_rand_like, test/test_tensorexpr.py::TestTensorExprFuser::test_rank_two, test/test_tensorexpr.py::TestTensorExprFuser::test_relu, test/test_tensorexpr.py::TestTensorExprFuser::test_remainder, test/test_tensorexpr.py::TestTensorExprFuser::test_reps, test/test_tensorexpr.py::TestTensorExprFuser::test_round_2, test/test_tensorexpr.py::TestTensorExprFuser::test_scalar, test/test_tensorexpr.py::TestTensorExprFuser::test_short, test/test_tensorexpr.py::TestTensorExprFuser::test_simple_add, test/test_tensorexpr.py::TestTensorExprFuser::test_sin_pow, test/test_tensorexpr.py::TestTensorExprFuser::test_slice, test/test_tensorexpr.py::TestTensorExprFuser::test_sliced_stride, test/test_tensorexpr.py::TestTensorExprFuser::test_softmax_cpu, test/test_tensorexpr.py::TestTensorExprFuser::test_softmax_cuda, test/test_tensorexpr.py::TestTensorExprFuser::test_strided_output_preserved, test/test_tensorexpr.py::TestTensorExprFuser::test_three_arg, test/test_tensorexpr.py::TestTensorExprFuser::test_three_arg2, test/test_tensorexpr.py::TestTensorExprFuser::test_transpose, test/test_tensorexpr.py::TestTensorExprFuser::test_unary_ops, test/test_tensorexpr.py::TestTensorExprFuser::test_unsqueeze, test/test_tensorexpr.py::TestTensorExprFuser::test_where 2025-12-04T12:48:04.1140976Z 2025-12-04T12:48:04.1141087Z Finished test_tensorexpr 1/1 ... [2025-12-04 12:48:04.111747][2201946.57273096], took 0.47min 2025-12-04T12:48:04.1141473Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:48:04.1184998Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:48:04.1187688Z Running test_jiterator 1/1 ... [2025-12-04 12:48:04.118610][2201946.57959965] 2025-12-04T12:48:04.1187862Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:48:04.1189120Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_jiterator.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:48:04.118797] 2025-12-04T12:48:16.3069959Z 2025-12-04T12:48:16.3071148Z test_jiterator 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_jiterator_1.1_8fa53495e815e7e1_.log 2025-12-04T12:48:16.3124848Z Running 289 items in this shard: test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_bfloat16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex128_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_complex64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float32_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_float64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int32_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_int8_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_contiguous_shape_strides0_cuda_uint8_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_bfloat16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex128_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_complex64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float32_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_float64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int16_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int32_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int64_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_int8_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_complex128, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_complex64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_int16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_int32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_int64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_int8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_all_dtype_noncontiguous_shape_strides0_cuda_uint8_uint8, test/test_jiterator.py::TestPythonJiteratorCUDA::test_bool_extra_args_is_train_False_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_bool_extra_args_is_train_True_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_-4_2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_-4_2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_-4_2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_-4_2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_3_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_3_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_3_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha2_beta_3_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_-4_2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_-4_2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_-4_2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_-4_2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_3_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_3_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_3_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_-1_beta_3_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_-4_2_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_-4_2_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_-4_2_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_-4_2_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_3_cuda_bfloat16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_3_cuda_float16, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_3_cuda_float32, test/test_jiterator.py::TestPythonJiteratorCUDA::test_extra_args_alpha_2_0_beta_3_cuda_float64, test/test_jiterator.py::TestPythonJiteratorCUDA::test_invalid_function_name_code_string_template T my _kernel(T x) { return x; }_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_invalid_function_name_code_string_template Tmy_kernel(T x) { return x; }_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_multiple_functors_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_inputs_num_inputs_1_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_inputs_num_inputs_5_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_inputs_num_inputs_8_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_outputs_num_outputs_1_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_outputs_num_outputs_4_cuda, test/test_jiterator.py::TestPythonJiteratorCUDA::test_various_num_outputs_num_outputs_8_cuda 2025-12-04T12:48:16.3166816Z 2025-12-04T12:48:16.3166923Z Finished test_jiterator 1/1 ... [2025-12-04 12:48:16.306873][2201958.767859386], took 0.20min 2025-12-04T12:48:16.3167321Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:48:16.3167682Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:48:16.3167907Z Running test_native_functions 1/1 ... [2025-12-04 12:48:16.313527][2201958.774516688] 2025-12-04T12:48:16.3168088Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:48:16.3168480Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_native_functions.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:48:16.313719] 2025-12-04T12:48:18.5844741Z 2025-12-04T12:48:18.5845584Z test_native_functions 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_native_functions_1.1_e8cfec4db153d9a2_.log 2025-12-04T12:48:18.5847372Z Running 11 items in this shard: test/test_native_functions.py::TestNativeFunctions::test_intlist_error_with_overload, test/test_native_functions.py::TestNativeFunctions::test_optional_filled_intlist, test/test_native_functions.py::TestNativeFunctions::test_optional_floatlist, test/test_native_functions.py::TestNativeFunctions::test_optional_floatlist_invalid, test/test_native_functions.py::TestNativeFunctions::test_optional_intlist, test/test_native_functions.py::TestNativeFunctions::test_optional_intlist_invalid, test/test_native_functions.py::TestNativeFunctions::test_string_defaults, test/test_native_functions.py::TestNativeFunctions::test_symintlist_error, test/test_native_functions.py::TestNativeFunctions::test_symintlist_error_with_overload, test/test_native_functions.py::TestNativeFunctions::test_symintlist_error_with_overload_but_is_unique, test/test_native_functions.py::TestNativeFunctions::test_vararg_symintlist_error 2025-12-04T12:48:18.5848794Z 2025-12-04T12:48:18.5848911Z Finished test_native_functions 1/1 ... [2025-12-04 12:48:18.584276][2201961.045260621], took 0.04min 2025-12-04T12:48:18.5862249Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:48:18.5912027Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:48:18.5915474Z Running test_typing 1/1 ... [2025-12-04 12:48:18.591303][2201961.052292758] 2025-12-04T12:48:18.5915652Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:48:18.5916103Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_typing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:48:18.591494] 2025-12-04T12:49:11.9429725Z 2025-12-04T12:49:11.9430967Z test_typing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_typing_1.1_616b0d655dabcfaa_.log 2025-12-04T12:49:11.9435663Z Running 18 items in this shard: test/test_typing.py::TestTyping::test_fail_arithmetic_ops.py, test/test_typing.py::TestTyping::test_fail_creation_ops.py, test/test_typing.py::TestTyping::test_fail_random.py, test/test_typing.py::TestTyping::test_fail_torch_size.py, test/test_typing.py::TestTyping::test_reveal_module_list.py, test/test_typing.py::TestTyping::test_reveal_namedtuple.py, test/test_typing.py::TestTyping::test_reveal_opt_size.py, test/test_typing.py::TestTyping::test_reveal_size.py, test/test_typing.py::TestTyping::test_reveal_tensor_constructors.py, test/test_typing.py::TestTyping::test_reveal_tensor_copy.py, test/test_typing.py::TestTyping::test_reveal_tensor_sampling.py, test/test_typing.py::TestTyping::test_reveal_torch_optim.py, test/test_typing.py::TestTyping::test_success_arithmetic_ops.py, test/test_typing.py::TestTyping::test_success_creation_ops.py, test/test_typing.py::TestTyping::test_success_cuda_steam.py, test/test_typing.py::TestTyping::test_success_distributions.py, test/test_typing.py::TestTyping::test_success_math_ops.py, test/test_typing.py::TestTyping::test_success_torch_size.py 2025-12-04T12:49:11.9439333Z 2025-12-04T12:49:11.9439458Z Finished test_typing 1/1 ... [2025-12-04 12:49:11.942650][2202014.403636828], took 0.89min 2025-12-04T12:49:11.9442637Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:49:11.9492055Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:49:11.9494118Z Running higher_order_ops/test_invoke_subgraph 1/1 ... [2025-12-04 12:49:11.949285][2202014.41027442] 2025-12-04T12:49:11.9494346Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:49:11.9495790Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'higher_order_ops/test_invoke_subgraph.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:49:11.949477] 2025-12-04T12:49:45.9753231Z 2025-12-04T12:49:45.9754106Z higher_order_ops/test_invoke_subgraph 1/1 was successful, full logs can be found in artifacts with path test/test-reports/higher_order_ops.test_invoke_subgraph_1.1_aef27fbbcdcc55f3_.log 2025-12-04T12:49:45.9778998Z Running 73 items in this shard: test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraph::test_aot_function, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraph::test_multiple, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraph::test_simple, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_ac, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_ac_rng, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_ac_rng_cudagraphs, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_auto_functionalize, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_autograd_function, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_buffer_mutation_errors_under_training, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_buffer_mutation_works_under_no_grad, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_bwd_partitioning, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_complex, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_const_tensor, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dce, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dce_recursive, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dedupe, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_different_strides_in_backward, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_different_symint, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_differing_strides_for_grad_outs, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_div, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dropout_checks_joint_graph, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dropout_checks_joint_graph_inference, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_dynamic, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_fail_with_direct_invoke_subgraph, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_fake_tensor_checking, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_gen_schema, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_gen_schema_with_buffer_mutation, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_grad_accuracy_check, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_input_aliasing, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_mutation, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_mutation_inference_mode, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_mutation_mutiple_times, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_mutation_mutiple_times_fake_tensor_cahche_hit, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_input_output_aliasing, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_kwargs_only, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_list, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_mod_attr_aliasing, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_module, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_module_forward, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_module_method, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_nonlocal_list_mutation_hidden, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_nonlocal_update, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_normalize_gm, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_output_output_aliasing, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_pending_unbacked, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_preserves_output_strides, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_preserves_strides, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_redundant_compile_region, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_return_none, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_return_none_from_fwd, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_return_size, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_sdpa, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_simple, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_simple_module, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_symint_from_fwd_to_bwd, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_triton_kernel_native, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_tuple_of_tuple, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_udf_output, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_unbacked1, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_unbacked2, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_unbacked_symbol, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_view_to_reshape, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportNonstrict::test_multiple_module, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportNonstrict::test_pending_unbacked, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportNonstrict::test_simple_func, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportNonstrict::test_simple_method, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportNonstrict::test_unbacked, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportStrict::test_multiple_module, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportStrict::test_pending_unbacked, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportStrict::test_simple_func, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportStrict::test_simple_method, test/higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphExportStrict::test_unbacked, test/higher_order_ops/test_invoke_subgraph.py::NegativeTesting::test_graph_break 2025-12-04T12:49:45.9791490Z 2025-12-04T12:49:45.9791637Z Finished higher_order_ops/test_invoke_subgraph 1/1 ... [2025-12-04 12:49:45.974958][2202048.435941515], took 0.57min 2025-12-04T12:49:45.9792075Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T12:49:45.9818103Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:49:45.9820375Z Running test_decomp 3/12 ... [2025-12-04 12:49:45.981868][2202048.442857683] 2025-12-04T12:49:45.9820554Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:49:45.9822119Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=3', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:49:45.982087] 2025-12-04T13:06:57.2217951Z 2025-12-04T13:06:57.2218977Z test_decomp 3/12 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_3.12_05a1b1c5ac18e726_.log 2025-12-04T13:06:57.2310706Z Running 752 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___ror___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___ror___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___ror___cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__segment_reduce_lengths_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acosh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acosh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_allclose_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_aminmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_aminmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argsort_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argwhere_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_scatter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_1d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_1d_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_2d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_baddbmm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bernoulli_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bfloat16_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_not_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_not_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_right_shift_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bool_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ceil_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_min_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_combinations_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_physical_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cos_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_count_nonzero_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumprod_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumulative_trapezoid_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diff_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diff_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_einsum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_strided_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftn_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftshift_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftshift_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftshift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fill_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flipud_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmin_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_frexp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_uint16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ge_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ge_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_grid_sampler_3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hsplit_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hsplit_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hypot_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_mean_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_mean_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isfinite_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isinf_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isposinf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isposinf_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kthvalue_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ldexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ldexp_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_le_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_ex_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cross_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cross_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_det_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eig_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_factor_ex_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_solve_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lstsq_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_solve_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_power_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_svdvals_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorsolve_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vecdot_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vector_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_normal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logcumsumexp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logcumsumexp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_not_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_or_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_cumprod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_cumsum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_fill_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_fill_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_var_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_var_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_var_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matrix_exp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_with_dim_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_variadic_tensors_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_with_dim_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_minimum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_movedim_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_movedim_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_movedim_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_multinomial_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_neg_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_alpha_dropout_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool3d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv2d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_ctc_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_fractional_max_pool2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gaussian_nll_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardshrink_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardsigmoid_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardsigmoid_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardswish_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardtanh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_instance_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_trilinear_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_local_response_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_logsigmoid_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_logsigmoid_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool1d_grad_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool3d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mse_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_nll_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_circular_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_silu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_smooth_l1_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_bilinear_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_fro_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_inf_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_normal_in_place_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pinverse_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_0_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_0_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rand_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rand_like_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_renorm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize__cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize__cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_decimals_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_decimals_3_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_decimals_neg_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_prod_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_gaussian_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signbit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signbit_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinc_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_y1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_y1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_v_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_he_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i0e_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1e_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1e_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i1_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k1_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtr_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtr_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_spherical_bessel_j0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_unbiased_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_to_size_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tile_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unflatten_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_uniform_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_consecutive_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_consecutive_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_mean_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vdot_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vsplit_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vsplit_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick__chunk_cat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick__softmax_backward_data_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_abs_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_addmv_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_addmv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_aminmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_arange_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_arange_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_arange_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_atan2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_atan2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_baddbmm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_bernoulli_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_left_shift_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_not_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_right_shift_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_xor_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_bucketize_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_bernoulli_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_index_fill_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_linalg_cross_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nansum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_select_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_transpose_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_cumsum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_scatter_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_dist_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_div_no_rounding_mode_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_div_trunc_rounding_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_dot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_erfc_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_erfinv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft2_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fmin_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fmin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fmin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_frexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_index_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_index_select_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_index_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_isneginf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_item_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_vector_norm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_vector_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_list_of_tensors_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_mv_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_native_dropout_backward_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nextafter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardshrink_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_leaky_relu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_mse_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_pad_constant_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_pad_constant_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_silu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_unfold_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_norm_fro_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_norm_inf_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_normal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_normal_in_place_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_repeat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_repeat_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_rot90_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_signbit_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_slice_scatter_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_special_erfcx_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_special_log_ndtr_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_special_log_ndtr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_multiple_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_std_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_std_mean_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_std_mean_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_trunc_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_uniform_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_var_unbiased_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_vdot_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_xlogy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_GRU_eval_mode_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_GRU_train_mode_cuda_float64, test/test_decomp.py::DecompOneOffTestsCUDA::test_sdpa_nn_functional_scaled_dot_product_attention_cuda_float16 2025-12-04T13:06:57.2387038Z 2025-12-04T13:06:57.2387145Z Finished test_decomp 3/12 ... [2025-12-04 13:06:57.222110][2203079.683094297], took 17.19min 2025-12-04T13:06:57.2387522Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:06:57.2387880Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:06:57.2388101Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T13:06:57.2388288Z Uploading artifacts took 0.00 seconds 2025-12-04T13:06:57.2388448Z Running test_decomp 9/12 ... [2025-12-04 13:06:57.228731][2203079.689720966] 2025-12-04T13:06:57.2388616Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:06:57.2388990Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=9', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:06:57.228944] 2025-12-04T13:17:47.5746769Z 2025-12-04T13:17:47.5747555Z test_decomp 9/12 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_9.12_113ff9da7871c4aa_.log 2025-12-04T13:17:47.5833130Z Running 753 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmod___cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acosh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acosh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_decomposed_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_decomposed_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmv_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_allclose_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argmin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argsort_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bernoulli_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_or_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_min_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_combinations_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_combinations_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_contiguous_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_contiguous_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_count_nonzero_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diff_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_einsum_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftn_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftn_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fill_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmod_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ge_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ge_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ge_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geometric_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gradient_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hash_tensor_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_i0_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_igamma_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_inner_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isposinf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kthvalue_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lcm_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_le_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lerp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cross_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_det_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eigvals_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_householder_product_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_factor_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_factor_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_solve_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_multi_dot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_norm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_solve_triangular_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorinv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorsolve_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorsolve_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vander_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vander_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log1p_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_and_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logit_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logit_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mH_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mH_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_normalize_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_std_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_std_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_sum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_sum_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_no_dim_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mean_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_median_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_median_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_median_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_binary_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_with_dim_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_minimum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_minimum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_movedim_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mv_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmean_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nextafter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_alpha_dropout_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_batch_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv3d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv3d_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cross_entropy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cross_entropy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cross_entropy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_elu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_bag_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_glu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_grid_sample_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_instance_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_bilinear_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_bilinear_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_linear_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_kl_div_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_l1_loss_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_leaky_relu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_linear_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_local_response_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_local_response_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_pool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool1d_grad_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mish_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_margin_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multilabel_margin_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_normalize_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_constant_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pairwise_distance_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_rrelu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softmin_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_unfold_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_bilinear_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_fro_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_inf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_inf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_normal_in_place_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rad2deg_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rand_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ravel_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_renorm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize_as__cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_neg_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_neg_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_mean_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sigmoid_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sigmoid_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_hann_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinc_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_airy_ai_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j1_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_y0_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_v_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_he_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_he_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i0_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k0_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k0_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_square_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_unbiased_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_to_size_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_svd_lowrank_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_svd_lowrank_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensordot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tile_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tile_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_sparse_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapz_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triangular_solve_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_indices_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_true_divide_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unflatten_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_complex_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vsplit_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick__upsample_bilinear2d_aa_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_addcdiv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_addmv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_asin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_atan2_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_atan2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_atanh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_baddbmm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_and_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_left_shift_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_or_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cauchy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_ceil_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_min_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_complex_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_copysign_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_addcdiv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_diag_embed_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_mv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nn_functional_glu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nn_functional_max_unpool3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nn_functional_max_unpool3d_grad_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_norm_fro_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_rot90_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_special_log_ndtr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_unsqueeze_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_cumsum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_diag_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_scatter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_digamma_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_dist_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_div_trunc_rounding_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_div_trunc_rounding_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifftn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_gcd_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_gcd_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_grid_sampler_2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_index_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_index_fill_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_isnan_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isnan_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_isneginf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_isposinf_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_lgamma_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_vector_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_log_softmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log_softmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_mv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_neg_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardtanh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool2d_grad_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_grad_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_mish_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_mish_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_pad_constant_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu6_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_unfold_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_norm_fro_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_norm_fro_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_normal_in_place_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_randn_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_randn_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_rot90_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_0_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_0_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_select_scatter_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_select_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sgn_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_signbit_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_sinh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_special_entr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_special_entr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_special_i0e_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_special_i0e_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_special_xlog1py_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_special_zeta_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_multiple_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_tril_indices_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_var_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_var_mean_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_var_mean_unbiased_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_var_unbiased_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_where_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_zero__cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_LSTM_train_mode_cuda_float64, test/test_decomp.py::DecompOneOffTestsCUDA::test_amp_batch_norm_backward_cuda, test/test_decomp.py::DecompOneOffTestsCUDA::test_sdpa_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_decomp.py::DecompOneOffTestsCUDA::test_sdpa_nn_functional_scaled_dot_product_attention_cuda_float64 2025-12-04T13:17:47.5910612Z 2025-12-04T13:17:47.5910723Z Finished test_decomp 9/12 ... [2025-12-04 13:17:47.574811][2203730.035795105], took 10.84min 2025-12-04T13:17:47.5911102Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:17:47.5911458Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:17:47.5911667Z Running test_legacy_vmap 1/1 ... [2025-12-04 13:17:47.581508][2203730.042495499] 2025-12-04T13:17:47.5911841Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:17:47.5912217Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_legacy_vmap.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:17:47.581714] 2025-12-04T13:20:09.7675204Z 2025-12-04T13:20:09.7675918Z test_legacy_vmap 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_legacy_vmap_1.1_a7fed1ed0df52024_.log 2025-12-04T13:20:09.7688946Z Running 124 items in this shard: test/test_legacy_vmap.py::TestVmapAPILegacy::test_accepts_nested_inputs, test/test_legacy_vmap.py::TestVmapAPILegacy::test_backward_unsupported_interaction, test/test_legacy_vmap.py::TestVmapAPILegacy::test_batched_gradient_basic, test/test_legacy_vmap.py::TestVmapAPILegacy::test_constant_function, test/test_legacy_vmap.py::TestVmapAPILegacy::test_different_map_dim_size_raises, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_atan2, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_does_not_warn_by_default, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_masked_fill, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_multiple_returns, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_warns_when_warnings_are_enabled, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_with_undefined_grad, test/test_legacy_vmap.py::TestVmapAPILegacy::test_fallback_zero_dim, test/test_legacy_vmap.py::TestVmapAPILegacy::test_func_with_no_inputs, test/test_legacy_vmap.py::TestVmapAPILegacy::test_functools_partial, test/test_legacy_vmap.py::TestVmapAPILegacy::test_grad_unsupported_interaction, test/test_legacy_vmap.py::TestVmapAPILegacy::test_in_dim_not_in_tensor_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_in_dims_wrong_type_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_inplace_fallback_nary_different_levels, test/test_legacy_vmap.py::TestVmapAPILegacy::test_inplace_fallback_nary_same_levels, test/test_legacy_vmap.py::TestVmapAPILegacy::test_inplace_fallback_unary, test/test_legacy_vmap.py::TestVmapAPILegacy::test_integer_in_dim_but_not_tensor_input_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_multiple_inputs, test/test_legacy_vmap.py::TestVmapAPILegacy::test_multiple_out_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_multiple_outputs, test/test_legacy_vmap.py::TestVmapAPILegacy::test_multiple_outputs_error_cases, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nested_non_default_in_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nested_out_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nested_with_different_map_dim, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nested_with_same_map_dim, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nn_module, test/test_legacy_vmap.py::TestVmapAPILegacy::test_non_default_in_dims_out_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_non_tensor_output_raises, test/test_legacy_vmap.py::TestVmapAPILegacy::test_non_zero_in_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_none_in_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_nonzero_out_dims, test/test_legacy_vmap.py::TestVmapAPILegacy::test_noop_in_inner_vmap, test/test_legacy_vmap.py::TestVmapAPILegacy::test_not_enough_in_dims_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_out_dim_out_of_bounds_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_out_dims_and_num_outputs_mismatch_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_out_dims_edge_case, test/test_legacy_vmap.py::TestVmapAPILegacy::test_out_dims_must_be_int_or_tuple_of_int_err_msg, test/test_legacy_vmap.py::TestVmapAPILegacy::test_single_input, test/test_legacy_vmap.py::TestVmapAPILegacy::test_unsupported_op_err_msg, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_T_numpy, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_as_strided, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_binary_pointwise_ops, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_bmm, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_cat, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_chunk, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_clamp, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_clone, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_comparison_ops, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_conj, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_contiguous, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_diagonal, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_dot, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_expand_as, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_fill_and_zero_inplace, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_imag, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_is_complex, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_is_contiguous, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_is_floating_point, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_mm, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_movedim, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_mv, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_narrow, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_new_empty, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_new_empty_strided, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_new_zeros, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_no_random_op_support, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_real, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_reshape, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_reshape_as, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_result_type, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_select, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_slice, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_split, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_squeeze, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_stack, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_stride, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_sum_dim, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_t, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_tensor_split, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_to, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_trace, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_transpose, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_unary_pointwise_ops, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_unbind, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_unfold, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_view, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_view_as, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_view_as_complex, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_view_as_real, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_vmap_fallback_check, test/test_legacy_vmap.py::TestVmapOperatorsLegacy::test_vmap_fallback_check_ok, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_add_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_binary_cross_entropy_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_diagonal_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_div_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_expand_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_index_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_inplace_manyview_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_inplace_on_view_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_lgamma_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_log1p_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_log_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_logsumexp_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_max_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_median_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_min_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_mul_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_permute_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_reshape_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_select_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_sigmoid_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_slice_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_stack_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_sub_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_threshold_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_trace_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_unrelated_output_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_unrelated_output_multiple_grad_cuda, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_vmap_fallback_check, test/test_legacy_vmap.py::TestVmapBatchedGradientLegacyCUDA::test_vmap_fallback_check_ok 2025-12-04T13:20:09.7701317Z 2025-12-04T13:20:09.7701424Z Finished test_legacy_vmap 1/1 ... [2025-12-04 13:20:09.767258][2203872.228244595], took 2.37min 2025-12-04T13:20:09.7701811Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:20:09.7739091Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:20:09.7740698Z Running higher_order_ops/test_print 1/1 ... [2025-12-04 13:20:09.773949][2203872.234938344] 2025-12-04T13:20:09.7740902Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:20:09.7742621Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'higher_order_ops/test_print.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:20:09.774150] 2025-12-04T13:20:12.3925575Z 2025-12-04T13:20:12.3926555Z higher_order_ops/test_print 1/1 was successful, full logs can be found in artifacts with path test/test-reports/higher_order_ops.test_print_1.1_9e8eb674b2a37b4e_.log 2025-12-04T13:20:12.3929488Z Running 10 items in this shard: test/higher_order_ops/test_print.py::TestHopPrint::test_base_print, test/higher_order_ops/test_print.py::TestHopPrint::test_constant_mutation_backend_aot_eager, test/higher_order_ops/test_print.py::TestHopPrint::test_constant_mutation_backend_eager, test/higher_order_ops/test_print.py::TestHopPrint::test_para_print, test/higher_order_ops/test_print.py::TestHopPrint::test_print_gen_schema, test/higher_order_ops/test_print.py::TestHopPrint::test_print_with_input_mutations, test/higher_order_ops/test_print.py::TestHopPrint::test_print_with_proxy_graph, test/higher_order_ops/test_print.py::TestHopPrint::test_print_with_side_effect, test/higher_order_ops/test_print.py::TestHopPrint::test_reorder_print_no_graph_break_backend_aot_eager, test/higher_order_ops/test_print.py::TestHopPrint::test_reorder_print_no_graph_break_backend_eager 2025-12-04T13:20:12.3932292Z 2025-12-04T13:20:12.3932550Z Finished higher_order_ops/test_print 1/1 ... [2025-12-04 13:20:12.392179][2203874.853166674], took 0.04min 2025-12-04T13:20:12.3937068Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:20:12.3989983Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:20:12.3990394Z Running test_per_overload_api 1/1 ... [2025-12-04 13:20:12.398869][2203874.859858894] 2025-12-04T13:20:12.3990675Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:20:12.3992131Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_per_overload_api.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:20:12.399068] 2025-12-04T13:20:14.5670563Z 2025-12-04T13:20:14.5671487Z test_per_overload_api 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_per_overload_api_1.1_b7c1915eb0c61b59_.log 2025-12-04T13:20:14.5672937Z Running 3 items in this shard: test/test_per_overload_api.py::TestPerOverloadAPI::test_basics_opoverload, test/test_per_overload_api.py::TestPerOverloadAPI::test_basics_opoverloadpacket, test/test_per_overload_api.py::TestPerOverloadAPI::test_decompose 2025-12-04T13:20:14.5673823Z 2025-12-04T13:20:14.5674082Z Finished test_per_overload_api 1/1 ... [2025-12-04 13:20:14.566775][2203877.02776049], took 0.04min 2025-12-04T13:20:14.5692564Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:20:14.5744232Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:20:14.5745141Z Running test_multiprocessing 1/1 ... [2025-12-04 13:20:14.574305][2203877.035294705] 2025-12-04T13:20:14.5745454Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:20:14.5746146Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_multiprocessing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:20:14.574491] 2025-12-04T13:21:04.1707531Z 2025-12-04T13:21:04.1708431Z test_multiprocessing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_multiprocessing_1.1_4739a36a3a57172b_.log 2025-12-04T13:21:04.1718800Z Running 42 items in this shard: test/test_multiprocessing.py::TestMultiprocessing::test_autograd_errors, test/test_multiprocessing.py::TestMultiprocessing::test_autograd_fine_with_spawn, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_bad_call, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_ipc_deadlock, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_memory_allocation, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_parameter_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_send_many, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_simple, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_small_tensors, test/test_multiprocessing.py::TestMultiprocessing::test_cuda_variable_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_empty_shared, test/test_multiprocessing.py::TestMultiprocessing::test_empty_tensor_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_empty_tensor_sharing_cuda, test/test_multiprocessing.py::TestMultiprocessing::test_empty_tensor_sharing_meta, test/test_multiprocessing.py::TestMultiprocessing::test_event, test/test_multiprocessing.py::TestMultiprocessing::test_event_handle_exporter, test/test_multiprocessing.py::TestMultiprocessing::test_event_handle_importer, test/test_multiprocessing.py::TestMultiprocessing::test_event_handle_multi_gpu, test/test_multiprocessing.py::TestMultiprocessing::test_event_multiprocess, test/test_multiprocessing.py::TestMultiprocessing::test_fd_pool, test/test_multiprocessing.py::TestMultiprocessing::test_fd_preserve_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_fd_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_fs, test/test_multiprocessing.py::TestMultiprocessing::test_fs_is_shared, test/test_multiprocessing.py::TestMultiprocessing::test_fs_pool, test/test_multiprocessing.py::TestMultiprocessing::test_fs_preserve_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_fs_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_inherit_tensor, test/test_multiprocessing.py::TestMultiprocessing::test_integer_parameter_serialization_cpu, test/test_multiprocessing.py::TestMultiprocessing::test_integer_parameter_serialization_cuda, test/test_multiprocessing.py::TestMultiprocessing::test_is_shared, test/test_multiprocessing.py::TestMultiprocessing::test_is_shared_cuda, test/test_multiprocessing.py::TestMultiprocessing::test_leaf_variable_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_meta_simple, test/test_multiprocessing.py::TestMultiprocessing::test_mixed_types_cuda_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_non_leaf_variable_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_parameter_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_rebuild_cuda_tensor, test/test_multiprocessing.py::TestMultiprocessing::test_set_thread_name, test/test_multiprocessing.py::TestMultiprocessing::test_tensor_sharing_meta, test/test_multiprocessing.py::TestMultiprocessing::test_variable_sharing, test/test_multiprocessing.py::TestMultiprocessing::test_wrong_cuda_fork 2025-12-04T13:21:04.1726025Z 2025-12-04T13:21:04.1726203Z Finished test_multiprocessing 1/1 ... [2025-12-04 13:21:04.170472][2203926.631454065], took 0.83min 2025-12-04T13:21:04.1727038Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T13:21:04.1770139Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:21:04.1771425Z Running test_meta 2/3 ... [2025-12-04 13:21:04.177025][2203926.638014929] 2025-12-04T13:21:04.1771656Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:21:04.1774008Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_meta.py', '--shard-id=2', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:21:04.177240] 2025-12-04T14:02:33.0593593Z 2025-12-04T14:02:33.0593968Z PRINTING LOG FILE of test_meta 2/3 (test/test-reports/test_meta_2.3_cccb03203fa43a3b_.log) 2025-12-04T14:02:33.0594578Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-49e839ce31e1f1d7.xml 2025-12-04T14:02:33.0595519Z ============================= test session starts ============================== 2025-12-04T14:02:33.0595976Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:02:33.0596376Z cachedir: .pytest_cache 2025-12-04T14:02:33.0596876Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:02:33.0597385Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T14:02:33.0597633Z configfile: pytest.ini 2025-12-04T14:02:33.0598111Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T14:02:33.0599179Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T14:02:33.0599941Z collected 40725 items 2025-12-04T14:02:33.0600389Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T14:02:33.2211963Z Running 13396 items in this shard: test/test_meta.py::TestMetaConverter::test_empty_strided_non_dense_leaf, test/test_meta.py::TestMetaConverter::test_imag, test/test_meta.py::TestMetaConverter::test_tensor_outlives_converter, test/test_meta.py::TestMetaConverter::test_view_as_complex, test/test_meta.py::TestMetaConverter::test_view_as_real, test/test_meta.py::TestMetaCUDA::test_batch_norm_backward_output_mask2_cuda, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_legendre_polynomial_p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorsolve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_unpack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanquantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_ctc_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_bag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_glu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bicubic_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_kl_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_hann_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_nuttall_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_inverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float8_e4m3fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igammac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorsolve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_prelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ormqr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_bartlett_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_blackman_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hann_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_kaiser_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_offsets_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__unsafe_masked_index_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_combinations_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cov_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fliplr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isreal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vecdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_movedim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_msort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanmedian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_alpha_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_cosine_similarity_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pixel_unshuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_rms_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_selu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_normal_in_place_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sinc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_legendre_polynomial_p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triu_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unsafe_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bernoulli_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_inverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_inverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvalsh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_complex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_bartlett_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_blackman_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_gaussian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_kaiser_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_nuttall_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_nuttall_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_argsort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diag_embed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ihfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_unary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_inv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lstsq_grad_oriented_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_meshgrid_list_of_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_min_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_movedim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_alpha_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_tanhshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_norm_fro_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_in_place_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_kaiser_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_take_along_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unflatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e5m2, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_multi_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_ctc_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_bag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_kl_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_local_response_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rrelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_complex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polar_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hann_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_kaiser_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_uint64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_embedding_bag_byte_prepack_cuda, test/test_meta.py::TestMetaCUDA::test_embedding_bag_dense_backward_mode_1_cuda, test/test_meta.py::TestMetaCUDA::test_fill__alias_relationship_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask3_cuda, test/test_meta.py::TestMetaCUDA::test_huber_loss_backward_cuda, test/test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask1_cuda, test/test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask4_cuda, test/test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask5_cuda, test/test_meta.py::TestMetaCUDA::test_map_location_deserialize_cuda, test/test_meta.py::TestMetaCUDA::test_meta_autograd_no_error_cuda, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__native_batch_norm_legit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__segment_reduce_offsets_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hypot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_igamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_igammac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cond_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_multinomial_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanquantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanquantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_prelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_kaiser_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch__scaled_mm_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e5m2, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanquantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_dropout_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nextafter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_ctc_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_leaky_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_complex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_nuttall_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float16_float32_cuda, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_bias_dtype2_cuda, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_float32_cuda, test/test_meta.py::TestMetaCUDA::test_quantized_embedding_bag_cuda 2025-12-04T14:02:33.3690322Z 2025-12-04T14:02:33.3690556Z test_meta.py::TestMetaConverter::test_empty_strided_non_dense_leaf PASSED [0.0021s] [ 0%] 2025-12-04T14:02:33.3690888Z test_meta.py::TestMetaConverter::test_imag PASSED [0.0015s] [ 0%] 2025-12-04T14:02:33.3691132Z test_meta.py::TestMetaConverter::test_tensor_outlives_converter PASSED [0.0006s] [ 0%] 2025-12-04T14:02:33.3691381Z test_meta.py::TestMetaConverter::test_view_as_complex PASSED [0.0010s] [ 0%] 2025-12-04T14:02:33.3691607Z test_meta.py::TestMetaConverter::test_view_as_real PASSED [0.0009s] [ 0%] 2025-12-04T14:02:33.3691890Z test_meta.py::TestMetaCUDA::test_batch_norm_backward_output_mask2_cuda SKIPPED [0.0489s] (Only runs on cpu) [ 0%] 2025-12-04T14:02:33.3692203Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_atan2_cuda_float32 PASSED [1.0840s] [ 0%] 2025-12-04T14:02:33.3692501Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_clamp_max_cuda_float32 PASSED [0.8966s] [ 0%] 2025-12-04T14:02:33.3692862Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_no_rounding_mode_cuda_float32 PASSED [0.8637s] [ 0%] 2025-12-04T14:02:33.3693191Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_trunc_rounding_cuda_float32 PASSED [0.8367s] [ 0%] 2025-12-04T14:02:33.3693507Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_float_power_cuda_float32 PASSED [0.8601s] [ 0%] 2025-12-04T14:02:33.3693804Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmin_cuda_float32 PASSED [0.8334s] [ 0%] 2025-12-04T14:02:33.3694525Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmod_cuda_float32 PASSED [0.8596s] [ 0%] 2025-12-04T14:02:33.3694813Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_gt_cuda_float32 PASSED [0.8596s] [ 0%] 2025-12-04T14:02:33.3695108Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_heaviside_cuda_float32 PASSED [0.0651s] [ 0%] 2025-12-04T14:02:33.3695408Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_igamma_cuda_float32 PASSED [0.0103s] [ 0%] 2025-12-04T14:02:33.3695696Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_isclose_cuda_float32 XFAIL [0.0052s] [ 0%] 2025-12-04T14:02:33.3695985Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_and_cuda_float32 PASSED [0.8617s] [ 0%] 2025-12-04T14:02:33.3696284Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_xor_cuda_float32 PASSED [0.0075s] [ 0%] 2025-12-04T14:02:33.3696586Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_special_xlog1py_cuda_float32 PASSED [0.0245s] [ 0%] 2025-12-04T14:02:33.3696887Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_special_zeta_cuda_float32 PASSED [0.8378s] [ 0%] 2025-12-04T14:02:33.3697174Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_sub_cuda_float32 PASSED [0.0210s] [ 0%] 2025-12-04T14:02:33.3697454Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_xlogy_cuda_float32 PASSED [0.0080s] [ 0%] 2025-12-04T14:02:33.3697732Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_clamp_min_cuda_float32 PASSED [0.0047s] [ 0%] 2025-12-04T14:02:33.3698076Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_float_power_cuda_float32 PASSED [0.8295s] [ 0%] 2025-12-04T14:02:33.3698425Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_floor_divide_cuda_float32 PASSED [0.0245s] [ 0%] 2025-12-04T14:02:33.3698700Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_fmax_cuda_float32 PASSED [0.8422s] [ 0%] 2025-12-04T14:02:33.3698967Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ge_cuda_float32 PASSED [0.8601s] [ 0%] 2025-12-04T14:02:33.3699227Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_gt_cuda_float32 PASSED [0.8340s] [ 0%] 2025-12-04T14:02:33.3699491Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_heaviside_cuda_float32 XFAIL [0.0062s] [ 0%] 2025-12-04T14:02:33.3699935Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igamma_cuda_float32 PASSED [1.6684s] [ 0%] 2025-12-04T14:02:33.3700246Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igammac_cuda_float32 PASSED [0.8482s] [ 0%] 2025-12-04T14:02:33.3700553Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_jiterator_binary_return_by_ref_cuda_float32 PASSED [0.9998s] [ 0%] 2025-12-04T14:02:33.3700853Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ldexp_cuda_float32 PASSED [0.8339s] [ 0%] 2025-12-04T14:02:33.3701130Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logical_xor_cuda_float32 PASSED [0.8317s] [ 0%] 2025-12-04T14:02:33.3701400Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_lt_cuda_float32 PASSED [0.8455s] [ 0%] 2025-12-04T14:02:33.3701668Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_min_binary_cuda_float32 PASSED [0.8517s] [ 0%] 2025-12-04T14:02:33.3701939Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_ne_cuda_float32 PASSED [0.8228s] [ 0%] 2025-12-04T14:02:33.3702203Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_polar_cuda_float32 XFAIL [0.0043s] [ 0%] 2025-12-04T14:02:33.3702473Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_remainder_cuda_float32 PASSED [0.8202s] [ 0%] 2025-12-04T14:02:33.3702744Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_rsub_cuda_float32 PASSED [0.0040s] [ 0%] 2025-12-04T14:02:33.3703040Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_hermite_polynomial_h_cuda_float32 PASSED [0.8393s] [ 0%] 2025-12-04T14:02:33.3703395Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_hermite_polynomial_he_cuda_float32 PASSED [0.0065s] [ 0%] 2025-12-04T14:02:33.3703728Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_legendre_polynomial_p_cuda_float32 PASSED [0.0049s] [ 0%] 2025-12-04T14:02:33.3704071Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.8377s] [ 0%] 2025-12-04T14:02:33.3704423Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.0063s] [ 0%] 2025-12-04T14:02:33.3704751Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_xlog1py_cuda_float32 PASSED [0.0255s] [ 0%] 2025-12-04T14:02:33.3705036Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_sub_cuda_float32 PASSED [0.8338s] [ 0%] 2025-12-04T14:02:33.3705309Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_true_divide_cuda_float32 PASSED [0.0050s] [ 0%] 2025-12-04T14:02:33.3705629Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3705985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3706329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3706669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3707032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3707385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3707721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3708054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3708383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3708781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3709160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3709522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3709873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3710269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3710622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3710969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3711312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3711660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3712015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3712396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3712751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3713100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3713460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3713831Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3714193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3714552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3714899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3715244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3715587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3715961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3716333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3716685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3717032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3717374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3717745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3718106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3718461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3718811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3719153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3719495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3719845Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3720249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3720595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3720967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3721389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3721791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3722172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3722537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3722900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.3723225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_complex128 XFAIL [0.0063s] [ 0%] 2025-12-04T14:02:33.3723509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_float32 PASSED [0.8924s] [ 0%] 2025-12-04T14:02:33.3723788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int32 PASSED [0.0209s] [ 0%] 2025-12-04T14:02:33.3724065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_bfloat16 PASSED [0.0151s] [ 0%] 2025-12-04T14:02:33.3724348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float16 PASSED [0.0150s] [ 0%] 2025-12-04T14:02:33.3724621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_int8 XFAIL [0.0057s] [ 0%] 2025-12-04T14:02:33.3724914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_bfloat16 PASSED [0.1057s] [ 0%] 2025-12-04T14:02:33.3725217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_bool XFAIL [0.0103s] [ 0%] 2025-12-04T14:02:33.3725490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_float16 PASSED [0.9228s] [ 0%] 2025-12-04T14:02:33.3725766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int32 PASSED [0.0581s] [ 0%] 2025-12-04T14:02:33.3726038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int8 PASSED [0.0577s] [ 0%] 2025-12-04T14:02:33.3726328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_uint8 PASSED [0.0573s] [ 0%] 2025-12-04T14:02:33.3726615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_bfloat16 PASSED [0.1570s] [ 0%] 2025-12-04T14:02:33.3726902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_bool XFAIL [0.0072s] [ 0%] 2025-12-04T14:02:33.3727189Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_complex64 PASSED [1.0303s] [ 0%] 2025-12-04T14:02:33.3727483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_float16 PASSED [0.1515s] [ 0%] 2025-12-04T14:02:33.3727768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int16 XFAIL [0.0074s] [ 0%] 2025-12-04T14:02:33.3728048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int32 XFAIL [0.8468s] [ 0%] 2025-12-04T14:02:33.3728327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_uint8 XFAIL [0.8488s] [ 0%] 2025-12-04T14:02:33.3728618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_complex128 PASSED [1.0369s] [ 0%] 2025-12-04T14:02:33.3728916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_complex64 PASSED [0.2047s] [ 0%] 2025-12-04T14:02:33.3729209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_float16 PASSED [0.1509s] [ 0%] 2025-12-04T14:02:33.3729498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_float64 PASSED [0.1508s] [ 0%] 2025-12-04T14:02:33.3729797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int16 PASSED [0.1253s] [ 0%] 2025-12-04T14:02:33.3730078Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int64 PASSED [0.1038s] [ 0%] 2025-12-04T14:02:33.3730405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int8 PASSED [0.1038s] [ 0%] 2025-12-04T14:02:33.3730681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_bool XFAIL [0.0059s] [ 0%] 2025-12-04T14:02:33.3730960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_complex128 PASSED [0.0149s] [ 0%] 2025-12-04T14:02:33.3731245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_uint8 XFAIL [0.0056s] [ 0%] 2025-12-04T14:02:33.3731523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_bfloat16 PASSED [0.0150s] [ 0%] 2025-12-04T14:02:33.3731807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_float16 PASSED [0.0148s] [ 0%] 2025-12-04T14:02:33.3732084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_int64 XFAIL [0.0056s] [ 0%] 2025-12-04T14:02:33.3732378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float16 PASSED [0.8467s] [ 0%] 2025-12-04T14:02:33.3732668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float32 PASSED [0.0151s] [ 0%] 2025-12-04T14:02:33.3732954Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int16 PASSED [0.0147s] [ 0%] 2025-12-04T14:02:33.3733249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_bfloat16 PASSED [0.1517s] [ 1%] 2025-12-04T14:02:33.3733578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_complex64 XFAIL [0.0067s] [ 1%] 2025-12-04T14:02:33.3733890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_bool XFAIL [0.8420s] [ 1%] 2025-12-04T14:02:33.3734190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_complex128 XFAIL [0.8469s] [ 1%] 2025-12-04T14:02:33.3734495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float16 PASSED [0.9832s] [ 1%] 2025-12-04T14:02:33.3734797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float64 PASSED [0.1498s] [ 1%] 2025-12-04T14:02:33.3735108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_int8 PASSED [0.1049s] [ 1%] 2025-12-04T14:02:33.3735401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_uint8 PASSED [0.1030s] [ 1%] 2025-12-04T14:02:33.3735695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_bfloat16 PASSED [0.0158s] [ 1%] 2025-12-04T14:02:33.3735991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_complex64 PASSED [0.0156s] [ 1%] 2025-12-04T14:02:33.3736284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float16 PASSED [0.0154s] [ 1%] 2025-12-04T14:02:33.3736576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_bfloat16 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3736864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_complex64 PASSED [0.0148s] [ 1%] 2025-12-04T14:02:33.3737153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_float16 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3737437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int8 XFAIL [0.0057s] [ 1%] 2025-12-04T14:02:33.3737722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_bfloat16 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3738014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float32 PASSED [0.0146s] [ 1%] 2025-12-04T14:02:33.3738307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float64 PASSED [0.0146s] [ 1%] 2025-12-04T14:02:33.3738667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_bool XFAIL [0.0066s] [ 1%] 2025-12-04T14:02:33.3738973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_complex128 PASSED [0.9497s] [ 1%] 2025-12-04T14:02:33.3739267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_complex64 PASSED [0.0970s] [ 1%] 2025-12-04T14:02:33.3739559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_uint8 XFAIL [0.0067s] [ 1%] 2025-12-04T14:02:33.3739840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_bool XFAIL [0.8381s] [ 1%] 2025-12-04T14:02:33.3740158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float16 PASSED [0.8500s] [ 1%] 2025-12-04T14:02:33.3740450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float32 PASSED [0.0151s] [ 1%] 2025-12-04T14:02:33.3740742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float64 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3741029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_int16 XFAIL [0.0058s] [ 1%] 2025-12-04T14:02:33.3741313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_bfloat16 PASSED [0.8534s] [ 1%] 2025-12-04T14:02:33.3741598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_bool XFAIL [0.0061s] [ 1%] 2025-12-04T14:02:33.3741884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_complex128 XFAIL [0.8568s] [ 1%] 2025-12-04T14:02:33.3742175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_float32 PASSED [0.8558s] [ 1%] 2025-12-04T14:02:33.3742488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int16 XFAIL [0.0064s] [ 1%] 2025-12-04T14:02:33.3742785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int8 XFAIL [0.8437s] [ 1%] 2025-12-04T14:02:33.3743071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float32 PASSED [0.8699s] [ 1%] 2025-12-04T14:02:33.3743353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int32 XFAIL [0.0061s] [ 1%] 2025-12-04T14:02:33.3743628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int8 XFAIL [0.8354s] [ 1%] 2025-12-04T14:02:33.3743901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_uint8 XFAIL [0.8406s] [ 1%] 2025-12-04T14:02:33.3744204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float64 PASSED [0.8433s] [ 1%] 2025-12-04T14:02:33.3744492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int32 XFAIL [0.0062s] [ 1%] 2025-12-04T14:02:33.3744777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_uint8 XFAIL [0.8404s] [ 1%] 2025-12-04T14:02:33.3745067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_complex64 XFAIL [0.8406s] [ 1%] 2025-12-04T14:02:33.3745363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_int64 PASSED [0.8547s] [ 1%] 2025-12-04T14:02:33.3745648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_int32 XFAIL [0.0064s] [ 1%] 2025-12-04T14:02:33.3745932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_bfloat16 PASSED [0.9530s] [ 1%] 2025-12-04T14:02:33.3746222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_bool XFAIL [0.0077s] [ 1%] 2025-12-04T14:02:33.3746511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_complex128 PASSED [0.9815s] [ 1%] 2025-12-04T14:02:33.3746803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float16 PASSED [0.1041s] [ 1%] 2025-12-04T14:02:33.3747090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float32 PASSED [0.1063s] [ 1%] 2025-12-04T14:02:33.3747377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float64 PASSED [0.1042s] [ 1%] 2025-12-04T14:02:33.3747686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int32 XFAIL [0.0076s] [ 1%] 2025-12-04T14:02:33.3747975Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_bfloat16 PASSED [0.8709s] [ 1%] 2025-12-04T14:02:33.3748274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_complex128 XFAIL [0.0062s] [ 1%] 2025-12-04T14:02:33.3748571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float64 PASSED [0.8482s] [ 1%] 2025-12-04T14:02:33.3748862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int16 XFAIL [0.0063s] [ 1%] 2025-12-04T14:02:33.3749149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int32 XFAIL [0.8470s] [ 1%] 2025-12-04T14:02:33.3749433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int8 XFAIL [0.8570s] [ 1%] 2025-12-04T14:02:33.3749723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_bfloat16 PASSED [0.8432s] [ 1%] 2025-12-04T14:02:33.3750030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_complex128 PASSED [0.0153s] [ 1%] 2025-12-04T14:02:33.3750366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float16 PASSED [0.0150s] [ 1%] 2025-12-04T14:02:33.3750660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float32 PASSED [0.0148s] [ 1%] 2025-12-04T14:02:33.3750951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int16 XFAIL [0.0058s] [ 1%] 2025-12-04T14:02:33.3751241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int32 XFAIL [0.0057s] [ 1%] 2025-12-04T14:02:33.3751542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int8 XFAIL [0.8498s] [ 1%] 2025-12-04T14:02:33.3751852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_complex64 PASSED [0.8519s] [ 1%] 2025-12-04T14:02:33.3752149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int32 XFAIL [0.0062s] [ 1%] 2025-12-04T14:02:33.3752443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_complex64 PASSED [0.8850s] [ 1%] 2025-12-04T14:02:33.3752739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_float32 PASSED [0.0151s] [ 1%] 2025-12-04T14:02:33.3753046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int64 XFAIL [0.0059s] [ 1%] 2025-12-04T14:02:33.3753327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int8 XFAIL [0.0057s] [ 1%] 2025-12-04T14:02:33.3753613Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_bfloat16 PASSED [0.8513s] [ 1%] 2025-12-04T14:02:33.3753896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_bool XFAIL [0.0062s] [ 1%] 2025-12-04T14:02:33.3754178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_float16 PASSED [0.8510s] [ 1%] 2025-12-04T14:02:33.3754462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_int64 XFAIL [0.0061s] [ 1%] 2025-12-04T14:02:33.3754739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_uint8 XFAIL [0.8376s] [ 1%] 2025-12-04T14:02:33.3755071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_complex128 SKIPPED [0.8337s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3755453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int16 SKIPPED [0.0015s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3755822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3756163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_complex128 XFAIL [0.0079s] [ 1%] 2025-12-04T14:02:33.3756465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float16 PASSED [0.9944s] [ 1%] 2025-12-04T14:02:33.3756784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float64 PASSED [0.1506s] [ 1%] 2025-12-04T14:02:33.3757083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_complex128 XFAIL [0.0071s] [ 1%] 2025-12-04T14:02:33.3757384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_complex64 XFAIL [0.8476s] [ 1%] 2025-12-04T14:02:33.3757683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_float16 PASSED [0.9887s] [ 1%] 2025-12-04T14:02:33.3757976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_uint8 PASSED [0.1027s] [ 1%] 2025-12-04T14:02:33.3758269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_complex64 PASSED [0.0975s] [ 1%] 2025-12-04T14:02:33.3758555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_bool XFAIL [0.0058s] [ 1%] 2025-12-04T14:02:33.3758838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float16 PASSED [0.0151s] [ 1%] 2025-12-04T14:02:33.3759129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float32 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3759414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float64 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3759699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int64 PASSED [0.0146s] [ 1%] 2025-12-04T14:02:33.3760031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3760467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3760866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3761244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.3761578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_complex128 PASSED [0.1122s] [ 1%] 2025-12-04T14:02:33.3761870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float16 PASSED [0.0854s] [ 1%] 2025-12-04T14:02:33.3762178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float32 PASSED [0.0840s] [ 1%] 2025-12-04T14:02:33.3762465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_float64 PASSED [0.0835s] [ 1%] 2025-12-04T14:02:33.3762749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int64 PASSED [0.0574s] [ 1%] 2025-12-04T14:02:33.3763031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int8 PASSED [0.0573s] [ 1%] 2025-12-04T14:02:33.3763311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_uint8 PASSED [0.0572s] [ 1%] 2025-12-04T14:02:33.3763610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_complex128 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3763921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int64 XFAIL [0.0057s] [ 1%] 2025-12-04T14:02:33.3764223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_uint8 XFAIL [0.0055s] [ 1%] 2025-12-04T14:02:33.3764521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_bfloat16 PASSED [0.0145s] [ 1%] 2025-12-04T14:02:33.3764810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_bool XFAIL [0.0056s] [ 1%] 2025-12-04T14:02:33.3765103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_complex128 XFAIL [0.0056s] [ 1%] 2025-12-04T14:02:33.3765398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float32 PASSED [0.8573s] [ 1%] 2025-12-04T14:02:33.3765716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_complex64 PASSED [0.0154s] [ 1%] 2025-12-04T14:02:33.3766012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_float16 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3766300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int16 XFAIL [0.0059s] [ 1%] 2025-12-04T14:02:33.3766598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_complex128 PASSED [0.8702s] [ 1%] 2025-12-04T14:02:33.3766900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int16 XFAIL [0.0060s] [ 1%] 2025-12-04T14:02:33.3767187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_bool PASSED [0.0263s] [ 1%] 2025-12-04T14:02:33.3767474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float32 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3767760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int32 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3768046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int64 PASSED [0.0148s] [ 1%] 2025-12-04T14:02:33.3768327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_int8 PASSED [0.0145s] [ 1%] 2025-12-04T14:02:33.3768609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_uint8 PASSED [0.0147s] [ 1%] 2025-12-04T14:02:33.3768898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_complex64 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3769186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_float32 PASSED [0.0146s] [ 1%] 2025-12-04T14:02:33.3769493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_float64 PASSED [0.0145s] [ 1%] 2025-12-04T14:02:33.3769786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int16 XFAIL [0.0057s] [ 1%] 2025-12-04T14:02:33.3770065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int32 XFAIL [0.0055s] [ 1%] 2025-12-04T14:02:33.3770405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int64 XFAIL [0.0056s] [ 1%] 2025-12-04T14:02:33.3770681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int8 XFAIL [0.8562s] [ 1%] 2025-12-04T14:02:33.3772320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_uint8 XFAIL [0.8445s] [ 1%] 2025-12-04T14:02:33.3772991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_bfloat16 PASSED [0.8549s] [ 1%] 2025-12-04T14:02:33.3773363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_bool XFAIL [0.0062s] [ 1%] 2025-12-04T14:02:33.3773724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_complex128 PASSED [0.8778s] [ 1%] 2025-12-04T14:02:33.3774083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float16 PASSED [0.0153s] [ 1%] 2025-12-04T14:02:33.3774447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float64 PASSED [0.0148s] [ 1%] 2025-12-04T14:02:33.3774800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_complex64 PASSED [0.0150s] [ 1%] 2025-12-04T14:02:33.3775153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float16 PASSED [0.0149s] [ 1%] 2025-12-04T14:02:33.3775501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_int16 XFAIL [0.0058s] [ 2%] 2025-12-04T14:02:33.3775836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float16 XFAIL [0.8660s] [ 2%] 2025-12-04T14:02:33.3776184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float64 XFAIL [0.8511s] [ 2%] 2025-12-04T14:02:33.3776528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_int32 XFAIL [0.8390s] [ 2%] 2025-12-04T14:02:33.3776856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_uint8 XFAIL [0.8602s] [ 2%] 2025-12-04T14:02:33.3777281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_complex128 PASSED [0.8811s] [ 2%] 2025-12-04T14:02:33.3777636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_complex64 PASSED [0.0153s] [ 2%] 2025-12-04T14:02:33.3777984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int64 XFAIL [0.0059s] [ 2%] 2025-12-04T14:02:33.3778335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_float16 PASSED [0.0151s] [ 2%] 2025-12-04T14:02:33.3778687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_int64 XFAIL [0.0056s] [ 2%] 2025-12-04T14:02:33.3779026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_int8 XFAIL [0.0055s] [ 2%] 2025-12-04T14:02:33.3779376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_uint8 XFAIL [0.8669s] [ 2%] 2025-12-04T14:02:33.3779734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_complex128 XFAIL [0.8481s] [ 2%] 2025-12-04T14:02:33.3780154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_complex64 XFAIL [0.8361s] [ 2%] 2025-12-04T14:02:33.3780511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_float16 PASSED [0.8648s] [ 2%] 2025-12-04T14:02:33.3780868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_int32 PASSED [0.0150s] [ 2%] 2025-12-04T14:02:33.3781216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_int64 PASSED [0.0146s] [ 2%] 2025-12-04T14:02:33.3781622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_bfloat16 PASSED [0.0127s] [ 2%] 2025-12-04T14:02:33.3782060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_int32 PASSED [0.0135s] [ 2%] 2025-12-04T14:02:33.3782372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_int64 PASSED [0.0136s] [ 2%] 2025-12-04T14:02:33.3782733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__native_batch_norm_legit_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3783156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3783594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3784013Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3784430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3784844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3785254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3785654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3786073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3786517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3786965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3787454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3787910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3788341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3788759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3789175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3789591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3789940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_bfloat16 PASSED [0.0078s] [ 2%] 2025-12-04T14:02:33.3790253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_bool PASSED [0.8494s] [ 2%] 2025-12-04T14:02:33.3790524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_float16 PASSED [0.0043s] [ 2%] 2025-12-04T14:02:33.3790797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_float64 PASSED [0.8388s] [ 2%] 2025-12-04T14:02:33.3791165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_bool SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3791677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_complex32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3792146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3792516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_bfloat16 PASSED [0.8423s] [ 2%] 2025-12-04T14:02:33.3792904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3793368Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3793819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3794273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T14:02:33.3794643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_bfloat16 PASSED [0.0074s] [ 2%] 2025-12-04T14:02:33.3794918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_complex128 PASSED [0.0063s] [ 2%] 2025-12-04T14:02:33.3795190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_float64 PASSED [0.0061s] [ 2%] 2025-12-04T14:02:33.3795462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_bfloat16 PASSED [1.2420s] [ 2%] 2025-12-04T14:02:33.3795741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_bfloat16 PASSED [0.0518s] [ 2%] 2025-12-04T14:02:33.3796025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_complex128 PASSED [0.3631s] [ 2%] 2025-12-04T14:02:33.3796303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int16 PASSED [0.0081s] [ 2%] 2025-12-04T14:02:33.3796573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int8 PASSED [0.0076s] [ 2%] 2025-12-04T14:02:33.3796868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_complex128 PASSED [0.0691s] [ 2%] 2025-12-04T14:02:33.3797145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_complex64 PASSED [0.0728s] [ 2%] 2025-12-04T14:02:33.3797423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_float32 PASSED [0.7885s] [ 2%] 2025-12-04T14:02:33.3797694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_bfloat16 PASSED [0.0053s] [ 2%] 2025-12-04T14:02:33.3797964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_complex128 PASSED [0.0143s] [ 2%] 2025-12-04T14:02:33.3798234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float32 PASSED [0.8414s] [ 2%] 2025-12-04T14:02:33.3798500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float64 PASSED [0.0066s] [ 2%] 2025-12-04T14:02:33.3798767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int16 PASSED [0.0046s] [ 2%] 2025-12-04T14:02:33.3799035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int64 PASSED [0.0042s] [ 2%] 2025-12-04T14:02:33.3799356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3799739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3800153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3800530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3800901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3801258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3801608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3801970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3802362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3802733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3803095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3803445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3803795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3804151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3804507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3804862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3805213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3805566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3805949Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3806297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3806652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3807014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3807374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3807733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3808096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3808458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3808814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3809171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3809555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3809920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3810440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3810818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3811180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3811561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3811923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3812286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3812645Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3813000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3813356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3813710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3814073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3814439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3814817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3815216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3815595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3815966Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3816331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3816694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3817053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3817420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3817799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3818185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3818571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3818971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3819361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3819737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3820154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3820528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.3820885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex128 PASSED [0.0045s] [ 2%] 2025-12-04T14:02:33.3821177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex32 PASSED [0.0042s] [ 2%] 2025-12-04T14:02:33.3821468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex64 PASSED [0.8545s] [ 2%] 2025-12-04T14:02:33.3821753Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_float32 PASSED [0.0060s] [ 2%] 2025-12-04T14:02:33.3822055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_bfloat16 XFAIL [0.0045s] [ 2%] 2025-12-04T14:02:33.3822372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_bool XFAIL [0.8694s] [ 2%] 2025-12-04T14:02:33.3822689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex128 XFAIL [0.8588s] [ 2%] 2025-12-04T14:02:33.3823010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex64 XFAIL [0.8456s] [ 2%] 2025-12-04T14:02:33.3823328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float32 XFAIL [0.8609s] [ 2%] 2025-12-04T14:02:33.3823644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int16 XFAIL [0.8553s] [ 2%] 2025-12-04T14:02:33.3823956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int64 XFAIL [0.8411s] [ 3%] 2025-12-04T14:02:33.3824340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_complex128 SKIPPED [0.8585s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3824746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_complex64 SKIPPED [0.0015s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3825145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3825480Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_bfloat16 PASSED [0.8612s] [ 3%] 2025-12-04T14:02:33.3825854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_complex32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3826321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_complex64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3826783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3827232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3827685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3828158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3828540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_float32 PASSED [0.0068s] [ 3%] 2025-12-04T14:02:33.3828904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3829355Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3829717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float16 PASSED [0.8590s] [ 3%] 2025-12-04T14:02:33.3830002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float32 PASSED [0.0042s] [ 3%] 2025-12-04T14:02:33.3830311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float64 PASSED [0.8357s] [ 3%] 2025-12-04T14:02:33.3830674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3831124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_bool SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3831486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_float64 PASSED [0.0083s] [ 3%] 2025-12-04T14:02:33.3831849Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3832313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T14:02:33.3832732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3833113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3833509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3833872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3834233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3834596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3834956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3835325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3835699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3836066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3836426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3836751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_float16 PASSED [0.3636s] [ 3%] 2025-12-04T14:02:33.3837028Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_float32 PASSED [0.8640s] [ 3%] 2025-12-04T14:02:33.3838391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3839211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3839590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3839957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3840375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3840860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3841239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3841603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3841967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3842326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3842658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_int8 PASSED [0.0066s] [ 3%] 2025-12-04T14:02:33.3842945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_uint8 PASSED [0.0056s] [ 3%] 2025-12-04T14:02:33.3843237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int8 PASSED [0.0146s] [ 3%] 2025-12-04T14:02:33.3843528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int32 PASSED [0.8814s] [ 3%] 2025-12-04T14:02:33.3843808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int64 PASSED [0.8532s] [ 3%] 2025-12-04T14:02:33.3844085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_bool PASSED [0.0080s] [ 3%] 2025-12-04T14:02:33.3844440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int32 PASSED [0.0060s] [ 3%] 2025-12-04T14:02:33.3844744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int64 PASSED [0.0056s] [ 3%] 2025-12-04T14:02:33.3845045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int8 PASSED [0.0056s] [ 3%] 2025-12-04T14:02:33.3845384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3845762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3846147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3846520Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3846889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3847247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3847624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3847980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3848377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3848756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3849151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3849547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3850014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3850448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3850834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3851215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3851593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3851968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3852335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3852690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3853055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3853415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3853785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3854134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3854502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3854890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3855272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3855647Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3856026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3856391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3856747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3857100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3857435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cauchy_cuda_float16 PASSED [0.8640s] [ 3%] 2025-12-04T14:02:33.3857767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdist_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3858124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdist_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3858484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3858847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3859230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3859593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3859953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3860310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float16 PASSED [0.8623s] [ 3%] 2025-12-04T14:02:33.3860584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int64 PASSED [0.8467s] [ 3%] 2025-12-04T14:02:33.3860895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_bool SKIPPED [0.0014s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3861253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3861614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3861968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3862325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3862681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3863047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3863398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3863751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3864100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3864464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3864840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3865215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3865599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3865992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3866376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3866775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3867170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3867528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3867880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3868193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_float16 PASSED [0.0097s] [ 3%] 2025-12-04T14:02:33.3868487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_float32 PASSED [0.8622s] [ 3%] 2025-12-04T14:02:33.3868769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int16 PASSED [0.0088s] [ 3%] 2025-12-04T14:02:33.3869048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int64 PASSED [0.8463s] [ 3%] 2025-12-04T14:02:33.3869326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_bool PASSED [0.0089s] [ 3%] 2025-12-04T14:02:33.3869601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_float16 PASSED [0.8586s] [ 3%] 2025-12-04T14:02:33.3869879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_int8 PASSED [0.0085s] [ 3%] 2025-12-04T14:02:33.3870189Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_uint8 PASSED [0.8570s] [ 3%] 2025-12-04T14:02:33.3870505Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_int8 SKIPPED [0.0015s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3870880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3871266Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3871642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3872045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3872415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3872785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3873161Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3873540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3873912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.3874281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3874642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_complex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3875005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3875365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3875720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3876092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3876436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_complex128 PASSED [0.8706s] [ 4%] 2025-12-04T14:02:33.3876738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_complex64 PASSED [0.8509s] [ 4%] 2025-12-04T14:02:33.3877033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_float64 PASSED [0.0041s] [ 4%] 2025-12-04T14:02:33.3877321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int64 PASSED [0.8487s] [ 4%] 2025-12-04T14:02:33.3877673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3878060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_complex64 SKIPPED [0.0013s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3878446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3878824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3879197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3879571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3879959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3880376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3880709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float16 PASSED [0.0148s] [ 4%] 2025-12-04T14:02:33.3880988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float32 PASSED [0.8548s] [ 4%] 2025-12-04T14:02:33.3881381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3881847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3882264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3882632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3882997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3883361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3883722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3884132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3884499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_float16 PASSED [0.0096s] [ 4%] 2025-12-04T14:02:33.3884857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3885332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3885713Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float32 PASSED [0.8758s] [ 4%] 2025-12-04T14:02:33.3885984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float64 PASSED [0.0056s] [ 4%] 2025-12-04T14:02:33.3886342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3886773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3887147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3887519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3887884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3888239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3888593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3888938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3889282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3889634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3889992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3890402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3890755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3891107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3891457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3891816Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_float64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T14:02:33.3892178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T14:02:33.3892533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int8 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T14:02:33.3892893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_float64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T14:02:33.3893255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_int8 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T14:02:33.3893635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3894037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3894473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3894871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3895262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3895655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3896005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float16 PASSED [0.8568s] [ 4%] 2025-12-04T14:02:33.3896276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float32 PASSED [0.0045s] [ 4%] 2025-12-04T14:02:33.3896545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float64 PASSED [0.8516s] [ 4%] 2025-12-04T14:02:33.3896857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3897213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3897569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3897917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3898261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3898602Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3898957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3899326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3899743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3900144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3900510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3900869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3901230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3901594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3901953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3902328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3902720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3903102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3903495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3903887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3904261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3904629Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3905018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3905376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3905744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3906117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3906492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3906851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3907166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_bfloat16 PASSED [0.8581s] [ 4%] 2025-12-04T14:02:33.3907532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3907990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3908445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3908865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3909190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_float32 PASSED [0.0135s] [ 4%] 2025-12-04T14:02:33.3909488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int8 PASSED [0.0081s] [ 4%] 2025-12-04T14:02:33.3909781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_uint8 PASSED [0.0173s] [ 4%] 2025-12-04T14:02:33.3910080Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_bfloat16 PASSED [0.0060s] [ 4%] 2025-12-04T14:02:33.3910438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_float16 PASSED [0.0057s] [ 4%] 2025-12-04T14:02:33.3934573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3935066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3935546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3936022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.3936491Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_float64 PASSED [0.0063s] [ 4%] 2025-12-04T14:02:33.3936819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3937175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3937527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3937899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3938256Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3938607Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3938955Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3939299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3939648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3940005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3940394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3940744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3941092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3941473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3941830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3942187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3942541Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3942894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3943246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3943592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3943934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3944275Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3944621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3944980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3945376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3945764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3946127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3946496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.3946837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_float16 PASSED [0.0063s] [ 4%] 2025-12-04T14:02:33.3947094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_float64 PASSED [0.0061s] [ 5%] 2025-12-04T14:02:33.3947344Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int8 PASSED [0.0061s] [ 5%] 2025-12-04T14:02:33.3947591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_uint8 PASSED [0.0060s] [ 5%] 2025-12-04T14:02:33.3947885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3948230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3948571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3948869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_float64 PASSED [0.8697s] [ 5%] 2025-12-04T14:02:33.3949217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_uint8 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3949658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3950006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float32 PASSED [0.0063s] [ 5%] 2025-12-04T14:02:33.3950412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3950852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3951289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3951641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_float16 PASSED [0.8494s] [ 5%] 2025-12-04T14:02:33.3951904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_float32 PASSED [0.0065s] [ 5%] 2025-12-04T14:02:33.3952258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3952701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3953141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3953578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3954009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3954484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3954933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3955381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T14:02:33.3955748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float32 PASSED [0.8466s] [ 5%] 2025-12-04T14:02:33.3956004Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float64 PASSED [0.0055s] [ 5%] 2025-12-04T14:02:33.3956315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3956672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3957018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3957374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3957736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3958093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3958443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3958790Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3959148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3959493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3959835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3960216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3960556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3960859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_bfloat16 PASSED [0.8548s] [ 5%] 2025-12-04T14:02:33.3961118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float64 PASSED [0.0042s] [ 5%] 2025-12-04T14:02:33.3961388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_bfloat16 PASSED [0.0089s] [ 5%] 2025-12-04T14:02:33.3961698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3962035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3962384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3962737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3963114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3963456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3963804Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3964158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3964530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3964872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3965212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3965561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3965911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3966250Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3966609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3966978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3967342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3967695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3968067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3968427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3968783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3969133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3969482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3969829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3970208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3970549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3970899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3971259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3971628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3971990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3972337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3972680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3973021Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3973374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3973715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3974064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3974415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3974766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3975113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3975466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3975826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3976181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3976533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3976894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3977240Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3977587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3977930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3978270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3978622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3978976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3979327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3979675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3980018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3980405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3980764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3981124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3981494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3981859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3982237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3982604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3982956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3983304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3983650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3983999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3984345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3984687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3985033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3985383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3985751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3986099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3986445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3986797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3987156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3987507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3987860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3988212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3988563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3988920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3989306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3989676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3990024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3990404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3990765Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3991108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3991453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3991751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_bool PASSED [0.8390s] [ 5%] 2025-12-04T14:02:33.3992009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_complex128 PASSED [0.0052s] [ 5%] 2025-12-04T14:02:33.3992273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_complex32 PASSED [0.0037s] [ 5%] 2025-12-04T14:02:33.3992527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_int16 PASSED [0.8667s] [ 5%] 2025-12-04T14:02:33.3992823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3993168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3993516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3993870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.3994238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3994582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3994922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3995271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3995625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3995980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3996336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3996686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3997034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3997386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3997734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3998097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3998469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3998818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3999163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3999525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.3999936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T14:02:33.4000394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float16 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 6%] 2025-12-04T14:02:33.4000770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 6%] 2025-12-04T14:02:33.4001191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T14:02:33.4001556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int32 PASSED [0.8668s] [ 6%] 2025-12-04T14:02:33.4001821Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int64 PASSED [0.0043s] [ 6%] 2025-12-04T14:02:33.4002097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_bfloat16 PASSED [0.0127s] [ 6%] 2025-12-04T14:02:33.4002382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_float32 PASSED [0.0117s] [ 6%] 2025-12-04T14:02:33.4002695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4003041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4003404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4003750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4004096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4004438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4004786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4005095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float32 PASSED [0.0061s] [ 6%] 2025-12-04T14:02:33.4005361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float64 PASSED [0.0061s] [ 6%] 2025-12-04T14:02:33.4005623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_int32 PASSED [0.0061s] [ 6%] 2025-12-04T14:02:33.4005926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4006282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4006626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4006984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4007343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4007700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4008055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4008404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4008774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4009081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int32 PASSED [0.0078s] [ 6%] 2025-12-04T14:02:33.4009336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_bool PASSED [0.0056s] [ 6%] 2025-12-04T14:02:33.4009601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_bfloat16 PASSED [0.0082s] [ 6%] 2025-12-04T14:02:33.4009878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float64 PASSED [0.0043s] [ 6%] 2025-12-04T14:02:33.4010194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_int32 PASSED [0.0042s] [ 6%] 2025-12-04T14:02:33.4010460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_int64 PASSED [0.0042s] [ 6%] 2025-12-04T14:02:33.4010777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4011134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4011492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4011852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4012238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4012601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4012957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4013321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4013673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_3d_cuda_bfloat16 SKIPPED [0.0001s] (Skipped!) [ 6%] 2025-12-04T14:02:33.4013965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float16 PASSED [0.0057s] [ 6%] 2025-12-04T14:02:33.4014221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float64 PASSED [0.0056s] [ 6%] 2025-12-04T14:02:33.4014529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4014880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4015223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4015580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4015966Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4016345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4016702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4017016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_bool PASSED [0.0229s] [ 6%] 2025-12-04T14:02:33.4017285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_int16 PASSED [0.0073s] [ 6%] 2025-12-04T14:02:33.4017617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_histc_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4017971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4018324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4018682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4019042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4019391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4019739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4020087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4020443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_bfloat16 PASSED [0.0059s] [ 6%] 2025-12-04T14:02:33.4020712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float16 PASSED [0.0057s] [ 6%] 2025-12-04T14:02:33.4020977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float32 PASSED [0.0055s] [ 6%] 2025-12-04T14:02:33.4021255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float64 PASSED [0.0055s] [ 6%] 2025-12-04T14:02:33.4021513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_float16 PASSED [0.8716s] [ 6%] 2025-12-04T14:02:33.4021769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_float64 PASSED [0.0065s] [ 6%] 2025-12-04T14:02:33.4022121Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T14:02:33.4022565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T14:02:33.4022962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_imag_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4023316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_imag_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4023634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_complex128 PASSED [0.0613s] [ 6%] 2025-12-04T14:02:33.4023912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_float16 PASSED [0.0081s] [ 6%] 2025-12-04T14:02:33.4024184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int16 PASSED [0.0077s] [ 6%] 2025-12-04T14:02:33.4024452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int32 PASSED [0.8584s] [ 6%] 2025-12-04T14:02:33.4024738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int64 PASSED [0.0103s] [ 6%] 2025-12-04T14:02:33.4025026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_complex32 PASSED [0.8616s] [ 6%] 2025-12-04T14:02:33.4025308Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_complex64 PASSED [0.8532s] [ 6%] 2025-12-04T14:02:33.4025587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float32 PASSED [0.0059s] [ 6%] 2025-12-04T14:02:33.4025860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int8 PASSED [0.0042s] [ 6%] 2025-12-04T14:02:33.4026130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_float32 PASSED [0.0058s] [ 6%] 2025-12-04T14:02:33.4026415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_int16 PASSED [0.0053s] [ 6%] 2025-12-04T14:02:33.4026690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_complex64 PASSED [0.0154s] [ 6%] 2025-12-04T14:02:33.4026981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_float16 PASSED [0.0066s] [ 6%] 2025-12-04T14:02:33.4027279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_float32 PASSED [0.0063s] [ 6%] 2025-12-04T14:02:33.4027572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int64 PASSED [0.0062s] [ 6%] 2025-12-04T14:02:33.4027859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int8 PASSED [0.0063s] [ 6%] 2025-12-04T14:02:33.4028149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_bfloat16 PASSED [0.0063s] [ 6%] 2025-12-04T14:02:33.4028444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float64 PASSED [0.0063s] [ 6%] 2025-12-04T14:02:33.4028734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_uint8 PASSED [0.0062s] [ 6%] 2025-12-04T14:02:33.4029026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_bfloat16 PASSED [0.0068s] [ 6%] 2025-12-04T14:02:33.4029319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_float16 PASSED [0.0066s] [ 6%] 2025-12-04T14:02:33.4029611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_float64 PASSED [0.0065s] [ 6%] 2025-12-04T14:02:33.4029911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_mean_cuda_int32 PASSED [0.0066s] [ 6%] 2025-12-04T14:02:33.4030225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float64 PASSED [0.0062s] [ 6%] 2025-12-04T14:02:33.4030514Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int16 PASSED [0.0061s] [ 6%] 2025-12-04T14:02:33.4030846Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4031220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4031587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4031953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4032316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4032671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4033020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4033364Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4033730Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4034084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4034431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4034789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4035164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4035519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4035877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4036231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4036590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4036947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4037300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4037655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4038006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4038353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.4038697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4039058Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4039405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4039757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4040131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4040483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4040839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4041185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4041527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4041866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4042212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4042597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4042968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4043328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4043680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4044049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4044399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4044750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4045096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4045441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4045793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4046141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4046483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4046823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4047165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4047517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_istft_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4047890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4048231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4048574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4048924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4049270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4049610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4049981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4050425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4050837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4051258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4051715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4052162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4052587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4053003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4053408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4053795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4054181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4054561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4054959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4055380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4055795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4056205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4056601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4057003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4057369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4057728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4058098Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4058467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4058829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4059142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_int32 PASSED [0.0091s] [ 7%] 2025-12-04T14:02:33.4059406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_int64 PASSED [0.3153s] [ 7%] 2025-12-04T14:02:33.4059781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_complex128 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4060285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4060764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4061236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4061606Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_float16 PASSED [0.0059s] [ 7%] 2025-12-04T14:02:33.4061876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_float64 PASSED [0.0056s] [ 7%] 2025-12-04T14:02:33.4062137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int32 PASSED [0.0055s] [ 7%] 2025-12-04T14:02:33.4062408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int8 PASSED [0.0055s] [ 7%] 2025-12-04T14:02:33.4062674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_bfloat16 PASSED [0.0144s] [ 7%] 2025-12-04T14:02:33.4062948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex128 PASSED [0.7332s] [ 7%] 2025-12-04T14:02:33.4063223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex32 PASSED [1.6062s] [ 7%] 2025-12-04T14:02:33.4063494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex64 PASSED [0.7307s] [ 7%] 2025-12-04T14:02:33.4063767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float32 PASSED [0.0051s] [ 7%] 2025-12-04T14:02:33.4064128Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4064586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T14:02:33.4065012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4065405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4065797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4066218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4066616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4067001Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4067378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4067756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4068135Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4068517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4068895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4069265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4069639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4070039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4070469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4070637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4070807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4070983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eig_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4071159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4071323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4071502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4071672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4071908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_complex128 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T14:02:33.4072141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float64 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T14:02:33.4072310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4072496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4072691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4072913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T14:02:33.4073126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T14:02:33.4073302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4073470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4073667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4073861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4074988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4075177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4075372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4075552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4075720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4075895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4076093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4076295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4076489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4076710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 7%] 2025-12-04T14:02:33.4076923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 7%] 2025-12-04T14:02:33.4077101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_qr_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4077279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4077461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4077648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4077817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svd_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4077996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4078173Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4078353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorsolve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4078524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4078694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4078888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4079057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4079235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.4079403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4079595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4079769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4079949Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4080146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4080345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4080535Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4080716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4080904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4081023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_float16 PASSED [0.0038s] [ 8%] 2025-12-04T14:02:33.4081249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4081455Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4081580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float16 PASSED [0.8515s] [ 8%] 2025-12-04T14:02:33.4081695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float32 PASSED [0.0044s] [ 8%] 2025-12-04T14:02:33.4081905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4082116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4082326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4082534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4082649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_bfloat16 PASSED [0.8457s] [ 8%] 2025-12-04T14:02:33.4082769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float32 PASSED [0.0055s] [ 8%] 2025-12-04T14:02:33.4082984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4083203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4083327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_bfloat16 PASSED [0.0088s] [ 8%] 2025-12-04T14:02:33.4083452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float32 PASSED [0.0045s] [ 8%] 2025-12-04T14:02:33.4083619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4083817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084004Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4084851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4085882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4086039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logdet_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4086171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_bfloat16 PASSED [0.8601s] [ 8%] 2025-12-04T14:02:33.4086296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_complex64 PASSED [0.2753s] [ 8%] 2025-12-04T14:02:33.4086422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float32 PASSED [0.0057s] [ 8%] 2025-12-04T14:02:33.4086540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int64 PASSED [0.0053s] [ 8%] 2025-12-04T14:02:33.4086664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_uint8 PASSED [0.0052s] [ 8%] 2025-12-04T14:02:33.4086782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_bool PASSED [0.8585s] [ 8%] 2025-12-04T14:02:33.4086910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_float32 PASSED [0.0052s] [ 8%] 2025-12-04T14:02:33.4087044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_float64 PASSED [0.0037s] [ 8%] 2025-12-04T14:02:33.4087179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_int64 PASSED [0.8716s] [ 8%] 2025-12-04T14:02:33.4087303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_int8 PASSED [0.0052s] [ 8%] 2025-12-04T14:02:33.4087421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_uint8 PASSED [0.0036s] [ 8%] 2025-12-04T14:02:33.4087553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_complex64 PASSED [0.2723s] [ 8%] 2025-12-04T14:02:33.4087672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_float32 PASSED [0.0055s] [ 8%] 2025-12-04T14:02:33.4087810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_float64 PASSED [0.0052s] [ 8%] 2025-12-04T14:02:33.4087927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int64 PASSED [0.0051s] [ 8%] 2025-12-04T14:02:33.4088053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_int64 PASSED [0.0052s] [ 8%] 2025-12-04T14:02:33.4088171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_uint8 PASSED [0.0051s] [ 8%] 2025-12-04T14:02:33.4088296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_bfloat16 PASSED [0.0058s] [ 8%] 2025-12-04T14:02:33.4088506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.4088676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4088837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4089961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4090248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4090422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4090584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4090748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4090911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4091980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_bfloat16 PASSED [0.8602s] [ 8%] 2025-12-04T14:02:33.4092091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float16 PASSED [0.0077s] [ 8%] 2025-12-04T14:02:33.4092219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int16 PASSED [0.0058s] [ 8%] 2025-12-04T14:02:33.4092326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int32 PASSED [0.0056s] [ 8%] 2025-12-04T14:02:33.4092438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int64 PASSED [0.0056s] [ 8%] 2025-12-04T14:02:33.4092599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4092758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4092929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_unpack_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_unpack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4093922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094075Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094535Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4094848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4095855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4096869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4097898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098567Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4098899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4099029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_complex64 PASSED [0.0068s] [ 8%] 2025-12-04T14:02:33.4099149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_float16 PASSED [0.0063s] [ 8%] 2025-12-04T14:02:33.4099272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int32 PASSED [0.0063s] [ 8%] 2025-12-04T14:02:33.4099390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int64 PASSED [0.0063s] [ 8%] 2025-12-04T14:02:33.4099521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int8 PASSED [0.0063s] [ 8%] 2025-12-04T14:02:33.4099716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4099888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4100978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4101151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.4101318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4101498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4101672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4101836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4101967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_bool PASSED [0.8868s] [ 9%] 2025-12-04T14:02:33.4102111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_complex128 PASSED [0.0064s] [ 9%] 2025-12-04T14:02:33.4102245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float32 PASSED [0.0043s] [ 9%] 2025-12-04T14:02:33.4102371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int64 PASSED [0.0041s] [ 9%] 2025-12-04T14:02:33.4102501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int8 PASSED [0.8636s] [ 9%] 2025-12-04T14:02:33.4102625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_uint8 PASSED [0.0061s] [ 9%] 2025-12-04T14:02:33.4102802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4102976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4103149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4103324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4103495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmax_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4103673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4103856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4104884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105058Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4105883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4106901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_no_dim_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107620Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4107983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4108975Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4109957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4110178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4110375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4110559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4110742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4110932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4111956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4112971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4113926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4114084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4114246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4114402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4114572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4114690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float32 PASSED [0.8583s] [ 9%] 2025-12-04T14:02:33.4114808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_uint8 PASSED [0.0069s] [ 9%] 2025-12-04T14:02:33.4114979Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4115152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4115328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4115468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16 PASSED [0.8903s] [ 9%] 2025-12-04T14:02:33.4115611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float16 PASSED [0.0143s] [ 9%] 2025-12-04T14:02:33.4115843Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4115988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float16 PASSED [0.0104s] [ 9%] 2025-12-04T14:02:33.4116122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0103s] [ 9%] 2025-12-04T14:02:33.4116377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4116522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float16 PASSED [0.0102s] [ 9%] 2025-12-04T14:02:33.4116754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4116987Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4117229Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4117465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.4117587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_float64 PASSED [0.8835s] [ 9%] 2025-12-04T14:02:33.4117715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int8 PASSED [0.0050s] [ 9%] 2025-12-04T14:02:33.4117833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_uint8 PASSED [0.0036s] [ 9%] 2025-12-04T14:02:33.4118002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4118170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4118338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4118501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4118674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4118857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanquantile_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4119843Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4120900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_batch_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4121948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4122064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bool PASSED [0.0059s] [ 9%] 2025-12-04T14:02:33.4122186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_complex64 PASSED [0.0057s] [ 9%] 2025-12-04T14:02:33.4122300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float64 PASSED [0.0056s] [ 9%] 2025-12-04T14:02:33.4122414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int32 PASSED [0.0056s] [ 9%] 2025-12-04T14:02:33.4122522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int64 PASSED [0.0056s] [ 9%] 2025-12-04T14:02:33.4122642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_bfloat16 PASSED [0.8637s] [ 9%] 2025-12-04T14:02:33.4122751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_int64 PASSED [0.0044s] [ 9%] 2025-12-04T14:02:33.4122885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_uint8 PASSED [0.8543s] [ 9%] 2025-12-04T14:02:33.4123044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4123209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_int64 SKIPPED [0.0013s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4123378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.4123562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex32 SKIPPED [0.0012s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4123739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4123919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4124926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125763Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4125927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4126086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4126212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_float32 PASSED [0.0068s] [ 10%] 2025-12-04T14:02:33.4126409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4126613Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4126807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4127020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4127214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4127414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4127612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4127807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4128005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4128200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4128353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16 PASSED [0.0253s] [ 10%] 2025-12-04T14:02:33.4128537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4128726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4128921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4129117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4129321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4129545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4129763Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4129973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4130152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_bfloat16 PASSED [0.0115s] [ 10%] 2025-12-04T14:02:33.4130287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float16 PASSED [0.0039s] [ 10%] 2025-12-04T14:02:33.4130485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4130682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4130876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4131071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4131261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4131474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4131652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4131838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4132967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex32 SKIPPED [0.0008s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4133181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4133385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4133582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4133779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4133991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4134191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4134385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4134585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4134777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cross_entropy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4134962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4135141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_ctc_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4135291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_bfloat16 PASSED [0.0100s] [ 10%] 2025-12-04T14:02:33.4135432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float16 PASSED [0.8712s] [ 10%] 2025-12-04T14:02:33.4135576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float64 PASSED [0.0119s] [ 10%] 2025-12-04T14:02:33.4135723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float32 PASSED [0.0117s] [ 10%] 2025-12-04T14:02:33.4135867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float64 PASSED [0.0112s] [ 10%] 2025-12-04T14:02:33.4136011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_bfloat16 PASSED [0.0127s] [ 10%] 2025-12-04T14:02:33.4136147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_float16 PASSED [0.0125s] [ 10%] 2025-12-04T14:02:33.4136342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_bag_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4136531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_bag_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4136707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16 PASSED [0.0080s] [ 10%] 2025-12-04T14:02:33.4136874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16 PASSED [0.0079s] [ 10%] 2025-12-04T14:02:33.4137047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.8656s] [ 10%] 2025-12-04T14:02:33.4137220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16 PASSED [0.0079s] [ 10%] 2025-12-04T14:02:33.4137410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0065s] [ 10%] 2025-12-04T14:02:33.4137596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64 PASSED [0.0062s] [ 10%] 2025-12-04T14:02:33.4137797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gelu_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4138948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_glu_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4139137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4139324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4139508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4139698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4139853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_float16 PASSED [0.8736s] [ 10%] 2025-12-04T14:02:33.4140038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4140254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4140441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4140624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4140810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4140994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4141171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4141350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4141527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4141743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4141951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4142139Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4142320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4142521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4142712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4142905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4143108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4143301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4143506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bicubic_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4143702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4143908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4144105Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4144335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4144551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4144748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4144958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4145159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4145368Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4145547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_kl_div_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4145734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4145915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4146109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4146309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4146493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4146641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0158s] [ 10%] 2025-12-04T14:02:33.4146836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147605Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4147997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4148188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4148386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T14:02:33.4148586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4148773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4148962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4149147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4149337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4149523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4149712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4149896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4150131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4150336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4150545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4150731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4150923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4151130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4151316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4151507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4151698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4151882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4152074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4152279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4152478Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4152684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4152913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4153117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4153327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4153529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4153716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4153895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4154078Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4154265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4154452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4154642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_one_hot_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4154838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155210Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155411Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4155972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4156152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4156339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4156534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4156735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4156930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4157136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4157330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4157525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4157718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4157910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4158102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4158288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4158477Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4158658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4159092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4159290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4159484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4159676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4159872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4160072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4160300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4160494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4160683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4160872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161052Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4161963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4162144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4162317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4162507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4162716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4162931Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4163075Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_bfloat16 PASSED [0.8657s] [ 11%] 2025-12-04T14:02:33.4163208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_float16 PASSED [0.0065s] [ 11%] 2025-12-04T14:02:33.4163359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_selu_cuda_float64 PASSED [0.0046s] [ 11%] 2025-12-04T14:02:33.4163522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_complex_cuda_complex64 PASSED [0.0125s] [ 11%] 2025-12-04T14:02:33.4163659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_bfloat16 PASSED [0.8687s] [ 11%] 2025-12-04T14:02:33.4163791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_float64 PASSED [0.0053s] [ 11%] 2025-12-04T14:02:33.4163980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4164171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4164374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4164571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4164772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4164962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4165145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4165334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4165520Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4165708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4165896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166080Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4166992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4167172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4167356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4167536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4167689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_float64 PASSED [0.0180s] [ 11%] 2025-12-04T14:02:33.4167833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_int8 PASSED [0.0048s] [ 11%] 2025-12-04T14:02:33.4168035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4168239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4171859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4172054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4172247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4172462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4172673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4172883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4173087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4173295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4173521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4173700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4173878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4174072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_bilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4174265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_bilinear_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4174426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4174586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4174737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_bfloat16 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T14:02:33.4174888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T14:02:33.4175038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int32 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T14:02:33.4175209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4175376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4175528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4175691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4175852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_complex64 SKIPPED [0.0008s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T14:02:33.4176946Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_bfloat16 PASSED [0.8661s] [ 11%] 2025-12-04T14:02:33.4177073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float32 PASSED [0.0055s] [ 12%] 2025-12-04T14:02:33.4177229Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4177388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4177554Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4177708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4177868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4178976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179292Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4179960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180478Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180794Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4180950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4181111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4181267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polar_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4181406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.8844s] [ 12%] 2025-12-04T14:02:33.4181655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4181795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0101s] [ 12%] 2025-12-04T14:02:33.4182022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4182247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4182470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4182607Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0065s] [ 12%] 2025-12-04T14:02:33.4182833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4183057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4183281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4183524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4183674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float32 PASSED [0.8798s] [ 12%] 2025-12-04T14:02:33.4183898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4184154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_int64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4184290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_float64 PASSED [0.0103s] [ 12%] 2025-12-04T14:02:33.4184515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4184739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4184903Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4185827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float32 PASSED [0.0059s] [ 12%] 2025-12-04T14:02:33.4185938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float64 PASSED [0.0056s] [ 12%] 2025-12-04T14:02:33.4186045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int64 PASSED [0.0056s] [ 12%] 2025-12-04T14:02:33.4186150Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int8 PASSED [0.0055s] [ 12%] 2025-12-04T14:02:33.4186256Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_uint8 PASSED [0.0055s] [ 12%] 2025-12-04T14:02:33.4186414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4186570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4186722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4186874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4186981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_float32 PASSED [0.0155s] [ 12%] 2025-12-04T14:02:33.4187090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_float64 PASSED [0.0148s] [ 12%] 2025-12-04T14:02:33.4187196Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int16 PASSED [0.0149s] [ 12%] 2025-12-04T14:02:33.4187315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_uint8 PASSED [0.0149s] [ 12%] 2025-12-04T14:02:33.4187481Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_qr_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4187598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_bfloat16 PASSED [0.8762s] [ 12%] 2025-12-04T14:02:33.4187805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_bool SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4188010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4188225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4188385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4188546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4188705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4188863Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189019Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4189957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190150Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4190931Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4191081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4191231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4191394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4191527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_bfloat16 PASSED [0.0045s] [ 12%] 2025-12-04T14:02:33.4191740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4191859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float16 PASSED [0.0036s] [ 12%] 2025-12-04T14:02:33.4191978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float64 PASSED [0.8686s] [ 12%] 2025-12-04T14:02:33.4192203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T14:02:33.4192321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_bfloat16 PASSED [0.8900s] [ 12%] 2025-12-04T14:02:33.4192436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int64 PASSED [0.0080s] [ 12%] 2025-12-04T14:02:33.4192551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int8 PASSED [0.0063s] [ 12%] 2025-12-04T14:02:33.4192663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_bfloat16 PASSED [0.0231s] [ 12%] 2025-12-04T14:02:33.4192777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_float32 PASSED [0.8723s] [ 12%] 2025-12-04T14:02:33.4192937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_complex64 SKIPPED [0.0016s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int64 SKIPPED [0.0012s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4193930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4194903Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4195857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4196022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4196141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_complex128 PASSED [0.0045s] [ 12%] 2025-12-04T14:02:33.4196258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_complex64 PASSED [0.8709s] [ 12%] 2025-12-04T14:02:33.4196370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_float16 PASSED [0.0053s] [ 12%] 2025-12-04T14:02:33.4196483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_int32 PASSED [0.0038s] [ 12%] 2025-12-04T14:02:33.4196595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_bool PASSED [0.8642s] [ 12%] 2025-12-04T14:02:33.4196710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float32 PASSED [0.0054s] [ 12%] 2025-12-04T14:02:33.4196827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float64 PASSED [0.0038s] [ 12%] 2025-12-04T14:02:33.4196939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int32 PASSED [0.8589s] [ 12%] 2025-12-04T14:02:33.4197100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_bool SKIPPED [0.0015s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4197269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4197439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4197619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4197777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4197945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4198110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4198259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T14:02:33.4198417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4198570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4198722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4198875Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4199925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float32 PASSED [0.8855s] [ 13%] 2025-12-04T14:02:33.4200038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float64 PASSED [0.0044s] [ 13%] 2025-12-04T14:02:33.4200220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float16 PASSED [0.0043s] [ 13%] 2025-12-04T14:02:33.4200350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float32 PASSED [0.8667s] [ 13%] 2025-12-04T14:02:33.4200462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_float16 PASSED [0.0056s] [ 13%] 2025-12-04T14:02:33.4200616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4200765Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4200914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4201900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4202059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4202185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_complex128 PASSED [0.0145s] [ 13%] 2025-12-04T14:02:33.4202305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_float64 PASSED [0.0070s] [ 13%] 2025-12-04T14:02:33.4202419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int8 PASSED [0.0069s] [ 13%] 2025-12-04T14:02:33.4202535Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_uint8 PASSED [0.0069s] [ 13%] 2025-12-04T14:02:33.4202646Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_bool PASSED [0.0121s] [ 13%] 2025-12-04T14:02:33.4202757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float64 PASSED [0.0161s] [ 13%] 2025-12-04T14:02:33.4202867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_int32 PASSED [0.0118s] [ 13%] 2025-12-04T14:02:33.4202974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_int64 PASSED [0.0119s] [ 13%] 2025-12-04T14:02:33.4203114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int32 PASSED [0.0138s] [ 13%] 2025-12-04T14:02:33.4203251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int8 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4203382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_bfloat16 PASSED [0.0146s] [ 13%] 2025-12-04T14:02:33.4203511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_float32 PASSED [0.0146s] [ 13%] 2025-12-04T14:02:33.4203638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_float64 PASSED [0.0147s] [ 13%] 2025-12-04T14:02:33.4203775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int8 PASSED [0.0145s] [ 13%] 2025-12-04T14:02:33.4203902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_float32 PASSED [0.0137s] [ 13%] 2025-12-04T14:02:33.4204028Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int16 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4204154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int64 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4204277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int16 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4204399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int32 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4204521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int8 PASSED [0.0136s] [ 13%] 2025-12-04T14:02:33.4204688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4204851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205013Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205806Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4205968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4206131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4206243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_bfloat16 PASSED [0.8922s] [ 13%] 2025-12-04T14:02:33.4206357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex128 PASSED [0.0065s] [ 13%] 2025-12-04T14:02:33.4206468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex32 PASSED [0.9987s] [ 13%] 2025-12-04T14:02:33.4206576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_complex64 PASSED [0.0066s] [ 13%] 2025-12-04T14:02:33.4206683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_int64 PASSED [0.8566s] [ 13%] 2025-12-04T14:02:33.4206839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4207003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float16 SKIPPED [0.0014s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4207165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int32 SKIPPED [0.0013s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4207315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int8 SKIPPED [0.0014s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4207466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4207591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_bfloat16 PASSED [0.0056s] [ 13%] 2025-12-04T14:02:33.4207797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4207912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float32 PASSED [0.8662s] [ 13%] 2025-12-04T14:02:33.4208120Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4208325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4208435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int16 PASSED [0.0038s] [ 13%] 2025-12-04T14:02:33.4208543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int32 PASSED [0.8676s] [ 13%] 2025-12-04T14:02:33.4208651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int64 PASSED [0.0043s] [ 13%] 2025-12-04T14:02:33.4208757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_int8 PASSED [0.8776s] [ 13%] 2025-12-04T14:02:33.4208863Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_uint8 PASSED [0.0044s] [ 13%] 2025-12-04T14:02:33.4209044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_bartlett_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4209233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_cosine_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4209416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_exponential_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4209599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_exponential_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4209785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4209957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_hann_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4210170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_nuttall_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4210329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4210439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float32 PASSED [0.8717s] [ 13%] 2025-12-04T14:02:33.4210548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float64 PASSED [0.0044s] [ 13%] 2025-12-04T14:02:33.4210755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4210971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4211190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4211397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4211594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4211812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4212011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4212217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4212327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float32 PASSED [0.8781s] [ 13%] 2025-12-04T14:02:33.4212526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4212722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4212920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T14:02:33.4213084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4213241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4213407Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4213557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4213724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4213884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4214853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4215953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4216957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217121Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217298Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4217955Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4218124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4218287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4218451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4218638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4218826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4219024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T14:02:33.4219216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4219402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4219586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4219784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4219966Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4220189Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4220354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4220517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4220680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4220839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4221867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4222847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4223980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4224144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4224342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4224528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4224721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4224906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4225984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4226166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4226356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4226539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4226724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4226905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4227095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4227287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4227487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4227673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4227853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4228887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4229058Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4229227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4229392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4229610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4229812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4230020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4230252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4230458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4230658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4230853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4231050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4231243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4231456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4231662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4231870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4232069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4232284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4232483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4232689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4232898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4233100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4233307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4233506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4233713Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool SKIPPED [0.0008s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4233912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4234133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4234338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4234521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4234712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4234896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4235951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4236940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237763Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4237933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4238111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4238288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4238459Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4238633Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4238822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239367Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239541Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4239892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4240123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4240299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4240477Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4240649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4240836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4241049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T14:02:33.4241255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T14:02:33.4241464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T14:02:33.4241584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_bfloat16 PASSED [0.8684s] [ 14%] 2025-12-04T14:02:33.4241705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_bool XFAIL [0.0046s] [ 14%] 2025-12-04T14:02:33.4241827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_complex128 PASSED [0.8733s] [ 14%] 2025-12-04T14:02:33.4241951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float64 PASSED [0.0041s] [ 14%] 2025-12-04T14:02:33.4242065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int16 PASSED [0.8671s] [ 14%] 2025-12-04T14:02:33.4242184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int64 PASSED [0.0057s] [ 14%] 2025-12-04T14:02:33.4242374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4242547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4242710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4242878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T14:02:33.4242997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_bool PASSED [0.8811s] [ 14%] 2025-12-04T14:02:33.4243118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex64 PASSED [0.0067s] [ 15%] 2025-12-04T14:02:33.4243241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_float16 PASSED [0.8848s] [ 15%] 2025-12-04T14:02:33.4243354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int8 PASSED [0.0072s] [ 15%] 2025-12-04T14:02:33.4243473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_uint8 PASSED [0.8788s] [ 15%] 2025-12-04T14:02:33.4243605Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_complex32 PASSED [0.0069s] [ 15%] 2025-12-04T14:02:33.4243740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float16 PASSED [0.0048s] [ 15%] 2025-12-04T14:02:33.4243869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float64 PASSED [0.0046s] [ 15%] 2025-12-04T14:02:33.4244022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int16 PASSED [0.0045s] [ 15%] 2025-12-04T14:02:33.4244163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int64 PASSED [0.8720s] [ 15%] 2025-12-04T14:02:33.4244331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4244495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4244660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4244833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4244996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4245985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_bfloat16 PASSED [0.0072s] [ 15%] 2025-12-04T14:02:33.4246108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float16 PASSED [0.0062s] [ 15%] 2025-12-04T14:02:33.4246220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int16 PASSED [0.0061s] [ 15%] 2025-12-04T14:02:33.4246336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int64 PASSED [0.0061s] [ 15%] 2025-12-04T14:02:33.4246462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_uint8 PASSED [0.0061s] [ 15%] 2025-12-04T14:02:33.4246624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4246785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4246944Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4247921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248266Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4248927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4249035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_bool PASSED [0.8934s] [ 15%] 2025-12-04T14:02:33.4249156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_complex128 PASSED [0.0053s] [ 15%] 2025-12-04T14:02:33.4249267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_float64 PASSED [0.0037s] [ 15%] 2025-12-04T14:02:33.4249379Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_int8 PASSED [0.8712s] [ 15%] 2025-12-04T14:02:33.4249550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4249720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_int8 SKIPPED [0.0013s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4249883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4250928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4251133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4251255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_bfloat16 PASSED [0.8895s] [ 15%] 2025-12-04T14:02:33.4251468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4251589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float16 PASSED [0.8761s] [ 15%] 2025-12-04T14:02:33.4251799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4252001Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4252182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4252364Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4252546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4252710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4252882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253226Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4253860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4254999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4255158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4255310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4255572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 15%] 2025-12-04T14:02:33.4255822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 15%] 2025-12-04T14:02:33.4256030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4256231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4256433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4256634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4256842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4257866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int16 SKIPPED [0.0008s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4258040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4258208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4258336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int64 PASSED [0.0064s] [ 15%] 2025-12-04T14:02:33.4258461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_uint8 PASSED [0.0053s] [ 15%] 2025-12-04T14:02:33.4258628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4258798Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4258976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259477Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4259951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex32 PASSED [0.0148s] [ 15%] 2025-12-04T14:02:33.4260073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_float16 PASSED [0.0077s] [ 15%] 2025-12-04T14:02:33.4260220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_float64 PASSED [0.0077s] [ 15%] 2025-12-04T14:02:33.4260337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_int16 PASSED [0.0077s] [ 15%] 2025-12-04T14:02:33.4260457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_uint8 PASSED [0.0076s] [ 15%] 2025-12-04T14:02:33.4260623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_indices_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4260769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float16 PASSED [0.8898s] [ 15%] 2025-12-04T14:02:33.4260883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float32 PASSED [0.0104s] [ 15%] 2025-12-04T14:02:33.4261107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4261331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4261476Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_float32 PASSED [0.8706s] [ 15%] 2025-12-04T14:02:33.4261599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_float64 PASSED [0.0080s] [ 15%] 2025-12-04T14:02:33.4261823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4262037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4262258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4262474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T14:02:33.4262590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float16 PASSED [0.8727s] [ 15%] 2025-12-04T14:02:33.4262710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float64 PASSED [0.0044s] [ 15%] 2025-12-04T14:02:33.4262824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int16 PASSED [0.8726s] [ 15%] 2025-12-04T14:02:33.4262943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int8 PASSED [0.0044s] [ 15%] 2025-12-04T14:02:33.4263120Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4263295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4263457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4263625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4263783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4263956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T14:02:33.4264990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265157Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4265989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4266147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4266313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4266486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4266665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4266838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4267851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4268861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int16 SKIPPED [0.0008s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4269917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4270953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_bfloat16 PASSED [0.0061s] [ 16%] 2025-12-04T14:02:33.4271074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float32 PASSED [0.0057s] [ 16%] 2025-12-04T14:02:33.4271198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float64 PASSED [0.0056s] [ 16%] 2025-12-04T14:02:33.4271315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_int16 PASSED [0.0056s] [ 16%] 2025-12-04T14:02:33.4271456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_int32 PASSED [0.0057s] [ 16%] 2025-12-04T14:02:33.4271571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_uint8 PASSED [0.0056s] [ 16%] 2025-12-04T14:02:33.4271737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4271896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272567Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4272925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_real_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4273974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_real_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274140Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274308Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274476Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274804Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4274960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4275938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4276858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_bfloat16 PASSED [0.0073s] [ 16%] 2025-12-04T14:02:33.4277065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 16%] 2025-12-04T14:02:33.4277277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 16%] 2025-12-04T14:02:33.4277414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_bfloat16 PASSED [0.8830s] [ 16%] 2025-12-04T14:02:33.4277539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_complex128 PASSED [0.0052s] [ 16%] 2025-12-04T14:02:33.4277701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4277865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4278878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4279044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T14:02:33.4279160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex128 PASSED [0.0035s] [ 16%] 2025-12-04T14:02:33.4279278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float16 PASSED [0.8912s] [ 16%] 2025-12-04T14:02:33.4279387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int16 PASSED [0.0044s] [ 16%] 2025-12-04T14:02:33.4279500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_bool PASSED [0.8966s] [ 16%] 2025-12-04T14:02:33.4279617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_complex32 PASSED [0.0045s] [ 16%] 2025-12-04T14:02:33.4279733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float16 PASSED [0.8985s] [ 16%] 2025-12-04T14:02:33.4279842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float64 PASSED [0.0045s] [ 16%] 2025-12-04T14:02:33.4279945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int64 PASSED [0.8997s] [ 16%] 2025-12-04T14:02:33.4280051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_uint8 PASSED [0.0045s] [ 16%] 2025-12-04T14:02:33.4280212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_bfloat16 PASSED [0.0099s] [ 16%] 2025-12-04T14:02:33.4280339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_complex128 PASSED [0.0116s] [ 16%] 2025-12-04T14:02:33.4280459Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float16 PASSED [0.0083s] [ 16%] 2025-12-04T14:02:33.4280578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float32 PASSED [0.0083s] [ 16%] 2025-12-04T14:02:33.4280696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_float64 PASSED [0.0081s] [ 16%] 2025-12-04T14:02:33.4280813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_int64 PASSED [0.0081s] [ 16%] 2025-12-04T14:02:33.4280928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_uint8 PASSED [0.0081s] [ 16%] 2025-12-04T14:02:33.4281046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_complex64 PASSED [0.0092s] [ 16%] 2025-12-04T14:02:33.4281158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float64 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4281285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int32 PASSED [0.0088s] [ 16%] 2025-12-04T14:02:33.4281411Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int32 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4281524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int8 PASSED [0.0085s] [ 16%] 2025-12-04T14:02:33.4281637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_uint8 PASSED [0.0085s] [ 16%] 2025-12-04T14:02:33.4281748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float16 PASSED [0.0127s] [ 16%] 2025-12-04T14:02:33.4281872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float32 PASSED [0.0122s] [ 16%] 2025-12-04T14:02:33.4281981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_int16 PASSED [0.0122s] [ 16%] 2025-12-04T14:02:33.4282090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_int64 PASSED [0.0123s] [ 16%] 2025-12-04T14:02:33.4282214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_complex128 PASSED [0.0620s] [ 16%] 2025-12-04T14:02:33.4282337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_complex64 PASSED [0.0218s] [ 16%] 2025-12-04T14:02:33.4282455Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float64 PASSED [0.1009s] [ 16%] 2025-12-04T14:02:33.4282566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int64 PASSED [0.0092s] [ 16%] 2025-12-04T14:02:33.4282676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int8 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4282789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float32 PASSED [0.0088s] [ 16%] 2025-12-04T14:02:33.4282899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int16 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4283012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int32 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4283123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int8 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4283235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_uint8 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4283361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int16 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4283470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int8 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4283589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex128 PASSED [0.0090s] [ 16%] 2025-12-04T14:02:33.4283707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex64 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4283820Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_int8 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4283932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_uint8 PASSED [0.0088s] [ 16%] 2025-12-04T14:02:33.4284050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_complex64 PASSED [0.0089s] [ 16%] 2025-12-04T14:02:33.4284162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_float32 PASSED [0.0087s] [ 16%] 2025-12-04T14:02:33.4284277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_float64 PASSED [0.0088s] [ 16%] 2025-12-04T14:02:33.4284386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_bool PASSED [0.0085s] [ 16%] 2025-12-04T14:02:33.4284496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_int16 PASSED [0.0086s] [ 16%] 2025-12-04T14:02:33.4284632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_bfloat16 PASSED [0.9061s] [ 16%] 2025-12-04T14:02:33.4284769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_float32 PASSED [0.2409s] [ 17%] 2025-12-04T14:02:33.4284900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex32 PASSED [0.0240s] [ 17%] 2025-12-04T14:02:33.4285028Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float32 PASSED [0.0113s] [ 17%] 2025-12-04T14:02:33.4285147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float64 PASSED [0.0112s] [ 17%] 2025-12-04T14:02:33.4285262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_int32 PASSED [0.0111s] [ 17%] 2025-12-04T14:02:33.4285374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_uint8 PASSED [0.0112s] [ 17%] 2025-12-04T14:02:33.4285498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_bool PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4285619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4285736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int64 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4285853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int8 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4285976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4286100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4286219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_float16 PASSED [0.0502s] [ 17%] 2025-12-04T14:02:33.4286335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int32 PASSED [0.0382s] [ 17%] 2025-12-04T14:02:33.4286454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int64 PASSED [0.0382s] [ 17%] 2025-12-04T14:02:33.4286572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_uint8 PASSED [0.0380s] [ 17%] 2025-12-04T14:02:33.4286700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_bfloat16 PASSED [0.0979s] [ 17%] 2025-12-04T14:02:33.4286832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_complex128 PASSED [0.1289s] [ 17%] 2025-12-04T14:02:33.4286957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_float32 PASSED [0.0973s] [ 17%] 2025-12-04T14:02:33.4287089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_int16 XFAIL [0.0052s] [ 17%] 2025-12-04T14:02:33.4287210Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_int32 XFAIL [0.9260s] [ 17%] 2025-12-04T14:02:33.4287338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_complex64 PASSED [1.0248s] [ 17%] 2025-12-04T14:02:33.4287464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_float64 PASSED [0.0979s] [ 17%] 2025-12-04T14:02:33.4287585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int16 PASSED [0.0679s] [ 17%] 2025-12-04T14:02:33.4287708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_uint8 PASSED [0.0676s] [ 17%] 2025-12-04T14:02:33.4287832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_complex64 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4287950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int8 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4288069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_uint8 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4288192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_complex128 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4288315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_complex64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4288435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int64 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4288568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4288707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_float16 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4288824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4288956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_complex128 XFAIL [0.0046s] [ 17%] 2025-12-04T14:02:33.4289081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_int32 PASSED [0.9841s] [ 17%] 2025-12-04T14:02:33.4289638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_uint8 PASSED [0.0757s] [ 17%] 2025-12-04T14:02:33.4289762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_complex64 PASSED [0.0093s] [ 17%] 2025-12-04T14:02:33.4289882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_float64 PASSED [0.0088s] [ 17%] 2025-12-04T14:02:33.4290002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int64 PASSED [0.0085s] [ 17%] 2025-12-04T14:02:33.4290154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float32 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4290271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4290395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_complex128 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4290519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_complex64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4290639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_float16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4290762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4290882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4291000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int64 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4291117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int8 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4291252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_uint8 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4291375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_float64 PASSED [0.0555s] [ 17%] 2025-12-04T14:02:33.4291490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int16 PASSED [0.0432s] [ 17%] 2025-12-04T14:02:33.4291608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int64 PASSED [0.0433s] [ 17%] 2025-12-04T14:02:33.4291724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_uint8 PASSED [0.0433s] [ 17%] 2025-12-04T14:02:33.4291840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_bool PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4291959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float64 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4292077Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4292190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int8 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4292310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4292428Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int32 PASSED [0.0097s] [ 17%] 2025-12-04T14:02:33.4292544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int16 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4292672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_uint8 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4292811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_complex128 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4292937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_complex64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4293059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float16 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4293184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float64 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4293305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int64 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4293439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int8 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4293555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_bool XFAIL [0.0036s] [ 17%] 2025-12-04T14:02:33.4293679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float16 PASSED [0.9446s] [ 17%] 2025-12-04T14:02:33.4293800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float32 PASSED [0.0192s] [ 17%] 2025-12-04T14:02:33.4293920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int64 XFAIL [0.0038s] [ 17%] 2025-12-04T14:02:33.4294034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int8 XFAIL [0.9044s] [ 17%] 2025-12-04T14:02:33.4294149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_bool XFAIL [0.8979s] [ 17%] 2025-12-04T14:02:33.4294268Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_float32 PASSED [0.9602s] [ 17%] 2025-12-04T14:02:33.4294386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_int8 XFAIL [0.0057s] [ 17%] 2025-12-04T14:02:33.4294502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_uint8 XFAIL [0.9140s] [ 17%] 2025-12-04T14:02:33.4294627Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_bfloat16 PASSED [0.8987s] [ 17%] 2025-12-04T14:02:33.4294754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_complex128 XFAIL [0.0045s] [ 17%] 2025-12-04T14:02:33.4294887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_complex64 XFAIL [0.9002s] [ 17%] 2025-12-04T14:02:33.4295011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float16 PASSED [0.9123s] [ 17%] 2025-12-04T14:02:33.4295132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float32 PASSED [0.0087s] [ 17%] 2025-12-04T14:02:33.4295260Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_complex128 PASSED [0.0086s] [ 17%] 2025-12-04T14:02:33.4295380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4295502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4295624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4295743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float32 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.4295864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4295985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_float64 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4296101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int8 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4296222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_uint8 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4296355Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4296488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_complex128 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4296611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_complex64 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4296732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float16 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4296852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4296970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_bfloat16 PASSED [0.0117s] [ 17%] 2025-12-04T14:02:33.4297102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_complex128 XFAIL [0.0036s] [ 17%] 2025-12-04T14:02:33.4297223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float32 PASSED [0.9147s] [ 17%] 2025-12-04T14:02:33.4297340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int32 PASSED [0.9174s] [ 17%] 2025-12-04T14:02:33.4297468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_complex64 XFAIL [0.0064s] [ 17%] 2025-12-04T14:02:33.4297590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int16 PASSED [0.9776s] [ 17%] 2025-12-04T14:02:33.4297717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_bfloat16 PASSED [0.1118s] [ 17%] 2025-12-04T14:02:33.4297838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int32 PASSED [0.0757s] [ 17%] 2025-12-04T14:02:33.4297959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int8 PASSED [0.0756s] [ 17%] 2025-12-04T14:02:33.4298079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_uint8 PASSED [0.0754s] [ 17%] 2025-12-04T14:02:33.4298198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bool PASSED [0.0415s] [ 17%] 2025-12-04T14:02:33.4298317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_complex64 PASSED [0.0691s] [ 17%] 2025-12-04T14:02:33.4298440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4298566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_bool XFAIL [0.0036s] [ 17%] 2025-12-04T14:02:33.4298690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_complex64 PASSED [0.9282s] [ 17%] 2025-12-04T14:02:33.4298805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_int16 PASSED [0.0085s] [ 17%] 2025-12-04T14:02:33.4298922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_bool XFAIL [0.0038s] [ 17%] 2025-12-04T14:02:33.4299041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float32 PASSED [0.9898s] [ 17%] 2025-12-04T14:02:33.4299163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float64 PASSED [0.0744s] [ 17%] 2025-12-04T14:02:33.4299280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int16 XFAIL [0.0037s] [ 17%] 2025-12-04T14:02:33.4299398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int32 XFAIL [0.9071s] [ 17%] 2025-12-04T14:02:33.4299518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_bfloat16 PASSED [0.9507s] [ 17%] 2025-12-04T14:02:33.4299634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_bool XFAIL [0.0048s] [ 17%] 2025-12-04T14:02:33.4299757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_complex128 PASSED [0.0605s] [ 17%] 2025-12-04T14:02:33.4299878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_float64 PASSED [0.0466s] [ 17%] 2025-12-04T14:02:33.4299996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int32 PASSED [0.0334s] [ 17%] 2025-12-04T14:02:33.4300172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_bool PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4300322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_complex128 PASSED [0.0083s] [ 17%] 2025-12-04T14:02:33.4300454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T14:02:33.4300581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_int16 PASSED [0.0084s] [ 17%] 2025-12-04T14:02:33.4300704Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_int8 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4300839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_bfloat16 PASSED [0.0082s] [ 17%] 2025-12-04T14:02:33.4300956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_bool XFAIL [0.0036s] [ 17%] 2025-12-04T14:02:33.4301082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_complex128 XFAIL [0.9280s] [ 17%] 2025-12-04T14:02:33.4301202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_int32 PASSED [0.9346s] [ 17%] 2025-12-04T14:02:33.4301323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_int8 PASSED [0.9129s] [ 17%] 2025-12-04T14:02:33.4301449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_complex128 PASSED [0.0107s] [ 17%] 2025-12-04T14:02:33.4301571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_float32 PASSED [0.0085s] [ 17%] 2025-12-04T14:02:33.4301693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_float64 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4301813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int32 PASSED [0.0084s] [ 18%] 2025-12-04T14:02:33.4301932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int8 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4302056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float16 PASSED [0.0075s] [ 18%] 2025-12-04T14:02:33.4302181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float64 PASSED [0.0074s] [ 18%] 2025-12-04T14:02:33.4302313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int16 PASSED [0.0075s] [ 18%] 2025-12-04T14:02:33.4302436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int32 PASSED [0.0073s] [ 18%] 2025-12-04T14:02:33.4302556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int64 PASSED [0.0074s] [ 18%] 2025-12-04T14:02:33.4302678Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int8 PASSED [0.0074s] [ 18%] 2025-12-04T14:02:33.4302796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int16 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4302915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int64 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4303034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_bfloat16 PASSED [0.0084s] [ 18%] 2025-12-04T14:02:33.4303149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_bool PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4303269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_complex64 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4303389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_float64 PASSED [0.0081s] [ 18%] 2025-12-04T14:02:33.4303513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_complex128 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4303633Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_float16 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4303752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int32 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4303904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_uint8 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4304036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_bfloat16 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4304155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int8 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4304274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_uint8 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4304394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_complex128 XFAIL [0.0111s] [ 18%] 2025-12-04T14:02:33.4304521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_float16 XFAIL [0.0111s] [ 18%] 2025-12-04T14:02:33.4304638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_float64 XFAIL [0.9286s] [ 18%] 2025-12-04T14:02:33.4304754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int32 XFAIL [0.9041s] [ 18%] 2025-12-04T14:02:33.4304867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int8 XFAIL [0.9164s] [ 18%] 2025-12-04T14:02:33.4304982Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_bool PASSED [0.9173s] [ 18%] 2025-12-04T14:02:33.4305103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_complex64 PASSED [0.0088s] [ 18%] 2025-12-04T14:02:33.4305221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int32 PASSED [0.0084s] [ 18%] 2025-12-04T14:02:33.4305336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int8 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4305452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_uint8 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4305576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_complex128 PASSED [0.0084s] [ 18%] 2025-12-04T14:02:33.4305696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float16 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4305814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_int32 PASSED [0.0083s] [ 18%] 2025-12-04T14:02:33.4305935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_float32 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4306062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_int8 PASSED [0.0082s] [ 18%] 2025-12-04T14:02:33.4306183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_bfloat16 PASSED [0.0058s] [ 18%] 2025-12-04T14:02:33.4306307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_complex64 PASSED [0.0059s] [ 18%] 2025-12-04T14:02:33.4306426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float16 PASSED [0.0057s] [ 18%] 2025-12-04T14:02:33.4306547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float64 PASSED [0.0058s] [ 18%] 2025-12-04T14:02:33.4306684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float16 PASSED [0.0138s] [ 18%] 2025-12-04T14:02:33.4306820Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_bfloat16 PASSED [0.1033s] [ 18%] 2025-12-04T14:02:33.4306956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_float16 PASSED [0.0246s] [ 18%] 2025-12-04T14:02:33.4307086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_bfloat16 PASSED [0.0141s] [ 18%] 2025-12-04T14:02:33.4307239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16 PASSED [0.0146s] [ 18%] 2025-12-04T14:02:33.4307388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64 PASSED [0.0143s] [ 18%] 2025-12-04T14:02:33.4307545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8 PASSED [0.0142s] [ 18%] 2025-12-04T14:02:33.4307669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex32 PASSED [0.9123s] [ 18%] 2025-12-04T14:02:33.4307782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex64 PASSED [0.0064s] [ 18%] 2025-12-04T14:02:33.4307892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_float16 PASSED [0.9189s] [ 18%] 2025-12-04T14:02:33.4308001Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_float64 PASSED [0.0045s] [ 18%] 2025-12-04T14:02:33.4308108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_int32 PASSED [0.9001s] [ 18%] 2025-12-04T14:02:33.4308224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_bool PASSED [0.0139s] [ 18%] 2025-12-04T14:02:33.4308338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex128 PASSED [0.0041s] [ 18%] 2025-12-04T14:02:33.4308451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex32 PASSED [0.9118s] [ 18%] 2025-12-04T14:02:33.4308562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex64 PASSED [0.0058s] [ 18%] 2025-12-04T14:02:33.4308672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_float32 PASSED [0.0040s] [ 18%] 2025-12-04T14:02:33.4308781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int32 PASSED [0.9011s] [ 18%] 2025-12-04T14:02:33.4308889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_bool PASSED [0.0061s] [ 18%] 2025-12-04T14:02:33.4309003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex32 PASSED [0.0041s] [ 18%] 2025-12-04T14:02:33.4309115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex64 PASSED [0.9055s] [ 18%] 2025-12-04T14:02:33.4309224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int16 PASSED [0.0058s] [ 18%] 2025-12-04T14:02:33.4309331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int8 PASSED [0.0041s] [ 18%] 2025-12-04T14:02:33.4309444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_complex128 PASSED [0.0106s] [ 18%] 2025-12-04T14:02:33.4309553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_complex64 PASSED [0.0101s] [ 18%] 2025-12-04T14:02:33.4309673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float16 PASSED [0.9184s] [ 18%] 2025-12-04T14:02:33.4309780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float64 PASSED [0.0122s] [ 18%] 2025-12-04T14:02:33.4309894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_complex64 PASSED [0.0125s] [ 18%] 2025-12-04T14:02:33.4310006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_float64 PASSED [0.0129s] [ 18%] 2025-12-04T14:02:33.4310160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_bfloat16 PASSED [0.0125s] [ 18%] 2025-12-04T14:02:33.4310273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_float64 PASSED [0.0124s] [ 18%] 2025-12-04T14:02:33.4310385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_bfloat16 PASSED [0.0096s] [ 18%] 2025-12-04T14:02:33.4310498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_complex128 PASSED [0.0096s] [ 18%] 2025-12-04T14:02:33.4310626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_float32 PASSED [0.0092s] [ 18%] 2025-12-04T14:02:33.4310739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_complex128 PASSED [0.0070s] [ 18%] 2025-12-04T14:02:33.4310847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_float16 PASSED [0.0090s] [ 18%] 2025-12-04T14:02:33.4310958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_float64 PASSED [0.0088s] [ 18%] 2025-12-04T14:02:33.4311070Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_bool PASSED [0.9221s] [ 18%] 2025-12-04T14:02:33.4311204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_float64 PASSED [0.0051s] [ 18%] 2025-12-04T14:02:33.4311330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int8 PASSED [0.0035s] [ 18%] 2025-12-04T14:02:33.4311436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_bool PASSED [0.0184s] [ 18%] 2025-12-04T14:02:33.4311544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_float64 PASSED [0.0187s] [ 18%] 2025-12-04T14:02:33.4311651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int16 PASSED [0.0186s] [ 18%] 2025-12-04T14:02:33.4311757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_uint8 PASSED [0.0185s] [ 18%] 2025-12-04T14:02:33.4311890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_complex128 PASSED [0.0307s] [ 18%] 2025-12-04T14:02:33.4312003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float16 PASSED [0.9502s] [ 18%] 2025-12-04T14:02:33.4312111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_bool PASSED [0.0138s] [ 18%] 2025-12-04T14:02:33.4312221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_float16 PASSED [0.0133s] [ 18%] 2025-12-04T14:02:33.4312331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float32 PASSED [0.9244s] [ 18%] 2025-12-04T14:02:33.4312441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_int64 PASSED [0.0143s] [ 18%] 2025-12-04T14:02:33.4312553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_bfloat16 PASSED [0.0106s] [ 18%] 2025-12-04T14:02:33.4312666Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float16 PASSED [0.0053s] [ 18%] 2025-12-04T14:02:33.4312778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float64 PASSED [0.0047s] [ 18%] 2025-12-04T14:02:33.4312890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_complex32 PASSED [1.0370s] [ 18%] 2025-12-04T14:02:33.4313000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_float64 PASSED [0.0045s] [ 18%] 2025-12-04T14:02:33.4313110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int64 PASSED [0.9083s] [ 18%] 2025-12-04T14:02:33.4313220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_complex128 PASSED [0.0174s] [ 18%] 2025-12-04T14:02:33.4313339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_uint8 PASSED [0.0164s] [ 18%] 2025-12-04T14:02:33.4313451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_bfloat16 PASSED [0.0137s] [ 18%] 2025-12-04T14:02:33.4313561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float32 PASSED [0.0132s] [ 18%] 2025-12-04T14:02:33.4313672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_int16 PASSED [0.0105s] [ 18%] 2025-12-04T14:02:33.4313781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_int8 PASSED [0.0102s] [ 18%] 2025-12-04T14:02:33.4313893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float64 PASSED [0.0131s] [ 18%] 2025-12-04T14:02:33.4314003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int16 PASSED [0.0078s] [ 18%] 2025-12-04T14:02:33.4314110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int64 PASSED [0.9194s] [ 18%] 2025-12-04T14:02:33.4314220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_uint8 PASSED [0.0100s] [ 18%] 2025-12-04T14:02:33.4314332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float16 PASSED [0.0140s] [ 18%] 2025-12-04T14:02:33.4314443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float32 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.4314554Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_float64 PASSED [0.9196s] [ 18%] 2025-12-04T14:02:33.4314662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int64 PASSED [0.0101s] [ 18%] 2025-12-04T14:02:33.4314780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int8 PASSED [0.0081s] [ 18%] 2025-12-04T14:02:33.4314899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_bool PASSED [0.1625s] [ 18%] 2025-12-04T14:02:33.4315010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int64 PASSED [0.0288s] [ 18%] 2025-12-04T14:02:33.4315119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int8 PASSED [0.0253s] [ 18%] 2025-12-04T14:02:33.4315230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_uint8 PASSED [0.0253s] [ 18%] 2025-12-04T14:02:33.4315340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bool PASSED [0.9955s] [ 18%] 2025-12-04T14:02:33.4315470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_complex128 PASSED [0.0067s] [ 18%] 2025-12-04T14:02:33.4315584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float16 PASSED [0.0045s] [ 18%] 2025-12-04T14:02:33.4315698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float32 PASSED [0.9063s] [ 18%] 2025-12-04T14:02:33.4315812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_float64 PASSED [0.0061s] [ 18%] 2025-12-04T14:02:33.4315922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int64 PASSED [0.0045s] [ 18%] 2025-12-04T14:02:33.4316049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex64 PASSED [0.0053s] [ 18%] 2025-12-04T14:02:33.4316173Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_float64 PASSED [0.0048s] [ 18%] 2025-12-04T14:02:33.4316295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int16 PASSED [0.0047s] [ 18%] 2025-12-04T14:02:33.4316415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int8 PASSED [0.0047s] [ 18%] 2025-12-04T14:02:33.4316536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_uint8 PASSED [0.0047s] [ 18%] 2025-12-04T14:02:33.4316654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_float32 PASSED [0.9129s] [ 18%] 2025-12-04T14:02:33.4316767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int16 PASSED [0.0058s] [ 18%] 2025-12-04T14:02:33.4316920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_bfloat16 PASSED [0.0037s] [ 18%] 2025-12-04T14:02:33.4317062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_complex128 PASSED [0.9085s] [ 18%] 2025-12-04T14:02:33.4317197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_float16 PASSED [0.0054s] [ 18%] 2025-12-04T14:02:33.4317333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_float64 PASSED [0.0036s] [ 18%] 2025-12-04T14:02:33.4317465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int16 PASSED [0.9054s] [ 18%] 2025-12-04T14:02:33.4317598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int64 PASSED [0.0053s] [ 19%] 2025-12-04T14:02:33.4317725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_float32 PASSED [0.0074s] [ 19%] 2025-12-04T14:02:33.4317850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int16 PASSED [0.0066s] [ 19%] 2025-12-04T14:02:33.4317973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int8 PASSED [0.0065s] [ 19%] 2025-12-04T14:02:33.4318085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_bfloat16 PASSED [0.0029s] [ 19%] 2025-12-04T14:02:33.4318199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex128 PASSED [0.9179s] [ 19%] 2025-12-04T14:02:33.4318311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex64 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4318432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_float32 PASSED [0.9026s] [ 19%] 2025-12-04T14:02:33.4318551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_float64 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4318659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_int64 PASSED [0.9141s] [ 19%] 2025-12-04T14:02:33.4318767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_int8 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4318876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_int8 PASSED [0.9187s] [ 19%] 2025-12-04T14:02:33.4318983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_uint8 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4319100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_bool PASSED [0.0095s] [ 19%] 2025-12-04T14:02:33.4319210Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_float16 PASSED [0.0087s] [ 19%] 2025-12-04T14:02:33.4319322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_bfloat16 PASSED [0.0028s] [ 19%] 2025-12-04T14:02:33.4319434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex128 PASSED [1.1051s] [ 19%] 2025-12-04T14:02:33.4319546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex32 PASSED [0.1740s] [ 19%] 2025-12-04T14:02:33.4319658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_complex64 PASSED [0.9187s] [ 19%] 2025-12-04T14:02:33.4319768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_float16 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4319876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_float64 PASSED [0.9072s] [ 19%] 2025-12-04T14:02:33.4319985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int32 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4320133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int64 PASSED [0.9197s] [ 19%] 2025-12-04T14:02:33.4320246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_complex32 PASSED [0.1722s] [ 19%] 2025-12-04T14:02:33.4320360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_float16 PASSED [0.9227s] [ 19%] 2025-12-04T14:02:33.4320468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int16 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4320577Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int32 PASSED [0.9052s] [ 19%] 2025-12-04T14:02:33.4320709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int8 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4320834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex32 PASSED [0.8909s] [ 19%] 2025-12-04T14:02:33.4320957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex64 PASSED [0.0051s] [ 19%] 2025-12-04T14:02:33.4321079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_float32 PASSED [0.9024s] [ 19%] 2025-12-04T14:02:33.4321193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_uint8 PASSED [0.0050s] [ 19%] 2025-12-04T14:02:33.4321315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_bfloat16 PASSED [0.9032s] [ 19%] 2025-12-04T14:02:33.4321427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_bool PASSED [0.0055s] [ 19%] 2025-12-04T14:02:33.4321547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_float16 PASSED [0.0040s] [ 19%] 2025-12-04T14:02:33.4321666Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_float32 PASSED [0.9036s] [ 19%] 2025-12-04T14:02:33.4321778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int32 PASSED [0.0055s] [ 19%] 2025-12-04T14:02:33.4321893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int64 PASSED [0.0039s] [ 19%] 2025-12-04T14:02:33.4322005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_uint8 PASSED [0.9073s] [ 19%] 2025-12-04T14:02:33.4322137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_bfloat16 PASSED [0.0062s] [ 19%] 2025-12-04T14:02:33.4322270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_complex32 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4322387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_float16 PASSED [0.0043s] [ 19%] 2025-12-04T14:02:33.4322500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_bfloat16 PASSED [0.0074s] [ 19%] 2025-12-04T14:02:33.4322614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_float64 PASSED [0.0067s] [ 19%] 2025-12-04T14:02:33.4322730Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_bfloat16 PASSED [0.0054s] [ 19%] 2025-12-04T14:02:33.4322857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_bfloat16 PASSED [0.9102s] [ 19%] 2025-12-04T14:02:33.4322971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int32 PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4323085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int64 PASSED [0.9320s] [ 19%] 2025-12-04T14:02:33.4323197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int8 PASSED [0.0167s] [ 19%] 2025-12-04T14:02:33.4323313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int16 PASSED [0.0091s] [ 19%] 2025-12-04T14:02:33.4323431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int32 PASSED [0.0085s] [ 19%] 2025-12-04T14:02:33.4323544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int8 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4323660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_uint8 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4323787Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int32 PASSED [0.0085s] [ 19%] 2025-12-04T14:02:33.4323913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_uint8 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4324029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int32 PASSED [0.0038s] [ 19%] 2025-12-04T14:02:33.4324145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_bool PASSED [0.0085s] [ 19%] 2025-12-04T14:02:33.4324258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int32 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4324380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int64 PASSED [0.0085s] [ 19%] 2025-12-04T14:02:33.4324492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int8 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4324621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int8 PASSED [0.0085s] [ 19%] 2025-12-04T14:02:33.4324733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_bool PASSED [0.0083s] [ 19%] 2025-12-04T14:02:33.4324848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_int8 PASSED [0.0083s] [ 19%] 2025-12-04T14:02:33.4324971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_complex128 PASSED [0.0069s] [ 19%] 2025-12-04T14:02:33.4325090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float16 PASSED [0.0066s] [ 19%] 2025-12-04T14:02:33.4325208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float32 PASSED [0.0067s] [ 19%] 2025-12-04T14:02:33.4325323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_float64 PASSED [0.0067s] [ 19%] 2025-12-04T14:02:33.4325437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_int16 PASSED [0.0067s] [ 19%] 2025-12-04T14:02:33.4325549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_int32 PASSED [0.0067s] [ 19%] 2025-12-04T14:02:33.4325660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_float32 PASSED [0.0030s] [ 19%] 2025-12-04T14:02:33.4325776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_bool PASSED [0.9367s] [ 19%] 2025-12-04T14:02:33.4325897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_float64 PASSED [0.0040s] [ 19%] 2025-12-04T14:02:33.4326005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int64 PASSED [0.9121s] [ 19%] 2025-12-04T14:02:33.4326114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int8 PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4326222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_uint8 PASSED [0.9193s] [ 19%] 2025-12-04T14:02:33.4326353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_bfloat16 PASSED [0.0046s] [ 19%] 2025-12-04T14:02:33.4326488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int32 PASSED [0.0033s] [ 19%] 2025-12-04T14:02:33.4326614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int8 PASSED [0.9055s] [ 19%] 2025-12-04T14:02:33.4326739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_complex128 PASSED [0.9159s] [ 19%] 2025-12-04T14:02:33.4326864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_complex64 PASSED [0.9084s] [ 19%] 2025-12-04T14:02:33.4326985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_float16 PASSED [0.8993s] [ 19%] 2025-12-04T14:02:33.4327108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_float32 PASSED [0.9105s] [ 19%] 2025-12-04T14:02:33.4327227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int32 PASSED [0.9156s] [ 19%] 2025-12-04T14:02:33.4327343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int64 PASSED [0.9054s] [ 19%] 2025-12-04T14:02:33.4327464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_bfloat16 PASSED [0.0225s] [ 19%] 2025-12-04T14:02:33.4327579Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_float32 PASSED [0.0161s] [ 19%] 2025-12-04T14:02:33.4327697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_float64 PASSED [0.0160s] [ 19%] 2025-12-04T14:02:33.4327805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_bool PASSED [0.0030s] [ 19%] 2025-12-04T14:02:33.4327918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_complex64 PASSED [0.9229s] [ 19%] 2025-12-04T14:02:33.4328054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_bfloat16 PASSED [0.0101s] [ 19%] 2025-12-04T14:02:33.4328183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_complex128 PASSED [0.0080s] [ 19%] 2025-12-04T14:02:33.4328304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int16 PASSED [0.0077s] [ 19%] 2025-12-04T14:02:33.4328425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int8 PASSED [0.0076s] [ 19%] 2025-12-04T14:02:33.4328534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_float32 PASSED [0.0084s] [ 19%] 2025-12-04T14:02:33.4328643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int32 PASSED [0.0081s] [ 19%] 2025-12-04T14:02:33.4328752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_uint8 PASSED [0.9239s] [ 19%] 2025-12-04T14:02:33.4328867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cauchy_cuda_float16 PASSED [0.0068s] [ 19%] 2025-12-04T14:02:33.4328981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_bfloat16 PASSED [0.9193s] [ 19%] 2025-12-04T14:02:33.4329093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_float32 PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4329207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_bfloat16 PASSED [0.9030s] [ 19%] 2025-12-04T14:02:33.4329318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_float64 PASSED [0.0045s] [ 19%] 2025-12-04T14:02:33.4329438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int16 PASSED [0.9018s] [ 19%] 2025-12-04T14:02:33.4329545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int64 PASSED [0.0045s] [ 19%] 2025-12-04T14:02:33.4329663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_uint8 PASSED [0.8913s] [ 19%] 2025-12-04T14:02:33.4329771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_bool PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4329889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_complex128 PASSED [0.9076s] [ 19%] 2025-12-04T14:02:33.4330000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float16 PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4330174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int16 PASSED [0.9206s] [ 19%] 2025-12-04T14:02:33.4330281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int32 PASSED [0.0041s] [ 19%] 2025-12-04T14:02:33.4330391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_float32 PASSED [0.9064s] [ 19%] 2025-12-04T14:02:33.4330499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int32 PASSED [0.0042s] [ 19%] 2025-12-04T14:02:33.4330618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_complex64 PASSED [0.0267s] [ 19%] 2025-12-04T14:02:33.4330749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_inverse_cuda_complex64 PASSED [0.1058s] [ 19%] 2025-12-04T14:02:33.4330876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_complex64 PASSED [0.0124s] [ 19%] 2025-12-04T14:02:33.4330990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_complex64 PASSED [0.9139s] [ 19%] 2025-12-04T14:02:33.4331102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float32 PASSED [0.0050s] [ 19%] 2025-12-04T14:02:33.4331213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float64 PASSED [0.8970s] [ 19%] 2025-12-04T14:02:33.4331323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_int16 PASSED [0.0050s] [ 19%] 2025-12-04T14:02:33.4331433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_uint8 PASSED [0.9230s] [ 19%] 2025-12-04T14:02:33.4331544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_float32 PASSED [0.0153s] [ 19%] 2025-12-04T14:02:33.4331664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int64 PASSED [0.9391s] [ 19%] 2025-12-04T14:02:33.4331772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int8 PASSED [0.0140s] [ 19%] 2025-12-04T14:02:33.4331891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_bfloat16 PASSED [0.0123s] [ 19%] 2025-12-04T14:02:33.4332008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_float32 PASSED [0.0117s] [ 19%] 2025-12-04T14:02:33.4332123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int16 PASSED [0.0116s] [ 19%] 2025-12-04T14:02:33.4332237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int32 PASSED [0.0116s] [ 19%] 2025-12-04T14:02:33.4332350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int64 PASSED [0.0116s] [ 19%] 2025-12-04T14:02:33.4332466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_bfloat16 PASSED [0.0117s] [ 19%] 2025-12-04T14:02:33.4332580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int16 PASSED [0.0116s] [ 19%] 2025-12-04T14:02:33.4332689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_bool PASSED [0.0034s] [ 19%] 2025-12-04T14:02:33.4332800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float32 PASSED [0.9264s] [ 19%] 2025-12-04T14:02:33.4332910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_int32 PASSED [0.0052s] [ 19%] 2025-12-04T14:02:33.4333032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_float64 PASSED [0.0058s] [ 19%] 2025-12-04T14:02:33.4333166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int16 PASSED [0.0052s] [ 19%] 2025-12-04T14:02:33.4333297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int32 PASSED [0.0051s] [ 20%] 2025-12-04T14:02:33.4333414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_int8 PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4333530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bool PASSED [0.0798s] [ 20%] 2025-12-04T14:02:33.4333656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_complex128 PASSED [0.0788s] [ 20%] 2025-12-04T14:02:33.4333773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int16 PASSED [0.0791s] [ 20%] 2025-12-04T14:02:33.4333896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float16 PASSED [0.0129s] [ 20%] 2025-12-04T14:02:33.4334009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float32 PASSED [0.0082s] [ 20%] 2025-12-04T14:02:33.4334123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_complex_cuda_float64 PASSED [0.0080s] [ 20%] 2025-12-04T14:02:33.4334233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_bfloat16 PASSED [0.0024s] [ 20%] 2025-12-04T14:02:33.4334347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_complex64 PASSED [0.9264s] [ 20%] 2025-12-04T14:02:33.4334455Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_int32 PASSED [0.0036s] [ 20%] 2025-12-04T14:02:33.4334564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_int64 PASSED [0.8950s] [ 20%] 2025-12-04T14:02:33.4334694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_complex128 PASSED [0.0047s] [ 20%] 2025-12-04T14:02:33.4334817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_float16 PASSED [0.9071s] [ 20%] 2025-12-04T14:02:33.4334937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int16 PASSED [0.0040s] [ 20%] 2025-12-04T14:02:33.4335056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int64 PASSED [0.9142s] [ 20%] 2025-12-04T14:02:33.4335175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_uint8 PASSED [0.0039s] [ 20%] 2025-12-04T14:02:33.4335310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_bfloat16 PASSED [0.0286s] [ 20%] 2025-12-04T14:02:33.4335439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_complex128 PASSED [0.0278s] [ 20%] 2025-12-04T14:02:33.4335565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_complex64 PASSED [0.0276s] [ 20%] 2025-12-04T14:02:33.4335690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_float32 PASSED [0.0271s] [ 20%] 2025-12-04T14:02:33.4335810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int32 PASSED [0.9401s] [ 20%] 2025-12-04T14:02:33.4335927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_int64 PASSED [0.0034s] [ 20%] 2025-12-04T14:02:33.4336040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int16 PASSED [0.0129s] [ 20%] 2025-12-04T14:02:33.4336153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int8 PASSED [0.0116s] [ 20%] 2025-12-04T14:02:33.4336264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int64 PASSED [0.8522s] [ 20%] 2025-12-04T14:02:33.4336372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_bool PASSED [0.9162s] [ 20%] 2025-12-04T14:02:33.4336484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex128 PASSED [0.1993s] [ 20%] 2025-12-04T14:02:33.4336595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex32 PASSED [0.2131s] [ 20%] 2025-12-04T14:02:33.4336706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex64 PASSED [0.9210s] [ 20%] 2025-12-04T14:02:33.4336823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float16 PASSED [0.0059s] [ 20%] 2025-12-04T14:02:33.4336941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int16 PASSED [0.0041s] [ 20%] 2025-12-04T14:02:33.4337046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int32 PASSED [0.9228s] [ 20%] 2025-12-04T14:02:33.4337154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_uint8 PASSED [0.0059s] [ 20%] 2025-12-04T14:02:33.4337267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_complex32 PASSED [0.2107s] [ 20%] 2025-12-04T14:02:33.4337375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int8 PASSED [0.9179s] [ 20%] 2025-12-04T14:02:33.4337510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_bfloat16 PASSED [0.0187s] [ 20%] 2025-12-04T14:02:33.4337630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_bool PASSED [0.9167s] [ 20%] 2025-12-04T14:02:33.4337757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_complex128 PASSED [0.0182s] [ 20%] 2025-12-04T14:02:33.4337880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_float16 PASSED [0.9284s] [ 20%] 2025-12-04T14:02:33.4337998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int16 PASSED [0.0185s] [ 20%] 2025-12-04T14:02:33.4338118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int8 PASSED [0.9138s] [ 20%] 2025-12-04T14:02:33.4338229Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float16 PASSED [0.6238s] [ 20%] 2025-12-04T14:02:33.4338336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float32 PASSED [0.3124s] [ 20%] 2025-12-04T14:02:33.4338446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float64 PASSED [0.3692s] [ 20%] 2025-12-04T14:02:33.4338552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_uint8 PASSED [0.3167s] [ 20%] 2025-12-04T14:02:33.4338665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_bfloat16 PASSED [0.9304s] [ 20%] 2025-12-04T14:02:33.4338776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float16 PASSED [0.0060s] [ 20%] 2025-12-04T14:02:33.4338886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float32 PASSED [0.0043s] [ 20%] 2025-12-04T14:02:33.4339002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_int16 PASSED [0.0041s] [ 20%] 2025-12-04T14:02:33.4339110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_int64 PASSED [0.9048s] [ 20%] 2025-12-04T14:02:33.4339218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int32 PASSED [0.0105s] [ 20%] 2025-12-04T14:02:33.4339334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_bfloat16 PASSED [0.9263s] [ 20%] 2025-12-04T14:02:33.4339442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_bool PASSED [0.0051s] [ 20%] 2025-12-04T14:02:33.4339556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_float32 PASSED [0.9154s] [ 20%] 2025-12-04T14:02:33.4339668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_float64 PASSED [0.0048s] [ 20%] 2025-12-04T14:02:33.4339780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int16 PASSED [0.8979s] [ 20%] 2025-12-04T14:02:33.4339889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int64 PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4339999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int8 PASSED [0.9065s] [ 20%] 2025-12-04T14:02:33.4340168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_complex128 PASSED [0.0261s] [ 20%] 2025-12-04T14:02:33.4340286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_complex128 PASSED [0.0058s] [ 20%] 2025-12-04T14:02:33.4340399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_float32 PASSED [0.0054s] [ 20%] 2025-12-04T14:02:33.4340531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_float64 PASSED [0.0053s] [ 20%] 2025-12-04T14:02:33.4340654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_uint8 PASSED [0.0055s] [ 20%] 2025-12-04T14:02:33.4340787Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_float32 PASSED [0.0319s] [ 20%] 2025-12-04T14:02:33.4340917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_int8 PASSED [0.9591s] [ 20%] 2025-12-04T14:02:33.4341027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_bool PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4341142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_float16 PASSED [0.9296s] [ 20%] 2025-12-04T14:02:33.4341269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_float64 PASSED [0.0048s] [ 20%] 2025-12-04T14:02:33.4341380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_int8 PASSED [0.9282s] [ 20%] 2025-12-04T14:02:33.4341494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_complex128 PASSED [0.0132s] [ 20%] 2025-12-04T14:02:33.4341607Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_complex64 PASSED [0.0104s] [ 20%] 2025-12-04T14:02:33.4341716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float32 PASSED [0.0101s] [ 20%] 2025-12-04T14:02:33.4341826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float64 PASSED [0.9187s] [ 20%] 2025-12-04T14:02:33.4341936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int16 PASSED [0.0124s] [ 20%] 2025-12-04T14:02:33.4342047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_bool PASSED [0.0187s] [ 20%] 2025-12-04T14:02:33.4342162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int32 PASSED [0.9414s] [ 20%] 2025-12-04T14:02:33.4342276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float32 PASSED [0.0112s] [ 20%] 2025-12-04T14:02:33.4342389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int16 PASSED [0.0086s] [ 20%] 2025-12-04T14:02:33.4342499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int32 PASSED [0.0084s] [ 20%] 2025-12-04T14:02:33.4342624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_bfloat16 PASSED [0.9197s] [ 20%] 2025-12-04T14:02:33.4342762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_complex32 PASSED [0.0117s] [ 20%] 2025-12-04T14:02:33.4342886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_float32 PASSED [0.0096s] [ 20%] 2025-12-04T14:02:33.4343005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex64 PASSED [0.0064s] [ 20%] 2025-12-04T14:02:33.4343119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float16 PASSED [0.0061s] [ 20%] 2025-12-04T14:02:33.4343233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float64 PASSED [0.0061s] [ 20%] 2025-12-04T14:02:33.4343346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_int16 PASSED [0.0061s] [ 20%] 2025-12-04T14:02:33.4343458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_int32 PASSED [0.0061s] [ 20%] 2025-12-04T14:02:33.4343586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float32 PASSED [0.0115s] [ 20%] 2025-12-04T14:02:33.4343715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float64 PASSED [0.0112s] [ 20%] 2025-12-04T14:02:33.4343837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_int8 PASSED [0.9314s] [ 20%] 2025-12-04T14:02:33.4343948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float64 PASSED [0.2112s] [ 20%] 2025-12-04T14:02:33.4344055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_int32 PASSED [0.2093s] [ 20%] 2025-12-04T14:02:33.4344181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_bfloat16 PASSED [0.9092s] [ 20%] 2025-12-04T14:02:33.4344307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_bool PASSED [0.0076s] [ 20%] 2025-12-04T14:02:33.4344418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int32 PASSED [0.0041s] [ 20%] 2025-12-04T14:02:33.4344529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int64 PASSED [0.9161s] [ 20%] 2025-12-04T14:02:33.4344638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float16 PASSED [0.0576s] [ 20%] 2025-12-04T14:02:33.4344747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float32 PASSED [0.0541s] [ 20%] 2025-12-04T14:02:33.4344883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_bool PASSED [0.0093s] [ 20%] 2025-12-04T14:02:33.4345017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex128 PASSED [0.0092s] [ 20%] 2025-12-04T14:02:33.4345148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float16 PASSED [0.0092s] [ 20%] 2025-12-04T14:02:33.4345275Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_int8 PASSED [0.0092s] [ 20%] 2025-12-04T14:02:33.4345405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_bfloat16 PASSED [0.0099s] [ 20%] 2025-12-04T14:02:33.4345533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float16 PASSED [0.0097s] [ 20%] 2025-12-04T14:02:33.4345657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int32 PASSED [0.0091s] [ 20%] 2025-12-04T14:02:33.4345773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_complex64 PASSED [0.0042s] [ 20%] 2025-12-04T14:02:33.4345883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_float64 PASSED [0.9295s] [ 20%] 2025-12-04T14:02:33.4345999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex64 PASSED [0.0043s] [ 20%] 2025-12-04T14:02:33.4346111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float16 PASSED [0.9299s] [ 20%] 2025-12-04T14:02:33.4346224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float64 PASSED [0.0040s] [ 20%] 2025-12-04T14:02:33.4346342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_int8 PASSED [0.9059s] [ 20%] 2025-12-04T14:02:33.4346452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_bool PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4346566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_complex64 PASSED [0.0037s] [ 20%] 2025-12-04T14:02:33.4346679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_float32 PASSED [0.9301s] [ 20%] 2025-12-04T14:02:33.4346788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int16 PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4346898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int32 PASSED [0.0036s] [ 20%] 2025-12-04T14:02:33.4347006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int8 PASSED [0.9189s] [ 20%] 2025-12-04T14:02:33.4347117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_uint8 PASSED [0.0050s] [ 20%] 2025-12-04T14:02:33.4347230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_bfloat16 PASSED [0.0074s] [ 20%] 2025-12-04T14:02:33.4347345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_complex32 PASSED [0.0067s] [ 20%] 2025-12-04T14:02:33.4347454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_int32 PASSED [0.0065s] [ 20%] 2025-12-04T14:02:33.4347563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_uint8 PASSED [0.9228s] [ 20%] 2025-12-04T14:02:33.4347676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_bfloat16 PASSED [0.2960s] [ 20%] 2025-12-04T14:02:33.4347800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float32 PASSED [0.3254s] [ 20%] 2025-12-04T14:02:33.4347911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float64 PASSED [0.0811s] [ 20%] 2025-12-04T14:02:33.4348036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_complex64 PASSED [0.0042s] [ 20%] 2025-12-04T14:02:33.4348152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float16 PASSED [0.0037s] [ 20%] 2025-12-04T14:02:33.4348263Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float64 PASSED [0.0038s] [ 20%] 2025-12-04T14:02:33.4348374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_int16 PASSED [0.0038s] [ 20%] 2025-12-04T14:02:33.4348496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_bool PASSED [0.0060s] [ 20%] 2025-12-04T14:02:33.4348618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_complex64 PASSED [0.0061s] [ 20%] 2025-12-04T14:02:33.4348732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_int16 PASSED [0.0060s] [ 20%] 2025-12-04T14:02:33.4348847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_int8 PASSED [0.0060s] [ 20%] 2025-12-04T14:02:33.4348959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_uint8 PASSED [0.0059s] [ 21%] 2025-12-04T14:02:33.4349085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_bfloat16 PASSED [0.0139s] [ 21%] 2025-12-04T14:02:33.4349205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_bool PASSED [0.0138s] [ 21%] 2025-12-04T14:02:33.4349333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_complex128 PASSED [0.0138s] [ 21%] 2025-12-04T14:02:33.4349454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int8 PASSED [0.0137s] [ 21%] 2025-12-04T14:02:33.4349575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_complex64 XFAIL [0.0045s] [ 21%] 2025-12-04T14:02:33.4349694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int64 XFAIL [0.9141s] [ 21%] 2025-12-04T14:02:33.4349810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int8 XFAIL [0.9155s] [ 21%] 2025-12-04T14:02:33.4349927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_uint8 XFAIL [0.9191s] [ 21%] 2025-12-04T14:02:33.4350041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_bool PASSED [0.9424s] [ 21%] 2025-12-04T14:02:33.4350204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_complex64 PASSED [0.0098s] [ 21%] 2025-12-04T14:02:33.4350314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_float16 PASSED [0.0095s] [ 21%] 2025-12-04T14:02:33.4350421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int16 PASSED [0.9244s] [ 21%] 2025-12-04T14:02:33.4350524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int8 PASSED [0.0114s] [ 21%] 2025-12-04T14:02:33.4350639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_complex64 PASSED [0.0054s] [ 21%] 2025-12-04T14:02:33.4350751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_float16 PASSED [0.0049s] [ 21%] 2025-12-04T14:02:33.4350860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int16 PASSED [0.9343s] [ 21%] 2025-12-04T14:02:33.4350965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int8 PASSED [0.0047s] [ 21%] 2025-12-04T14:02:33.4351074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_bool PASSED [0.0044s] [ 21%] 2025-12-04T14:02:33.4351182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_int32 PASSED [0.9116s] [ 21%] 2025-12-04T14:02:33.4351295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_bfloat16 PASSED [0.0060s] [ 21%] 2025-12-04T14:02:33.4351409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_float16 PASSED [0.9328s] [ 21%] 2025-12-04T14:02:33.4351537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_complex64 PASSED [0.2026s] [ 21%] 2025-12-04T14:02:33.4351658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_float16 PASSED [0.0051s] [ 21%] 2025-12-04T14:02:33.4351767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_float32 PASSED [0.9268s] [ 21%] 2025-12-04T14:02:33.4351873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_bool PASSED [0.0059s] [ 21%] 2025-12-04T14:02:33.4351984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex128 PASSED [0.0057s] [ 21%] 2025-12-04T14:02:33.4352096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex32 PASSED [1.1187s] [ 21%] 2025-12-04T14:02:33.4352216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_float16 PASSED [0.0058s] [ 21%] 2025-12-04T14:02:33.4352324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_int16 PASSED [0.0040s] [ 21%] 2025-12-04T14:02:33.4352430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_int8 PASSED [0.9309s] [ 21%] 2025-12-04T14:02:33.4352551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_bfloat16 PASSED [0.0049s] [ 21%] 2025-12-04T14:02:33.4352662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_bool PASSED [0.9307s] [ 21%] 2025-12-04T14:02:33.4352786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_complex128 PASSED [0.0050s] [ 21%] 2025-12-04T14:02:33.4352906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_complex64 PASSED [0.9361s] [ 21%] 2025-12-04T14:02:33.4353023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_float64 PASSED [0.0049s] [ 21%] 2025-12-04T14:02:33.4353143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_float16 PASSED [0.0073s] [ 21%] 2025-12-04T14:02:33.4353260Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_float64 PASSED [0.0066s] [ 21%] 2025-12-04T14:02:33.4353376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_int8 PASSED [0.0066s] [ 21%] 2025-12-04T14:02:33.4353486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_bool PASSED [0.0046s] [ 21%] 2025-12-04T14:02:33.4353598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float16 PASSED [0.0046s] [ 21%] 2025-12-04T14:02:33.4353720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float32 PASSED [0.9160s] [ 21%] 2025-12-04T14:02:33.4353832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float64 PASSED [0.0064s] [ 21%] 2025-12-04T14:02:33.4353940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int16 PASSED [0.0049s] [ 21%] 2025-12-04T14:02:33.4354050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int8 PASSED [0.0047s] [ 21%] 2025-12-04T14:02:33.4354160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float16 PASSED [0.9332s] [ 21%] 2025-12-04T14:02:33.4354274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float32 PASSED [0.0046s] [ 21%] 2025-12-04T14:02:33.4354383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_uint8 PASSED [0.9396s] [ 21%] 2025-12-04T14:02:33.4354504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exponential_cuda_float32 PASSED [0.0064s] [ 21%] 2025-12-04T14:02:33.4354612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_bfloat16 PASSED [0.0412s] [ 21%] 2025-12-04T14:02:33.4354724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_complex128 PASSED [0.0401s] [ 21%] 2025-12-04T14:02:33.4354835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_complex64 PASSED [0.0402s] [ 21%] 2025-12-04T14:02:33.4354943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float16 PASSED [0.0402s] [ 21%] 2025-12-04T14:02:33.4355063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float8_e4m3fnuz PASSED [0.0403s] [ 21%] 2025-12-04T14:02:33.4355180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_uint8 PASSED [0.9678s] [ 21%] 2025-12-04T14:02:33.4355310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_complex32 PASSED [2.8782s] [ 21%] 2025-12-04T14:02:33.4355419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int32 PASSED [2.4106s] [ 21%] 2025-12-04T14:02:33.4355531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int8 PASSED [0.0076s] [ 21%] 2025-12-04T14:02:33.4355639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_bool PASSED [0.9339s] [ 21%] 2025-12-04T14:02:33.4355757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex128 PASSED [2.9498s] [ 21%] 2025-12-04T14:02:33.4355878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_float32 PASSED [0.0083s] [ 21%] 2025-12-04T14:02:33.4355992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int16 PASSED [0.0073s] [ 21%] 2025-12-04T14:02:33.4356102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int64 PASSED [0.0071s] [ 21%] 2025-12-04T14:02:33.4356213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_uint8 PASSED [0.0071s] [ 21%] 2025-12-04T14:02:33.4356332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_complex128 PASSED [1.8163s] [ 21%] 2025-12-04T14:02:33.4356445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_int32 PASSED [2.0268s] [ 21%] 2025-12-04T14:02:33.4356555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_int64 PASSED [0.0086s] [ 21%] 2025-12-04T14:02:33.4356682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex128 PASSED [0.0110s] [ 21%] 2025-12-04T14:02:33.4356807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex64 PASSED [0.8616s] [ 21%] 2025-12-04T14:02:33.4356927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float16 PASSED [0.0091s] [ 21%] 2025-12-04T14:02:33.4357046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int32 PASSED [0.8738s] [ 21%] 2025-12-04T14:02:33.4357164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex32 PASSED [2.4395s] [ 21%] 2025-12-04T14:02:33.4357288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_float32 PASSED [2.3323s] [ 21%] 2025-12-04T14:02:33.4357400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int16 PASSED [0.0149s] [ 21%] 2025-12-04T14:02:33.4357512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int32 PASSED [0.0127s] [ 21%] 2025-12-04T14:02:33.4357624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int64 PASSED [0.0125s] [ 21%] 2025-12-04T14:02:33.4357736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_int8 PASSED [0.0123s] [ 21%] 2025-12-04T14:02:33.4357853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex32 PASSED [0.2964s] [ 21%] 2025-12-04T14:02:33.4357968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_float16 PASSED [0.0105s] [ 21%] 2025-12-04T14:02:33.4358079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_float64 PASSED [1.3748s] [ 21%] 2025-12-04T14:02:33.4358202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex128 PASSED [0.8513s] [ 21%] 2025-12-04T14:02:33.4358320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex32 PASSED [0.0292s] [ 21%] 2025-12-04T14:02:33.4358432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_int64 PASSED [0.3394s] [ 21%] 2025-12-04T14:02:33.4358545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_int8 PASSED [0.8910s] [ 21%] 2025-12-04T14:02:33.4358657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_uint8 PASSED [0.0164s] [ 21%] 2025-12-04T14:02:33.4358792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_complex32 PASSED [0.0075s] [ 21%] 2025-12-04T14:02:33.4358919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float16 PASSED [0.0075s] [ 21%] 2025-12-04T14:02:33.4359037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float32 PASSED [0.0071s] [ 21%] 2025-12-04T14:02:33.4406904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_float64 PASSED [0.3079s] [ 21%] 2025-12-04T14:02:33.4407079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int32 PASSED [0.0075s] [ 21%] 2025-12-04T14:02:33.4407216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_uint8 PASSED [0.0070s] [ 21%] 2025-12-04T14:02:33.4407415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float16 PASSED [1.1926s] [ 21%] 2025-12-04T14:02:33.4407550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float32 PASSED [0.0118s] [ 21%] 2025-12-04T14:02:33.4407670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_int32 PASSED [0.0117s] [ 21%] 2025-12-04T14:02:33.4407814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float16 PASSED [0.5977s] [ 21%] 2025-12-04T14:02:33.4407933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float32 PASSED [0.0087s] [ 21%] 2025-12-04T14:02:33.4408064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float64 PASSED [0.0083s] [ 21%] 2025-12-04T14:02:33.4408181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_int32 PASSED [0.0081s] [ 21%] 2025-12-04T14:02:33.4408316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float32 PASSED [0.0069s] [ 21%] 2025-12-04T14:02:33.4408442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int16 PASSED [0.0067s] [ 21%] 2025-12-04T14:02:33.4408574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int32 PASSED [0.0067s] [ 21%] 2025-12-04T14:02:33.4408699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int64 PASSED [0.0067s] [ 21%] 2025-12-04T14:02:33.4408834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_int8 PASSED [0.0067s] [ 21%] 2025-12-04T14:02:33.4408970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_bool PASSED [0.3248s] [ 21%] 2025-12-04T14:02:33.4409098Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_int64 PASSED [0.0112s] [ 21%] 2025-12-04T14:02:33.4409218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_int8 PASSED [0.0107s] [ 21%] 2025-12-04T14:02:33.4409349Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_uint8 PASSED [0.9083s] [ 21%] 2025-12-04T14:02:33.4409479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_float64 PASSED [0.7475s] [ 21%] 2025-12-04T14:02:33.4409597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_int16 PASSED [0.8830s] [ 21%] 2025-12-04T14:02:33.4409719Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_uint8 PASSED [0.0138s] [ 21%] 2025-12-04T14:02:33.4409838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_bool PASSED [0.8985s] [ 21%] 2025-12-04T14:02:33.4409964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int16 PASSED [0.0150s] [ 21%] 2025-12-04T14:02:33.4410079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_bool PASSED [1.3257s] [ 21%] 2025-12-04T14:02:33.4410240Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_float32 PASSED [0.0094s] [ 21%] 2025-12-04T14:02:33.4410360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_uint8 PASSED [0.0076s] [ 21%] 2025-12-04T14:02:33.4410493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_float32 PASSED [0.1433s] [ 21%] 2025-12-04T14:02:33.4410631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int32 PASSED [0.0073s] [ 21%] 2025-12-04T14:02:33.4410800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int64 PASSED [0.0072s] [ 21%] 2025-12-04T14:02:33.4410916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_bool PASSED [0.0086s] [ 21%] 2025-12-04T14:02:33.4411048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_float32 PASSED [0.0086s] [ 21%] 2025-12-04T14:02:33.4411164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_int8 PASSED [0.8874s] [ 21%] 2025-12-04T14:02:33.4411291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_bool PASSED [0.4372s] [ 21%] 2025-12-04T14:02:33.4411427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_uint8 PASSED [0.8805s] [ 21%] 2025-12-04T14:02:33.4411550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_bool PASSED [0.0094s] [ 21%] 2025-12-04T14:02:33.4411677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int64 PASSED [0.8780s] [ 21%] 2025-12-04T14:02:33.4411793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_uint8 PASSED [0.0090s] [ 21%] 2025-12-04T14:02:33.4411924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_float64 PASSED [1.5620s] [ 21%] 2025-12-04T14:02:33.4412041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_int8 PASSED [0.0112s] [ 21%] 2025-12-04T14:02:33.4412165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_uint8 PASSED [0.0087s] [ 21%] 2025-12-04T14:02:33.4412281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_complex64 PASSED [0.0040s] [ 21%] 2025-12-04T14:02:33.4412403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_float32 PASSED [0.0037s] [ 21%] 2025-12-04T14:02:33.4412523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_float64 PASSED [0.8674s] [ 21%] 2025-12-04T14:02:33.4412648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int32 PASSED [0.0055s] [ 21%] 2025-12-04T14:02:33.4412762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_uint8 PASSED [0.0039s] [ 21%] 2025-12-04T14:02:33.4412885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float16 PASSED [0.8794s] [ 22%] 2025-12-04T14:02:33.4413018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float64 PASSED [0.0053s] [ 22%] 2025-12-04T14:02:33.4413144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int32 PASSED [0.0038s] [ 22%] 2025-12-04T14:02:33.4413259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int64 PASSED [0.8581s] [ 22%] 2025-12-04T14:02:33.4413382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_uint8 PASSED [0.0054s] [ 22%] 2025-12-04T14:02:33.4413507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_complex128 PASSED [0.0082s] [ 22%] 2025-12-04T14:02:33.4413624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_float16 PASSED [0.0075s] [ 22%] 2025-12-04T14:02:33.4413746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_float32 PASSED [0.0074s] [ 22%] 2025-12-04T14:02:33.4413858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int32 PASSED [0.0073s] [ 22%] 2025-12-04T14:02:33.4413980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int64 PASSED [0.0074s] [ 22%] 2025-12-04T14:02:33.4414104Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_bfloat16 PASSED [0.0032s] [ 22%] 2025-12-04T14:02:33.4414228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_complex64 PASSED [0.0032s] [ 22%] 2025-12-04T14:02:33.4414341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_int32 PASSED [0.0032s] [ 22%] 2025-12-04T14:02:33.4414470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_bfloat16 PASSED [0.0032s] [ 22%] 2025-12-04T14:02:33.4414595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_bool PASSED [0.0031s] [ 22%] 2025-12-04T14:02:33.4414736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_complex128 PASSED [0.0033s] [ 22%] 2025-12-04T14:02:33.4414851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_float16 PASSED [0.0033s] [ 22%] 2025-12-04T14:02:33.4414975Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_float32 PASSED [0.0031s] [ 22%] 2025-12-04T14:02:33.4415093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int8 PASSED [0.8767s] [ 22%] 2025-12-04T14:02:33.4415215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_uint8 PASSED [0.0042s] [ 22%] 2025-12-04T14:02:33.4415365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_bfloat16 PASSED [0.0102s] [ 22%] 2025-12-04T14:02:33.4415494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_complex128 PASSED [0.0087s] [ 22%] 2025-12-04T14:02:33.4415626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float16 PASSED [0.0091s] [ 22%] 2025-12-04T14:02:33.4415754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int32 PASSED [0.0092s] [ 22%] 2025-12-04T14:02:33.4415882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int8 PASSED [0.0091s] [ 22%] 2025-12-04T14:02:33.4416002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_uint8 PASSED [0.0092s] [ 22%] 2025-12-04T14:02:33.4416125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_float32 PASSED [0.8778s] [ 22%] 2025-12-04T14:02:33.4416242Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int16 PASSED [0.0045s] [ 22%] 2025-12-04T14:02:33.4416365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int32 PASSED [0.8711s] [ 22%] 2025-12-04T14:02:33.4416479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_int8 PASSED [0.0044s] [ 22%] 2025-12-04T14:02:33.4416614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_bfloat16 PASSED [0.0316s] [ 22%] 2025-12-04T14:02:33.4416743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_float16 PASSED [0.0306s] [ 22%] 2025-12-04T14:02:33.4416873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int16 PASSED [0.0160s] [ 22%] 2025-12-04T14:02:33.4417007Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int8 PASSED [0.0159s] [ 22%] 2025-12-04T14:02:33.4417139Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_uint8 PASSED [0.0094s] [ 22%] 2025-12-04T14:02:33.4417258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_bfloat16 PASSED [0.0087s] [ 22%] 2025-12-04T14:02:33.4417381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_float16 PASSED [0.0092s] [ 22%] 2025-12-04T14:02:33.4417504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_int64 PASSED [0.0090s] [ 22%] 2025-12-04T14:02:33.4417618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_float64 PASSED [0.0088s] [ 22%] 2025-12-04T14:02:33.4417739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int16 PASSED [0.0085s] [ 22%] 2025-12-04T14:02:33.4417853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int64 PASSED [0.0085s] [ 22%] 2025-12-04T14:02:33.4417978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_bfloat16 PASSED [0.0094s] [ 22%] 2025-12-04T14:02:33.4418088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int32 PASSED [0.0090s] [ 22%] 2025-12-04T14:02:33.4418207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int64 PASSED [0.0090s] [ 22%] 2025-12-04T14:02:33.4418325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float16 PASSED [0.8617s] [ 22%] 2025-12-04T14:02:33.4418446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float32 PASSED [0.0049s] [ 22%] 2025-12-04T14:02:33.4418574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float64 PASSED [0.0035s] [ 22%] 2025-12-04T14:02:33.4418713Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float32 PASSED [0.8781s] [ 22%] 2025-12-04T14:02:33.4418830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_float64 PASSED [0.0057s] [ 22%] 2025-12-04T14:02:33.4418961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int32 PASSED [0.0067s] [ 22%] 2025-12-04T14:02:33.4419077Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int8 PASSED [0.0063s] [ 22%] 2025-12-04T14:02:33.4419208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_complex128 PASSED [0.0073s] [ 22%] 2025-12-04T14:02:33.4419349Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_float32 PASSED [0.0068s] [ 22%] 2025-12-04T14:02:33.4419465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int32 PASSED [0.0068s] [ 22%] 2025-12-04T14:02:33.4419585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int64 PASSED [0.0068s] [ 22%] 2025-12-04T14:02:33.4419699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_uint8 PASSED [0.0068s] [ 22%] 2025-12-04T14:02:33.4419817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int64 PASSED [0.0106s] [ 22%] 2025-12-04T14:02:33.4419928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int8 PASSED [0.0097s] [ 22%] 2025-12-04T14:02:33.4420050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_bfloat16 PASSED [0.0086s] [ 22%] 2025-12-04T14:02:33.4420203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float32 PASSED [0.0085s] [ 22%] 2025-12-04T14:02:33.4420325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float64 PASSED [0.0084s] [ 22%] 2025-12-04T14:02:33.4420437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int32 PASSED [0.0084s] [ 22%] 2025-12-04T14:02:33.4420554Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int64 PASSED [0.0084s] [ 22%] 2025-12-04T14:02:33.4420662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int8 PASSED [0.0084s] [ 22%] 2025-12-04T14:02:33.4420793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_float16 PASSED [0.8723s] [ 22%] 2025-12-04T14:02:33.4420928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int64 PASSED [0.0062s] [ 22%] 2025-12-04T14:02:33.4421063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_complex128 PASSED [0.2843s] [ 22%] 2025-12-04T14:02:33.4421182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_float16 PASSED [0.1725s] [ 22%] 2025-12-04T14:02:33.4421307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int16 PASSED [0.1707s] [ 22%] 2025-12-04T14:02:33.4421431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int32 PASSED [0.1729s] [ 22%] 2025-12-04T14:02:33.4421550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int8 PASSED [0.1708s] [ 22%] 2025-12-04T14:02:33.4421709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_bfloat16 SKIPPED [0.0002s] (Skipped!) [ 22%] 2025-12-04T14:02:33.4421854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float16 SKIPPED [0.0001s] (Skipped!) [ 22%] 2025-12-04T14:02:33.4421972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int16 PASSED [0.0087s] [ 22%] 2025-12-04T14:02:33.4422080Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int32 PASSED [0.0086s] [ 22%] 2025-12-04T14:02:33.4422194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int64 PASSED [0.0085s] [ 22%] 2025-12-04T14:02:33.4422302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int8 PASSED [0.0085s] [ 22%] 2025-12-04T14:02:33.4422426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_bfloat16 PASSED [0.8739s] [ 22%] 2025-12-04T14:02:33.4422564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float16 PASSED [0.0040s] [ 22%] 2025-12-04T14:02:33.4422697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float64 PASSED [0.8755s] [ 22%] 2025-12-04T14:02:33.4422821Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_bfloat16 PASSED [0.0132s] [ 22%] 2025-12-04T14:02:33.4422951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_float64 PASSED [0.0110s] [ 22%] 2025-12-04T14:02:33.4423073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int32 PASSED [0.0108s] [ 22%] 2025-12-04T14:02:33.4423199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int64 PASSED [0.0106s] [ 22%] 2025-12-04T14:02:33.4423337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int8 PASSED [0.8800s] [ 22%] 2025-12-04T14:02:33.4423458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_float32 PASSED [0.0169s] [ 22%] 2025-12-04T14:02:33.4423586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int32 PASSED [0.0148s] [ 22%] 2025-12-04T14:02:33.4423703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_float32 PASSED [0.0501s] [ 22%] 2025-12-04T14:02:33.4423826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int64 PASSED [0.0496s] [ 22%] 2025-12-04T14:02:33.4423939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_uint8 PASSED [0.0491s] [ 22%] 2025-12-04T14:02:33.4424066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_complex64 PASSED [0.8754s] [ 22%] 2025-12-04T14:02:33.4424186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_float64 PASSED [0.0049s] [ 22%] 2025-12-04T14:02:33.4424309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int64 PASSED [0.0035s] [ 22%] 2025-12-04T14:02:33.4424423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_bool PASSED [0.8653s] [ 22%] 2025-12-04T14:02:33.4424551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_float64 PASSED [0.0056s] [ 22%] 2025-12-04T14:02:33.4424670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float64 PASSED [0.0091s] [ 22%] 2025-12-04T14:02:33.4424797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_bfloat16 PASSED [1.0488s] [ 22%] 2025-12-04T14:02:33.4424920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igammac_cuda_float32 PASSED [0.0104s] [ 22%] 2025-12-04T14:02:33.4425036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igammac_cuda_float64 PASSED [0.0087s] [ 22%] 2025-12-04T14:02:33.4425159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex64 PASSED [0.8816s] [ 22%] 2025-12-04T14:02:33.4425276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_bool PASSED [0.0089s] [ 22%] 2025-12-04T14:02:33.4425406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_complex128 PASSED [0.0102s] [ 22%] 2025-12-04T14:02:33.4425529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_complex32 PASSED [0.8819s] [ 22%] 2025-12-04T14:02:33.4425654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_float64 PASSED [0.0120s] [ 22%] 2025-12-04T14:02:33.4425777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_bfloat16 PASSED [0.0048s] [ 22%] 2025-12-04T14:02:33.4425906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_complex32 PASSED [0.0045s] [ 22%] 2025-12-04T14:02:33.4426032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_float64 PASSED [0.0044s] [ 22%] 2025-12-04T14:02:33.4426315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int16 PASSED [0.0044s] [ 22%] 2025-12-04T14:02:33.4426452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int32 PASSED [0.0043s] [ 22%] 2025-12-04T14:02:33.4426612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_uint8 PASSED [0.0044s] [ 22%] 2025-12-04T14:02:33.4426764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_complex128 PASSED [0.0069s] [ 22%] 2025-12-04T14:02:33.4426908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int8 PASSED [0.0066s] [ 22%] 2025-12-04T14:02:33.4427031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_bfloat16 PASSED [0.0058s] [ 22%] 2025-12-04T14:02:33.4427218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex32 PASSED [0.0055s] [ 22%] 2025-12-04T14:02:33.4427343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int32 PASSED [0.0053s] [ 22%] 2025-12-04T14:02:33.4427504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int64 PASSED [0.0054s] [ 22%] 2025-12-04T14:02:33.4427646Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float64 PASSED [0.0077s] [ 22%] 2025-12-04T14:02:33.4427797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_uint8 PASSED [0.0074s] [ 22%] 2025-12-04T14:02:33.4427988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_float16 PASSED [0.0076s] [ 22%] 2025-12-04T14:02:33.4428158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_float32 PASSED [0.0075s] [ 22%] 2025-12-04T14:02:33.4428315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_int8 PASSED [0.8811s] [ 22%] 2025-12-04T14:02:33.4428453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_float16 PASSED [0.0101s] [ 22%] 2025-12-04T14:02:33.4428598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int8 PASSED [0.0084s] [ 22%] 2025-12-04T14:02:33.4428761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_float32 PASSED [0.0077s] [ 22%] 2025-12-04T14:02:33.4428929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int64 PASSED [0.0075s] [ 22%] 2025-12-04T14:02:33.4429067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_bfloat16 PASSED [0.0044s] [ 22%] 2025-12-04T14:02:33.4429213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_bool PASSED [0.0040s] [ 22%] 2025-12-04T14:02:33.4429357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int32 PASSED [0.8840s] [ 22%] 2025-12-04T14:02:33.4429532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int64 PASSED [0.0060s] [ 22%] 2025-12-04T14:02:33.4429687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_complex64 PASSED [0.0056s] [ 22%] 2025-12-04T14:02:33.4429815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_float16 PASSED [0.8718s] [ 22%] 2025-12-04T14:02:33.4429954Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_float32 PASSED [0.0041s] [ 22%] 2025-12-04T14:02:33.4430081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int16 PASSED [0.8696s] [ 22%] 2025-12-04T14:02:33.4430285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int8 PASSED [0.0041s] [ 22%] 2025-12-04T14:02:33.4430424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_bool PASSED [0.0892s] [ 23%] 2025-12-04T14:02:33.4430570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_complex128 PASSED [0.0938s] [ 23%] 2025-12-04T14:02:33.4430696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float32 PASSED [0.0927s] [ 23%] 2025-12-04T14:02:33.4430838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int64 PASSED [0.0871s] [ 23%] 2025-12-04T14:02:33.4430955Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_uint8 PASSED [0.0873s] [ 23%] 2025-12-04T14:02:33.4431141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_bfloat16 PASSED [0.0081s] [ 23%] 2025-12-04T14:02:33.4431288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_bool PASSED [0.8811s] [ 23%] 2025-12-04T14:02:33.4431450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_complex64 PASSED [0.0195s] [ 23%] 2025-12-04T14:02:33.4431581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_float32 PASSED [0.0081s] [ 23%] 2025-12-04T14:02:33.4431721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int8 PASSED [0.0038s] [ 23%] 2025-12-04T14:02:33.4431894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float16 PASSED [0.0346s] [ 23%] 2025-12-04T14:02:33.4432017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int16 PASSED [0.8918s] [ 23%] 2025-12-04T14:02:33.4432170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int64 PASSED [0.0056s] [ 23%] 2025-12-04T14:02:33.4432304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float16 PASSED [0.0034s] [ 23%] 2025-12-04T14:02:33.4432435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float32 PASSED [0.8722s] [ 23%] 2025-12-04T14:02:33.4432583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_int32 PASSED [0.0042s] [ 23%] 2025-12-04T14:02:33.4432732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_bfloat16 PASSED [0.8691s] [ 23%] 2025-12-04T14:02:33.4432855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int16 PASSED [0.0044s] [ 23%] 2025-12-04T14:02:33.4432997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int32 PASSED [0.8653s] [ 23%] 2025-12-04T14:02:33.4433118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int64 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.4433269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_uint8 PASSED [0.8753s] [ 23%] 2025-12-04T14:02:33.4433406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_float16 PASSED [0.0048s] [ 23%] 2025-12-04T14:02:33.4433548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_bool PASSED [0.8761s] [ 23%] 2025-12-04T14:02:33.4433697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float16 PASSED [0.0047s] [ 23%] 2025-12-04T14:02:33.4433829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex128 PASSED [0.0061s] [ 23%] 2025-12-04T14:02:33.4434002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float64 PASSED [0.0039s] [ 23%] 2025-12-04T14:02:33.4434137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex32 PASSED [0.8859s] [ 23%] 2025-12-04T14:02:33.4434285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex64 PASSED [0.0056s] [ 23%] 2025-12-04T14:02:33.4434409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_float32 PASSED [0.8647s] [ 23%] 2025-12-04T14:02:33.4434545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int32 PASSED [0.0053s] [ 23%] 2025-12-04T14:02:33.4434660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int64 PASSED [0.8709s] [ 23%] 2025-12-04T14:02:33.4434823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int8 PASSED [0.0057s] [ 23%] 2025-12-04T14:02:33.4434982Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128 PASSED [0.2721s] [ 23%] 2025-12-04T14:02:33.4435160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_float64 PASSED [0.2564s] [ 23%] 2025-12-04T14:02:33.4435312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int16 PASSED [0.2961s] [ 23%] 2025-12-04T14:02:33.4435473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int64 PASSED [0.0054s] [ 23%] 2025-12-04T14:02:33.4435685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bfloat16 PASSED [0.3122s] [ 23%] 2025-12-04T14:02:33.4435863Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bool PASSED [0.2835s] [ 23%] 2025-12-04T14:02:33.4436059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128 PASSED [0.0063s] [ 23%] 2025-12-04T14:02:33.4436223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float16 PASSED [0.2972s] [ 23%] 2025-12-04T14:02:33.4436393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.2545s] [ 23%] 2025-12-04T14:02:33.4436575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float64 PASSED [0.2537s] [ 23%] 2025-12-04T14:02:33.4436774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int16 PASSED [0.0057s] [ 23%] 2025-12-04T14:02:33.4436932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int8 PASSED [0.2648s] [ 23%] 2025-12-04T14:02:33.4437107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8 PASSED [0.2673s] [ 23%] 2025-12-04T14:02:33.4437248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float16 PASSED [0.2910s] [ 23%] 2025-12-04T14:02:33.4437442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_uint8 PASSED [0.2473s] [ 23%] 2025-12-04T14:02:33.4437620Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int16 PASSED [0.2886s] [ 23%] 2025-12-04T14:02:33.4437777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int8 PASSED [0.0057s] [ 23%] 2025-12-04T14:02:33.4437948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8 PASSED [0.2461s] [ 23%] 2025-12-04T14:02:33.4438092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_complex128 PASSED [0.8948s] [ 23%] 2025-12-04T14:02:33.4438279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_float64 PASSED [0.0052s] [ 23%] 2025-12-04T14:02:33.4438411Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_bfloat16 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.4438547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_bool PASSED [0.0037s] [ 23%] 2025-12-04T14:02:33.4438680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_float32 PASSED [0.8660s] [ 23%] 2025-12-04T14:02:33.4438825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float16 PASSED [0.0194s] [ 23%] 2025-12-04T14:02:33.4438945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float64 PASSED [0.0058s] [ 23%] 2025-12-04T14:02:33.4439112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_int8 PASSED [0.3331s] [ 23%] 2025-12-04T14:02:33.4439248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_float64 PASSED [0.0159s] [ 23%] 2025-12-04T14:02:33.4439370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_int16 PASSED [0.0126s] [ 23%] 2025-12-04T14:02:33.4439504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_bool PASSED [0.0085s] [ 23%] 2025-12-04T14:02:33.4439615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int16 PASSED [0.0084s] [ 23%] 2025-12-04T14:02:33.4439778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int64 PASSED [0.0085s] [ 23%] 2025-12-04T14:02:33.4439891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_int8 PASSED [0.0085s] [ 23%] 2025-12-04T14:02:33.4440020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_uint8 PASSED [0.0084s] [ 23%] 2025-12-04T14:02:33.4440185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float16 PASSED [0.0122s] [ 23%] 2025-12-04T14:02:33.4440313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float64 PASSED [0.0121s] [ 23%] 2025-12-04T14:02:33.4440470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_int8 PASSED [0.0053s] [ 23%] 2025-12-04T14:02:33.4440645Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_complex64 PASSED [0.0176s] [ 23%] 2025-12-04T14:02:33.4440785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float64 PASSED [0.0158s] [ 23%] 2025-12-04T14:02:33.4440942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_complex64 PASSED [0.0041s] [ 23%] 2025-12-04T14:02:33.4441074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float16 PASSED [0.0040s] [ 23%] 2025-12-04T14:02:33.4441237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_uint8 PASSED [0.8667s] [ 23%] 2025-12-04T14:02:33.4441398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_float32 PASSED [1.0473s] [ 23%] 2025-12-04T14:02:33.4441540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_bfloat16 PASSED [0.0069s] [ 23%] 2025-12-04T14:02:33.4441688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_bool PASSED [0.0064s] [ 23%] 2025-12-04T14:02:33.4441828Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_complex64 PASSED [0.0063s] [ 23%] 2025-12-04T14:02:33.4441997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float32 PASSED [0.0062s] [ 23%] 2025-12-04T14:02:33.4442141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int64 PASSED [0.0062s] [ 23%] 2025-12-04T14:02:33.4442302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_complex128 PASSED [0.0916s] [ 23%] 2025-12-04T14:02:33.4442434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_complex64 PASSED [0.0447s] [ 23%] 2025-12-04T14:02:33.4442578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_float64 PASSED [0.0838s] [ 23%] 2025-12-04T14:02:33.4442703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_float32 PASSED [0.0549s] [ 23%] 2025-12-04T14:02:33.4442989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_float32 SKIPPED [0.0008s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T14:02:33.4443152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_cuda_complex64 PASSED [0.0455s] [ 23%] 2025-12-04T14:02:33.4443286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_float32 PASSED [0.9391s] [ 23%] 2025-12-04T14:02:33.4443443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex64 PASSED [1.1014s] [ 23%] 2025-12-04T14:02:33.4443575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_float64 PASSED [0.0075s] [ 23%] 2025-12-04T14:02:33.4443845Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_complex128 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T14:02:33.4444067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T14:02:33.4444299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T14:02:33.4444435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_complex128 PASSED [1.0470s] [ 23%] 2025-12-04T14:02:33.4444578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_complex64 PASSED [0.1339s] [ 23%] 2025-12-04T14:02:33.4444795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128 PASSED [1.0449s] [ 23%] 2025-12-04T14:02:33.4444932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_float64 PASSED [0.0792s] [ 23%] 2025-12-04T14:02:33.4445092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_solve_cuda_float64 PASSED [0.0874s] [ 23%] 2025-12-04T14:02:33.4445246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_complex128 PASSED [0.0977s] [ 23%] 2025-12-04T14:02:33.4445401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_complex64 PASSED [0.0925s] [ 23%] 2025-12-04T14:02:33.4445568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float32 PASSED [0.1037s] [ 23%] 2025-12-04T14:02:33.4445729Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_power_cuda_complex64 PASSED [0.1115s] [ 23%] 2025-12-04T14:02:33.4445878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_bfloat16 PASSED [0.0121s] [ 23%] 2025-12-04T14:02:33.4446031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_float16 PASSED [0.0112s] [ 23%] 2025-12-04T14:02:33.4446168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_float64 PASSED [0.1329s] [ 23%] 2025-12-04T14:02:33.4446363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float16 PASSED [0.0661s] [ 23%] 2025-12-04T14:02:33.4446516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_complex64 PASSED [0.0397s] [ 23%] 2025-12-04T14:02:33.4446649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_float32 PASSED [0.0398s] [ 23%] 2025-12-04T14:02:33.4446908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_cuda_float64 PASSED [0.0381s] [ 23%] 2025-12-04T14:02:33.4447056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_hermitian_cuda_complex128 PASSED [0.0148s] [ 23%] 2025-12-04T14:02:33.4447317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 23%] 2025-12-04T14:02:33.4447539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 23%] 2025-12-04T14:02:33.4447684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex128 PASSED [0.0334s] [ 23%] 2025-12-04T14:02:33.4447824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_float32 PASSED [0.0600s] [ 23%] 2025-12-04T14:02:33.4447975Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_slogdet_cuda_complex128 PASSED [1.1390s] [ 23%] 2025-12-04T14:02:33.4448101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_slogdet_cuda_float32 PASSED [0.0111s] [ 23%] 2025-12-04T14:02:33.4448281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_complex128 PASSED [0.0242s] [ 23%] 2025-12-04T14:02:33.4448429Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_complex64 PASSED [0.0179s] [ 23%] 2025-12-04T14:02:33.4448567Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_ex_cuda_float32 PASSED [0.0175s] [ 23%] 2025-12-04T14:02:33.4448728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_complex64 PASSED [0.2729s] [ 23%] 2025-12-04T14:02:33.4448871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_float64 PASSED [0.0907s] [ 23%] 2025-12-04T14:02:33.4449040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svd_cuda_float32 PASSED [0.1135s] [ 23%] 2025-12-04T14:02:33.4449176Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_complex128 PASSED [0.0239s] [ 23%] 2025-12-04T14:02:33.4449331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_complex64 PASSED [0.0237s] [ 23%] 2025-12-04T14:02:33.4449463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_float32 PASSED [0.0288s] [ 23%] 2025-12-04T14:02:33.4449618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_complex64 PASSED [0.9323s] [ 23%] 2025-12-04T14:02:33.4449789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorsolve_cuda_complex128 PASSED [0.0199s] [ 23%] 2025-12-04T14:02:33.4449948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_complex128 PASSED [0.0251s] [ 23%] 2025-12-04T14:02:33.4450348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_float32 PASSED [0.0242s] [ 23%] 2025-12-04T14:02:33.4450478Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_int16 PASSED [0.0244s] [ 23%] 2025-12-04T14:02:33.4450612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_int32 PASSED [0.0242s] [ 23%] 2025-12-04T14:02:33.4450789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_complex64 PASSED [0.1285s] [ 23%] 2025-12-04T14:02:33.4450941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float64 PASSED [0.0204s] [ 23%] 2025-12-04T14:02:33.4451066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int32 PASSED [0.0203s] [ 23%] 2025-12-04T14:02:33.4451201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int64 PASSED [0.0203s] [ 23%] 2025-12-04T14:02:33.4451353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_complex128 PASSED [0.1065s] [ 23%] 2025-12-04T14:02:33.4451545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_complex64 PASSED [0.1061s] [ 24%] 2025-12-04T14:02:33.4451701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_float32 PASSED [0.1058s] [ 24%] 2025-12-04T14:02:33.4451857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_int64 PASSED [0.1055s] [ 24%] 2025-12-04T14:02:33.4451977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_int64 PASSED [0.9497s] [ 24%] 2025-12-04T14:02:33.4452109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_int8 PASSED [0.0057s] [ 24%] 2025-12-04T14:02:33.4452279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_float64 PASSED [0.0031s] [ 24%] 2025-12-04T14:02:33.4452404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int8 PASSED [0.9324s] [ 24%] 2025-12-04T14:02:33.4452549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_bool PASSED [0.0059s] [ 24%] 2025-12-04T14:02:33.4452673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_complex128 PASSED [0.1547s] [ 24%] 2025-12-04T14:02:33.4452805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int16 PASSED [0.9313s] [ 24%] 2025-12-04T14:02:33.4452923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int64 PASSED [0.0059s] [ 24%] 2025-12-04T14:02:33.4453082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_int8 PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.4453205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_complex128 PASSED [0.9292s] [ 24%] 2025-12-04T14:02:33.4453334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float32 PASSED [0.0059s] [ 24%] 2025-12-04T14:02:33.4453463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float64 PASSED [0.0051s] [ 24%] 2025-12-04T14:02:33.4453618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_bool PASSED [0.0097s] [ 24%] 2025-12-04T14:02:33.4453795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float16 PASSED [0.0092s] [ 24%] 2025-12-04T14:02:33.4453950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_int32 PASSED [0.0091s] [ 24%] 2025-12-04T14:02:33.4454081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_int64 PASSED [0.0090s] [ 24%] 2025-12-04T14:02:33.4454206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_bfloat16 PASSED [0.9423s] [ 24%] 2025-12-04T14:02:33.4454339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_float32 PASSED [0.0063s] [ 24%] 2025-12-04T14:02:33.4454474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_float64 PASSED [0.0047s] [ 24%] 2025-12-04T14:02:33.4454592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_bfloat16 PASSED [0.0223s] [ 24%] 2025-12-04T14:02:33.4454717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_float64 PASSED [0.0162s] [ 24%] 2025-12-04T14:02:33.4454837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_bfloat16 PASSED [0.0111s] [ 24%] 2025-12-04T14:02:33.4454972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_complex128 PASSED [0.3846s] [ 24%] 2025-12-04T14:02:33.4455092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_float64 PASSED [0.0117s] [ 24%] 2025-12-04T14:02:33.4455211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_uint8 PASSED [0.0113s] [ 24%] 2025-12-04T14:02:33.4455326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_bool PASSED [0.0040s] [ 24%] 2025-12-04T14:02:33.4455452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_complex128 PASSED [0.0040s] [ 24%] 2025-12-04T14:02:33.4455575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_complex64 PASSED [0.9249s] [ 24%] 2025-12-04T14:02:33.4455697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float32 PASSED [0.0061s] [ 24%] 2025-12-04T14:02:33.4455815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int64 PASSED [0.0043s] [ 24%] 2025-12-04T14:02:33.4455927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_bool PASSED [0.0093s] [ 24%] 2025-12-04T14:02:33.4456047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_float32 PASSED [0.0109s] [ 24%] 2025-12-04T14:02:33.4456162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int64 PASSED [0.0107s] [ 24%] 2025-12-04T14:02:33.4456278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int8 PASSED [0.0110s] [ 24%] 2025-12-04T14:02:33.4456392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_uint8 PASSED [0.0126s] [ 24%] 2025-12-04T14:02:33.4456525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_bfloat16 PASSED [0.0114s] [ 24%] 2025-12-04T14:02:33.4456644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float16 PASSED [0.0111s] [ 24%] 2025-12-04T14:02:33.4456761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int8 PASSED [0.0107s] [ 24%] 2025-12-04T14:02:33.4456870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int8 PASSED [0.9353s] [ 24%] 2025-12-04T14:02:33.4456992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_complex128 PASSED [0.1208s] [ 24%] 2025-12-04T14:02:33.4457110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_complex64 PASSED [0.1182s] [ 24%] 2025-12-04T14:02:33.4457227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int64 PASSED [0.1108s] [ 24%] 2025-12-04T14:02:33.4457338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int8 PASSED [0.0463s] [ 24%] 2025-12-04T14:02:33.4457482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_complex64 PASSED [0.6877s] [ 24%] 2025-12-04T14:02:33.4457617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int64 PASSED [0.6470s] [ 24%] 2025-12-04T14:02:33.4457750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int8 PASSED [0.2611s] [ 24%] 2025-12-04T14:02:33.4457866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_bool PASSED [0.0106s] [ 24%] 2025-12-04T14:02:33.4457992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_float16 PASSED [0.0110s] [ 24%] 2025-12-04T14:02:33.4458116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_float16 PASSED [0.9336s] [ 24%] 2025-12-04T14:02:33.4458225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int16 PASSED [0.0042s] [ 24%] 2025-12-04T14:02:33.4458337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int8 PASSED [0.9365s] [ 24%] 2025-12-04T14:02:33.4458444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_float32 PASSED [0.0107s] [ 24%] 2025-12-04T14:02:33.4458552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int32 PASSED [0.0088s] [ 24%] 2025-12-04T14:02:33.4458665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int8 PASSED [0.0086s] [ 24%] 2025-12-04T14:02:33.4458781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_complex128 PASSED [0.1537s] [ 24%] 2025-12-04T14:02:33.4458892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_complex64 PASSED [0.0546s] [ 24%] 2025-12-04T14:02:33.4459008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_solve_cuda_float32 PASSED [0.0307s] [ 24%] 2025-12-04T14:02:33.4459126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_unpack_cuda_complex64 PASSED [0.0230s] [ 24%] 2025-12-04T14:02:33.4459245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_unpack_cuda_float32 PASSED [0.0216s] [ 24%] 2025-12-04T14:02:33.4459356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex32 PASSED [0.9424s] [ 24%] 2025-12-04T14:02:33.4459465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex64 PASSED [0.0065s] [ 24%] 2025-12-04T14:02:33.4459580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_complex128 PASSED [0.0038s] [ 24%] 2025-12-04T14:02:33.4459685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int16 PASSED [0.9366s] [ 24%] 2025-12-04T14:02:33.4459788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int8 PASSED [0.0051s] [ 24%] 2025-12-04T14:02:33.4459903Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_int32 PASSED [0.1142s] [ 24%] 2025-12-04T14:02:33.4460019Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_int8 PASSED [0.1128s] [ 24%] 2025-12-04T14:02:33.4460186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_uint8 PASSED [0.1124s] [ 24%] 2025-12-04T14:02:33.4460310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_bfloat16 PASSED [0.1387s] [ 24%] 2025-12-04T14:02:33.4460424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int8 PASSED [0.1141s] [ 24%] 2025-12-04T14:02:33.4460550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float64 PASSED [0.0791s] [ 24%] 2025-12-04T14:02:33.4460667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int32 PASSED [0.0668s] [ 24%] 2025-12-04T14:02:33.4460790Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_uint8 PASSED [0.0665s] [ 24%] 2025-12-04T14:02:33.4460912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_bfloat16 PASSED [0.0788s] [ 24%] 2025-12-04T14:02:33.4461033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float16 PASSED [0.0790s] [ 24%] 2025-12-04T14:02:33.4461155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float64 PASSED [0.0788s] [ 24%] 2025-12-04T14:02:33.4461277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int64 PASSED [0.0659s] [ 24%] 2025-12-04T14:02:33.4461394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int8 PASSED [0.0669s] [ 24%] 2025-12-04T14:02:33.4461522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_complex128 PASSED [0.0374s] [ 24%] 2025-12-04T14:02:33.4461645Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float16 PASSED [0.0386s] [ 24%] 2025-12-04T14:02:33.4461786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float64 PASSED [0.0373s] [ 24%] 2025-12-04T14:02:33.4461919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_int32 PASSED [0.0372s] [ 24%] 2025-12-04T14:02:33.4462039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_uint8 PASSED [0.0371s] [ 24%] 2025-12-04T14:02:33.4462161Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_bfloat16 PASSED [0.0386s] [ 24%] 2025-12-04T14:02:33.4462283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_complex128 PASSED [0.0374s] [ 24%] 2025-12-04T14:02:33.4462414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_float16 PASSED [0.0385s] [ 24%] 2025-12-04T14:02:33.4462534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_float64 PASSED [0.0372s] [ 24%] 2025-12-04T14:02:33.4462653Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int16 PASSED [0.0370s] [ 24%] 2025-12-04T14:02:33.4462773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int32 PASSED [0.0371s] [ 24%] 2025-12-04T14:02:33.4462892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int64 PASSED [0.0370s] [ 24%] 2025-12-04T14:02:33.4463011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_int8 PASSED [0.0370s] [ 24%] 2025-12-04T14:02:33.4463128Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_bfloat16 PASSED [0.0099s] [ 24%] 2025-12-04T14:02:33.4463247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_float64 PASSED [0.0096s] [ 24%] 2025-12-04T14:02:33.4463361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int16 PASSED [0.0096s] [ 24%] 2025-12-04T14:02:33.4463474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int64 PASSED [0.0096s] [ 24%] 2025-12-04T14:02:33.4463601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_log_softmax_cuda_float16 PASSED [0.0456s] [ 24%] 2025-12-04T14:02:33.4463726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float32 PASSED [0.2165s] [ 24%] 2025-12-04T14:02:33.4463861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float64 PASSED [0.2176s] [ 24%] 2025-12-04T14:02:33.4463983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int16 PASSED [0.1874s] [ 24%] 2025-12-04T14:02:33.4464102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int32 PASSED [0.1890s] [ 24%] 2025-12-04T14:02:33.4464225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_bfloat16 PASSED [0.2337s] [ 24%] 2025-12-04T14:02:33.4464344Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_complex128 PASSED [0.1931s] [ 24%] 2025-12-04T14:02:33.4464463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_float32 PASSED [0.2260s] [ 24%] 2025-12-04T14:02:33.4464584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_bfloat16 PASSED [0.0255s] [ 24%] 2025-12-04T14:02:33.4464703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_float16 PASSED [0.7953s] [ 24%] 2025-12-04T14:02:33.4464833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_complex64 PASSED [0.0695s] [ 24%] 2025-12-04T14:02:33.4464956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float32 PASSED [0.0678s] [ 24%] 2025-12-04T14:02:33.4465084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float64 PASSED [0.0676s] [ 24%] 2025-12-04T14:02:33.4465202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_bfloat16 PASSED [1.0437s] [ 24%] 2025-12-04T14:02:33.4465317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int32 PASSED [0.1398s] [ 24%] 2025-12-04T14:02:33.4465440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int64 PASSED [0.1333s] [ 24%] 2025-12-04T14:02:33.4465566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_int8 PASSED [0.1347s] [ 24%] 2025-12-04T14:02:33.4465682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_uint8 PASSED [0.1346s] [ 24%] 2025-12-04T14:02:33.4465807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_bfloat16 PASSED [0.0056s] [ 24%] 2025-12-04T14:02:33.4465927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_float16 PASSED [0.0053s] [ 24%] 2025-12-04T14:02:33.4466056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_int32 PASSED [0.0053s] [ 24%] 2025-12-04T14:02:33.4466174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_int64 PASSED [0.0052s] [ 24%] 2025-12-04T14:02:33.4466303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_complex128 PASSED [0.0069s] [ 24%] 2025-12-04T14:02:33.4466424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_float16 PASSED [0.0066s] [ 24%] 2025-12-04T14:02:33.4466543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_float64 PASSED [0.0067s] [ 24%] 2025-12-04T14:02:33.4466664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int16 PASSED [0.0066s] [ 24%] 2025-12-04T14:02:33.4466780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int8 PASSED [0.0066s] [ 24%] 2025-12-04T14:02:33.4466897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_uint8 PASSED [0.0066s] [ 24%] 2025-12-04T14:02:33.4467019Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_bfloat16 PASSED [0.0343s] [ 24%] 2025-12-04T14:02:33.4467143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmin_cuda_float32 PASSED [0.0428s] [ 24%] 2025-12-04T14:02:33.4467256Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int64 PASSED [0.3912s] [ 24%] 2025-12-04T14:02:33.4467377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_complex64 PASSED [0.1155s] [ 24%] 2025-12-04T14:02:33.4467490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int64 PASSED [0.1144s] [ 24%] 2025-12-04T14:02:33.4467621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_complex128 PASSED [0.4152s] [ 24%] 2025-12-04T14:02:33.4467741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_complex64 PASSED [0.4099s] [ 24%] 2025-12-04T14:02:33.4467858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_float16 PASSED [0.3751s] [ 24%] 2025-12-04T14:02:33.4467970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int16 PASSED [0.3736s] [ 25%] 2025-12-04T14:02:33.4468083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matmul_cuda_complex64 PASSED [0.0256s] [ 25%] 2025-12-04T14:02:33.4468203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_complex128 PASSED [0.9572s] [ 25%] 2025-12-04T14:02:33.4468325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_complex64 PASSED [0.0090s] [ 25%] 2025-12-04T14:02:33.4468443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float16 PASSED [0.0065s] [ 25%] 2025-12-04T14:02:33.4468560Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_bfloat16 PASSED [0.0090s] [ 25%] 2025-12-04T14:02:33.4468677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_float16 PASSED [0.0086s] [ 25%] 2025-12-04T14:02:33.4468793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_float64 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4468907Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int16 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4469062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_pool2d_with_indices_backward_cuda_float16 PASSED [2.1707s] [ 25%] 2025-12-04T14:02:33.4469204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_bfloat16 PASSED [0.9307s] [ 25%] 2025-12-04T14:02:33.4469332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_bool PASSED [0.0050s] [ 25%] 2025-12-04T14:02:33.4469462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int16 PASSED [0.0035s] [ 25%] 2025-12-04T14:02:33.4469587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_uint8 PASSED [0.9196s] [ 25%] 2025-12-04T14:02:33.4469731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_bfloat16 PASSED [0.0050s] [ 25%] 2025-12-04T14:02:33.4469864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float16 PASSED [0.0036s] [ 25%] 2025-12-04T14:02:33.4469996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float32 PASSED [0.9222s] [ 25%] 2025-12-04T14:02:33.4470172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_int16 PASSED [0.0055s] [ 25%] 2025-12-04T14:02:33.4470301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_uint8 PASSED [0.9185s] [ 25%] 2025-12-04T14:02:33.4470415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_complex128 PASSED [0.0243s] [ 25%] 2025-12-04T14:02:33.4470525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float16 PASSED [0.0146s] [ 25%] 2025-12-04T14:02:33.4470633Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float32 PASSED [0.0143s] [ 25%] 2025-12-04T14:02:33.4470741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mean_cuda_float64 PASSED [0.0142s] [ 25%] 2025-12-04T14:02:33.4470850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int32 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4470958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int8 PASSED [0.0096s] [ 25%] 2025-12-04T14:02:33.4471099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_complex64 PASSED [0.0104s] [ 25%] 2025-12-04T14:02:33.4471234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_float16 PASSED [0.0102s] [ 25%] 2025-12-04T14:02:33.4471385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_float64 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4471516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_int16 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4471649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_bool PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4471791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_complex128 PASSED [0.0103s] [ 25%] 2025-12-04T14:02:33.4471929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_float64 PASSED [0.0103s] [ 25%] 2025-12-04T14:02:33.4472064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_int8 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4472196Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_uint8 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4472316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_float64 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4472428Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int16 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4472558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_bfloat16 PASSED [0.9242s] [ 25%] 2025-12-04T14:02:33.4472685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int16 PASSED [0.0050s] [ 25%] 2025-12-04T14:02:33.4472828Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int64 PASSED [0.0034s] [ 25%] 2025-12-04T14:02:33.4472974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_bfloat16 PASSED [0.9233s] [ 25%] 2025-12-04T14:02:33.4473101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_bool PASSED [0.0054s] [ 25%] 2025-12-04T14:02:33.4473232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_float16 PASSED [0.9264s] [ 25%] 2025-12-04T14:02:33.4473361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int64 PASSED [0.0054s] [ 25%] 2025-12-04T14:02:33.4473486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int8 PASSED [0.9337s] [ 25%] 2025-12-04T14:02:33.4473610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_bool PASSED [0.0104s] [ 25%] 2025-12-04T14:02:33.4473722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_int64 PASSED [0.0088s] [ 25%] 2025-12-04T14:02:33.4473831Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_int8 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4473943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_uint8 PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.4474052Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_complex128 PASSED [0.0049s] [ 25%] 2025-12-04T14:02:33.4474163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_complex64 PASSED [0.0047s] [ 25%] 2025-12-04T14:02:33.4474271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_float16 PASSED [0.0041s] [ 25%] 2025-12-04T14:02:33.4474382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int16 PASSED [1.0576s] [ 25%] 2025-12-04T14:02:33.4474489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int32 PASSED [0.0070s] [ 25%] 2025-12-04T14:02:33.4474599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_uint8 PASSED [0.0051s] [ 25%] 2025-12-04T14:02:33.4474717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_complex64 PASSED [0.9391s] [ 25%] 2025-12-04T14:02:33.4474835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float16 PASSED [0.0044s] [ 25%] 2025-12-04T14:02:33.4474946Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_int16 PASSED [0.9404s] [ 25%] 2025-12-04T14:02:33.4475075Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_int32 PASSED [0.0045s] [ 25%] 2025-12-04T14:02:33.4475188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_bfloat16 PASSED [0.0047s] [ 25%] 2025-12-04T14:02:33.4475299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int32 PASSED [0.9232s] [ 25%] 2025-12-04T14:02:33.4475408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_uint8 PASSED [0.0057s] [ 25%] 2025-12-04T14:02:33.4475524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex128 PASSED [0.0091s] [ 25%] 2025-12-04T14:02:33.4475638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex64 PASSED [0.0087s] [ 25%] 2025-12-04T14:02:33.4475746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int32 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4475856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_uint8 PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.4475968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_complex128 PASSED [0.0031s] [ 25%] 2025-12-04T14:02:33.4476079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_float16 PASSED [0.9222s] [ 25%] 2025-12-04T14:02:33.4476212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0168s] [ 25%] 2025-12-04T14:02:33.4476342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int16 PASSED [0.0141s] [ 25%] 2025-12-04T14:02:33.4476470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8 PASSED [0.0139s] [ 25%] 2025-12-04T14:02:33.4476619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int16 PASSED [0.0139s] [ 25%] 2025-12-04T14:02:33.4476745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8 PASSED [0.0139s] [ 25%] 2025-12-04T14:02:33.4476882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16 PASSED [0.0140s] [ 25%] 2025-12-04T14:02:33.4477012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float16 PASSED [0.9430s] [ 25%] 2025-12-04T14:02:33.4477144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0165s] [ 25%] 2025-12-04T14:02:33.4477282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float64 PASSED [0.0151s] [ 25%] 2025-12-04T14:02:33.4477413Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int16 PASSED [0.0139s] [ 25%] 2025-12-04T14:02:33.4477545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int32 PASSED [0.0138s] [ 25%] 2025-12-04T14:02:33.4477665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_float32 PASSED [0.0038s] [ 25%] 2025-12-04T14:02:33.4477784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_int32 PASSED [0.9298s] [ 25%] 2025-12-04T14:02:33.4477895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_uint8 PASSED [0.0055s] [ 25%] 2025-12-04T14:02:33.4478014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_complex128 PASSED [1.1407s] [ 25%] 2025-12-04T14:02:33.4478130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_complex32 PASSED [0.0741s] [ 25%] 2025-12-04T14:02:33.4478243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float32 PASSED [0.0683s] [ 25%] 2025-12-04T14:02:33.4478357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int16 PASSED [0.9442s] [ 25%] 2025-12-04T14:02:33.4478473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int8 PASSED [0.0121s] [ 25%] 2025-12-04T14:02:33.4478586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_uint8 PASSED [0.0101s] [ 25%] 2025-12-04T14:02:33.4478708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_bool PASSED [0.9288s] [ 25%] 2025-12-04T14:02:33.4478822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex64 PASSED [1.0328s] [ 25%] 2025-12-04T14:02:33.4478933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_float16 PASSED [0.0187s] [ 25%] 2025-12-04T14:02:33.4479043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_float32 PASSED [0.0183s] [ 25%] 2025-12-04T14:02:33.4479156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int64 PASSED [0.0134s] [ 25%] 2025-12-04T14:02:33.4479277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_bfloat16 XFAIL [0.0031s] [ 25%] 2025-12-04T14:02:33.4479399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex128 XFAIL [0.0028s] [ 25%] 2025-12-04T14:02:33.4479522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex32 XFAIL [0.9380s] [ 25%] 2025-12-04T14:02:33.4479643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex64 XFAIL [0.9229s] [ 25%] 2025-12-04T14:02:33.4479761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_float16 XFAIL [0.9314s] [ 25%] 2025-12-04T14:02:33.4479876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_float64 XFAIL [0.9324s] [ 25%] 2025-12-04T14:02:33.4479989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int16 XFAIL [0.9309s] [ 25%] 2025-12-04T14:02:33.4480133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int32 XFAIL [0.9379s] [ 25%] 2025-12-04T14:02:33.4480265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_int64 XFAIL [0.9313s] [ 25%] 2025-12-04T14:02:33.4480390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_uint8 XFAIL [0.9236s] [ 25%] 2025-12-04T14:02:33.4480507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex128 PASSED [0.9258s] [ 25%] 2025-12-04T14:02:33.4480622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex64 PASSED [0.9249s] [ 25%] 2025-12-04T14:02:33.4480759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float32 PASSED [0.0230s] [ 25%] 2025-12-04T14:02:33.4480906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float64 PASSED [0.0100s] [ 25%] 2025-12-04T14:02:33.4481036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_bfloat16 PASSED [0.0390s] [ 25%] 2025-12-04T14:02:33.4481165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float16 PASSED [0.0326s] [ 25%] 2025-12-04T14:02:33.4481272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_bool PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.4481384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_complex128 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4481493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_float16 PASSED [0.0086s] [ 25%] 2025-12-04T14:02:33.4481600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int16 PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.4481704Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int32 PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.4481818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex128 PASSED [0.9371s] [ 25%] 2025-12-04T14:02:33.4481928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex32 PASSED [0.1189s] [ 25%] 2025-12-04T14:02:33.4482037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float32 PASSED [0.9286s] [ 25%] 2025-12-04T14:02:33.4482144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float64 PASSED [0.0045s] [ 25%] 2025-12-04T14:02:33.4482253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_int32 PASSED [0.9259s] [ 25%] 2025-12-04T14:02:33.4482377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_bool PASSED [0.0080s] [ 25%] 2025-12-04T14:02:33.4482498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex128 PASSED [0.9353s] [ 25%] 2025-12-04T14:02:33.4482615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex32 PASSED [0.0081s] [ 25%] 2025-12-04T14:02:33.4482727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_int64 PASSED [0.9416s] [ 25%] 2025-12-04T14:02:33.4482856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_bfloat16 PASSED [0.0082s] [ 25%] 2025-12-04T14:02:33.4482985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex128 PASSED [0.9347s] [ 25%] 2025-12-04T14:02:33.4483111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float16 PASSED [0.0082s] [ 25%] 2025-12-04T14:02:33.4483235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float64 PASSED [0.9348s] [ 25%] 2025-12-04T14:02:33.4483358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_int16 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4483481Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_int64 PASSED [0.9280s] [ 25%] 2025-12-04T14:02:33.4483601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_complex128 PASSED [0.0085s] [ 25%] 2025-12-04T14:02:33.4483715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float16 PASSED [0.9225s] [ 25%] 2025-12-04T14:02:33.4483826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int16 PASSED [0.0086s] [ 25%] 2025-12-04T14:02:33.4483947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int64 PASSED [0.9307s] [ 25%] 2025-12-04T14:02:33.4484069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_uint8 PASSED [0.0083s] [ 25%] 2025-12-04T14:02:33.4484185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_complex64 PASSED [0.9362s] [ 25%] 2025-12-04T14:02:33.4484297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_int16 PASSED [0.0083s] [ 26%] 2025-12-04T14:02:33.4484406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_int32 PASSED [0.9363s] [ 26%] 2025-12-04T14:02:33.4484519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float64 PASSED [0.0085s] [ 26%] 2025-12-04T14:02:33.4484652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int8 PASSED [0.9217s] [ 26%] 2025-12-04T14:02:33.4484762Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_uint8 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.4484915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16 PASSED [0.0191s] [ 26%] 2025-12-04T14:02:33.4485065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16 PASSED [0.0171s] [ 26%] 2025-12-04T14:02:33.4485215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.0168s] [ 26%] 2025-12-04T14:02:33.4485361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float64 PASSED [0.0169s] [ 26%] 2025-12-04T14:02:33.4485510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16 PASSED [0.0140s] [ 26%] 2025-12-04T14:02:33.4485656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.9480s] [ 26%] 2025-12-04T14:02:33.4485805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64 PASSED [0.0123s] [ 26%] 2025-12-04T14:02:33.4485951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float16 PASSED [0.9489s] [ 26%] 2025-12-04T14:02:33.4486097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float64 PASSED [0.0182s] [ 26%] 2025-12-04T14:02:33.4486254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16 PASSED [0.0199s] [ 26%] 2025-12-04T14:02:33.4486402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.0140s] [ 26%] 2025-12-04T14:02:33.4486538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0148s] [ 26%] 2025-12-04T14:02:33.4486675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_bfloat16 PASSED [0.0115s] [ 26%] 2025-12-04T14:02:33.4486813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float16 PASSED [0.0066s] [ 26%] 2025-12-04T14:02:33.4486947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float64 PASSED [0.0066s] [ 26%] 2025-12-04T14:02:33.4487082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_cuda_float32 PASSED [0.0176s] [ 26%] 2025-12-04T14:02:33.4487237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16 PASSED [0.0243s] [ 26%] 2025-12-04T14:02:33.4487384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_cuda_float32 XFAIL [0.0283s] [ 26%] 2025-12-04T14:02:33.4487529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_cuda_float64 XFAIL [0.9529s] [ 26%] 2025-12-04T14:02:33.4487697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16 PASSED [0.9848s] [ 26%] 2025-12-04T14:02:33.4487850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_bfloat16 PASSED [0.0042s] [ 26%] 2025-12-04T14:02:33.4488000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_bool PASSED [0.0039s] [ 26%] 2025-12-04T14:02:33.4488149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_complex64 PASSED [0.9222s] [ 26%] 2025-12-04T14:02:33.4488289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float16 PASSED [0.0060s] [ 26%] 2025-12-04T14:02:33.4488448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float64 PASSED [0.0042s] [ 26%] 2025-12-04T14:02:33.4488577Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_float16 PASSED [0.1303s] [ 26%] 2025-12-04T14:02:33.4488892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_bfloat16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x72d251200c00 size: 768 2025-12-04T14:02:33.4489076Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x72d251200c00 size: 768 2025-12-04T14:02:33.4489120Z PASSED [0.0911s] [ 26%] 2025-12-04T14:02:33.4489432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x72d280203a00 size: 5888 2025-12-04T14:02:33.4489617Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x72d280203a00 size: 5888 2025-12-04T14:02:33.4489812Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x72d280203600 size: 6656 2025-12-04T14:02:33.4489995Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x72d280203600 size: 6656 2025-12-04T14:02:33.4490039Z PASSED [0.0513s] [ 26%] 2025-12-04T14:02:33.4490416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x72d250206400 size: 11008 2025-12-04T14:02:33.4490597Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x72d250206400 size: 11008 2025-12-04T14:02:33.4490792Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x72d250206200 size: 12544 2025-12-04T14:02:33.4490976Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x72d250206200 size: 12544 2025-12-04T14:02:33.4491015Z PASSED [0.0610s] [ 26%] 2025-12-04T14:02:33.4491149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_float64 PASSED [0.9766s] [ 26%] 2025-12-04T14:02:33.4491297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_bfloat16 PASSED [0.6649s] [ 26%] 2025-12-04T14:02:33.4491442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.9468s] [ 26%] 2025-12-04T14:02:33.4491589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32 PASSED [0.4886s] [ 26%] 2025-12-04T14:02:33.4491733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16 PASSED [0.1261s] [ 26%] 2025-12-04T14:02:33.4491880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex128 PASSED [0.0884s] [ 26%] 2025-12-04T14:02:33.4492038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64 PASSED [0.1214s] [ 26%] 2025-12-04T14:02:33.4492192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.0132s] [ 26%] 2025-12-04T14:02:33.4492334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_float64 PASSED [0.0125s] [ 26%] 2025-12-04T14:02:33.4492488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16 PASSED [0.0571s] [ 26%] 2025-12-04T14:02:33.4492648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int16 PASSED [0.0560s] [ 26%] 2025-12-04T14:02:33.4492795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32 PASSED [0.0556s] [ 26%] 2025-12-04T14:02:33.4492941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64 PASSED [0.9154s] [ 26%] 2025-12-04T14:02:33.4493088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int8 PASSED [0.0583s] [ 26%] 2025-12-04T14:02:33.4493230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_similarity_cuda_float16 PASSED [0.0524s] [ 26%] 2025-12-04T14:02:33.4493375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float16 PASSED [0.0642s] [ 26%] 2025-12-04T14:02:33.4493514Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float64 PASSED [0.0574s] [ 26%] 2025-12-04T14:02:33.4493648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_ctc_loss_cuda_float32 PASSED [0.0351s] [ 26%] 2025-12-04T14:02:33.4493782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout3d_cuda_float16 PASSED [0.8903s] [ 26%] 2025-12-04T14:02:33.4493917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout3d_cuda_float64 PASSED [0.0158s] [ 26%] 2025-12-04T14:02:33.4494051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout_cuda_float32 PASSED [0.9042s] [ 26%] 2025-12-04T14:02:33.4494176Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_float32 PASSED [0.0074s] [ 26%] 2025-12-04T14:02:33.4494326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float16 PASSED [0.0772s] [ 26%] 2025-12-04T14:02:33.4494460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_bfloat16 PASSED [0.0097s] [ 26%] 2025-12-04T14:02:33.4494594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_float32 PASSED [0.0095s] [ 26%] 2025-12-04T14:02:33.4494758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16 PASSED [0.0102s] [ 26%] 2025-12-04T14:02:33.4494926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64 PASSED [0.0045s] [ 26%] 2025-12-04T14:02:33.4495091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8 PASSED [0.8649s] [ 26%] 2025-12-04T14:02:33.4495235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float16 PASSED [1.7314s] [ 26%] 2025-12-04T14:02:33.4495364Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_bfloat16 PASSED [0.0243s] [ 26%] 2025-12-04T14:02:33.4495491Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float32 PASSED [0.0124s] [ 26%] 2025-12-04T14:02:33.4495619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_bfloat16 PASSED [0.0428s] [ 26%] 2025-12-04T14:02:33.4495747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_float16 PASSED [0.0346s] [ 26%] 2025-12-04T14:02:33.4495894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float16 PASSED [0.5937s] [ 26%] 2025-12-04T14:02:33.4496038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float32 PASSED [0.5914s] [ 26%] 2025-12-04T14:02:33.4496175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float64 PASSED [0.5936s] [ 26%] 2025-12-04T14:02:33.4496310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float32 PASSED [0.0664s] [ 26%] 2025-12-04T14:02:33.4496444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_float16 PASSED [0.0136s] [ 26%] 2025-12-04T14:02:33.4497064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_bfloat16 PASSED [0.0090s] [ 26%] 2025-12-04T14:02:33.4497202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_float64 PASSED [0.0087s] [ 26%] 2025-12-04T14:02:33.4497337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_int32 PASSED [0.0086s] [ 26%] 2025-12-04T14:02:33.4497490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0483s] [ 26%] 2025-12-04T14:02:33.4497643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64 PASSED [0.0462s] [ 26%] 2025-12-04T14:02:33.4497782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_instance_norm_cuda_float32 PASSED [0.2525s] [ 26%] 2025-12-04T14:02:33.4497932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64 PASSED [0.8302s] [ 26%] 2025-12-04T14:02:33.4498090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8 PASSED [0.0346s] [ 26%] 2025-12-04T14:02:33.4498243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_bfloat16 PASSED [0.0249s] [ 26%] 2025-12-04T14:02:33.4498392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float16 PASSED [0.0245s] [ 26%] 2025-12-04T14:02:33.4498542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0246s] [ 26%] 2025-12-04T14:02:33.4498700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_float64 PASSED [0.0245s] [ 26%] 2025-12-04T14:02:33.4498855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16 PASSED [0.2389s] [ 26%] 2025-12-04T14:02:33.4499010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16 PASSED [1.1044s] [ 26%] 2025-12-04T14:02:33.4499143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_float32 PASSED [0.0127s] [ 26%] 2025-12-04T14:02:33.4499281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float64 PASSED [0.8733s] [ 26%] 2025-12-04T14:02:33.4499417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_float32 PASSED [0.0119s] [ 26%] 2025-12-04T14:02:33.4499555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_complex128 PASSED [0.0703s] [ 26%] 2025-12-04T14:02:33.4499685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float32 PASSED [0.8849s] [ 26%] 2025-12-04T14:02:33.4499834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_local_response_norm_cuda_float32 PASSED [0.0423s] [ 26%] 2025-12-04T14:02:33.4499969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float16 PASSED [0.0068s] [ 26%] 2025-12-04T14:02:33.4500147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float32 PASSED [0.0060s] [ 26%] 2025-12-04T14:02:33.4500294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float64 PASSED [0.0061s] [ 26%] 2025-12-04T14:02:33.4500456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0642s] [ 26%] 2025-12-04T14:02:33.4500604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int32 PASSED [0.0644s] [ 26%] 2025-12-04T14:02:33.4500746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64 PASSED [0.0646s] [ 26%] 2025-12-04T14:02:33.4500890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_uint8 PASSED [0.0644s] [ 26%] 2025-12-04T14:02:33.4501035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float32 PASSED [0.8980s] [ 26%] 2025-12-04T14:02:33.4501172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float64 PASSED [0.8992s] [ 26%] 2025-12-04T14:02:33.4501306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float32 PASSED [0.8487s] [ 26%] 2025-12-04T14:02:33.4501441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_float32 PASSED [0.3870s] [ 26%] 2025-12-04T14:02:33.4501575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_float64 PASSED [0.3824s] [ 26%] 2025-12-04T14:02:33.4501721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16 PASSED [0.0691s] [ 26%] 2025-12-04T14:02:33.4501864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0638s] [ 26%] 2025-12-04T14:02:33.4502011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool2d_grad_cuda_bfloat16 PASSED [0.0621s] [ 26%] 2025-12-04T14:02:33.4502149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16 PASSED [0.1123s] [ 26%] 2025-12-04T14:02:33.4502292Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_cuda_float32 PASSED [0.1124s] [ 26%] 2025-12-04T14:02:33.4502438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16 PASSED [0.0277s] [ 26%] 2025-12-04T14:02:33.4502594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.0273s] [ 26%] 2025-12-04T14:02:33.4502723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0140s] [ 26%] 2025-12-04T14:02:33.4502849Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float16 PASSED [0.0052s] [ 26%] 2025-12-04T14:02:33.4502977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float32 PASSED [0.0051s] [ 26%] 2025-12-04T14:02:33.4503103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float64 PASSED [0.0051s] [ 26%] 2025-12-04T14:02:33.4503238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float16 PASSED [0.0086s] [ 26%] 2025-12-04T14:02:33.4503371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float64 PASSED [0.0083s] [ 26%] 2025-12-04T14:02:33.4503534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_bfloat16 PASSED [2.8378s] [ 26%] 2025-12-04T14:02:33.4503692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64 PASSED [2.8174s] [ 26%] 2025-12-04T14:02:33.4503839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16 PASSED [0.9013s] [ 26%] 2025-12-04T14:02:33.4503992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16 PASSED [0.0268s] [ 26%] 2025-12-04T14:02:33.4504155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0192s] [ 26%] 2025-12-04T14:02:33.4504323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float64 PASSED [0.0190s] [ 26%] 2025-12-04T14:02:33.4504456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float32 PASSED [0.0971s] [ 26%] 2025-12-04T14:02:33.4504591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float64 PASSED [0.0965s] [ 26%] 2025-12-04T14:02:33.4504727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_complex64 PASSED [0.0236s] [ 26%] 2025-12-04T14:02:33.4504877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_bfloat16 PASSED [0.0200s] [ 26%] 2025-12-04T14:02:33.4505018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_complex128 PASSED [0.0199s] [ 26%] 2025-12-04T14:02:33.4505156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_float16 PASSED [0.0198s] [ 26%] 2025-12-04T14:02:33.4505298Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_complex128 PASSED [0.0280s] [ 26%] 2025-12-04T14:02:33.4505441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_complex64 PASSED [0.0277s] [ 26%] 2025-12-04T14:02:33.4505578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_float64 PASSED [0.0276s] [ 26%] 2025-12-04T14:02:33.4505711Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_int8 PASSED [0.0273s] [ 26%] 2025-12-04T14:02:33.4505849Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16 PASSED [0.0154s] [ 26%] 2025-12-04T14:02:33.4505984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float16 PASSED [0.8903s] [ 27%] 2025-12-04T14:02:33.4506120Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float32 PASSED [0.0100s] [ 27%] 2025-12-04T14:02:33.4506254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_float64 PASSED [0.0083s] [ 27%] 2025-12-04T14:02:33.4506398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_int16 PASSED [0.0082s] [ 27%] 2025-12-04T14:02:33.4506530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_int8 PASSED [0.0080s] [ 27%] 2025-12-04T14:02:33.4506664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_uint8 PASSED [0.8707s] [ 27%] 2025-12-04T14:02:33.4506809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_complex128 PASSED [0.0171s] [ 27%] 2025-12-04T14:02:33.4506951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_complex64 PASSED [0.0083s] [ 27%] 2025-12-04T14:02:33.4507088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_float16 PASSED [0.0082s] [ 27%] 2025-12-04T14:02:33.4507224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int16 PASSED [0.0081s] [ 27%] 2025-12-04T14:02:33.4507360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int32 PASSED [0.8841s] [ 27%] 2025-12-04T14:02:33.4507496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_int64 PASSED [0.0098s] [ 27%] 2025-12-04T14:02:33.4507631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_uint8 PASSED [0.0082s] [ 27%] 2025-12-04T14:02:33.4507788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex128 PASSED [0.0053s] [ 27%] 2025-12-04T14:02:33.4507939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int16 PASSED [0.0052s] [ 27%] 2025-12-04T14:02:33.4508106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_complex64 PASSED [0.0158s] [ 27%] 2025-12-04T14:02:33.4508249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_float16 PASSED [0.8875s] [ 27%] 2025-12-04T14:02:33.4508393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_float32 PASSED [0.8774s] [ 27%] 2025-12-04T14:02:33.4508534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int32 PASSED [0.8882s] [ 27%] 2025-12-04T14:02:33.4508683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int64 PASSED [0.8946s] [ 27%] 2025-12-04T14:02:33.4508823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int8 PASSED [0.8828s] [ 27%] 2025-12-04T14:02:33.4508963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_uint8 PASSED [0.8807s] [ 27%] 2025-12-04T14:02:33.4509093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pdist_cuda_float32 PASSED [0.0154s] [ 27%] 2025-12-04T14:02:33.4509221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pdist_cuda_float64 PASSED [0.8712s] [ 27%] 2025-12-04T14:02:33.4509361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_bfloat16 PASSED [0.0063s] [ 27%] 2025-12-04T14:02:33.4509500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_float16 PASSED [0.0049s] [ 27%] 2025-12-04T14:02:33.4509637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int16 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.4509774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int64 PASSED [0.0047s] [ 27%] 2025-12-04T14:02:33.4509910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_int8 PASSED [0.0047s] [ 27%] 2025-12-04T14:02:33.4510049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.4510245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16 PASSED [0.3086s] [ 27%] 2025-12-04T14:02:33.4510390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16 PASSED [0.3082s] [ 27%] 2025-12-04T14:02:33.4510532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.3033s] [ 27%] 2025-12-04T14:02:33.4510663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_prelu_cuda_bfloat16 PASSED [0.0286s] [ 27%] 2025-12-04T14:02:33.4510791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_bfloat16 PASSED [0.8914s] [ 27%] 2025-12-04T14:02:33.4510918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int16 PASSED [0.0080s] [ 27%] 2025-12-04T14:02:33.4511045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int8 PASSED [0.8879s] [ 27%] 2025-12-04T14:02:33.4511170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_uint8 PASSED [0.0083s] [ 27%] 2025-12-04T14:02:33.4511296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_float32 PASSED [0.8739s] [ 27%] 2025-12-04T14:02:33.4511420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int16 PASSED [0.0075s] [ 27%] 2025-12-04T14:02:33.4511543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int32 PASSED [0.0055s] [ 27%] 2025-12-04T14:02:33.4511676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_bfloat16 PASSED [0.0068s] [ 27%] 2025-12-04T14:02:33.4511827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_complex64 PASSED [0.2479s] [ 27%] 2025-12-04T14:02:33.4511972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float16 PASSED [0.8848s] [ 27%] 2025-12-04T14:02:33.4512099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rrelu_cuda_float64 PASSED [0.0151s] [ 27%] 2025-12-04T14:02:33.4512262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16 PASSED [0.1442s] [ 27%] 2025-12-04T14:02:33.4512421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float64 PASSED [0.1786s] [ 27%] 2025-12-04T14:02:33.4512559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_cuda_bfloat16 PASSED [0.0040s] [ 27%] 2025-12-04T14:02:33.4512699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16 PASSED [0.0151s] [ 27%] 2025-12-04T14:02:33.4512841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0088s] [ 27%] 2025-12-04T14:02:33.4512973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_bfloat16 PASSED [0.9043s] [ 27%] 2025-12-04T14:02:33.4513105Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float16 PASSED [0.9073s] [ 27%] 2025-12-04T14:02:33.4513251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16 PASSED [0.8839s] [ 27%] 2025-12-04T14:02:33.4513400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex128 PASSED [0.8829s] [ 27%] 2025-12-04T14:02:33.4513545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float16 PASSED [0.8744s] [ 27%] 2025-12-04T14:02:33.4513687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_int32 PASSED [0.8831s] [ 27%] 2025-12-04T14:02:33.4513832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_uint8 PASSED [0.8907s] [ 27%] 2025-12-04T14:02:33.4513967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_bfloat16 PASSED [0.0178s] [ 27%] 2025-12-04T14:02:33.4514110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_float16 PASSED [0.9051s] [ 27%] 2025-12-04T14:02:33.4514246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float32 PASSED [0.0175s] [ 27%] 2025-12-04T14:02:33.4514382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_complex128 PASSED [0.0070s] [ 27%] 2025-12-04T14:02:33.4514514Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float16 PASSED [0.0067s] [ 27%] 2025-12-04T14:02:33.4514645Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float64 PASSED [0.8893s] [ 27%] 2025-12-04T14:02:33.4514773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_int64 PASSED [0.0085s] [ 27%] 2025-12-04T14:02:33.4514913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_complex128 PASSED [0.0058s] [ 27%] 2025-12-04T14:02:33.4515044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int16 PASSED [0.0054s] [ 27%] 2025-12-04T14:02:33.4515177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_uint8 PASSED [0.0053s] [ 27%] 2025-12-04T14:02:33.4515310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float16 PASSED [0.0058s] [ 27%] 2025-12-04T14:02:33.4515441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_int32 PASSED [0.0055s] [ 27%] 2025-12-04T14:02:33.4515570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_uint8 PASSED [0.0055s] [ 27%] 2025-12-04T14:02:33.4515731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [0.0445s] [ 27%] 2025-12-04T14:02:33.4515888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64 PASSED [0.0426s] [ 27%] 2025-12-04T14:02:33.4516035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int16 PASSED [0.0428s] [ 27%] 2025-12-04T14:02:33.4516179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int64 PASSED [0.0428s] [ 27%] 2025-12-04T14:02:33.4516330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8 PASSED [0.9360s] [ 27%] 2025-12-04T14:02:33.4516504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16 PASSED [0.0459s] [ 27%] 2025-12-04T14:02:33.4516671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 PASSED [0.0441s] [ 27%] 2025-12-04T14:02:33.4516835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64 PASSED [0.0430s] [ 27%] 2025-12-04T14:02:33.4516996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int32 PASSED [0.0430s] [ 27%] 2025-12-04T14:02:33.4517157Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8 PASSED [0.0431s] [ 27%] 2025-12-04T14:02:33.4517288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_bfloat16 PASSED [0.1752s] [ 27%] 2025-12-04T14:02:33.4517418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float32 PASSED [0.1744s] [ 27%] 2025-12-04T14:02:33.4517563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_bfloat16 PASSED [0.9202s] [ 27%] 2025-12-04T14:02:33.4517707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float16 PASSED [0.0265s] [ 27%] 2025-12-04T14:02:33.4517851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_bfloat16 PASSED [0.0147s] [ 27%] 2025-12-04T14:02:33.4518002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float16 PASSED [0.0144s] [ 27%] 2025-12-04T14:02:33.4518145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float64 PASSED [0.0143s] [ 27%] 2025-12-04T14:02:33.4518260Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_bfloat16 PASSED [0.0139s] [ 27%] 2025-12-04T14:02:33.4518374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_bool PASSED [0.0134s] [ 27%] 2025-12-04T14:02:33.4518490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex64 PASSED [0.0135s] [ 27%] 2025-12-04T14:02:33.4518603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_float16 PASSED [0.0134s] [ 27%] 2025-12-04T14:02:33.4518715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_float64 PASSED [0.0134s] [ 27%] 2025-12-04T14:02:33.4518827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_int64 PASSED [0.0132s] [ 27%] 2025-12-04T14:02:33.4518938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_uint8 PASSED [0.0134s] [ 27%] 2025-12-04T14:02:33.4519087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 27%] 2025-12-04T14:02:33.4519235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_int16 SKIPPED [0.0005s] (Only runs on cpu) [ 27%] 2025-12-04T14:02:33.4519345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_bfloat16 PASSED [0.0371s] [ 27%] 2025-12-04T14:02:33.4519475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_complex64 PASSED [0.0375s] [ 27%] 2025-12-04T14:02:33.4519596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float16 PASSED [0.0369s] [ 27%] 2025-12-04T14:02:33.4519714Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_complex128 PASSED [0.0050s] [ 27%] 2025-12-04T14:02:33.4519827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float16 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.4519941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_bfloat16 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.4520053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_float16 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.4520222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_float32 PASSED [0.0047s] [ 27%] 2025-12-04T14:02:33.4520335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_bfloat16 PASSED [0.0089s] [ 27%] 2025-12-04T14:02:33.4520451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_float32 PASSED [0.0086s] [ 27%] 2025-12-04T14:02:33.4520562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_cuda_float64 PASSED [0.0086s] [ 27%] 2025-12-04T14:02:33.4520688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_bfloat16 PASSED [0.8820s] [ 27%] 2025-12-04T14:02:33.4520812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float16 PASSED [0.0055s] [ 27%] 2025-12-04T14:02:33.4520939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float64 PASSED [0.8831s] [ 27%] 2025-12-04T14:02:33.4521065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float16 PASSED [0.0070s] [ 27%] 2025-12-04T14:02:33.4521197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float32 PASSED [0.0051s] [ 27%] 2025-12-04T14:02:33.4521308Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_complex128 PASSED [0.8789s] [ 27%] 2025-12-04T14:02:33.4521422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float32 PASSED [0.0044s] [ 27%] 2025-12-04T14:02:33.4521533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float64 PASSED [0.8930s] [ 27%] 2025-12-04T14:02:33.4521641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int64 PASSED [0.0042s] [ 27%] 2025-12-04T14:02:33.4521764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int8 PASSED [0.8783s] [ 27%] 2025-12-04T14:02:33.4521882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_complex32 PASSED [0.0084s] [ 27%] 2025-12-04T14:02:33.4522000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float64 PASSED [0.9001s] [ 27%] 2025-12-04T14:02:33.4522111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int64 PASSED [0.0082s] [ 27%] 2025-12-04T14:02:33.4522225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ormqr_cuda_complex128 PASSED [0.1119s] [ 27%] 2025-12-04T14:02:33.4522333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_bool PASSED [0.0036s] [ 27%] 2025-12-04T14:02:33.4522444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int64 PASSED [0.8908s] [ 27%] 2025-12-04T14:02:33.4522551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int8 PASSED [0.0052s] [ 27%] 2025-12-04T14:02:33.4522676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_complex64 PASSED [0.2354s] [ 27%] 2025-12-04T14:02:33.4522792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_float32 PASSED [0.2303s] [ 27%] 2025-12-04T14:02:33.4522914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float32 PASSED [0.0045s] [ 27%] 2025-12-04T14:02:33.4523034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float64 PASSED [0.0042s] [ 27%] 2025-12-04T14:02:33.4523161Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_complex32 PASSED [0.8852s] [ 27%] 2025-12-04T14:02:33.4523288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float16 PASSED [0.0051s] [ 27%] 2025-12-04T14:02:33.4523400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float32 PASSED [0.8749s] [ 27%] 2025-12-04T14:02:33.4523511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int16 PASSED [0.0052s] [ 27%] 2025-12-04T14:02:33.4523621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int64 PASSED [0.8681s] [ 27%] 2025-12-04T14:02:33.4523739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pinverse_cuda_complex64 PASSED [0.0228s] [ 27%] 2025-12-04T14:02:33.4523886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.0075s] [ 28%] 2025-12-04T14:02:33.4524019Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_bool PASSED [0.0095s] [ 28%] 2025-12-04T14:02:33.4524153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int32 PASSED [0.0071s] [ 28%] 2025-12-04T14:02:33.4524289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_uint8 PASSED [0.8827s] [ 28%] 2025-12-04T14:02:33.4524424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float16 PASSED [0.0117s] [ 28%] 2025-12-04T14:02:33.4524559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int64 PASSED [0.0072s] [ 28%] 2025-12-04T14:02:33.4524689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int8 PASSED [0.0071s] [ 28%] 2025-12-04T14:02:33.4524824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float16 PASSED [0.0071s] [ 28%] 2025-12-04T14:02:33.4524956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_int32 PASSED [0.0070s] [ 28%] 2025-12-04T14:02:33.4525086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_uint8 PASSED [0.8785s] [ 28%] 2025-12-04T14:02:33.4525221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0090s] [ 28%] 2025-12-04T14:02:33.4525353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float64 PASSED [0.0073s] [ 28%] 2025-12-04T14:02:33.4525496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int16 PASSED [0.0071s] [ 28%] 2025-12-04T14:02:33.4525626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int32 PASSED [0.0070s] [ 28%] 2025-12-04T14:02:33.4525757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int64 PASSED [0.0070s] [ 28%] 2025-12-04T14:02:33.4525887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_bool PASSED [0.8863s] [ 28%] 2025-12-04T14:02:33.4526018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int16 PASSED [0.0090s] [ 28%] 2025-12-04T14:02:33.4526148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int32 PASSED [0.0074s] [ 28%] 2025-12-04T14:02:33.4526278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int64 PASSED [0.0071s] [ 28%] 2025-12-04T14:02:33.4526407Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int8 PASSED [0.0070s] [ 28%] 2025-12-04T14:02:33.4526520Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_bfloat16 PASSED [0.0088s] [ 28%] 2025-12-04T14:02:33.4526626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int16 PASSED [0.0085s] [ 28%] 2025-12-04T14:02:33.4526736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_uint8 PASSED [0.0084s] [ 28%] 2025-12-04T14:02:33.4526847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_bfloat16 PASSED [0.0261s] [ 28%] 2025-12-04T14:02:33.4526962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_bool PASSED [0.8939s] [ 28%] 2025-12-04T14:02:33.4527086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_complex128 PASSED [0.0252s] [ 28%] 2025-12-04T14:02:33.4527195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_float16 PASSED [0.9065s] [ 28%] 2025-12-04T14:02:33.4527303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_int8 PASSED [0.2714s] [ 28%] 2025-12-04T14:02:33.4527410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_bfloat16 PASSED [0.9010s] [ 28%] 2025-12-04T14:02:33.4527515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_bool PASSED [0.0212s] [ 28%] 2025-12-04T14:02:33.4527639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_complex64 PASSED [0.0202s] [ 28%] 2025-12-04T14:02:33.4527747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float16 PASSED [0.0198s] [ 28%] 2025-12-04T14:02:33.4527855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float32 PASSED [0.0194s] [ 28%] 2025-12-04T14:02:33.4527964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_float64 PASSED [0.0192s] [ 28%] 2025-12-04T14:02:33.4528069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_uint8 PASSED [0.0191s] [ 28%] 2025-12-04T14:02:33.4528179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_complex128 PASSED [0.0236s] [ 28%] 2025-12-04T14:02:33.4528287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_complex64 PASSED [0.0224s] [ 28%] 2025-12-04T14:02:33.4528394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_float64 PASSED [0.0215s] [ 28%] 2025-12-04T14:02:33.4528508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_bfloat16 PASSED [0.0030s] [ 28%] 2025-12-04T14:02:33.4528619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_int32 PASSED [0.8973s] [ 28%] 2025-12-04T14:02:33.4528738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_bfloat16 PASSED [0.0124s] [ 28%] 2025-12-04T14:02:33.4528858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_complex128 PASSED [0.0100s] [ 28%] 2025-12-04T14:02:33.4528968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_int32 PASSED [0.0103s] [ 28%] 2025-12-04T14:02:33.4529097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float16 PASSED [0.0129s] [ 28%] 2025-12-04T14:02:33.4529217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float64 PASSED [0.0125s] [ 28%] 2025-12-04T14:02:33.4529334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int16 PASSED [0.8934s] [ 28%] 2025-12-04T14:02:33.4529447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_bfloat16 PASSED [0.0062s] [ 28%] 2025-12-04T14:02:33.4529558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_complex32 PASSED [0.0043s] [ 28%] 2025-12-04T14:02:33.4529670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float16 PASSED [0.8908s] [ 28%] 2025-12-04T14:02:33.4529788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_bfloat16 PASSED [0.0124s] [ 28%] 2025-12-04T14:02:33.4529905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float32 PASSED [0.0100s] [ 28%] 2025-12-04T14:02:33.4530014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_bool PASSED [0.0039s] [ 28%] 2025-12-04T14:02:33.4530167Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex128 PASSED [0.8943s] [ 28%] 2025-12-04T14:02:33.4530278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex32 PASSED [0.0055s] [ 28%] 2025-12-04T14:02:33.4530388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_float32 PASSED [0.8872s] [ 28%] 2025-12-04T14:02:33.4530496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int32 PASSED [0.0036s] [ 28%] 2025-12-04T14:02:33.4530616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int64 PASSED [0.8942s] [ 28%] 2025-12-04T14:02:33.4530756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_complex128 PASSED [0.0059s] [ 28%] 2025-12-04T14:02:33.4530878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_complex64 PASSED [0.0041s] [ 28%] 2025-12-04T14:02:33.4530996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_float64 PASSED [0.8859s] [ 28%] 2025-12-04T14:02:33.4531109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int32 PASSED [0.0059s] [ 28%] 2025-12-04T14:02:33.4531236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int64 PASSED [0.0041s] [ 28%] 2025-12-04T14:02:33.4531348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int8 PASSED [0.8891s] [ 28%] 2025-12-04T14:02:33.4531463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_uint8 PASSED [0.0057s] [ 28%] 2025-12-04T14:02:33.4531581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_bfloat16 PASSED [0.0099s] [ 28%] 2025-12-04T14:02:33.4531696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float32 PASSED [0.0092s] [ 28%] 2025-12-04T14:02:33.4531811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float64 PASSED [0.0091s] [ 28%] 2025-12-04T14:02:33.4531923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int32 PASSED [0.0090s] [ 28%] 2025-12-04T14:02:33.4532034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int8 PASSED [0.0090s] [ 28%] 2025-12-04T14:02:33.4532147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_bfloat16 PASSED [0.0122s] [ 28%] 2025-12-04T14:02:33.4532259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_complex64 PASSED [0.0090s] [ 28%] 2025-12-04T14:02:33.4532370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_float16 PASSED [0.0096s] [ 28%] 2025-12-04T14:02:33.4532482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_float32 PASSED [0.0084s] [ 28%] 2025-12-04T14:02:33.4532598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_complex128 PASSED [0.0236s] [ 28%] 2025-12-04T14:02:33.4532732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_float16 PASSED [0.0234s] [ 28%] 2025-12-04T14:02:33.4532841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_float64 PASSED [0.0233s] [ 28%] 2025-12-04T14:02:33.4532951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_int16 PASSED [0.0232s] [ 28%] 2025-12-04T14:02:33.4533059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_uint8 PASSED [0.0233s] [ 28%] 2025-12-04T14:02:33.4533190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_complex128 PASSED [0.0131s] [ 28%] 2025-12-04T14:02:33.4533314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_int32 PASSED [0.8872s] [ 28%] 2025-12-04T14:02:33.4533438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_int64 PASSED [0.0101s] [ 28%] 2025-12-04T14:02:33.4533560Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_uint8 PASSED [0.0082s] [ 28%] 2025-12-04T14:02:33.4533680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_bfloat16 PASSED [0.0035s] [ 28%] 2025-12-04T14:02:33.4533800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_complex128 PASSED [0.8942s] [ 28%] 2025-12-04T14:02:33.4533913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_int32 PASSED [0.0053s] [ 28%] 2025-12-04T14:02:33.4534026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_int8 PASSED [0.0038s] [ 28%] 2025-12-04T14:02:33.4534138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_bfloat16 PASSED [0.8993s] [ 28%] 2025-12-04T14:02:33.4534271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_complex128 PASSED [0.0055s] [ 28%] 2025-12-04T14:02:33.4534391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_int32 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.4534502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_uint8 PASSED [0.8786s] [ 28%] 2025-12-04T14:02:33.4534610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_bool PASSED [0.0062s] [ 28%] 2025-12-04T14:02:33.4534722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float64 PASSED [0.0045s] [ 28%] 2025-12-04T14:02:33.4534832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_int64 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.4534967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_complex64 PASSED [0.0043s] [ 28%] 2025-12-04T14:02:33.4535079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int16 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.4535191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int8 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.4535304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_uint8 PASSED [0.0043s] [ 28%] 2025-12-04T14:02:33.4535421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int64 PASSED [0.8846s] [ 28%] 2025-12-04T14:02:33.4535542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_complex128 PASSED [0.0031s] [ 28%] 2025-12-04T14:02:33.4535663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_complex32 PASSED [0.8770s] [ 28%] 2025-12-04T14:02:33.4535781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_float64 PASSED [0.0034s] [ 28%] 2025-12-04T14:02:33.4535896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_int32 PASSED [0.8751s] [ 28%] 2025-12-04T14:02:33.4536011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_uint8 PASSED [0.0034s] [ 28%] 2025-12-04T14:02:33.4536117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_bool PASSED [0.0149s] [ 28%] 2025-12-04T14:02:33.4536225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int32 PASSED [0.0138s] [ 28%] 2025-12-04T14:02:33.4536341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_uint8 PASSED [0.0137s] [ 28%] 2025-12-04T14:02:33.4536453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_bfloat16 PASSED [0.0208s] [ 28%] 2025-12-04T14:02:33.4536565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_complex128 PASSED [0.0208s] [ 28%] 2025-12-04T14:02:33.4536677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_complex64 PASSED [0.0208s] [ 28%] 2025-12-04T14:02:33.4536786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float64 PASSED [0.0207s] [ 28%] 2025-12-04T14:02:33.4536895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_int8 PASSED [0.0205s] [ 28%] 2025-12-04T14:02:33.4537005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float32 PASSED [0.8822s] [ 28%] 2025-12-04T14:02:33.4537115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float64 PASSED [0.0045s] [ 28%] 2025-12-04T14:02:33.4537222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int16 PASSED [0.8814s] [ 28%] 2025-12-04T14:02:33.4537330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_uint8 PASSED [0.0045s] [ 28%] 2025-12-04T14:02:33.4537455Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_3_cuda_float16 PASSED [0.0049s] [ 28%] 2025-12-04T14:02:33.4537580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_3_cuda_float64 PASSED [0.8787s] [ 28%] 2025-12-04T14:02:33.4537712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_bfloat16 PASSED [0.0061s] [ 28%] 2025-12-04T14:02:33.4537852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float32 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.4537974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_bfloat16 PASSED [0.0040s] [ 28%] 2025-12-04T14:02:33.4538081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_bool PASSED [0.8782s] [ 28%] 2025-12-04T14:02:33.4538197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex128 PASSED [0.0080s] [ 28%] 2025-12-04T14:02:33.4538308Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex32 PASSED [0.2315s] [ 28%] 2025-12-04T14:02:33.4538415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int8 PASSED [0.8848s] [ 28%] 2025-12-04T14:02:33.4538536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_complex128 PASSED [0.0133s] [ 28%] 2025-12-04T14:02:33.4538648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_complex64 PASSED [0.0108s] [ 28%] 2025-12-04T14:02:33.4538754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_int64 PASSED [0.0106s] [ 28%] 2025-12-04T14:02:33.4538880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_complex64 PASSED [0.8978s] [ 28%] 2025-12-04T14:02:33.4539004Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_float64 PASSED [0.0048s] [ 28%] 2025-12-04T14:02:33.4539125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float16 PASSED [0.0093s] [ 28%] 2025-12-04T14:02:33.4539239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int32 PASSED [0.8851s] [ 28%] 2025-12-04T14:02:33.4539352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int64 PASSED [0.0103s] [ 28%] 2025-12-04T14:02:33.4539464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_bool PASSED [0.0155s] [ 28%] 2025-12-04T14:02:33.4539574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int16 PASSED [0.8886s] [ 28%] 2025-12-04T14:02:33.4539703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float64 PASSED [0.0203s] [ 29%] 2025-12-04T14:02:33.4539829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_int16 PASSED [0.9142s] [ 29%] 2025-12-04T14:02:33.4539976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_bfloat16 PASSED [0.0201s] [ 29%] 2025-12-04T14:02:33.4540138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_float16 PASSED [0.9020s] [ 29%] 2025-12-04T14:02:33.4540274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_float64 PASSED [0.0218s] [ 29%] 2025-12-04T14:02:33.4540406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_float32 PASSED [0.9016s] [ 29%] 2025-12-04T14:02:33.4540538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int32 PASSED [0.0194s] [ 29%] 2025-12-04T14:02:33.4540668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int64 PASSED [0.9127s] [ 29%] 2025-12-04T14:02:33.4540797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bool PASSED [0.0200s] [ 29%] 2025-12-04T14:02:33.4540925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_float32 PASSED [0.8921s] [ 29%] 2025-12-04T14:02:33.4541055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int16 PASSED [0.0203s] [ 29%] 2025-12-04T14:02:33.4541181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int64 PASSED [0.9054s] [ 29%] 2025-12-04T14:02:33.4541309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_int8 PASSED [0.0200s] [ 29%] 2025-12-04T14:02:33.4541438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float16 PASSED [0.1941s] [ 29%] 2025-12-04T14:02:33.4541574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_uint8 PASSED [0.1945s] [ 29%] 2025-12-04T14:02:33.4541709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_complex128 PASSED [0.0043s] [ 29%] 2025-12-04T14:02:33.4541820Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int16 PASSED [0.8979s] [ 29%] 2025-12-04T14:02:33.4541936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int8 PASSED [0.0057s] [ 29%] 2025-12-04T14:02:33.4542061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_bfloat16 PASSED [0.0070s] [ 29%] 2025-12-04T14:02:33.4542188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float32 PASSED [0.0063s] [ 29%] 2025-12-04T14:02:33.4542322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_int16 PASSED [0.0069s] [ 29%] 2025-12-04T14:02:33.4542448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_uint8 PASSED [0.0081s] [ 29%] 2025-12-04T14:02:33.4542562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_complex64 PASSED [0.0035s] [ 29%] 2025-12-04T14:02:33.4542676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_float64 PASSED [0.8739s] [ 29%] 2025-12-04T14:02:33.4542784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int16 PASSED [0.0045s] [ 29%] 2025-12-04T14:02:33.4542906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_complex128 PASSED [0.9046s] [ 29%] 2025-12-04T14:02:33.4543017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_float64 PASSED [0.0041s] [ 29%] 2025-12-04T14:02:33.4543130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int64 PASSED [0.9046s] [ 29%] 2025-12-04T14:02:33.4543245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int8 PASSED [0.0042s] [ 29%] 2025-12-04T14:02:33.4543358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_bool PASSED [0.0045s] [ 29%] 2025-12-04T14:02:33.4543481Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex32 PASSED [1.1077s] [ 29%] 2025-12-04T14:02:33.4543597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float32 PASSED [0.0052s] [ 29%] 2025-12-04T14:02:33.4543716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_int64 PASSED [0.0039s] [ 29%] 2025-12-04T14:02:33.4543839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_int8 PASSED [0.8998s] [ 29%] 2025-12-04T14:02:33.4543954Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_bfloat16 PASSED [0.0043s] [ 29%] 2025-12-04T14:02:33.4544062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_bool PASSED [0.8953s] [ 29%] 2025-12-04T14:02:33.4544175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_int64 PASSED [0.0042s] [ 29%] 2025-12-04T14:02:33.4544283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_int8 PASSED [0.8983s] [ 29%] 2025-12-04T14:02:33.4544425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_bartlett_cuda_float64 PASSED [0.0147s] [ 29%] 2025-12-04T14:02:33.4544566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_blackman_cuda_float32 PASSED [0.0284s] [ 29%] 2025-12-04T14:02:33.4544705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_blackman_cuda_float64 PASSED [0.0277s] [ 29%] 2025-12-04T14:02:33.4544839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_cosine_cuda_float32 PASSED [0.0095s] [ 29%] 2025-12-04T14:02:33.4544989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_general_hamming_cuda_float32 PASSED [0.9080s] [ 29%] 2025-12-04T14:02:33.4545127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hamming_cuda_float64 PASSED [0.0320s] [ 29%] 2025-12-04T14:02:33.4545258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hann_cuda_float32 PASSED [0.9264s] [ 29%] 2025-12-04T14:02:33.4545404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hann_cuda_float64 PASSED [0.0302s] [ 29%] 2025-12-04T14:02:33.4545547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_kaiser_cuda_float32 PASSED [0.0537s] [ 29%] 2025-12-04T14:02:33.4545668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_float32 PASSED [0.8800s] [ 29%] 2025-12-04T14:02:33.4545781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int16 PASSED [0.0044s] [ 29%] 2025-12-04T14:02:33.4545899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int32 PASSED [0.8936s] [ 29%] 2025-12-04T14:02:33.4546009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int64 PASSED [0.0045s] [ 29%] 2025-12-04T14:02:33.4546146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_uint8 PASSED [0.8861s] [ 29%] 2025-12-04T14:02:33.4546259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex128 PASSED [0.0063s] [ 29%] 2025-12-04T14:02:33.4546376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex64 PASSED [0.8986s] [ 29%] 2025-12-04T14:02:33.4546486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_int8 PASSED [0.0044s] [ 29%] 2025-12-04T14:02:33.4546597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_uint8 PASSED [0.8823s] [ 29%] 2025-12-04T14:02:33.4546706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_bool PASSED [0.0090s] [ 29%] 2025-12-04T14:02:33.4546825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_complex64 PASSED [1.1469s] [ 29%] 2025-12-04T14:02:33.4546939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_float64 PASSED [0.0080s] [ 29%] 2025-12-04T14:02:33.4547345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_int8 PASSED [0.0052s] [ 29%] 2025-12-04T14:02:33.4547461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_bfloat16 PASSED [0.0029s] [ 29%] 2025-12-04T14:02:33.4547570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_bool PASSED [0.9037s] [ 29%] 2025-12-04T14:02:33.4547685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float32 PASSED [0.0044s] [ 29%] 2025-12-04T14:02:33.4547795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float64 PASSED [0.8866s] [ 29%] 2025-12-04T14:02:33.4547921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_int32 PASSED [0.0043s] [ 29%] 2025-12-04T14:02:33.4548030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_uint8 PASSED [0.8860s] [ 29%] 2025-12-04T14:02:33.4548143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int16 PASSED [0.0050s] [ 29%] 2025-12-04T14:02:33.4548255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int32 PASSED [0.8902s] [ 29%] 2025-12-04T14:02:33.4548383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float16 PASSED [0.0180s] [ 29%] 2025-12-04T14:02:33.4548507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float32 PASSED [0.0153s] [ 29%] 2025-12-04T14:02:33.4548632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_float64 PASSED [0.0151s] [ 29%] 2025-12-04T14:02:33.4548753Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int32 PASSED [0.0149s] [ 29%] 2025-12-04T14:02:33.4548872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float32 PASSED [0.0057s] [ 29%] 2025-12-04T14:02:33.4548985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float64 PASSED [0.0055s] [ 29%] 2025-12-04T14:02:33.4549114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_bool PASSED [0.0057s] [ 29%] 2025-12-04T14:02:33.4549246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_float32 PASSED [0.0057s] [ 29%] 2025-12-04T14:02:33.4549358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_bfloat16 PASSED [0.0146s] [ 29%] 2025-12-04T14:02:33.4549483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float16 PASSED [0.9063s] [ 29%] 2025-12-04T14:02:33.4549601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int8 PASSED [0.0170s] [ 29%] 2025-12-04T14:02:33.4549756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 29%] 2025-12-04T14:02:33.4549878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int32 PASSED [0.2208s] [ 29%] 2025-12-04T14:02:33.4550005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_bool PASSED [0.2034s] [ 29%] 2025-12-04T14:02:33.4550195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int32 PASSED [0.8985s] [ 29%] 2025-12-04T14:02:33.4550324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_uint8 PASSED [0.0059s] [ 29%] 2025-12-04T14:02:33.4550451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_int32 PASSED [0.0059s] [ 29%] 2025-12-04T14:02:33.4550581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_uint8 PASSED [0.8998s] [ 29%] 2025-12-04T14:02:33.4550716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_float32 PASSED [1.0572s] [ 29%] 2025-12-04T14:02:33.4550842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_bool PASSED [0.9183s] [ 29%] 2025-12-04T14:02:33.4550968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_float64 PASSED [1.1007s] [ 29%] 2025-12-04T14:02:33.4551093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int16 PASSED [0.8871s] [ 29%] 2025-12-04T14:02:33.4551219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int64 PASSED [0.0053s] [ 29%] 2025-12-04T14:02:33.4551341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int8 PASSED [0.0039s] [ 29%] 2025-12-04T14:02:33.4551470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_uint8 PASSED [0.0037s] [ 29%] 2025-12-04T14:02:33.4551619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_float64 PASSED [0.0105s] [ 29%] 2025-12-04T14:02:33.4551779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int32 PASSED [0.0101s] [ 29%] 2025-12-04T14:02:33.4551923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8 PASSED [0.0082s] [ 29%] 2025-12-04T14:02:33.4552075Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0095s] [ 29%] 2025-12-04T14:02:33.4552222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float64 PASSED [0.0093s] [ 29%] 2025-12-04T14:02:33.4552367Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int16 PASSED [0.0100s] [ 29%] 2025-12-04T14:02:33.4552511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int32 PASSED [0.0083s] [ 29%] 2025-12-04T14:02:33.4552655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_v_cuda_int64 PASSED [0.0100s] [ 29%] 2025-12-04T14:02:33.4552802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_v_cuda_int8 PASSED [0.0082s] [ 29%] 2025-12-04T14:02:33.4552947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.4704s] [ 29%] 2025-12-04T14:02:33.4553092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int32 PASSED [0.0107s] [ 29%] 2025-12-04T14:02:33.4553233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int64 PASSED [0.0083s] [ 29%] 2025-12-04T14:02:33.4553374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float16 PASSED [0.0055s] [ 29%] 2025-12-04T14:02:33.4553508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float32 PASSED [0.0051s] [ 29%] 2025-12-04T14:02:33.4553630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int16 PASSED [0.0055s] [ 29%] 2025-12-04T14:02:33.4553748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int64 PASSED [0.0046s] [ 29%] 2025-12-04T14:02:33.4553870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_bool PASSED [0.9055s] [ 29%] 2025-12-04T14:02:33.4553991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_float32 PASSED [0.0075s] [ 29%] 2025-12-04T14:02:33.4554124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int32 PASSED [0.0041s] [ 29%] 2025-12-04T14:02:33.4554244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int64 PASSED [0.8804s] [ 29%] 2025-12-04T14:02:33.4554390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_float32 PASSED [0.0126s] [ 29%] 2025-12-04T14:02:33.4554532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int16 PASSED [0.0100s] [ 29%] 2025-12-04T14:02:33.4554676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int32 PASSED [0.0101s] [ 29%] 2025-12-04T14:02:33.4554819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int64 PASSED [0.0084s] [ 29%] 2025-12-04T14:02:33.4554957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_uint8 PASSED [0.0083s] [ 29%] 2025-12-04T14:02:33.4555100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_bool PASSED [0.0097s] [ 29%] 2025-12-04T14:02:33.4555239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_int8 PASSED [0.0081s] [ 29%] 2025-12-04T14:02:33.4555384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_uint8 PASSED [0.0083s] [ 29%] 2025-12-04T14:02:33.4555502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_bool PASSED [0.8916s] [ 29%] 2025-12-04T14:02:33.4555635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_bfloat16 PASSED [0.0067s] [ 29%] 2025-12-04T14:02:33.4555752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_float16 PASSED [0.0045s] [ 29%] 2025-12-04T14:02:33.4555870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int64 PASSED [0.8938s] [ 29%] 2025-12-04T14:02:33.4555985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_uint8 PASSED [0.0052s] [ 29%] 2025-12-04T14:02:33.4556102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_bool PASSED [0.0052s] [ 29%] 2025-12-04T14:02:33.4556218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_uint8 PASSED [0.8908s] [ 29%] 2025-12-04T14:02:33.4556344Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_bool PASSED [0.0105s] [ 29%] 2025-12-04T14:02:33.4556470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int16 PASSED [0.0067s] [ 29%] 2025-12-04T14:02:33.4556593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int32 PASSED [0.0064s] [ 29%] 2025-12-04T14:02:33.4556717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int8 PASSED [0.0063s] [ 29%] 2025-12-04T14:02:33.4556839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_uint8 PASSED [0.9024s] [ 29%] 2025-12-04T14:02:33.4556982Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_float64 PASSED [0.0071s] [ 29%] 2025-12-04T14:02:33.4557119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int16 PASSED [0.1631s] [ 30%] 2025-12-04T14:02:33.4557269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int32 PASSED [0.0041s] [ 30%] 2025-12-04T14:02:33.4557416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int64 PASSED [0.8919s] [ 30%] 2025-12-04T14:02:33.4557556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int8 PASSED [0.0056s] [ 30%] 2025-12-04T14:02:33.4557691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_uint8 PASSED [0.0039s] [ 30%] 2025-12-04T14:02:33.4557832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_float32 PASSED [0.0051s] [ 30%] 2025-12-04T14:02:33.4557981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_float64 PASSED [1.0480s] [ 30%] 2025-12-04T14:02:33.4558122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int32 PASSED [0.0074s] [ 30%] 2025-12-04T14:02:33.4558261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int8 PASSED [0.0041s] [ 30%] 2025-12-04T14:02:33.4558397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int16 PASSED [0.0053s] [ 30%] 2025-12-04T14:02:33.4558538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int64 PASSED [0.8977s] [ 30%] 2025-12-04T14:02:33.4558675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_float32 PASSED [0.1539s] [ 30%] 2025-12-04T14:02:33.4558810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int8 PASSED [0.0053s] [ 30%] 2025-12-04T14:02:33.4558935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_float32 PASSED [0.0073s] [ 30%] 2025-12-04T14:02:33.4559057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int32 PASSED [0.8960s] [ 30%] 2025-12-04T14:02:33.4559179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int16 PASSED [0.0073s] [ 30%] 2025-12-04T14:02:33.4559301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int8 PASSED [0.0039s] [ 30%] 2025-12-04T14:02:33.4559420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_uint8 PASSED [0.0038s] [ 30%] 2025-12-04T14:02:33.4559588Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bool PASSED [0.0072s] [ 30%] 2025-12-04T14:02:33.4559750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float32 PASSED [0.0072s] [ 30%] 2025-12-04T14:02:33.4559911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16 PASSED [0.0071s] [ 30%] 2025-12-04T14:02:33.4560069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32 PASSED [0.0070s] [ 30%] 2025-12-04T14:02:33.4560264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_bool PASSED [0.9019s] [ 30%] 2025-12-04T14:02:33.4560414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_int64 PASSED [0.0056s] [ 30%] 2025-12-04T14:02:33.4560559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_uint8 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.4560707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32 PASSED [0.0054s] [ 30%] 2025-12-04T14:02:33.4560851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64 PASSED [0.9005s] [ 30%] 2025-12-04T14:02:33.4561008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32 PASSED [0.0133s] [ 30%] 2025-12-04T14:02:33.4561178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64 PASSED [0.0084s] [ 30%] 2025-12-04T14:02:33.4561356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool PASSED [0.0093s] [ 30%] 2025-12-04T14:02:33.4561513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.0095s] [ 30%] 2025-12-04T14:02:33.4561670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int64 PASSED [0.0083s] [ 30%] 2025-12-04T14:02:33.4561828Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [0.4661s] [ 30%] 2025-12-04T14:02:33.4561998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64 PASSED [0.3544s] [ 30%] 2025-12-04T14:02:33.4562143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_float32 PASSED [0.8861s] [ 30%] 2025-12-04T14:02:33.4562286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_float64 PASSED [0.1463s] [ 30%] 2025-12-04T14:02:33.4562417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float32 PASSED [0.0137s] [ 30%] 2025-12-04T14:02:33.4562544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float64 PASSED [0.0130s] [ 30%] 2025-12-04T14:02:33.4562671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int16 PASSED [0.0131s] [ 30%] 2025-12-04T14:02:33.4562794Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_uint8 PASSED [0.0131s] [ 30%] 2025-12-04T14:02:33.4562916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_bool PASSED [0.0108s] [ 30%] 2025-12-04T14:02:33.4563037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_float32 PASSED [0.0093s] [ 30%] 2025-12-04T14:02:33.4563163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_float64 PASSED [0.0097s] [ 30%] 2025-12-04T14:02:33.4563283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int32 PASSED [0.0086s] [ 30%] 2025-12-04T14:02:33.4563398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_bool PASSED [0.8957s] [ 30%] 2025-12-04T14:02:33.4563527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex32 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4563644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_int8 PASSED [0.8976s] [ 30%] 2025-12-04T14:02:33.4563759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_uint8 PASSED [0.0047s] [ 30%] 2025-12-04T14:02:33.4563890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_complex128 PASSED [0.8957s] [ 30%] 2025-12-04T14:02:33.4564021Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_complex64 PASSED [0.0048s] [ 30%] 2025-12-04T14:02:33.4564146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_float16 PASSED [0.8891s] [ 30%] 2025-12-04T14:02:33.4564273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int16 PASSED [0.0050s] [ 30%] 2025-12-04T14:02:33.4564409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex128 PASSED [0.0044s] [ 30%] 2025-12-04T14:02:33.4564549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex64 PASSED [0.8966s] [ 30%] 2025-12-04T14:02:33.4564678Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int32 PASSED [0.0056s] [ 30%] 2025-12-04T14:02:33.4564811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int8 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.4564937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_bfloat16 PASSED [0.8967s] [ 30%] 2025-12-04T14:02:33.4565079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_complex32 PASSED [0.0051s] [ 30%] 2025-12-04T14:02:33.4565214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_float16 PASSED [0.0037s] [ 30%] 2025-12-04T14:02:33.4565331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_complex128 PASSED [0.9046s] [ 30%] 2025-12-04T14:02:33.4565450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_complex64 PASSED [0.0061s] [ 30%] 2025-12-04T14:02:33.4565561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int64 PASSED [0.8863s] [ 30%] 2025-12-04T14:02:33.4565675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int8 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4565795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_bool PASSED [0.0050s] [ 30%] 2025-12-04T14:02:33.4565916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_complex128 PASSED [0.0043s] [ 30%] 2025-12-04T14:02:33.4566032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_complex64 PASSED [0.8868s] [ 30%] 2025-12-04T14:02:33.4566149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float32 PASSED [0.0062s] [ 30%] 2025-12-04T14:02:33.4566259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int16 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4566389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_complex128 PASSED [0.0066s] [ 30%] 2025-12-04T14:02:33.4566512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_float64 PASSED [0.0061s] [ 30%] 2025-12-04T14:02:33.4566634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_uint8 PASSED [0.0061s] [ 30%] 2025-12-04T14:02:33.4566749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_bfloat16 PASSED [0.0044s] [ 30%] 2025-12-04T14:02:33.4566869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_complex64 PASSED [0.8960s] [ 30%] 2025-12-04T14:02:33.4566982Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int64 PASSED [0.0058s] [ 30%] 2025-12-04T14:02:33.4567118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_complex32 PASSED [0.0042s] [ 30%] 2025-12-04T14:02:33.4567244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float32 PASSED [0.8949s] [ 30%] 2025-12-04T14:02:33.4567389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float64 PASSED [0.0053s] [ 30%] 2025-12-04T14:02:33.4567515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int8 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.4567630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_bfloat16 PASSED [0.0091s] [ 30%] 2025-12-04T14:02:33.4567749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_complex32 PASSED [0.0086s] [ 30%] 2025-12-04T14:02:33.4567863Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_float16 PASSED [0.0085s] [ 30%] 2025-12-04T14:02:33.4567977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int16 PASSED [0.0085s] [ 30%] 2025-12-04T14:02:33.4568090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_bfloat16 PASSED [0.9116s] [ 30%] 2025-12-04T14:02:33.4568208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_bfloat16 PASSED [0.0173s] [ 30%] 2025-12-04T14:02:33.4568337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_complex64 PASSED [0.8987s] [ 30%] 2025-12-04T14:02:33.4568466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_float64 PASSED [0.0060s] [ 30%] 2025-12-04T14:02:33.4568592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_complex128 PASSED [0.0041s] [ 30%] 2025-12-04T14:02:33.4568709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_complex128 PASSED [0.6507s] [ 30%] 2025-12-04T14:02:33.4568832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_complex64 PASSED [1.2151s] [ 30%] 2025-12-04T14:02:33.4568959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_float32 PASSED [0.3525s] [ 30%] 2025-12-04T14:02:33.4569069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_float64 PASSED [0.3335s] [ 30%] 2025-12-04T14:02:33.4569185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_complex32 PASSED [0.0109s] [ 30%] 2025-12-04T14:02:33.4569302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_complex64 PASSED [0.0101s] [ 30%] 2025-12-04T14:02:33.4569410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_uint8 PASSED [0.9011s] [ 30%] 2025-12-04T14:02:33.4569534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_complex64 PASSED [0.0148s] [ 30%] 2025-12-04T14:02:33.4569644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_float16 PASSED [0.0141s] [ 30%] 2025-12-04T14:02:33.4569756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_uint8 PASSED [0.9079s] [ 30%] 2025-12-04T14:02:33.4569872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_bool PASSED [0.0123s] [ 30%] 2025-12-04T14:02:33.4569991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_uint8 PASSED [0.0105s] [ 30%] 2025-12-04T14:02:33.4570158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_complex64 PASSED [0.3024s] [ 30%] 2025-12-04T14:02:33.4570281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_float32 PASSED [0.3008s] [ 30%] 2025-12-04T14:02:33.4570399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_float64 PASSED [0.2952s] [ 30%] 2025-12-04T14:02:33.4570511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_bool PASSED [0.8944s] [ 30%] 2025-12-04T14:02:33.4570625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_complex64 PASSED [0.0055s] [ 30%] 2025-12-04T14:02:33.4570741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_float16 PASSED [0.0039s] [ 30%] 2025-12-04T14:02:33.4570851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int16 PASSED [0.9036s] [ 30%] 2025-12-04T14:02:33.4570963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int64 PASSED [0.0055s] [ 30%] 2025-12-04T14:02:33.4571091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_int8 PASSED [0.0039s] [ 30%] 2025-12-04T14:02:33.4571201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_uint8 PASSED [0.9002s] [ 30%] 2025-12-04T14:02:33.4571313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_complex64 PASSED [0.0048s] [ 30%] 2025-12-04T14:02:33.4571421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float32 PASSED [0.8946s] [ 30%] 2025-12-04T14:02:33.4571531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float64 PASSED [0.0048s] [ 30%] 2025-12-04T14:02:33.4571636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int16 PASSED [0.8955s] [ 30%] 2025-12-04T14:02:33.4571741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_uint8 PASSED [0.0049s] [ 30%] 2025-12-04T14:02:33.4571865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_float64 PASSED [0.0124s] [ 30%] 2025-12-04T14:02:33.4571991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int64 PASSED [0.0110s] [ 30%] 2025-12-04T14:02:33.4572101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float16 PASSED [0.0079s] [ 30%] 2025-12-04T14:02:33.4572211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float32 PASSED [0.0076s] [ 30%] 2025-12-04T14:02:33.4572319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float64 PASSED [0.9130s] [ 30%] 2025-12-04T14:02:33.4572432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int32 PASSED [0.0099s] [ 30%] 2025-12-04T14:02:33.4572553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int8 PASSED [0.0080s] [ 30%] 2025-12-04T14:02:33.4572676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_bfloat16 PASSED [0.0031s] [ 30%] 2025-12-04T14:02:33.4572782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_bool PASSED [0.8927s] [ 30%] 2025-12-04T14:02:33.4572897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_complex32 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4573008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_float64 PASSED [0.8863s] [ 30%] 2025-12-04T14:02:33.4573114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int8 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4573242Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_complex32 PASSED [0.9000s] [ 30%] 2025-12-04T14:02:33.4573355Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_complex64 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4573469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_float64 PASSED [0.8815s] [ 30%] 2025-12-04T14:02:33.4573579Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int16 PASSED [0.0045s] [ 30%] 2025-12-04T14:02:33.4573690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_uint8 PASSED [0.8916s] [ 30%] 2025-12-04T14:02:33.4573814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_bfloat16 PASSED [0.0096s] [ 30%] 2025-12-04T14:02:33.4573939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_float16 PASSED [0.0077s] [ 30%] 2025-12-04T14:02:33.4574060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_float32 PASSED [0.0075s] [ 30%] 2025-12-04T14:02:33.4574183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int32 PASSED [0.9002s] [ 31%] 2025-12-04T14:02:33.4574300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int64 PASSED [0.0094s] [ 31%] 2025-12-04T14:02:33.4574422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_uint8 PASSED [0.0077s] [ 31%] 2025-12-04T14:02:33.4574540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensordot_cuda_bfloat16 PASSED [0.0121s] [ 31%] 2025-12-04T14:02:33.4574656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_complex128 PASSED [0.0307s] [ 31%] 2025-12-04T14:02:33.4574776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int64 PASSED [0.0300s] [ 31%] 2025-12-04T14:02:33.4574889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_bfloat16 PASSED [0.8937s] [ 31%] 2025-12-04T14:02:33.4574995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_bool PASSED [0.0072s] [ 31%] 2025-12-04T14:02:33.4575109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_complex128 PASSED [0.9048s] [ 31%] 2025-12-04T14:02:33.4575225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_bool PASSED [0.0499s] [ 31%] 2025-12-04T14:02:33.4575341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float64 PASSED [0.9065s] [ 31%] 2025-12-04T14:02:33.4575457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int16 PASSED [0.0053s] [ 31%] 2025-12-04T14:02:33.4575570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_uint8 PASSED [0.9049s] [ 31%] 2025-12-04T14:02:33.4575684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float32 PASSED [0.0220s] [ 31%] 2025-12-04T14:02:33.4575795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float64 PASSED [0.0067s] [ 31%] 2025-12-04T14:02:33.4576052Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 31%] 2025-12-04T14:02:33.4576204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 PASSED [0.0086s] [ 31%] 2025-12-04T14:02:33.4576373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bool PASSED [0.0079s] [ 31%] 2025-12-04T14:02:33.4576537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float16 PASSED [0.0078s] [ 31%] 2025-12-04T14:02:33.4576688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int8 PASSED [0.0077s] [ 31%] 2025-12-04T14:02:33.4576805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_complex128 PASSED [0.0029s] [ 31%] 2025-12-04T14:02:33.4576916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float32 PASSED [0.8934s] [ 31%] 2025-12-04T14:02:33.4577038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_int32 PASSED [0.0046s] [ 31%] 2025-12-04T14:02:33.4577146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_int64 PASSED [0.8960s] [ 31%] 2025-12-04T14:02:33.4577276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_bfloat16 PASSED [0.0084s] [ 31%] 2025-12-04T14:02:33.4577404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_complex64 PASSED [0.8975s] [ 31%] 2025-12-04T14:02:33.4577531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float16 PASSED [0.0081s] [ 31%] 2025-12-04T14:02:33.4577654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float32 PASSED [0.9066s] [ 31%] 2025-12-04T14:02:33.4577780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float64 PASSED [0.0081s] [ 31%] 2025-12-04T14:02:33.4577902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int32 PASSED [0.8993s] [ 31%] 2025-12-04T14:02:33.4578027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int64 PASSED [0.0079s] [ 31%] 2025-12-04T14:02:33.4578148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_uint8 PASSED [0.8882s] [ 31%] 2025-12-04T14:02:33.4578270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_complex32 PASSED [0.0055s] [ 31%] 2025-12-04T14:02:33.4578385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int16 PASSED [0.0044s] [ 31%] 2025-12-04T14:02:33.4578502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int64 PASSED [0.0043s] [ 31%] 2025-12-04T14:02:33.4578635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_bfloat16 PASSED [0.0321s] [ 31%] 2025-12-04T14:02:33.4578756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_complex128 PASSED [0.0307s] [ 31%] 2025-12-04T14:02:33.4578875Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float16 PASSED [0.9368s] [ 31%] 2025-12-04T14:02:33.4578988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_int16 PASSED [0.0332s] [ 31%] 2025-12-04T14:02:33.4579100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_bfloat16 PASSED [0.0315s] [ 31%] 2025-12-04T14:02:33.4579215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_complex128 PASSED [0.0306s] [ 31%] 2025-12-04T14:02:33.4579331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_complex64 PASSED [0.0305s] [ 31%] 2025-12-04T14:02:33.4579441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int32 PASSED [0.9286s] [ 31%] 2025-12-04T14:02:33.4579551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int8 PASSED [0.0334s] [ 31%] 2025-12-04T14:02:33.4579660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_uint8 PASSED [0.0312s] [ 31%] 2025-12-04T14:02:33.4579768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_bool PASSED [0.0105s] [ 31%] 2025-12-04T14:02:33.4579881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_complex64 PASSED [0.0101s] [ 31%] 2025-12-04T14:02:33.4579991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_int16 PASSED [0.0099s] [ 31%] 2025-12-04T14:02:33.4580159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_int32 PASSED [0.0099s] [ 31%] 2025-12-04T14:02:33.4580282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_uint8 PASSED [0.0099s] [ 31%] 2025-12-04T14:02:33.4580402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_indices_cuda_int32 PASSED [0.0157s] [ 31%] 2025-12-04T14:02:33.4580512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_bfloat16 PASSED [0.0100s] [ 31%] 2025-12-04T14:02:33.4580621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_bool PASSED [0.0101s] [ 31%] 2025-12-04T14:02:33.4580734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex128 PASSED [0.0100s] [ 31%] 2025-12-04T14:02:33.4580861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex32 PASSED [0.9043s] [ 31%] 2025-12-04T14:02:33.4580969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_float16 PASSED [0.0120s] [ 31%] 2025-12-04T14:02:33.4581078Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int8 PASSED [0.0102s] [ 31%] 2025-12-04T14:02:33.4581196Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_indices_cuda_int64 PASSED [0.9022s] [ 31%] 2025-12-04T14:02:33.4581318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float64 PASSED [0.0110s] [ 31%] 2025-12-04T14:02:33.4581433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_int16 PASSED [0.0094s] [ 31%] 2025-12-04T14:02:33.4581549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_int32 PASSED [0.0093s] [ 31%] 2025-12-04T14:02:33.4581661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_float64 PASSED [0.8792s] [ 31%] 2025-12-04T14:02:33.4581771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_int8 PASSED [0.0044s] [ 31%] 2025-12-04T14:02:33.4581894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex128 PASSED [0.0053s] [ 31%] 2025-12-04T14:02:33.4582009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_bfloat16 PASSED [0.8802s] [ 31%] 2025-12-04T14:02:33.4582120Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_int32 PASSED [0.0054s] [ 31%] 2025-12-04T14:02:33.4582245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_uint8 PASSED [0.0042s] [ 31%] 2025-12-04T14:02:33.4585280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_bfloat16 PASSED [0.0051s] [ 31%] 2025-12-04T14:02:33.4585420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int16 PASSED [0.0052s] [ 31%] 2025-12-04T14:02:33.4585541Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int64 PASSED [0.0056s] [ 31%] 2025-12-04T14:02:33.4585662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_int8 PASSED [0.0050s] [ 31%] 2025-12-04T14:02:33.4585789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex128 PASSED [0.0132s] [ 31%] 2025-12-04T14:02:33.4585919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex32 PASSED [0.0129s] [ 31%] 2025-12-04T14:02:33.4586042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float16 PASSED [0.0127s] [ 31%] 2025-12-04T14:02:33.4586170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float64 PASSED [0.0128s] [ 31%] 2025-12-04T14:02:33.4586288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_int32 PASSED [0.0127s] [ 31%] 2025-12-04T14:02:33.4586406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_bfloat16 PASSED [0.0081s] [ 31%] 2025-12-04T14:02:33.4586525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex128 PASSED [0.0082s] [ 31%] 2025-12-04T14:02:33.4586643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex64 PASSED [0.0081s] [ 31%] 2025-12-04T14:02:33.4586786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float32 PASSED [0.0080s] [ 31%] 2025-12-04T14:02:33.4586914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int16 PASSED [0.0080s] [ 31%] 2025-12-04T14:02:33.4587033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_complex128 PASSED [0.0036s] [ 31%] 2025-12-04T14:02:33.4587152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float16 PASSED [0.8919s] [ 31%] 2025-12-04T14:02:33.4587271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float64 PASSED [0.0051s] [ 31%] 2025-12-04T14:02:33.4587398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_bool PASSED [0.0966s] [ 31%] 2025-12-04T14:02:33.4587544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float16 PASSED [0.0984s] [ 31%] 2025-12-04T14:02:33.4587674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float64 PASSED [0.0974s] [ 31%] 2025-12-04T14:02:33.4587804Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int32 PASSED [0.0968s] [ 31%] 2025-12-04T14:02:33.4587931Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int64 PASSED [0.0975s] [ 31%] 2025-12-04T14:02:33.4588061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int8 PASSED [0.0979s] [ 31%] 2025-12-04T14:02:33.4588171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_bool PASSED [0.1955s] [ 31%] 2025-12-04T14:02:33.4588287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float16 PASSED [0.2137s] [ 31%] 2025-12-04T14:02:33.4588399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float32 PASSED [0.2043s] [ 31%] 2025-12-04T14:02:33.4588515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float64 PASSED [0.2059s] [ 31%] 2025-12-04T14:02:33.4588628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int32 PASSED [0.2009s] [ 31%] 2025-12-04T14:02:33.4588740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int64 PASSED [0.2016s] [ 31%] 2025-12-04T14:02:33.4588852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int8 PASSED [0.2001s] [ 31%] 2025-12-04T14:02:33.4588987Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int16 PASSED [0.0427s] [ 31%] 2025-12-04T14:02:33.4589111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int64 PASSED [0.0416s] [ 31%] 2025-12-04T14:02:33.4589233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_bfloat16 PASSED [0.8931s] [ 31%] 2025-12-04T14:02:33.4589364Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_complex128 PASSED [0.0053s] [ 31%] 2025-12-04T14:02:33.4589485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float64 PASSED [0.8897s] [ 31%] 2025-12-04T14:02:33.4589609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_bfloat16 PASSED [0.0048s] [ 31%] 2025-12-04T14:02:33.4589737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_complex128 PASSED [0.8901s] [ 31%] 2025-12-04T14:02:33.4589859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float16 PASSED [0.0048s] [ 31%] 2025-12-04T14:02:33.4589977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int64 PASSED [0.8831s] [ 31%] 2025-12-04T14:02:33.4590259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_bfloat16 PASSED [0.0086s] [ 31%] 2025-12-04T14:02:33.4590378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_bool PASSED [0.0066s] [ 31%] 2025-12-04T14:02:33.4590500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_float32 PASSED [0.0046s] [ 31%] 2025-12-04T14:02:33.4590616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int32 PASSED [0.8900s] [ 31%] 2025-12-04T14:02:33.4590747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_complex64 PASSED [0.0120s] [ 31%] 2025-12-04T14:02:33.4590872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_float32 PASSED [0.0096s] [ 31%] 2025-12-04T14:02:33.4590985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_float64 PASSED [0.8976s] [ 31%] 2025-12-04T14:02:33.4591108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_complex64 PASSED [0.0170s] [ 31%] 2025-12-04T14:02:33.4591224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_float16 PASSED [0.0153s] [ 31%] 2025-12-04T14:02:33.4591353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float16 PASSED [0.8939s] [ 31%] 2025-12-04T14:02:33.4591495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float64 PASSED [0.0057s] [ 31%] 2025-12-04T14:02:33.4591621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_complex64 PASSED [0.0037s] [ 31%] 2025-12-04T14:02:33.4591742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_float16 PASSED [0.8813s] [ 31%] 2025-12-04T14:02:33.4591862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_complex64 PASSED [0.0062s] [ 31%] 2025-12-04T14:02:33.4591984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex128 PASSED [0.0036s] [ 31%] 2025-12-04T14:02:33.4592101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex64 PASSED [0.8933s] [ 31%] 2025-12-04T14:02:33.4592212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_int64 PASSED [0.0051s] [ 31%] 2025-12-04T14:02:33.4592325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_int8 PASSED [0.0036s] [ 31%] 2025-12-04T14:02:33.4592446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_real_cuda_complex64 PASSED [0.9136s] [ 31%] 2025-12-04T14:02:33.4592571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_complex64 PASSED [0.0074s] [ 31%] 2025-12-04T14:02:33.4592686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int64 PASSED [0.8997s] [ 31%] 2025-12-04T14:02:33.4592801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int8 PASSED [0.0078s] [ 31%] 2025-12-04T14:02:33.4592935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_uint8 PASSED [0.8943s] [ 31%] 2025-12-04T14:02:33.4593050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_bool PASSED [0.0060s] [ 31%] 2025-12-04T14:02:33.4593166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex32 PASSED [0.0045s] [ 31%] 2025-12-04T14:02:33.4593276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_float16 PASSED [0.8817s] [ 31%] 2025-12-04T14:02:33.4593385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_int8 PASSED [0.0062s] [ 31%] 2025-12-04T14:02:33.4593495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_uint8 PASSED [0.0044s] [ 31%] 2025-12-04T14:02:33.4593610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_bool PASSED [0.8888s] [ 32%] 2025-12-04T14:02:33.4593719Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_int16 PASSED [0.0050s] [ 32%] 2025-12-04T14:02:33.4593830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_uint8 PASSED [0.0037s] [ 32%] 2025-12-04T14:02:33.4593947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_complex128 PASSED [0.0053s] [ 32%] 2025-12-04T14:02:33.4594060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float64 PASSED [0.0048s] [ 32%] 2025-12-04T14:02:33.4594170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int64 PASSED [0.0048s] [ 32%] 2025-12-04T14:02:33.4594283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_uint8 PASSED [0.0048s] [ 32%] 2025-12-04T14:02:33.4594401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_bool PASSED [0.0075s] [ 32%] 2025-12-04T14:02:33.4594526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int64 PASSED [0.0073s] [ 32%] 2025-12-04T14:02:33.4594634Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_uint8 PASSED [0.0072s] [ 32%] 2025-12-04T14:02:33.4594744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int64 PASSED [0.0135s] [ 32%] 2025-12-04T14:02:33.4594860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_float16 PASSED [0.0043s] [ 32%] 2025-12-04T14:02:33.4594972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_float64 PASSED [0.0041s] [ 32%] 2025-12-04T14:02:33.4595092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int16 PASSED [0.0041s] [ 32%] 2025-12-04T14:02:33.4595199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int32 PASSED [0.0041s] [ 32%] 2025-12-04T14:02:33.4595312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_float64 PASSED [0.9099s] [ 32%] 2025-12-04T14:02:33.4595423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int8 PASSED [0.0045s] [ 32%] 2025-12-04T14:02:33.4595535Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_uint8 PASSED [0.8992s] [ 32%] 2025-12-04T14:02:33.4595657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex64 PASSED [0.0085s] [ 32%] 2025-12-04T14:02:33.4595774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int32 PASSED [0.8998s] [ 32%] 2025-12-04T14:02:33.4595940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4596933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4597973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4598139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4598305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4598502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4598667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4598832Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4599879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4600048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4600347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4600512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4600677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4600856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4601849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602520Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4602862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603025Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4603894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T14:02:33.4604027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float16 PASSED [0.0296s] [ 32%] 2025-12-04T14:02:33.4604161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float32 PASSED [0.0210s] [ 32%] 2025-12-04T14:02:33.4604293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_int16 PASSED [0.0209s] [ 32%] 2025-12-04T14:02:33.4604424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_uint8 PASSED [0.0208s] [ 32%] 2025-12-04T14:02:33.4604555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float32 PASSED [0.0210s] [ 32%] 2025-12-04T14:02:33.4604687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int64 XFAIL [0.0074s] [ 32%] 2025-12-04T14:02:33.4604822Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int8 XFAIL [0.9004s] [ 32%] 2025-12-04T14:02:33.4604958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_complex64 PASSED [1.0404s] [ 32%] 2025-12-04T14:02:33.4605088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float32 PASSED [0.1120s] [ 32%] 2025-12-04T14:02:33.4605216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int32 PASSED [0.0824s] [ 32%] 2025-12-04T14:02:33.4605344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int64 PASSED [0.0828s] [ 32%] 2025-12-04T14:02:33.4605472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_uint8 PASSED [0.0824s] [ 32%] 2025-12-04T14:02:33.4605606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_bool XFAIL [0.0086s] [ 32%] 2025-12-04T14:02:33.4605753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_complex64 PASSED [1.1654s] [ 32%] 2025-12-04T14:02:33.4605890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_float64 PASSED [0.1949s] [ 32%] 2025-12-04T14:02:33.4606022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_bool XFAIL [0.0104s] [ 32%] 2025-12-04T14:02:33.4606161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_complex64 PASSED [1.1564s] [ 32%] 2025-12-04T14:02:33.4606296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int8 PASSED [0.1315s] [ 32%] 2025-12-04T14:02:33.4606438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_uint8 PASSED [0.1312s] [ 32%] 2025-12-04T14:02:33.4606584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_bfloat16 PASSED [0.0281s] [ 32%] 2025-12-04T14:02:33.4606712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_bool XFAIL [0.0074s] [ 32%] 2025-12-04T14:02:33.4606849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_complex128 PASSED [0.0212s] [ 32%] 2025-12-04T14:02:33.4606983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_complex64 PASSED [0.0236s] [ 32%] 2025-12-04T14:02:33.4607134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_float32 PASSED [0.0213s] [ 32%] 2025-12-04T14:02:33.4607260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_int64 XFAIL [0.0076s] [ 32%] 2025-12-04T14:02:33.4607394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float64 PASSED [0.9126s] [ 32%] 2025-12-04T14:02:33.4607522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_int64 XFAIL [0.0078s] [ 32%] 2025-12-04T14:02:33.4607647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_int8 XFAIL [0.0075s] [ 32%] 2025-12-04T14:02:33.4607776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_bool XFAIL [0.8975s] [ 32%] 2025-12-04T14:02:33.4607906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float16 PASSED [0.9100s] [ 32%] 2025-12-04T14:02:33.4608039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float32 PASSED [0.0211s] [ 32%] 2025-12-04T14:02:33.4608168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_float64 PASSED [0.0209s] [ 32%] 2025-12-04T14:02:33.4608298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int16 PASSED [0.0208s] [ 32%] 2025-12-04T14:02:33.4608427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int64 PASSED [0.0208s] [ 32%] 2025-12-04T14:02:33.4608557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_uint8 PASSED [0.0206s] [ 32%] 2025-12-04T14:02:33.4608698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_bool XFAIL [0.0185s] [ 32%] 2025-12-04T14:02:33.4608839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float32 PASSED [1.0824s] [ 32%] 2025-12-04T14:02:33.4608976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float64 PASSED [0.1967s] [ 32%] 2025-12-04T14:02:33.4609118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_bfloat16 PASSED [0.2450s] [ 32%] 2025-12-04T14:02:33.4609252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_bool XFAIL [0.0186s] [ 32%] 2025-12-04T14:02:33.4609387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int16 PASSED [1.0196s] [ 32%] 2025-12-04T14:02:33.4609521Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int8 PASSED [0.1310s] [ 32%] 2025-12-04T14:02:33.4609654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_bfloat16 PASSED [0.0213s] [ 32%] 2025-12-04T14:02:33.4609782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int8 PASSED [0.0209s] [ 32%] 2025-12-04T14:02:33.4609914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_bfloat16 PASSED [0.0279s] [ 32%] 2025-12-04T14:02:33.4610050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_complex128 PASSED [0.0210s] [ 32%] 2025-12-04T14:02:33.4610220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_int16 XFAIL [0.0074s] [ 32%] 2025-12-04T14:02:33.4610360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_int64 XFAIL [0.9043s] [ 32%] 2025-12-04T14:02:33.4610484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_uint8 XFAIL [0.8953s] [ 32%] 2025-12-04T14:02:33.4610618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_bfloat16 PASSED [0.9149s] [ 32%] 2025-12-04T14:02:33.4610748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float16 PASSED [0.0280s] [ 32%] 2025-12-04T14:02:33.4610875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int16 XFAIL [0.0075s] [ 32%] 2025-12-04T14:02:33.4611013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int32 XFAIL [0.0072s] [ 32%] 2025-12-04T14:02:33.4611139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_uint8 XFAIL [0.9061s] [ 32%] 2025-12-04T14:02:33.4611264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_bool XFAIL [0.8934s] [ 32%] 2025-12-04T14:02:33.4611398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_complex128 PASSED [1.0309s] [ 32%] 2025-12-04T14:02:33.4611530Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_float32 PASSED [0.1117s] [ 32%] 2025-12-04T14:02:33.4611659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_float64 PASSED [0.1119s] [ 32%] 2025-12-04T14:02:33.4611790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_float16 PASSED [0.0280s] [ 32%] 2025-12-04T14:02:33.4611920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_float64 PASSED [0.0208s] [ 32%] 2025-12-04T14:02:33.4612046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_int8 XFAIL [0.0074s] [ 32%] 2025-12-04T14:02:33.4612172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_uint8 XFAIL [0.0073s] [ 32%] 2025-12-04T14:02:33.4612300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_bool XFAIL [0.8969s] [ 32%] 2025-12-04T14:02:33.4612445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float32 PASSED [0.9108s] [ 32%] 2025-12-04T14:02:33.4612579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float64 PASSED [0.0212s] [ 32%] 2025-12-04T14:02:33.4612707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int32 XFAIL [0.0076s] [ 32%] 2025-12-04T14:02:33.4612833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_uint8 XFAIL [0.0075s] [ 32%] 2025-12-04T14:02:33.4612963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_float64 PASSED [0.9201s] [ 33%] 2025-12-04T14:02:33.4613087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int8 XFAIL [0.0077s] [ 33%] 2025-12-04T14:02:33.4613221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_bfloat16 PASSED [0.0281s] [ 33%] 2025-12-04T14:02:33.4613346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_bool XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4613484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_complex64 PASSED [0.9174s] [ 33%] 2025-12-04T14:02:33.4613616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_bfloat16 PASSED [0.0281s] [ 33%] 2025-12-04T14:02:33.4613751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_complex64 XFAIL [0.0076s] [ 33%] 2025-12-04T14:02:33.4613884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float32 PASSED [0.9098s] [ 33%] 2025-12-04T14:02:33.4614032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float64 PASSED [0.0211s] [ 33%] 2025-12-04T14:02:33.4614173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int32 PASSED [0.0209s] [ 33%] 2025-12-04T14:02:33.4614301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_bool XFAIL [0.0075s] [ 33%] 2025-12-04T14:02:33.4614436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_complex128 XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4614567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_complex64 XFAIL [0.8928s] [ 33%] 2025-12-04T14:02:33.4614706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int16 XFAIL [0.9064s] [ 33%] 2025-12-04T14:02:33.4614832Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int64 XFAIL [0.9108s] [ 33%] 2025-12-04T14:02:33.4614958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int8 XFAIL [0.9073s] [ 33%] 2025-12-04T14:02:33.4615092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_complex64 PASSED [1.0867s] [ 33%] 2025-12-04T14:02:33.4615224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_float16 PASSED [0.1379s] [ 33%] 2025-12-04T14:02:33.4615350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_uint8 XFAIL [0.0093s] [ 33%] 2025-12-04T14:02:33.4615556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_bool SKIPPED [0.0001s] (In-place lgamma not supported for integral tensors) [ 33%] 2025-12-04T14:02:33.4615692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_complex64 XFAIL [0.9118s] [ 33%] 2025-12-04T14:02:33.4615826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_float16 PASSED [0.9325s] [ 33%] 2025-12-04T14:02:33.4615960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_float64 PASSED [0.0214s] [ 33%] 2025-12-04T14:02:33.4616088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_bool XFAIL [0.0076s] [ 33%] 2025-12-04T14:02:33.4616225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_complex128 PASSED [0.9120s] [ 33%] 2025-12-04T14:02:33.4616369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_float16 PASSED [0.0287s] [ 33%] 2025-12-04T14:02:33.4616497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_int8 XFAIL [0.0078s] [ 33%] 2025-12-04T14:02:33.4616626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_uint8 XFAIL [0.9088s] [ 33%] 2025-12-04T14:02:33.4616760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_bfloat16 PASSED [0.9374s] [ 33%] 2025-12-04T14:02:33.4616892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float32 PASSED [0.0213s] [ 33%] 2025-12-04T14:02:33.4617026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float64 PASSED [0.0210s] [ 33%] 2025-12-04T14:02:33.4617153Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int16 XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4617286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float16 PASSED [0.9203s] [ 33%] 2025-12-04T14:02:33.4617415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float64 PASSED [0.0213s] [ 33%] 2025-12-04T14:02:33.4617547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_bfloat16 PASSED [0.0282s] [ 33%] 2025-12-04T14:02:33.4617675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float16 PASSED [0.0281s] [ 33%] 2025-12-04T14:02:33.4617815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float32 PASSED [0.0209s] [ 33%] 2025-12-04T14:02:33.4617950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_int16 XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4618075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_uint8 XFAIL [0.9034s] [ 33%] 2025-12-04T14:02:33.4618248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_bool SKIPPED [0.8971s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4618427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_complex64 SKIPPED [0.0014s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4618613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4618751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_bfloat16 PASSED [0.2487s] [ 33%] 2025-12-04T14:02:33.4618884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_bool XFAIL [0.0189s] [ 33%] 2025-12-04T14:02:33.4619019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_float64 PASSED [1.0809s] [ 33%] 2025-12-04T14:02:33.4619154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_int64 PASSED [0.1321s] [ 33%] 2025-12-04T14:02:33.4619285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_uint8 PASSED [0.1315s] [ 33%] 2025-12-04T14:02:33.4619423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_bfloat16 PASSED [0.2472s] [ 33%] 2025-12-04T14:02:33.4619553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_bool XFAIL [0.0188s] [ 33%] 2025-12-04T14:02:33.4619687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_float64 PASSED [1.0989s] [ 33%] 2025-12-04T14:02:33.4619821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int64 PASSED [0.1324s] [ 33%] 2025-12-04T14:02:33.4619953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_uint8 PASSED [0.1346s] [ 33%] 2025-12-04T14:02:33.4620145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_bfloat16 PASSED [0.1136s] [ 33%] 2025-12-04T14:02:33.4620276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_complex64 PASSED [0.1426s] [ 33%] 2025-12-04T14:02:33.4620407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_float16 PASSED [0.1120s] [ 33%] 2025-12-04T14:02:33.4620535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_int16 PASSED [0.0827s] [ 33%] 2025-12-04T14:02:33.4620667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_uint8 PASSED [0.0209s] [ 33%] 2025-12-04T14:02:33.4620851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4621031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4621203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4621332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_int8 PASSED [0.0877s] [ 33%] 2025-12-04T14:02:33.4621474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_float16 PASSED [0.0278s] [ 33%] 2025-12-04T14:02:33.4621609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int16 XFAIL [0.0075s] [ 33%] 2025-12-04T14:02:33.4621757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int64 XFAIL [0.0073s] [ 33%] 2025-12-04T14:02:33.4621909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_complex64 XFAIL [0.9110s] [ 33%] 2025-12-04T14:02:33.4622044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float16 PASSED [0.9148s] [ 33%] 2025-12-04T14:02:33.4622175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float64 PASSED [0.0209s] [ 33%] 2025-12-04T14:02:33.4622304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_bool XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4622452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_complex128 PASSED [0.0211s] [ 33%] 2025-12-04T14:02:33.4622587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_float16 PASSED [0.0279s] [ 33%] 2025-12-04T14:02:33.4622715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int16 XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4622844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int8 XFAIL [0.9095s] [ 33%] 2025-12-04T14:02:33.4622972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_uint8 XFAIL [0.9010s] [ 33%] 2025-12-04T14:02:33.4623111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_bfloat16 PASSED [0.9264s] [ 33%] 2025-12-04T14:02:33.4623248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_float64 PASSED [0.0303s] [ 33%] 2025-12-04T14:02:33.4623379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int16 XFAIL [0.0076s] [ 33%] 2025-12-04T14:02:33.4623510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int32 XFAIL [0.9034s] [ 33%] 2025-12-04T14:02:33.4623639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int8 XFAIL [0.8938s] [ 33%] 2025-12-04T14:02:33.4623774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_bfloat16 PASSED [0.9203s] [ 33%] 2025-12-04T14:02:33.4623902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int16 PASSED [0.0213s] [ 33%] 2025-12-04T14:02:33.4624041Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int64 PASSED [0.0210s] [ 33%] 2025-12-04T14:02:33.4624168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_int32 XFAIL [0.0075s] [ 33%] 2025-12-04T14:02:33.4624294Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_uint8 XFAIL [0.8843s] [ 33%] 2025-12-04T14:02:33.4624428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_complex64 PASSED [0.9082s] [ 33%] 2025-12-04T14:02:33.4624555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_int32 XFAIL [0.0078s] [ 33%] 2025-12-04T14:02:33.4624686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float16 PASSED [0.0283s] [ 33%] 2025-12-04T14:02:33.4624817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float32 PASSED [0.0211s] [ 33%] 2025-12-04T14:02:33.4624944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int8 XFAIL [0.0074s] [ 33%] 2025-12-04T14:02:33.4625076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_complex128 XFAIL [0.9147s] [ 33%] 2025-12-04T14:02:33.4625208Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_complex64 XFAIL [0.9206s] [ 33%] 2025-12-04T14:02:33.4625336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_float32 XFAIL [0.9593s] [ 33%] 2025-12-04T14:02:33.4625463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_int64 XFAIL [0.9527s] [ 33%] 2025-12-04T14:02:33.4625596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_uint8 XFAIL [0.9343s] [ 33%] 2025-12-04T14:02:33.4625732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_bool XFAIL [0.9358s] [ 33%] 2025-12-04T14:02:33.4625861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float16 PASSED [0.9419s] [ 33%] 2025-12-04T14:02:33.4625991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float64 PASSED [0.0211s] [ 33%] 2025-12-04T14:02:33.4626114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int16 XFAIL [0.0075s] [ 33%] 2025-12-04T14:02:33.4626248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int32 XFAIL [0.9286s] [ 33%] 2025-12-04T14:02:33.4626372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_uint8 XFAIL [0.9265s] [ 33%] 2025-12-04T14:02:33.4626505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_bfloat16 PASSED [0.9436s] [ 33%] 2025-12-04T14:02:33.4626631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_bool XFAIL [0.0077s] [ 33%] 2025-12-04T14:02:33.4626767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_complex128 PASSED [0.9409s] [ 33%] 2025-12-04T14:02:33.4626902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_complex64 PASSED [0.0212s] [ 33%] 2025-12-04T14:02:33.4627027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_int8 XFAIL [0.0075s] [ 33%] 2025-12-04T14:02:33.4627156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_uint8 XFAIL [0.0073s] [ 33%] 2025-12-04T14:02:33.4627284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_bool XFAIL [0.9312s] [ 33%] 2025-12-04T14:02:33.4627419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_complex64 XFAIL [0.9245s] [ 33%] 2025-12-04T14:02:33.4627552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float64 PASSED [0.9520s] [ 33%] 2025-12-04T14:02:33.4627683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int32 PASSED [0.0210s] [ 33%] 2025-12-04T14:02:33.4627824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int8 PASSED [0.0207s] [ 33%] 2025-12-04T14:02:33.4627955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_uint8 PASSED [0.0207s] [ 33%] 2025-12-04T14:02:33.4628091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_complex128 PASSED [0.0177s] [ 33%] 2025-12-04T14:02:33.4628226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_complex64 PASSED [0.0176s] [ 33%] 2025-12-04T14:02:33.4628360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_float64 PASSED [0.0175s] [ 33%] 2025-12-04T14:02:33.4628488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_int16 PASSED [0.0175s] [ 33%] 2025-12-04T14:02:33.4628617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_int64 PASSED [0.0174s] [ 33%] 2025-12-04T14:02:33.4628809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_offsets_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__softmax_backward_data_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4629956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4630183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4630366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4630545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4630754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4630955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4631147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T14:02:33.4633762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex128 XFAIL [0.0030s] [ 33%] 2025-12-04T14:02:33.4633888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex32 XFAIL [0.9251s] [ 33%] 2025-12-04T14:02:33.4634013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int64 PASSED [1.8433s] [ 33%] 2025-12-04T14:02:33.4634137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int8 PASSED [0.0044s] [ 33%] 2025-12-04T14:02:33.4634290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_bfloat16 PASSED [0.0045s] [ 33%] 2025-12-04T14:02:33.4634519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4634742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4634994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4635226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4635444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4635664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4635889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T14:02:33.4636014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_bfloat16 PASSED [0.0072s] [ 34%] 2025-12-04T14:02:33.4636137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_bool PASSED [0.9331s] [ 34%] 2025-12-04T14:02:33.4636278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_complex128 PASSED [0.0096s] [ 34%] 2025-12-04T14:02:33.4636402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int8 PASSED [0.0073s] [ 34%] 2025-12-04T14:02:33.4636533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_bfloat16 PASSED [0.0088s] [ 34%] 2025-12-04T14:02:33.4636669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_complex64 PASSED [0.3644s] [ 34%] 2025-12-04T14:02:33.4636807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float16 PASSED [0.0087s] [ 34%] 2025-12-04T14:02:33.4636941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float32 PASSED [0.9338s] [ 34%] 2025-12-04T14:02:33.4637069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_bfloat16 PASSED [0.0113s] [ 34%] 2025-12-04T14:02:33.4637200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_uint8 PASSED [0.0087s] [ 34%] 2025-12-04T14:02:33.4637325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_bfloat16 PASSED [0.0069s] [ 34%] 2025-12-04T14:02:33.4637457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_complex64 PASSED [0.9358s] [ 34%] 2025-12-04T14:02:33.4637580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float16 PASSED [0.0095s] [ 34%] 2025-12-04T14:02:33.4637707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float32 PASSED [0.0061s] [ 34%] 2025-12-04T14:02:33.4637854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_float16 PASSED [0.0065s] [ 34%] 2025-12-04T14:02:33.4638053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_bfloat16 PASSED [0.9463s] [ 34%] 2025-12-04T14:02:33.4638182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float16 PASSED [0.0076s] [ 34%] 2025-12-04T14:02:33.4638306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float32 PASSED [0.9299s] [ 34%] 2025-12-04T14:02:33.4638458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float64 PASSED [0.0073s] [ 34%] 2025-12-04T14:02:33.4638582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_float32 PASSED [0.0060s] [ 34%] 2025-12-04T14:02:33.4638709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int16 PASSED [0.0049s] [ 34%] 2025-12-04T14:02:33.4638835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int32 PASSED [0.0047s] [ 34%] 2025-12-04T14:02:33.4638963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int64 PASSED [0.0047s] [ 34%] 2025-12-04T14:02:33.4639139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4639324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4639501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4639680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4639860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4640963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4641130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4641318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_H_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4641512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___radd___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4641698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rmod___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4641890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rsub___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4642043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_add_cuda_float32 PASSED [0.1126s] [ 34%] 2025-12-04T14:02:33.4642225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_atan_cuda_float32 PASSED [0.0225s] [ 34%] 2025-12-04T14:02:33.4642377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_ceil_cuda_float32 PASSED [0.0217s] [ 34%] 2025-12-04T14:02:33.4642543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_clamp_min_cuda_float32 PASSED [0.1922s] [ 34%] 2025-12-04T14:02:33.4642704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_cos_cuda_float32 PASSED [0.0210s] [ 34%] 2025-12-04T14:02:33.4642862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_cosh_cuda_float32 PASSED [0.0209s] [ 34%] 2025-12-04T14:02:33.4643010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_erf_cuda_float32 PASSED [0.0208s] [ 34%] 2025-12-04T14:02:33.4643169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_expm1_cuda_float32 PASSED [0.0209s] [ 34%] 2025-12-04T14:02:33.4643328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_floor_cuda_float32 PASSED [0.0207s] [ 34%] 2025-12-04T14:02:33.4643478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_frac_cuda_float32 PASSED [0.0336s] [ 34%] 2025-12-04T14:02:33.4643636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_lerp_cuda_float32 PASSED [0.1368s] [ 34%] 2025-12-04T14:02:33.4643787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log1p_cuda_float32 PASSED [0.0207s] [ 34%] 2025-12-04T14:02:33.4643941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log_cuda_float32 PASSED [0.0208s] [ 34%] 2025-12-04T14:02:33.4644090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_pow_cuda_float32 PASSED [0.1258s] [ 34%] 2025-12-04T14:02:33.4644248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_round_cuda_float32 PASSED [0.0205s] [ 34%] 2025-12-04T14:02:33.4644398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_rsqrt_cuda_float32 PASSED [0.0208s] [ 34%] 2025-12-04T14:02:33.4644564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_tanh_cuda_float32 PASSED [0.0208s] [ 34%] 2025-12-04T14:02:33.4644715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_trunc_cuda_float32 PASSED [0.0207s] [ 34%] 2025-12-04T14:02:33.4644870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_zero_cuda_float32 PASSED [0.0175s] [ 34%] 2025-12-04T14:02:33.4645087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__segment_reduce_lengths_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4645307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__segment_reduce_offsets_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4645516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__unsafe_masked_index_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4645724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4645871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_abs_cuda_float32 PASSED [0.0040s] [ 34%] 2025-12-04T14:02:33.4646013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addcdiv_cuda_float32 PASSED [0.0758s] [ 34%] 2025-12-04T14:02:33.4646175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addmm_decomposed_cuda_float32 PASSED [0.0755s] [ 34%] 2025-12-04T14:02:33.4646326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addr_cuda_float32 PASSED [0.0267s] [ 34%] 2025-12-04T14:02:33.4646514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_amin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4646707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_arange_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4646902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4647093Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4647281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_argwhere_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4647437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_cuda_float32 PASSED [0.0076s] [ 34%] 2025-12-04T14:02:33.4647599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_partial_views_cuda_float32 XFAIL [0.0047s] [ 34%] 2025-12-04T14:02:33.4647746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_asinh_cuda_float32 PASSED [0.9376s] [ 34%] 2025-12-04T14:02:33.4647885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atan_cuda_float32 PASSED [0.0046s] [ 34%] 2025-12-04T14:02:33.4648029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atanh_cuda_float32 PASSED [0.0042s] [ 34%] 2025-12-04T14:02:33.4648223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atleast_2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4648421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bernoulli_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4648612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bincount_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4648768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_not_cuda_int64 PASSED [0.0051s] [ 34%] 2025-12-04T14:02:33.4648920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_or_cuda_int64 PASSED [0.0185s] [ 34%] 2025-12-04T14:02:33.4649101Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4649316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_tensors_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4649497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cat_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4649685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cdist_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4649870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cfloat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4650065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cholesky_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4650280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_chunk_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4650425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_max_cuda_float32 PASSED [0.0239s] [ 34%] 2025-12-04T14:02:33.4650590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_min_cuda_float32 PASSED [0.0251s] [ 34%] 2025-12-04T14:02:33.4650784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_column_stack_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4650983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_combinations_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4651176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_conj_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4651332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_conj_physical_cuda_float32 PASSED [0.0039s] [ 34%] 2025-12-04T14:02:33.4651526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_count_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4651713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cov_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4651900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cross_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4652106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumulative_trapezoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4652307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagonal_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4652488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diff_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4652638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_digamma_cuda_float32 PASSED [0.9323s] [ 34%] 2025-12-04T14:02:33.4652819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dot_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4653027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dstack_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4653225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_empty_permuted_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4653369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_eq_cuda_float32 PASSED [0.0279s] [ 34%] 2025-12-04T14:02:33.4653514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erf_cuda_float32 PASSED [0.0041s] [ 34%] 2025-12-04T14:02:33.4653667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erfinv_cuda_float32 PASSED [0.0040s] [ 34%] 2025-12-04T14:02:33.4653860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_expand_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4654011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_exponential_cuda_float32 PASSED [0.0093s] [ 34%] 2025-12-04T14:02:33.4654203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fliplr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4654385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_float_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4654587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_float_power_cuda_float32 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 34%] 2025-12-04T14:02:33.4654748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_frac_cuda_float32 PASSED [0.0047s] [ 34%] 2025-12-04T14:02:33.4654948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_grid_sampler_2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4655135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_half_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4655338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hash_tensor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4655492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_heaviside_cuda_float32 PASSED [0.0296s] [ 34%] 2025-12-04T14:02:33.4655640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_copy_cuda_float32 PASSED [0.0169s] [ 34%] 2025-12-04T14:02:33.4655793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_put_cuda_float32 PASSED [0.0201s] [ 34%] 2025-12-04T14:02:33.4655987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4656175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_int_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4656362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isclose_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4656556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isfinite_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4656744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4656929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isreal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4657121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_istft_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4657352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_4inputs_with_extra_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T14:02:33.4657495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lcm_cuda_int64 PASSED [0.0259s] [ 35%] 2025-12-04T14:02:33.4657694Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cholesky_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4657901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cross_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4658091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_det_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4658288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eig_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4658533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 35%] 2025-12-04T14:02:33.4658726Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lstsq_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4658935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_factor_ex_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4659147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_rank_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4659353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_multi_dot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4659553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4659777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_norm_subgradients_at_zero_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4659986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_pinv_hermitian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4660543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_qr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4660745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_slogdet_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4660940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vecdot_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4661086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_cuda_float32 PASSED [0.0050s] [ 35%] 2025-12-04T14:02:33.4661289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4661444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logical_or_cuda_float32 PASSED [0.0153s] [ 35%] 2025-12-04T14:02:33.4661594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logical_xor_cuda_float32 PASSED [0.0152s] [ 35%] 2025-12-04T14:02:33.4661790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logspace_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4662020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4662157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lt_cuda_float32 PASSED [0.9641s] [ 35%] 2025-12-04T14:02:33.4662354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_amin_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4662572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_argmin_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4662774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_cumprod_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4662965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4663166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4663360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_softmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4663555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_var_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4663762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_matmul_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4663950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_maximum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4664142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_median_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4664367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_meshgrid_variadic_tensors_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4664557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_minimum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4664737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4664921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mode_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4665107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_movedim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4665296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_msort_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4665490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanmedian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4665674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nansum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4665877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_batch_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4666081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_dropout_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4666284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_new_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4666431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nextafter_cuda_float32 PASSED [0.9553s] [ 35%] 2025-12-04T14:02:33.4666654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool1d_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4666886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool2d_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4667109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool3d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4667284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0295s] [ 35%] 2025-12-04T14:02:33.4667442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_celu_cuda_float32 PASSED [0.0062s] [ 35%] 2025-12-04T14:02:33.4667661Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_channel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4667864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4668096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4668313Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_cosine_similarity_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4668525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_glu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4668741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_group_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4668911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0084s] [ 35%] 2025-12-04T14:02:33.4669137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hinge_embedding_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4669353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_area_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4669581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_nearest_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4669788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_layer_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4670001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4670258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4670506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multilabel_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4670720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_normalize_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4670931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_circular_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4671168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_reflect_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4671382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_replicate_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4672586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pixel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4672803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4673014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_rms_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4673179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_selu_cuda_float32 PASSED [0.0069s] [ 35%] 2025-12-04T14:02:33.4673401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softplus_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4673615Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softshrink_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4673861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4674070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4674290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_upsample_bilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4674487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4674642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_normal_in_place_cuda_float32 PASSED [0.0068s] [ 35%] 2025-12-04T14:02:33.4674843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_permute_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4675009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0128s] [ 35%] 2025-12-04T14:02:33.4675164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_reciprocal_cuda_float32 PASSED [0.9458s] [ 35%] 2025-12-04T14:02:33.4675356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_repeat_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4675555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_repeat_interleave_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4675703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resize__cuda_float32 PASSED [0.0063s] [ 35%] 2025-12-04T14:02:33.4675886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_roll_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4676034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_cuda_float32 PASSED [0.9317s] [ 35%] 2025-12-04T14:02:33.4676194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_decimals_neg_3_cuda_float32 PASSED [0.0077s] [ 35%] 2025-12-04T14:02:33.4676355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_cuda_float32 PASSED [0.3163s] [ 35%] 2025-12-04T14:02:33.4676514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amin_cuda_float32 PASSED [0.2737s] [ 35%] 2025-12-04T14:02:33.4676677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_prod_cuda_float32 PASSED [0.2720s] [ 35%] 2025-12-04T14:02:33.4676888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_select_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4677037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sigmoid_cuda_float32 PASSED [0.0061s] [ 35%] 2025-12-04T14:02:33.4677181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sign_cuda_float32 PASSED [0.0039s] [ 35%] 2025-12-04T14:02:33.4677390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_bartlett_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4677616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_blackman_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4677820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_cosine_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4678039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_exponential_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4678258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_gaussian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4678477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4678623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sinc_cuda_float32 PASSED [0.1483s] [ 35%] 2025-12-04T14:02:33.4678816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_slice_scatter_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4679022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4679218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_airy_ai_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4679420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_bessel_j0_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4679636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_hermite_polynomial_h_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4679834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_i1e_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4680049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_legendre_polynomial_p_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4680307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_i0_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4680523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4680764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4680999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4681210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_zeta_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4681400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4681555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_squeeze_multiple_cuda_float32 PASSED [0.0117s] [ 35%] 2025-12-04T14:02:33.4681700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sub_cuda_float32 PASSED [0.9427s] [ 35%] 2025-12-04T14:02:33.4681886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sum_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4682086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_t_copy_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4682275Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_take_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4682426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tan_cuda_float32 PASSED [0.9420s] [ 35%] 2025-12-04T14:02:33.4682570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tanh_cuda_float32 PASSED [0.0062s] [ 35%] 2025-12-04T14:02:33.4682762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tensor_split_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4682958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tensordot_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4683140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_topk_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4683328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trace_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4683524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_transpose_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4683715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trapezoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4683910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triu_indices_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4684058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_true_divide_cuda_float32 PASSED [0.0161s] [ 35%] 2025-12-04T14:02:33.4684254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unbind_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4684440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unbind_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4684638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unravel_index_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4684829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unsafe_split_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4685026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vdot_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T14:02:33.4685228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4685432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_real_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4685619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4685760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_xlogy_cuda_float32 PASSED [0.0265s] [ 36%] 2025-12-04T14:02:33.4685948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4686152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zeros_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4686332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4686501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4686692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4686870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4687915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4688948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689823Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4689989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4690194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4690380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4690547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4690721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4690899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_complex128 SKIPPED [0.0008s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4691955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4692130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4692306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4692487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4692628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_complex32 PASSED [0.0047s] [ 36%] 2025-12-04T14:02:33.4692764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_complex64 PASSED [0.0046s] [ 36%] 2025-12-04T14:02:33.4692903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float16 PASSED [0.0046s] [ 36%] 2025-12-04T14:02:33.4693032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int32 PASSED [0.0046s] [ 36%] 2025-12-04T14:02:33.4693167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_uint8 PASSED [0.0046s] [ 36%] 2025-12-04T14:02:33.4693329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_bfloat16 XFAIL [0.0039s] [ 36%] 2025-12-04T14:02:33.4693488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_complex64 XFAIL [0.0038s] [ 36%] 2025-12-04T14:02:33.4693632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_int8 XFAIL [0.0038s] [ 36%] 2025-12-04T14:02:33.4693801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_uint8 XFAIL [0.0039s] [ 36%] 2025-12-04T14:02:33.4693997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_bfloat16 SKIPPED [0.9331s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4694182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4694369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4694562Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4694743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4694959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4695185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4695412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4695538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float16 PASSED [0.0042s] [ 36%] 2025-12-04T14:02:33.4695663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float64 PASSED [0.9272s] [ 36%] 2025-12-04T14:02:33.4695786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_bfloat16 PASSED [0.0051s] [ 36%] 2025-12-04T14:02:33.4696010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_complex128 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4696226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4696440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4696660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4696786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_float64 PASSED [0.9331s] [ 36%] 2025-12-04T14:02:33.4697001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T14:02:33.4697179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4697355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4697536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4697709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4697875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4698936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4699112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4699282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4699448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4699619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4699747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_bfloat16 PASSED [0.0058s] [ 36%] 2025-12-04T14:02:33.4699875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_float16 PASSED [0.9438s] [ 36%] 2025-12-04T14:02:33.4700049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bernoulli_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4700263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_complex32 SKIPPED [0.0012s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4700433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4700601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4700735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int16 PASSED [0.0076s] [ 36%] 2025-12-04T14:02:33.4700863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int32 PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4701002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int64 PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4701132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int16 PASSED [0.0037s] [ 36%] 2025-12-04T14:02:33.4701264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int32 PASSED [0.9394s] [ 36%] 2025-12-04T14:02:33.4701390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int8 PASSED [0.0060s] [ 36%] 2025-12-04T14:02:33.4701532Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_uint8 PASSED [0.0041s] [ 36%] 2025-12-04T14:02:33.4701672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int16 PASSED [0.0067s] [ 36%] 2025-12-04T14:02:33.4701813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int32 PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4701966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_uint8 PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4702095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_bool PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4702223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int16 PASSED [0.0064s] [ 36%] 2025-12-04T14:02:33.4702348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int32 PASSED [0.0063s] [ 36%] 2025-12-04T14:02:33.4702475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_uint8 PASSED [0.0063s] [ 36%] 2025-12-04T14:02:33.4702664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4702844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4703880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704711Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4704892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_shapes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4705079Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4705267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4705452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4705645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4705825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706018Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4706916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4707949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T14:02:33.4708131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4708312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4708485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4708662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4708839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4708999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4709991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4710207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4710337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_bfloat16 PASSED [0.0056s] [ 37%] 2025-12-04T14:02:33.4710462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float16 PASSED [0.0053s] [ 37%] 2025-12-04T14:02:33.4710634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4710805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4710988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4711156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4711338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4711503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4711669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4711793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_float16 PASSED [0.0030s] [ 37%] 2025-12-04T14:02:33.4711917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_float32 PASSED [0.9323s] [ 37%] 2025-12-04T14:02:33.4712082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int8 SKIPPED [0.0015s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4712245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_uint8 SKIPPED [0.0013s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4712417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4712583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4712751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4712913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4713904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4714065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4714250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4714437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_inverse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4714621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_inverse_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4714941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_solve_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4715939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_bfloat16 PASSED [0.0096s] [ 37%] 2025-12-04T14:02:33.4716067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_float32 PASSED [0.0075s] [ 37%] 2025-12-04T14:02:33.4716191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_int8 PASSED [0.0075s] [ 37%] 2025-12-04T14:02:33.4716314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_bool PASSED [0.0074s] [ 37%] 2025-12-04T14:02:33.4716444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_float32 PASSED [0.0074s] [ 37%] 2025-12-04T14:02:33.4716567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_int64 PASSED [0.0074s] [ 37%] 2025-12-04T14:02:33.4716739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4716908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4717969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4718146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4718336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4718509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4718679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_complex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4718848Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4719023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4719187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4719347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4719497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_complex128 PASSED [0.9411s] [ 37%] 2025-12-04T14:02:33.4719632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_complex64 PASSED [0.0046s] [ 37%] 2025-12-04T14:02:33.4719765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float16 PASSED [0.9311s] [ 37%] 2025-12-04T14:02:33.4719895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_int16 PASSED [0.0046s] [ 37%] 2025-12-04T14:02:33.4720071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4720282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4720464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4720640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4720812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4720982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4721156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4721330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4721499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4721723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4721894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4722086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4722261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4722443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4722612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4722732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_bfloat16 PASSED [0.0046s] [ 37%] 2025-12-04T14:02:33.4722953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4723087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float32 PASSED [0.0038s] [ 37%] 2025-12-04T14:02:33.4723298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4723509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4723733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4723954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4724171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4724383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4724555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4724734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4724907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4725898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4726897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4727065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4727238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4727401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4727564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4727750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_bfloat16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4727928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_complex128 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_bfloat16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_complex64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int8 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4728950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_uint8 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 37%] 2025-12-04T14:02:33.4729140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4729336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4729519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4729703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4729830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_bfloat16 PASSED [0.0032s] [ 37%] 2025-12-04T14:02:33.4729956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_float16 PASSED [0.9429s] [ 37%] 2025-12-04T14:02:33.4730226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4730443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T14:02:33.4730623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4730791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4730956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T14:02:33.4731119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4731295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4731467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4731636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4731824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4731992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4732159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4732330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4732506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4732675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4732843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733558Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4733905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4734950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4735110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4735331Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4735468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float16 PASSED [0.0063s] [ 38%] 2025-12-04T14:02:33.4735593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float32 PASSED [0.9356s] [ 38%] 2025-12-04T14:02:33.4735819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4736036Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4736251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4736420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4736589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4736757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4736921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4737060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_bfloat16 PASSED [0.0161s] [ 38%] 2025-12-04T14:02:33.4737200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float16 PASSED [0.0145s] [ 38%] 2025-12-04T14:02:33.4737338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float32 PASSED [0.9435s] [ 38%] 2025-12-04T14:02:33.4737476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float64 PASSED [0.0163s] [ 38%] 2025-12-04T14:02:33.4737611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int8 PASSED [0.0104s] [ 38%] 2025-12-04T14:02:33.4737753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_bfloat16 PASSED [0.0062s] [ 38%] 2025-12-04T14:02:33.4737892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_float16 PASSED [0.0062s] [ 38%] 2025-12-04T14:02:33.4738126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4738274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_bfloat16 PASSED [0.0079s] [ 38%] 2025-12-04T14:02:33.4738411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float32 PASSED [0.9442s] [ 38%] 2025-12-04T14:02:33.4738547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int32 PASSED [0.0100s] [ 38%] 2025-12-04T14:02:33.4738692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int8 PASSED [0.9578s] [ 38%] 2025-12-04T14:02:33.4738860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_complex64 SKIPPED [0.0015s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739532Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4739872Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740036Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4740893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_complex32 SKIPPED [0.0008s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4741901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742602Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4742986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4743163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4743339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4743529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4743703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4743892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4744070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4744246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4744420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4744590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4744713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex128 PASSED [0.0089s] [ 38%] 2025-12-04T14:02:33.4744834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float32 PASSED [0.0075s] [ 38%] 2025-12-04T14:02:33.4744951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int32 PASSED [0.0074s] [ 38%] 2025-12-04T14:02:33.4745066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int64 PASSED [0.9428s] [ 38%] 2025-12-04T14:02:33.4745181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int8 PASSED [0.0099s] [ 38%] 2025-12-04T14:02:33.4745296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_uint8 PASSED [0.0077s] [ 38%] 2025-12-04T14:02:33.4745466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4745634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4745798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4746008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4746128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_float32 PASSED [0.9461s] [ 38%] 2025-12-04T14:02:33.4746352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4746562Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4746697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_bfloat16 PASSED [0.0064s] [ 38%] 2025-12-04T14:02:33.4746819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float16 PASSED [0.9410s] [ 38%] 2025-12-04T14:02:33.4747031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4747242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_bool SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4747467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4747679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4747803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_bfloat16 PASSED [1.0729s] [ 38%] 2025-12-04T14:02:33.4747942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float16 PASSED [0.9342s] [ 38%] 2025-12-04T14:02:33.4748063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float32 PASSED [0.0059s] [ 38%] 2025-12-04T14:02:33.4748276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4748395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_float16 PASSED [0.9429s] [ 38%] 2025-12-04T14:02:33.4748604Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4748812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4749022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4749194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4749370Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4749540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4749709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4749876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4750047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4750254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4750438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4750566Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_bfloat16 PASSED [0.0041s] [ 38%] 2025-12-04T14:02:33.4750792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4750917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_float16 PASSED [0.0032s] [ 38%] 2025-12-04T14:02:33.4751130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T14:02:33.4751264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float32 PASSED [0.0052s] [ 38%] 2025-12-04T14:02:33.4751444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4751610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4751773Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4751946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4752938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4753104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4753281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4753450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4753618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T14:02:33.4753780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4753957Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4754132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4755234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4755406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4755579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4755764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4755935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4756976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757481Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4757990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4758981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4759145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4759317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4759491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4759660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4759835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4760922Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4761927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4762942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4763965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4764948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int32 PASSED [0.9456s] [ 39%] 2025-12-04T14:02:33.4765122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_complex128 SKIPPED [0.0015s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4765290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4765458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4765622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4765786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4765952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4766950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4767962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4768126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4768287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int32 SKIPPED [0.0008s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4768528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 39%] 2025-12-04T14:02:33.4768707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_float32 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 39%] 2025-12-04T14:02:33.4768943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 39%] 2025-12-04T14:02:33.4769165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 39%] 2025-12-04T14:02:33.4769285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int16 PASSED [0.9394s] [ 39%] 2025-12-04T14:02:33.4769406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int64 PASSED [0.0047s] [ 39%] 2025-12-04T14:02:33.4769524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_int8 PASSED [0.0035s] [ 39%] 2025-12-04T14:02:33.4769642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_uint8 PASSED [0.9424s] [ 39%] 2025-12-04T14:02:33.4769773Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float16 PASSED [0.0174s] [ 39%] 2025-12-04T14:02:33.4769902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int16 PASSED [0.0101s] [ 39%] 2025-12-04T14:02:33.4770027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int32 PASSED [0.0099s] [ 39%] 2025-12-04T14:02:33.4770192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int8 PASSED [0.0097s] [ 39%] 2025-12-04T14:02:33.4770359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4770519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4770679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4770845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4771005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4771128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_bfloat16 PASSED [0.0076s] [ 39%] 2025-12-04T14:02:33.4771267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_float64 PASSED [0.9426s] [ 39%] 2025-12-04T14:02:33.4771384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_int16 PASSED [0.0096s] [ 39%] 2025-12-04T14:02:33.4771500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_uint8 PASSED [0.9415s] [ 39%] 2025-12-04T14:02:33.4771620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_bfloat16 PASSED [0.0053s] [ 39%] 2025-12-04T14:02:33.4771751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float32 PASSED [0.0033s] [ 39%] 2025-12-04T14:02:33.4771870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float64 PASSED [0.9570s] [ 39%] 2025-12-04T14:02:33.4772037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4772203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4772381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4772540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_bool SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4772707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4772884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4773891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4774833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_int8 PASSED [0.0078s] [ 39%] 2025-12-04T14:02:33.4774950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_uint8 PASSED [0.2788s] [ 39%] 2025-12-04T14:02:33.4775067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_bfloat16 PASSED [0.0075s] [ 39%] 2025-12-04T14:02:33.4775194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int16 PASSED [0.0067s] [ 39%] 2025-12-04T14:02:33.4775310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int8 PASSED [0.0067s] [ 39%] 2025-12-04T14:02:33.4775436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_float32 PASSED [0.0050s] [ 39%] 2025-12-04T14:02:33.4775573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_float64 PASSED [0.0048s] [ 39%] 2025-12-04T14:02:33.4775697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int16 PASSED [0.0052s] [ 39%] 2025-12-04T14:02:33.4775820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int32 PASSED [0.0052s] [ 39%] 2025-12-04T14:02:33.4775942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_uint8 PASSED [0.0052s] [ 39%] 2025-12-04T14:02:33.4776113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4776291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T14:02:33.4776456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4776631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4776808Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4776977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4777144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4777311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4777488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4777666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4777783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_bool PASSED [0.0063s] [ 40%] 2025-12-04T14:02:33.4777902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_float64 PASSED [0.0066s] [ 40%] 2025-12-04T14:02:33.4778015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_uint8 PASSED [0.0066s] [ 40%] 2025-12-04T14:02:33.4778185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4778351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4778512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4778688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4778856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779030Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4779904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4780035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_bfloat16 PASSED [0.0092s] [ 40%] 2025-12-04T14:02:33.4780204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_bool PASSED [0.0091s] [ 40%] 2025-12-04T14:02:33.4780344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int16 PASSED [0.0086s] [ 40%] 2025-12-04T14:02:33.4780468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int32 PASSED [0.0086s] [ 40%] 2025-12-04T14:02:33.4780591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int64 PASSED [0.0079s] [ 40%] 2025-12-04T14:02:33.4780767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4780930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4781940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4782105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4782271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4782435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4782557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_bfloat16 PASSED [0.9352s] [ 40%] 2025-12-04T14:02:33.4782675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float32 PASSED [0.0053s] [ 40%] 2025-12-04T14:02:33.4782793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float64 PASSED [0.0036s] [ 40%] 2025-12-04T14:02:33.4783005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 40%] 2025-12-04T14:02:33.4783230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 40%] 2025-12-04T14:02:33.4783439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 40%] 2025-12-04T14:02:33.4783621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4783788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4783916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_bfloat16 PASSED [0.0101s] [ 40%] 2025-12-04T14:02:33.4784043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float32 PASSED [0.9451s] [ 40%] 2025-12-04T14:02:33.4784170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float64 PASSED [0.0114s] [ 40%] 2025-12-04T14:02:33.4784304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_int32 PASSED [0.0089s] [ 40%] 2025-12-04T14:02:33.4784427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_uint8 PASSED [0.0086s] [ 40%] 2025-12-04T14:02:33.4784553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_bool PASSED [0.9454s] [ 40%] 2025-12-04T14:02:33.4784679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float32 PASSED [0.0062s] [ 40%] 2025-12-04T14:02:33.4784811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int8 PASSED [0.0045s] [ 40%] 2025-12-04T14:02:33.4784935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_int16 PASSED [0.0064s] [ 40%] 2025-12-04T14:02:33.4785058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int16 PASSED [0.0053s] [ 40%] 2025-12-04T14:02:33.4785180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_uint8 PASSED [0.0050s] [ 40%] 2025-12-04T14:02:33.4785319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_bfloat16 PASSED [0.0071s] [ 40%] 2025-12-04T14:02:33.4785457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float16 PASSED [0.9439s] [ 40%] 2025-12-04T14:02:33.4785590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_int32 PASSED [0.0090s] [ 40%] 2025-12-04T14:02:33.4785725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_uint8 PASSED [0.0072s] [ 40%] 2025-12-04T14:02:33.4785857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int64 PASSED [0.0070s] [ 40%] 2025-12-04T14:02:33.4785996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_bfloat16 PASSED [0.9449s] [ 40%] 2025-12-04T14:02:33.4786128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int64 PASSED [0.0097s] [ 40%] 2025-12-04T14:02:33.4786261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int8 PASSED [0.0076s] [ 40%] 2025-12-04T14:02:33.4786392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_uint8 PASSED [0.0074s] [ 40%] 2025-12-04T14:02:33.4786530Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_float32 PASSED [0.9456s] [ 40%] 2025-12-04T14:02:33.4786703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4786880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_complex32 SKIPPED [0.0013s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_int16 SKIPPED [0.0012s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4787917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788078Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4788908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4789918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4790935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4791953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4792971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_istft_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4793965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4794124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4794316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4794516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4794709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4794902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4795110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4795308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4795511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4795723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4795925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4796107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4796302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4796484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4796677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4796889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4797968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4798137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4798301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T14:02:33.4798464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4798626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4798786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4798946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4799967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4800084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_int8 PASSED [0.0101s] [ 41%] 2025-12-04T14:02:33.4800247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_uint8 PASSED [0.2998s] [ 41%] 2025-12-04T14:02:33.4800382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_float32 PASSED [0.0081s] [ 41%] 2025-12-04T14:02:33.4800598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4800824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4800945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_bfloat16 PASSED [0.0073s] [ 41%] 2025-12-04T14:02:33.4801058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_bool PASSED [0.0063s] [ 41%] 2025-12-04T14:02:33.4801173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int16 PASSED [0.0066s] [ 41%] 2025-12-04T14:02:33.4801286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int32 PASSED [0.0066s] [ 41%] 2025-12-04T14:02:33.4801399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int64 PASSED [0.0066s] [ 41%] 2025-12-04T14:02:33.4801510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_int8 PASSED [0.0066s] [ 41%] 2025-12-04T14:02:33.4801624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_uint8 PASSED [0.0066s] [ 41%] 2025-12-04T14:02:33.4801745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_bfloat16 PASSED [0.0085s] [ 41%] 2025-12-04T14:02:33.4801868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex32 PASSED [0.9519s] [ 41%] 2025-12-04T14:02:33.4801990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex64 PASSED [0.0161s] [ 41%] 2025-12-04T14:02:33.4802109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float16 PASSED [0.0087s] [ 41%] 2025-12-04T14:02:33.4802233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float16 PASSED [0.0040s] [ 41%] 2025-12-04T14:02:33.4802353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float32 PASSED [0.9497s] [ 41%] 2025-12-04T14:02:33.4802572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4802785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4803000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4803198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4803378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4803567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4803746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4803919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4804089Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4804266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4804440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4804612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4804792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_det_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4807175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4807366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4807554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4807736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4807917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4808090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4808265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4808445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4808628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvals_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4808818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvalsh_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4809065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_complex128 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T14:02:33.4809309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T14:02:33.4809512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4809686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4809862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4810061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4810290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4810517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T14:02:33.4810759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T14:02:33.4810934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4811129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4811325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4811503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4811674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4811856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4812986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4813184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4813363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4813541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4813735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4813911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4814118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4814314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4814490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4814680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_hermitian_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4814870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_hermitian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4815105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T14:02:33.4815323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_float64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T14:02:33.4815511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4815681Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4815867Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4816958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svd_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4817146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4817326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4817518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4817701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4817885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4818972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4819159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4819328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4819500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4819680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4819848Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4820040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4820324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4820515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4820706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4820894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T14:02:33.4821017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_bfloat16 PASSED [0.9641s] [ 41%] 2025-12-04T14:02:33.4821139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_float32 PASSED [0.0053s] [ 41%] 2025-12-04T14:02:33.4821355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4821570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4821793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4821918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_float16 PASSED [0.0032s] [ 41%] 2025-12-04T14:02:33.4822053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_float32 PASSED [0.9413s] [ 41%] 2025-12-04T14:02:33.4822268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4822483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4822715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4822841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float64 PASSED [0.9404s] [ 41%] 2025-12-04T14:02:33.4823053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4823189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_bfloat16 PASSED [0.9403s] [ 41%] 2025-12-04T14:02:33.4823399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_bool SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4823622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_complex128 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4823852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4823974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float16 PASSED [0.9454s] [ 41%] 2025-12-04T14:02:33.4824095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float32 PASSED [0.0055s] [ 41%] 2025-12-04T14:02:33.4824214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float64 PASSED [0.0039s] [ 41%] 2025-12-04T14:02:33.4824428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T14:02:33.4824639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T14:02:33.4824770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_normal_cuda_float64 PASSED [0.0054s] [ 42%] 2025-12-04T14:02:33.4824948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4825142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4825328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4825501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4825676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4825855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4826038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4826223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4826398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4826529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float32 PASSED [0.9488s] [ 42%] 2025-12-04T14:02:33.4826670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_int32 PASSED [0.0083s] [ 42%] 2025-12-04T14:02:33.4826801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_int64 PASSED [0.9492s] [ 42%] 2025-12-04T14:02:33.4826927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_bool PASSED [0.0054s] [ 42%] 2025-12-04T14:02:33.4827065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_complex128 PASSED [0.0039s] [ 42%] 2025-12-04T14:02:33.4827195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float32 PASSED [0.9422s] [ 42%] 2025-12-04T14:02:33.4827344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float64 PASSED [0.0055s] [ 42%] 2025-12-04T14:02:33.4827476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_complex128 PASSED [0.2719s] [ 42%] 2025-12-04T14:02:33.4827606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_float32 PASSED [0.0061s] [ 42%] 2025-12-04T14:02:33.4827739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int16 PASSED [0.0058s] [ 42%] 2025-12-04T14:02:33.4827866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int32 PASSED [0.0056s] [ 42%] 2025-12-04T14:02:33.4827991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_bool PASSED [0.0057s] [ 42%] 2025-12-04T14:02:33.4828125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_complex64 PASSED [0.2714s] [ 42%] 2025-12-04T14:02:33.4828254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_float32 PASSED [0.0060s] [ 42%] 2025-12-04T14:02:33.4828383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int16 PASSED [0.0057s] [ 42%] 2025-12-04T14:02:33.4828509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int8 PASSED [0.0056s] [ 42%] 2025-12-04T14:02:33.4828629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float16 PASSED [0.0068s] [ 42%] 2025-12-04T14:02:33.4828750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float64 PASSED [0.0060s] [ 42%] 2025-12-04T14:02:33.4828927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4829103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4829275Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4829448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4829618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4829814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4830009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4830277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4830470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4830670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4830860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4831919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4832087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4832251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4832413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4832536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_float64 PASSED [0.0067s] [ 42%] 2025-12-04T14:02:33.4832651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int32 PASSED [0.0066s] [ 42%] 2025-12-04T14:02:33.4832767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int64 PASSED [0.0066s] [ 42%] 2025-12-04T14:02:33.4832882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_int8 PASSED [0.0065s] [ 42%] 2025-12-04T14:02:33.4833045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4833219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_solve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4833390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4833560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_unpack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4833724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4833884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4834865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4835916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4836958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4837138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4837315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4837491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4837667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4837800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_bfloat16 PASSED [0.0075s] [ 42%] 2025-12-04T14:02:33.4837933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex32 PASSED [0.0073s] [ 42%] 2025-12-04T14:02:33.4838062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_float64 PASSED [0.0073s] [ 42%] 2025-12-04T14:02:33.4838200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int32 PASSED [0.0071s] [ 42%] 2025-12-04T14:02:33.4838326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int64 PASSED [0.9432s] [ 42%] 2025-12-04T14:02:33.4838452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int8 PASSED [0.0097s] [ 42%] 2025-12-04T14:02:33.4838645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_log_softmax_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4838827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4839906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840822Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int16 SKIPPED [0.0008s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4840997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4841164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4841296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_bool PASSED [0.9633s] [ 42%] 2025-12-04T14:02:33.4841430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_float64 PASSED [0.0065s] [ 42%] 2025-12-04T14:02:33.4841566Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_int16 PASSED [0.0047s] [ 42%] 2025-12-04T14:02:33.4841696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_int32 PASSED [0.0045s] [ 42%] 2025-12-04T14:02:33.4841828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_uint8 PASSED [0.0044s] [ 42%] 2025-12-04T14:02:33.4842004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4842181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4842374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4842547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4842747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4842914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4843967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4844972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4845986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4846188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4846397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4846582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T14:02:33.4846769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4846953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4847135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4847320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4847516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4847698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4847869Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4848875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4849860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4850026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4850255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4850454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4850644Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4850844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4851986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4852196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4852388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4852579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4852768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4852956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4853124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4853295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4853463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4853644Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4853823Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4854953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4855956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4856940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4857919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4858953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4859118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4859279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4859401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_bfloat16 PASSED [0.0059s] [ 43%] 2025-12-04T14:02:33.4859518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_bool PASSED [0.0057s] [ 43%] 2025-12-04T14:02:33.4859650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_float16 PASSED [0.0057s] [ 43%] 2025-12-04T14:02:33.4859829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4859990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4860201Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4860362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4860508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16 PASSED [0.0153s] [ 43%] 2025-12-04T14:02:33.4860745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T14:02:33.4860890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16 PASSED [0.0141s] [ 43%] 2025-12-04T14:02:33.4861035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float16 PASSED [0.0137s] [ 43%] 2025-12-04T14:02:33.4861268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T14:02:33.4861413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16 PASSED [0.9580s] [ 43%] 2025-12-04T14:02:33.4861540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_bool PASSED [0.0056s] [ 43%] 2025-12-04T14:02:33.4861667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_int32 PASSED [0.0038s] [ 43%] 2025-12-04T14:02:33.4861838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4862870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4863890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864403Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4864907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4865908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4866099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4866288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4866470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4866659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4866778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_bool PASSED [0.0068s] [ 43%] 2025-12-04T14:02:33.4866896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_float64 PASSED [0.0066s] [ 43%] 2025-12-04T14:02:33.4867020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int16 PASSED [0.0066s] [ 43%] 2025-12-04T14:02:33.4867135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int32 PASSED [0.0067s] [ 43%] 2025-12-04T14:02:33.4867249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int64 PASSED [0.0066s] [ 43%] 2025-12-04T14:02:33.4867361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_uint8 PASSED [0.0066s] [ 43%] 2025-12-04T14:02:33.4867485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex64 PASSED [0.0042s] [ 43%] 2025-12-04T14:02:33.4867610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_int16 PASSED [0.9509s] [ 43%] 2025-12-04T14:02:33.4867724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_int8 PASSED [0.0050s] [ 43%] 2025-12-04T14:02:33.4867895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4868969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4869143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4869310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T14:02:33.4869477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4869641Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4869816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4869980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4870192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4870356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4870548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4870717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4870888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4871066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4871271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4871475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4871676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4871891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4872092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4872316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4872517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4872719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4872920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4873119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4873321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4873472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float16 PASSED [0.0163s] [ 44%] 2025-12-04T14:02:33.4873664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4873855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4874044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4874233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4874422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4874633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4874850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4875038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4875239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4875443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4875643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4875863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4876020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_bfloat16 PASSED [0.0045s] [ 44%] 2025-12-04T14:02:33.4876160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float64 PASSED [0.0041s] [ 44%] 2025-12-04T14:02:33.4876374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4876567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4876762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4876948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4877139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4877324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4877515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4877700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4877885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4878070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4878269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4878470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4878669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4878881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4879085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4879287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4879494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4879690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4879888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4880129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4880332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4880532Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4880743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4880943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_similarity_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4881143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4881291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout2d_cuda_float16 PASSED [0.0117s] [ 44%] 2025-12-04T14:02:33.4881439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float32 PASSED [0.0133s] [ 44%] 2025-12-04T14:02:33.4881584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout_cuda_float64 PASSED [0.0140s] [ 44%] 2025-12-04T14:02:33.4881723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_float64 PASSED [0.0046s] [ 44%] 2025-12-04T14:02:33.4881901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0089s] [ 44%] 2025-12-04T14:02:33.4882075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64 PASSED [0.9538s] [ 44%] 2025-12-04T14:02:33.4882252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [0.0090s] [ 44%] 2025-12-04T14:02:33.4882431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16 PASSED [0.9745s] [ 44%] 2025-12-04T14:02:33.4882609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0087s] [ 44%] 2025-12-04T14:02:33.4882786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8 PASSED [0.9639s] [ 44%] 2025-12-04T14:02:33.4883002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4883203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gaussian_nll_loss_cuda_bfloat16 SKIPPED [0.0013s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4883384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4883578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4883758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4883952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4884144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4884344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4884535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4884733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4884924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardshrink_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4885075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0064s] [ 44%] 2025-12-04T14:02:33.4885266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4885454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4885639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4885843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4886033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4886224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4886418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4886613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4886806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_instance_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4887006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4887214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4887412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4887608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_area_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4887822Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4888026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4888229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4888451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4888652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4888875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4889088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4889290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4889477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4889665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4889854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4890044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4890261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4890414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16 PASSED [0.0114s] [ 44%] 2025-12-04T14:02:33.4890561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0109s] [ 44%] 2025-12-04T14:02:33.4890752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4890944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4891131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4891315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4891515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4891715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_local_response_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4891920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4892123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4892323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4892524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4892734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4892924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4893126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4893318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4893507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4893695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4893883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4894071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4894264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4894463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4894663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4894854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4895054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T14:02:33.4895198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_float16 PASSED [0.0049s] [ 45%] 2025-12-04T14:02:33.4895416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4895629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4895827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multi_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4896042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4896247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4896437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_nll_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4896633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4896857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4897048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4897252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4897449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4897642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4897833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4898983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4899170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4899357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4899552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4899757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4899954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4900208Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4900428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4900638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4900843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4901060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4901260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4901476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4901679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4901877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4902076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4902276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4902473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4902657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pdist_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4902856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4903053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4903247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4903439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4903629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4903819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4904031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4904229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4904425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4904636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4904833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905028Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4905960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4906142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4906319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4906509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4906696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4906884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4907023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_float64 PASSED [0.0062s] [ 45%] 2025-12-04T14:02:33.4907238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4907376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_bfloat16 PASSED [0.9544s] [ 45%] 2025-12-04T14:02:33.4907517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float16 PASSED [0.0071s] [ 45%] 2025-12-04T14:02:33.4907670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_complex_cuda_complex128 PASSED [0.0040s] [ 45%] 2025-12-04T14:02:33.4907808Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_cuda_bfloat16 PASSED [0.9525s] [ 45%] 2025-12-04T14:02:33.4907945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_cuda_float64 PASSED [0.0059s] [ 45%] 2025-12-04T14:02:33.4908152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4908346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4908546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4908742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4908940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4909135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4909333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4909531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4909737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4909931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4910154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4910345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4910533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4910723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softshrink_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4910913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4911097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4911289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4911474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4911666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4911862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4912008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_float16 PASSED [0.0062s] [ 45%] 2025-12-04T14:02:33.4912153Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_float64 PASSED [0.0053s] [ 45%] 2025-12-04T14:02:33.4912308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_int16 PASSED [0.0052s] [ 45%] 2025-12-04T14:02:33.4912451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_int8 PASSED [0.0053s] [ 45%] 2025-12-04T14:02:33.4912655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4912869Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4913091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4913311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4913540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4913722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4913919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4914103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4914304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4914503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4914672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4914846Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4915016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4915182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4915337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_bool SKIPPED [0.0007s] (Only runs on cpu) [ 45%] 2025-12-04T14:02:33.4915493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_int32 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T14:02:33.4915647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_int64 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T14:02:33.4915813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4915983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4916149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4916326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4916499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4916665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4916841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4917008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4917147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_complex128 PASSED [0.0039s] [ 45%] 2025-12-04T14:02:33.4917329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4917499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4917668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4917835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4918833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4919829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4920005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T14:02:33.4920222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4920417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4920586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4920759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4920941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4921962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pinverse_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4922123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.9673s] [ 46%] 2025-12-04T14:02:33.4922360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4922508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_float16 PASSED [0.0099s] [ 46%] 2025-12-04T14:02:33.4922654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_float64 PASSED [0.0079s] [ 46%] 2025-12-04T14:02:33.4922892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4923125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4923358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4923592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4923827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4923977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.0077s] [ 46%] 2025-12-04T14:02:33.4924123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0076s] [ 46%] 2025-12-04T14:02:33.4924269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_bfloat16 PASSED [0.9574s] [ 46%] 2025-12-04T14:02:33.4924430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0103s] [ 46%] 2025-12-04T14:02:33.4924576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float64 PASSED [0.0079s] [ 46%] 2025-12-04T14:02:33.4924809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4924968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_float16 PASSED [0.0078s] [ 46%] 2025-12-04T14:02:33.4925204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4925382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4925550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4925692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex128 PASSED [0.0066s] [ 46%] 2025-12-04T14:02:33.4925818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex32 PASSED [0.4756s] [ 46%] 2025-12-04T14:02:33.4925939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_float64 PASSED [0.9550s] [ 46%] 2025-12-04T14:02:33.4926065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int16 PASSED [0.0090s] [ 46%] 2025-12-04T14:02:33.4926235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4926399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4926561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4926724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4926847Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_complex128 PASSED [0.0172s] [ 46%] 2025-12-04T14:02:33.4926971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_complex64 PASSED [0.0166s] [ 46%] 2025-12-04T14:02:33.4927091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_float64 PASSED [0.9683s] [ 46%] 2025-12-04T14:02:33.4927210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_uint8 PASSED [0.0191s] [ 46%] 2025-12-04T14:02:33.4927371Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_qr_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4927543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_quantile_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4927763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4927888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float32 PASSED [0.9541s] [ 46%] 2025-12-04T14:02:33.4928107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4928277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_bfloat16 SKIPPED [0.0013s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4928455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4928623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4928789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4928964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4929980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4930198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4930373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4930546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4930718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4930884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4931870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4932936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4933080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_float32 PASSED [0.9604s] [ 46%] 2025-12-04T14:02:33.4933302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4933523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_uint8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4933665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int16 PASSED [0.0086s] [ 46%] 2025-12-04T14:02:33.4933790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int32 PASSED [0.0070s] [ 46%] 2025-12-04T14:02:33.4933913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_int8 PASSED [0.0069s] [ 46%] 2025-12-04T14:02:33.4934039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_complex128 PASSED [0.0082s] [ 46%] 2025-12-04T14:02:33.4934180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_float32 PASSED [0.0072s] [ 46%] 2025-12-04T14:02:33.4934350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4934515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4934698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4934876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4935954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4936981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4937147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4937269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bool PASSED [0.0036s] [ 46%] 2025-12-04T14:02:33.4937391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int16 PASSED [0.9564s] [ 46%] 2025-12-04T14:02:33.4937523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_complex64 PASSED [0.0059s] [ 46%] 2025-12-04T14:02:33.4937659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float16 PASSED [0.0041s] [ 46%] 2025-12-04T14:02:33.4937783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int16 PASSED [0.0039s] [ 46%] 2025-12-04T14:02:33.4937907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int32 PASSED [0.9696s] [ 46%] 2025-12-04T14:02:33.4938030Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int64 PASSED [0.0058s] [ 46%] 2025-12-04T14:02:33.4938161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_uint8 PASSED [0.0041s] [ 46%] 2025-12-04T14:02:33.4938333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4938507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4938685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4938854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939681Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4939841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4940000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4940160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_bfloat16 PASSED [0.9583s] [ 46%] 2025-12-04T14:02:33.4940280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_int32 PASSED [0.0046s] [ 46%] 2025-12-04T14:02:33.4940416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float16 PASSED [0.0042s] [ 46%] 2025-12-04T14:02:33.4940567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_bfloat16 PASSED [0.9579s] [ 46%] 2025-12-04T14:02:33.4940707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_float16 PASSED [0.0060s] [ 46%] 2025-12-04T14:02:33.4940931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex128 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4941167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4941289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_float16 PASSED [0.9573s] [ 46%] 2025-12-04T14:02:33.4941501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4941728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T14:02:33.4941895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4942063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4942235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4942394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4942556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T14:02:33.4942732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4942902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4943073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4943199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_bool PASSED [0.0093s] [ 47%] 2025-12-04T14:02:33.4943328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_float64 PASSED [0.0080s] [ 47%] 2025-12-04T14:02:33.4943455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int64 PASSED [0.9637s] [ 47%] 2025-12-04T14:02:33.4943583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_complex64 PASSED [0.0162s] [ 47%] 2025-12-04T14:02:33.4943704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_float16 PASSED [0.0199s] [ 47%] 2025-12-04T14:02:33.4943826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_float64 PASSED [0.0192s] [ 47%] 2025-12-04T14:02:33.4943946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int16 PASSED [0.0139s] [ 47%] 2025-12-04T14:02:33.4944067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int8 PASSED [0.9789s] [ 47%] 2025-12-04T14:02:33.4944185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_uint8 PASSED [0.0165s] [ 47%] 2025-12-04T14:02:33.4944325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float16 PASSED [0.0169s] [ 47%] 2025-12-04T14:02:33.4944474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float32 PASSED [0.0166s] [ 47%] 2025-12-04T14:02:33.4944614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_float64 PASSED [0.0165s] [ 47%] 2025-12-04T14:02:33.4944754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_bfloat16 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4944899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_int16 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4945034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_int8 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4945170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_uint8 PASSED [0.0165s] [ 47%] 2025-12-04T14:02:33.4945309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_bfloat16 PASSED [0.0180s] [ 47%] 2025-12-04T14:02:33.4945455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float16 PASSED [0.0184s] [ 47%] 2025-12-04T14:02:33.4945593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float32 PASSED [0.0177s] [ 47%] 2025-12-04T14:02:33.4945727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_int16 PASSED [0.0175s] [ 47%] 2025-12-04T14:02:33.4945862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_int32 PASSED [0.0176s] [ 47%] 2025-12-04T14:02:33.4946011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_bfloat16 PASSED [0.0165s] [ 47%] 2025-12-04T14:02:33.4946150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float64 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4946286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int32 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4946422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_uint8 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4946555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int32 PASSED [0.0164s] [ 47%] 2025-12-04T14:02:33.4946688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int8 PASSED [0.0163s] [ 47%] 2025-12-04T14:02:33.4946864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947370Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4947871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4948042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4948216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4948346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_float32 PASSED [0.9663s] [ 47%] 2025-12-04T14:02:33.4948464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int16 PASSED [0.0048s] [ 47%] 2025-12-04T14:02:33.4948580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_uint8 PASSED [0.0035s] [ 47%] 2025-12-04T14:02:33.4948755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4948919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4949044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_float32 PASSED [0.9719s] [ 47%] 2025-12-04T14:02:33.4949167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_float64 PASSED [0.0063s] [ 47%] 2025-12-04T14:02:33.4949384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4949515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_bool PASSED [0.9485s] [ 47%] 2025-12-04T14:02:33.4949634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float16 PASSED [0.0051s] [ 47%] 2025-12-04T14:02:33.4949754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float64 PASSED [0.0035s] [ 47%] 2025-12-04T14:02:33.4949879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int32 PASSED [0.9528s] [ 47%] 2025-12-04T14:02:33.4950070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_bartlett_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4950301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_bartlett_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4950491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_blackman_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4950678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_cosine_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4950866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_gaussian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4951054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_gaussian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4951250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4951447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_cosine_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4951629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hann_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4951815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_kaiser_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_nuttall_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_nuttall_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952371Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4952882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4953094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4953312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4953541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4953750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int16 SKIPPED [0.0008s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4953957Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4954177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4954384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T14:02:33.4954507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float64 PASSED [0.0038s] [ 47%] 2025-12-04T14:02:33.4954673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4954841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4955873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4956955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4957133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4957302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4957461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4957631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4957790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_bfloat16 SKIPPED [0.0005s] (Only runs on cpu) [ 47%] 2025-12-04T14:02:33.4957950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float16 SKIPPED [0.0005s] (Only runs on cpu) [ 47%] 2025-12-04T14:02:33.4958117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 47%] 2025-12-04T14:02:33.4958301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4958476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4958648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4958821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4959884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4960988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4961163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4961338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4961537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4961748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4961943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4962150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4962325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4962499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4962672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4962842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int32 SKIPPED [0.0008s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4963914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4964108Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4964302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4964504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4964679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4964851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4965038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T14:02:33.4965208Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4965375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4965541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4965719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4965891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4966059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4966238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4966433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4966628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4966827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4967944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4968117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4968292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4968484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4968684Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4968875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4969073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4969263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4969452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4969640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4969837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4970961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4971139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4971312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4971522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4971732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4971938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4972138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4972338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4972533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4972740Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4972938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4973135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4973340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4973546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4973751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4973970Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4974175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4974378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4974598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4974802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4975007Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4975210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4975418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4975621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4975810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976370Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4976900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4977902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4978980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4979163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4979336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4979557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T14:02:33.4979776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T14:02:33.4979900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_float16 PASSED [0.9522s] [ 48%] 2025-12-04T14:02:33.4980026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_bfloat16 PASSED [0.0063s] [ 48%] 2025-12-04T14:02:33.4980184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_bool XFAIL [0.0032s] [ 48%] 2025-12-04T14:02:33.4980310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_complex128 PASSED [0.9669s] [ 48%] 2025-12-04T14:02:33.4980436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_complex64 PASSED [0.0043s] [ 48%] 2025-12-04T14:02:33.4980615Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4980784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4980973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4981143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4981274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_complex32 PASSED [0.9568s] [ 48%] 2025-12-04T14:02:33.4981410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float16 PASSED [0.0082s] [ 48%] 2025-12-04T14:02:33.4981535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float32 PASSED [0.0064s] [ 48%] 2025-12-04T14:02:33.4981656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_float64 PASSED [0.0062s] [ 48%] 2025-12-04T14:02:33.4981779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int32 PASSED [0.0061s] [ 48%] 2025-12-04T14:02:33.4981900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_uint8 PASSED [0.0061s] [ 48%] 2025-12-04T14:02:33.4982055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_float16 PASSED [0.0053s] [ 48%] 2025-12-04T14:02:33.4982189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int32 PASSED [0.0052s] [ 48%] 2025-12-04T14:02:33.4982323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int64 PASSED [0.0052s] [ 48%] 2025-12-04T14:02:33.4982455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_uint8 PASSED [0.0052s] [ 48%] 2025-12-04T14:02:33.4982633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4982802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4982971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4983966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4984989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex32 PASSED [0.0072s] [ 48%] 2025-12-04T14:02:33.4985106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_int32 PASSED [0.0069s] [ 48%] 2025-12-04T14:02:33.4985274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4985441Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4985618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4985791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4985961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_lowrank_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4986992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987808Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4987928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_bfloat16 PASSED [0.9593s] [ 48%] 2025-12-04T14:02:33.4988049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_complex128 PASSED [0.0057s] [ 48%] 2025-12-04T14:02:33.4988165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_float16 PASSED [0.0041s] [ 48%] 2025-12-04T14:02:33.4988279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int64 PASSED [0.9647s] [ 48%] 2025-12-04T14:02:33.4988452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 48%] 2025-12-04T14:02:33.4988630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4988800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4988972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4989131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4989304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4989464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4989625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4989746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_bfloat16 PASSED [0.0042s] [ 49%] 2025-12-04T14:02:33.4989965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4990150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float16 PASSED [0.0031s] [ 49%] 2025-12-04T14:02:33.4990364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4990588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4990709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_bfloat16 PASSED [0.9558s] [ 49%] 2025-12-04T14:02:33.4990932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex128 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4991150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4991271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float32 PASSED [0.0042s] [ 49%] 2025-12-04T14:02:33.4991480Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4991689Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4991901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.4992083Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4992254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4992422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4992595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4992766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4992953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4993965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4994994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4995160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4995320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4995579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 49%] 2025-12-04T14:02:33.4995837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 49%] 2025-12-04T14:02:33.4996040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4996244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4996442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4996611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4996775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4996950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4997134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4997309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4997449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_bfloat16 PASSED [0.9631s] [ 49%] 2025-12-04T14:02:33.4997579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_complex64 PASSED [0.0082s] [ 49%] 2025-12-04T14:02:33.4997705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_int64 PASSED [0.0065s] [ 49%] 2025-12-04T14:02:33.4997876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4998857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex32 PASSED [0.0107s] [ 49%] 2025-12-04T14:02:33.4998977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float32 PASSED [0.0100s] [ 49%] 2025-12-04T14:02:33.4999096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int16 PASSED [0.9804s] [ 49%] 2025-12-04T14:02:33.4999267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_indices_cuda_int32 SKIPPED [0.0015s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.4999389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_bfloat16 PASSED [0.0123s] [ 49%] 2025-12-04T14:02:33.4999511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_complex128 PASSED [0.9584s] [ 49%] 2025-12-04T14:02:33.4999634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_complex64 PASSED [0.0125s] [ 49%] 2025-12-04T14:02:33.4999752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_float32 PASSED [0.0103s] [ 49%] 2025-12-04T14:02:33.4999870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int32 PASSED [0.0101s] [ 49%] 2025-12-04T14:02:33.4999985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int8 PASSED [0.9592s] [ 49%] 2025-12-04T14:02:33.5000154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float32 PASSED [0.0085s] [ 49%] 2025-12-04T14:02:33.5000378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 49%] 2025-12-04T14:02:33.5000499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_bfloat16 PASSED [0.9623s] [ 49%] 2025-12-04T14:02:33.5000618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int64 PASSED [0.0047s] [ 49%] 2025-12-04T14:02:33.5000792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5000974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5001996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5002183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5002354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5002522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5002712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5002890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5003917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5004081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5004249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5004415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5004577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5004702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_float32 PASSED [0.9578s] [ 49%] 2025-12-04T14:02:33.5004886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_float32 SKIPPED [0.0015s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int32 SKIPPED [0.0013s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unravel_index_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5005941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5006109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5006299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5006474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5006653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5006824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5007876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5008946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5009135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5009308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5009450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex32 PASSED [0.0076s] [ 49%] 2025-12-04T14:02:33.5009580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex64 PASSED [0.0066s] [ 49%] 2025-12-04T14:02:33.5009708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float16 PASSED [0.0066s] [ 49%] 2025-12-04T14:02:33.5009834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float64 PASSED [0.9716s] [ 49%] 2025-12-04T14:02:33.5009958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_int16 PASSED [0.0085s] [ 49%] 2025-12-04T14:02:33.5010152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5010343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5010512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5010692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5010876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5011057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T14:02:33.5011237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5011410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5011589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5011755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5011916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_complex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5012926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_real_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5013952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5014956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5015960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5016962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5017134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5017300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5017463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5017635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5017850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 50%] 2025-12-04T14:02:33.5018065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 50%] 2025-12-04T14:02:33.5018289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 50%] 2025-12-04T14:02:33.5018414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_complex128 PASSED [0.9649s] [ 50%] 2025-12-04T14:02:33.5018537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_float16 PASSED [0.0056s] [ 50%] 2025-12-04T14:02:33.5018657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int16 PASSED [0.0039s] [ 50%] 2025-12-04T14:02:33.5018785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int32 PASSED [0.9747s] [ 50%] 2025-12-04T14:02:33.5018951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bool SKIPPED [0.0013s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5019961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 50%] 2025-12-04T14:02:33.5020080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_bfloat16 PASSED [0.9559s] [ 50%] 2025-12-04T14:02:33.5020231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_bool PASSED [0.0045s] [ 50%] 2025-12-04T14:02:33.5020351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_complex128 PASSED [0.9733s] [ 50%] 2025-12-04T14:02:33.5020472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_complex32 PASSED [0.0048s] [ 50%] 2025-12-04T14:02:33.5020587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_float16 PASSED [0.9492s] [ 50%] 2025-12-04T14:02:33.5020704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_float64 PASSED [0.0046s] [ 50%] 2025-12-04T14:02:33.5020818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_int16 PASSED [0.9545s] [ 50%] 2025-12-04T14:02:33.5020931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_bool PASSED [0.0046s] [ 50%] 2025-12-04T14:02:33.5021063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex128 PASSED [0.9732s] [ 50%] 2025-12-04T14:02:33.5021184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex64 PASSED [0.0046s] [ 50%] 2025-12-04T14:02:33.5021296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int16 PASSED [0.9654s] [ 50%] 2025-12-04T14:02:33.5021435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_bool PASSED [0.0108s] [ 50%] 2025-12-04T14:02:33.5021572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex32 PASSED [0.0090s] [ 50%] 2025-12-04T14:02:33.5021704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex64 PASSED [0.0088s] [ 50%] 2025-12-04T14:02:33.5021834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float16 PASSED [0.0086s] [ 50%] 2025-12-04T14:02:33.5021962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float32 PASSED [0.0085s] [ 50%] 2025-12-04T14:02:33.5022105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float64 PASSED [0.0086s] [ 50%] 2025-12-04T14:02:33.5022230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_uint8 PASSED [0.0085s] [ 50%] 2025-12-04T14:02:33.5022352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_bool PASSED [0.0100s] [ 50%] 2025-12-04T14:02:33.5022479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_complex128 PASSED [0.0098s] [ 50%] 2025-12-04T14:02:33.5022614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_int16 PASSED [0.0096s] [ 50%] 2025-12-04T14:02:33.5022735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int8 PASSED [0.0096s] [ 50%] 2025-12-04T14:02:33.5022865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_complex128 PASSED [0.0137s] [ 50%] 2025-12-04T14:02:33.5022989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float16 PASSED [0.0191s] [ 50%] 2025-12-04T14:02:33.5023113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float32 PASSED [0.0135s] [ 50%] 2025-12-04T14:02:33.5023236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int16 PASSED [0.0163s] [ 50%] 2025-12-04T14:02:33.5023356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int64 PASSED [0.0161s] [ 50%] 2025-12-04T14:02:33.5023476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int8 PASSED [0.0161s] [ 50%] 2025-12-04T14:02:33.5023610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmatmul___cuda_complex128 PASSED [0.0248s] [ 50%] 2025-12-04T14:02:33.5023735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_float64 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5023860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_bfloat16 PASSED [0.0128s] [ 50%] 2025-12-04T14:02:33.5023989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_complex128 PASSED [0.0098s] [ 50%] 2025-12-04T14:02:33.5024115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_complex64 PASSED [0.0098s] [ 50%] 2025-12-04T14:02:33.5024241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_float32 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5024363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_float64 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5024484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_int64 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5024603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_int8 PASSED [0.0098s] [ 50%] 2025-12-04T14:02:33.5024733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_bool PASSED [0.0096s] [ 50%] 2025-12-04T14:02:33.5024854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_int16 PASSED [0.0095s] [ 50%] 2025-12-04T14:02:33.5024975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_uint8 PASSED [0.0096s] [ 50%] 2025-12-04T14:02:33.5025113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int64 PASSED [0.0098s] [ 50%] 2025-12-04T14:02:33.5025232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int8 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5025359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_bfloat16 PASSED [0.0131s] [ 50%] 2025-12-04T14:02:33.5025479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int32 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5025601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int64 PASSED [0.0097s] [ 50%] 2025-12-04T14:02:33.5025732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int16 PASSED [0.0096s] [ 50%] 2025-12-04T14:02:33.5025853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int32 PASSED [0.0094s] [ 50%] 2025-12-04T14:02:33.5025972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_int64 PASSED [0.0095s] [ 50%] 2025-12-04T14:02:33.5026103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_bfloat16 PASSED [0.9735s] [ 50%] 2025-12-04T14:02:33.5026237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_bool PASSED [0.0166s] [ 50%] 2025-12-04T14:02:33.5026369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_complex64 PASSED [0.9813s] [ 50%] 2025-12-04T14:02:33.5026493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_int32 PASSED [0.0169s] [ 50%] 2025-12-04T14:02:33.5026629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_complex64 PASSED [0.9662s] [ 50%] 2025-12-04T14:02:33.5026758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_int16 PASSED [0.0107s] [ 50%] 2025-12-04T14:02:33.5026885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_uint8 PASSED [0.0089s] [ 50%] 2025-12-04T14:02:33.5027021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_bfloat16 PASSED [0.0159s] [ 50%] 2025-12-04T14:02:33.5027153Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_float16 PASSED [0.0155s] [ 50%] 2025-12-04T14:02:33.5027280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_bool PASSED [0.0489s] [ 50%] 2025-12-04T14:02:33.5027410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float32 PASSED [0.0528s] [ 50%] 2025-12-04T14:02:33.5027542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float64 PASSED [0.0529s] [ 50%] 2025-12-04T14:02:33.5027671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_int64 PASSED [0.0405s] [ 50%] 2025-12-04T14:02:33.5027808Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_float32 PASSED [0.1007s] [ 50%] 2025-12-04T14:02:33.5027940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int64 XFAIL [0.0052s] [ 50%] 2025-12-04T14:02:33.5028081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bfloat16 PASSED [1.0651s] [ 50%] 2025-12-04T14:02:33.5028222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_complex128 PASSED [0.1351s] [ 50%] 2025-12-04T14:02:33.5028358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_float16 PASSED [0.1008s] [ 50%] 2025-12-04T14:02:33.5028504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_float64 PASSED [0.1002s] [ 50%] 2025-12-04T14:02:33.5028638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_uint8 PASSED [0.0695s] [ 50%] 2025-12-04T14:02:33.5028771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float32 PASSED [0.0087s] [ 50%] 2025-12-04T14:02:33.5028911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int16 PASSED [0.0126s] [ 50%] 2025-12-04T14:02:33.5029042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_bool PASSED [0.0125s] [ 50%] 2025-12-04T14:02:33.5029177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_complex128 PASSED [0.0087s] [ 50%] 2025-12-04T14:02:33.5029306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int64 PASSED [0.0124s] [ 50%] 2025-12-04T14:02:33.5029439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_float16 PASSED [0.0155s] [ 50%] 2025-12-04T14:02:33.5029578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int8 PASSED [0.0087s] [ 50%] 2025-12-04T14:02:33.5029710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_bool XFAIL [0.0182s] [ 50%] 2025-12-04T14:02:33.5029852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_complex64 XFAIL [0.9713s] [ 50%] 2025-12-04T14:02:33.5030000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_float64 PASSED [0.1184s] [ 50%] 2025-12-04T14:02:33.5030166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int32 PASSED [0.0807s] [ 50%] 2025-12-04T14:02:33.5030301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_uint8 PASSED [0.0806s] [ 50%] 2025-12-04T14:02:33.5030441Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_bfloat16 PASSED [0.1680s] [ 50%] 2025-12-04T14:02:33.5030574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_bool XFAIL [0.0179s] [ 51%] 2025-12-04T14:02:33.5030707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int16 PASSED [1.0377s] [ 51%] 2025-12-04T14:02:33.5030843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int32 PASSED [0.0806s] [ 51%] 2025-12-04T14:02:33.5030976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float16 PASSED [0.0093s] [ 51%] 2025-12-04T14:02:33.5031105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int16 PASSED [0.0091s] [ 51%] 2025-12-04T14:02:33.5031237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_bfloat16 PASSED [0.0155s] [ 51%] 2025-12-04T14:02:33.5031365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_bool PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5031496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5031626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_uint8 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5031755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_bool PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5031887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_float32 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5032016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int16 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5032142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5032289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_float16 PASSED [0.1141s] [ 51%] 2025-12-04T14:02:33.5032418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int16 PASSED [0.0702s] [ 51%] 2025-12-04T14:02:33.5032548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int64 PASSED [0.0703s] [ 51%] 2025-12-04T14:02:33.5035771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_uint8 PASSED [0.0702s] [ 51%] 2025-12-04T14:02:33.5035926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_complex64 XFAIL [0.0037s] [ 51%] 2025-12-04T14:02:33.5036061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_float64 PASSED [0.9854s] [ 51%] 2025-12-04T14:02:33.5036190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int16 PASSED [0.0134s] [ 51%] 2025-12-04T14:02:33.5036320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_bool PASSED [0.0149s] [ 51%] 2025-12-04T14:02:33.5036483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int8 PASSED [0.0133s] [ 51%] 2025-12-04T14:02:33.5036616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_bfloat16 PASSED [0.0158s] [ 51%] 2025-12-04T14:02:33.5036753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_complex128 PASSED [0.0089s] [ 51%] 2025-12-04T14:02:33.5036891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_complex64 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5037039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_float64 PASSED [0.0085s] [ 51%] 2025-12-04T14:02:33.5037168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int64 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5037299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_bool PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5037437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_complex128 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5037572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_float32 PASSED [0.0085s] [ 51%] 2025-12-04T14:02:33.5037707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_bfloat16 PASSED [0.0155s] [ 51%] 2025-12-04T14:02:33.5037841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5037975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float64 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5038107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int16 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5038238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int64 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5038371Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_int8 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5038499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int16 XFAIL [0.0036s] [ 51%] 2025-12-04T14:02:33.5038627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int64 XFAIL [0.9819s] [ 51%] 2025-12-04T14:02:33.5038756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_bool XFAIL [0.9661s] [ 51%] 2025-12-04T14:02:33.5038889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_float16 PASSED [1.0229s] [ 51%] 2025-12-04T14:02:33.5039016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int8 XFAIL [0.0057s] [ 51%] 2025-12-04T14:02:33.5039164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_bool PASSED [0.9722s] [ 51%] 2025-12-04T14:02:33.5039301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int8 PASSED [0.0130s] [ 51%] 2025-12-04T14:02:33.5039429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_bool PASSED [0.0128s] [ 51%] 2025-12-04T14:02:33.5039561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_int32 PASSED [0.0126s] [ 51%] 2025-12-04T14:02:33.5039712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_bfloat16 PASSED [0.0155s] [ 51%] 2025-12-04T14:02:33.5039845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int16 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5039975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int64 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5040142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5040293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_uint8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5040422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_bool PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5040558Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_complex64 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5040687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_int32 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5040829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_bool PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5040962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_complex64 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5041091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int32 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5041220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5041350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_uint8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5041479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_float32 PASSED [0.0053s] [ 51%] 2025-12-04T14:02:33.5041609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int16 PASSED [0.0051s] [ 51%] 2025-12-04T14:02:33.5041740Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int32 PASSED [0.0051s] [ 51%] 2025-12-04T14:02:33.5041870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_bool XFAIL [0.0180s] [ 51%] 2025-12-04T14:02:33.5042008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_complex64 XFAIL [0.9668s] [ 51%] 2025-12-04T14:02:33.5042145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int16 PASSED [0.0821s] [ 51%] 2025-12-04T14:02:33.5042279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_bool XFAIL [0.0184s] [ 51%] 2025-12-04T14:02:33.5042416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_complex64 XFAIL [0.9699s] [ 51%] 2025-12-04T14:02:33.5042555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_float64 PASSED [0.1186s] [ 51%] 2025-12-04T14:02:33.5042687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_int8 PASSED [0.0803s] [ 51%] 2025-12-04T14:02:33.5042817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float16 PASSED [0.1132s] [ 51%] 2025-12-04T14:02:33.5042947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float32 PASSED [0.0592s] [ 51%] 2025-12-04T14:02:33.5043090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_float64 PASSED [0.0590s] [ 51%] 2025-12-04T14:02:33.5043218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_int64 PASSED [0.0445s] [ 51%] 2025-12-04T14:02:33.5043348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_int8 PASSED [0.0445s] [ 51%] 2025-12-04T14:02:33.5043500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_bfloat16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5043627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_bool XFAIL [0.0036s] [ 51%] 2025-12-04T14:02:33.5043757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float16 PASSED [0.9740s] [ 51%] 2025-12-04T14:02:33.5043885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int64 PASSED [0.0090s] [ 51%] 2025-12-04T14:02:33.5044014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_uint8 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5044164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_complex64 PASSED [0.2115s] [ 51%] 2025-12-04T14:02:33.5044292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int64 XFAIL [0.0036s] [ 51%] 2025-12-04T14:02:33.5044417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int8 XFAIL [0.9722s] [ 51%] 2025-12-04T14:02:33.5044551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_bfloat16 PASSED [1.0673s] [ 51%] 2025-12-04T14:02:33.5044693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_float16 PASSED [0.1032s] [ 51%] 2025-12-04T14:02:33.5044824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_float64 PASSED [0.0558s] [ 51%] 2025-12-04T14:02:33.5044952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int16 PASSED [0.0421s] [ 51%] 2025-12-04T14:02:33.5045079Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int8 PASSED [0.0419s] [ 51%] 2025-12-04T14:02:33.5045215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_bool PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5045358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_float16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5045498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_int16 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5045636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_uint8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5045767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_uint8 PASSED [0.0085s] [ 51%] 2025-12-04T14:02:33.5045901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_bfloat16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5046035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_float16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5046165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int16 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5046297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int32 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5046426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_uint8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5046562Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_bfloat16 PASSED [0.0154s] [ 51%] 2025-12-04T14:02:33.5046695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_complex64 XFAIL [0.0037s] [ 51%] 2025-12-04T14:02:33.5046844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_float32 PASSED [0.9906s] [ 51%] 2025-12-04T14:02:33.5046974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_int16 PASSED [0.0090s] [ 51%] 2025-12-04T14:02:33.5047104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_uint8 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5047243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_bool PASSED [0.0126s] [ 51%] 2025-12-04T14:02:33.5047378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_complex128 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5047508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int64 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5047634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5047769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_float32 PASSED [0.0086s] [ 51%] 2025-12-04T14:02:33.5047914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_int8 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5048042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_bool PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5048172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_int32 PASSED [0.0125s] [ 51%] 2025-12-04T14:02:33.5048300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_uint8 PASSED [0.0124s] [ 51%] 2025-12-04T14:02:33.5048447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_complex128 XFAIL [0.0116s] [ 51%] 2025-12-04T14:02:33.5048576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_float16 XFAIL [0.9800s] [ 51%] 2025-12-04T14:02:33.5048707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_bfloat16 PASSED [0.9929s] [ 51%] 2025-12-04T14:02:33.5048845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex128 PASSED [0.0090s] [ 51%] 2025-12-04T14:02:33.5048972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_int64 PASSED [0.0127s] [ 51%] 2025-12-04T14:02:33.5049098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_int8 PASSED [0.0126s] [ 51%] 2025-12-04T14:02:33.5049235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_complex128 PASSED [0.0087s] [ 51%] 2025-12-04T14:02:33.5049368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float32 PASSED [0.0085s] [ 51%] 2025-12-04T14:02:33.5049496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int16 PASSED [0.0126s] [ 51%] 2025-12-04T14:02:33.5049629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_bfloat16 PASSED [0.0155s] [ 51%] 2025-12-04T14:02:33.5049770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_complex128 XFAIL [0.0037s] [ 51%] 2025-12-04T14:02:33.5049901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_uint8 PASSED [0.9647s] [ 51%] 2025-12-04T14:02:33.5050034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_bfloat16 PASSED [0.0063s] [ 51%] 2025-12-04T14:02:33.5050192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_bool PASSED [0.0059s] [ 51%] 2025-12-04T14:02:33.5050328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_complex64 PASSED [0.0059s] [ 51%] 2025-12-04T14:02:33.5050459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_float64 PASSED [0.0058s] [ 51%] 2025-12-04T14:02:33.5050586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int8 PASSED [0.0058s] [ 51%] 2025-12-04T14:02:33.5050743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_uint8 PASSED [0.0058s] [ 51%] 2025-12-04T14:02:33.5050897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float32 PASSED [0.0164s] [ 51%] 2025-12-04T14:02:33.5051044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_float16 PASSED [0.0732s] [ 51%] 2025-12-04T14:02:33.5051207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_bfloat16 PASSED [0.0073s] [ 51%] 2025-12-04T14:02:33.5051352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_float32 PASSED [0.0060s] [ 51%] 2025-12-04T14:02:33.5051496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_float64 PASSED [0.0060s] [ 51%] 2025-12-04T14:02:33.5051638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int64 PASSED [0.0145s] [ 51%] 2025-12-04T14:02:33.5051819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16 PASSED [0.0149s] [ 52%] 2025-12-04T14:02:33.5051980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64 PASSED [0.0147s] [ 52%] 2025-12-04T14:02:33.5052141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int16 PASSED [0.0146s] [ 52%] 2025-12-04T14:02:33.5052316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64 PASSED [0.0146s] [ 52%] 2025-12-04T14:02:33.5052473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_uint8 PASSED [0.0147s] [ 52%] 2025-12-04T14:02:33.5052622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16 PASSED [0.0061s] [ 52%] 2025-12-04T14:02:33.5052771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float64 PASSED [0.0057s] [ 52%] 2025-12-04T14:02:33.5052888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_bool PASSED [0.9597s] [ 52%] 2025-12-04T14:02:33.5053013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_complex128 PASSED [0.0048s] [ 52%] 2025-12-04T14:02:33.5053139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex128 PASSED [0.0046s] [ 52%] 2025-12-04T14:02:33.5053263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex32 PASSED [0.9562s] [ 52%] 2025-12-04T14:02:33.5053386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_complex64 PASSED [0.0058s] [ 52%] 2025-12-04T14:02:33.5053507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T14:02:33.5053626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int16 PASSED [0.9672s] [ 52%] 2025-12-04T14:02:33.5053745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int32 PASSED [0.0061s] [ 52%] 2025-12-04T14:02:33.5053861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_bool PASSED [0.0044s] [ 52%] 2025-12-04T14:02:33.5053987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex64 PASSED [0.9637s] [ 52%] 2025-12-04T14:02:33.5054105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int16 PASSED [0.0058s] [ 52%] 2025-12-04T14:02:33.5054226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int32 PASSED [0.0042s] [ 52%] 2025-12-04T14:02:33.5054345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int64 PASSED [0.9602s] [ 52%] 2025-12-04T14:02:33.5054465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_bfloat16 PASSED [0.0179s] [ 52%] 2025-12-04T14:02:33.5054603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex64 PASSED [0.9740s] [ 52%] 2025-12-04T14:02:33.5054721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_float32 PASSED [0.0137s] [ 52%] 2025-12-04T14:02:33.5054838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_int32 PASSED [0.0115s] [ 52%] 2025-12-04T14:02:33.5054968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_uint8 PASSED [0.0112s] [ 52%] 2025-12-04T14:02:33.5055099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_complex128 PASSED [0.0132s] [ 52%] 2025-12-04T14:02:33.5055221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float32 PASSED [0.0080s] [ 52%] 2025-12-04T14:02:33.5055344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float32 PASSED [0.0141s] [ 52%] 2025-12-04T14:02:33.5055470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float64 PASSED [0.0140s] [ 52%] 2025-12-04T14:02:33.5055608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float16 PASSED [0.0140s] [ 52%] 2025-12-04T14:02:33.5055731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int16 PASSED [0.0138s] [ 52%] 2025-12-04T14:02:33.5055852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int64 PASSED [0.0138s] [ 52%] 2025-12-04T14:02:33.5055979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_complex128 PASSED [0.0104s] [ 52%] 2025-12-04T14:02:33.5056116Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_complex64 PASSED [0.0102s] [ 52%] 2025-12-04T14:02:33.5056237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float32 PASSED [0.0101s] [ 52%] 2025-12-04T14:02:33.5056378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_bfloat16 PASSED [0.0130s] [ 52%] 2025-12-04T14:02:33.5056522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_complex128 PASSED [0.0102s] [ 52%] 2025-12-04T14:02:33.5056647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmv_cuda_complex128 PASSED [0.0112s] [ 52%] 2025-12-04T14:02:33.5056768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmv_cuda_float64 PASSED [0.0108s] [ 52%] 2025-12-04T14:02:33.5056890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float64 PASSED [0.0104s] [ 52%] 2025-12-04T14:02:33.5057008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_uint8 PASSED [0.0079s] [ 52%] 2025-12-04T14:02:33.5057138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_bfloat16 PASSED [0.0033s] [ 52%] 2025-12-04T14:02:33.5057261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_bool PASSED [0.9737s] [ 52%] 2025-12-04T14:02:33.5057391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float16 PASSED [0.0048s] [ 52%] 2025-12-04T14:02:33.5057519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float32 PASSED [0.0035s] [ 52%] 2025-12-04T14:02:33.5057646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int32 PASSED [0.9651s] [ 52%] 2025-12-04T14:02:33.5057768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_complex128 PASSED [0.0270s] [ 52%] 2025-12-04T14:02:33.5057890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_float32 PASSED [0.0239s] [ 52%] 2025-12-04T14:02:33.5058009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_float64 PASSED [0.0238s] [ 52%] 2025-12-04T14:02:33.5058126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_int32 PASSED [0.0236s] [ 52%] 2025-12-04T14:02:33.5058257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_int64 PASSED [0.0236s] [ 52%] 2025-12-04T14:02:33.5058393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_H_cuda_float32 PASSED [0.0039s] [ 52%] 2025-12-04T14:02:33.5058541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___getitem___cuda_float32 PASSED [0.0177s] [ 52%] 2025-12-04T14:02:33.5058699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rmod___cuda_float32 PASSED [0.0474s] [ 52%] 2025-12-04T14:02:33.5058840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rsub___cuda_float32 PASSED [0.0482s] [ 52%] 2025-12-04T14:02:33.5058987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_add_cuda_float32 PASSED [0.0530s] [ 52%] 2025-12-04T14:02:33.5059141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_addcdiv_cuda_float32 PASSED [0.1008s] [ 52%] 2025-12-04T14:02:33.5059288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_div_cuda_float32 PASSED [0.0601s] [ 52%] 2025-12-04T14:02:33.5059450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_exp_cuda_float32 PASSED [0.0086s] [ 52%] 2025-12-04T14:02:33.5059598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log1p_cuda_float32 PASSED [0.0086s] [ 52%] 2025-12-04T14:02:33.5059746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log_cuda_float32 PASSED [0.0085s] [ 52%] 2025-12-04T14:02:33.5059905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_max_cuda_float32 PASSED [0.0053s] [ 52%] 2025-12-04T14:02:33.5060057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_minimum_cuda_float32 PASSED [0.1170s] [ 52%] 2025-12-04T14:02:33.5060244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_mul_cuda_float32 PASSED [0.0593s] [ 52%] 2025-12-04T14:02:33.5060394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_norm_cuda_float32 PASSED [0.1121s] [ 52%] 2025-12-04T14:02:33.5060540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_pow_cuda_float32 PASSED [0.0532s] [ 52%] 2025-12-04T14:02:33.5060689Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_round_cuda_float32 PASSED [0.0085s] [ 52%] 2025-12-04T14:02:33.5060842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sigmoid_cuda_float32 PASSED [0.0078s] [ 52%] 2025-12-04T14:02:33.5060987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sub_cuda_float32 XFAIL [0.0113s] [ 52%] 2025-12-04T14:02:33.5061133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_tan_cuda_float32 PASSED [0.9838s] [ 52%] 2025-12-04T14:02:33.5061294Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_lengths_cuda_float32 PASSED [0.1856s] [ 52%] 2025-12-04T14:02:33.5061456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_offsets_cuda_float32 PASSED [0.1554s] [ 52%] 2025-12-04T14:02:33.5061633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.3140s] [ 52%] 2025-12-04T14:02:33.5061772Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_add_cuda_float32 PASSED [0.0602s] [ 52%] 2025-12-04T14:02:33.5061910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addbmm_cuda_float32 PASSED [0.1066s] [ 52%] 2025-12-04T14:02:33.5062050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addcdiv_cuda_float32 PASSED [0.1498s] [ 52%] 2025-12-04T14:02:33.5062202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addcmul_cuda_float32 PASSED [0.1509s] [ 52%] 2025-12-04T14:02:33.5062340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addr_cuda_float32 PASSED [0.0734s] [ 52%] 2025-12-04T14:02:33.5062478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_argsort_cuda_float32 PASSED [0.0690s] [ 52%] 2025-12-04T14:02:33.5062633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_as_strided_cuda_float32 PASSED [0.0053s] [ 52%] 2025-12-04T14:02:33.5062786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_as_strided_scatter_cuda_float32 PASSED [0.0351s] [ 52%] 2025-12-04T14:02:33.5062924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_asinh_cuda_float32 PASSED [0.0040s] [ 52%] 2025-12-04T14:02:33.5063061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atan_cuda_float32 PASSED [0.9740s] [ 52%] 2025-12-04T14:02:33.5063205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_2d_cuda_float32 PASSED [0.0063s] [ 52%] 2025-12-04T14:02:33.5063362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bernoulli_cuda_float32 PASSED [0.0096s] [ 52%] 2025-12-04T14:02:33.5063515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_right_shift_cuda_int64 PASSED [0.0460s] [ 52%] 2025-12-04T14:02:33.5063652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bmm_cuda_float32 PASSED [0.0078s] [ 52%] 2025-12-04T14:02:33.5063811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bool_cuda_float32 PASSED [0.9656s] [ 52%] 2025-12-04T14:02:33.5063962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_shapes_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T14:02:33.5064115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_tensors_cuda_float32 PASSED [0.0406s] [ 52%] 2025-12-04T14:02:33.5064255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cdouble_cuda_float32 PASSED [0.9598s] [ 52%] 2025-12-04T14:02:33.5064405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cholesky_solve_cuda_float32 PASSED [0.0920s] [ 52%] 2025-12-04T14:02:33.5064541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_chunk_cuda_float32 PASSED [0.0052s] [ 52%] 2025-12-04T14:02:33.5064679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_cuda_float32 PASSED [0.1995s] [ 52%] 2025-12-04T14:02:33.5064826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_column_stack_cuda_float32 PASSED [0.0055s] [ 52%] 2025-12-04T14:02:33.5064971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_contiguous_cuda_float32 PASSED [0.9644s] [ 52%] 2025-12-04T14:02:33.5065112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_copysign_cuda_float32 PASSED [0.0841s] [ 52%] 2025-12-04T14:02:33.5065262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_count_nonzero_cuda_float32 PASSED [0.0523s] [ 52%] 2025-12-04T14:02:33.5065398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diag_cuda_float32 PASSED [0.0278s] [ 52%] 2025-12-04T14:02:33.5065540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diag_embed_cuda_float32 PASSED [0.0687s] [ 52%] 2025-12-04T14:02:33.5065682Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagonal_cuda_float32 PASSED [0.0127s] [ 52%] 2025-12-04T14:02:33.5065833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagonal_scatter_cuda_float32 PASSED [0.0675s] [ 52%] 2025-12-04T14:02:33.5065968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diff_cuda_float32 PASSED [3.8029s] [ 52%] 2025-12-04T14:02:33.5066114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dist_cuda_float32 PASSED [0.3359s] [ 52%] 2025-12-04T14:02:33.5066269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_floor_rounding_cuda_float32 PASSED [0.2249s] [ 52%] 2025-12-04T14:02:33.5066423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_no_rounding_mode_cuda_float32 PASSED [0.0473s] [ 52%] 2025-12-04T14:02:33.5066586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_trunc_rounding_cuda_float32 PASSED [0.0516s] [ 52%] 2025-12-04T14:02:33.5066725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dstack_cuda_float32 PASSED [0.0072s] [ 52%] 2025-12-04T14:02:33.5066870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_empty_like_cuda_float32 PASSED [0.9680s] [ 52%] 2025-12-04T14:02:33.5067017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_empty_permuted_cuda_float32 PASSED [0.0174s] [ 52%] 2025-12-04T14:02:33.5067164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erf_cuda_float32 PASSED [0.0045s] [ 52%] 2025-12-04T14:02:33.5067301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_exp2_cuda_float32 PASSED [0.1439s] [ 52%] 2025-12-04T14:02:33.5067448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_exponential_cuda_float32 PASSED [0.0090s] [ 52%] 2025-12-04T14:02:33.5067584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_eye_cuda_float32 PASSED [0.0611s] [ 52%] 2025-12-04T14:02:33.5067736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft2_cuda_float32 PASSED [0.2782s] [ 52%] 2025-12-04T14:02:33.5067878Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifftn_cuda_float32 PASSED [0.0253s] [ 52%] 2025-12-04T14:02:33.5068021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ihfftn_cuda_float32 PASSED [0.0664s] [ 52%] 2025-12-04T14:02:33.5068163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_irfft_cuda_float32 PASSED [0.9805s] [ 52%] 2025-12-04T14:02:33.5068302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_rfftn_cuda_float32 PASSED [0.0372s] [ 52%] 2025-12-04T14:02:33.5068442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_flatten_cuda_float32 PASSED [0.9705s] [ 52%] 2025-12-04T14:02:33.5068577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_flip_cuda_float32 PASSED [0.0207s] [ 52%] 2025-12-04T14:02:33.5068715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_float_cuda_float32 PASSED [0.9804s] [ 52%] 2025-12-04T14:02:33.5068860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_float_power_cuda_float32 PASSED [0.0452s] [ 52%] 2025-12-04T14:02:33.5068997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_floor_cuda_float32 PASSED [0.0042s] [ 52%] 2025-12-04T14:02:33.5069135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_frexp_cuda_float32 PASSED [0.0042s] [ 52%] 2025-12-04T14:02:33.5069270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_gather_cuda_float32 PASSED [0.0242s] [ 52%] 2025-12-04T14:02:33.5069406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ge_cuda_float32 PASSED [0.0469s] [ 52%] 2025-12-04T14:02:33.5069559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_grid_sampler_2d_cuda_float32 PASSED [11.9236s] [ 52%] 2025-12-04T14:02:33.5069727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_grid_sampler_3d_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 52%] 2025-12-04T14:02:33.5069860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_gt_cuda_float32 PASSED [0.0469s] [ 52%] 2025-12-04T14:02:33.5070015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_heaviside_cuda_float32 PASSED [0.0996s] [ 52%] 2025-12-04T14:02:33.5070189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_histc_cuda_float32 PASSED [0.1253s] [ 52%] 2025-12-04T14:02:33.5070326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hypot_cuda_float32 PASSED [0.0463s] [ 52%] 2025-12-04T14:02:33.5070475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_i0_cuda_float32 PASSED [1.1312s] [ 52%] 2025-12-04T14:02:33.5070613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_igammac_cuda_float32 PASSED [0.0478s] [ 53%] 2025-12-04T14:02:33.5070753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_imag_cuda_complex64 PASSED [0.0050s] [ 53%] 2025-12-04T14:02:33.5070894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_add_cuda_float32 PASSED [0.0765s] [ 53%] 2025-12-04T14:02:33.5071039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_copy_cuda_float32 PASSED [0.9781s] [ 53%] 2025-12-04T14:02:33.5071186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_int_cuda_float32 PASSED [0.0047s] [ 53%] 2025-12-04T14:02:33.5071323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isin_cuda_float32 PASSED [0.0100s] [ 53%] 2025-12-04T14:02:33.5071477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_binary_cuda_float32 PASSED [0.2583s] [ 53%] 2025-12-04T14:02:33.5071660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_binary_return_by_ref_cuda_float32 PASSED [0.2553s] [ 53%] 2025-12-04T14:02:33.5071810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_unary_cuda_float32 PASSED [0.2400s] [ 53%] 2025-12-04T14:02:33.5071953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_kthvalue_cuda_float32 PASSED [0.0101s] [ 53%] 2025-12-04T14:02:33.5072103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cholesky_cuda_float32 PASSED [0.0474s] [ 53%] 2025-12-04T14:02:33.5072259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cholesky_ex_cuda_float32 PASSED [0.9838s] [ 53%] 2025-12-04T14:02:33.5072407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_cross_cuda_float32 PASSED [0.9771s] [ 53%] 2025-12-04T14:02:33.5072557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_diagonal_cuda_float32 PASSED [0.0150s] [ 53%] 2025-12-04T14:02:33.5072702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_inv_cuda_float32 PASSED [0.0282s] [ 53%] 2025-12-04T14:02:33.5072939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_ldl_solve_cuda_float32 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 53%] 2025-12-04T14:02:33.5073104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lstsq_grad_oriented_cuda_float32 PASSED [2.7255s] [ 53%] 2025-12-04T14:02:33.5073259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_factor_ex_cuda_float32 PASSED [0.0447s] [ 53%] 2025-12-04T14:02:33.5073411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_solve_cuda_float32 PASSED [1.5631s] [ 53%] 2025-12-04T14:02:33.5073565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_norm_cuda_float32 PASSED [0.3345s] [ 53%] 2025-12-04T14:02:33.5073720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_rank_cuda_float32 PASSED [0.5091s] [ 53%] 2025-12-04T14:02:33.5073870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_solve_ex_cuda_float32 PASSED [0.0964s] [ 53%] 2025-12-04T14:02:33.5074027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_svd_cuda_float32 PASSED [1.2139s] [ 53%] 2025-12-04T14:02:33.5074176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_svdvals_cuda_float32 PASSED [0.0892s] [ 53%] 2025-12-04T14:02:33.5074334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_vander_cuda_float32 PASSED [0.0625s] [ 53%] 2025-12-04T14:02:33.5074496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_tensor_overload_cuda_float32 PASSED [0.1118s] [ 53%] 2025-12-04T14:02:33.5074634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log1p_cuda_float32 PASSED [0.0040s] [ 53%] 2025-12-04T14:02:33.5074770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_cuda_float32 PASSED [0.0055s] [ 53%] 2025-12-04T14:02:33.5074907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logdet_cuda_float32 PASSED [0.0655s] [ 53%] 2025-12-04T14:02:33.5075062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logical_not_cuda_float32 PASSED [0.0060s] [ 53%] 2025-12-04T14:02:33.5075195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mH_cuda_float32 PASSED [0.0061s] [ 53%] 2025-12-04T14:02:33.5075344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_cumprod_cuda_float32 PASSED [0.1236s] [ 53%] 2025-12-04T14:02:33.5075498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_fill_cuda_float32 PASSED [0.0456s] [ 53%] 2025-12-04T14:02:33.5075651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_log_softmax_cuda_float32 PASSED [0.1420s] [ 53%] 2025-12-04T14:02:33.5075804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_logaddexp_cuda_float32 PASSED [0.4217s] [ 53%] 2025-12-04T14:02:33.5075950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_mean_cuda_float32 PASSED [0.9230s] [ 53%] 2025-12-04T14:02:33.5076098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_scatter_cuda_float32 PASSED [0.0604s] [ 53%] 2025-12-04T14:02:33.5076245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_select_cuda_float32 PASSED [0.0254s] [ 53%] 2025-12-04T14:02:33.5076393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_softmax_cuda_float32 PASSED [0.1006s] [ 53%] 2025-12-04T14:02:33.5076542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_softmin_cuda_float32 PASSED [0.1269s] [ 53%] 2025-12-04T14:02:33.5076685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_var_cuda_float32 PASSED [1.2519s] [ 53%] 2025-12-04T14:02:33.5076828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_matrix_exp_cuda_float32 PASSED [0.0130s] [ 53%] 2025-12-04T14:02:33.5077001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_pool2d_with_indices_backward_cuda_float32 PASSED [5.9630s] [ 53%] 2025-12-04T14:02:33.5077157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_reduction_no_dim_cuda_float32 PASSED [0.9743s] [ 53%] 2025-12-04T14:02:33.5077297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_maximum_cuda_float32 PASSED [0.0479s] [ 53%] 2025-12-04T14:02:33.5077459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0125s] [ 53%] 2025-12-04T14:02:33.5077613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_min_reduction_no_dim_cuda_float32 PASSED [0.0044s] [ 53%] 2025-12-04T14:02:33.5077774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_min_reduction_with_dim_cuda_float32 PASSED [0.9746s] [ 53%] 2025-12-04T14:02:33.5077925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_movedim_cuda_float32 PASSED [0.0062s] [ 53%] 2025-12-04T14:02:33.5078063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mul_cuda_float32 PASSED [0.0469s] [ 53%] 2025-12-04T14:02:33.5078209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_multinomial_cuda_float32 PASSED [0.9900s] [ 53%] 2025-12-04T14:02:33.5078380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0350s] [ 53%] 2025-12-04T14:02:33.5078539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0321s] [ 53%] 2025-12-04T14:02:33.5078700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_dropout_backward_cuda_float32 PASSED [0.0712s] [ 53%] 2025-12-04T14:02:33.5078835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ne_cuda_float32 PASSED [0.0461s] [ 53%] 2025-12-04T14:02:33.5078993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_new_empty_cuda_float32 PASSED [0.0108s] [ 53%] 2025-12-04T14:02:33.5079135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_new_zeros_cuda_float32 PASSED [0.0110s] [ 53%] 2025-12-04T14:02:33.5079310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0277s] [ 53%] 2025-12-04T14:02:33.5079486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_alpha_dropout_cuda_float32 PASSED [0.9996s] [ 53%] 2025-12-04T14:02:33.5079646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0214s] [ 53%] 2025-12-04T14:02:33.5079837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.4777s] [ 53%] 2025-12-04T14:02:33.5080004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_channel_shuffle_cuda_float32 XFAIL [0.0038s] [ 53%] 2025-12-04T14:02:33.5080385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x72d203201400 size: 1024 2025-12-04T14:02:33.5080571Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x72d203201400 size: 1024 2025-12-04T14:02:33.5080614Z PASSED [0.8031s] [ 53%] 2025-12-04T14:02:33.5080774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv3d_cuda_float32 PASSED [0.5003s] [ 53%] 2025-12-04T14:02:33.5080944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.1507s] [ 53%] 2025-12-04T14:02:33.5081115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.2823s] [ 53%] 2025-12-04T14:02:33.5081282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.2768s] [ 53%] 2025-12-04T14:02:33.5081450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_cross_entropy_cuda_float32 PASSED [0.6070s] [ 53%] 2025-12-04T14:02:33.5081611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout3d_cuda_float32 PASSED [0.0469s] [ 53%] 2025-12-04T14:02:33.5081770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout_cuda_float32 PASSED [0.0240s] [ 53%] 2025-12-04T14:02:33.5081972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0342s] [ 53%] 2025-12-04T14:02:33.5082134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardtanh_cuda_float32 PASSED [0.0192s] [ 53%] 2025-12-04T14:02:33.5082305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_area_cuda_float32 PASSED [0.1176s] [ 53%] 2025-12-04T14:02:33.5082494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_bicubic_cuda_float32 PASSED [3.9144s] [ 53%] 2025-12-04T14:02:33.5082668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_linear_cuda_float32 PASSED [0.1897s] [ 53%] 2025-12-04T14:02:33.5082852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.1085s] [ 53%] 2025-12-04T14:02:33.5083031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.1082s] [ 53%] 2025-12-04T14:02:33.5083222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_trilinear_cuda_float32 PASSED [1.2866s] [ 53%] 2025-12-04T14:02:33.5083383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_l1_loss_cuda_float32 PASSED [0.0499s] [ 53%] 2025-12-04T14:02:33.5083548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_layer_norm_cuda_float32 PASSED [0.1907s] [ 53%] 2025-12-04T14:02:33.5083717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_linear_cuda_float32 PASSED [0.3517s] [ 53%] 2025-12-04T14:02:33.5083895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_local_response_norm_cuda_float32 PASSED [0.1253s] [ 53%] 2025-12-04T14:02:33.5084059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_pool2d_cuda_float32 PASSED [2.1498s] [ 53%] 2025-12-04T14:02:33.5084223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_pool3d_cuda_float32 PASSED [1.8084s] [ 53%] 2025-12-04T14:02:33.5084387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool2d_cuda_float32 PASSED [3.5751s] [ 53%] 2025-12-04T14:02:33.5084562Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.3327s] [ 53%] 2025-12-04T14:02:33.5084735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.3085s] [ 53%] 2025-12-04T14:02:33.5084909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.1092s] [ 53%] 2025-12-04T14:02:33.5085098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.1570s] [ 53%] 2025-12-04T14:02:33.5085261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_normalize_cuda_float32 PASSED [0.0860s] [ 53%] 2025-12-04T14:02:33.5085436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pairwise_distance_cuda_float32 PASSED [0.0755s] [ 53%] 2025-12-04T14:02:33.5085592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pdist_cuda_float32 PASSED [0.0203s] [ 53%] 2025-12-04T14:02:33.5085760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0139s] [ 53%] 2025-12-04T14:02:33.5085929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0384s] [ 53%] 2025-12-04T14:02:33.5086115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0189s] [ 53%] 2025-12-04T14:02:33.5086280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softshrink_cuda_float32 PASSED [0.0168s] [ 53%] 2025-12-04T14:02:33.5086456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_tanhshrink_cuda_float32 PASSED [0.0091s] [ 53%] 2025-12-04T14:02:33.5086630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.1195s] [ 53%] 2025-12-04T14:02:33.5086799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0581s] [ 53%] 2025-12-04T14:02:33.5086945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nonzero_cuda_float32 PASSED [0.0310s] [ 53%] 2025-12-04T14:02:33.5087089Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_norm_fro_cuda_float32 PASSED [0.0102s] [ 53%] 2025-12-04T14:02:33.5087254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_in_place_cuda_float32 PASSED [0.0059s] [ 53%] 2025-12-04T14:02:33.5087393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_cuda_float32 PASSED [0.9700s] [ 53%] 2025-12-04T14:02:33.5087543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pca_lowrank_cuda_float32 PASSED [2.2188s] [ 53%] 2025-12-04T14:02:33.5087701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_permute_copy_cuda_float32 PASSED [0.0092s] [ 53%] 2025-12-04T14:02:33.5087844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_permute_cuda_float32 PASSED [0.0053s] [ 53%] 2025-12-04T14:02:33.5087984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pow_cuda_float32 PASSED [0.0461s] [ 53%] 2025-12-04T14:02:33.5088126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_prod_cuda_float32 PASSED [0.0554s] [ 53%] 2025-12-04T14:02:33.5088264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_qr_cuda_float32 PASSED [0.0466s] [ 53%] 2025-12-04T14:02:33.5088407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_rand_like_cuda_float32 PASSED [0.9837s] [ 53%] 2025-12-04T14:02:33.5088550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randn_cuda_float32 PASSED [0.0063s] [ 53%] 2025-12-04T14:02:33.5088698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_reciprocal_cuda_float32 PASSED [0.0060s] [ 53%] 2025-12-04T14:02:33.5088845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_remainder_cuda_float32 PASSED [0.0465s] [ 53%] 2025-12-04T14:02:33.5089001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_repeat_interleave_cuda_float32 PASSED [0.0184s] [ 53%] 2025-12-04T14:02:33.5089144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resize__cuda_float32 PASSED [0.0061s] [ 53%] 2025-12-04T14:02:33.5089290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resize_as__cuda_float32 PASSED [0.9793s] [ 53%] 2025-12-04T14:02:33.5089442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resolve_conj_cuda_float32 PASSED [0.0035s] [ 53%] 2025-12-04T14:02:33.5089580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_roll_cuda_float32 PASSED [0.0394s] [ 53%] 2025-12-04T14:02:33.5089721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_cuda_float32 PASSED [0.0041s] [ 53%] 2025-12-04T14:02:33.5089858Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_rsqrt_cuda_float32 PASSED [0.0055s] [ 53%] 2025-12-04T14:02:33.5090011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_select_cuda_float32 PASSED [0.0062s] [ 53%] 2025-12-04T14:02:33.5090191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sigmoid_cuda_float32 PASSED [0.0053s] [ 53%] 2025-12-04T14:02:33.5090360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_general_hamming_cuda_float32 PASSED [0.0317s] [ 53%] 2025-12-04T14:02:33.5090538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_hamming_cuda_float32 PASSED [0.0311s] [ 53%] 2025-12-04T14:02:33.5090697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_kaiser_cuda_float32 PASSED [0.0567s] [ 53%] 2025-12-04T14:02:33.5090839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signbit_cuda_float32 PASSED [0.0047s] [ 53%] 2025-12-04T14:02:33.5090977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_slice_cuda_float32 PASSED [0.0050s] [ 53%] 2025-12-04T14:02:33.5091171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sparse_mm_reduce_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 53%] 2025-12-04T14:02:33.5091322Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_airy_ai_cuda_float32 PASSED [0.4255s] [ 53%] 2025-12-04T14:02:33.5091481Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_j1_cuda_float32 PASSED [0.1673s] [ 53%] 2025-12-04T14:02:33.5091643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_erfcx_cuda_float32 PASSED [0.1963s] [ 53%] 2025-12-04T14:02:33.5091815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_hermite_polynomial_h_cuda_float32 PASSED [0.0441s] [ 53%] 2025-12-04T14:02:33.5091982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_i1_cuda_float32 PASSED [1.1207s] [ 53%] 2025-12-04T14:02:33.5092147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_k1_cuda_float32 PASSED [0.1705s] [ 53%] 2025-12-04T14:02:33.5092299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_ndtr_cuda_float32 PASSED [0.9885s] [ 54%] 2025-12-04T14:02:33.5092447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_ndtri_cuda_float32 PASSED [0.1628s] [ 54%] 2025-12-04T14:02:33.5092634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0454s] [ 54%] 2025-12-04T14:02:33.5092781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_zeta_cuda_float32 PASSED [0.0458s] [ 54%] 2025-12-04T14:02:33.5092922Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sqrt_cuda_float32 PASSED [0.9740s] [ 54%] 2025-12-04T14:02:33.5093063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_cuda_float32 PASSED [0.0108s] [ 54%] 2025-12-04T14:02:33.5093218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_multiple_cuda_float32 PASSED [0.0071s] [ 54%] 2025-12-04T14:02:33.5093356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_stack_cuda_float32 PASSED [0.0101s] [ 54%] 2025-12-04T14:02:33.5093501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_mean_cuda_float32 PASSED [1.0003s] [ 54%] 2025-12-04T14:02:33.5093657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_mean_unbiased_cuda_float32 PASSED [0.0094s] [ 54%] 2025-12-04T14:02:33.5093794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_svd_cuda_float32 PASSED [0.3491s] [ 54%] 2025-12-04T14:02:33.5093935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_t_copy_cuda_float32 PASSED [0.0056s] [ 54%] 2025-12-04T14:02:33.5094095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_take_along_dim_cuda_float32 XFAIL [0.0508s] [ 54%] 2025-12-04T14:02:33.5094236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tanh_cuda_float32 PASSED [0.9798s] [ 54%] 2025-12-04T14:02:33.5094384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tensor_split_cuda_float32 PASSED [0.0233s] [ 54%] 2025-12-04T14:02:33.5094540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trace_cuda_float32 PASSED [0.0044s] [ 54%] 2025-12-04T14:02:33.5094684Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unflatten_cuda_float32 PASSED [0.0092s] [ 54%] 2025-12-04T14:02:33.5094824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unfold_cuda_float32 PASSED [0.0180s] [ 54%] 2025-12-04T14:02:33.5094972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unravel_index_cuda_int64 PASSED [0.1142s] [ 54%] 2025-12-04T14:02:33.5095135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsafe_chunk_cuda_float32 PASSED [0.9718s] [ 54%] 2025-12-04T14:02:33.5095287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsqueeze_copy_cuda_float32 PASSED [0.0161s] [ 54%] 2025-12-04T14:02:33.5095432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unsqueeze_cuda_float32 PASSED [0.0078s] [ 54%] 2025-12-04T14:02:33.5095572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_cuda_float32 PASSED [0.0235s] [ 54%] 2025-12-04T14:02:33.5095737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_view_as_real_cuda_complex64 PASSED [0.9596s] [ 54%] 2025-12-04T14:02:33.5095880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_vsplit_cuda_float32 PASSED [0.0077s] [ 54%] 2025-12-04T14:02:33.5096020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_xlogy_cuda_float32 PASSED [0.0839s] [ 54%] 2025-12-04T14:02:33.5096161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_zero__cuda_float32 PASSED [0.0070s] [ 54%] 2025-12-04T14:02:33.5096290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_bfloat16 PASSED [0.0517s] [ 54%] 2025-12-04T14:02:33.5096424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_float32 PASSED [0.0370s] [ 54%] 2025-12-04T14:02:33.5096547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_float32 PASSED [0.0126s] [ 54%] 2025-12-04T14:02:33.5096673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_float64 PASSED [0.0126s] [ 54%] 2025-12-04T14:02:33.5096795Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int32 PASSED [0.0126s] [ 54%] 2025-12-04T14:02:33.5096917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_uint8 PASSED [0.0125s] [ 54%] 2025-12-04T14:02:33.5097039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_int8 PASSED [0.0125s] [ 54%] 2025-12-04T14:02:33.5097166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_bfloat16 PASSED [0.0058s] [ 54%] 2025-12-04T14:02:33.5097294Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_float16 PASSED [0.0058s] [ 54%] 2025-12-04T14:02:33.5097417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int8 PASSED [0.0048s] [ 54%] 2025-12-04T14:02:33.5097540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_bool PASSED [0.9664s] [ 54%] 2025-12-04T14:02:33.5097665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_complex32 PASSED [0.0045s] [ 54%] 2025-12-04T14:02:33.5097788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int16 PASSED [0.9598s] [ 54%] 2025-12-04T14:02:33.5097918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int64 PASSED [0.0044s] [ 54%] 2025-12-04T14:02:33.5098042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int8 PASSED [0.9655s] [ 54%] 2025-12-04T14:02:33.5098162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_uint8 PASSED [0.0044s] [ 54%] 2025-12-04T14:02:33.5098295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_float32 PASSED [0.0217s] [ 54%] 2025-12-04T14:02:33.5098412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int32 PASSED [0.0205s] [ 54%] 2025-12-04T14:02:33.5098539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_float16 PASSED [0.0152s] [ 54%] 2025-12-04T14:02:33.5098662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_float32 PASSED [0.0147s] [ 54%] 2025-12-04T14:02:33.5098788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float16 PASSED [0.0081s] [ 54%] 2025-12-04T14:02:33.5098927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float64 PASSED [0.9659s] [ 54%] 2025-12-04T14:02:33.5099049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int16 PASSED [0.0101s] [ 54%] 2025-12-04T14:02:33.5099172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int32 PASSED [0.0083s] [ 54%] 2025-12-04T14:02:33.5099293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int8 PASSED [0.0081s] [ 54%] 2025-12-04T14:02:33.5099428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float32 PASSED [0.9760s] [ 54%] 2025-12-04T14:02:33.5099549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int32 PASSED [0.0102s] [ 54%] 2025-12-04T14:02:33.5099671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_uint8 PASSED [0.0083s] [ 54%] 2025-12-04T14:02:33.5099799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_bfloat16 PASSED [0.0283s] [ 54%] 2025-12-04T14:02:33.5099927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int32 PASSED [1.0104s] [ 54%] 2025-12-04T14:02:33.5100054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float16 PASSED [0.0065s] [ 54%] 2025-12-04T14:02:33.5100225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_uint8 PASSED [0.0045s] [ 54%] 2025-12-04T14:02:33.5100366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_complex128 PASSED [0.0054s] [ 54%] 2025-12-04T14:02:33.5100508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_complex32 PASSED [0.0051s] [ 54%] 2025-12-04T14:02:33.5100642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex32 PASSED [0.9762s] [ 54%] 2025-12-04T14:02:33.5100768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int16 PASSED [0.0057s] [ 54%] 2025-12-04T14:02:33.5100897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int32 PASSED [0.0041s] [ 54%] 2025-12-04T14:02:33.5101042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_bool PASSED [0.9666s] [ 54%] 2025-12-04T14:02:33.5101193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_float16 PASSED [0.0051s] [ 54%] 2025-12-04T14:02:33.5101342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_float32 PASSED [0.0035s] [ 54%] 2025-12-04T14:02:33.5101492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int16 PASSED [0.9592s] [ 54%] 2025-12-04T14:02:33.5101636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_complex32 PASSED [0.0100s] [ 54%] 2025-12-04T14:02:33.5101794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_complex64 PASSED [0.0077s] [ 54%] 2025-12-04T14:02:33.5101934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_float16 PASSED [0.0074s] [ 54%] 2025-12-04T14:02:33.5102073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_float64 PASSED [0.0073s] [ 54%] 2025-12-04T14:02:33.5102222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_uint8 PASSED [0.0073s] [ 54%] 2025-12-04T14:02:33.5102349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_bool PASSED [0.0032s] [ 54%] 2025-12-04T14:02:33.5102475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_complex64 PASSED [0.9706s] [ 54%] 2025-12-04T14:02:33.5102596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_float16 PASSED [0.0053s] [ 54%] 2025-12-04T14:02:33.5102719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_int16 PASSED [0.0036s] [ 54%] 2025-12-04T14:02:33.5102849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_uint8 PASSED [0.9662s] [ 54%] 2025-12-04T14:02:33.5102970Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_bool PASSED [0.0050s] [ 54%] 2025-12-04T14:02:33.5103093Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float32 PASSED [0.0032s] [ 54%] 2025-12-04T14:02:33.5103218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float64 PASSED [0.9734s] [ 54%] 2025-12-04T14:02:33.5103349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int32 PASSED [0.0048s] [ 54%] 2025-12-04T14:02:33.5103471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int8 PASSED [0.0034s] [ 54%] 2025-12-04T14:02:33.5103589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_bool PASSED [0.0123s] [ 54%] 2025-12-04T14:02:33.5103714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_float16 PASSED [0.0129s] [ 54%] 2025-12-04T14:02:33.5103833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int8 PASSED [0.0119s] [ 54%] 2025-12-04T14:02:33.5103955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_uint8 PASSED [0.0119s] [ 54%] 2025-12-04T14:02:33.5104081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_complex128 PASSED [0.9656s] [ 54%] 2025-12-04T14:02:33.5104206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_float64 PASSED [0.0046s] [ 54%] 2025-12-04T14:02:33.5104327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_int32 PASSED [0.0037s] [ 54%] 2025-12-04T14:02:33.5104447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_int64 PASSED [0.9592s] [ 54%] 2025-12-04T14:02:33.5104574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_bfloat16 PASSED [0.0052s] [ 54%] 2025-12-04T14:02:33.5104693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_bool PASSED [0.0035s] [ 54%] 2025-12-04T14:02:33.5104815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_int32 PASSED [0.9710s] [ 54%] 2025-12-04T14:02:33.5104934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_uint8 PASSED [0.0051s] [ 54%] 2025-12-04T14:02:33.5105069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_bfloat16 PASSED [0.0041s] [ 54%] 2025-12-04T14:02:33.5105202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_complex32 PASSED [0.9740s] [ 54%] 2025-12-04T14:02:33.5105334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float16 PASSED [0.0051s] [ 54%] 2025-12-04T14:02:33.5105463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float32 PASSED [0.9729s] [ 54%] 2025-12-04T14:02:33.5105605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_float64 PASSED [0.0049s] [ 54%] 2025-12-04T14:02:33.5105731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int16 PASSED [0.9692s] [ 54%] 2025-12-04T14:02:33.5105859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_uint8 PASSED [0.0050s] [ 54%] 2025-12-04T14:02:33.5106002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_bfloat16 PASSED [0.9670s] [ 54%] 2025-12-04T14:02:33.5106129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_bool PASSED [0.0055s] [ 54%] 2025-12-04T14:02:33.5106264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_complex32 PASSED [0.0040s] [ 54%] 2025-12-04T14:02:33.5106392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_float64 PASSED [0.9657s] [ 54%] 2025-12-04T14:02:33.5106519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int16 PASSED [0.0057s] [ 54%] 2025-12-04T14:02:33.5106655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int64 PASSED [0.0041s] [ 54%] 2025-12-04T14:02:33.5106783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_uint8 PASSED [0.9729s] [ 54%] 2025-12-04T14:02:33.5106915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_bfloat16 PASSED [0.0065s] [ 54%] 2025-12-04T14:02:33.5107053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_bool PASSED [0.0048s] [ 54%] 2025-12-04T14:02:33.5107186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex32 PASSED [0.0047s] [ 54%] 2025-12-04T14:02:33.5107317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_float32 PASSED [0.9675s] [ 54%] 2025-12-04T14:02:33.5107442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_int64 PASSED [0.0065s] [ 54%] 2025-12-04T14:02:33.5107574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float32 PASSED [0.0057s] [ 54%] 2025-12-04T14:02:33.5107700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int16 PASSED [0.9719s] [ 54%] 2025-12-04T14:02:33.5107823Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int32 PASSED [0.0041s] [ 54%] 2025-12-04T14:02:33.5107948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int32 PASSED [0.0087s] [ 54%] 2025-12-04T14:02:33.5108076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_uint8 PASSED [0.0100s] [ 54%] 2025-12-04T14:02:33.5108217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_left_shift_cuda_int64 PASSED [0.0096s] [ 54%] 2025-12-04T14:02:33.5108353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_left_shift_cuda_int8 PASSED [0.0095s] [ 54%] 2025-12-04T14:02:33.5108484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_int32 PASSED [0.9695s] [ 54%] 2025-12-04T14:02:33.5108610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_int16 PASSED [0.0117s] [ 54%] 2025-12-04T14:02:33.5108739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_uint8 PASSED [0.0098s] [ 54%] 2025-12-04T14:02:33.5108880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int16 PASSED [0.0096s] [ 54%] 2025-12-04T14:02:33.5109008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_bool PASSED [0.0094s] [ 54%] 2025-12-04T14:02:33.5109135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int32 PASSED [0.0095s] [ 54%] 2025-12-04T14:02:33.5109269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_bfloat16 PASSED [0.0076s] [ 54%] 2025-12-04T14:02:33.5109418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex32 PASSED [0.0149s] [ 54%] 2025-12-04T14:02:33.5109544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_complex128 PASSED [0.9858s] [ 54%] 2025-12-04T14:02:33.5109667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_float64 PASSED [0.0046s] [ 54%] 2025-12-04T14:02:33.5109802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex128 PASSED [0.9772s] [ 54%] 2025-12-04T14:02:33.5109929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex32 PASSED [0.0042s] [ 54%] 2025-12-04T14:02:33.5110049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float32 PASSED [0.9786s] [ 54%] 2025-12-04T14:02:33.5110209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float64 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5110329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_int16 PASSED [0.9782s] [ 55%] 2025-12-04T14:02:33.5110465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_int32 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5110584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_uint8 PASSED [0.9725s] [ 55%] 2025-12-04T14:02:33.5110730Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_bfloat16 PASSED [0.0049s] [ 55%] 2025-12-04T14:02:33.5110875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_complex128 PASSED [0.0035s] [ 55%] 2025-12-04T14:02:33.5111033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_complex64 PASSED [0.9745s] [ 55%] 2025-12-04T14:02:33.5111172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float64 PASSED [0.0048s] [ 55%] 2025-12-04T14:02:33.5111312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int32 PASSED [0.0033s] [ 55%] 2025-12-04T14:02:33.5111452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int8 PASSED [0.9670s] [ 55%] 2025-12-04T14:02:33.5111584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float32 PASSED [0.9664s] [ 55%] 2025-12-04T14:02:33.5111722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float64 PASSED [0.9681s] [ 55%] 2025-12-04T14:02:33.5111850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int8 PASSED [0.9647s] [ 55%] 2025-12-04T14:02:33.5111983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_bfloat16 PASSED [0.0190s] [ 55%] 2025-12-04T14:02:33.5112111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_float32 PASSED [0.0170s] [ 55%] 2025-12-04T14:02:33.5112238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_int64 PASSED [0.0168s] [ 55%] 2025-12-04T14:02:33.5112364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_uint8 PASSED [0.0166s] [ 55%] 2025-12-04T14:02:33.5112486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_float16 PASSED [0.9670s] [ 55%] 2025-12-04T14:02:33.5112617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_bool PASSED [0.0107s] [ 55%] 2025-12-04T14:02:33.5112755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float16 PASSED [0.0088s] [ 55%] 2025-12-04T14:02:33.5112887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_int32 PASSED [0.0085s] [ 55%] 2025-12-04T14:02:33.5113022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_int8 PASSED [0.0085s] [ 55%] 2025-12-04T14:02:33.5113142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_bool PASSED [0.0086s] [ 55%] 2025-12-04T14:02:33.5113283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex128 PASSED [0.0087s] [ 55%] 2025-12-04T14:02:33.5113408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex32 PASSED [0.9753s] [ 55%] 2025-12-04T14:02:33.5113529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float16 PASSED [0.0112s] [ 55%] 2025-12-04T14:02:33.5113665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float32 PASSED [0.0089s] [ 55%] 2025-12-04T14:02:33.5113789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float64 PASSED [0.0050s] [ 55%] 2025-12-04T14:02:33.5113921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_complex128 PASSED [0.9738s] [ 55%] 2025-12-04T14:02:33.5114049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_complex32 PASSED [0.0043s] [ 55%] 2025-12-04T14:02:33.5114177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_float64 PASSED [0.9687s] [ 55%] 2025-12-04T14:02:33.5114309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_int16 PASSED [0.0040s] [ 55%] 2025-12-04T14:02:33.5114432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_uint8 PASSED [0.9657s] [ 55%] 2025-12-04T14:02:33.5114552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int16 PASSED [0.0045s] [ 55%] 2025-12-04T14:02:33.5114673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int32 PASSED [0.9747s] [ 55%] 2025-12-04T14:02:33.5114810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_complex128 PASSED [0.0041s] [ 55%] 2025-12-04T14:02:33.5114936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_float16 PASSED [0.9650s] [ 55%] 2025-12-04T14:02:33.5115062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_float64 PASSED [0.0041s] [ 55%] 2025-12-04T14:02:33.5115184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_int32 PASSED [0.9736s] [ 55%] 2025-12-04T14:02:33.5115308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_int64 PASSED [0.0040s] [ 55%] 2025-12-04T14:02:33.5115432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_bfloat16 PASSED [0.9722s] [ 55%] 2025-12-04T14:02:33.5115555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_float32 PASSED [0.0040s] [ 55%] 2025-12-04T14:02:33.5115676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int32 PASSED [0.9814s] [ 55%] 2025-12-04T14:02:33.5115798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_uint8 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5115915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_bool PASSED [0.9698s] [ 55%] 2025-12-04T14:02:33.5116043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex128 PASSED [0.0043s] [ 55%] 2025-12-04T14:02:33.5116164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_float16 PASSED [0.9646s] [ 55%] 2025-12-04T14:02:33.5116286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_float64 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5116405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int32 PASSED [0.9708s] [ 55%] 2025-12-04T14:02:33.5116525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int64 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5116648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_uint8 PASSED [0.9572s] [ 55%] 2025-12-04T14:02:33.5116777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_cuda_complex64 PASSED [0.0154s] [ 55%] 2025-12-04T14:02:33.5116904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_bfloat16 PASSED [0.0036s] [ 55%] 2025-12-04T14:02:33.5117032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_bool PASSED [0.9738s] [ 55%] 2025-12-04T14:02:33.5117161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex32 PASSED [0.0048s] [ 55%] 2025-12-04T14:02:33.5117286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex64 PASSED [0.9760s] [ 55%] 2025-12-04T14:02:33.5117420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float64 PASSED [0.0052s] [ 55%] 2025-12-04T14:02:33.5117542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int16 PASSED [0.9713s] [ 55%] 2025-12-04T14:02:33.5117665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int16 PASSED [0.0158s] [ 55%] 2025-12-04T14:02:33.5117785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int32 PASSED [0.0130s] [ 55%] 2025-12-04T14:02:33.5117907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int8 PASSED [0.0133s] [ 55%] 2025-12-04T14:02:33.5118043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int16 PASSED [0.0137s] [ 55%] 2025-12-04T14:02:33.5118168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int8 PASSED [0.0137s] [ 55%] 2025-12-04T14:02:33.5118293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_uint8 PASSED [0.0137s] [ 55%] 2025-12-04T14:02:33.5118420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_uint8 PASSED [0.0136s] [ 55%] 2025-12-04T14:02:33.5118555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_bfloat16 PASSED [0.9749s] [ 55%] 2025-12-04T14:02:33.5118680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_complex64 PASSED [0.0057s] [ 55%] 2025-12-04T14:02:33.5118805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_float64 PASSED [0.0039s] [ 55%] 2025-12-04T14:02:33.5118927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_int16 PASSED [0.0036s] [ 55%] 2025-12-04T14:02:33.5119063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_bfloat16 PASSED [0.0056s] [ 55%] 2025-12-04T14:02:33.5119191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_bool PASSED [0.0053s] [ 55%] 2025-12-04T14:02:33.5119329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex32 PASSED [0.0054s] [ 55%] 2025-12-04T14:02:33.5119463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex64 PASSED [0.0054s] [ 55%] 2025-12-04T14:02:33.5119594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_int16 PASSED [0.0053s] [ 55%] 2025-12-04T14:02:33.5119725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_int32 PASSED [0.0053s] [ 55%] 2025-12-04T14:02:33.5119859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_uint8 PASSED [0.0053s] [ 55%] 2025-12-04T14:02:33.5119995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_bfloat16 PASSED [0.0906s] [ 55%] 2025-12-04T14:02:33.5120166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int32 PASSED [0.0898s] [ 55%] 2025-12-04T14:02:33.5120299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int64 PASSED [0.0899s] [ 55%] 2025-12-04T14:02:33.5120428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int8 PASSED [0.0898s] [ 55%] 2025-12-04T14:02:33.5120560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_uint8 PASSED [0.0899s] [ 55%] 2025-12-04T14:02:33.5120684Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_complex_cuda_float16 PASSED [0.0116s] [ 55%] 2025-12-04T14:02:33.5120820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_bool PASSED [0.9733s] [ 55%] 2025-12-04T14:02:33.5120945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_complex32 PASSED [0.0038s] [ 55%] 2025-12-04T14:02:33.5121069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float16 PASSED [0.9673s] [ 55%] 2025-12-04T14:02:33.5121202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float32 PASSED [0.0035s] [ 55%] 2025-12-04T14:02:33.5121324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int32 PASSED [0.9659s] [ 55%] 2025-12-04T14:02:33.5121442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int64 PASSED [0.0034s] [ 55%] 2025-12-04T14:02:33.5121563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_uint8 PASSED [0.9585s] [ 55%] 2025-12-04T14:02:33.5121702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_complex128 PASSED [0.0046s] [ 55%] 2025-12-04T14:02:33.5121851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_float64 PASSED [0.9808s] [ 55%] 2025-12-04T14:02:33.5121985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_uint8 PASSED [0.0040s] [ 55%] 2025-12-04T14:02:33.5122127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_bfloat16 PASSED [0.0305s] [ 55%] 2025-12-04T14:02:33.5122268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_complex64 PASSED [0.0295s] [ 55%] 2025-12-04T14:02:33.5122421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int64 PASSED [0.0289s] [ 55%] 2025-12-04T14:02:33.5122555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int8 PASSED [0.0289s] [ 55%] 2025-12-04T14:02:33.5122688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_uint8 PASSED [0.9992s] [ 55%] 2025-12-04T14:02:33.5122824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_bfloat16 PASSED [0.0034s] [ 55%] 2025-12-04T14:02:33.5122950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_bool PASSED [0.9687s] [ 55%] 2025-12-04T14:02:33.5123086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_complex32 PASSED [0.0035s] [ 55%] 2025-12-04T14:02:33.5123211Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int8 PASSED [0.9721s] [ 55%] 2025-12-04T14:02:33.5123342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_uint8 PASSED [0.0034s] [ 55%] 2025-12-04T14:02:33.5123464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_bool PASSED [0.0178s] [ 55%] 2025-12-04T14:02:33.5123594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float16 PASSED [0.0185s] [ 55%] 2025-12-04T14:02:33.5123720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float32 PASSED [0.0157s] [ 55%] 2025-12-04T14:02:33.5123850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_float64 PASSED [0.0334s] [ 55%] 2025-12-04T14:02:33.5123975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_uint8 PASSED [0.0348s] [ 55%] 2025-12-04T14:02:33.5124097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_bfloat16 PASSED [0.9824s] [ 55%] 2025-12-04T14:02:33.5124225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_complex32 PASSED [0.0066s] [ 55%] 2025-12-04T14:02:33.5124342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int64 PASSED [0.0044s] [ 55%] 2025-12-04T14:02:33.5124466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_bfloat16 PASSED [0.9720s] [ 55%] 2025-12-04T14:02:33.5124594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_bool PASSED [0.0063s] [ 55%] 2025-12-04T14:02:33.5124720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float32 PASSED [0.0042s] [ 55%] 2025-12-04T14:02:33.5124853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float16 PASSED [0.0226s] [ 55%] 2025-12-04T14:02:33.5124998Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float32 PASSED [0.9895s] [ 55%] 2025-12-04T14:02:33.5125130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_float64 PASSED [0.0230s] [ 55%] 2025-12-04T14:02:33.5125256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_complex128 PASSED [0.3600s] [ 55%] 2025-12-04T14:02:33.5125374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_uint8 PASSED [0.3743s] [ 55%] 2025-12-04T14:02:33.5125500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_bfloat16 PASSED [0.0043s] [ 55%] 2025-12-04T14:02:33.5125636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_complex128 PASSED [0.9670s] [ 55%] 2025-12-04T14:02:33.5125761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_float32 PASSED [0.0058s] [ 55%] 2025-12-04T14:02:33.5125883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_uint8 PASSED [0.0044s] [ 55%] 2025-12-04T14:02:33.5126004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_bool PASSED [0.9817s] [ 55%] 2025-12-04T14:02:33.5126139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float16 PASSED [0.0047s] [ 55%] 2025-12-04T14:02:33.5126261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int16 PASSED [0.9759s] [ 55%] 2025-12-04T14:02:33.5126385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_bool PASSED [0.0052s] [ 55%] 2025-12-04T14:02:33.5126506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int32 PASSED [0.9716s] [ 55%] 2025-12-04T14:02:33.5126629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_uint8 PASSED [0.0052s] [ 55%] 2025-12-04T14:02:33.5126759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_complex128 PASSED [0.0154s] [ 55%] 2025-12-04T14:02:33.5126888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_float32 PASSED [0.0141s] [ 55%] 2025-12-04T14:02:33.5127010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float16 PASSED [0.9841s] [ 55%] 2025-12-04T14:02:33.5127136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float64 PASSED [0.0096s] [ 55%] 2025-12-04T14:02:33.5127256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int32 PASSED [0.0079s] [ 55%] 2025-12-04T14:02:33.5127380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int64 PASSED [0.9793s] [ 55%] 2025-12-04T14:02:33.5127527Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float64 PASSED [0.0417s] [ 56%] 2025-12-04T14:02:33.5127668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_int32 PASSED [0.0400s] [ 56%] 2025-12-04T14:02:33.5127792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_bool PASSED [0.0034s] [ 56%] 2025-12-04T14:02:33.5127917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float16 PASSED [0.9835s] [ 56%] 2025-12-04T14:02:33.5128046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float64 PASSED [0.0047s] [ 56%] 2025-12-04T14:02:33.5128167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int64 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5128292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int8 PASSED [0.9689s] [ 56%] 2025-12-04T14:02:33.5128429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_complex32 PASSED [0.0137s] [ 56%] 2025-12-04T14:02:33.5128550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float64 PASSED [0.0116s] [ 56%] 2025-12-04T14:02:33.5128667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_int16 PASSED [0.0113s] [ 56%] 2025-12-04T14:02:33.5128797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_int32 PASSED [0.9799s] [ 56%] 2025-12-04T14:02:33.5128929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_complex32 PASSED [0.0282s] [ 56%] 2025-12-04T14:02:33.5129059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_float64 PASSED [0.0252s] [ 56%] 2025-12-04T14:02:33.5129185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int64 PASSED [0.0250s] [ 56%] 2025-12-04T14:02:33.5129310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int8 PASSED [0.9935s] [ 56%] 2025-12-04T14:02:33.5129446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_bfloat16 PASSED [0.0132s] [ 56%] 2025-12-04T14:02:33.5129567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_bool PASSED [0.0109s] [ 56%] 2025-12-04T14:02:33.5129691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_int64 PASSED [0.0104s] [ 56%] 2025-12-04T14:02:33.5129812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_int8 PASSED [0.9755s] [ 56%] 2025-12-04T14:02:33.5129956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_float32 PASSED [0.0124s] [ 56%] 2025-12-04T14:02:33.5130086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int8 PASSED [0.0104s] [ 56%] 2025-12-04T14:02:33.5130255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_uint8 PASSED [0.0101s] [ 56%] 2025-12-04T14:02:33.5130386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_complex128 PASSED [0.0065s] [ 56%] 2025-12-04T14:02:33.5130513Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_float16 PASSED [0.0064s] [ 56%] 2025-12-04T14:02:33.5130637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int16 PASSED [0.0063s] [ 56%] 2025-12-04T14:02:33.5130775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float16 PASSED [0.0129s] [ 56%] 2025-12-04T14:02:33.5130896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_bfloat16 PASSED [0.3088s] [ 56%] 2025-12-04T14:02:33.5131014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int64 PASSED [0.2212s] [ 56%] 2025-12-04T14:02:33.5131138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_float64 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5131259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int64 PASSED [0.0040s] [ 56%] 2025-12-04T14:02:33.5131381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_bfloat16 PASSED [0.0894s] [ 56%] 2025-12-04T14:02:33.5131503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_complex128 PASSED [1.0351s] [ 56%] 2025-12-04T14:02:33.5131627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_complex64 PASSED [0.0711s] [ 56%] 2025-12-04T14:02:33.5131764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_float64 PASSED [0.0370s] [ 56%] 2025-12-04T14:02:33.5131900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int16 PASSED [0.0218s] [ 56%] 2025-12-04T14:02:33.5132034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int8 PASSED [0.0215s] [ 56%] 2025-12-04T14:02:33.5132193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_complex64 PASSED [0.0103s] [ 56%] 2025-12-04T14:02:33.5132333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_float32 PASSED [0.0101s] [ 56%] 2025-12-04T14:02:33.5132472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_float64 PASSED [0.0101s] [ 56%] 2025-12-04T14:02:33.5132621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int16 PASSED [0.0124s] [ 56%] 2025-12-04T14:02:33.5132759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int64 PASSED [0.0124s] [ 56%] 2025-12-04T14:02:33.5132899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_uint8 PASSED [0.0124s] [ 56%] 2025-12-04T14:02:33.5133033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float32 PASSED [0.0108s] [ 56%] 2025-12-04T14:02:33.5133169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int64 PASSED [0.0100s] [ 56%] 2025-12-04T14:02:33.5133303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_bfloat16 PASSED [0.9767s] [ 56%] 2025-12-04T14:02:33.5133425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_complex128 PASSED [0.0066s] [ 56%] 2025-12-04T14:02:33.5133545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_float32 PASSED [0.0033s] [ 56%] 2025-12-04T14:02:33.5133678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_float64 PASSED [0.9733s] [ 56%] 2025-12-04T14:02:33.5133802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_bfloat16 PASSED [0.0041s] [ 56%] 2025-12-04T14:02:33.5133929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex128 PASSED [0.9814s] [ 56%] 2025-12-04T14:02:33.5134051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float16 PASSED [0.0042s] [ 56%] 2025-12-04T14:02:33.5134174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float32 PASSED [0.9958s] [ 56%] 2025-12-04T14:02:33.5134293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int64 PASSED [0.0040s] [ 56%] 2025-12-04T14:02:33.5134412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int8 PASSED [0.9894s] [ 56%] 2025-12-04T14:02:33.5134536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_bfloat16 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5134664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_complex128 PASSED [0.0035s] [ 56%] 2025-12-04T14:02:33.5134790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_complex32 PASSED [0.9723s] [ 56%] 2025-12-04T14:02:33.5134912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_float16 PASSED [0.0049s] [ 56%] 2025-12-04T14:02:33.5135035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_float32 PASSED [0.0035s] [ 56%] 2025-12-04T14:02:33.5135154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_int16 PASSED [0.9724s] [ 56%] 2025-12-04T14:02:33.5135274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_int32 PASSED [0.0050s] [ 56%] 2025-12-04T14:02:33.5135396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float16 PASSED [0.0079s] [ 56%] 2025-12-04T14:02:33.5135517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float32 PASSED [0.0071s] [ 56%] 2025-12-04T14:02:33.5135640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_bfloat16 PASSED [0.0280s] [ 56%] 2025-12-04T14:02:33.5135765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex128 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5135898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex32 PASSED [0.0038s] [ 56%] 2025-12-04T14:02:33.5136024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex64 PASSED [0.0038s] [ 56%] 2025-12-04T14:02:33.5136142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_int8 PASSED [0.0036s] [ 56%] 2025-12-04T14:02:33.5136286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_complex128 PASSED [0.9778s] [ 56%] 2025-12-04T14:02:33.5136415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_float32 PASSED [0.0082s] [ 56%] 2025-12-04T14:02:33.5136541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int64 PASSED [0.9878s] [ 56%] 2025-12-04T14:02:33.5136678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex32 PASSED [0.0168s] [ 56%] 2025-12-04T14:02:33.5136815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex64 PASSED [0.0149s] [ 56%] 2025-12-04T14:02:33.5136962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int16 PASSED [0.0149s] [ 56%] 2025-12-04T14:02:33.5137094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_uint8 PASSED [0.0149s] [ 56%] 2025-12-04T14:02:33.5137230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_complex128 XFAIL [0.0046s] [ 56%] 2025-12-04T14:02:33.5137364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_complex64 XFAIL [0.9791s] [ 56%] 2025-12-04T14:02:33.5137504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_float16 XFAIL [0.9829s] [ 56%] 2025-12-04T14:02:33.5137631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int32 XFAIL [0.9685s] [ 56%] 2025-12-04T14:02:33.5137759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int64 XFAIL [0.9749s] [ 56%] 2025-12-04T14:02:33.5137887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_uint8 XFAIL [0.9685s] [ 56%] 2025-12-04T14:02:33.5138003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_bool PASSED [0.9796s] [ 56%] 2025-12-04T14:02:33.5138125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex32 PASSED [0.0137s] [ 56%] 2025-12-04T14:02:33.5138239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_int32 PASSED [0.0104s] [ 56%] 2025-12-04T14:02:33.5138355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_uint8 PASSED [0.9998s] [ 56%] 2025-12-04T14:02:33.5138473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_bool PASSED [0.0069s] [ 56%] 2025-12-04T14:02:33.5138599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_complex64 PASSED [0.0053s] [ 56%] 2025-12-04T14:02:33.5138720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_float16 PASSED [0.0049s] [ 56%] 2025-12-04T14:02:33.5138842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_int16 PASSED [0.0048s] [ 56%] 2025-12-04T14:02:33.5138961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_bfloat16 PASSED [0.9732s] [ 56%] 2025-12-04T14:02:33.5139080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int16 PASSED [0.0052s] [ 56%] 2025-12-04T14:02:33.5139195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int32 PASSED [0.0036s] [ 56%] 2025-12-04T14:02:33.5139312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int64 PASSED [0.9669s] [ 56%] 2025-12-04T14:02:33.5139425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_int8 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5139556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_float16 PASSED [0.0049s] [ 56%] 2025-12-04T14:02:33.5139676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_float64 PASSED [0.0053s] [ 56%] 2025-12-04T14:02:33.5139793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_int16 PASSED [0.9753s] [ 56%] 2025-12-04T14:02:33.5139916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_float16 PASSED [0.0054s] [ 56%] 2025-12-04T14:02:33.5140048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_float32 PASSED [0.0034s] [ 56%] 2025-12-04T14:02:33.5140212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int32 PASSED [0.9866s] [ 56%] 2025-12-04T14:02:33.5140331Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int64 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5140450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int8 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5140571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_bfloat16 PASSED [1.0020s] [ 56%] 2025-12-04T14:02:33.5140701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_bool PASSED [0.0079s] [ 56%] 2025-12-04T14:02:33.5140819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int16 PASSED [0.0044s] [ 56%] 2025-12-04T14:02:33.5140936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int8 PASSED [0.9825s] [ 56%] 2025-12-04T14:02:33.5141055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_bfloat16 PASSED [0.0064s] [ 56%] 2025-12-04T14:02:33.5141191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_complex128 PASSED [0.0042s] [ 56%] 2025-12-04T14:02:33.5141313Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_complex64 PASSED [0.9742s] [ 56%] 2025-12-04T14:02:33.5141432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float16 PASSED [0.0064s] [ 56%] 2025-12-04T14:02:33.5141551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float32 PASSED [0.0042s] [ 56%] 2025-12-04T14:02:33.5141667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int16 PASSED [0.9704s] [ 56%] 2025-12-04T14:02:33.5141782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int64 PASSED [0.0059s] [ 56%] 2025-12-04T14:02:33.5141913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_complex128 PASSED [0.0035s] [ 56%] 2025-12-04T14:02:33.5142037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_int16 PASSED [0.9778s] [ 56%] 2025-12-04T14:02:33.5142159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_uint8 PASSED [0.0052s] [ 56%] 2025-12-04T14:02:33.5142290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_float64 PASSED [0.0083s] [ 56%] 2025-12-04T14:02:33.5142416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_int64 PASSED [0.0077s] [ 56%] 2025-12-04T14:02:33.5142543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_complex64 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5142664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float32 PASSED [0.0050s] [ 56%] 2025-12-04T14:02:33.5142785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int32 PASSED [0.9818s] [ 56%] 2025-12-04T14:02:33.5142903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int64 PASSED [0.0070s] [ 56%] 2025-12-04T14:02:33.5143026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_bfloat16 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5143151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_complex128 PASSED [0.9807s] [ 56%] 2025-12-04T14:02:33.5143288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_complex64 PASSED [0.0047s] [ 56%] 2025-12-04T14:02:33.5143410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_float16 PASSED [0.0039s] [ 56%] 2025-12-04T14:02:33.5143528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_int64 PASSED [0.9686s] [ 56%] 2025-12-04T14:02:33.5143660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_bfloat16 PASSED [0.0078s] [ 56%] 2025-12-04T14:02:33.5143804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float16 PASSED [0.0055s] [ 56%] 2025-12-04T14:02:33.5143935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float32 PASSED [0.0046s] [ 56%] 2025-12-04T14:02:33.5144049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_bool PASSED [0.0516s] [ 56%] 2025-12-04T14:02:33.5144172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_complex128 PASSED [0.0603s] [ 56%] 2025-12-04T14:02:33.5144296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e5m2 PASSED [0.0602s] [ 56%] 2025-12-04T14:02:33.5144437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e5m2fnuz PASSED [1.0305s] [ 56%] 2025-12-04T14:02:33.5144557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_bool PASSED [0.0103s] [ 57%] 2025-12-04T14:02:33.5144687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_complex32 PASSED [0.9943s] [ 57%] 2025-12-04T14:02:33.5144818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int32 PASSED [0.0108s] [ 57%] 2025-12-04T14:02:33.5144945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_complex64 PASSED [0.9858s] [ 57%] 2025-12-04T14:02:33.5145066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int16 PASSED [0.0123s] [ 57%] 2025-12-04T14:02:33.5145191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int64 PASSED [0.0098s] [ 57%] 2025-12-04T14:02:33.5145326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_complex32 PASSED [0.0079s] [ 57%] 2025-12-04T14:02:33.5145456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_float64 PASSED [0.0074s] [ 57%] 2025-12-04T14:02:33.5145585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int16 PASSED [0.0074s] [ 57%] 2025-12-04T14:02:33.5145706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_bool PASSED [0.0163s] [ 57%] 2025-12-04T14:02:33.5145836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex128 PASSED [0.0240s] [ 57%] 2025-12-04T14:02:33.5145964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex64 PASSED [0.0158s] [ 57%] 2025-12-04T14:02:33.5146090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_float64 PASSED [0.0161s] [ 57%] 2025-12-04T14:02:33.5146214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int32 PASSED [0.9947s] [ 57%] 2025-12-04T14:02:33.5146342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_complex128 PASSED [0.0171s] [ 57%] 2025-12-04T14:02:33.5146470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_complex32 PASSED [0.0132s] [ 57%] 2025-12-04T14:02:33.5146594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float16 PASSED [0.9856s] [ 57%] 2025-12-04T14:02:33.5146719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float64 PASSED [0.0174s] [ 57%] 2025-12-04T14:02:33.5146841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int16 PASSED [0.0151s] [ 57%] 2025-12-04T14:02:33.5146963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int64 PASSED [0.0146s] [ 57%] 2025-12-04T14:02:33.5147094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_uint8 PASSED [0.9936s] [ 57%] 2025-12-04T14:02:33.5147224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex128 PASSED [0.0216s] [ 57%] 2025-12-04T14:02:33.5147352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex64 PASSED [0.0189s] [ 57%] 2025-12-04T14:02:33.5147489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_float16 PASSED [0.0168s] [ 57%] 2025-12-04T14:02:33.5147612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int32 PASSED [0.0184s] [ 57%] 2025-12-04T14:02:33.5147735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_uint8 PASSED [0.0182s] [ 57%] 2025-12-04T14:02:33.5147855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_bool PASSED [0.0080s] [ 57%] 2025-12-04T14:02:33.5147987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_complex128 PASSED [0.0077s] [ 57%] 2025-12-04T14:02:33.5148121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_float32 PASSED [0.0078s] [ 57%] 2025-12-04T14:02:33.5148245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int16 PASSED [0.0078s] [ 57%] 2025-12-04T14:02:33.5148366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int8 PASSED [0.0078s] [ 57%] 2025-12-04T14:02:33.5148503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_complex32 PASSED [0.0079s] [ 57%] 2025-12-04T14:02:33.5148629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_float32 PASSED [0.0138s] [ 57%] 2025-12-04T14:02:33.5148750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int64 PASSED [0.9888s] [ 57%] 2025-12-04T14:02:33.5148871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int8 PASSED [0.0171s] [ 57%] 2025-12-04T14:02:33.5148993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_bool PASSED [0.0100s] [ 57%] 2025-12-04T14:02:33.5149118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float32 PASSED [0.0097s] [ 57%] 2025-12-04T14:02:33.5149242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int16 PASSED [0.0096s] [ 57%] 2025-12-04T14:02:33.5149365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int64 PASSED [0.9884s] [ 57%] 2025-12-04T14:02:33.5149499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_bfloat16 PASSED [0.0101s] [ 57%] 2025-12-04T14:02:33.5149635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex64 PASSED [0.9802s] [ 57%] 2025-12-04T14:02:33.5149768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float16 PASSED [0.0096s] [ 57%] 2025-12-04T14:02:33.5149900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int16 PASSED [0.9795s] [ 57%] 2025-12-04T14:02:33.5150029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int64 PASSED [0.0102s] [ 57%] 2025-12-04T14:02:33.5150200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int8 PASSED [0.9999s] [ 57%] 2025-12-04T14:02:33.5150329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_uint8 PASSED [0.0096s] [ 57%] 2025-12-04T14:02:33.5150454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_bool PASSED [0.9908s] [ 57%] 2025-12-04T14:02:33.5150580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_float32 PASSED [0.0158s] [ 57%] 2025-12-04T14:02:33.5150704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int16 PASSED [0.9995s] [ 57%] 2025-12-04T14:02:33.5150843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_uint8 PASSED [0.0168s] [ 57%] 2025-12-04T14:02:33.5150967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_uint8 PASSED [1.0016s] [ 57%] 2025-12-04T14:02:33.5151092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int32 PASSED [0.0192s] [ 57%] 2025-12-04T14:02:33.5151234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int64 PASSED [0.0157s] [ 57%] 2025-12-04T14:02:33.5151358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_bool PASSED [0.0087s] [ 57%] 2025-12-04T14:02:33.5151482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int64 PASSED [0.0084s] [ 57%] 2025-12-04T14:02:33.5151603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_bool PASSED [0.0086s] [ 57%] 2025-12-04T14:02:33.5151734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_complex128 PASSED [0.0083s] [ 57%] 2025-12-04T14:02:33.5151880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_complex64 PASSED [0.0083s] [ 57%] 2025-12-04T14:02:33.5152005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_float16 PASSED [0.0089s] [ 57%] 2025-12-04T14:02:33.5152130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int16 PASSED [0.0086s] [ 57%] 2025-12-04T14:02:33.5152263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int8 PASSED [0.0085s] [ 57%] 2025-12-04T14:02:33.5152387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_uint8 PASSED [0.0085s] [ 57%] 2025-12-04T14:02:33.5152510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_bool PASSED [0.0100s] [ 57%] 2025-12-04T14:02:33.5152642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_complex128 PASSED [1.2748s] [ 57%] 2025-12-04T14:02:33.5152770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_float32 PASSED [0.0128s] [ 57%] 2025-12-04T14:02:33.5152894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_uint8 PASSED [0.0105s] [ 57%] 2025-12-04T14:02:33.5153016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_bool PASSED [0.9721s] [ 57%] 2025-12-04T14:02:33.5153140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_float16 PASSED [0.0223s] [ 57%] 2025-12-04T14:02:33.5153265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_float64 PASSED [0.9908s] [ 57%] 2025-12-04T14:02:33.5153385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int16 PASSED [0.0110s] [ 57%] 2025-12-04T14:02:33.5153506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int64 PASSED [0.9826s] [ 57%] 2025-12-04T14:02:33.5153629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_bool PASSED [0.0127s] [ 57%] 2025-12-04T14:02:33.5153756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_float16 PASSED [0.4721s] [ 57%] 2025-12-04T14:02:33.5153882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_float64 PASSED [0.0096s] [ 57%] 2025-12-04T14:02:33.5154005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_int32 PASSED [0.0097s] [ 57%] 2025-12-04T14:02:33.5154128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_int8 PASSED [0.9936s] [ 57%] 2025-12-04T14:02:33.5154248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_bfloat16 PASSED [0.0059s] [ 57%] 2025-12-04T14:02:33.5154368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float16 PASSED [0.0041s] [ 57%] 2025-12-04T14:02:33.5154499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float64 PASSED [0.9772s] [ 57%] 2025-12-04T14:02:33.5154620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int16 PASSED [0.0059s] [ 57%] 2025-12-04T14:02:33.5154736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int32 PASSED [0.0041s] [ 57%] 2025-12-04T14:02:33.5154874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_complex64 PASSED [0.9801s] [ 57%] 2025-12-04T14:02:33.5154996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_float64 PASSED [0.0057s] [ 57%] 2025-12-04T14:02:33.5155117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_uint8 PASSED [0.0040s] [ 57%] 2025-12-04T14:02:33.5155236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_bfloat16 PASSED [0.0082s] [ 57%] 2025-12-04T14:02:33.5155356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float64 PASSED [0.0078s] [ 57%] 2025-12-04T14:02:33.5155484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int16 PASSED [0.0078s] [ 57%] 2025-12-04T14:02:33.5155611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_complex128 PASSED [0.0033s] [ 57%] 2025-12-04T14:02:33.5155733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_float64 PASSED [0.0032s] [ 57%] 2025-12-04T14:02:33.5155852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int16 PASSED [0.0032s] [ 57%] 2025-12-04T14:02:33.5155982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int32 PASSED [0.0033s] [ 57%] 2025-12-04T14:02:33.5156103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int8 PASSED [0.0033s] [ 57%] 2025-12-04T14:02:33.5156225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_float16 PASSED [0.9983s] [ 57%] 2025-12-04T14:02:33.5156344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int32 PASSED [0.0041s] [ 57%] 2025-12-04T14:02:33.5156463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int8 PASSED [0.9812s] [ 57%] 2025-12-04T14:02:33.5156581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_uint8 PASSED [0.0042s] [ 57%] 2025-12-04T14:02:33.5156712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_bfloat16 PASSED [0.0114s] [ 57%] 2025-12-04T14:02:33.5156838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int32 PASSED [0.0102s] [ 57%] 2025-12-04T14:02:33.5156957Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_int16 PASSED [0.9813s] [ 57%] 2025-12-04T14:02:33.5157075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_int8 PASSED [0.0045s] [ 57%] 2025-12-04T14:02:33.5157206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float32 PASSED [0.0372s] [ 57%] 2025-12-04T14:02:33.5157333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_int16 PASSED [0.0218s] [ 57%] 2025-12-04T14:02:33.5157461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_int64 PASSED [0.0215s] [ 57%] 2025-12-04T14:02:33.5157582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float16 PASSED [0.0127s] [ 57%] 2025-12-04T14:02:33.5157701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float32 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5157820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_float64 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5157937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int16 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5158055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int32 PASSED [0.0093s] [ 57%] 2025-12-04T14:02:33.5158185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_bfloat16 PASSED [0.0128s] [ 57%] 2025-12-04T14:02:33.5158304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_bool PASSED [0.0093s] [ 57%] 2025-12-04T14:02:33.5158422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float16 PASSED [0.0126s] [ 57%] 2025-12-04T14:02:33.5158551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float64 PASSED [0.0095s] [ 57%] 2025-12-04T14:02:33.5158668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int64 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5158786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_uint8 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5158902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int32 PASSED [0.0100s] [ 57%] 2025-12-04T14:02:33.5159020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int8 PASSED [0.0100s] [ 57%] 2025-12-04T14:02:33.5159156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_uint8 PASSED [0.0100s] [ 57%] 2025-12-04T14:02:33.5159274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float32 PASSED [0.9886s] [ 57%] 2025-12-04T14:02:33.5159395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frexp_cuda_float32 PASSED [0.0048s] [ 57%] 2025-12-04T14:02:33.5159516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frexp_cuda_float64 PASSED [0.9753s] [ 57%] 2025-12-04T14:02:33.5159646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_bfloat16 PASSED [0.0060s] [ 57%] 2025-12-04T14:02:33.5159769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_complex64 PASSED [0.0043s] [ 57%] 2025-12-04T14:02:33.5159888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_float16 PASSED [0.9749s] [ 57%] 2025-12-04T14:02:33.5160010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_bool PASSED [0.0092s] [ 57%] 2025-12-04T14:02:33.5160190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float16 PASSED [0.9778s] [ 57%] 2025-12-04T14:02:33.5160316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float64 PASSED [0.0087s] [ 57%] 2025-12-04T14:02:33.5160443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_uint32 PASSED [0.9862s] [ 57%] 2025-12-04T14:02:33.5160567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_uint8 PASSED [0.0093s] [ 57%] 2025-12-04T14:02:33.5160688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_bool PASSED [0.9929s] [ 57%] 2025-12-04T14:02:33.5160810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_float16 PASSED [0.0092s] [ 57%] 2025-12-04T14:02:33.5160931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int16 PASSED [0.9872s] [ 57%] 2025-12-04T14:02:33.5161051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int32 PASSED [0.0094s] [ 57%] 2025-12-04T14:02:33.5161170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int8 PASSED [1.0009s] [ 57%] 2025-12-04T14:02:33.5161286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int16 PASSED [0.0137s] [ 57%] 2025-12-04T14:02:33.5161401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int8 PASSED [0.0096s] [ 58%] 2025-12-04T14:02:33.5161518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_uint8 PASSED [0.0095s] [ 58%] 2025-12-04T14:02:33.5161635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_float64 PASSED [0.0095s] [ 58%] 2025-12-04T14:02:33.5161749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int64 PASSED [0.0094s] [ 58%] 2025-12-04T14:02:33.5161891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float16 PASSED [0.0053s] [ 58%] 2025-12-04T14:02:33.5162020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float32 PASSED [0.9922s] [ 58%] 2025-12-04T14:02:33.5162146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_float64 PASSED [0.0070s] [ 58%] 2025-12-04T14:02:33.5162282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int16 PASSED [0.0055s] [ 58%] 2025-12-04T14:02:33.5162404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int64 PASSED [0.0051s] [ 58%] 2025-12-04T14:02:33.5162531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_uint8 PASSED [0.0050s] [ 58%] 2025-12-04T14:02:33.5162652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_int32 PASSED [0.2020s] [ 58%] 2025-12-04T14:02:33.5162775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_int64 PASSED [0.2015s] [ 58%] 2025-12-04T14:02:33.5162942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float16 SKIPPED [0.0001s] (Skipped!) [ 58%] 2025-12-04T14:02:33.5163091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 58%] 2025-12-04T14:02:33.5163213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_bfloat16 PASSED [0.0119s] [ 58%] 2025-12-04T14:02:33.5163332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_float32 PASSED [0.9743s] [ 58%] 2025-12-04T14:02:33.5163462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int64 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5163579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int8 PASSED [0.9592s] [ 58%] 2025-12-04T14:02:33.5163696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_uint8 PASSED [0.0042s] [ 58%] 2025-12-04T14:02:33.5163827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_bfloat16 PASSED [0.0119s] [ 58%] 2025-12-04T14:02:33.5163952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_bool PASSED [0.0109s] [ 58%] 2025-12-04T14:02:33.5164080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_float64 PASSED [0.0108s] [ 58%] 2025-12-04T14:02:33.5164206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int8 PASSED [0.9930s] [ 58%] 2025-12-04T14:02:33.5164332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float16 PASSED [0.0228s] [ 58%] 2025-12-04T14:02:33.5164459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float32 PASSED [0.0176s] [ 58%] 2025-12-04T14:02:33.5164583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int16 PASSED [0.0187s] [ 58%] 2025-12-04T14:02:33.5164710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int64 PASSED [0.0156s] [ 58%] 2025-12-04T14:02:33.5164833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_int32 PASSED [0.0515s] [ 58%] 2025-12-04T14:02:33.5164951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_int64 PASSED [0.0512s] [ 58%] 2025-12-04T14:02:33.5165071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_bool PASSED [0.0069s] [ 58%] 2025-12-04T14:02:33.5165197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex64 PASSED [0.9676s] [ 58%] 2025-12-04T14:02:33.5165321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float64 PASSED [0.0051s] [ 58%] 2025-12-04T14:02:33.5165439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int16 PASSED [0.0036s] [ 58%] 2025-12-04T14:02:33.5165574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int8 PASSED [0.9845s] [ 58%] 2025-12-04T14:02:33.5165695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_uint8 PASSED [0.0055s] [ 58%] 2025-12-04T14:02:33.5165818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_bfloat16 PASSED [0.0046s] [ 58%] 2025-12-04T14:02:33.5165947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_bool PASSED [0.0040s] [ 58%] 2025-12-04T14:02:33.5166073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex64 PASSED [0.9732s] [ 58%] 2025-12-04T14:02:33.5166193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int16 PASSED [0.0060s] [ 58%] 2025-12-04T14:02:33.5166312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int32 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5166431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_int8 PASSED [0.0040s] [ 58%] 2025-12-04T14:02:33.5166550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_uint8 PASSED [0.9786s] [ 58%] 2025-12-04T14:02:33.5166676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int32 PASSED [0.0071s] [ 58%] 2025-12-04T14:02:33.5166790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int64 PASSED [0.0036s] [ 58%] 2025-12-04T14:02:33.5166905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int8 PASSED [0.9768s] [ 58%] 2025-12-04T14:02:33.5167027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igamma_cuda_float32 PASSED [0.0120s] [ 58%] 2025-12-04T14:02:33.5167161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igammac_cuda_float32 PASSED [0.0098s] [ 58%] 2025-12-04T14:02:33.5167289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_bfloat16 PASSED [0.9871s] [ 58%] 2025-12-04T14:02:33.5167420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex128 PASSED [0.0132s] [ 58%] 2025-12-04T14:02:33.5167550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex32 PASSED [0.0128s] [ 58%] 2025-12-04T14:02:33.5167679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_complex64 PASSED [0.9941s] [ 58%] 2025-12-04T14:02:33.5167805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_float16 PASSED [0.0156s] [ 58%] 2025-12-04T14:02:33.5167928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int64 PASSED [0.0110s] [ 58%] 2025-12-04T14:02:33.5168051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_uint8 PASSED [0.0106s] [ 58%] 2025-12-04T14:02:33.5168180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_bfloat16 PASSED [0.9916s] [ 58%] 2025-12-04T14:02:33.5168303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_bool PASSED [0.0069s] [ 58%] 2025-12-04T14:02:33.5168435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex32 PASSED [0.0049s] [ 58%] 2025-12-04T14:02:33.5168560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int16 PASSED [0.0046s] [ 58%] 2025-12-04T14:02:33.5168682Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int8 PASSED [0.0045s] [ 58%] 2025-12-04T14:02:33.5168814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_complex64 PASSED [0.0077s] [ 58%] 2025-12-04T14:02:33.5168945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_float64 PASSED [0.0074s] [ 58%] 2025-12-04T14:02:33.5169070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int16 PASSED [0.0073s] [ 58%] 2025-12-04T14:02:33.5169194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int32 PASSED [0.0073s] [ 58%] 2025-12-04T14:02:33.5169328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int64 PASSED [0.0073s] [ 58%] 2025-12-04T14:02:33.5169451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int8 PASSED [0.0072s] [ 58%] 2025-12-04T14:02:33.5169579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_bfloat16 PASSED [0.0059s] [ 58%] 2025-12-04T14:02:33.5169719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_complex32 PASSED [0.0056s] [ 58%] 2025-12-04T14:02:33.5169842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int32 PASSED [0.0055s] [ 58%] 2025-12-04T14:02:33.5169966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int64 PASSED [0.0055s] [ 58%] 2025-12-04T14:02:33.5170086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_int8 PASSED [0.0055s] [ 58%] 2025-12-04T14:02:33.5170259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_int32 PASSED [0.0077s] [ 58%] 2025-12-04T14:02:33.5170413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_bfloat16 PASSED [0.0077s] [ 58%] 2025-12-04T14:02:33.5170554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_float64 PASSED [0.0082s] [ 58%] 2025-12-04T14:02:33.5170689Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int32 PASSED [0.0081s] [ 58%] 2025-12-04T14:02:33.5170836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int16 PASSED [0.9917s] [ 58%] 2025-12-04T14:02:33.5170970Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int64 PASSED [0.0098s] [ 58%] 2025-12-04T14:02:33.5171104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int8 PASSED [0.0080s] [ 58%] 2025-12-04T14:02:33.5171232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_bool PASSED [0.0047s] [ 58%] 2025-12-04T14:02:33.5171363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_float16 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5171494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_float32 PASSED [0.0042s] [ 58%] 2025-12-04T14:02:33.5171623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int32 PASSED [0.9930s] [ 58%] 2025-12-04T14:02:33.5171752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int64 PASSED [0.0064s] [ 58%] 2025-12-04T14:02:33.5171880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_uint8 PASSED [0.0047s] [ 58%] 2025-12-04T14:02:33.5172005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_bfloat16 PASSED [0.0063s] [ 58%] 2025-12-04T14:02:33.5172129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float16 PASSED [0.0058s] [ 58%] 2025-12-04T14:02:33.5172250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float16 PASSED [0.0029s] [ 58%] 2025-12-04T14:02:33.5172369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float64 PASSED [0.9971s] [ 58%] 2025-12-04T14:02:33.5172488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_int16 PASSED [0.0041s] [ 58%] 2025-12-04T14:02:33.5172606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_int32 PASSED [0.9808s] [ 58%] 2025-12-04T14:02:33.5172725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_uint8 PASSED [0.0042s] [ 58%] 2025-12-04T14:02:33.5172845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_bool PASSED [0.0971s] [ 58%] 2025-12-04T14:02:33.5172969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int16 PASSED [0.0984s] [ 58%] 2025-12-04T14:02:33.5173112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int32 PASSED [0.0983s] [ 58%] 2025-12-04T14:02:33.5173238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int8 PASSED [0.1016s] [ 58%] 2025-12-04T14:02:33.5173362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_uint8 PASSED [0.0986s] [ 58%] 2025-12-04T14:02:33.5173504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_float16 PASSED [0.0090s] [ 58%] 2025-12-04T14:02:33.5173624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int8 PASSED [0.9836s] [ 58%] 2025-12-04T14:02:33.5173746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_uint8 PASSED [0.0059s] [ 58%] 2025-12-04T14:02:33.5173867Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int64 PASSED [0.0046s] [ 58%] 2025-12-04T14:02:33.5173986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int8 PASSED [0.9791s] [ 58%] 2025-12-04T14:02:33.5174124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_bfloat16 PASSED [0.0056s] [ 58%] 2025-12-04T14:02:33.5174243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int16 PASSED [0.0035s] [ 58%] 2025-12-04T14:02:33.5174361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int64 PASSED [0.9932s] [ 58%] 2025-12-04T14:02:33.5174480Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_uint8 PASSED [0.0045s] [ 58%] 2025-12-04T14:02:33.5174610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_bool PASSED [0.9871s] [ 58%] 2025-12-04T14:02:33.5174735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_complex64 PASSED [0.0047s] [ 58%] 2025-12-04T14:02:33.5174856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int8 PASSED [0.9741s] [ 58%] 2025-12-04T14:02:33.5174975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_uint8 PASSED [0.0047s] [ 58%] 2025-12-04T14:02:33.5175098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_bool PASSED [0.9825s] [ 58%] 2025-12-04T14:02:33.5175222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int16 PASSED [0.0046s] [ 58%] 2025-12-04T14:02:33.5175347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int32 PASSED [0.9729s] [ 58%] 2025-12-04T14:02:33.5175470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int64 PASSED [0.0046s] [ 58%] 2025-12-04T14:02:33.5175596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_bfloat16 PASSED [0.0037s] [ 58%] 2025-12-04T14:02:33.5175718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_bool PASSED [0.9724s] [ 58%] 2025-12-04T14:02:33.5175843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_float16 PASSED [0.0049s] [ 58%] 2025-12-04T14:02:33.5175968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_float64 PASSED [0.0031s] [ 58%] 2025-12-04T14:02:33.5176091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_bfloat16 PASSED [0.9802s] [ 58%] 2025-12-04T14:02:33.5176219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex128 PASSED [0.0076s] [ 58%] 2025-12-04T14:02:33.5176341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_float16 PASSED [0.9799s] [ 58%] 2025-12-04T14:02:33.5176461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int32 PASSED [0.0060s] [ 58%] 2025-12-04T14:02:33.5176579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int64 PASSED [0.0042s] [ 58%] 2025-12-04T14:02:33.5176698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int8 PASSED [0.9846s] [ 58%] 2025-12-04T14:02:33.5176836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_istft_cuda_complex64 PASSED [0.8243s] [ 58%] 2025-12-04T14:02:33.5176958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_complex32 PASSED [0.9851s] [ 58%] 2025-12-04T14:02:33.5177111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16 PASSED [0.3233s] [ 58%] 2025-12-04T14:02:33.5177270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_bool PASSED [0.2569s] [ 58%] 2025-12-04T14:02:33.5177421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int16 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5177573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8 PASSED [0.2523s] [ 58%] 2025-12-04T14:02:33.5177737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5177908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.0040s] [ 58%] 2025-12-04T14:02:33.5178065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8 PASSED [0.0037s] [ 58%] 2025-12-04T14:02:33.5178200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int16 PASSED [0.0052s] [ 58%] 2025-12-04T14:02:33.5178333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int32 PASSED [0.2493s] [ 58%] 2025-12-04T14:02:33.5178487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_int64 PASSED [0.2494s] [ 58%] 2025-12-04T14:02:33.5178643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_bfloat16 PASSED [0.3042s] [ 59%] 2025-12-04T14:02:33.5178796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_bool PASSED [0.2467s] [ 59%] 2025-12-04T14:02:33.5178954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_complex128 PASSED [0.0065s] [ 59%] 2025-12-04T14:02:33.5179111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_complex64 PASSED [0.2686s] [ 59%] 2025-12-04T14:02:33.5179264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float16 PASSED [0.2883s] [ 59%] 2025-12-04T14:02:33.5179420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float64 PASSED [0.2491s] [ 59%] 2025-12-04T14:02:33.5179571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int64 PASSED [0.2492s] [ 59%] 2025-12-04T14:02:33.5179708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_bool PASSED [1.0952s] [ 59%] 2025-12-04T14:02:33.5179846Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_float16 PASSED [0.0052s] [ 59%] 2025-12-04T14:02:33.5179968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_bfloat16 PASSED [0.0058s] [ 59%] 2025-12-04T14:02:33.5180085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_bool PASSED [0.0043s] [ 59%] 2025-12-04T14:02:33.5180255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_complex64 PASSED [0.0041s] [ 59%] 2025-12-04T14:02:33.5180374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int32 PASSED [0.9870s] [ 59%] 2025-12-04T14:02:33.5180490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int8 PASSED [0.0060s] [ 59%] 2025-12-04T14:02:33.5180617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_bfloat16 PASSED [0.0059s] [ 59%] 2025-12-04T14:02:33.5180756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float16 PASSED [0.0055s] [ 59%] 2025-12-04T14:02:33.5180882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float32 PASSED [0.0075s] [ 59%] 2025-12-04T14:02:33.5181003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int16 PASSED [0.9852s] [ 59%] 2025-12-04T14:02:33.5181130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int8 PASSED [0.0208s] [ 59%] 2025-12-04T14:02:33.5181247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_bool PASSED [0.0161s] [ 59%] 2025-12-04T14:02:33.5181369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_float32 PASSED [0.0130s] [ 59%] 2025-12-04T14:02:33.5181488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_float64 PASSED [0.0145s] [ 59%] 2025-12-04T14:02:33.5181608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int32 PASSED [0.0156s] [ 59%] 2025-12-04T14:02:33.5181739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int64 PASSED [0.0156s] [ 59%] 2025-12-04T14:02:33.5181857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int8 PASSED [0.0156s] [ 59%] 2025-12-04T14:02:33.5181979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_uint8 PASSED [0.0158s] [ 59%] 2025-12-04T14:02:33.5182100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_bfloat16 PASSED [0.0119s] [ 59%] 2025-12-04T14:02:33.5182228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_bool PASSED [0.0093s] [ 59%] 2025-12-04T14:02:33.5182345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float64 PASSED [0.0095s] [ 59%] 2025-12-04T14:02:33.5182458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int32 PASSED [0.0094s] [ 59%] 2025-12-04T14:02:33.5182572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int64 PASSED [0.0094s] [ 59%] 2025-12-04T14:02:33.5182686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int8 PASSED [0.0093s] [ 59%] 2025-12-04T14:02:33.5182806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_bfloat16 PASSED [1.0126s] [ 59%] 2025-12-04T14:02:33.5182931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex128 PASSED [0.0212s] [ 59%] 2025-12-04T14:02:33.5183053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex64 PASSED [1.0015s] [ 59%] 2025-12-04T14:02:33.5183174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float16 PASSED [0.0154s] [ 59%] 2025-12-04T14:02:33.5183293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float64 PASSED [0.0136s] [ 59%] 2025-12-04T14:02:33.5183415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_float32 PASSED [0.0041s] [ 59%] 2025-12-04T14:02:33.5183536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int16 PASSED [0.9886s] [ 59%] 2025-12-04T14:02:33.5183656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int32 PASSED [0.0062s] [ 59%] 2025-12-04T14:02:33.5183774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int64 PASSED [0.0044s] [ 59%] 2025-12-04T14:02:33.5183917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_cuda_complex64 PASSED [0.0180s] [ 59%] 2025-12-04T14:02:33.5184058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_ex_cuda_float32 PASSED [0.9854s] [ 59%] 2025-12-04T14:02:33.5184191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_complex128 PASSED [0.9974s] [ 59%] 2025-12-04T14:02:33.5184324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_bfloat16 PASSED [0.0057s] [ 59%] 2025-12-04T14:02:33.5184466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_complex64 PASSED [0.0043s] [ 59%] 2025-12-04T14:02:33.5184601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float32 PASSED [0.0041s] [ 59%] 2025-12-04T14:02:33.5184731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float64 PASSED [0.9843s] [ 59%] 2025-12-04T14:02:33.5184878Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_bfloat16 PASSED [0.0081s] [ 59%] 2025-12-04T14:02:33.5185017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex128 PASSED [0.0066s] [ 59%] 2025-12-04T14:02:33.5185156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex32 PASSED [0.0065s] [ 59%] 2025-12-04T14:02:33.5185292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex64 PASSED [0.0065s] [ 59%] 2025-12-04T14:02:33.5185428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float32 PASSED [0.0064s] [ 59%] 2025-12-04T14:02:33.5185574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_int64 PASSED [0.0064s] [ 59%] 2025-12-04T14:02:33.5185704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_complex64 PASSED [0.0478s] [ 59%] 2025-12-04T14:02:33.5185834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_float64 PASSED [0.0401s] [ 59%] 2025-12-04T14:02:33.5185975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_float32 PASSED [0.0420s] [ 59%] 2025-12-04T14:02:33.5186109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_float64 PASSED [0.0409s] [ 59%] 2025-12-04T14:02:33.5186348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_float32 SKIPPED [0.0011s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 59%] 2025-12-04T14:02:33.5186482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_cuda_complex128 PASSED [0.0125s] [ 59%] 2025-12-04T14:02:33.5186623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0043s] [ 59%] 2025-12-04T14:02:33.5186846Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 59%] 2025-12-04T14:02:33.5187065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_solve_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 59%] 2025-12-04T14:02:33.5187218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex64 PASSED [1.8084s] [ 59%] 2025-12-04T14:02:33.5187368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_float64 PASSED [0.1156s] [ 59%] 2025-12-04T14:02:33.5187496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_cuda_float32 PASSED [0.0197s] [ 59%] 2025-12-04T14:02:33.5187634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_complex64 PASSED [0.0399s] [ 59%] 2025-12-04T14:02:33.5187769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_float32 PASSED [0.0386s] [ 59%] 2025-12-04T14:02:33.5187905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_float64 PASSED [0.0367s] [ 59%] 2025-12-04T14:02:33.5188048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_complex64 PASSED [0.0184s] [ 59%] 2025-12-04T14:02:33.5188187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_complex128 PASSED [0.0940s] [ 59%] 2025-12-04T14:02:33.5188339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_bfloat16 PASSED [0.0816s] [ 59%] 2025-12-04T14:02:33.5188485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_complex128 PASSED [0.1100s] [ 59%] 2025-12-04T14:02:33.5188626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_complex64 PASSED [0.1107s] [ 59%] 2025-12-04T14:02:33.5188773Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float16 PASSED [0.0794s] [ 59%] 2025-12-04T14:02:33.5188911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float32 PASSED [0.1176s] [ 59%] 2025-12-04T14:02:33.5189048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_float64 PASSED [0.1177s] [ 59%] 2025-12-04T14:02:33.5189190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_cuda_complex128 PASSED [0.1674s] [ 59%] 2025-12-04T14:02:33.5189331Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_cuda_complex64 PASSED [0.1699s] [ 59%] 2025-12-04T14:02:33.5189498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_hermitian_cuda_complex128 PASSED [0.0212s] [ 59%] 2025-12-04T14:02:33.5189649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.0218s] [ 59%] 2025-12-04T14:02:33.5189791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_multi_dot_cuda_complex128 PASSED [0.0119s] [ 59%] 2025-12-04T14:02:33.5189937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_multi_dot_cuda_float32 PASSED [0.0118s] [ 59%] 2025-12-04T14:02:33.5190068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float64 PASSED [0.1497s] [ 59%] 2025-12-04T14:02:33.5190232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_complex128 PASSED [0.0585s] [ 59%] 2025-12-04T14:02:33.5190453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0007s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 59%] 2025-12-04T14:02:33.5190672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_float64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 59%] 2025-12-04T14:02:33.5190804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_complex64 PASSED [0.0230s] [ 59%] 2025-12-04T14:02:33.5190932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_float64 PASSED [0.0219s] [ 59%] 2025-12-04T14:02:33.5191060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_float32 PASSED [0.1182s] [ 59%] 2025-12-04T14:02:33.5191198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_complex128 PASSED [0.0245s] [ 59%] 2025-12-04T14:02:33.5191343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_complex128 PASSED [0.0139s] [ 59%] 2025-12-04T14:02:33.5191485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_complex64 PASSED [1.6454s] [ 59%] 2025-12-04T14:02:33.5191620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_complex64 PASSED [0.0323s] [ 59%] 2025-12-04T14:02:33.5191754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_float32 PASSED [1.6369s] [ 59%] 2025-12-04T14:02:33.5191882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int8 PASSED [0.0379s] [ 59%] 2025-12-04T14:02:33.5192015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_bfloat16 PASSED [0.0612s] [ 59%] 2025-12-04T14:02:33.5192151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_complex128 PASSED [0.0715s] [ 59%] 2025-12-04T14:02:33.5192297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float32 PASSED [0.0451s] [ 59%] 2025-12-04T14:02:33.5192431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float64 PASSED [1.6889s] [ 59%] 2025-12-04T14:02:33.5192568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float16 PASSED [0.1920s] [ 59%] 2025-12-04T14:02:33.5192730Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float32 PASSED [0.1455s] [ 59%] 2025-12-04T14:02:33.5192869Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float64 PASSED [0.1454s] [ 59%] 2025-12-04T14:02:33.5192996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_float16 PASSED [0.0218s] [ 59%] 2025-12-04T14:02:33.5193118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int32 PASSED [0.0216s] [ 59%] 2025-12-04T14:02:33.5193260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int64 PASSED [0.0215s] [ 59%] 2025-12-04T14:02:33.5193409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_bfloat16 PASSED [0.1104s] [ 59%] 2025-12-04T14:02:33.5193558Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float64 PASSED [0.1107s] [ 59%] 2025-12-04T14:02:33.5193681Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_bfloat16 PASSED [0.0042s] [ 59%] 2025-12-04T14:02:33.5193812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_bool PASSED [1.6140s] [ 59%] 2025-12-04T14:02:33.5193933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float32 PASSED [0.0064s] [ 59%] 2025-12-04T14:02:33.5194054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_int16 PASSED [0.0046s] [ 59%] 2025-12-04T14:02:33.5194173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_int8 PASSED [1.6386s] [ 59%] 2025-12-04T14:02:33.5194299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_complex128 PASSED [0.0050s] [ 59%] 2025-12-04T14:02:33.5194425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_complex64 PASSED [1.6330s] [ 59%] 2025-12-04T14:02:33.5194546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_float16 PASSED [0.0057s] [ 59%] 2025-12-04T14:02:33.5194664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int8 PASSED [0.0037s] [ 59%] 2025-12-04T14:02:33.5194781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_bool PASSED [1.6376s] [ 59%] 2025-12-04T14:02:33.5194900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int32 PASSED [0.0065s] [ 59%] 2025-12-04T14:02:33.5195018Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int64 PASSED [0.0045s] [ 59%] 2025-12-04T14:02:33.5195136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_uint8 PASSED [1.6390s] [ 59%] 2025-12-04T14:02:33.5195256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_complex64 PASSED [0.1418s] [ 59%] 2025-12-04T14:02:33.5195373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int64 PASSED [0.0046s] [ 59%] 2025-12-04T14:02:33.5195488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_uint8 PASSED [1.6277s] [ 59%] 2025-12-04T14:02:33.5195616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_float32 PASSED [0.0069s] [ 59%] 2025-12-04T14:02:33.5195745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_cuda_float16 PASSED [0.0128s] [ 59%] 2025-12-04T14:02:33.5195873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_cuda_float64 PASSED [0.0103s] [ 59%] 2025-12-04T14:02:33.5196032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_complex64 PASSED [0.0112s] [ 59%] 2025-12-04T14:02:33.5196177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_float32 PASSED [1.6381s] [ 59%] 2025-12-04T14:02:33.5196317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int8 PASSED [1.6232s] [ 59%] 2025-12-04T14:02:33.5196455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_bfloat16 PASSED [0.0312s] [ 59%] 2025-12-04T14:02:33.5196586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_complex32 PASSED [0.5892s] [ 59%] 2025-12-04T14:02:33.5196723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_complex128 PASSED [0.0060s] [ 59%] 2025-12-04T14:02:33.5196859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_complex64 PASSED [0.0055s] [ 59%] 2025-12-04T14:02:33.5196983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logdet_cuda_float32 PASSED [0.0271s] [ 60%] 2025-12-04T14:02:33.5197119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logdet_cuda_float64 PASSED [1.6273s] [ 60%] 2025-12-04T14:02:33.5197250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_bfloat16 PASSED [0.0179s] [ 60%] 2025-12-04T14:02:33.5197384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_complex64 PASSED [0.4076s] [ 60%] 2025-12-04T14:02:33.5197522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float16 PASSED [0.0152s] [ 60%] 2025-12-04T14:02:33.5197652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float64 PASSED [0.0124s] [ 60%] 2025-12-04T14:02:33.5197777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int8 PASSED [0.0123s] [ 60%] 2025-12-04T14:02:33.5197903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_uint8 PASSED [0.0121s] [ 60%] 2025-12-04T14:02:33.5198038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex128 PASSED [1.5993s] [ 60%] 2025-12-04T14:02:33.5198166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float32 PASSED [0.0062s] [ 60%] 2025-12-04T14:02:33.5198296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float64 PASSED [0.0045s] [ 60%] 2025-12-04T14:02:33.5198422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int32 PASSED [0.0043s] [ 60%] 2025-12-04T14:02:33.5198547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_bool PASSED [0.0102s] [ 60%] 2025-12-04T14:02:33.5198678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_complex128 PASSED [0.4107s] [ 60%] 2025-12-04T14:02:33.5198806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_float64 PASSED [0.0125s] [ 60%] 2025-12-04T14:02:33.5198931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int16 PASSED [0.0120s] [ 60%] 2025-12-04T14:02:33.5199056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_bool PASSED [0.0101s] [ 60%] 2025-12-04T14:02:33.5199190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_complex64 PASSED [0.0146s] [ 60%] 2025-12-04T14:02:33.5199322Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_float32 PASSED [0.0121s] [ 60%] 2025-12-04T14:02:33.5199454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_float64 PASSED [0.0121s] [ 60%] 2025-12-04T14:02:33.5199571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_bool PASSED [1.6178s] [ 60%] 2025-12-04T14:02:33.5199702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float16 PASSED [0.0116s] [ 60%] 2025-12-04T14:02:33.5199824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float32 PASSED [0.0077s] [ 60%] 2025-12-04T14:02:33.5199944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float64 PASSED [1.6356s] [ 60%] 2025-12-04T14:02:33.5200064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_int64 PASSED [0.0117s] [ 60%] 2025-12-04T14:02:33.5200244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_bfloat16 PASSED [0.1270s] [ 60%] 2025-12-04T14:02:33.5200374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_complex128 PASSED [0.1289s] [ 60%] 2025-12-04T14:02:33.5200503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_complex64 PASSED [0.1260s] [ 60%] 2025-12-04T14:02:33.5200628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_float16 PASSED [0.1257s] [ 60%] 2025-12-04T14:02:33.5200752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int64 PASSED [0.1179s] [ 60%] 2025-12-04T14:02:33.5200914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float16 PASSED [0.7280s] [ 60%] 2025-12-04T14:02:33.5201062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float32 PASSED [0.7267s] [ 60%] 2025-12-04T14:02:33.5201210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_float64 PASSED [0.7303s] [ 60%] 2025-12-04T14:02:33.5201366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_uint8 PASSED [0.2243s] [ 60%] 2025-12-04T14:02:33.5201491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_bool PASSED [0.0126s] [ 60%] 2025-12-04T14:02:33.5201617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float32 PASSED [0.0258s] [ 60%] 2025-12-04T14:02:33.5201745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float64 PASSED [0.0255s] [ 60%] 2025-12-04T14:02:33.5201868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int8 PASSED [0.0113s] [ 60%] 2025-12-04T14:02:33.5201992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_complex128 PASSED [1.6495s] [ 60%] 2025-12-04T14:02:33.5202114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_complex32 PASSED [0.0043s] [ 60%] 2025-12-04T14:02:33.5202233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int32 PASSED [1.6285s] [ 60%] 2025-12-04T14:02:33.5202350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float16 PASSED [0.0146s] [ 60%] 2025-12-04T14:02:33.5202468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float64 PASSED [0.0099s] [ 60%] 2025-12-04T14:02:33.5202583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_uint8 PASSED [0.0096s] [ 60%] 2025-12-04T14:02:33.5202713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex128 PASSED [0.0304s] [ 60%] 2025-12-04T14:02:33.5202837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_float32 PASSED [0.0295s] [ 60%] 2025-12-04T14:02:33.5202969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_complex64 PASSED [0.0222s] [ 60%] 2025-12-04T14:02:33.5203094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_float64 PASSED [0.0216s] [ 60%] 2025-12-04T14:02:33.5203214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_bfloat16 PASSED [1.6314s] [ 60%] 2025-12-04T14:02:33.5203329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_bool PASSED [0.0059s] [ 60%] 2025-12-04T14:02:33.5203449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_float16 PASSED [0.0042s] [ 60%] 2025-12-04T14:02:33.5203581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_float32 PASSED [1.6287s] [ 60%] 2025-12-04T14:02:33.5203696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int16 PASSED [0.0089s] [ 60%] 2025-12-04T14:02:33.5203818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex128 PASSED [0.0051s] [ 60%] 2025-12-04T14:02:33.5203947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_float64 PASSED [1.6494s] [ 60%] 2025-12-04T14:02:33.5204062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int32 PASSED [0.0063s] [ 60%] 2025-12-04T14:02:33.5204174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int64 PASSED [0.0043s] [ 60%] 2025-12-04T14:02:33.5204305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_bfloat16 PASSED [0.1752s] [ 60%] 2025-12-04T14:02:33.5204434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_float16 PASSED [0.1733s] [ 60%] 2025-12-04T14:02:33.5204570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_int8 PASSED [0.1267s] [ 60%] 2025-12-04T14:02:33.5204699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float64 PASSED [0.1491s] [ 60%] 2025-12-04T14:02:33.5204826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_int16 PASSED [0.1271s] [ 60%] 2025-12-04T14:02:33.5204952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_uint8 PASSED [0.1306s] [ 60%] 2025-12-04T14:02:33.5205094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_bfloat16 PASSED [0.0881s] [ 60%] 2025-12-04T14:02:33.5205226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_float16 PASSED [0.0873s] [ 60%] 2025-12-04T14:02:33.5205358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_float64 PASSED [0.0873s] [ 60%] 2025-12-04T14:02:33.5205489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int16 PASSED [0.0741s] [ 60%] 2025-12-04T14:02:33.5205621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_float16 PASSED [0.0873s] [ 60%] 2025-12-04T14:02:33.5205754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_uint8 PASSED [0.0779s] [ 60%] 2025-12-04T14:02:33.5205892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_complex64 PASSED [0.0467s] [ 60%] 2025-12-04T14:02:33.5206026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float16 PASSED [0.0501s] [ 60%] 2025-12-04T14:02:33.5206158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float32 PASSED [0.0458s] [ 60%] 2025-12-04T14:02:33.5206290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int32 PASSED [0.0453s] [ 60%] 2025-12-04T14:02:33.5206423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int64 PASSED [0.0456s] [ 60%] 2025-12-04T14:02:33.5206551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int8 PASSED [0.0453s] [ 60%] 2025-12-04T14:02:33.5206688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_complex64 PASSED [0.0457s] [ 60%] 2025-12-04T14:02:33.5206818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_float16 PASSED [0.0499s] [ 60%] 2025-12-04T14:02:33.5206951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_bfloat16 PASSED [1.6466s] [ 60%] 2025-12-04T14:02:33.5207076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_bool PASSED [0.0133s] [ 60%] 2025-12-04T14:02:33.5207217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_complex32 PASSED [0.0111s] [ 60%] 2025-12-04T14:02:33.5207347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_float16 PASSED [0.0106s] [ 60%] 2025-12-04T14:02:33.5207476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_float32 PASSED [0.0105s] [ 60%] 2025-12-04T14:02:33.5207600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int8 PASSED [0.0103s] [ 60%] 2025-12-04T14:02:33.5207744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_uint8 PASSED [1.6630s] [ 60%] 2025-12-04T14:02:33.5207885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_bfloat16 PASSED [0.0629s] [ 60%] 2025-12-04T14:02:33.5208024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float32 PASSED [0.0531s] [ 60%] 2025-12-04T14:02:33.5208160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float64 PASSED [0.0538s] [ 60%] 2025-12-04T14:02:33.5208311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_complex128 PASSED [0.2358s] [ 60%] 2025-12-04T14:02:33.5208449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_complex64 PASSED [0.2353s] [ 60%] 2025-12-04T14:02:33.5208579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_bfloat16 PASSED [0.3197s] [ 60%] 2025-12-04T14:02:33.5208713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_complex128 PASSED [0.2392s] [ 60%] 2025-12-04T14:02:33.5208852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float16 PASSED [0.3148s] [ 60%] 2025-12-04T14:02:33.5208982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float32 PASSED [0.2813s] [ 60%] 2025-12-04T14:02:33.5209110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_float64 PASSED [0.2795s] [ 60%] 2025-12-04T14:02:33.5209240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_norm_cuda_float32 PASSED [2.5536s] [ 60%] 2025-12-04T14:02:33.5209380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_normalize_cuda_complex128 PASSED [0.0911s] [ 60%] 2025-12-04T14:02:33.5209516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_normalize_cuda_float32 PASSED [0.0801s] [ 60%] 2025-12-04T14:02:33.5209642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_bool PASSED [0.1520s] [ 60%] 2025-12-04T14:02:33.5209776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_complex128 PASSED [2.3820s] [ 60%] 2025-12-04T14:02:33.5209907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_float64 PASSED [0.7855s] [ 60%] 2025-12-04T14:02:33.5210034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_int16 PASSED [0.1541s] [ 60%] 2025-12-04T14:02:33.5210203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_uint8 PASSED [0.1511s] [ 60%] 2025-12-04T14:02:33.5210334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int16 PASSED [0.0060s] [ 60%] 2025-12-04T14:02:33.5210464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int32 PASSED [0.0056s] [ 60%] 2025-12-04T14:02:33.5210595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int64 PASSED [0.0057s] [ 60%] 2025-12-04T14:02:33.5210725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_bool PASSED [0.0069s] [ 60%] 2025-12-04T14:02:33.5210856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_float32 PASSED [0.0068s] [ 60%] 2025-12-04T14:02:33.5210985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int32 PASSED [0.0068s] [ 60%] 2025-12-04T14:02:33.5211124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int8 PASSED [0.0067s] [ 60%] 2025-12-04T14:02:33.5211254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_uint8 PASSED [0.0068s] [ 60%] 2025-12-04T14:02:33.5211387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmax_cuda_bfloat16 PASSED [0.0383s] [ 60%] 2025-12-04T14:02:33.5211533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmax_cuda_float32 PASSED [0.0381s] [ 60%] 2025-12-04T14:02:33.5211667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_float16 PASSED [0.0535s] [ 60%] 2025-12-04T14:02:33.5211794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float16 PASSED [0.5662s] [ 60%] 2025-12-04T14:02:33.5211921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float64 PASSED [0.4573s] [ 60%] 2025-12-04T14:02:33.5212047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int32 PASSED [0.4884s] [ 60%] 2025-12-04T14:02:33.5212190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_bfloat16 PASSED [0.1750s] [ 60%] 2025-12-04T14:02:33.5212321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_complex64 PASSED [0.1281s] [ 60%] 2025-12-04T14:02:33.5212447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int32 PASSED [0.1431s] [ 60%] 2025-12-04T14:02:33.5212582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int64 PASSED [1.7673s] [ 60%] 2025-12-04T14:02:33.5212714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_complex128 PASSED [0.4935s] [ 60%] 2025-12-04T14:02:33.5212844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_complex64 PASSED [0.4874s] [ 60%] 2025-12-04T14:02:33.5212969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int16 PASSED [0.4647s] [ 60%] 2025-12-04T14:02:33.5213094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int32 PASSED [0.4675s] [ 60%] 2025-12-04T14:02:33.5213218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_bfloat16 PASSED [0.0371s] [ 60%] 2025-12-04T14:02:33.5213341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_float64 PASSED [0.0307s] [ 60%] 2025-12-04T14:02:33.5213468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_bfloat16 PASSED [1.6276s] [ 60%] 2025-12-04T14:02:33.5213600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_complex128 PASSED [0.0092s] [ 60%] 2025-12-04T14:02:33.5213731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_complex64 PASSED [0.0063s] [ 60%] 2025-12-04T14:02:33.5213855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int16 PASSED [0.0100s] [ 60%] 2025-12-04T14:02:33.5213979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int32 PASSED [0.0095s] [ 60%] 2025-12-04T14:02:33.5214138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16 PASSED [2.2933s] [ 60%] 2025-12-04T14:02:33.5214278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_bool PASSED [1.6472s] [ 60%] 2025-12-04T14:02:33.5214421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_float64 PASSED [0.0054s] [ 60%] 2025-12-04T14:02:33.5216221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int32 PASSED [0.0040s] [ 61%] 2025-12-04T14:02:33.5216348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_float16 PASSED [0.0135s] [ 61%] 2025-12-04T14:02:33.5216494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int32 PASSED [0.0095s] [ 61%] 2025-12-04T14:02:33.5216620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int64 PASSED [0.0095s] [ 61%] 2025-12-04T14:02:33.5216742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_uint8 PASSED [0.0094s] [ 61%] 2025-12-04T14:02:33.5216880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_complex128 PASSED [1.6634s] [ 61%] 2025-12-04T14:02:33.5217002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_float64 PASSED [0.0187s] [ 61%] 2025-12-04T14:02:33.5217126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int16 PASSED [1.6346s] [ 61%] 2025-12-04T14:02:33.5217247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_uint8 PASSED [0.0131s] [ 61%] 2025-12-04T14:02:33.5217399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_complex64 PASSED [0.0130s] [ 61%] 2025-12-04T14:02:33.5217561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_float64 PASSED [0.0124s] [ 61%] 2025-12-04T14:02:33.5217706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int8 PASSED [0.0123s] [ 61%] 2025-12-04T14:02:33.5217850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_uint8 PASSED [0.0122s] [ 61%] 2025-12-04T14:02:33.5218000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_bool PASSED [0.0122s] [ 61%] 2025-12-04T14:02:33.5218160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float16 PASSED [0.0123s] [ 61%] 2025-12-04T14:02:33.5218311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float64 PASSED [0.0121s] [ 61%] 2025-12-04T14:02:33.5218459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int32 PASSED [0.0122s] [ 61%] 2025-12-04T14:02:33.5218606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int8 PASSED [0.0121s] [ 61%] 2025-12-04T14:02:33.5218735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_bfloat16 PASSED [0.0132s] [ 61%] 2025-12-04T14:02:33.5218860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_bool PASSED [0.0094s] [ 61%] 2025-12-04T14:02:33.5218989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_float16 PASSED [0.0129s] [ 61%] 2025-12-04T14:02:33.5219115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_int16 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5219241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_uint8 PASSED [0.0095s] [ 61%] 2025-12-04T14:02:33.5219380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_bool PASSED [0.0032s] [ 61%] 2025-12-04T14:02:33.5219523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float16 PASSED [1.6415s] [ 61%] 2025-12-04T14:02:33.5219663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float32 PASSED [0.0050s] [ 61%] 2025-12-04T14:02:33.5219809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_float64 PASSED [0.0034s] [ 61%] 2025-12-04T14:02:33.5219947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_int16 PASSED [1.6360s] [ 61%] 2025-12-04T14:02:33.5220213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_bfloat16 PASSED [0.0060s] [ 61%] 2025-12-04T14:02:33.5220358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_float16 PASSED [1.6142s] [ 61%] 2025-12-04T14:02:33.5220516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int32 PASSED [0.0059s] [ 61%] 2025-12-04T14:02:33.5220658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int8 PASSED [1.6262s] [ 61%] 2025-12-04T14:02:33.5220798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_uint8 PASSED [0.0057s] [ 61%] 2025-12-04T14:02:33.5220938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_bfloat16 PASSED [0.0143s] [ 61%] 2025-12-04T14:02:33.5221060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int64 PASSED [0.0098s] [ 61%] 2025-12-04T14:02:33.5221182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_uint8 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5221302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_complex64 PASSED [0.0051s] [ 61%] 2025-12-04T14:02:33.5221421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float16 PASSED [0.0042s] [ 61%] 2025-12-04T14:02:33.5221551Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float32 PASSED [1.6534s] [ 61%] 2025-12-04T14:02:33.5221669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float64 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5221786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_bool PASSED [0.0053s] [ 61%] 2025-12-04T14:02:33.5221907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_float64 PASSED [0.0051s] [ 61%] 2025-12-04T14:02:33.5222040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int32 PASSED [1.6643s] [ 61%] 2025-12-04T14:02:33.5222161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int8 PASSED [0.0048s] [ 61%] 2025-12-04T14:02:33.5222284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_uint8 PASSED [1.6080s] [ 61%] 2025-12-04T14:02:33.5222406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bfloat16 PASSED [0.0064s] [ 61%] 2025-12-04T14:02:33.5222531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_float16 PASSED [0.0045s] [ 61%] 2025-12-04T14:02:33.5222650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int64 PASSED [1.6429s] [ 61%] 2025-12-04T14:02:33.5222769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int8 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5222887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_uint8 PASSED [0.0043s] [ 61%] 2025-12-04T14:02:33.5223003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_bool PASSED [0.0100s] [ 61%] 2025-12-04T14:02:33.5223121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_float64 PASSED [0.0097s] [ 61%] 2025-12-04T14:02:33.5223239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int32 PASSED [0.0095s] [ 61%] 2025-12-04T14:02:33.5223355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int64 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5223486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_bfloat16 PASSED [0.0088s] [ 61%] 2025-12-04T14:02:33.5223615Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_float16 PASSED [0.0087s] [ 61%] 2025-12-04T14:02:33.5223735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_bfloat16 PASSED [1.6351s] [ 61%] 2025-12-04T14:02:33.5223855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_complex64 PASSED [0.0057s] [ 61%] 2025-12-04T14:02:33.5224006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16 PASSED [0.0212s] [ 61%] 2025-12-04T14:02:33.5224147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int32 PASSED [0.0188s] [ 61%] 2025-12-04T14:02:33.5224303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8 PASSED [0.0186s] [ 61%] 2025-12-04T14:02:33.5224448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float64 PASSED [0.0171s] [ 61%] 2025-12-04T14:02:33.5224590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float16 PASSED [0.0195s] [ 61%] 2025-12-04T14:02:33.5224745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [1.6386s] [ 61%] 2025-12-04T14:02:33.5224886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int16 PASSED [0.0226s] [ 61%] 2025-12-04T14:02:33.5225026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64 PASSED [0.0190s] [ 61%] 2025-12-04T14:02:33.5225164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int8 PASSED [0.0186s] [ 61%] 2025-12-04T14:02:33.5225304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_bfloat16 PASSED [1.6522s] [ 61%] 2025-12-04T14:02:33.5225429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int16 PASSED [0.0062s] [ 61%] 2025-12-04T14:02:33.5225555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int8 PASSED [0.0041s] [ 61%] 2025-12-04T14:02:33.5225682Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex32 PASSED [0.0926s] [ 61%] 2025-12-04T14:02:33.5225820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex64 PASSED [0.0816s] [ 61%] 2025-12-04T14:02:33.5225944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_float16 PASSED [0.0905s] [ 61%] 2025-12-04T14:02:33.5226071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_bfloat16 PASSED [0.0109s] [ 61%] 2025-12-04T14:02:33.5226197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int16 PASSED [0.0106s] [ 61%] 2025-12-04T14:02:33.5226320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_uint8 PASSED [0.0105s] [ 61%] 2025-12-04T14:02:33.5226443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_bfloat16 PASSED [0.0196s] [ 61%] 2025-12-04T14:02:33.5226563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_bool PASSED [0.0145s] [ 61%] 2025-12-04T14:02:33.5226684Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int16 PASSED [0.0145s] [ 61%] 2025-12-04T14:02:33.5226816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_complex64 XFAIL [0.0031s] [ 61%] 2025-12-04T14:02:33.5226942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_int32 XFAIL [1.6468s] [ 61%] 2025-12-04T14:02:33.5227064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_float32 PASSED [1.6508s] [ 61%] 2025-12-04T14:02:33.5227204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float16 PASSED [1.6256s] [ 61%] 2025-12-04T14:02:33.5227341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float32 PASSED [0.0198s] [ 61%] 2025-12-04T14:02:33.5227489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_float32 PASSED [0.0125s] [ 61%] 2025-12-04T14:02:33.5227610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_bfloat16 PASSED [0.0122s] [ 61%] 2025-12-04T14:02:33.5227731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_complex128 PASSED [0.0098s] [ 61%] 2025-12-04T14:02:33.5227849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_float32 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5227974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int32 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5228091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int8 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5228206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_uint8 PASSED [0.0095s] [ 61%] 2025-12-04T14:02:33.5228329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex128 PASSED [1.6471s] [ 61%] 2025-12-04T14:02:33.5228463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex32 PASSED [0.0056s] [ 61%] 2025-12-04T14:02:33.5228583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_float16 PASSED [0.0037s] [ 61%] 2025-12-04T14:02:33.5228707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_int32 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5228845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_bfloat16 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5228980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_bool PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5229126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_int64 PASSED [0.0064s] [ 61%] 2025-12-04T14:02:33.5229254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_complex64 PASSED [0.0069s] [ 61%] 2025-12-04T14:02:33.5229377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_int16 PASSED [0.0067s] [ 61%] 2025-12-04T14:02:33.5229510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_uint8 PASSED [0.0067s] [ 61%] 2025-12-04T14:02:33.5229639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_complex128 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5229764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float32 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5229886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int32 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5230017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_complex64 PASSED [0.0066s] [ 61%] 2025-12-04T14:02:33.5230182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_float16 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5230307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int16 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5230429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int32 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5230554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int64 PASSED [0.0065s] [ 61%] 2025-12-04T14:02:33.5230681Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_bfloat16 PASSED [0.0098s] [ 61%] 2025-12-04T14:02:33.5230811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float16 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5230941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float64 PASSED [0.0096s] [ 61%] 2025-12-04T14:02:33.5231103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float16 PASSED [0.0176s] [ 61%] 2025-12-04T14:02:33.5231267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float16 PASSED [0.0103s] [ 61%] 2025-12-04T14:02:33.5231426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0093s] [ 61%] 2025-12-04T14:02:33.5231586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16 PASSED [0.0118s] [ 61%] 2025-12-04T14:02:33.5231744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16 PASSED [0.0117s] [ 61%] 2025-12-04T14:02:33.5231916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0117s] [ 61%] 2025-12-04T14:02:33.5232073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float64 PASSED [0.0175s] [ 61%] 2025-12-04T14:02:33.5232245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16 PASSED [0.0152s] [ 61%] 2025-12-04T14:02:33.5232397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0166s] [ 61%] 2025-12-04T14:02:33.5232549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float64 PASSED [0.0162s] [ 61%] 2025-12-04T14:02:33.5232697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float32 PASSED [0.0061s] [ 61%] 2025-12-04T14:02:33.5232844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float64 PASSED [0.0058s] [ 61%] 2025-12-04T14:02:33.5233005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool3d_cuda_bfloat16 PASSED [0.0069s] [ 61%] 2025-12-04T14:02:33.5233150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool3d_cuda_float64 PASSED [0.0069s] [ 61%] 2025-12-04T14:02:33.5233298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_cuda_float16 PASSED [0.0381s] [ 61%] 2025-12-04T14:02:33.5233480Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16 PASSED [0.0376s] [ 61%] 2025-12-04T14:02:33.5233645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float64 PASSED [0.0301s] [ 61%] 2025-12-04T14:02:33.5233792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_bilinear_cuda_bfloat16 PASSED [0.0960s] [ 61%] 2025-12-04T14:02:33.5233938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_bilinear_cuda_float16 PASSED [0.0951s] [ 61%] 2025-12-04T14:02:33.5234114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0591s] [ 61%] 2025-12-04T14:02:33.5234290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64 PASSED [0.0608s] [ 61%] 2025-12-04T14:02:33.5234430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_celu_cuda_float64 PASSED [0.0062s] [ 61%] 2025-12-04T14:02:33.5234583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_float16 PASSED [1.6654s] [ 62%] 2025-12-04T14:02:33.5234735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_uint8 PASSED [0.0076s] [ 62%] 2025-12-04T14:02:33.5234882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_complex128 PASSED [0.0963s] [ 62%] 2025-12-04T14:02:33.5235024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float32 PASSED [0.1480s] [ 62%] 2025-12-04T14:02:33.5235345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_float16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x72d202000c00 size: 768 2025-12-04T14:02:33.5235530Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x72d202000c00 size: 768 2025-12-04T14:02:33.5235571Z PASSED [0.8521s] [ 62%] 2025-12-04T14:02:33.5235720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex128 PASSED [0.1915s] [ 62%] 2025-12-04T14:02:33.5235877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex32 PASSED [1.9232s] [ 62%] 2025-12-04T14:02:33.5236018Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float16 PASSED [0.0276s] [ 62%] 2025-12-04T14:02:33.5236159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float32 PASSED [0.0249s] [ 62%] 2025-12-04T14:02:33.5236308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_float64 PASSED [1.6862s] [ 62%] 2025-12-04T14:02:33.5236467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32 PASSED [0.1502s] [ 62%] 2025-12-04T14:02:33.5236624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float16 PASSED [0.0131s] [ 62%] 2025-12-04T14:02:33.5236780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float64 PASSED [0.0127s] [ 62%] 2025-12-04T14:02:33.5236948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_float16 PASSED [0.0147s] [ 62%] 2025-12-04T14:02:33.5237100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.0147s] [ 62%] 2025-12-04T14:02:33.5237258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64 PASSED [0.0977s] [ 62%] 2025-12-04T14:02:33.5237433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64 PASSED [0.0606s] [ 62%] 2025-12-04T14:02:33.5237591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_uint8 PASSED [1.7199s] [ 62%] 2025-12-04T14:02:33.5237746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float32 PASSED [0.0598s] [ 62%] 2025-12-04T14:02:33.5237899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float16 PASSED [0.0904s] [ 62%] 2025-12-04T14:02:33.5238051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float64 PASSED [0.0674s] [ 62%] 2025-12-04T14:02:33.5238196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_ctc_loss_cuda_float32 PASSED [0.0327s] [ 62%] 2025-12-04T14:02:33.5238341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_ctc_loss_cuda_float64 PASSED [0.0320s] [ 62%] 2025-12-04T14:02:33.5238488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_bfloat16 PASSED [0.0142s] [ 62%] 2025-12-04T14:02:33.5238634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float16 PASSED [0.0141s] [ 62%] 2025-12-04T14:02:33.5238778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float32 PASSED [1.6755s] [ 62%] 2025-12-04T14:02:33.5238924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float64 PASSED [0.0144s] [ 62%] 2025-12-04T14:02:33.5239068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_float64 PASSED [0.0148s] [ 62%] 2025-12-04T14:02:33.5239212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout_cuda_bfloat16 PASSED [0.0163s] [ 62%] 2025-12-04T14:02:33.5239363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_bag_cuda_float64 PASSED [0.0740s] [ 62%] 2025-12-04T14:02:33.5239510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_bfloat16 PASSED [0.0100s] [ 62%] 2025-12-04T14:02:33.5239654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float64 PASSED [1.6668s] [ 62%] 2025-12-04T14:02:33.5239843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0139s] [ 62%] 2025-12-04T14:02:33.5240021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [1.6743s] [ 62%] 2025-12-04T14:02:33.5240245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128 PASSED [0.0060s] [ 62%] 2025-12-04T14:02:33.5240440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [1.6553s] [ 62%] 2025-12-04T14:02:33.5240616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64 PASSED [0.0059s] [ 62%] 2025-12-04T14:02:33.5240791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8 PASSED [1.6453s] [ 62%] 2025-12-04T14:02:33.5240968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float64 PASSED [0.0492s] [ 62%] 2025-12-04T14:02:33.5241132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_bfloat16 PASSED [0.0558s] [ 62%] 2025-12-04T14:02:33.5241293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float16 PASSED [0.0491s] [ 62%] 2025-12-04T14:02:33.5241463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16 PASSED [2.6025s] [ 62%] 2025-12-04T14:02:33.5241617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64 PASSED [1.9521s] [ 62%] 2025-12-04T14:02:33.5241757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float16 PASSED [0.0547s] [ 62%] 2025-12-04T14:02:33.5241909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_bfloat16 PASSED [0.0738s] [ 62%] 2025-12-04T14:02:33.5242056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_float32 PASSED [0.0564s] [ 62%] 2025-12-04T14:02:33.5242204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_float32 PASSED [0.0074s] [ 62%] 2025-12-04T14:02:33.5242351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_bfloat16 PASSED [0.0103s] [ 62%] 2025-12-04T14:02:33.5242497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float16 PASSED [1.6383s] [ 62%] 2025-12-04T14:02:33.5242640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float64 PASSED [0.0122s] [ 62%] 2025-12-04T14:02:33.5242782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int64 PASSED [0.0095s] [ 62%] 2025-12-04T14:02:33.5242941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64 PASSED [0.0500s] [ 62%] 2025-12-04T14:02:33.5243090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_bfloat16 PASSED [0.0237s] [ 62%] 2025-12-04T14:02:33.5243243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_bfloat16 PASSED [0.0871s] [ 62%] 2025-12-04T14:02:33.5243396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_float16 PASSED [0.0846s] [ 62%] 2025-12-04T14:02:33.5243547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_instance_norm_cuda_float32 PASSED [1.6753s] [ 62%] 2025-12-04T14:02:33.5243707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16 PASSED [0.9912s] [ 62%] 2025-12-04T14:02:33.5243883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_bfloat16 PASSED [0.1913s] [ 62%] 2025-12-04T14:02:33.5244044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16 PASSED [0.1910s] [ 62%] 2025-12-04T14:02:33.5244200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float16 PASSED [0.0706s] [ 62%] 2025-12-04T14:02:33.5244379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16 PASSED [0.0356s] [ 62%] 2025-12-04T14:02:33.5244537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest_cuda_uint8 PASSED [0.0250s] [ 62%] 2025-12-04T14:02:33.5244702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16 PASSED [0.3300s] [ 62%] 2025-12-04T14:02:33.5244865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.3219s] [ 62%] 2025-12-04T14:02:33.5245021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_kl_div_cuda_bfloat16 PASSED [0.0569s] [ 62%] 2025-12-04T14:02:33.5245163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_bfloat16 PASSED [0.0147s] [ 62%] 2025-12-04T14:02:33.5245310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_complex128 PASSED [0.0151s] [ 62%] 2025-12-04T14:02:33.5245465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_complex64 PASSED [0.0152s] [ 62%] 2025-12-04T14:02:33.5245611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_float16 PASSED [1.6727s] [ 62%] 2025-12-04T14:02:33.5245756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_float64 PASSED [0.0183s] [ 62%] 2025-12-04T14:02:33.5245904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0148s] [ 62%] 2025-12-04T14:02:33.5246047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_bfloat16 PASSED [0.0320s] [ 62%] 2025-12-04T14:02:33.5246190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_float64 PASSED [0.0273s] [ 62%] 2025-12-04T14:02:33.5246348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_local_response_norm_cuda_bfloat16 PASSED [0.0634s] [ 62%] 2025-12-04T14:02:33.5246507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_local_response_norm_cuda_float16 PASSED [0.0627s] [ 62%] 2025-12-04T14:02:33.5246654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float16 PASSED [0.0071s] [ 62%] 2025-12-04T14:02:33.5246813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int16 PASSED [0.0737s] [ 62%] 2025-12-04T14:02:33.5246971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64 PASSED [0.0734s] [ 62%] 2025-12-04T14:02:33.5247115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_float32 PASSED [0.9686s] [ 62%] 2025-12-04T14:02:33.5247263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16 PASSED [0.8601s] [ 62%] 2025-12-04T14:02:33.5247409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_float32 PASSED [0.8647s] [ 62%] 2025-12-04T14:02:33.5247560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16 PASSED [0.3233s] [ 62%] 2025-12-04T14:02:33.5247723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16 PASSED [0.0511s] [ 62%] 2025-12-04T14:02:33.5247874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_cuda_float16 PASSED [0.4558s] [ 62%] 2025-12-04T14:02:33.5248027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float16 PASSED [0.0687s] [ 62%] 2025-12-04T14:02:33.5248192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.0693s] [ 62%] 2025-12-04T14:02:33.5248347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float64 PASSED [0.0674s] [ 62%] 2025-12-04T14:02:33.5248496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16 PASSED [0.1564s] [ 62%] 2025-12-04T14:02:33.5248645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float32 PASSED [0.1571s] [ 62%] 2025-12-04T14:02:33.5248800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16 PASSED [0.0251s] [ 62%] 2025-12-04T14:02:33.5248952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0060s] [ 62%] 2025-12-04T14:02:33.5249091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_float32 PASSED [0.0055s] [ 62%] 2025-12-04T14:02:33.5249247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_float16 PASSED [0.0361s] [ 62%] 2025-12-04T14:02:33.5249419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16 PASSED [0.0380s] [ 62%] 2025-12-04T14:02:33.5249582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0268s] [ 62%] 2025-12-04T14:02:33.5249754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16 PASSED [0.0354s] [ 62%] 2025-12-04T14:02:33.5249924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64 PASSED [0.0258s] [ 62%] 2025-12-04T14:02:33.5250072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_bfloat16 PASSED [1.6778s] [ 62%] 2025-12-04T14:02:33.5250254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_complex128 PASSED [0.0319s] [ 62%] 2025-12-04T14:02:33.5250408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_complex64 PASSED [0.0284s] [ 62%] 2025-12-04T14:02:33.5250555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_normalize_cuda_float16 PASSED [0.0346s] [ 62%] 2025-12-04T14:02:33.5250695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_one_hot_cuda_int64 PASSED [0.0089s] [ 62%] 2025-12-04T14:02:33.5250843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int32 PASSED [0.0211s] [ 62%] 2025-12-04T14:02:33.5250988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int64 PASSED [0.0210s] [ 62%] 2025-12-04T14:02:33.5251133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_bool PASSED [0.0293s] [ 62%] 2025-12-04T14:02:33.5251281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_float16 PASSED [0.0294s] [ 62%] 2025-12-04T14:02:33.5251427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int16 PASSED [0.0294s] [ 62%] 2025-12-04T14:02:33.5251579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_complex128 PASSED [0.0087s] [ 62%] 2025-12-04T14:02:33.5251742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float32 PASSED [0.0085s] [ 62%] 2025-12-04T14:02:33.5251887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int32 PASSED [0.0085s] [ 62%] 2025-12-04T14:02:33.5252032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int64 PASSED [0.0085s] [ 62%] 2025-12-04T14:02:33.5252191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_int16 PASSED [1.6205s] [ 62%] 2025-12-04T14:02:33.5252339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_uint8 PASSED [0.0110s] [ 62%] 2025-12-04T14:02:33.5252506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex128 PASSED [0.0059s] [ 62%] 2025-12-04T14:02:33.5252671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64 PASSED [0.0056s] [ 62%] 2025-12-04T14:02:33.5252849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32 PASSED [0.0054s] [ 62%] 2025-12-04T14:02:33.5253009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_uint8 PASSED [0.0054s] [ 62%] 2025-12-04T14:02:33.5253166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_bfloat16 PASSED [0.0202s] [ 62%] 2025-12-04T14:02:33.5253339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_complex64 PASSED [0.0164s] [ 62%] 2025-12-04T14:02:33.5253495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_float32 PASSED [1.6315s] [ 62%] 2025-12-04T14:02:33.5253649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_float64 PASSED [1.6516s] [ 62%] 2025-12-04T14:02:33.5253802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_uint8 PASSED [1.6469s] [ 62%] 2025-12-04T14:02:33.5253943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pdist_cuda_float64 PASSED [0.0094s] [ 62%] 2025-12-04T14:02:33.5254091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_bool PASSED [0.0061s] [ 62%] 2025-12-04T14:02:33.5254245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_complex128 PASSED [0.0056s] [ 62%] 2025-12-04T14:02:33.5254402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0061s] [ 62%] 2025-12-04T14:02:33.5254554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16 PASSED [0.0060s] [ 62%] 2025-12-04T14:02:33.5254707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16 PASSED [0.0059s] [ 62%] 2025-12-04T14:02:33.5254859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64 PASSED [0.0059s] [ 62%] 2025-12-04T14:02:33.5255010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8 PASSED [0.0060s] [ 62%] 2025-12-04T14:02:33.5255167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16 PASSED [0.4806s] [ 62%] 2025-12-04T14:02:33.5255320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16 PASSED [0.4773s] [ 62%] 2025-12-04T14:02:33.5255472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8 PASSED [0.3772s] [ 62%] 2025-12-04T14:02:33.5255613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_prelu_cuda_float16 PASSED [0.0278s] [ 62%] 2025-12-04T14:02:33.5255766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_bfloat16 PASSED [0.0072s] [ 63%] 2025-12-04T14:02:33.5255903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int16 PASSED [0.0063s] [ 63%] 2025-12-04T14:02:33.5256040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int32 PASSED [0.0062s] [ 63%] 2025-12-04T14:02:33.5256186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int64 PASSED [0.0062s] [ 63%] 2025-12-04T14:02:33.5256324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_uint8 PASSED [0.0061s] [ 63%] 2025-12-04T14:02:33.5256462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_bfloat16 PASSED [0.0064s] [ 63%] 2025-12-04T14:02:33.5256601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_float16 PASSED [0.0064s] [ 63%] 2025-12-04T14:02:33.5256760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_complex128 PASSED [0.0202s] [ 63%] 2025-12-04T14:02:33.5256903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_float16 PASSED [0.0083s] [ 63%] 2025-12-04T14:02:33.5257043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rrelu_cuda_float32 PASSED [1.6689s] [ 63%] 2025-12-04T14:02:33.5257227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16 PASSED [0.1678s] [ 63%] 2025-12-04T14:02:33.5257397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16 PASSED [0.1497s] [ 63%] 2025-12-04T14:02:33.5257534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_selu_cuda_float16 PASSED [0.0063s] [ 63%] 2025-12-04T14:02:33.5257689Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_complex_cuda_complex128 PASSED [0.0039s] [ 63%] 2025-12-04T14:02:33.5257828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_bfloat16 PASSED [1.6522s] [ 63%] 2025-12-04T14:02:33.5257966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_float16 PASSED [0.0060s] [ 63%] 2025-12-04T14:02:33.5258116Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16 PASSED [0.0204s] [ 63%] 2025-12-04T14:02:33.5258268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float64 PASSED [0.0163s] [ 63%] 2025-12-04T14:02:33.5258422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0096s] [ 63%] 2025-12-04T14:02:33.5258565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_float16 PASSED [0.0107s] [ 63%] 2025-12-04T14:02:33.5258722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_float64 PASSED [0.0088s] [ 63%] 2025-12-04T14:02:33.5258874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_int8 PASSED [1.6361s] [ 63%] 2025-12-04T14:02:33.5259029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_uint8 PASSED [1.6430s] [ 63%] 2025-12-04T14:02:33.5259173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softplus_cuda_float16 PASSED [0.0083s] [ 63%] 2025-12-04T14:02:33.5259322Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_float32 PASSED [1.6602s] [ 63%] 2025-12-04T14:02:33.5259468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_float64 PASSED [0.0114s] [ 63%] 2025-12-04T14:02:33.5259631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_bfloat16 PASSED [0.0086s] [ 63%] 2025-12-04T14:02:33.5259778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_complex64 PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5259923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float64 PASSED [0.0069s] [ 63%] 2025-12-04T14:02:33.5260074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_uint8 PASSED [1.6626s] [ 63%] 2025-12-04T14:02:33.5260248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_int16 PASSED [0.0091s] [ 63%] 2025-12-04T14:02:33.5260393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_int8 PASSED [1.6485s] [ 63%] 2025-12-04T14:02:33.5260540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_bfloat16 PASSED [0.0095s] [ 63%] 2025-12-04T14:02:33.5260699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_float64 PASSED [0.0062s] [ 63%] 2025-12-04T14:02:33.5260840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_int16 PASSED [0.0058s] [ 63%] 2025-12-04T14:02:33.5260984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_uint8 PASSED [0.0057s] [ 63%] 2025-12-04T14:02:33.5261145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128 PASSED [0.0490s] [ 63%] 2025-12-04T14:02:33.5261318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float16 PASSED [0.0629s] [ 63%] 2025-12-04T14:02:33.5261479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64 PASSED [0.0462s] [ 63%] 2025-12-04T14:02:33.5261636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int8 PASSED [0.0505s] [ 63%] 2025-12-04T14:02:33.5261816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16 PASSED [0.0634s] [ 63%] 2025-12-04T14:02:33.5261996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 PASSED [0.0479s] [ 63%] 2025-12-04T14:02:33.5262173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64 PASSED [0.0506s] [ 63%] 2025-12-04T14:02:33.5262344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8 PASSED [0.0506s] [ 63%] 2025-12-04T14:02:33.5262483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_bool PASSED [0.3136s] [ 63%] 2025-12-04T14:02:33.5262630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_complex128 PASSED [0.3067s] [ 63%] 2025-12-04T14:02:33.5262788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0324s] [ 63%] 2025-12-04T14:02:33.5262943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_bilinear_cuda_float64 PASSED [0.0318s] [ 63%] 2025-12-04T14:02:33.5263098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float16 PASSED [0.0203s] [ 63%] 2025-12-04T14:02:33.5263252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float64 PASSED [0.0176s] [ 63%] 2025-12-04T14:02:33.5263376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_bool PASSED [0.0147s] [ 63%] 2025-12-04T14:02:33.5263520Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_complex128 PASSED [0.0144s] [ 63%] 2025-12-04T14:02:33.5263649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_complex32 PASSED [0.0143s] [ 63%] 2025-12-04T14:02:33.5263776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float64 PASSED [0.0143s] [ 63%] 2025-12-04T14:02:33.5263898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_int32 PASSED [0.0151s] [ 63%] 2025-12-04T14:02:33.5264032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_int8 PASSED [0.0151s] [ 63%] 2025-12-04T14:02:33.5264194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_bfloat16 SKIPPED [0.0007s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5264349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_bool SKIPPED [0.0006s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5264510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex32 SKIPPED [0.0006s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5264680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float32 SKIPPED [0.0006s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5264838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int16 SKIPPED [0.0007s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5264995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int32 SKIPPED [0.0005s] (Only runs on cpu) [ 63%] 2025-12-04T14:02:33.5265128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_bfloat16 PASSED [0.0537s] [ 63%] 2025-12-04T14:02:33.5265250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float32 PASSED [0.0416s] [ 63%] 2025-12-04T14:02:33.5265371Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float64 PASSED [1.6955s] [ 63%] 2025-12-04T14:02:33.5265499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_float16 PASSED [0.0090s] [ 63%] 2025-12-04T14:02:33.5265628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_float64 PASSED [0.0054s] [ 63%] 2025-12-04T14:02:33.5265755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_bfloat16 PASSED [0.0073s] [ 63%] 2025-12-04T14:02:33.5265883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float32 PASSED [0.0051s] [ 63%] 2025-12-04T14:02:33.5266014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_complex64 PASSED [0.0060s] [ 63%] 2025-12-04T14:02:33.5266141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_bfloat16 PASSED [0.0112s] [ 63%] 2025-12-04T14:02:33.5266265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_float16 PASSED [0.0108s] [ 63%] 2025-12-04T14:02:33.5266389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_cuda_float64 PASSED [0.0092s] [ 63%] 2025-12-04T14:02:33.5266513Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bfloat16 PASSED [0.0029s] [ 63%] 2025-12-04T14:02:33.5266631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bool PASSED [1.6331s] [ 63%] 2025-12-04T14:02:33.5266753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_complex32 PASSED [0.0048s] [ 63%] 2025-12-04T14:02:33.5266876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_complex64 PASSED [1.6595s] [ 63%] 2025-12-04T14:02:33.5266997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_float16 PASSED [0.0051s] [ 63%] 2025-12-04T14:02:33.5267125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_bfloat16 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5267248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_bool PASSED [0.0069s] [ 63%] 2025-12-04T14:02:33.5267387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex64 PASSED [0.0067s] [ 63%] 2025-12-04T14:02:33.5267514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_int16 PASSED [0.0067s] [ 63%] 2025-12-04T14:02:33.5267634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ormqr_cuda_float64 PASSED [0.1155s] [ 63%] 2025-12-04T14:02:33.5267772Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float16 PASSED [0.0042s] [ 63%] 2025-12-04T14:02:33.5267893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int32 PASSED [1.6507s] [ 63%] 2025-12-04T14:02:33.5268014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int64 PASSED [0.0058s] [ 63%] 2025-12-04T14:02:33.5268135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int8 PASSED [0.0040s] [ 63%] 2025-12-04T14:02:33.5268254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_uint8 PASSED [1.6587s] [ 63%] 2025-12-04T14:02:33.5268395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_float64 PASSED [0.2565s] [ 63%] 2025-12-04T14:02:33.5268528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_bfloat16 PASSED [0.0049s] [ 63%] 2025-12-04T14:02:33.5268665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_complex32 PASSED [1.6186s] [ 63%] 2025-12-04T14:02:33.5268799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_complex64 PASSED [0.0073s] [ 63%] 2025-12-04T14:02:33.5268941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_float16 PASSED [0.0050s] [ 63%] 2025-12-04T14:02:33.5269077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_float64 PASSED [0.0046s] [ 63%] 2025-12-04T14:02:33.5269205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_int8 PASSED [0.0046s] [ 63%] 2025-12-04T14:02:33.5269327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_bool PASSED [1.6250s] [ 63%] 2025-12-04T14:02:33.5269456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_complex32 PASSED [0.0058s] [ 63%] 2025-12-04T14:02:33.5269581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_complex64 PASSED [0.0038s] [ 63%] 2025-12-04T14:02:33.5269708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float16 PASSED [1.6527s] [ 63%] 2025-12-04T14:02:33.5269833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float32 PASSED [0.0055s] [ 63%] 2025-12-04T14:02:33.5269953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int16 PASSED [0.0037s] [ 63%] 2025-12-04T14:02:33.5270074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int32 PASSED [1.6294s] [ 63%] 2025-12-04T14:02:33.5270240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_complex128 PASSED [0.0308s] [ 63%] 2025-12-04T14:02:33.5270365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polar_cuda_float32 PASSED [0.0178s] [ 63%] 2025-12-04T14:02:33.5270485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polar_cuda_float64 PASSED [0.0171s] [ 63%] 2025-12-04T14:02:33.5270634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0076s] [ 63%] 2025-12-04T14:02:33.5270780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int64 PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5270924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_bool PASSED [1.6375s] [ 63%] 2025-12-04T14:02:33.5271068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int32 PASSED [0.0103s] [ 63%] 2025-12-04T14:02:33.5271226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int8 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5271375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.0078s] [ 63%] 2025-12-04T14:02:33.5271516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_bool PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5271677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_float64 PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5271821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_int16 PASSED [1.6336s] [ 63%] 2025-12-04T14:02:33.5271969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_bfloat16 PASSED [0.0098s] [ 63%] 2025-12-04T14:02:33.5272112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int16 PASSED [0.0079s] [ 63%] 2025-12-04T14:02:33.5272269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int64 PASSED [0.0077s] [ 63%] 2025-12-04T14:02:33.5272413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_int16 PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5272559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_int8 PASSED [0.0075s] [ 63%] 2025-12-04T14:02:33.5272688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_float32 PASSED [1.6512s] [ 63%] 2025-12-04T14:02:33.5272824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int16 PASSED [0.0035s] [ 63%] 2025-12-04T14:02:33.5272947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int32 PASSED [1.6436s] [ 63%] 2025-12-04T14:02:33.5273069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex64 PASSED [0.0125s] [ 63%] 2025-12-04T14:02:33.5273190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float32 PASSED [0.0101s] [ 63%] 2025-12-04T14:02:33.5273307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int16 PASSED [0.0099s] [ 63%] 2025-12-04T14:02:33.5273424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int64 PASSED [0.0097s] [ 63%] 2025-12-04T14:02:33.5273544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float64 PASSED [1.6718s] [ 63%] 2025-12-04T14:02:33.5273663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int32 PASSED [0.0361s] [ 63%] 2025-12-04T14:02:33.5273782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_uint8 PASSED [0.0341s] [ 63%] 2025-12-04T14:02:33.5273905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_complex128 PASSED [0.0232s] [ 63%] 2025-12-04T14:02:33.5274024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_float16 PASSED [0.0228s] [ 63%] 2025-12-04T14:02:33.5274141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int16 PASSED [0.0227s] [ 63%] 2025-12-04T14:02:33.5274256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int32 PASSED [0.0224s] [ 63%] 2025-12-04T14:02:33.5274372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int8 PASSED [0.0224s] [ 63%] 2025-12-04T14:02:33.5274494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_qr_cuda_complex64 PASSED [0.0251s] [ 64%] 2025-12-04T14:02:33.5274625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_quantile_cuda_float32 PASSED [0.4343s] [ 64%] 2025-12-04T14:02:33.5274751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_bfloat16 PASSED [1.6277s] [ 64%] 2025-12-04T14:02:33.5274872Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_bool PASSED [0.0056s] [ 64%] 2025-12-04T14:02:33.5275004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_float32 PASSED [0.0036s] [ 64%] 2025-12-04T14:02:33.5275132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_float64 PASSED [0.0113s] [ 64%] 2025-12-04T14:02:33.5275256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float64 PASSED [0.0106s] [ 64%] 2025-12-04T14:02:33.5275386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int16 PASSED [0.0103s] [ 64%] 2025-12-04T14:02:33.5275507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int8 PASSED [0.0103s] [ 64%] 2025-12-04T14:02:33.5275637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_float16 PASSED [0.0140s] [ 64%] 2025-12-04T14:02:33.5275766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_uint8 PASSED [0.0134s] [ 64%] 2025-12-04T14:02:33.5275888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_bfloat16 PASSED [1.6530s] [ 64%] 2025-12-04T14:02:33.5276025Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex128 PASSED [0.0062s] [ 64%] 2025-12-04T14:02:33.5276149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex32 PASSED [0.0049s] [ 64%] 2025-12-04T14:02:33.5276274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_complex64 PASSED [0.0042s] [ 64%] 2025-12-04T14:02:33.5276395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_float16 PASSED [0.0045s] [ 64%] 2025-12-04T14:02:33.5276539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex64 PASSED [0.0111s] [ 64%] 2025-12-04T14:02:33.5276668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_float32 PASSED [0.0107s] [ 64%] 2025-12-04T14:02:33.5276800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_float64 PASSED [1.6361s] [ 64%] 2025-12-04T14:02:33.5276925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_complex32 PASSED [0.0062s] [ 64%] 2025-12-04T14:02:33.5277044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_float16 PASSED [0.0042s] [ 64%] 2025-12-04T14:02:33.5277164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_int16 PASSED [1.6133s] [ 64%] 2025-12-04T14:02:33.5277284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_int64 PASSED [0.0063s] [ 64%] 2025-12-04T14:02:33.5277405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float32 PASSED [1.6352s] [ 64%] 2025-12-04T14:02:33.5277524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float64 PASSED [0.0038s] [ 64%] 2025-12-04T14:02:33.5277643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int64 PASSED [1.6494s] [ 64%] 2025-12-04T14:02:33.5277768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_bool PASSED [0.0069s] [ 64%] 2025-12-04T14:02:33.5277904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_complex128 PASSED [0.0044s] [ 64%] 2025-12-04T14:02:33.5278035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_complex64 PASSED [1.6576s] [ 64%] 2025-12-04T14:02:33.5278162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int16 PASSED [0.0066s] [ 64%] 2025-12-04T14:02:33.5278287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_uint8 PASSED [0.0044s] [ 64%] 2025-12-04T14:02:33.5278414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_float32 PASSED [0.0107s] [ 64%] 2025-12-04T14:02:33.5278535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_int8 PASSED [0.0103s] [ 64%] 2025-12-04T14:02:33.5278668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_bfloat16 PASSED [0.0138s] [ 64%] 2025-12-04T14:02:33.5278799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_complex128 PASSED [0.0108s] [ 64%] 2025-12-04T14:02:33.5278923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float16 PASSED [0.0298s] [ 64%] 2025-12-04T14:02:33.5279059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int8 PASSED [0.0291s] [ 64%] 2025-12-04T14:02:33.5279198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float64 PASSED [1.6387s] [ 64%] 2025-12-04T14:02:33.5279330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_complex32 PASSED [0.0061s] [ 64%] 2025-12-04T14:02:33.5279458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_float64 PASSED [0.0042s] [ 64%] 2025-12-04T14:02:33.5279584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int32 PASSED [1.6218s] [ 64%] 2025-12-04T14:02:33.5279719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int8 PASSED [0.0060s] [ 64%] 2025-12-04T14:02:33.5279840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_bool PASSED [0.0051s] [ 64%] 2025-12-04T14:02:33.5279966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_complex64 PASSED [1.6558s] [ 64%] 2025-12-04T14:02:33.5280122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int32 PASSED [0.0070s] [ 64%] 2025-12-04T14:02:33.5280256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int8 PASSED [0.0050s] [ 64%] 2025-12-04T14:02:33.5280381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_bfloat16 PASSED [0.0049s] [ 64%] 2025-12-04T14:02:33.5280506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_float16 PASSED [0.0045s] [ 64%] 2025-12-04T14:02:33.5280628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_float64 PASSED [0.0045s] [ 64%] 2025-12-04T14:02:33.5280750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_uint8 PASSED [0.0044s] [ 64%] 2025-12-04T14:02:33.5280879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float32 PASSED [0.0046s] [ 64%] 2025-12-04T14:02:33.5281009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float64 PASSED [0.0046s] [ 64%] 2025-12-04T14:02:33.5281134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_int32 PASSED [0.0046s] [ 64%] 2025-12-04T14:02:33.5281259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_int64 PASSED [0.0045s] [ 64%] 2025-12-04T14:02:33.5281390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_bfloat16 PASSED [1.6226s] [ 64%] 2025-12-04T14:02:33.5281525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_complex64 PASSED [0.0036s] [ 64%] 2025-12-04T14:02:33.5281656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_float64 PASSED [1.6139s] [ 64%] 2025-12-04T14:02:33.5281781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_bool PASSED [0.0035s] [ 64%] 2025-12-04T14:02:33.5281914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_complex32 PASSED [1.6444s] [ 64%] 2025-12-04T14:02:33.5282046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_complex64 PASSED [0.0037s] [ 64%] 2025-12-04T14:02:33.5282165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_bool PASSED [0.0173s] [ 64%] 2025-12-04T14:02:33.5282288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_complex128 PASSED [0.0158s] [ 64%] 2025-12-04T14:02:33.5282426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int16 PASSED [0.0154s] [ 64%] 2025-12-04T14:02:33.5282544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int32 PASSED [0.0153s] [ 64%] 2025-12-04T14:02:33.5282669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_complex64 PASSED [0.0235s] [ 64%] 2025-12-04T14:02:33.5282790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_bfloat16 PASSED [0.0031s] [ 64%] 2025-12-04T14:02:33.5282923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float32 PASSED [1.6338s] [ 64%] 2025-12-04T14:02:33.5283045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float64 PASSED [0.0049s] [ 64%] 2025-12-04T14:02:33.5283168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int64 PASSED [1.6204s] [ 64%] 2025-12-04T14:02:33.5283287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_uint8 PASSED [0.0050s] [ 64%] 2025-12-04T14:02:33.5283426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_bfloat16 PASSED [0.0049s] [ 64%] 2025-12-04T14:02:33.5283702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_neg_3_cuda_float64 PASSED [1.6306s] [ 64%] 2025-12-04T14:02:33.5283827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_complex32 PASSED [0.0067s] [ 64%] 2025-12-04T14:02:33.5283948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_float64 PASSED [0.0043s] [ 64%] 2025-12-04T14:02:33.5284080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int16 PASSED [1.6769s] [ 64%] 2025-12-04T14:02:33.5284199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int32 PASSED [0.0065s] [ 64%] 2025-12-04T14:02:33.5284318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_bfloat16 PASSED [0.0168s] [ 64%] 2025-12-04T14:02:33.5284437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int32 PASSED [0.0118s] [ 64%] 2025-12-04T14:02:33.5284553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int64 PASSED [0.0117s] [ 64%] 2025-12-04T14:02:33.5284687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_bfloat16 PASSED [1.6469s] [ 64%] 2025-12-04T14:02:33.5284819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float16 PASSED [0.0055s] [ 64%] 2025-12-04T14:02:33.5284951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float32 PASSED [0.0038s] [ 64%] 2025-12-04T14:02:33.5285082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float64 PASSED [1.6401s] [ 64%] 2025-12-04T14:02:33.5285211Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_int16 PASSED [0.0054s] [ 64%] 2025-12-04T14:02:33.5285342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_uint8 PASSED [0.0035s] [ 64%] 2025-12-04T14:02:33.5285480Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_complex128 PASSED [0.0086s] [ 64%] 2025-12-04T14:02:33.5285609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_float32 PASSED [0.0081s] [ 64%] 2025-12-04T14:02:33.5285737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int16 PASSED [1.6537s] [ 64%] 2025-12-04T14:02:33.5285863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int32 PASSED [0.0107s] [ 64%] 2025-12-04T14:02:33.5285988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int64 PASSED [0.0086s] [ 64%] 2025-12-04T14:02:33.5286115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_int8 PASSED [0.0088s] [ 64%] 2025-12-04T14:02:33.5286240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_uint8 PASSED [0.0091s] [ 64%] 2025-12-04T14:02:33.5286376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float64 PASSED [1.6857s] [ 64%] 2025-12-04T14:02:33.5286497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int32 PASSED [0.0176s] [ 64%] 2025-12-04T14:02:33.5286617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int8 PASSED [0.0154s] [ 64%] 2025-12-04T14:02:33.5286764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int32 PASSED [0.0186s] [ 64%] 2025-12-04T14:02:33.5286904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int64 PASSED [0.0184s] [ 64%] 2025-12-04T14:02:33.5287041Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_int8 PASSED [0.0182s] [ 64%] 2025-12-04T14:02:33.5287178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_uint8 PASSED [0.0183s] [ 64%] 2025-12-04T14:02:33.5287320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_bfloat16 PASSED [0.0183s] [ 64%] 2025-12-04T14:02:33.5287471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_float16 PASSED [0.0183s] [ 64%] 2025-12-04T14:02:33.5287610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_int16 PASSED [0.0201s] [ 64%] 2025-12-04T14:02:33.5287747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_int64 PASSED [0.0185s] [ 64%] 2025-12-04T14:02:33.5287899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_bfloat16 PASSED [0.0200s] [ 64%] 2025-12-04T14:02:33.5288035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_int64 PASSED [0.0197s] [ 64%] 2025-12-04T14:02:33.5288174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int16 PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int64 PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int8 PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_bool PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float32 PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int64 PASSED [0.0181s] [ 64%] 2025-12-04T14:02:33.5288993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int16 PASSED [0.2042s] [ 64%] 2025-12-04T14:02:33.5289122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int64 PASSED [0.1999s] [ 64%] 2025-12-04T14:02:33.5289253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_uint8 PASSED [0.2002s] [ 64%] 2025-12-04T14:02:33.5289380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_complex128 PASSED [1.6580s] [ 64%] 2025-12-04T14:02:33.5289505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_complex32 PASSED [0.0057s] [ 64%] 2025-12-04T14:02:33.5289630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_float16 PASSED [0.0041s] [ 64%] 2025-12-04T14:02:33.5289761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_bool PASSED [0.0079s] [ 64%] 2025-12-04T14:02:33.5289892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_int16 PASSED [0.0072s] [ 64%] 2025-12-04T14:02:33.5290021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_int8 PASSED [0.0072s] [ 64%] 2025-12-04T14:02:33.5290201Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex128 PASSED [0.0037s] [ 64%] 2025-12-04T14:02:33.5290323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex64 PASSED [1.6177s] [ 64%] 2025-12-04T14:02:33.5290444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_float16 PASSED [0.0055s] [ 64%] 2025-12-04T14:02:33.5290572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_uint8 PASSED [0.0034s] [ 64%] 2025-12-04T14:02:33.5290691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_bool PASSED [1.6564s] [ 64%] 2025-12-04T14:02:33.5290815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_complex64 PASSED [0.0043s] [ 64%] 2025-12-04T14:02:33.5290933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int8 PASSED [1.6305s] [ 64%] 2025-12-04T14:02:33.5291060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_bfloat16 PASSED [0.0058s] [ 64%] 2025-12-04T14:02:33.5291199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_complex128 PASSED [0.0057s] [ 64%] 2025-12-04T14:02:33.5291327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_complex64 PASSED [1.8550s] [ 64%] 2025-12-04T14:02:33.5291450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float16 PASSED [0.0063s] [ 64%] 2025-12-04T14:02:33.5291571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_int16 PASSED [0.0043s] [ 64%] 2025-12-04T14:02:33.5291704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_bfloat16 PASSED [0.0036s] [ 64%] 2025-12-04T14:02:33.5291827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float16 PASSED [1.6159s] [ 64%] 2025-12-04T14:02:33.5291947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float32 PASSED [0.0050s] [ 65%] 2025-12-04T14:02:33.5292066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_int8 PASSED [1.6575s] [ 65%] 2025-12-04T14:02:33.5292213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_bartlett_cuda_float32 PASSED [0.0165s] [ 65%] 2025-12-04T14:02:33.5292359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_blackman_cuda_float32 PASSED [1.6595s] [ 65%] 2025-12-04T14:02:33.5292504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_cosine_cuda_float32 PASSED [0.0129s] [ 65%] 2025-12-04T14:02:33.5292651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_gaussian_cuda_float32 PASSED [0.0199s] [ 65%] 2025-12-04T14:02:33.5292807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_hamming_cuda_float32 PASSED [0.0326s] [ 65%] 2025-12-04T14:02:33.5292961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_hamming_cuda_float64 PASSED [0.0319s] [ 65%] 2025-12-04T14:02:33.5293102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hann_cuda_float64 PASSED [0.0317s] [ 65%] 2025-12-04T14:02:33.5293244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_kaiser_cuda_float64 PASSED [0.0578s] [ 65%] 2025-12-04T14:02:33.5293367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int16 PASSED [1.6296s] [ 65%] 2025-12-04T14:02:33.5293486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int8 PASSED [0.0054s] [ 65%] 2025-12-04T14:02:33.5293609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_uint8 PASSED [0.0037s] [ 65%] 2025-12-04T14:02:33.5293732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex128 PASSED [1.6146s] [ 65%] 2025-12-04T14:02:33.5293852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex32 PASSED [0.2175s] [ 65%] 2025-12-04T14:02:33.5293983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_complex64 PASSED [0.0035s] [ 65%] 2025-12-04T14:02:33.5294103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_uint8 PASSED [1.6438s] [ 65%] 2025-12-04T14:02:33.5294225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_complex64 PASSED [0.0081s] [ 65%] 2025-12-04T14:02:33.5294355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int16 PASSED [0.0060s] [ 65%] 2025-12-04T14:02:33.5294472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int64 PASSED [0.0056s] [ 65%] 2025-12-04T14:02:33.5294588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int8 PASSED [0.0055s] [ 65%] 2025-12-04T14:02:33.5294707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_float32 PASSED [1.6322s] [ 65%] 2025-12-04T14:02:33.5294826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_float64 PASSED [0.0048s] [ 65%] 2025-12-04T14:02:33.5294972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_complex128 PASSED [1.6345s] [ 65%] 2025-12-04T14:02:33.5295091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_float32 PASSED [0.0057s] [ 65%] 2025-12-04T14:02:33.5295211Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int16 PASSED [1.6798s] [ 65%] 2025-12-04T14:02:33.5295328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int32 PASSED [0.0059s] [ 65%] 2025-12-04T14:02:33.5295469Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_uint8 PASSED [1.6526s] [ 65%] 2025-12-04T14:02:33.5295602Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_bfloat16 PASSED [0.0228s] [ 65%] 2025-12-04T14:02:33.5295737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_float64 PASSED [0.0190s] [ 65%] 2025-12-04T14:02:33.5295861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_cuda_float64 PASSED [0.0061s] [ 65%] 2025-12-04T14:02:33.5295996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_bool PASSED [0.0068s] [ 65%] 2025-12-04T14:02:33.5296140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_complex128 PASSED [0.0068s] [ 65%] 2025-12-04T14:02:33.5296283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_complex64 PASSED [0.0068s] [ 65%] 2025-12-04T14:02:33.5296423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_float32 PASSED [0.0067s] [ 65%] 2025-12-04T14:02:33.5296558Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int32 PASSED [0.0067s] [ 65%] 2025-12-04T14:02:33.5296680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_float64 PASSED [0.0153s] [ 65%] 2025-12-04T14:02:33.5296798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_uint8 PASSED [0.0150s] [ 65%] 2025-12-04T14:02:33.5296962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_complex128 SKIPPED [0.0001s] (Skipped!) [ 65%] 2025-12-04T14:02:33.5297120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 65%] 2025-12-04T14:02:33.5297251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_bool PASSED [1.6487s] [ 65%] 2025-12-04T14:02:33.5297386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_float64 PASSED [0.0079s] [ 65%] 2025-12-04T14:02:33.5297520Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_bool PASSED [0.0048s] [ 65%] 2025-12-04T14:02:33.5297664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int32 PASSED [1.6274s] [ 65%] 2025-12-04T14:02:33.5297799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int64 PASSED [0.0065s] [ 65%] 2025-12-04T14:02:33.5297930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int8 PASSED [0.0045s] [ 65%] 2025-12-04T14:02:33.5298074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_int8 PASSED [1.6289s] [ 65%] 2025-12-04T14:02:33.5298207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_bool PASSED [0.0079s] [ 65%] 2025-12-04T14:02:33.5298340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int8 PASSED [0.0044s] [ 65%] 2025-12-04T14:02:33.5298493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_bool PASSED [0.0121s] [ 65%] 2025-12-04T14:02:33.5298651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0109s] [ 65%] 2025-12-04T14:02:33.5298813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int8 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5298969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0105s] [ 65%] 2025-12-04T14:02:33.5299123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int64 PASSED [0.0116s] [ 65%] 2025-12-04T14:02:33.5299287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_float64 PASSED [0.0104s] [ 65%] 2025-12-04T14:02:33.5299440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int32 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5299590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5299744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_uint8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5299874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int16 PASSED [0.0051s] [ 65%] 2025-12-04T14:02:33.5300003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int32 PASSED [0.0048s] [ 65%] 2025-12-04T14:02:33.5300173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int64 PASSED [0.0049s] [ 65%] 2025-12-04T14:02:33.5300303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_int64 PASSED [0.0041s] [ 65%] 2025-12-04T14:02:33.5300454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int64 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5300610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_float64 PASSED [0.2876s] [ 65%] 2025-12-04T14:02:33.5300763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_uint8 PASSED [0.0125s] [ 65%] 2025-12-04T14:02:33.5300892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_float64 PASSED [1.6553s] [ 65%] 2025-12-04T14:02:33.5301021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int16 PASSED [0.0058s] [ 65%] 2025-12-04T14:02:33.5301147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int32 PASSED [0.0039s] [ 65%] 2025-12-04T14:02:33.5301274Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int64 PASSED [1.6476s] [ 65%] 2025-12-04T14:02:33.5301398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int8 PASSED [0.0062s] [ 65%] 2025-12-04T14:02:33.5301544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_bfloat16 PASSED [0.0055s] [ 65%] 2025-12-04T14:02:33.5301675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_float32 PASSED [1.6585s] [ 65%] 2025-12-04T14:02:33.5301800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int32 PASSED [0.0060s] [ 65%] 2025-12-04T14:02:33.5301926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int8 PASSED [0.0040s] [ 65%] 2025-12-04T14:02:33.5302088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_bool PASSED [0.4007s] [ 65%] 2025-12-04T14:02:33.5302240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int16 PASSED [0.0121s] [ 65%] 2025-12-04T14:02:33.5302390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int64 PASSED [0.0116s] [ 65%] 2025-12-04T14:02:33.5302541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_uint8 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5302703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_bool PASSED [0.0128s] [ 65%] 2025-12-04T14:02:33.5302855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int32 PASSED [0.0116s] [ 65%] 2025-12-04T14:02:33.5303005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int8 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5303171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_uint8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5303307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int32 PASSED [0.0075s] [ 65%] 2025-12-04T14:02:33.5303440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_uint8 PASSED [0.0071s] [ 65%] 2025-12-04T14:02:33.5303591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_float32 PASSED [1.6598s] [ 65%] 2025-12-04T14:02:33.5303739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_int32 PASSED [0.0057s] [ 65%] 2025-12-04T14:02:33.5303886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_uint8 PASSED [0.0042s] [ 65%] 2025-12-04T14:02:33.5304034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_float64 PASSED [1.8143s] [ 65%] 2025-12-04T14:02:33.5304180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int8 PASSED [0.0063s] [ 65%] 2025-12-04T14:02:33.5304326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_uint8 PASSED [0.0044s] [ 65%] 2025-12-04T14:02:33.5304456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_bool PASSED [0.0042s] [ 65%] 2025-12-04T14:02:33.5304587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_uint8 PASSED [1.6252s] [ 65%] 2025-12-04T14:02:33.5304755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16 PASSED [0.0102s] [ 65%] 2025-12-04T14:02:33.5304925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32 PASSED [0.0079s] [ 65%] 2025-12-04T14:02:33.5305087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64 PASSED [0.1847s] [ 65%] 2025-12-04T14:02:33.5305242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int64 PASSED [1.6694s] [ 65%] 2025-12-04T14:02:33.5305394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int8 PASSED [0.0065s] [ 65%] 2025-12-04T14:02:33.5305561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_uint8 PASSED [0.0043s] [ 65%] 2025-12-04T14:02:33.5305720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64 PASSED [1.6429s] [ 65%] 2025-12-04T14:02:33.5305873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32 PASSED [0.0063s] [ 65%] 2025-12-04T14:02:33.5306051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64 PASSED [0.3647s] [ 65%] 2025-12-04T14:02:33.5306216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int16 PASSED [0.0124s] [ 65%] 2025-12-04T14:02:33.5306376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8 PASSED [0.0116s] [ 65%] 2025-12-04T14:02:33.5306541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32 PASSED [0.0128s] [ 65%] 2025-12-04T14:02:33.5306714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int64 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5306874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5307038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_uint8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5307210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int32 PASSED [0.0115s] [ 65%] 2025-12-04T14:02:33.5307372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8 PASSED [0.0114s] [ 65%] 2025-12-04T14:02:33.5307533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_bool PASSED [0.5358s] [ 65%] 2025-12-04T14:02:33.5307699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [0.0096s] [ 65%] 2025-12-04T14:02:33.5307850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_float32 PASSED [0.0041s] [ 65%] 2025-12-04T14:02:33.5308002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_float64 PASSED [0.0039s] [ 65%] 2025-12-04T14:02:33.5308150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int16 PASSED [1.6528s] [ 65%] 2025-12-04T14:02:33.5308299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int64 PASSED [0.0068s] [ 65%] 2025-12-04T14:02:33.5308435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float32 PASSED [0.0160s] [ 65%] 2025-12-04T14:02:33.5308569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int8 PASSED [0.0177s] [ 65%] 2025-12-04T14:02:33.5308703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_uint8 PASSED [0.0174s] [ 65%] 2025-12-04T14:02:33.5308830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_bool PASSED [0.0120s] [ 65%] 2025-12-04T14:02:33.5308962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_float32 PASSED [0.0095s] [ 65%] 2025-12-04T14:02:33.5309088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int8 PASSED [0.0119s] [ 65%] 2025-12-04T14:02:33.5309215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex128 PASSED [1.6315s] [ 65%] 2025-12-04T14:02:33.5309340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex32 PASSED [0.0049s] [ 65%] 2025-12-04T14:02:33.5309475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_complex64 PASSED [1.6408s] [ 65%] 2025-12-04T14:02:33.5309598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_float32 PASSED [0.0048s] [ 65%] 2025-12-04T14:02:33.5309719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int8 PASSED [1.6464s] [ 65%] 2025-12-04T14:02:33.5309864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_bfloat16 PASSED [0.0054s] [ 65%] 2025-12-04T14:02:33.5310003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_complex64 PASSED [1.6530s] [ 65%] 2025-12-04T14:02:33.5310179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float32 PASSED [0.0053s] [ 65%] 2025-12-04T14:02:33.5310310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int16 PASSED [1.6503s] [ 65%] 2025-12-04T14:02:33.5310443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int32 PASSED [0.0056s] [ 65%] 2025-12-04T14:02:33.5310596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int64 PASSED [1.6273s] [ 65%] 2025-12-04T14:02:33.5310735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_bool PASSED [0.0062s] [ 65%] 2025-12-04T14:02:33.5310881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_complex32 PASSED [0.0044s] [ 65%] 2025-12-04T14:02:33.5311035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_float16 PASSED [1.6333s] [ 66%] 2025-12-04T14:02:33.5311176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int16 PASSED [0.0063s] [ 66%] 2025-12-04T14:02:33.5311315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int32 PASSED [0.0043s] [ 66%] 2025-12-04T14:02:33.5311453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_complex64 PASSED [1.6642s] [ 66%] 2025-12-04T14:02:33.5311592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float16 PASSED [0.0059s] [ 66%] 2025-12-04T14:02:33.5311729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int16 PASSED [0.0040s] [ 66%] 2025-12-04T14:02:33.5311862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int64 PASSED [1.6773s] [ 66%] 2025-12-04T14:02:33.5311996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int8 PASSED [0.0055s] [ 66%] 2025-12-04T14:02:33.5312114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_bool PASSED [0.0037s] [ 66%] 2025-12-04T14:02:33.5312237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_complex64 PASSED [1.6542s] [ 66%] 2025-12-04T14:02:33.5312357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float64 PASSED [0.0050s] [ 66%] 2025-12-04T14:02:33.5312476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int64 PASSED [0.0039s] [ 66%] 2025-12-04T14:02:33.5312601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_bfloat16 PASSED [0.0050s] [ 66%] 2025-12-04T14:02:33.5312722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_bool PASSED [1.6659s] [ 66%] 2025-12-04T14:02:33.5312849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_complex128 PASSED [0.0068s] [ 66%] 2025-12-04T14:02:33.5312973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_float32 PASSED [0.0047s] [ 66%] 2025-12-04T14:02:33.5313091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int16 PASSED [0.0044s] [ 66%] 2025-12-04T14:02:33.5313234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_bfloat16 PASSED [0.0073s] [ 66%] 2025-12-04T14:02:33.5313367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_float16 PASSED [0.0070s] [ 66%] 2025-12-04T14:02:33.5313498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_float32 PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5313640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int16 PASSED [0.0070s] [ 66%] 2025-12-04T14:02:33.5313767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int32 PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5313896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_uint8 PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5314022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_complex32 PASSED [0.0047s] [ 66%] 2025-12-04T14:02:33.5314146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_float64 PASSED [0.0046s] [ 66%] 2025-12-04T14:02:33.5314278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int16 PASSED [1.6423s] [ 66%] 2025-12-04T14:02:33.5314400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_uint8 PASSED [0.0073s] [ 66%] 2025-12-04T14:02:33.5314536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_float32 PASSED [0.0045s] [ 66%] 2025-12-04T14:02:33.5314669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int64 PASSED [1.6814s] [ 66%] 2025-12-04T14:02:33.5314796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_bool PASSED [0.0125s] [ 66%] 2025-12-04T14:02:33.5314919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_float64 PASSED [0.0102s] [ 66%] 2025-12-04T14:02:33.5315038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_int8 PASSED [0.0101s] [ 66%] 2025-12-04T14:02:33.5315162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_uint8 PASSED [0.0099s] [ 66%] 2025-12-04T14:02:33.5315286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_bfloat16 PASSED [0.0163s] [ 66%] 2025-12-04T14:02:33.5315413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_bfloat16 PASSED [0.0215s] [ 66%] 2025-12-04T14:02:33.5315544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_complex128 PASSED [1.6557s] [ 66%] 2025-12-04T14:02:33.5315669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_float16 PASSED [0.0248s] [ 66%] 2025-12-04T14:02:33.5315793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_cuda_float32 PASSED [1.6773s] [ 66%] 2025-12-04T14:02:33.5315934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_complex128 PASSED [0.0071s] [ 66%] 2025-12-04T14:02:33.5316072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float32 PASSED [0.0045s] [ 66%] 2025-12-04T14:02:33.5316202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_float16 PASSED [1.6208s] [ 66%] 2025-12-04T14:02:33.5316335Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_float32 PASSED [0.0063s] [ 66%] 2025-12-04T14:02:33.5316456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_bfloat16 PASSED [0.0166s] [ 66%] 2025-12-04T14:02:33.5316580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex128 PASSED [0.0116s] [ 66%] 2025-12-04T14:02:33.5316699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float16 PASSED [0.0155s] [ 66%] 2025-12-04T14:02:33.5316820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float32 PASSED [0.0113s] [ 66%] 2025-12-04T14:02:33.5316939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float64 PASSED [0.0111s] [ 66%] 2025-12-04T14:02:33.5317069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bfloat16 PASSED [0.0178s] [ 66%] 2025-12-04T14:02:33.5317191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex128 PASSED [1.6320s] [ 66%] 2025-12-04T14:02:33.5317312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex64 PASSED [0.0160s] [ 66%] 2025-12-04T14:02:33.5317442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_float16 PASSED [0.0183s] [ 66%] 2025-12-04T14:02:33.5317561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_float32 PASSED [0.0136s] [ 66%] 2025-12-04T14:02:33.5317677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int32 PASSED [0.0170s] [ 66%] 2025-12-04T14:02:33.5317806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_bfloat16 PASSED [0.0137s] [ 66%] 2025-12-04T14:02:33.5317932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_bool PASSED [0.0127s] [ 66%] 2025-12-04T14:02:33.5318069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_float16 PASSED [0.0136s] [ 66%] 2025-12-04T14:02:33.5318197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_float32 PASSED [0.0103s] [ 66%] 2025-12-04T14:02:33.5318320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_complex128 PASSED [0.1650s] [ 66%] 2025-12-04T14:02:33.5318440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_float32 PASSED [0.1537s] [ 66%] 2025-12-04T14:02:33.5318567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_cuda_float64 PASSED [0.1533s] [ 66%] 2025-12-04T14:02:33.5318696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_float64 PASSED [0.3333s] [ 66%] 2025-12-04T14:02:33.5318821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_bfloat16 PASSED [0.0048s] [ 66%] 2025-12-04T14:02:33.5318940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_bool PASSED [1.6591s] [ 66%] 2025-12-04T14:02:33.5319065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_complex64 PASSED [0.0062s] [ 66%] 2025-12-04T14:02:33.5319187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_float32 PASSED [0.0043s] [ 66%] 2025-12-04T14:02:33.5319304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_bfloat16 PASSED [1.6450s] [ 66%] 2025-12-04T14:02:33.5319420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float16 PASSED [0.0052s] [ 66%] 2025-12-04T14:02:33.5319535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int64 PASSED [1.6780s] [ 66%] 2025-12-04T14:02:33.5319648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int8 PASSED [0.0056s] [ 66%] 2025-12-04T14:02:33.5319787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_complex64 PASSED [0.0133s] [ 66%] 2025-12-04T14:02:33.5319920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_float32 PASSED [0.0118s] [ 66%] 2025-12-04T14:02:33.5320051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_int32 PASSED [0.0116s] [ 66%] 2025-12-04T14:02:33.5320212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_int64 PASSED [0.0115s] [ 66%] 2025-12-04T14:02:33.5320343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_uint8 PASSED [0.0115s] [ 66%] 2025-12-04T14:02:33.5320464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float16 PASSED [0.0081s] [ 66%] 2025-12-04T14:02:33.5320582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int32 PASSED [0.0079s] [ 66%] 2025-12-04T14:02:33.5320717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex128 PASSED [1.6864s] [ 66%] 2025-12-04T14:02:33.5320836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float16 PASSED [0.0056s] [ 66%] 2025-12-04T14:02:33.5320955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float64 PASSED [0.0035s] [ 66%] 2025-12-04T14:02:33.5321083Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int16 PASSED [1.7022s] [ 66%] 2025-12-04T14:02:33.5321198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int32 PASSED [0.0056s] [ 66%] 2025-12-04T14:02:33.5321316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int64 PASSED [0.0040s] [ 66%] 2025-12-04T14:02:33.5321438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_complex64 PASSED [1.7043s] [ 66%] 2025-12-04T14:02:33.5321555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_int16 PASSED [0.0054s] [ 66%] 2025-12-04T14:02:33.5321671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_int8 PASSED [0.0036s] [ 66%] 2025-12-04T14:02:33.5321819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_complex128 PASSED [0.0081s] [ 66%] 2025-12-04T14:02:33.5321949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float32 PASSED [0.0078s] [ 66%] 2025-12-04T14:02:33.5322078Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int16 PASSED [1.6973s] [ 66%] 2025-12-04T14:02:33.5322217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int32 PASSED [0.0104s] [ 66%] 2025-12-04T14:02:33.5322345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int64 PASSED [0.0082s] [ 66%] 2025-12-04T14:02:33.5322470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int8 PASSED [0.0079s] [ 66%] 2025-12-04T14:02:33.5322598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_uint8 PASSED [1.6743s] [ 66%] 2025-12-04T14:02:33.5322720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_bfloat16 PASSED [0.0413s] [ 66%] 2025-12-04T14:02:33.5322845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_complex128 PASSED [0.0388s] [ 66%] 2025-12-04T14:02:33.5322967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_complex64 PASSED [0.0386s] [ 66%] 2025-12-04T14:02:33.5323087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_float64 PASSED [0.0382s] [ 66%] 2025-12-04T14:02:33.5323204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_int32 PASSED [0.0382s] [ 66%] 2025-12-04T14:02:33.5323325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_complex64 PASSED [0.0059s] [ 66%] 2025-12-04T14:02:33.5323442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_float64 PASSED [1.6580s] [ 66%] 2025-12-04T14:02:33.5323559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int32 PASSED [0.0074s] [ 66%] 2025-12-04T14:02:33.5323673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int64 PASSED [1.6938s] [ 66%] 2025-12-04T14:02:33.5323786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int8 PASSED [0.0074s] [ 66%] 2025-12-04T14:02:33.5323909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_bool PASSED [1.6784s] [ 66%] 2025-12-04T14:02:33.5324034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_uint8 PASSED [0.0055s] [ 66%] 2025-12-04T14:02:33.5324151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int16 PASSED [0.0076s] [ 66%] 2025-12-04T14:02:33.5324268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_uint8 PASSED [0.0067s] [ 66%] 2025-12-04T14:02:33.5324542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float16 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 66%] 2025-12-04T14:02:33.5324711Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16 PASSED [1.6728s] [ 66%] 2025-12-04T14:02:33.5324886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int8 PASSED [0.0130s] [ 66%] 2025-12-04T14:02:33.5325004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_bool PASSED [1.6788s] [ 66%] 2025-12-04T14:02:33.5325124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int16 PASSED [0.0053s] [ 66%] 2025-12-04T14:02:33.5325243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int32 PASSED [1.6807s] [ 66%] 2025-12-04T14:02:33.5325361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int64 PASSED [0.0052s] [ 66%] 2025-12-04T14:02:33.5325488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int8 PASSED [1.6781s] [ 66%] 2025-12-04T14:02:33.5325607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_uint8 PASSED [0.0054s] [ 66%] 2025-12-04T14:02:33.5325743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_bfloat16 PASSED [0.0078s] [ 66%] 2025-12-04T14:02:33.5325874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_bool PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5326022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex128 PASSED [0.0068s] [ 66%] 2025-12-04T14:02:33.5326154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int64 PASSED [0.0067s] [ 66%] 2025-12-04T14:02:33.5326285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_uint8 PASSED [0.0067s] [ 66%] 2025-12-04T14:02:33.5326415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_bfloat16 PASSED [0.0046s] [ 66%] 2025-12-04T14:02:33.5326543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float32 PASSED [0.0046s] [ 66%] 2025-12-04T14:02:33.5326666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_int64 PASSED [1.6868s] [ 66%] 2025-12-04T14:02:33.5326797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_complex64 PASSED [0.0404s] [ 66%] 2025-12-04T14:02:33.5326922Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int32 PASSED [0.0373s] [ 66%] 2025-12-04T14:02:33.5327046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int64 PASSED [1.7122s] [ 66%] 2025-12-04T14:02:33.5327172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_complex128 PASSED [0.0375s] [ 66%] 2025-12-04T14:02:33.5327294Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float16 PASSED [0.0451s] [ 66%] 2025-12-04T14:02:33.5327414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float64 PASSED [0.0328s] [ 66%] 2025-12-04T14:02:33.5327534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int16 PASSED [0.0353s] [ 66%] 2025-12-04T14:02:33.5327654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int64 PASSED [0.0335s] [ 66%] 2025-12-04T14:02:33.5327770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int8 PASSED [0.0375s] [ 66%] 2025-12-04T14:02:33.5327892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_bfloat16 PASSED [0.0128s] [ 66%] 2025-12-04T14:02:33.5328016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_complex128 PASSED [1.6526s] [ 66%] 2025-12-04T14:02:33.5328147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_complex32 PASSED [0.0156s] [ 67%] 2025-12-04T14:02:33.5328268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_float16 PASSED [0.0127s] [ 67%] 2025-12-04T14:02:33.5328387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_float64 PASSED [0.0124s] [ 67%] 2025-12-04T14:02:33.5328504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int32 PASSED [0.0123s] [ 67%] 2025-12-04T14:02:33.5328630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int8 PASSED [0.0122s] [ 67%] 2025-12-04T14:02:33.5328750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_bfloat16 PASSED [1.6552s] [ 67%] 2025-12-04T14:02:33.5328867Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_bool PASSED [0.0158s] [ 67%] 2025-12-04T14:02:33.5328989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_complex64 PASSED [0.0128s] [ 67%] 2025-12-04T14:02:33.5329110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float16 PASSED [0.0124s] [ 67%] 2025-12-04T14:02:33.5329240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float64 PASSED [1.6576s] [ 67%] 2025-12-04T14:02:33.5329368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_indices_cuda_int64 PASSED [0.0164s] [ 67%] 2025-12-04T14:02:33.5329498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_bfloat16 PASSED [0.0141s] [ 67%] 2025-12-04T14:02:33.5329624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int16 PASSED [0.0126s] [ 67%] 2025-12-04T14:02:33.5329753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float16 PASSED [0.0032s] [ 67%] 2025-12-04T14:02:33.5329873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float32 PASSED [1.6387s] [ 67%] 2025-12-04T14:02:33.5329993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_float64 PASSED [0.0049s] [ 67%] 2025-12-04T14:02:33.5330148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int16 PASSED [1.6492s] [ 67%] 2025-12-04T14:02:33.5330267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_uint8 PASSED [0.0048s] [ 67%] 2025-12-04T14:02:33.5330392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_bool PASSED [0.0061s] [ 67%] 2025-12-04T14:02:33.5330527Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_complex32 PASSED [0.0055s] [ 67%] 2025-12-04T14:02:33.5330653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int64 PASSED [1.6584s] [ 67%] 2025-12-04T14:02:33.5330779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_complex32 PASSED [0.0078s] [ 67%] 2025-12-04T14:02:33.5330900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float64 PASSED [0.0053s] [ 67%] 2025-12-04T14:02:33.5331030Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_bfloat16 PASSED [0.0057s] [ 67%] 2025-12-04T14:02:33.5331161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_complex32 PASSED [0.0054s] [ 67%] 2025-12-04T14:02:33.5331283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int8 PASSED [0.0052s] [ 67%] 2025-12-04T14:02:33.5331419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex128 PASSED [0.0148s] [ 67%] 2025-12-04T14:02:33.5331552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex64 PASSED [1.6485s] [ 67%] 2025-12-04T14:02:33.5331678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int64 PASSED [0.0167s] [ 67%] 2025-12-04T14:02:33.5331802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int8 PASSED [1.6438s] [ 67%] 2025-12-04T14:02:33.5331941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_bfloat16 PASSED [0.0109s] [ 67%] 2025-12-04T14:02:33.5332069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_complex128 PASSED [1.6552s] [ 67%] 2025-12-04T14:02:33.5332194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_complex32 PASSED [0.0110s] [ 67%] 2025-12-04T14:02:33.5332326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_float16 PASSED [1.6599s] [ 67%] 2025-12-04T14:02:33.5332446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_int64 PASSED [0.0107s] [ 67%] 2025-12-04T14:02:33.5332566Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_uint8 PASSED [1.6506s] [ 67%] 2025-12-04T14:02:33.5332690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_float32 PASSED [0.0060s] [ 67%] 2025-12-04T14:02:33.5332827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int16 PASSED [0.1005s] [ 67%] 2025-12-04T14:02:33.5332976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int64 PASSED [0.0998s] [ 67%] 2025-12-04T14:02:33.5333098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float64 PASSED [0.2082s] [ 67%] 2025-12-04T14:02:33.5333218Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int64 PASSED [0.2071s] [ 67%] 2025-12-04T14:02:33.5333338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int8 PASSED [0.2043s] [ 67%] 2025-12-04T14:02:33.5333470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_uint16 PASSED [0.2222s] [ 67%] 2025-12-04T14:02:33.5333591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_uint64 PASSED [0.2213s] [ 67%] 2025-12-04T14:02:33.5333720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_int64 PASSED [0.0569s] [ 67%] 2025-12-04T14:02:33.5333851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_uint8 PASSED [0.0570s] [ 67%] 2025-12-04T14:02:33.5333985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex128 PASSED [0.0035s] [ 67%] 2025-12-04T14:02:33.5334119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex32 PASSED [1.6596s] [ 67%] 2025-12-04T14:02:33.5334249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_float16 PASSED [0.0057s] [ 67%] 2025-12-04T14:02:33.5334378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int16 PASSED [1.6518s] [ 67%] 2025-12-04T14:02:33.5334506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int32 PASSED [0.0055s] [ 67%] 2025-12-04T14:02:33.5334631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int8 PASSED [1.6332s] [ 67%] 2025-12-04T14:02:33.5334758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_bool PASSED [0.0053s] [ 67%] 2025-12-04T14:02:33.5334889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_float16 PASSED [1.6244s] [ 67%] 2025-12-04T14:02:33.5335019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_float64 PASSED [0.0054s] [ 67%] 2025-12-04T14:02:33.5335150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_bool PASSED [0.0082s] [ 67%] 2025-12-04T14:02:33.5335289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex128 PASSED [0.0075s] [ 67%] 2025-12-04T14:02:33.5335425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex32 PASSED [0.0074s] [ 67%] 2025-12-04T14:02:33.5335560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_float32 PASSED [0.0073s] [ 67%] 2025-12-04T14:02:33.5335700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int32 PASSED [0.0073s] [ 67%] 2025-12-04T14:02:33.5335831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int64 PASSED [0.0072s] [ 67%] 2025-12-04T14:02:33.5335954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_bool PASSED [0.0048s] [ 67%] 2025-12-04T14:02:33.5336095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_complex128 PASSED [0.0049s] [ 67%] 2025-12-04T14:02:33.5336225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_complex32 PASSED [1.6473s] [ 67%] 2025-12-04T14:02:33.5336349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int16 PASSED [0.0073s] [ 67%] 2025-12-04T14:02:33.5336473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int32 PASSED [0.0053s] [ 67%] 2025-12-04T14:02:33.5336593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_bfloat16 PASSED [0.0145s] [ 67%] 2025-12-04T14:02:33.5336724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_complex64 PASSED [0.0107s] [ 67%] 2025-12-04T14:02:33.5336849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_bfloat16 PASSED [0.0215s] [ 67%] 2025-12-04T14:02:33.5336975Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_float64 PASSED [0.0145s] [ 67%] 2025-12-04T14:02:33.5337111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_float16 PASSED [0.0045s] [ 67%] 2025-12-04T14:02:33.5337256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_float32 PASSED [1.6420s] [ 67%] 2025-12-04T14:02:33.5337388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_complex64 PASSED [0.0059s] [ 67%] 2025-12-04T14:02:33.5337520Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float32 PASSED [0.0040s] [ 67%] 2025-12-04T14:02:33.5337650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float64 PASSED [1.6247s] [ 67%] 2025-12-04T14:02:33.5337777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_complex128 PASSED [0.0069s] [ 67%] 2025-12-04T14:02:33.5337897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_float32 PASSED [0.0034s] [ 67%] 2025-12-04T14:02:33.5338031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_complex_cuda_float16 PASSED [1.6663s] [ 67%] 2025-12-04T14:02:33.5338157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_bfloat16 PASSED [0.0059s] [ 67%] 2025-12-04T14:02:33.5338280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float32 PASSED [0.0040s] [ 67%] 2025-12-04T14:02:33.5338401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int8 PASSED [1.6350s] [ 67%] 2025-12-04T14:02:33.5338529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_bfloat16 PASSED [0.0094s] [ 67%] 2025-12-04T14:02:33.5338651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_bool PASSED [1.6430s] [ 67%] 2025-12-04T14:02:33.5338776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float64 PASSED [0.0088s] [ 67%] 2025-12-04T14:02:33.5338900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_int32 PASSED [1.6241s] [ 67%] 2025-12-04T14:02:33.5339022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_uint8 PASSED [0.0089s] [ 67%] 2025-12-04T14:02:33.5339144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_complex32 PASSED [1.6537s] [ 67%] 2025-12-04T14:02:33.5339265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_complex64 PASSED [0.0075s] [ 67%] 2025-12-04T14:02:33.5339400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_int64 PASSED [0.0052s] [ 67%] 2025-12-04T14:02:33.5339524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_bfloat16 PASSED [1.6406s] [ 67%] 2025-12-04T14:02:33.5339646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float16 PASSED [0.0054s] [ 67%] 2025-12-04T14:02:33.5339778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_uint8 PASSED [0.0038s] [ 67%] 2025-12-04T14:02:33.5339903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_bfloat16 PASSED [0.0055s] [ 67%] 2025-12-04T14:02:33.5340031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_complex128 PASSED [0.0051s] [ 67%] 2025-12-04T14:02:33.5340181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_int16 PASSED [0.0051s] [ 67%] 2025-12-04T14:02:33.5340304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_bfloat16 PASSED [0.0085s] [ 67%] 2025-12-04T14:02:33.5340434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_bool PASSED [0.0089s] [ 67%] 2025-12-04T14:02:33.5340560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex128 PASSED [1.6703s] [ 67%] 2025-12-04T14:02:33.5340683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex32 PASSED [0.0114s] [ 67%] 2025-12-04T14:02:33.5340804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float64 PASSED [0.0087s] [ 67%] 2025-12-04T14:02:33.5340938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int32 PASSED [0.0082s] [ 67%] 2025-12-04T14:02:33.5341059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_float16 PASSED [0.0189s] [ 67%] 2025-12-04T14:02:33.5341179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_bfloat16 PASSED [0.0045s] [ 67%] 2025-12-04T14:02:33.5341300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float16 PASSED [0.0044s] [ 67%] 2025-12-04T14:02:33.5341420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float64 PASSED [0.0043s] [ 67%] 2025-12-04T14:02:33.5341542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int16 PASSED [0.0044s] [ 67%] 2025-12-04T14:02:33.5341662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_uint8 PASSED [0.0043s] [ 67%] 2025-12-04T14:02:33.5341785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_bfloat16 PASSED [1.6384s] [ 67%] 2025-12-04T14:02:33.5341912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_complex128 PASSED [0.0049s] [ 67%] 2025-12-04T14:02:33.5342032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float16 PASSED [1.6506s] [ 67%] 2025-12-04T14:02:33.5342152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float64 PASSED [0.0050s] [ 67%] 2025-12-04T14:02:33.5342272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_int64 PASSED [1.6459s] [ 67%] 2025-12-04T14:02:33.5342397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_bool PASSED [0.0091s] [ 67%] 2025-12-04T14:02:33.5342528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex32 PASSED [1.6356s] [ 67%] 2025-12-04T14:02:33.5342659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex64 PASSED [0.0093s] [ 67%] 2025-12-04T14:02:33.5342787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float16 PASSED [1.6630s] [ 67%] 2025-12-04T14:02:33.5342916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float32 PASSED [0.0093s] [ 67%] 2025-12-04T14:02:33.5343051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int32 PASSED [1.6706s] [ 67%] 2025-12-04T14:02:33.5343153Z test_meta.py::TestMetaCUDA::test_embedding_bag_byte_prepack_cuda PASSED [0.0033s] [ 67%] 2025-12-04T14:02:33.5343264Z test_meta.py::TestMetaCUDA::test_embedding_bag_dense_backward_mode_1_cuda PASSED [0.0107s] [ 67%] 2025-12-04T14:02:33.5343361Z test_meta.py::TestMetaCUDA::test_fill__alias_relationship_cuda PASSED [0.0012s] [ 67%] 2025-12-04T14:02:33.5343727Z test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask3_cuda SKIPPED [0.0007s] (Only runs on cpu) [ 67%] 2025-12-04T14:02:33.5343819Z test_meta.py::TestMetaCUDA::test_huber_loss_backward_cuda PASSED [0.0015s] [ 67%] 2025-12-04T14:02:33.5343950Z test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask1_cuda SKIPPED [0.0005s] (Only runs on cpu) [ 67%] 2025-12-04T14:02:33.5344078Z test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask4_cuda SKIPPED [0.0005s] (Only runs on cpu) [ 67%] 2025-12-04T14:02:33.5344207Z test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask5_cuda SKIPPED [0.0007s] (Only runs on cpu) [ 67%] 2025-12-04T14:02:33.5344314Z test_meta.py::TestMetaCUDA::test_map_location_deserialize_cuda PASSED [0.0022s] [ 67%] 2025-12-04T14:02:33.5344431Z test_meta.py::TestMetaCUDA::test_meta_autograd_no_error_cuda SKIPPED [0.0005s] (Only runs on cpu) [ 67%] 2025-12-04T14:02:33.5344568Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5344701Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5344845Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5344979Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5345114Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5345246Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T14:02:33.5345386Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5345520Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5345652Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5345781Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5345914Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346064Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346217Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346365Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346514Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346659Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346804Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5346944Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347091Z test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347231Z test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347374Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347534Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347681Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347823Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5347964Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348113Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348250Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348398Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348540Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348687Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348826Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5348962Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349098Z test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349234Z test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349370Z test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349516Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349654Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349796Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5349944Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350085Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350251Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350389Z test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350527Z test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350694Z test_meta.py::TestMetaCUDA::test_meta_inplace__batch_norm_with_update_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350848Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5350995Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T14:02:33.5351104Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_bfloat16 PASSED [0.0152s] [ 68%] 2025-12-04T14:02:33.5351226Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_complex128 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5351334Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_complex64 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5351443Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_float32 PASSED [1.6664s] [ 68%] 2025-12-04T14:02:33.5351546Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_int8 PASSED [0.0148s] [ 68%] 2025-12-04T14:02:33.5351650Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_uint8 PASSED [0.0142s] [ 68%] 2025-12-04T14:02:33.5351762Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_complex128 PASSED [0.0155s] [ 68%] 2025-12-04T14:02:33.5351881Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_float32 PASSED [0.0155s] [ 68%] 2025-12-04T14:02:33.5351986Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_float64 PASSED [0.0147s] [ 68%] 2025-12-04T14:02:33.5352090Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int32 XFAIL [0.0057s] [ 68%] 2025-12-04T14:02:33.5352196Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_bfloat16 XFAIL [0.0290s] [ 68%] 2025-12-04T14:02:33.5352322Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_float64 XFAIL [1.6933s] [ 68%] 2025-12-04T14:02:33.5352424Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int16 XFAIL [1.6699s] [ 68%] 2025-12-04T14:02:33.5352524Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int64 XFAIL [1.6453s] [ 68%] 2025-12-04T14:02:33.5352627Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int8 XFAIL [1.6656s] [ 68%] 2025-12-04T14:02:33.5352739Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_float32 PASSED [1.8064s] [ 68%] 2025-12-04T14:02:33.5352844Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int16 XFAIL [0.0075s] [ 68%] 2025-12-04T14:02:33.5352949Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_uint8 XFAIL [1.6434s] [ 68%] 2025-12-04T14:02:33.5353057Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int32 PASSED [1.7554s] [ 68%] 2025-12-04T14:02:33.5353165Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_bfloat16 PASSED [0.0150s] [ 68%] 2025-12-04T14:02:33.5353270Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float32 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5353372Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int64 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5353473Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int8 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5353579Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_float32 PASSED [1.6954s] [ 68%] 2025-12-04T14:02:33.5353682Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int16 XFAIL [0.0061s] [ 68%] 2025-12-04T14:02:33.5353786Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int32 XFAIL [1.6499s] [ 68%] 2025-12-04T14:02:33.5353893Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_bfloat16 PASSED [1.7106s] [ 68%] 2025-12-04T14:02:33.5353995Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_bool XFAIL [0.0059s] [ 68%] 2025-12-04T14:02:33.5354100Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_float64 PASSED [1.6883s] [ 68%] 2025-12-04T14:02:33.5354205Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_int64 PASSED [0.0147s] [ 68%] 2025-12-04T14:02:33.5354319Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_bfloat16 PASSED [0.1500s] [ 68%] 2025-12-04T14:02:33.5354446Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_complex128 XFAIL [0.0066s] [ 68%] 2025-12-04T14:02:33.5354559Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_float32 PASSED [1.7775s] [ 68%] 2025-12-04T14:02:33.5354671Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_float64 PASSED [0.1490s] [ 68%] 2025-12-04T14:02:33.5354791Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_int8 PASSED [0.1015s] [ 68%] 2025-12-04T14:02:33.5354900Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_bool XFAIL [0.0140s] [ 68%] 2025-12-04T14:02:33.5355012Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int16 PASSED [1.7320s] [ 68%] 2025-12-04T14:02:33.5355123Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int64 PASSED [0.1020s] [ 68%] 2025-12-04T14:02:33.5355230Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_bfloat16 PASSED [0.0153s] [ 68%] 2025-12-04T14:02:33.5355344Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_bool PASSED [0.0150s] [ 68%] 2025-12-04T14:02:33.5355449Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float16 PASSED [0.0152s] [ 68%] 2025-12-04T14:02:33.5355554Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float64 PASSED [0.0152s] [ 68%] 2025-12-04T14:02:33.5355661Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_int32 PASSED [0.0150s] [ 68%] 2025-12-04T14:02:33.5355765Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_bfloat16 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5355886Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_complex128 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5355987Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int64 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5356087Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_uint8 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5356194Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_bfloat16 PASSED [1.7006s] [ 68%] 2025-12-04T14:02:33.5356297Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_bool XFAIL [0.0060s] [ 68%] 2025-12-04T14:02:33.5356405Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_complex64 PASSED [1.6566s] [ 68%] 2025-12-04T14:02:33.5356512Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float16 PASSED [0.0150s] [ 68%] 2025-12-04T14:02:33.5356617Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float64 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5356721Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int16 XFAIL [0.0056s] [ 68%] 2025-12-04T14:02:33.5356823Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int64 XFAIL [1.6551s] [ 68%] 2025-12-04T14:02:33.5356923Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_int32 XFAIL [1.6310s] [ 68%] 2025-12-04T14:02:33.5357025Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_int64 XFAIL [1.6778s] [ 68%] 2025-12-04T14:02:33.5357127Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_bool XFAIL [1.6492s] [ 68%] 2025-12-04T14:02:33.5357233Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float16 PASSED [1.6763s] [ 68%] 2025-12-04T14:02:33.5357340Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float32 PASSED [0.0149s] [ 68%] 2025-12-04T14:02:33.5357441Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_uint8 XFAIL [0.0057s] [ 68%] 2025-12-04T14:02:33.5357547Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_float16 PASSED [0.0146s] [ 68%] 2025-12-04T14:02:33.5357649Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_int64 XFAIL [0.0055s] [ 68%] 2025-12-04T14:02:33.5357754Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_bfloat16 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5357864Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int16 XFAIL [0.0053s] [ 68%] 2025-12-04T14:02:33.5357965Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_uint8 XFAIL [0.0054s] [ 68%] 2025-12-04T14:02:33.5358077Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_complex128 PASSED [1.6646s] [ 68%] 2025-12-04T14:02:33.5358187Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_complex64 PASSED [0.0151s] [ 68%] 2025-12-04T14:02:33.5358306Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_float16 PASSED [0.0147s] [ 68%] 2025-12-04T14:02:33.5358408Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int8 XFAIL [0.0057s] [ 68%] 2025-12-04T14:02:33.5358517Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_bfloat16 PASSED [1.6691s] [ 68%] 2025-12-04T14:02:33.5358626Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_complex128 XFAIL [0.0061s] [ 68%] 2025-12-04T14:02:33.5358736Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_complex64 XFAIL [1.6731s] [ 68%] 2025-12-04T14:02:33.5358852Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_float64 PASSED [1.6948s] [ 68%] 2025-12-04T14:02:33.5358958Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_int8 PASSED [0.0148s] [ 68%] 2025-12-04T14:02:33.5359064Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_complex64 XFAIL [0.0057s] [ 68%] 2025-12-04T14:02:33.5359171Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_float16 PASSED [0.0255s] [ 68%] 2025-12-04T14:02:33.5359286Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_float32 PASSED [0.0250s] [ 68%] 2025-12-04T14:02:33.5359388Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_uint8 XFAIL [0.0054s] [ 68%] 2025-12-04T14:02:33.5359495Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_bfloat16 PASSED [0.1021s] [ 68%] 2025-12-04T14:02:33.5359606Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_complex128 PASSED [0.1332s] [ 68%] 2025-12-04T14:02:33.5359712Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float16 PASSED [0.1019s] [ 68%] 2025-12-04T14:02:33.5359817Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float32 PASSED [0.1019s] [ 68%] 2025-12-04T14:02:33.5359919Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int16 XFAIL [0.0071s] [ 68%] 2025-12-04T14:02:33.5360021Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int64 XFAIL [1.6605s] [ 68%] 2025-12-04T14:02:33.5360176Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_complex128 XFAIL [1.6703s] [ 68%] 2025-12-04T14:02:33.5360287Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_complex64 XFAIL [1.6687s] [ 68%] 2025-12-04T14:02:33.5360396Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_float32 PASSED [1.6501s] [ 68%] 2025-12-04T14:02:33.5360579Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_uint8 SKIPPED [0.0002s] (In-place lgamma not supported for integral tensors) [ 68%] 2025-12-04T14:02:33.5360688Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float32 PASSED [0.0148s] [ 68%] 2025-12-04T14:02:33.5360793Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float64 PASSED [0.0145s] [ 68%] 2025-12-04T14:02:33.5360899Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int16 XFAIL [0.0056s] [ 68%] 2025-12-04T14:02:33.5361003Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int64 XFAIL [1.6584s] [ 68%] 2025-12-04T14:02:33.5361107Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int8 XFAIL [1.6460s] [ 68%] 2025-12-04T14:02:33.5361212Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_uint8 XFAIL [1.6651s] [ 68%] 2025-12-04T14:02:33.5361320Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_bfloat16 PASSED [1.6445s] [ 69%] 2025-12-04T14:02:33.5361440Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float16 PASSED [0.0150s] [ 69%] 2025-12-04T14:02:33.5361547Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float64 PASSED [0.0145s] [ 69%] 2025-12-04T14:02:33.5361651Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int64 XFAIL [0.0057s] [ 69%] 2025-12-04T14:02:33.5361753Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int8 XFAIL [1.6476s] [ 69%] 2025-12-04T14:02:33.5361869Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_uint8 XFAIL [1.6394s] [ 69%] 2025-12-04T14:02:33.5361975Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_bfloat16 PASSED [1.6642s] [ 69%] 2025-12-04T14:02:33.5362081Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float64 PASSED [0.0149s] [ 69%] 2025-12-04T14:02:33.5362183Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int64 XFAIL [0.0057s] [ 69%] 2025-12-04T14:02:33.5362284Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int8 XFAIL [0.0055s] [ 69%] 2025-12-04T14:02:33.5362398Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_uint8 XFAIL [1.6437s] [ 69%] 2025-12-04T14:02:33.5362499Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_bool XFAIL [1.6582s] [ 69%] 2025-12-04T14:02:33.5362605Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_complex64 PASSED [1.6456s] [ 69%] 2025-12-04T14:02:33.5362707Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int16 XFAIL [0.0060s] [ 69%] 2025-12-04T14:02:33.5362806Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int32 XFAIL [1.6644s] [ 69%] 2025-12-04T14:02:33.5362919Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int8 XFAIL [1.6347s] [ 69%] 2025-12-04T14:02:33.5363071Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_float32 SKIPPED [1.6316s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5363183Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_bfloat16 PASSED [0.1521s] [ 69%] 2025-12-04T14:02:33.5363291Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_bool XFAIL [0.0142s] [ 69%] 2025-12-04T14:02:33.5363401Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_complex64 XFAIL [1.6501s] [ 69%] 2025-12-04T14:02:33.5363509Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int32 PASSED [1.7652s] [ 69%] 2025-12-04T14:02:33.5363615Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int8 PASSED [0.1014s] [ 69%] 2025-12-04T14:02:33.5363729Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_complex128 XFAIL [0.0067s] [ 69%] 2025-12-04T14:02:33.5363831Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int32 PASSED [1.6955s] [ 69%] 2025-12-04T14:02:33.5363933Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int64 PASSED [0.0581s] [ 69%] 2025-12-04T14:02:33.5364032Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int8 PASSED [0.0557s] [ 69%] 2025-12-04T14:02:33.5364141Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_bfloat16 PASSED [0.0149s] [ 69%] 2025-12-04T14:02:33.5364241Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_bool XFAIL [0.0055s] [ 69%] 2025-12-04T14:02:33.5364348Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_complex64 PASSED [1.6930s] [ 69%] 2025-12-04T14:02:33.5364451Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int16 PASSED [0.0151s] [ 69%] 2025-12-04T14:02:33.5364553Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int32 PASSED [0.0146s] [ 69%] 2025-12-04T14:02:33.5364700Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5364852Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5365012Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5365161Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5365310Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5365428Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_bfloat16 PASSED [0.0871s] [ 69%] 2025-12-04T14:02:33.5365535Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_complex64 PASSED [0.1090s] [ 69%] 2025-12-04T14:02:33.5365642Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float16 PASSED [0.0869s] [ 69%] 2025-12-04T14:02:33.5365748Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float32 PASSED [0.0823s] [ 69%] 2025-12-04T14:02:33.5365853Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float64 PASSED [0.0847s] [ 69%] 2025-12-04T14:02:33.5365991Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_complex128 PASSED [0.0151s] [ 69%] 2025-12-04T14:02:33.5366109Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_complex64 PASSED [0.0147s] [ 69%] 2025-12-04T14:02:33.5366225Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_float64 PASSED [0.0143s] [ 69%] 2025-12-04T14:02:33.5366335Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int16 XFAIL [0.0055s] [ 69%] 2025-12-04T14:02:33.5366454Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int8 XFAIL [0.0062s] [ 69%] 2025-12-04T14:02:33.5366563Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_complex64 XFAIL [1.6670s] [ 69%] 2025-12-04T14:02:33.5366668Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_int32 PASSED [1.6716s] [ 69%] 2025-12-04T14:02:33.5366773Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_int8 PASSED [0.0146s] [ 69%] 2025-12-04T14:02:33.5366882Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_bfloat16 PASSED [0.0148s] [ 69%] 2025-12-04T14:02:33.5366989Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_float16 PASSED [0.0146s] [ 69%] 2025-12-04T14:02:33.5367094Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_bool XFAIL [0.0055s] [ 69%] 2025-12-04T14:02:33.5367201Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int32 XFAIL [0.0054s] [ 69%] 2025-12-04T14:02:33.5367307Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_complex64 XFAIL [1.6524s] [ 69%] 2025-12-04T14:02:33.5367412Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_int32 PASSED [1.6723s] [ 69%] 2025-12-04T14:02:33.5367512Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_bool XFAIL [0.0060s] [ 69%] 2025-12-04T14:02:33.5367613Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int16 XFAIL [1.6795s] [ 69%] 2025-12-04T14:02:33.5367714Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int8 XFAIL [1.6551s] [ 69%] 2025-12-04T14:02:33.5367827Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_complex128 PASSED [1.6800s] [ 69%] 2025-12-04T14:02:33.5367929Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int32 XFAIL [0.0059s] [ 69%] 2025-12-04T14:02:33.5368032Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int64 XFAIL [1.6633s] [ 69%] 2025-12-04T14:02:33.5368133Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_uint8 XFAIL [1.6598s] [ 69%] 2025-12-04T14:02:33.5368242Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_bfloat16 PASSED [1.6561s] [ 69%] 2025-12-04T14:02:33.5368342Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_bool XFAIL [0.0059s] [ 69%] 2025-12-04T14:02:33.5368441Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int8 XFAIL [1.6823s] [ 69%] 2025-12-04T14:02:33.5368552Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_bool XFAIL [1.6469s] [ 69%] 2025-12-04T14:02:33.5368659Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_complex64 XFAIL [1.6737s] [ 69%] 2025-12-04T14:02:33.5368763Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float16 XFAIL [0.0160s] [ 69%] 2025-12-04T14:02:33.5368876Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float64 XFAIL [1.6821s] [ 69%] 2025-12-04T14:02:33.5368977Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_int32 XFAIL [0.0132s] [ 69%] 2025-12-04T14:02:33.5369085Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_complex128 PASSED [1.6979s] [ 69%] 2025-12-04T14:02:33.5369191Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float32 PASSED [0.0148s] [ 69%] 2025-12-04T14:02:33.5369289Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_int8 XFAIL [0.0056s] [ 69%] 2025-12-04T14:02:33.5369399Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_complex64 PASSED [0.0145s] [ 69%] 2025-12-04T14:02:33.5369510Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int16 XFAIL [0.0054s] [ 69%] 2025-12-04T14:02:33.5369611Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int32 XFAIL [0.0054s] [ 69%] 2025-12-04T14:02:33.5369712Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int64 XFAIL [1.6742s] [ 69%] 2025-12-04T14:02:33.5369815Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_bool XFAIL [1.6716s] [ 69%] 2025-12-04T14:02:33.5369934Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_complex128 XFAIL [1.6718s] [ 69%] 2025-12-04T14:02:33.5370042Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float16 PASSED [1.6812s] [ 69%] 2025-12-04T14:02:33.5370180Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float32 PASSED [0.0147s] [ 69%] 2025-12-04T14:02:33.5370287Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_uint8 PASSED [0.0142s] [ 69%] 2025-12-04T14:02:33.5370399Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_complex128 PASSED [0.0125s] [ 69%] 2025-12-04T14:02:33.5370504Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_float32 PASSED [0.0123s] [ 69%] 2025-12-04T14:02:33.5370609Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int64 PASSED [0.0121s] [ 69%] 2025-12-04T14:02:33.5370713Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_uint8 PASSED [0.0121s] [ 69%] 2025-12-04T14:02:33.5370880Z test_meta.py::TestMetaCUDA::test_meta_inplace__native_batch_norm_legit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371045Z test_meta.py::TestMetaCUDA::test_meta_inplace__native_batch_norm_legit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371211Z test_meta.py::TestMetaCUDA::test_meta_inplace__segment_reduce_offsets_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371375Z test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371537Z test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371698Z test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5371864Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372029Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372225Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372406Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372580Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372769Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5372935Z test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5373099Z test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5373196Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_float64 PASSED [1.6648s] [ 69%] 2025-12-04T14:02:33.5373301Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int8 PASSED [0.0044s] [ 69%] 2025-12-04T14:02:33.5373394Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_uint8 PASSED [1.6587s] [ 69%] 2025-12-04T14:02:33.5373586Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T14:02:33.5373775Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_bool SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T14:02:33.5373883Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float32 PASSED [0.0047s] [ 69%] 2025-12-04T14:02:33.5373979Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float64 PASSED [1.6309s] [ 69%] 2025-12-04T14:02:33.5374170Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_int32 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T14:02:33.5374361Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_uint8 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T14:02:33.5374452Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_bool PASSED [0.0071s] [ 69%] 2025-12-04T14:02:33.5374548Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float16 PASSED [0.0060s] [ 69%] 2025-12-04T14:02:33.5374642Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float64 PASSED [0.0058s] [ 69%] 2025-12-04T14:02:33.5374733Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int8 PASSED [0.0058s] [ 69%] 2025-12-04T14:02:33.5374835Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_complex128 PASSED [1.6776s] [ 69%] 2025-12-04T14:02:33.5374932Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float32 PASSED [0.0056s] [ 69%] 2025-12-04T14:02:33.5375029Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float64 PASSED [1.6624s] [ 69%] 2025-12-04T14:02:33.5375130Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float32 PASSED [0.0099s] [ 69%] 2025-12-04T14:02:33.5375227Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_float64 PASSED [1.6586s] [ 69%] 2025-12-04T14:02:33.5375323Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int16 PASSED [0.0098s] [ 69%] 2025-12-04T14:02:33.5375419Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int64 PASSED [1.6581s] [ 69%] 2025-12-04T14:02:33.5375514Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int8 PASSED [0.0100s] [ 69%] 2025-12-04T14:02:33.5375617Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_complex128 PASSED [1.6520s] [ 69%] 2025-12-04T14:02:33.5375729Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_float32 PASSED [0.0075s] [ 69%] 2025-12-04T14:02:33.5375841Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_float64 PASSED [0.0054s] [ 69%] 2025-12-04T14:02:33.5375950Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_complex128 PASSED [0.0047s] [ 69%] 2025-12-04T14:02:33.5376051Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_complex64 PASSED [0.0045s] [ 69%] 2025-12-04T14:02:33.5376145Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float32 PASSED [0.0045s] [ 69%] 2025-12-04T14:02:33.5376251Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float64 PASSED [0.0044s] [ 69%] 2025-12-04T14:02:33.5376347Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_complex64 PASSED [1.6410s] [ 69%] 2025-12-04T14:02:33.5376441Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int8 PASSED [0.0063s] [ 69%] 2025-12-04T14:02:33.5376591Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5376735Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_bool SKIPPED [0.0012s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5376882Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5377037Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T14:02:33.5377179Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5377318Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5377465Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5377616Z test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5377766Z test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5377901Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378035Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378176Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378311Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378448Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378594Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378737Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5378882Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379023Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379163Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379305Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379445Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379590Z test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379741Z test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5379885Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380024Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380227Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380361Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380498Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380642Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380798Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5380937Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381075Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381217Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381369Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381513Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381656Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381800Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5381940Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382082Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382231Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382380Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382529Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382671Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382814Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5382971Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5383128Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5383281Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5383431Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5383549Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_bfloat16 PASSED [1.6610s] [ 70%] 2025-12-04T14:02:33.5383651Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_bool PASSED [0.0059s] [ 70%] 2025-12-04T14:02:33.5383759Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_complex64 PASSED [0.0044s] [ 70%] 2025-12-04T14:02:33.5383862Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float16 PASSED [0.0040s] [ 70%] 2025-12-04T14:02:33.5383978Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float32 PASSED [1.6498s] [ 70%] 2025-12-04T14:02:33.5384079Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float64 PASSED [0.0060s] [ 70%] 2025-12-04T14:02:33.5384180Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_int16 PASSED [0.0042s] [ 70%] 2025-12-04T14:02:33.5384277Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_int8 PASSED [0.0040s] [ 70%] 2025-12-04T14:02:33.5384377Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_uint8 PASSED [1.6731s] [ 70%] 2025-12-04T14:02:33.5384493Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_bool XFAIL [0.0059s] [ 70%] 2025-12-04T14:02:33.5384623Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_float32 XFAIL [1.6750s] [ 70%] 2025-12-04T14:02:33.5384740Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_int64 XFAIL [1.6559s] [ 70%] 2025-12-04T14:02:33.5384857Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_int8 XFAIL [1.6565s] [ 70%] 2025-12-04T14:02:33.5385029Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_complex64 SKIPPED [1.6557s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385186Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float16 SKIPPED [0.0016s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385342Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385499Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385654Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385807Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5385959Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5386058Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_bfloat16 PASSED [1.6546s] [ 70%] 2025-12-04T14:02:33.5386255Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_complex64 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5386354Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_bfloat16 PASSED [1.6490s] [ 70%] 2025-12-04T14:02:33.5386542Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_bool SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5386737Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_complex64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5386835Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_float16 PASSED [1.6378s] [ 70%] 2025-12-04T14:02:33.5386932Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_float32 PASSED [0.0044s] [ 70%] 2025-12-04T14:02:33.5387125Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5387324Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5387512Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5387611Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_bfloat16 PASSED [0.0060s] [ 70%] 2025-12-04T14:02:33.5387805Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5387902Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_float16 PASSED [0.0055s] [ 70%] 2025-12-04T14:02:33.5387997Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_float32 PASSED [0.0054s] [ 70%] 2025-12-04T14:02:33.5388185Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5388372Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5388480Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_bfloat16 PASSED [1.6424s] [ 70%] 2025-12-04T14:02:33.5388662Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_bool SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5388858Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex128 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5389058Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5389155Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_float32 PASSED [1.6781s] [ 70%] 2025-12-04T14:02:33.5389253Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_float64 PASSED [0.0045s] [ 70%] 2025-12-04T14:02:33.5389440Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5389623Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5389809Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5390004Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5390240Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5390338Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_float16 PASSED [1.6383s] [ 70%] 2025-12-04T14:02:33.5390525Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5390712Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5390900Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T14:02:33.5391054Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391206Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391367Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391514Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391671Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391819Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5391961Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392113Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392263Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392421Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392564Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392707Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5392861Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393001Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393108Z test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_complex128 PASSED [0.0075s] [ 70%] 2025-12-04T14:02:33.5393208Z test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_float32 PASSED [0.0040s] [ 70%] 2025-12-04T14:02:33.5393308Z test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_float64 PASSED [1.6527s] [ 70%] 2025-12-04T14:02:33.5393460Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_complex128 SKIPPED [0.0016s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393605Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393748Z test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393891Z test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5393994Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_bool PASSED [0.0066s] [ 70%] 2025-12-04T14:02:33.5395467Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int8 PASSED [0.0055s] [ 70%] 2025-12-04T14:02:33.5395587Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int16 PASSED [0.0054s] [ 70%] 2025-12-04T14:02:33.5395701Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int32 PASSED [0.0053s] [ 70%] 2025-12-04T14:02:33.5395804Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int16 PASSED [1.6910s] [ 70%] 2025-12-04T14:02:33.5395907Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_int16 PASSED [0.0076s] [ 70%] 2025-12-04T14:02:33.5396007Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_int8 PASSED [0.0057s] [ 70%] 2025-12-04T14:02:33.5396107Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_xor_cuda_bool PASSED [0.0055s] [ 70%] 2025-12-04T14:02:33.5396249Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5396415Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5396555Z test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5396694Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5396843Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5396980Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5397118Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5397281Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5397441Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5397606Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T14:02:33.5397761Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5397915Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398076Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398220Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398361Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398497Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398633Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398789Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5398940Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399090Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399230Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399366Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399508Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399646Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399784Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5399919Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400052Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400224Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400380Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400480Z test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float32 PASSED [1.6742s] [ 71%] 2025-12-04T14:02:33.5400628Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400789Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5400934Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5401031Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float16 PASSED [1.6721s] [ 71%] 2025-12-04T14:02:33.5401128Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float32 PASSED [0.0044s] [ 71%] 2025-12-04T14:02:33.5401222Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int16 PASSED [1.6482s] [ 71%] 2025-12-04T14:02:33.5401329Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int8 PASSED [0.0044s] [ 71%] 2025-12-04T14:02:33.5401469Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_bool SKIPPED [0.0012s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5401618Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5401761Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5401918Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402057Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402193Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402339Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402483Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402626Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402764Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5402902Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403044Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403183Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403321Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403458Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403593Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403728Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5403862Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404014Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404173Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_complex64 SKIPPED [0.0008s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404333Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404499Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404654Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404812Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5404957Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405103Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405256Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405398Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405536Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405681Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5405783Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_bfloat16 PASSED [0.0088s] [ 71%] 2025-12-04T14:02:33.5405878Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int16 PASSED [0.0081s] [ 71%] 2025-12-04T14:02:33.5405979Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int16 PASSED [0.0063s] [ 71%] 2025-12-04T14:02:33.5406079Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_uint8 PASSED [0.0062s] [ 71%] 2025-12-04T14:02:33.5406186Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_bfloat16 PASSED [0.0063s] [ 71%] 2025-12-04T14:02:33.5406281Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_bool PASSED [0.0062s] [ 71%] 2025-12-04T14:02:33.5406385Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_float16 PASSED [0.0063s] [ 71%] 2025-12-04T14:02:33.5406486Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_float64 PASSED [0.0062s] [ 71%] 2025-12-04T14:02:33.5406585Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int32 PASSED [0.0062s] [ 71%] 2025-12-04T14:02:33.5406721Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5406868Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407009Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407162Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407312Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407461Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407618Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407770Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5407928Z test_meta.py::TestMetaCUDA::test_meta_inplace_complex_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5408073Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5408213Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5408357Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5408468Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_bfloat16 PASSED [1.6926s] [ 71%] 2025-12-04T14:02:33.5408580Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_complex32 PASSED [0.0043s] [ 71%] 2025-12-04T14:02:33.5408688Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float16 PASSED [1.6736s] [ 71%] 2025-12-04T14:02:33.5408794Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float32 PASSED [0.0041s] [ 71%] 2025-12-04T14:02:33.5408913Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int32 PASSED [1.6782s] [ 71%] 2025-12-04T14:02:33.5409066Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409218Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_float64 SKIPPED [0.0013s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409386Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409535Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int64 SKIPPED [0.0012s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409689Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409841Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5409992Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5410174Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5410321Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5410516Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5410620Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_float32 PASSED [0.0081s] [ 71%] 2025-12-04T14:02:33.5410816Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5410966Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411117Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411263Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411406Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411548Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411703Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5411845Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5412030Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5412138Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_float32 PASSED [1.6862s] [ 71%] 2025-12-04T14:02:33.5412234Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_float64 PASSED [0.0055s] [ 71%] 2025-12-04T14:02:33.5412421Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5412606Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5412800Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5412997Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5413184Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T14:02:33.5413346Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5413502Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5413653Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5413806Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5413955Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414097Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414240Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414378Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414521Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414662Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414800Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5414943Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5415086Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5415225Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5415364Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5415480Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_bfloat16 SKIPPED [0.0010s] (Skipped) [ 71%] 2025-12-04T14:02:33.5415610Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_float16 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T14:02:33.5415726Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_float32 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T14:02:33.5415838Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_int32 SKIPPED [0.0010s] (Skipped) [ 71%] 2025-12-04T14:02:33.5415960Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_int64 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T14:02:33.5416070Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_uint8 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T14:02:33.5416182Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int16 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T14:02:33.5416345Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5416506Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T14:02:33.5416618Z test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_float32 PASSED [0.0029s] [ 72%] 2025-12-04T14:02:33.5416716Z test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_float64 PASSED [1.6727s] [ 72%] 2025-12-04T14:02:33.5416910Z test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5417099Z test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int8 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5417250Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5417394Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5417537Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5417678Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5417817Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5417953Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418089Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418240Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418386Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418529Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418673Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418824Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5418973Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419116Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419270Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419427Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419569Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419715Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5419869Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420011Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420208Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420362Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420516Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420679Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420816Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5420959Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5421113Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5421250Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5421387Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5421492Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_bfloat16 PASSED [1.6698s] [ 72%] 2025-12-04T14:02:33.5421684Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int32 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5421877Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_uint8 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5422020Z test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5422159Z test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5422272Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_float32 PASSED [0.0134s] [ 72%] 2025-12-04T14:02:33.5422384Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int8 PASSED [0.0079s] [ 72%] 2025-12-04T14:02:33.5422599Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5422813Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5423025Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5423141Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_float32 PASSED [0.0055s] [ 72%] 2025-12-04T14:02:33.5423257Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_float64 PASSED [0.0054s] [ 72%] 2025-12-04T14:02:33.5423473Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5423680Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5423805Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_float32 PASSED [1.6730s] [ 72%] 2025-12-04T14:02:33.5423945Z test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_float16 SKIPPED [0.0017s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424081Z test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_float64 SKIPPED [0.0013s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424219Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424364Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424527Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424666Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424805Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5424952Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425091Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425236Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425375Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425517Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425659Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425798Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5425942Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426084Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426224Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426367Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426503Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426647Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_complex32 SKIPPED [0.0008s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426790Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5426927Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427065Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427215Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427365Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427508Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_int32 SKIPPED [0.0013s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427659Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427818Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5427971Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428126Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428285Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428437Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428592Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428752Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5428900Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5429047Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int64 SKIPPED [0.0013s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5429195Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5429291Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_bool PASSED [0.0069s] [ 72%] 2025-12-04T14:02:33.5429389Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_complex64 PASSED [0.0060s] [ 72%] 2025-12-04T14:02:33.5429485Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_float16 PASSED [0.0059s] [ 72%] 2025-12-04T14:02:33.5429578Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_float32 PASSED [0.0058s] [ 72%] 2025-12-04T14:02:33.5429671Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int16 PASSED [0.0058s] [ 72%] 2025-12-04T14:02:33.5429761Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int32 PASSED [0.0058s] [ 72%] 2025-12-04T14:02:33.5429905Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5430042Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5430269Z test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5430365Z test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_float16 PASSED [1.6443s] [ 72%] 2025-12-04T14:02:33.5430550Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5430647Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_float64 PASSED [1.6636s] [ 72%] 2025-12-04T14:02:33.5430831Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5431038Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5431226Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5431325Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bfloat16 PASSED [1.6770s] [ 72%] 2025-12-04T14:02:33.5431433Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_float64 PASSED [0.0069s] [ 72%] 2025-12-04T14:02:33.5431620Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5431808Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5431903Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_bfloat16 PASSED [1.6820s] [ 72%] 2025-12-04T14:02:33.5432086Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5432291Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_complex128 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5432387Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_float16 PASSED [0.0049s] [ 72%] 2025-12-04T14:02:33.5432480Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_float32 PASSED [1.6685s] [ 72%] 2025-12-04T14:02:33.5432642Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_bfloat16 SKIPPED [0.0017s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5432789Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_float32 SKIPPED [0.0014s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5432934Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433086Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433229Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433382Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433525Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433666Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5433856Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T14:02:33.5433969Z test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float16 PASSED [1.6605s] [ 72%] 2025-12-04T14:02:33.5434077Z test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float32 PASSED [0.0062s] [ 72%] 2025-12-04T14:02:33.5434215Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5434352Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5434499Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e4m3fn SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5434649Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e5m2fnuz SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5434784Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5434941Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435083Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435223Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435371Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435512Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435656Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435802Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5435953Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5436093Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T14:02:33.5436234Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5436381Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5436520Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5436668Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5436814Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5436956Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437098Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437248Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437399Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437547Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437694Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437833Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5437979Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438125Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438274Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438417Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438556Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438704Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5438855Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439006Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439170Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439323Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439467Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439612Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439764Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5439904Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440044Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440238Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440384Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440527Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440667Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440808Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5440956Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441105Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441251Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441393Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441547Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441695Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441846Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5441996Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442139Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442283Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442427Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442583Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442725Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5442868Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443030Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443172Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443322Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443473Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443631Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443773Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5443926Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444076Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444230Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444379Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444525Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444670Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444811Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5444956Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445098Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445242Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445381Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445530Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445679Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445821Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5445965Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5446068Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_complex128 PASSED [1.6797s] [ 73%] 2025-12-04T14:02:33.5446167Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float16 PASSED [0.0052s] [ 73%] 2025-12-04T14:02:33.5446313Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5446473Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5446621Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5446770Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5446925Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447069Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447213Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447352Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447503Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447638Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447781Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5447925Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448086Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448232Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448378Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448519Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448661Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448798Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5448943Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5449081Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5449217Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5449356Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5449466Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_bfloat16 XFAIL [0.0040s] [ 73%] 2025-12-04T14:02:33.5449572Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float64 PASSED [1.6881s] [ 73%] 2025-12-04T14:02:33.5449672Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float32 PASSED [0.0029s] [ 73%] 2025-12-04T14:02:33.5449768Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_int16 PASSED [1.6617s] [ 73%] 2025-12-04T14:02:33.5449863Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_uint8 PASSED [0.0043s] [ 73%] 2025-12-04T14:02:33.5449974Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_bfloat16 PASSED [0.0126s] [ 73%] 2025-12-04T14:02:33.5450082Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_float16 PASSED [0.0116s] [ 73%] 2025-12-04T14:02:33.5450239Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_float32 PASSED [0.0115s] [ 73%] 2025-12-04T14:02:33.5450344Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_int8 PASSED [0.0077s] [ 73%] 2025-12-04T14:02:33.5450485Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5450637Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5450775Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5450913Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5451051Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5451189Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5451298Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_float64 PASSED [1.6768s] [ 73%] 2025-12-04T14:02:33.5451395Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int16 PASSED [0.0082s] [ 73%] 2025-12-04T14:02:33.5451489Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int64 PASSED [0.0061s] [ 73%] 2025-12-04T14:02:33.5451584Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int8 PASSED [0.0059s] [ 73%] 2025-12-04T14:02:33.5451689Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_uint8 PASSED [0.0059s] [ 73%] 2025-12-04T14:02:33.5451836Z test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5451983Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452129Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452269Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452418Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452559Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452700Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452839Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5452975Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5453117Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5453209Z test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int8 PASSED [0.0054s] [ 73%] 2025-12-04T14:02:33.5453301Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bool PASSED [0.0053s] [ 73%] 2025-12-04T14:02:33.5453397Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float32 PASSED [0.0054s] [ 73%] 2025-12-04T14:02:33.5453490Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_int32 PASSED [0.0053s] [ 73%] 2025-12-04T14:02:33.5453581Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_int64 PASSED [0.0053s] [ 73%] 2025-12-04T14:02:33.5453673Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_uint8 PASSED [0.0052s] [ 73%] 2025-12-04T14:02:33.5453772Z test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int16 PASSED [1.6446s] [ 73%] 2025-12-04T14:02:33.5453890Z test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int64 PASSED [0.0062s] [ 73%] 2025-12-04T14:02:33.5453989Z test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int8 PASSED [0.0045s] [ 73%] 2025-12-04T14:02:33.5454141Z test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5454298Z test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5454442Z test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5454597Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T14:02:33.5454724Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float16 SKIPPED [0.0001s] (Skipped!) [ 74%] 2025-12-04T14:02:33.5454824Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_bfloat16 PASSED [1.6671s] [ 74%] 2025-12-04T14:02:33.5454925Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_bool PASSED [0.0079s] [ 74%] 2025-12-04T14:02:33.5455019Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_float32 PASSED [0.0058s] [ 74%] 2025-12-04T14:02:33.5455111Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_float64 PASSED [0.0055s] [ 74%] 2025-12-04T14:02:33.5455203Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int16 PASSED [0.0054s] [ 74%] 2025-12-04T14:02:33.5455340Z test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5455493Z test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5455641Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5455789Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5455889Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int16 PASSED [0.0073s] [ 74%] 2025-12-04T14:02:33.5455990Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int8 PASSED [0.0070s] [ 74%] 2025-12-04T14:02:33.5456130Z test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456270Z test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456418Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456559Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456702Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456845Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5456983Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5457131Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5457276Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5457415Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5457516Z test_meta.py::TestMetaCUDA::test_meta_inplace_hypot_cuda_float32 PASSED [0.0055s] [ 74%] 2025-12-04T14:02:33.5457621Z test_meta.py::TestMetaCUDA::test_meta_inplace_hypot_cuda_float64 PASSED [0.0054s] [ 74%] 2025-12-04T14:02:33.5457721Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_bfloat16 PASSED [1.6924s] [ 74%] 2025-12-04T14:02:33.5457813Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_float16 PASSED [0.0049s] [ 74%] 2025-12-04T14:02:33.5458004Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 74%] 2025-12-04T14:02:33.5458203Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 74%] 2025-12-04T14:02:33.5458303Z test_meta.py::TestMetaCUDA::test_meta_inplace_igamma_cuda_float32 PASSED [0.0065s] [ 74%] 2025-12-04T14:02:33.5458403Z test_meta.py::TestMetaCUDA::test_meta_inplace_igamma_cuda_float64 PASSED [0.0055s] [ 74%] 2025-12-04T14:02:33.5458502Z test_meta.py::TestMetaCUDA::test_meta_inplace_igammac_cuda_float64 PASSED [0.0055s] [ 74%] 2025-12-04T14:02:33.5458609Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_complex32 PASSED [0.0078s] [ 74%] 2025-12-04T14:02:33.5458720Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float16 PASSED [0.0074s] [ 74%] 2025-12-04T14:02:33.5458818Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int8 PASSED [0.0073s] [ 74%] 2025-12-04T14:02:33.5458917Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bool PASSED [1.6473s] [ 74%] 2025-12-04T14:02:33.5459024Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_complex32 PASSED [0.0058s] [ 74%] 2025-12-04T14:02:33.5459138Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_float16 PASSED [0.0041s] [ 74%] 2025-12-04T14:02:33.5459245Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_float32 PASSED [0.0038s] [ 74%] 2025-12-04T14:02:33.5459343Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int8 PASSED [1.6711s] [ 74%] 2025-12-04T14:02:33.5459443Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_bool PASSED [0.0071s] [ 74%] 2025-12-04T14:02:33.5459551Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_complex32 PASSED [1.6689s] [ 74%] 2025-12-04T14:02:33.5459656Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float16 PASSED [0.0073s] [ 74%] 2025-12-04T14:02:33.5459760Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float64 PASSED [1.6720s] [ 74%] 2025-12-04T14:02:33.5459861Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int32 PASSED [0.0072s] [ 74%] 2025-12-04T14:02:33.5459961Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int8 PASSED [1.6605s] [ 74%] 2025-12-04T14:02:33.5460062Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_uint8 PASSED [0.0073s] [ 74%] 2025-12-04T14:02:33.5460211Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_bfloat16 PASSED [1.6642s] [ 74%] 2025-12-04T14:02:33.5460315Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_float64 PASSED [0.0065s] [ 74%] 2025-12-04T14:02:33.5460418Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_int32 PASSED [0.0047s] [ 74%] 2025-12-04T14:02:33.5460534Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_bfloat16 PASSED [0.0061s] [ 74%] 2025-12-04T14:02:33.5460648Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int16 PASSED [0.0060s] [ 74%] 2025-12-04T14:02:33.5460761Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int32 PASSED [0.0059s] [ 74%] 2025-12-04T14:02:33.5460872Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int8 PASSED [0.0059s] [ 74%] 2025-12-04T14:02:33.5460980Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_int8 PASSED [0.0059s] [ 74%] 2025-12-04T14:02:33.5461091Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_uint8 PASSED [0.0058s] [ 74%] 2025-12-04T14:02:33.5461203Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_float64 PASSED [0.0062s] [ 74%] 2025-12-04T14:02:33.5461327Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int8 PASSED [0.0061s] [ 74%] 2025-12-04T14:02:33.5461436Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_uint8 PASSED [0.0062s] [ 74%] 2025-12-04T14:02:33.5461593Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5461759Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5461908Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462056Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462203Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462360Z test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462502Z test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462644Z test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5462786Z test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5463363Z test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5463498Z test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5463641Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5463788Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5463930Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464078Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464219Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464362Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464500Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464642Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464778Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5464916Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465051Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465190Z test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465327Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465475Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465624Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465768Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5465912Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466060Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466204Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466346Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466489Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466641Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466791Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5466930Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467091Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467233Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467371Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467522Z test_meta.py::TestMetaCUDA::test_meta_inplace_istft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467667Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467807Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5467941Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468080Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468253Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468431Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468606Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468778Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5468949Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469117Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469301Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469484Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469662Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469834Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5469997Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5470204Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5470366Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5470526Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5470696Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5470849Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5471002Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5471151Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5471296Z test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5471441Z test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5471537Z test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int32 PASSED [0.0070s] [ 74%] 2025-12-04T14:02:33.5471737Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 74%] 2025-12-04T14:02:33.5471838Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_float64 PASSED [1.6799s] [ 74%] 2025-12-04T14:02:33.5472031Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_uint8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 74%] 2025-12-04T14:02:33.5472126Z test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_float16 PASSED [1.6793s] [ 74%] 2025-12-04T14:02:33.5472221Z test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_float64 PASSED [0.0076s] [ 74%] 2025-12-04T14:02:33.5472313Z test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_uint8 PASSED [0.0057s] [ 74%] 2025-12-04T14:02:33.5472410Z test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_bfloat16 PASSED [0.0073s] [ 74%] 2025-12-04T14:02:33.5472508Z test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_float16 PASSED [0.0071s] [ 74%] 2025-12-04T14:02:33.5472602Z test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_float32 PASSED [0.0071s] [ 74%] 2025-12-04T14:02:33.5472796Z test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 74%] 2025-12-04T14:02:33.5472959Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_ex_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 74%] 2025-12-04T14:02:33.5473119Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5473273Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cond_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5473438Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cond_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5473593Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5473744Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5473905Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474054Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474207Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474360Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474524Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474682Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474838Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5474998Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475148Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475298Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475450Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475599Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475756Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5475910Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5476069Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5476226Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5476444Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_complex128 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 75%] 2025-12-04T14:02:33.5476655Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 75%] 2025-12-04T14:02:33.5476862Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 75%] 2025-12-04T14:02:33.5477017Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477170Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477318Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477491Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477652Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477818Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5477982Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5478141Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5478299Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5478500Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 75%] 2025-12-04T14:02:33.5478662Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5478815Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5478994Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479140Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479287Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479441Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479596Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479756Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5479909Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480072Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480265Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480425Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480586Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480744Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5480921Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481093Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481250Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481414Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481597Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481776Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5481966Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5482136Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5482300Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5482499Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_complex128 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 75%] 2025-12-04T14:02:33.5482714Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 75%] 2025-12-04T14:02:33.5482906Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 75%] 2025-12-04T14:02:33.5483071Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483218Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483372Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483525Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483679Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483829Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5483986Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_ex_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484154Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484317Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484468Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484617Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484775Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5484929Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485087Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485250Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485422Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485573Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485721Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5485884Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486036Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486185Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486336Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486492Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486635Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486800Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5486972Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5487070Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_float16 PASSED [1.6902s] [ 75%] 2025-12-04T14:02:33.5487167Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_float64 PASSED [0.0054s] [ 75%] 2025-12-04T14:02:33.5487361Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5487554Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5487741Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5487939Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5488132Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5488231Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float32 PASSED [1.6482s] [ 75%] 2025-12-04T14:02:33.5488328Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_bfloat16 PASSED [0.0054s] [ 75%] 2025-12-04T14:02:33.5488426Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_float32 PASSED [0.0037s] [ 75%] 2025-12-04T14:02:33.5488521Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_float64 PASSED [1.6581s] [ 75%] 2025-12-04T14:02:33.5488708Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_int16 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5488898Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_uint8 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5489079Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5489273Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5489455Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5489565Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_bfloat16 PASSED [0.0054s] [ 75%] 2025-12-04T14:02:33.5489679Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float32 PASSED [0.0043s] [ 75%] 2025-12-04T14:02:33.5489785Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float64 PASSED [0.0043s] [ 75%] 2025-12-04T14:02:33.5489951Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490146Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490304Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490474Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490633Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490790Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5490955Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491104Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491253Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491401Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491554Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491705Z test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 75%] 2025-12-04T14:02:33.5491808Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_bool PASSED [1.6896s] [ 75%] 2025-12-04T14:02:33.5491916Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_float64 PASSED [0.0069s] [ 75%] 2025-12-04T14:02:33.5492025Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_complex64 PASSED [0.0036s] [ 75%] 2025-12-04T14:02:33.5492131Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_float32 PASSED [1.6461s] [ 75%] 2025-12-04T14:02:33.5492234Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_int32 PASSED [0.0051s] [ 75%] 2025-12-04T14:02:33.5492337Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_int8 PASSED [0.0035s] [ 75%] 2025-12-04T14:02:33.5492436Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_bool PASSED [0.0050s] [ 75%] 2025-12-04T14:02:33.5492545Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_complex128 PASSED [0.0050s] [ 75%] 2025-12-04T14:02:33.5492648Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_float64 PASSED [0.0049s] [ 75%] 2025-12-04T14:02:33.5492750Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int16 PASSED [0.0053s] [ 75%] 2025-12-04T14:02:33.5492850Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int32 PASSED [0.0050s] [ 75%] 2025-12-04T14:02:33.5492949Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int64 PASSED [0.0049s] [ 75%] 2025-12-04T14:02:33.5493068Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_bfloat16 PASSED [0.0050s] [ 75%] 2025-12-04T14:02:33.5493174Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_float32 PASSED [0.0049s] [ 75%] 2025-12-04T14:02:33.5493275Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int16 PASSED [0.0049s] [ 75%] 2025-12-04T14:02:33.5493387Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int8 PASSED [0.0049s] [ 75%] 2025-12-04T14:02:33.5493575Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 75%] 2025-12-04T14:02:33.5493672Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_float16 PASSED [0.0057s] [ 76%] 2025-12-04T14:02:33.5493863Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 76%] 2025-12-04T14:02:33.5494054Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 76%] 2025-12-04T14:02:33.5494254Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 76%] 2025-12-04T14:02:33.5494403Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5494550Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5494702Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5494845Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495014Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495188Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495354Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495521Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495685Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5495847Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496010Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496161Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496314Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496463Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496611Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496752Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5496897Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497048Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex128 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497191Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497345Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497483Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497620Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497754Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5497847Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_bool PASSED [1.6621s] [ 76%] 2025-12-04T14:02:33.5497951Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float32 PASSED [0.0075s] [ 76%] 2025-12-04T14:02:33.5498044Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int16 PASSED [0.0056s] [ 76%] 2025-12-04T14:02:33.5498133Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int8 PASSED [0.0054s] [ 76%] 2025-12-04T14:02:33.5498226Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_uint8 PASSED [0.0054s] [ 76%] 2025-12-04T14:02:33.5498370Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5498533Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5498682Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5498828Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5498965Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499099Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499235Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499373Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499505Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499646Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499786Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5499922Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500053Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500235Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500385Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500531Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500679Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500838Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5500983Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501142Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501289Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501442Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501589Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501745Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5501913Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502065Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502218Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502380Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502530Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502687Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502838Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5502986Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5503097Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_bfloat16 PASSED [1.6486s] [ 76%] 2025-12-04T14:02:33.5503199Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_bool PASSED [0.0082s] [ 76%] 2025-12-04T14:02:33.5503306Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float32 PASSED [1.6485s] [ 76%] 2025-12-04T14:02:33.5503407Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int32 PASSED [0.0083s] [ 76%] 2025-12-04T14:02:33.5503510Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int64 PASSED [1.6508s] [ 76%] 2025-12-04T14:02:33.5503610Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int8 PASSED [0.0083s] [ 76%] 2025-12-04T14:02:33.5503773Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5503928Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504089Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logaddexp_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504254Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504409Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504579Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504733Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5504883Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5505041Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5505190Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5505344Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5505491Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5505598Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_bool PASSED [1.6648s] [ 76%] 2025-12-04T14:02:33.5505718Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float32 PASSED [0.0059s] [ 76%] 2025-12-04T14:02:33.5505825Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int16 PASSED [0.0042s] [ 76%] 2025-12-04T14:02:33.5505932Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int32 PASSED [0.0040s] [ 76%] 2025-12-04T14:02:33.5506086Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506242Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506396Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506547Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506695Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506842Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5506998Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507150Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507301Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507448Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507590Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507740Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5507884Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508029Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508169Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508318Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508477Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508628Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508773Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5508929Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509075Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509220Z test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509370Z test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509524Z test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509666Z test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509812Z test_meta.py::TestMetaCUDA::test_meta_inplace_matrix_exp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5509967Z test_meta.py::TestMetaCUDA::test_meta_inplace_matrix_exp_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510133Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510275Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_bool SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510419Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510563Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510705Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5510883Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511059Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511218Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511378Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511532Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511692Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5511849Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512005Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512162Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512328Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512469Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512610Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512764Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5512904Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5513042Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5513179Z test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 76%] 2025-12-04T14:02:33.5513316Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5513498Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5513666Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5513830Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514006Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514165Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514325Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514499Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514665Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514830Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5514993Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515142Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515285Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515429Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515584Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515743Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5515898Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516052Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516215Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516377Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516533Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516689Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516829Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5516972Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517115Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517259Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517409Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517549Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517690Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517837Z test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5517973Z test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518109Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518244Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518382Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518531Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518675Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518819Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5518956Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5519099Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5519236Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5519370Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5519471Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex128 PASSED [1.6801s] [ 77%] 2025-12-04T14:02:33.5519565Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_int32 PASSED [0.0066s] [ 77%] 2025-12-04T14:02:33.5519657Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_int8 PASSED [0.0052s] [ 77%] 2025-12-04T14:02:33.5519809Z test_meta.py::TestMetaCUDA::test_meta_inplace_multinomial_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5519968Z test_meta.py::TestMetaCUDA::test_meta_inplace_multinomial_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5520146Z test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5520288Z test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5520442Z test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5520577Z test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5520789Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5520998Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5521216Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5521422Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5521628Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5521762Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16 PASSED [0.0103s] [ 77%] 2025-12-04T14:02:33.5521882Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float16 PASSED [1.6574s] [ 77%] 2025-12-04T14:02:33.5522002Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0126s] [ 77%] 2025-12-04T14:02:33.5522212Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5522419Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 77%] 2025-12-04T14:02:33.5522523Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_bool PASSED [1.6985s] [ 77%] 2025-12-04T14:02:33.5522630Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_float32 PASSED [0.0064s] [ 77%] 2025-12-04T14:02:33.5522731Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int16 PASSED [0.0036s] [ 77%] 2025-12-04T14:02:33.5522881Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523026Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523168Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523311Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523462Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanquantile_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523612Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanquantile_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523749Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5523888Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524039Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524187Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524344Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524491Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524634Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524773Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5524911Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525078Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525233Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525389Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525560Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_batch_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525727Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5525883Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5526038Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5526130Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_bool PASSED [1.6830s] [ 77%] 2025-12-04T14:02:33.5526229Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_complex64 PASSED [0.0076s] [ 77%] 2025-12-04T14:02:33.5526319Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_int16 PASSED [0.0056s] [ 77%] 2025-12-04T14:02:33.5526417Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_bfloat16 PASSED [1.6813s] [ 77%] 2025-12-04T14:02:33.5526509Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int16 PASSED [0.0042s] [ 77%] 2025-12-04T14:02:33.5526601Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int32 PASSED [1.7069s] [ 77%] 2025-12-04T14:02:33.5526690Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_int64 PASSED [0.0043s] [ 77%] 2025-12-04T14:02:33.5526836Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5526979Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int32 SKIPPED [0.0012s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527122Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527275Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527431Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527585Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527746Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5527899Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528049Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528199Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528348Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528496Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528643Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528798Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5528937Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529076Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529214Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529369Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529516Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529657Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5529765Z test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_bfloat16 PASSED [0.0058s] [ 77%] 2025-12-04T14:02:33.5529867Z test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_float16 PASSED [0.0054s] [ 77%] 2025-12-04T14:02:33.5530046Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5530258Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5530437Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5530614Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5530793Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5530960Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531126Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531290Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531453Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531648Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531813Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5531991Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5532119Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_float64 PASSED [1.6570s] [ 77%] 2025-12-04T14:02:33.5532288Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5532460Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float16 SKIPPED [0.0014s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5532633Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5532806Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5532969Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5533131Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5533307Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 77%] 2025-12-04T14:02:33.5533470Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5533632Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5533793Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5533953Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534115Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534275Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534451Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534628Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534801Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5534975Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5535151Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5535329Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5535502Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5535690Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5535864Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536045Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536229Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536408Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536585Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536760Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5536947Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5537118Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5537287Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5537410Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float16 PASSED [0.0114s] [ 78%] 2025-12-04T14:02:33.5537531Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout3d_cuda_float32 PASSED [0.0088s] [ 78%] 2025-12-04T14:02:33.5537645Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float16 PASSED [0.0045s] [ 78%] 2025-12-04T14:02:33.5537758Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float32 PASSED [0.0042s] [ 78%] 2025-12-04T14:02:33.5537868Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_float64 PASSED [0.0041s] [ 78%] 2025-12-04T14:02:33.5538033Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5538186Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16 PASSED [0.0069s] [ 78%] 2025-12-04T14:02:33.5538337Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [0.0080s] [ 78%] 2025-12-04T14:02:33.5538494Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128 PASSED [0.0080s] [ 78%] 2025-12-04T14:02:33.5538649Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [1.6607s] [ 78%] 2025-12-04T14:02:33.5538802Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64 PASSED [0.0102s] [ 78%] 2025-12-04T14:02:33.5538953Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8 PASSED [0.0084s] [ 78%] 2025-12-04T14:02:33.5539135Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5539318Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5539497Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5539690Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5539853Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540010Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540215Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540383Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540549Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540714Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5540901Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5541027Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float16 PASSED [0.0053s] [ 78%] 2025-12-04T14:02:33.5541153Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0048s] [ 78%] 2025-12-04T14:02:33.5541332Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5541497Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5541661Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5541825Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5541984Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5542164Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5542330Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5542492Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5542655Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5542828Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543009Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543188Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543364Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_linear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543539Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_linear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543730Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5543914Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5544108Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5544270Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5544431Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5544597Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_layer_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5544721Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16 PASSED [0.0076s] [ 78%] 2025-12-04T14:02:33.5544852Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0074s] [ 78%] 2025-12-04T14:02:33.5545014Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5545180Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5545353Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5545529Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5545706Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5545872Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546039Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546204Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546384Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546556Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546725Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5546891Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547058Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547227Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547395Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547569Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool2d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547752Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5547920Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5548101Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5548216Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float16 PASSED [1.6909s] [ 78%] 2025-12-04T14:02:33.5548380Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mse_loss_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5548574Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5548752Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5548934Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549120Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549313Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549494Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549655Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549822Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5549985Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5550183Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5550357Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5550522Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5550689Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5550853Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551030Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551202Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551367Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551532Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551710Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5551879Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552048Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552224Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552388Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552551Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552711Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5552899Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553086Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553268Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553458Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553634Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553815Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5553986Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5554146Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pdist_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5554321Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5554495Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5554670Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128 SKIPPED [0.0008s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5554845Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555015Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555184Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555347Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_prelu_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555507Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_prelu_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555666Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555836Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 78%] 2025-12-04T14:02:33.5555991Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5556167Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5556327Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5556442Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float16 PASSED [0.0062s] [ 79%] 2025-12-04T14:02:33.5556555Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float64 PASSED [0.0053s] [ 79%] 2025-12-04T14:02:33.5556744Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5556868Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_bfloat16 PASSED [0.0045s] [ 79%] 2025-12-04T14:02:33.5556979Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_float32 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5557106Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_complex_cuda_complex64 PASSED [1.6685s] [ 79%] 2025-12-04T14:02:33.5557219Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_bfloat16 PASSED [0.0052s] [ 79%] 2025-12-04T14:02:33.5557338Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_float64 PASSED [0.0036s] [ 79%] 2025-12-04T14:02:33.5557511Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5557687Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5557859Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558029Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558194Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558373Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558543Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558713Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5558884Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559052Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559220Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559386Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559549Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559717Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5559887Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560058Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560260Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560418Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560587Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560758Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5560935Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5561059Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_bfloat16 PASSED [0.0050s] [ 79%] 2025-12-04T14:02:33.5561181Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float64 PASSED [0.0046s] [ 79%] 2025-12-04T14:02:33.5561309Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int32 PASSED [0.0046s] [ 79%] 2025-12-04T14:02:33.5561488Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5561672Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5561849Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562023Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562222Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562418Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562574Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562739Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5562900Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563076Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563252Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563426Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563575Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563737Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex32 SKIPPED [0.0008s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5563887Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5564031Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5564185Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5564324Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex128 SKIPPED [0.0005s] (Only runs on cpu) [ 79%] 2025-12-04T14:02:33.5564458Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int16 SKIPPED [0.0005s] (Only runs on cpu) [ 79%] 2025-12-04T14:02:33.5564589Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_uint8 SKIPPED [0.0005s] (Only runs on cpu) [ 79%] 2025-12-04T14:02:33.5564747Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5564887Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565032Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565181Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565335Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565479Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565623Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565770Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5565919Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566063Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566205Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566348Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566491Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566606Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_complex128 PASSED [0.0034s] [ 79%] 2025-12-04T14:02:33.5566719Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_float16 PASSED [1.6999s] [ 79%] 2025-12-04T14:02:33.5566855Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5566999Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_complex64 SKIPPED [0.0013s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567138Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567272Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567417Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567577Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567726Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5567870Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568019Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568162Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568298Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568439Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568589Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568729Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5568868Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569025Z test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569179Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569331Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569481Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569627Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569774Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5569921Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570065Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570246Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570388Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570534Z test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570674Z test_meta.py::TestMetaCUDA::test_meta_inplace_polar_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5570798Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_float16 PASSED [0.0072s] [ 79%] 2025-12-04T14:02:33.5570920Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_float64 PASSED [0.0060s] [ 79%] 2025-12-04T14:02:33.5571136Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5571347Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5571576Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5571699Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_float64 PASSED [0.0060s] [ 79%] 2025-12-04T14:02:33.5571922Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5572043Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0060s] [ 79%] 2025-12-04T14:02:33.5572253Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5572376Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_bfloat16 PASSED [1.6975s] [ 79%] 2025-12-04T14:02:33.5572596Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5572717Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float16 PASSED [0.0074s] [ 79%] 2025-12-04T14:02:33.5572926Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5573149Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5573358Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5573510Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5573655Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5573755Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_complex64 PASSED [0.0057s] [ 79%] 2025-12-04T14:02:33.5573850Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int32 PASSED [0.0054s] [ 79%] 2025-12-04T14:02:33.5573941Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int64 PASSED [0.0053s] [ 79%] 2025-12-04T14:02:33.5574077Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574221Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574365Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574504Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574640Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574775Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5574911Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5575009Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_bfloat16 PASSED [0.0143s] [ 79%] 2025-12-04T14:02:33.5575109Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_complex128 PASSED [0.0139s] [ 79%] 2025-12-04T14:02:33.5575216Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_complex64 PASSED [0.0140s] [ 79%] 2025-12-04T14:02:33.5575312Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_float32 PASSED [0.0139s] [ 79%] 2025-12-04T14:02:33.5575449Z test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5575595Z test_meta.py::TestMetaCUDA::test_meta_inplace_quantile_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5575750Z test_meta.py::TestMetaCUDA::test_meta_inplace_quantile_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 79%] 2025-12-04T14:02:33.5575941Z test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5576043Z test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_float16 PASSED [1.6997s] [ 79%] 2025-12-04T14:02:33.5576234Z test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_int64 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 79%] 2025-12-04T14:02:33.5576388Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5576530Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5576669Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int8 SKIPPED [0.0012s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5576831Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5576978Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577122Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577265Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577413Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577555Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577699Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577839Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5577979Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578118Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578255Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578393Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578527Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578670Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578812Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5578951Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5579092Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5579203Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_bfloat16 PASSED [1.6867s] [ 80%] 2025-12-04T14:02:33.5579398Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 80%] 2025-12-04T14:02:33.5579620Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_complex128 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 80%] 2025-12-04T14:02:33.5579731Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_float16 PASSED [0.0048s] [ 80%] 2025-12-04T14:02:33.5579834Z test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_float16 PASSED [0.0063s] [ 80%] 2025-12-04T14:02:33.5579934Z test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int16 PASSED [0.0059s] [ 80%] 2025-12-04T14:02:33.5580033Z test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int64 PASSED [0.0058s] [ 80%] 2025-12-04T14:02:33.5580200Z test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_bfloat16 PASSED [0.0069s] [ 80%] 2025-12-04T14:02:33.5580345Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5580485Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5580626Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5580781Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5580917Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581073Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581235Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581390Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581545Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581693Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581838Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5581979Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582127Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582270Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582413Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582554Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582697Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5582798Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_bfloat16 PASSED [0.0033s] [ 80%] 2025-12-04T14:02:33.5582895Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_bool PASSED [1.6984s] [ 80%] 2025-12-04T14:02:33.5583013Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_complex128 PASSED [0.0050s] [ 80%] 2025-12-04T14:02:33.5583117Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_complex64 PASSED [0.0035s] [ 80%] 2025-12-04T14:02:33.5583215Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float16 PASSED [1.6802s] [ 80%] 2025-12-04T14:02:33.5583323Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float32 PASSED [0.0049s] [ 80%] 2025-12-04T14:02:33.5583419Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int16 PASSED [0.0036s] [ 80%] 2025-12-04T14:02:33.5583514Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_uint8 PASSED [1.6937s] [ 80%] 2025-12-04T14:02:33.5583618Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_float64 PASSED [0.0052s] [ 80%] 2025-12-04T14:02:33.5583714Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int8 PASSED [0.0037s] [ 80%] 2025-12-04T14:02:33.5583862Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584023Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584171Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584315Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584478Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584627Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584781Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5584933Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585073Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585216Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585351Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585488Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585631Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585774Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5585915Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5586052Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5586196Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5586294Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_float16 PASSED [1.6892s] [ 80%] 2025-12-04T14:02:33.5586389Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int16 PASSED [0.0042s] [ 80%] 2025-12-04T14:02:33.5586484Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int32 PASSED [1.6925s] [ 80%] 2025-12-04T14:02:33.5586577Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int64 PASSED [0.0043s] [ 80%] 2025-12-04T14:02:33.5586700Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float32 PASSED [0.0040s] [ 80%] 2025-12-04T14:02:33.5586813Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_3_cuda_float32 PASSED [1.6809s] [ 80%] 2025-12-04T14:02:33.5586923Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_3_cuda_float64 PASSED [0.0056s] [ 80%] 2025-12-04T14:02:33.5587050Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float16 PASSED [0.0040s] [ 80%] 2025-12-04T14:02:33.5587248Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_complex32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 80%] 2025-12-04T14:02:33.5587345Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float16 PASSED [1.7009s] [ 80%] 2025-12-04T14:02:33.5587440Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float32 PASSED [0.0054s] [ 80%] 2025-12-04T14:02:33.5587637Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 80%] 2025-12-04T14:02:33.5587789Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5587933Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588072Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588221Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588375Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588528Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588686Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588842Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5588996Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5589147Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5589298Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5589445Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5589554Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_bfloat16 PASSED [0.0071s] [ 80%] 2025-12-04T14:02:33.5589665Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_complex128 PASSED [0.0066s] [ 80%] 2025-12-04T14:02:33.5589773Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_complex64 PASSED [0.0066s] [ 80%] 2025-12-04T14:02:33.5589880Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float16 PASSED [0.0065s] [ 80%] 2025-12-04T14:02:33.5591295Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float32 PASSED [0.0066s] [ 80%] 2025-12-04T14:02:33.5591404Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_int32 PASSED [0.0066s] [ 80%] 2025-12-04T14:02:33.5591509Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_int8 PASSED [0.0065s] [ 80%] 2025-12-04T14:02:33.5591611Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_uint8 PASSED [0.0065s] [ 80%] 2025-12-04T14:02:33.5591737Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_complex64 PASSED [0.0112s] [ 80%] 2025-12-04T14:02:33.5591838Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float16 PASSED [0.0148s] [ 80%] 2025-12-04T14:02:33.5591938Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float32 PASSED [0.0149s] [ 80%] 2025-12-04T14:02:33.5592034Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_int16 PASSED [0.0111s] [ 80%] 2025-12-04T14:02:33.5592171Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_bfloat16 PASSED [0.0129s] [ 80%] 2025-12-04T14:02:33.5592288Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float16 PASSED [0.0129s] [ 80%] 2025-12-04T14:02:33.5592400Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int32 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5592516Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float32 PASSED [0.0129s] [ 80%] 2025-12-04T14:02:33.5592629Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_uint8 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5592763Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_float64 PASSED [0.0137s] [ 80%] 2025-12-04T14:02:33.5592874Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int32 PASSED [0.0136s] [ 80%] 2025-12-04T14:02:33.5592984Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int64 PASSED [0.0136s] [ 80%] 2025-12-04T14:02:33.5593099Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int32 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5593226Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int64 PASSED [0.0127s] [ 80%] 2025-12-04T14:02:33.5593341Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int8 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5593456Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_bfloat16 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5593571Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_float32 PASSED [0.0128s] [ 80%] 2025-12-04T14:02:33.5593683Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_uint8 PASSED [0.0127s] [ 80%] 2025-12-04T14:02:33.5593836Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5593990Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594129Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594280Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594421Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594566Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594706Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594856Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 80%] 2025-12-04T14:02:33.5594956Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_float16 PASSED [0.0026s] [ 80%] 2025-12-04T14:02:33.5595053Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_float32 PASSED [1.6985s] [ 80%] 2025-12-04T14:02:33.5595148Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_int32 PASSED [0.0043s] [ 80%] 2025-12-04T14:02:33.5595239Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_int64 PASSED [1.6958s] [ 80%] 2025-12-04T14:02:33.5595331Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_uint8 PASSED [0.0044s] [ 81%] 2025-12-04T14:02:33.5595489Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5595636Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5595778Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5595926Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5596065Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5596203Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5596339Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5596545Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5596754Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5596947Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5597151Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5597251Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_float16 PASSED [1.6973s] [ 81%] 2025-12-04T14:02:33.5597346Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_int32 PASSED [0.0044s] [ 81%] 2025-12-04T14:02:33.5597441Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_uint8 PASSED [1.6834s] [ 81%] 2025-12-04T14:02:33.5597608Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_cosine_cuda_float32 SKIPPED [0.0017s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5597780Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_exponential_cuda_float32 SKIPPED [0.0014s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5597951Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_exponential_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598124Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_cosine_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598298Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_hamming_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598473Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_hamming_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598638Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_hamming_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598799Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_kaiser_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5598946Z test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5599091Z test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5599229Z test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5599337Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_bfloat16 PASSED [0.0034s] [ 81%] 2025-12-04T14:02:33.5599535Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5599633Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_bfloat16 PASSED [0.1907s] [ 81%] 2025-12-04T14:02:33.5599738Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float64 PASSED [0.0046s] [ 81%] 2025-12-04T14:02:33.5599928Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5600236Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5600420Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5600640Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5600736Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_float64 PASSED [1.6815s] [ 81%] 2025-12-04T14:02:33.5600924Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5601130Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_uint8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 81%] 2025-12-04T14:02:33.5601271Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5601409Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5601561Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5601710Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5601858Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602005Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602150Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602312Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602475Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602612Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602746Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5602886Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float16 SKIPPED [0.0005s] (Only runs on cpu) [ 81%] 2025-12-04T14:02:33.5603021Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 81%] 2025-12-04T14:02:33.5603187Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5603347Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5603516Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5603672Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5603839Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5603990Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604141Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604297Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604449Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604610Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604762Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5604917Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605076Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605231Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605388Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605540Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605713Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5605885Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606058Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606232Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606405Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606577Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606751Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5606922Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607094Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607247Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607404Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607556Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607705Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5607866Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608014Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608182Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608353Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608532Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608701Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5608872Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609031Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609178Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609322Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609472Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609619Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609763Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5609906Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610049Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610234Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610381Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610524Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610695Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5610863Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611038Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611207Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611392Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611561Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611727Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5611904Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612068Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612234Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612398Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612577Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612743Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5612907Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613084Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613250Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613416Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613582Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613744Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5613910Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614067Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614213Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614365Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614513Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614660Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614810Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5614957Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5615140Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5615324Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5615503Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5615678Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5615865Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616040Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616212Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616399Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616598Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616781Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 81%] 2025-12-04T14:02:33.5616976Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5617159Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5617345Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5617525Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5617705Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5617885Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618068Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618237Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618404Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618572Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618738Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5618896Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619048Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619198Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619353Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619502Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619645Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619802Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5619940Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620088Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620284Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620442Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620607Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620755Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5620917Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621095Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621257Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621411Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621566Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621720Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5621872Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5622059Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 82%] 2025-12-04T14:02:33.5622254Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 82%] 2025-12-04T14:02:33.5622449Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 82%] 2025-12-04T14:02:33.5622548Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_float64 PASSED [1.6942s] [ 82%] 2025-12-04T14:02:33.5622735Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 82%] 2025-12-04T14:02:33.5622837Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_bfloat16 PASSED [1.7242s] [ 82%] 2025-12-04T14:02:33.5622935Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int16 PASSED [0.0056s] [ 82%] 2025-12-04T14:02:33.5623030Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int32 PASSED [0.0039s] [ 82%] 2025-12-04T14:02:33.5623181Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 82%] 2025-12-04T14:02:33.5623297Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_bool PASSED [1.6920s] [ 82%] 2025-12-04T14:02:33.5623399Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_float16 PASSED [0.0065s] [ 82%] 2025-12-04T14:02:33.5623497Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_float64 PASSED [1.7035s] [ 82%] 2025-12-04T14:02:33.5623605Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int16 PASSED [0.0066s] [ 82%] 2025-12-04T14:02:33.5623699Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int8 PASSED [1.7155s] [ 82%] 2025-12-04T14:02:33.5623797Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_uint8 PASSED [0.0068s] [ 82%] 2025-12-04T14:02:33.5623912Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_bfloat16 PASSED [1.7006s] [ 82%] 2025-12-04T14:02:33.5624028Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex128 PASSED [0.0062s] [ 82%] 2025-12-04T14:02:33.5624143Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex32 PASSED [0.0046s] [ 82%] 2025-12-04T14:02:33.5625608Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_float32 PASSED [0.0043s] [ 82%] 2025-12-04T14:02:33.5625724Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int16 PASSED [0.0042s] [ 82%] 2025-12-04T14:02:33.5625810Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int64 2025-12-04T14:02:33.5625817Z 2025-12-04T14:02:33.5625988Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-49e839ce31e1f1d7.xml - 2025-12-04T14:02:33.5626052Z !!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:02:33.5626209Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py:2653: KeyboardInterrupt 2025-12-04T14:02:33.5626289Z (to show a full traceback on KeyboardInterrupt use --full-trace) 2025-12-04T14:02:33.5626369Z ========= 5844 passed, 4873 skipped, 318 xfailed in 1795.91s (0:29:55) ========= 2025-12-04T14:02:33.5626421Z Command took >30min, returning 124 2025-12-04T14:02:33.5626460Z Got exit code 124 2025-12-04T14:02:33.5626513Z Retrying single test... 2025-12-04T14:02:33.5626637Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-d2ade7808535d64e.xml 2025-12-04T14:02:33.5626698Z ============================= test session starts ============================== 2025-12-04T14:02:33.5626814Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:02:33.5626855Z cachedir: .pytest_cache 2025-12-04T14:02:33.5627014Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:02:33.5627063Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T14:02:33.5627103Z configfile: pytest.ini 2025-12-04T14:02:33.5627269Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T14:02:33.5627523Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T14:02:33.5627592Z collected 40725 items / 13395 deselected / 27330 selected 2025-12-04T14:02:33.5627767Z stepcurrent: skipping 11035 already run items. Running only test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int64 2025-12-04T14:02:33.5627812Z Running 1 items in this shard 2025-12-04T14:02:33.5627815Z 2025-12-04T14:02:33.5627928Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int64 PASSED [0.0748s] [100%] 2025-12-04T14:02:33.5627930Z 2025-12-04T14:02:33.5628092Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-d2ade7808535d64e.xml - 2025-12-04T14:02:33.5628160Z ===================== 1 passed, 13395 deselected in 1.71s ====================== 2025-12-04T14:02:33.5628198Z Got exit code 0 2025-12-04T14:02:33.5628301Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T14:02:33.5628421Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-363f58d291636c85.xml 2025-12-04T14:02:33.5628481Z ============================= test session starts ============================== 2025-12-04T14:02:33.5628602Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:02:33.5628643Z cachedir: .pytest_cache 2025-12-04T14:02:33.5628797Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:02:33.5628844Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T14:02:33.5628883Z configfile: pytest.ini 2025-12-04T14:02:33.5629047Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T14:02:33.5629350Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T14:02:33.5629428Z collected 40725 items / 11036 deselected / 29689 selected 2025-12-04T14:02:33.5629487Z stepcurrent: skipping 11036 already run items. 2025-12-04T14:02:33.5629534Z Running 2360 items in this shard 2025-12-04T14:02:33.5629536Z 2025-12-04T14:02:33.5629649Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_uint8 PASSED [0.0835s] [ 0%] 2025-12-04T14:02:33.5629798Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5629944Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630119Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630264Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630404Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630541Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630680Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630819Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5630961Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631106Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float16 SKIPPED [0.0015s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631265Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631420Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631573Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631716Z test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631860Z test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5631960Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex128 PASSED [1.0941s] [ 0%] 2025-12-04T14:02:33.5632087Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex32 PASSED [0.0513s] [ 0%] 2025-12-04T14:02:33.5632182Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_int16 PASSED [0.0070s] [ 0%] 2025-12-04T14:02:33.5632328Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5632479Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5632638Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5632781Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T14:02:33.5632933Z test_meta.py::TestMetaCUDA::test_meta_inplace_svd_lowrank_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5633072Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5633246Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5633342Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_bfloat16 PASSED [0.0047s] [ 1%] 2025-12-04T14:02:33.5633435Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_float64 PASSED [0.8570s] [ 1%] 2025-12-04T14:02:33.5633527Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int32 PASSED [0.0065s] [ 1%] 2025-12-04T14:02:33.5633679Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5633827Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5633977Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634127Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634269Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634415Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634553Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634691Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float64 SKIPPED [0.0015s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634826Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5634961Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T14:02:33.5635160Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex128 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T14:02:33.5635352Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T14:02:33.5635452Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float16 PASSED [0.0102s] [ 1%] 2025-12-04T14:02:33.5635551Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float64 PASSED [0.8413s] [ 1%] 2025-12-04T14:02:33.5635737Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int16 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T14:02:33.5635929Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T14:02:33.5636029Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_bfloat16 PASSED [0.8686s] [ 1%] 2025-12-04T14:02:33.5636225Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T14:02:33.5636332Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float16 PASSED [0.8554s] [ 2%] 2025-12-04T14:02:33.5636488Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_complex64 SKIPPED [0.0015s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5636638Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5636785Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5636932Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637098Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637246Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637394Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637540Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637681Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637816Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5637956Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638089Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638229Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638366Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638501Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638635Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638778Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5638921Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5639060Z test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5639198Z test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5639336Z test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5639469Z test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T14:02:33.5639604Z test_meta.py::TestMetaCUDA::test_meta_inplace_topk_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5639771Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch__scaled_mm_cuda_float8_e4m3fn SKIPPED [0.0005s] (Requires CUDA SM >= 8.9) [ 3%] 2025-12-04T14:02:33.5639926Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0005s] (Requires CUDA SM >= 8.9) [ 3%] 2025-12-04T14:02:33.5640193Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 3%] 2025-12-04T14:02:33.5640442Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 3%] 2025-12-04T14:02:33.5640622Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5640767Z test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5640956Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5641105Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5641263Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5641416Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5641567Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5641669Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_bool PASSED [0.0057s] [ 3%] 2025-12-04T14:02:33.5641777Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex64 PASSED [0.0068s] [ 3%] 2025-12-04T14:02:33.5641880Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int16 PASSED [0.0059s] [ 3%] 2025-12-04T14:02:33.5641979Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int64 PASSED [0.0063s] [ 3%] 2025-12-04T14:02:33.5642132Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5642283Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5642432Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5642578Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5642722Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5642867Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5643004Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5643165Z test_meta.py::TestMetaCUDA::test_meta_inplace_triangular_solve_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T14:02:33.5643262Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float16 PASSED [0.0149s] [ 4%] 2025-12-04T14:02:33.5643358Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float64 PASSED [0.0087s] [ 4%] 2025-12-04T14:02:33.5643451Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int16 PASSED [0.0083s] [ 4%] 2025-12-04T14:02:33.5643545Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int64 PASSED [0.0074s] [ 4%] 2025-12-04T14:02:33.5643650Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_uint8 PASSED [0.0087s] [ 4%] 2025-12-04T14:02:33.5643747Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float16 PASSED [0.0083s] [ 4%] 2025-12-04T14:02:33.5643840Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float64 PASSED [0.0087s] [ 4%] 2025-12-04T14:02:33.5643932Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int32 PASSED [0.0082s] [ 4%] 2025-12-04T14:02:33.5644141Z test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.5644348Z test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T14:02:33.5644444Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_float64 PASSED [0.8674s] [ 4%] 2025-12-04T14:02:33.5644540Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int16 PASSED [0.0056s] [ 4%] 2025-12-04T14:02:33.5644643Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int32 PASSED [0.8416s] [ 4%] 2025-12-04T14:02:33.5644755Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int64 PASSED [0.0039s] [ 4%] 2025-12-04T14:02:33.5644911Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645061Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645210Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645354Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645498Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645642Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645783Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5645922Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5646066Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T14:02:33.5646215Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5646360Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5646505Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5646655Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5646803Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5646951Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647097Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647240Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647398Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647545Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647691Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647846Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5647988Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5648127Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5648231Z test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_bfloat16 PASSED [0.8450s] [ 5%] 2025-12-04T14:02:33.5648339Z test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_complex128 PASSED [0.0048s] [ 5%] 2025-12-04T14:02:33.5648450Z test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float32 PASSED [0.0036s] [ 5%] 2025-12-04T14:02:33.5648562Z test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float64 PASSED [0.8622s] [ 5%] 2025-12-04T14:02:33.5648724Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_bfloat16 SKIPPED [0.0014s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5648883Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5649038Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5649196Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5649352Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5649499Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T14:02:33.5649641Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5649786Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5649925Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650066Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_uint32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650270Z test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650418Z test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650568Z test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650722Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5650876Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651020Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651174Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651340Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651500Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651655Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651819Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5651969Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5652068Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_bool PASSED [0.0060s] [ 6%] 2025-12-04T14:02:33.5652178Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_complex128 PASSED [0.0054s] [ 6%] 2025-12-04T14:02:33.5652286Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_complex64 PASSED [0.0054s] [ 6%] 2025-12-04T14:02:33.5652417Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_float32 PASSED [0.0054s] [ 6%] 2025-12-04T14:02:33.5652515Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int16 PASSED [0.0053s] [ 6%] 2025-12-04T14:02:33.5652614Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_uint8 PASSED [0.0053s] [ 6%] 2025-12-04T14:02:33.5652755Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5652900Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T14:02:33.5653041Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653189Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653334Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653495Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_unbiased_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653657Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_unbiased_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653807Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5653959Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654107Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654252Z test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654391Z test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654529Z test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654666Z test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654803Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5654950Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655091Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655247Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655397Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655555Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655699Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655840Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5655978Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5656119Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5656278Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T14:02:33.5656420Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5656564Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5656701Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5656840Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5656973Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657107Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657256Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657402Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657544Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657683Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657821Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5657967Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658110Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658252Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658389Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658529Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658664Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658803Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5658956Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T14:02:33.5659058Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_bfloat16 PASSED [0.0312s] [ 8%] 2025-12-04T14:02:33.5659156Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float32 PASSED [0.0075s] [ 8%] 2025-12-04T14:02:33.5659251Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float64 PASSED [0.0073s] [ 8%] 2025-12-04T14:02:33.5659453Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.5659642Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T14:02:33.5659829Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T14:02:33.5659930Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_bfloat16 PASSED [0.8658s] [ 9%] 2025-12-04T14:02:33.5660056Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float16 PASSED [0.0047s] [ 9%] 2025-12-04T14:02:33.5660193Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float32 PASSED [0.0047s] [ 9%] 2025-12-04T14:02:33.5660289Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_int32 PASSED [0.8637s] [ 9%] 2025-12-04T14:02:33.5660427Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_bool SKIPPED [0.0015s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5660571Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5660717Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5660858Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5660997Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5661137Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5661280Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5661420Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T14:02:33.5661518Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_complex64 PASSED [0.0034s] [ 9%] 2025-12-04T14:02:33.5661612Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_float16 PASSED [0.8549s] [ 9%] 2025-12-04T14:02:33.5661705Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_float64 PASSED [0.0040s] [ 9%] 2025-12-04T14:02:33.5661798Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int64 PASSED [0.8427s] [ 9%] 2025-12-04T14:02:33.5661889Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int8 PASSED [0.0036s] [ 9%] 2025-12-04T14:02:33.5661983Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_bfloat16 PASSED [0.8542s] [ 9%] 2025-12-04T14:02:33.5662072Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_bool PASSED [0.0057s] [ 9%] 2025-12-04T14:02:33.5662170Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_complex128 PASSED [0.8505s] [ 9%] 2025-12-04T14:02:33.5662261Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_float32 PASSED [0.0036s] [ 9%] 2025-12-04T14:02:33.5662352Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int64 PASSED [0.8476s] [ 9%] 2025-12-04T14:02:33.5662464Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_complex128 PASSED [0.0249s] [ 10%] 2025-12-04T14:02:33.5662573Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float16 PASSED [0.0100s] [ 10%] 2025-12-04T14:02:33.5662702Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float64 PASSED [0.0097s] [ 10%] 2025-12-04T14:02:33.5662802Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_bool PASSED [0.0060s] [ 10%] 2025-12-04T14:02:33.5662902Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float16 PASSED [0.0059s] [ 10%] 2025-12-04T14:02:33.5663016Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float32 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5663114Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int16 PASSED [0.0057s] [ 10%] 2025-12-04T14:02:33.5663212Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int32 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5663308Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_uint8 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5663405Z test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int16 PASSED [0.0161s] [ 10%] 2025-12-04T14:02:33.5663502Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int32 PASSED [0.0820s] [ 10%] 2025-12-04T14:02:33.5663628Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int64 PASSED [0.0068s] [ 10%] 2025-12-04T14:02:33.5663725Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int8 PASSED [0.0065s] [ 10%] 2025-12-04T14:02:33.5663830Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float32 PASSED [1.8188s] [ 10%] 2025-12-04T14:02:33.5663933Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float16 PASSED [0.0301s] [ 10%] 2025-12-04T14:02:33.5664029Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int8 PASSED [0.0075s] [ 10%] 2025-12-04T14:02:33.5664125Z test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int64 PASSED [0.0059s] [ 10%] 2025-12-04T14:02:33.5664229Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_bfloat16 PASSED [0.0283s] [ 10%] 2025-12-04T14:02:33.5664330Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float32 PASSED [0.0061s] [ 10%] 2025-12-04T14:02:33.5664431Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float64 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5664529Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_int32 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5664625Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_uint8 PASSED [0.0058s] [ 10%] 2025-12-04T14:02:33.5664728Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_bfloat16 PASSED [0.0060s] [ 10%] 2025-12-04T14:02:33.5664827Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_float16 PASSED [0.0059s] [ 10%] 2025-12-04T14:02:33.5664927Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_float64 PASSED [0.0058s] [ 11%] 2025-12-04T14:02:33.5665021Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int8 PASSED [0.0058s] [ 11%] 2025-12-04T14:02:33.5665118Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_uint8 PASSED [0.0058s] [ 11%] 2025-12-04T14:02:33.5665243Z test_meta.py::TestMetaCUDA::test_meta_outplace__batch_norm_with_update_cuda_bfloat16 PASSED [0.0284s] [ 11%] 2025-12-04T14:02:33.5665367Z test_meta.py::TestMetaCUDA::test_meta_outplace__batch_norm_with_update_cuda_float64 PASSED [0.0140s] [ 11%] 2025-12-04T14:02:33.5665473Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float32 PASSED [0.0324s] [ 11%] 2025-12-04T14:02:33.5665579Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float64 PASSED [0.0111s] [ 11%] 2025-12-04T14:02:33.5665682Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int64 PASSED [0.0108s] [ 11%] 2025-12-04T14:02:33.5665792Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_complex128 PASSED [0.0096s] [ 11%] 2025-12-04T14:02:33.5665900Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_float32 PASSED [0.0235s] [ 11%] 2025-12-04T14:02:33.5666003Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_int64 PASSED [0.0079s] [ 11%] 2025-12-04T14:02:33.5666118Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_uint8 PASSED [0.0077s] [ 11%] 2025-12-04T14:02:33.5666231Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_bfloat16 PASSED [0.0080s] [ 11%] 2025-12-04T14:02:33.5666336Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_bool PASSED [0.0124s] [ 11%] 2025-12-04T14:02:33.5666459Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_complex128 PASSED [0.0078s] [ 11%] 2025-12-04T14:02:33.5666568Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float64 PASSED [0.0077s] [ 11%] 2025-12-04T14:02:33.5666673Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_int16 PASSED [0.0079s] [ 11%] 2025-12-04T14:02:33.5666774Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_bool XFAIL [0.0232s] [ 11%] 2025-12-04T14:02:33.5666884Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_complex128 XFAIL [0.9670s] [ 11%] 2025-12-04T14:02:33.5666994Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_complex64 XFAIL [0.9179s] [ 11%] 2025-12-04T14:02:33.5667121Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float16 XFAIL [0.9127s] [ 11%] 2025-12-04T14:02:33.5667228Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float64 XFAIL [0.9162s] [ 11%] 2025-12-04T14:02:33.5667330Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int64 XFAIL [0.9064s] [ 11%] 2025-12-04T14:02:33.5667438Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_bool XFAIL [0.8954s] [ 11%] 2025-12-04T14:02:33.5667555Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_complex64 PASSED [1.0323s] [ 12%] 2025-12-04T14:02:33.5667665Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float32 PASSED [0.0954s] [ 12%] 2025-12-04T14:02:33.5667776Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float64 PASSED [0.0953s] [ 12%] 2025-12-04T14:02:33.5667883Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_bool XFAIL [0.0053s] [ 12%] 2025-12-04T14:02:33.5668001Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_complex128 PASSED [1.0267s] [ 12%] 2025-12-04T14:02:33.5668110Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int8 PASSED [0.0876s] [ 12%] 2025-12-04T14:02:33.5668220Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_uint8 PASSED [0.0660s] [ 12%] 2025-12-04T14:02:33.5668331Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_bfloat16 PASSED [0.0081s] [ 12%] 2025-12-04T14:02:33.5668437Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_bool PASSED [0.0126s] [ 12%] 2025-12-04T14:02:33.5668544Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_float32 PASSED [0.0078s] [ 12%] 2025-12-04T14:02:33.5668651Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int32 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5668761Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_bfloat16 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5668868Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_bool PASSED [0.0108s] [ 12%] 2025-12-04T14:02:33.5668979Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_complex128 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5669089Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_complex64 PASSED [0.0078s] [ 12%] 2025-12-04T14:02:33.5669199Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_float64 PASSED [0.0078s] [ 12%] 2025-12-04T14:02:33.5669303Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int32 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5669408Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int64 PASSED [0.0078s] [ 12%] 2025-12-04T14:02:33.5669512Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_uint8 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5669634Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_bfloat16 PASSED [0.0079s] [ 12%] 2025-12-04T14:02:33.5669739Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_bool XFAIL [0.0035s] [ 12%] 2025-12-04T14:02:33.5669848Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_float64 PASSED [0.9178s] [ 12%] 2025-12-04T14:02:33.5669952Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int32 PASSED [0.0083s] [ 12%] 2025-12-04T14:02:33.5670070Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int8 PASSED [0.0080s] [ 13%] 2025-12-04T14:02:33.5670216Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_uint8 PASSED [0.0078s] [ 13%] 2025-12-04T14:02:33.5670334Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_bfloat16 PASSED [0.1089s] [ 13%] 2025-12-04T14:02:33.5670452Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_complex128 XFAIL [0.0045s] [ 13%] 2025-12-04T14:02:33.5670570Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_complex64 XFAIL [0.9053s] [ 13%] 2025-12-04T14:02:33.5670713Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_float32 PASSED [0.1086s] [ 13%] 2025-12-04T14:02:33.5670825Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int8 PASSED [0.1128s] [ 13%] 2025-12-04T14:02:33.5670940Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_bfloat16 PASSED [0.1090s] [ 13%] 2025-12-04T14:02:33.5671051Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_bool XFAIL [0.0152s] [ 13%] 2025-12-04T14:02:33.5671162Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int64 PASSED [0.0741s] [ 13%] 2025-12-04T14:02:33.5671270Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_float64 PASSED [0.0088s] [ 13%] 2025-12-04T14:02:33.5671374Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int8 PASSED [0.0087s] [ 13%] 2025-12-04T14:02:33.5671483Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_complex64 PASSED [0.0079s] [ 13%] 2025-12-04T14:02:33.5671592Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float16 PASSED [0.0079s] [ 13%] 2025-12-04T14:02:33.5671698Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float32 PASSED [0.0077s] [ 13%] 2025-12-04T14:02:33.5671806Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_float64 PASSED [0.0077s] [ 13%] 2025-12-04T14:02:33.5671913Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int16 PASSED [0.0117s] [ 13%] 2025-12-04T14:02:33.5672018Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_uint8 PASSED [0.0079s] [ 13%] 2025-12-04T14:02:33.5672126Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_bfloat16 PASSED [0.0079s] [ 13%] 2025-12-04T14:02:33.5672231Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int16 PASSED [0.0117s] [ 13%] 2025-12-04T14:02:33.5672336Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int8 PASSED [0.0079s] [ 13%] 2025-12-04T14:02:33.5672447Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_bfloat16 PASSED [0.0580s] [ 13%] 2025-12-04T14:02:33.5672558Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_complex128 PASSED [0.0685s] [ 13%] 2025-12-04T14:02:33.5672663Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_int32 PASSED [0.0514s] [ 13%] 2025-12-04T14:02:33.5672767Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_int8 PASSED [0.0416s] [ 14%] 2025-12-04T14:02:33.5672871Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_uint8 PASSED [0.0414s] [ 14%] 2025-12-04T14:02:33.5672979Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_bfloat16 PASSED [0.0080s] [ 14%] 2025-12-04T14:02:33.5673083Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_bool PASSED [0.0148s] [ 14%] 2025-12-04T14:02:33.5673191Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_complex64 XFAIL [0.0036s] [ 14%] 2025-12-04T14:02:33.5673310Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_float16 PASSED [0.9138s] [ 14%] 2025-12-04T14:02:33.5673415Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int32 PASSED [0.0084s] [ 14%] 2025-12-04T14:02:33.5673518Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int64 PASSED [0.0080s] [ 14%] 2025-12-04T14:02:33.5673643Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int8 PASSED [0.0079s] [ 14%] 2025-12-04T14:02:33.5673746Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_uint8 PASSED [0.0079s] [ 14%] 2025-12-04T14:02:33.5673854Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_float64 PASSED [0.0078s] [ 14%] 2025-12-04T14:02:33.5673961Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int64 PASSED [0.0093s] [ 14%] 2025-12-04T14:02:33.5674067Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_float32 PASSED [0.0078s] [ 14%] 2025-12-04T14:02:33.5674170Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int8 PASSED [0.0266s] [ 14%] 2025-12-04T14:02:33.5674300Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_bool PASSED [0.0082s] [ 14%] 2025-12-04T14:02:33.5674414Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_complex128 PASSED [0.0079s] [ 14%] 2025-12-04T14:02:33.5674526Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float16 PASSED [0.0080s] [ 14%] 2025-12-04T14:02:33.5674636Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float32 PASSED [0.0077s] [ 14%] 2025-12-04T14:02:33.5674741Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int16 PASSED [0.0079s] [ 14%] 2025-12-04T14:02:33.5674847Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int64 PASSED [0.0079s] [ 14%] 2025-12-04T14:02:33.5674950Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_bool XFAIL [0.0035s] [ 14%] 2025-12-04T14:02:33.5675060Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float16 PASSED [0.9189s] [ 14%] 2025-12-04T14:02:33.5675169Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float32 PASSED [0.0081s] [ 14%] 2025-12-04T14:02:33.5675278Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_float64 PASSED [0.0079s] [ 15%] 2025-12-04T14:02:33.5675384Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_int16 PASSED [0.0079s] [ 15%] 2025-12-04T14:02:33.5675489Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_int32 PASSED [0.0078s] [ 15%] 2025-12-04T14:02:33.5675596Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_float64 PASSED [0.0185s] [ 15%] 2025-12-04T14:02:33.5675699Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_int64 XFAIL [0.0035s] [ 15%] 2025-12-04T14:02:33.5675809Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_complex64 PASSED [0.9991s] [ 15%] 2025-12-04T14:02:33.5675917Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float64 PASSED [0.0588s] [ 15%] 2025-12-04T14:02:33.5676022Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int16 XFAIL [0.0054s] [ 15%] 2025-12-04T14:02:33.5676125Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int64 XFAIL [0.9048s] [ 15%] 2025-12-04T14:02:33.5676239Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_bfloat16 PASSED [0.9020s] [ 15%] 2025-12-04T14:02:33.5676346Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_bool PASSED [0.0096s] [ 15%] 2025-12-04T14:02:33.5676459Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_complex64 XFAIL [0.0036s] [ 15%] 2025-12-04T14:02:33.5676570Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float64 PASSED [0.9037s] [ 15%] 2025-12-04T14:02:33.5676677Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int64 PASSED [0.0083s] [ 15%] 2025-12-04T14:02:33.5676793Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int8 PASSED [0.0080s] [ 15%] 2025-12-04T14:02:33.5676902Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int8 PASSED [0.0150s] [ 15%] 2025-12-04T14:02:33.5677008Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_uint8 PASSED [0.0080s] [ 15%] 2025-12-04T14:02:33.5677126Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_bool PASSED [0.0079s] [ 15%] 2025-12-04T14:02:33.5677237Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_complex64 PASSED [0.0078s] [ 15%] 2025-12-04T14:02:33.5677346Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float16 PASSED [0.0079s] [ 15%] 2025-12-04T14:02:33.5677453Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float32 PASSED [0.0077s] [ 15%] 2025-12-04T14:02:33.5677562Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float64 PASSED [0.0076s] [ 15%] 2025-12-04T14:02:33.5677669Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_float32 PASSED [0.0078s] [ 15%] 2025-12-04T14:02:33.5677800Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_float64 PASSED [0.0077s] [ 15%] 2025-12-04T14:02:33.5677906Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_int32 PASSED [0.0079s] [ 16%] 2025-12-04T14:02:33.5678011Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_bool PASSED [0.0078s] [ 16%] 2025-12-04T14:02:33.5678115Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_float32 PASSED [0.0077s] [ 16%] 2025-12-04T14:02:33.5678220Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_int16 PASSED [0.0079s] [ 16%] 2025-12-04T14:02:33.5678324Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_int32 PASSED [0.0079s] [ 16%] 2025-12-04T14:02:33.5678428Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_uint8 PASSED [0.0079s] [ 16%] 2025-12-04T14:02:33.5678535Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_float64 PASSED [0.0102s] [ 16%] 2025-12-04T14:02:33.5678640Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int16 PASSED [0.0048s] [ 16%] 2025-12-04T14:02:33.5678744Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int32 PASSED [0.0048s] [ 16%] 2025-12-04T14:02:33.5678846Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int64 PASSED [0.0048s] [ 16%] 2025-12-04T14:02:33.5678962Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_complex128 XFAIL [0.0044s] [ 16%] 2025-12-04T14:02:33.5679074Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_complex64 XFAIL [0.9162s] [ 16%] 2025-12-04T14:02:33.5679186Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_float64 PASSED [0.1084s] [ 16%] 2025-12-04T14:02:33.5679293Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int16 PASSED [0.0742s] [ 16%] 2025-12-04T14:02:33.5679402Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int32 PASSED [0.0747s] [ 16%] 2025-12-04T14:02:33.5679511Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_int64 PASSED [0.0742s] [ 16%] 2025-12-04T14:02:33.5679618Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_int32 PASSED [0.0784s] [ 16%] 2025-12-04T14:02:33.5679722Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_bool PASSED [0.0401s] [ 16%] 2025-12-04T14:02:33.5679833Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_complex128 PASSED [0.0660s] [ 16%] 2025-12-04T14:02:33.5679936Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int32 PASSED [0.0395s] [ 16%] 2025-12-04T14:02:33.5680039Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int8 PASSED [0.0397s] [ 16%] 2025-12-04T14:02:33.5680211Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_uint8 PASSED [0.0394s] [ 16%] 2025-12-04T14:02:33.5680336Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_complex128 PASSED [0.0079s] [ 16%] 2025-12-04T14:02:33.5680447Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_complex64 PASSED [0.0077s] [ 16%] 2025-12-04T14:02:33.5680553Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_float32 PASSED [0.0078s] [ 17%] 2025-12-04T14:02:33.5680658Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_uint8 PASSED [0.0076s] [ 17%] 2025-12-04T14:02:33.5680779Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_complex64 PASSED [0.1687s] [ 17%] 2025-12-04T14:02:33.5680888Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_float16 PASSED [0.0747s] [ 17%] 2025-12-04T14:02:33.5680992Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int16 XFAIL [0.0036s] [ 17%] 2025-12-04T14:02:33.5681097Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int32 XFAIL [0.9085s] [ 17%] 2025-12-04T14:02:33.5681201Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int64 XFAIL [0.9074s] [ 17%] 2025-12-04T14:02:33.5681325Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_bfloat16 PASSED [0.9514s] [ 17%] 2025-12-04T14:02:33.5681446Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_complex64 PASSED [0.0578s] [ 17%] 2025-12-04T14:02:33.5681555Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_float16 PASSED [0.0456s] [ 17%] 2025-12-04T14:02:33.5681660Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int16 PASSED [0.0320s] [ 17%] 2025-12-04T14:02:33.5681767Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int64 PASSED [0.0318s] [ 17%] 2025-12-04T14:02:33.5681870Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_uint8 PASSED [0.0317s] [ 17%] 2025-12-04T14:02:33.5681990Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_bfloat16 PASSED [0.0079s] [ 17%] 2025-12-04T14:02:33.5682114Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_complex128 PASSED [0.0079s] [ 17%] 2025-12-04T14:02:33.5682233Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_float16 PASSED [0.0078s] [ 17%] 2025-12-04T14:02:33.5682351Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_float64 PASSED [0.0077s] [ 17%] 2025-12-04T14:02:33.5682465Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int32 PASSED [0.0079s] [ 17%] 2025-12-04T14:02:33.5682582Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int64 PASSED [0.0079s] [ 17%] 2025-12-04T14:02:33.5682695Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_uint8 PASSED [0.0079s] [ 17%] 2025-12-04T14:02:33.5682807Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_float16 PASSED [0.0078s] [ 17%] 2025-12-04T14:02:33.5682915Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_int8 PASSED [0.0078s] [ 17%] 2025-12-04T14:02:33.5683031Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_complex128 PASSED [0.0080s] [ 17%] 2025-12-04T14:02:33.5683139Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int16 PASSED [0.0079s] [ 18%] 2025-12-04T14:02:33.5683249Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int32 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5683358Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_bool PASSED [0.0069s] [ 18%] 2025-12-04T14:02:33.5683468Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_int32 PASSED [0.0203s] [ 18%] 2025-12-04T14:02:33.5683580Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_complex128 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5683690Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float16 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5683794Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int32 PASSED [0.0115s] [ 18%] 2025-12-04T14:02:33.5683899Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int64 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5684016Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int8 PASSED [0.0079s] [ 18%] 2025-12-04T14:02:33.5684125Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_bfloat16 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5684238Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_complex64 PASSED [0.0078s] [ 18%] 2025-12-04T14:02:33.5684355Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float64 PASSED [0.0077s] [ 18%] 2025-12-04T14:02:33.5684460Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int32 PASSED [0.0110s] [ 18%] 2025-12-04T14:02:33.5684567Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_uint8 PASSED [0.0079s] [ 18%] 2025-12-04T14:02:33.5684681Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_bfloat16 PASSED [0.0079s] [ 18%] 2025-12-04T14:02:33.5684791Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_complex128 XFAIL [0.0107s] [ 18%] 2025-12-04T14:02:33.5684905Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int8 XFAIL [0.9155s] [ 18%] 2025-12-04T14:02:33.5685018Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_uint8 XFAIL [0.9195s] [ 18%] 2025-12-04T14:02:33.5685123Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int32 PASSED [0.9127s] [ 18%] 2025-12-04T14:02:33.5685226Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int8 PASSED [0.0084s] [ 18%] 2025-12-04T14:02:33.5685332Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_bool PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5685442Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_complex128 PASSED [0.0080s] [ 18%] 2025-12-04T14:02:33.5685556Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_complex64 PASSED [0.0078s] [ 18%] 2025-12-04T14:02:33.5685668Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float64 PASSED [0.0079s] [ 18%] 2025-12-04T14:02:33.5685776Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int64 PASSED [0.0079s] [ 19%] 2025-12-04T14:02:33.5685885Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_bool PASSED [0.0059s] [ 19%] 2025-12-04T14:02:33.5685993Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float16 PASSED [0.0060s] [ 19%] 2025-12-04T14:02:33.5686102Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int16 PASSED [0.0059s] [ 19%] 2025-12-04T14:02:33.5686210Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int32 PASSED [0.0059s] [ 19%] 2025-12-04T14:02:33.5686315Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int64 PASSED [0.0059s] [ 19%] 2025-12-04T14:02:33.5686419Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_int8 PASSED [0.0059s] [ 19%] 2025-12-04T14:02:33.5686546Z test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_bfloat16 PASSED [0.0133s] [ 19%] 2025-12-04T14:02:33.5686671Z test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_lengths_cuda_bfloat16 PASSED [0.0813s] [ 19%] 2025-12-04T14:02:33.5686802Z test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float16 PASSED [0.0358s] [ 19%] 2025-12-04T14:02:33.5686925Z test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_bfloat16 PASSED [0.9287s] [ 19%] 2025-12-04T14:02:33.5687071Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex64 PASSED [0.1113s] [ 19%] 2025-12-04T14:02:33.5687208Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.0145s] [ 19%] 2025-12-04T14:02:33.5687345Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float64 PASSED [0.0142s] [ 19%] 2025-12-04T14:02:33.5687480Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64 PASSED [0.0141s] [ 19%] 2025-12-04T14:02:33.5687618Z test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_float16 PASSED [0.9203s] [ 19%] 2025-12-04T14:02:33.5687723Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_complex128 PASSED [0.0043s] [ 19%] 2025-12-04T14:02:33.5687819Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_float64 PASSED [0.9062s] [ 19%] 2025-12-04T14:02:33.5687928Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int16 PASSED [0.0055s] [ 19%] 2025-12-04T14:02:33.5688022Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int64 PASSED [0.0039s] [ 19%] 2025-12-04T14:02:33.5688118Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_bool PASSED [0.9176s] [ 19%] 2025-12-04T14:02:33.5688220Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_complex32 PASSED [0.0054s] [ 19%] 2025-12-04T14:02:33.5688318Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_int16 PASSED [0.0039s] [ 19%] 2025-12-04T14:02:33.5688412Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_uint8 PASSED [0.9156s] [ 20%] 2025-12-04T14:02:33.5688521Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_bfloat16 PASSED [0.0115s] [ 20%] 2025-12-04T14:02:33.5688632Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_float16 PASSED [0.0099s] [ 20%] 2025-12-04T14:02:33.5688726Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_float32 PASSED [0.0095s] [ 20%] 2025-12-04T14:02:33.5688821Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int16 PASSED [0.9095s] [ 20%] 2025-12-04T14:02:33.5688915Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int64 PASSED [0.0113s] [ 20%] 2025-12-04T14:02:33.5689007Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_uint8 PASSED [0.0097s] [ 20%] 2025-12-04T14:02:33.5689114Z test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_complex128 PASSED [0.0768s] [ 20%] 2025-12-04T14:02:33.5689217Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float32 PASSED [0.0124s] [ 20%] 2025-12-04T14:02:33.5689322Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float64 PASSED [0.0119s] [ 20%] 2025-12-04T14:02:33.5689432Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_complex128 PASSED [0.3535s] [ 20%] 2025-12-04T14:02:33.5689531Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float64 PASSED [0.0123s] [ 20%] 2025-12-04T14:02:33.5689632Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int16 PASSED [0.0119s] [ 20%] 2025-12-04T14:02:33.5689729Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int8 PASSED [0.0118s] [ 20%] 2025-12-04T14:02:33.5689828Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_bfloat16 PASSED [0.3258s] [ 20%] 2025-12-04T14:02:33.5689925Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float64 PASSED [0.0768s] [ 20%] 2025-12-04T14:02:33.5690045Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_bfloat16 PASSED [0.0094s] [ 20%] 2025-12-04T14:02:33.5690196Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_float64 PASSED [0.0088s] [ 20%] 2025-12-04T14:02:33.5690305Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_complex128 PASSED [0.0096s] [ 20%] 2025-12-04T14:02:33.5690407Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_complex64 PASSED [0.0095s] [ 20%] 2025-12-04T14:02:33.5690503Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_bool PASSED [0.0152s] [ 20%] 2025-12-04T14:02:33.5690600Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float16 PASSED [0.0088s] [ 20%] 2025-12-04T14:02:33.5690695Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float64 PASSED [0.0085s] [ 20%] 2025-12-04T14:02:33.5690788Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int32 PASSED [0.9135s] [ 20%] 2025-12-04T14:02:33.5690881Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int64 PASSED [0.0085s] [ 21%] 2025-12-04T14:02:33.5690989Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_bfloat16 PASSED [0.9093s] [ 21%] 2025-12-04T14:02:33.5691108Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_bool PASSED [0.0045s] [ 21%] 2025-12-04T14:02:33.5691222Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_complex128 PASSED [0.0033s] [ 21%] 2025-12-04T14:02:33.5691328Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int32 PASSED [0.9070s] [ 21%] 2025-12-04T14:02:33.5691433Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int64 PASSED [0.0046s] [ 21%] 2025-12-04T14:02:33.5691546Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_int8 PASSED [0.0033s] [ 21%] 2025-12-04T14:02:33.5691651Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_uint8 PASSED [0.9051s] [ 21%] 2025-12-04T14:02:33.5691744Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_bool PASSED [0.0185s] [ 21%] 2025-12-04T14:02:33.5691842Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float16 PASSED [0.0181s] [ 21%] 2025-12-04T14:02:33.5691937Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float32 PASSED [0.0178s] [ 21%] 2025-12-04T14:02:33.5692030Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int8 PASSED [0.0178s] [ 21%] 2025-12-04T14:02:33.5692156Z test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_float64 PASSED [0.0277s] [ 21%] 2025-12-04T14:02:33.5692255Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_float64 PASSED [0.9161s] [ 21%] 2025-12-04T14:02:33.5692348Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int8 PASSED [0.0130s] [ 21%] 2025-12-04T14:02:33.5692445Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_uint8 PASSED [0.0113s] [ 21%] 2025-12-04T14:02:33.5692538Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_bool PASSED [0.0112s] [ 21%] 2025-12-04T14:02:33.5692637Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float16 PASSED [0.0123s] [ 21%] 2025-12-04T14:02:33.5692729Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int64 PASSED [0.0111s] [ 21%] 2025-12-04T14:02:33.5692823Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_uint8 PASSED [0.0111s] [ 21%] 2025-12-04T14:02:33.5692926Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_bfloat16 PASSED [0.0094s] [ 21%] 2025-12-04T14:02:33.5693029Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_float16 PASSED [0.0049s] [ 21%] 2025-12-04T14:02:33.5693125Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_int16 PASSED [0.0042s] [ 21%] 2025-12-04T14:02:33.5693226Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_uint8 PASSED [0.0044s] [ 21%] 2025-12-04T14:02:33.5693323Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_bool PASSED [0.9056s] [ 22%] 2025-12-04T14:02:33.5693425Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_complex128 PASSED [0.1279s] [ 22%] 2025-12-04T14:02:33.5693528Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_complex128 PASSED [0.0149s] [ 22%] 2025-12-04T14:02:33.5693622Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_float32 PASSED [0.0136s] [ 22%] 2025-12-04T14:02:33.5693718Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_int32 PASSED [0.0136s] [ 22%] 2025-12-04T14:02:33.5693811Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_int8 PASSED [0.0135s] [ 22%] 2025-12-04T14:02:33.5693904Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_uint8 PASSED [0.0153s] [ 22%] 2025-12-04T14:02:33.5694000Z test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_int8 PASSED [0.0096s] [ 22%] 2025-12-04T14:02:33.5694096Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_int8 PASSED [0.9257s] [ 22%] 2025-12-04T14:02:33.5694191Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int16 PASSED [0.0139s] [ 22%] 2025-12-04T14:02:33.5694289Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int32 PASSED [0.0075s] [ 22%] 2025-12-04T14:02:33.5694386Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int64 PASSED [0.0073s] [ 22%] 2025-12-04T14:02:33.5694484Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int8 PASSED [0.9131s] [ 22%] 2025-12-04T14:02:33.5694594Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int32 PASSED [0.1583s] [ 22%] 2025-12-04T14:02:33.5694704Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_bfloat16 PASSED [0.0813s] [ 22%] 2025-12-04T14:02:33.5694801Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_bool PASSED [0.9149s] [ 22%] 2025-12-04T14:02:33.5694922Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_complex128 PASSED [0.0059s] [ 22%] 2025-12-04T14:02:33.5695025Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_float32 PASSED [0.0042s] [ 22%] 2025-12-04T14:02:33.5695124Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_uint8 PASSED [0.9018s] [ 22%] 2025-12-04T14:02:33.5695237Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_bfloat16 PASSED [0.0062s] [ 22%] 2025-12-04T14:02:33.5695344Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_bool PASSED [0.0048s] [ 22%] 2025-12-04T14:02:33.5695461Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_complex64 PASSED [0.0046s] [ 22%] 2025-12-04T14:02:33.5695591Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float16 PASSED [0.0046s] [ 22%] 2025-12-04T14:02:33.5695705Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float32 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.5695816Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float64 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.5695927Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_int32 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.5696031Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_float32 PASSED [0.9185s] [ 23%] 2025-12-04T14:02:33.5696133Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int32 PASSED [0.0047s] [ 23%] 2025-12-04T14:02:33.5696232Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_uint8 PASSED [0.0035s] [ 23%] 2025-12-04T14:02:33.5696358Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bfloat16 PASSED [0.9118s] [ 23%] 2025-12-04T14:02:33.5696480Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_float16 PASSED [0.0050s] [ 23%] 2025-12-04T14:02:33.5696604Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_int64 PASSED [0.0035s] [ 23%] 2025-12-04T14:02:33.5696725Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_int8 PASSED [0.9105s] [ 23%] 2025-12-04T14:02:33.5696847Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_complex128 PASSED [0.0060s] [ 23%] 2025-12-04T14:02:33.5696961Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float16 PASSED [0.0045s] [ 23%] 2025-12-04T14:02:33.5697078Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float64 PASSED [0.0044s] [ 23%] 2025-12-04T14:02:33.5697191Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int64 PASSED [0.9117s] [ 23%] 2025-12-04T14:02:33.5697303Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_uint8 PASSED [0.0060s] [ 23%] 2025-12-04T14:02:33.5697399Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_bool PASSED [0.0030s] [ 23%] 2025-12-04T14:02:33.5697497Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_float16 PASSED [0.9011s] [ 23%] 2025-12-04T14:02:33.5697594Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_float64 PASSED [0.0041s] [ 23%] 2025-12-04T14:02:33.5697688Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int64 PASSED [0.9025s] [ 23%] 2025-12-04T14:02:33.5697783Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_uint8 PASSED [0.0042s] [ 23%] 2025-12-04T14:02:33.5697882Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_bfloat16 PASSED [0.9204s] [ 23%] 2025-12-04T14:02:33.5697989Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex128 PASSED [0.0042s] [ 23%] 2025-12-04T14:02:33.5698099Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex64 PASSED [0.9135s] [ 23%] 2025-12-04T14:02:33.5698199Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float16 PASSED [0.0041s] [ 23%] 2025-12-04T14:02:33.5698292Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int8 PASSED [0.9109s] [ 24%] 2025-12-04T14:02:33.5698392Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float16 PASSED [0.0224s] [ 24%] 2025-12-04T14:02:33.5698497Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float32 PASSED [0.0083s] [ 24%] 2025-12-04T14:02:33.5698593Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float64 PASSED [0.0081s] [ 24%] 2025-12-04T14:02:33.5698689Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_bfloat16 PASSED [0.9202s] [ 24%] 2025-12-04T14:02:33.5698790Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex128 PASSED [0.0059s] [ 24%] 2025-12-04T14:02:33.5698887Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex64 PASSED [0.8990s] [ 24%] 2025-12-04T14:02:33.5698984Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_float16 PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.5699106Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int64 PASSED [0.9192s] [ 24%] 2025-12-04T14:02:33.5699197Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int8 PASSED [0.0042s] [ 24%] 2025-12-04T14:02:33.5699291Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_bool PASSED [0.9048s] [ 24%] 2025-12-04T14:02:33.5699393Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex128 PASSED [0.0059s] [ 24%] 2025-12-04T14:02:33.5699494Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex32 PASSED [0.9064s] [ 24%] 2025-12-04T14:02:33.5699593Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_complex64 PASSED [0.1707s] [ 24%] 2025-12-04T14:02:33.5699690Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float16 PASSED [0.9084s] [ 24%] 2025-12-04T14:02:33.5699785Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float64 PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.5699885Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_bool PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.5699994Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_complex32 PASSED [0.9022s] [ 24%] 2025-12-04T14:02:33.5700133Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_complex64 PASSED [0.0055s] [ 24%] 2025-12-04T14:02:33.5700239Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float16 PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.5700343Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float64 PASSED [0.9127s] [ 24%] 2025-12-04T14:02:33.5700444Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int16 PASSED [0.0054s] [ 24%] 2025-12-04T14:02:33.5700545Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int64 PASSED [0.0041s] [ 24%] 2025-12-04T14:02:33.5700644Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_uint8 PASSED [0.9127s] [ 25%] 2025-12-04T14:02:33.5700752Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_bfloat16 PASSED [0.0041s] [ 25%] 2025-12-04T14:02:33.5700855Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int32 PASSED [0.9148s] [ 25%] 2025-12-04T14:02:33.5700954Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int8 PASSED [0.0039s] [ 25%] 2025-12-04T14:02:33.5701055Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_uint8 PASSED [0.8982s] [ 25%] 2025-12-04T14:02:33.5701163Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_bfloat16 PASSED [0.0040s] [ 25%] 2025-12-04T14:02:33.5701273Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex32 PASSED [0.9110s] [ 25%] 2025-12-04T14:02:33.5701377Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float32 PASSED [0.0039s] [ 25%] 2025-12-04T14:02:33.5701477Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_int32 PASSED [0.9072s] [ 25%] 2025-12-04T14:02:33.5701594Z test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float16 PASSED [0.0192s] [ 25%] 2025-12-04T14:02:33.5701699Z test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float32 PASSED [0.9234s] [ 25%] 2025-12-04T14:02:33.5701803Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex64 PASSED [0.0050s] [ 25%] 2025-12-04T14:02:33.5701904Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float32 PASSED [0.0036s] [ 25%] 2025-12-04T14:02:33.5702017Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float64 PASSED [0.9103s] [ 25%] 2025-12-04T14:02:33.5702115Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int16 PASSED [0.0049s] [ 25%] 2025-12-04T14:02:33.5702213Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_uint8 PASSED [0.0036s] [ 25%] 2025-12-04T14:02:33.5702311Z test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int32 PASSED [0.0134s] [ 25%] 2025-12-04T14:02:33.5702407Z test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int8 PASSED [0.0071s] [ 25%] 2025-12-04T14:02:33.5702511Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int16 PASSED [0.0084s] [ 25%] 2025-12-04T14:02:33.5702641Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int32 PASSED [0.0081s] [ 25%] 2025-12-04T14:02:33.5702743Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int8 PASSED [0.0080s] [ 25%] 2025-12-04T14:02:33.5702847Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_uint8 PASSED [0.0080s] [ 25%] 2025-12-04T14:02:33.5702958Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int16 PASSED [0.0195s] [ 25%] 2025-12-04T14:02:33.5703069Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int8 PASSED [0.0081s] [ 25%] 2025-12-04T14:02:33.5703172Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int32 PASSED [0.9090s] [ 26%] 2025-12-04T14:02:33.5703275Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int64 PASSED [0.0052s] [ 26%] 2025-12-04T14:02:33.5703376Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int8 PASSED [0.0038s] [ 26%] 2025-12-04T14:02:33.5703479Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_bool PASSED [0.0082s] [ 26%] 2025-12-04T14:02:33.5703579Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int16 PASSED [0.0081s] [ 26%] 2025-12-04T14:02:33.5703680Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int32 PASSED [0.0082s] [ 26%] 2025-12-04T14:02:33.5703780Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_uint8 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.5703894Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int32 PASSED [0.0081s] [ 26%] 2025-12-04T14:02:33.5704007Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int64 PASSED [0.0081s] [ 26%] 2025-12-04T14:02:33.5704119Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int8 PASSED [0.0081s] [ 26%] 2025-12-04T14:02:33.5704231Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_uint8 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.5704335Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_bool PASSED [0.0081s] [ 26%] 2025-12-04T14:02:33.5704437Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int64 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.5704538Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int8 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.5704642Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_uint8 PASSED [0.0080s] [ 26%] 2025-12-04T14:02:33.5704741Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_bool PASSED [0.0046s] [ 26%] 2025-12-04T14:02:33.5704850Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_complex128 PASSED [0.0046s] [ 26%] 2025-12-04T14:02:33.5704956Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_complex64 PASSED [0.0070s] [ 26%] 2025-12-04T14:02:33.5705071Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_float16 PASSED [0.0046s] [ 26%] 2025-12-04T14:02:33.5705172Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int8 PASSED [0.9116s] [ 26%] 2025-12-04T14:02:33.5705270Z test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_bfloat16 PASSED [0.0046s] [ 26%] 2025-12-04T14:02:33.5705366Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_bfloat16 PASSED [0.0037s] [ 26%] 2025-12-04T14:02:33.5705474Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_complex32 PASSED [0.9009s] [ 26%] 2025-12-04T14:02:33.5705569Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_float32 PASSED [0.0049s] [ 26%] 2025-12-04T14:02:33.5705661Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int8 PASSED [0.0036s] [ 27%] 2025-12-04T14:02:33.5705779Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_complex64 PASSED [0.9096s] [ 27%] 2025-12-04T14:02:33.5705893Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_float16 PASSED [0.0038s] [ 27%] 2025-12-04T14:02:33.5706006Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_int16 PASSED [0.8964s] [ 27%] 2025-12-04T14:02:33.5706980Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_int32 PASSED [0.0038s] [ 27%] 2025-12-04T14:02:33.5707091Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_bfloat16 PASSED [0.0042s] [ 27%] 2025-12-04T14:02:33.5707197Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_bool PASSED [0.9169s] [ 27%] 2025-12-04T14:02:33.5707308Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_complex128 PASSED [0.0053s] [ 27%] 2025-12-04T14:02:33.5707415Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float16 PASSED [0.0041s] [ 27%] 2025-12-04T14:02:33.5707522Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float32 PASSED [0.9254s] [ 27%] 2025-12-04T14:02:33.5707627Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_float64 PASSED [0.0058s] [ 27%] 2025-12-04T14:02:33.5707733Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_uint8 PASSED [0.0041s] [ 27%] 2025-12-04T14:02:33.5707835Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_uint8 PASSED [0.0190s] [ 27%] 2025-12-04T14:02:33.5707928Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_bool PASSED [0.9078s] [ 27%] 2025-12-04T14:02:33.5708030Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_complex128 PASSED [0.0048s] [ 27%] 2025-12-04T14:02:33.5708126Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float32 PASSED [0.0036s] [ 27%] 2025-12-04T14:02:33.5708221Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float64 PASSED [0.9107s] [ 27%] 2025-12-04T14:02:33.5708333Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_bfloat16 PASSED [0.0038s] [ 27%] 2025-12-04T14:02:33.5708443Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_float16 PASSED [0.9101s] [ 27%] 2025-12-04T14:02:33.5708554Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_float64 PASSED [0.0038s] [ 27%] 2025-12-04T14:02:33.5708662Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int32 PASSED [0.9016s] [ 27%] 2025-12-04T14:02:33.5708769Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int8 PASSED [0.0037s] [ 27%] 2025-12-04T14:02:33.5708864Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_bfloat16 PASSED [0.0087s] [ 27%] 2025-12-04T14:02:33.5708967Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex128 PASSED [0.9139s] [ 28%] 2025-12-04T14:02:33.5709063Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex64 PASSED [0.0097s] [ 28%] 2025-12-04T14:02:33.5709158Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_float64 PASSED [0.0079s] [ 28%] 2025-12-04T14:02:33.5709250Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int64 PASSED [0.0077s] [ 28%] 2025-12-04T14:02:33.5709347Z test_meta.py::TestMetaCUDA::test_meta_outplace_cauchy_cuda_float64 PASSED [0.9257s] [ 28%] 2025-12-04T14:02:33.5709457Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdist_cuda_float64 PASSED [0.1630s] [ 28%] 2025-12-04T14:02:33.5709564Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_complex128 PASSED [0.0037s] [ 28%] 2025-12-04T14:02:33.5709663Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float32 PASSED [0.9180s] [ 28%] 2025-12-04T14:02:33.5709774Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float64 PASSED [0.0049s] [ 28%] 2025-12-04T14:02:33.5709869Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float16 PASSED [0.9084s] [ 28%] 2025-12-04T14:02:33.5709963Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float32 PASSED [0.0042s] [ 28%] 2025-12-04T14:02:33.5710057Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float64 PASSED [0.9186s] [ 28%] 2025-12-04T14:02:33.5710183Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int32 PASSED [0.0041s] [ 28%] 2025-12-04T14:02:33.5710279Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_bool PASSED [0.8987s] [ 28%] 2025-12-04T14:02:33.5710405Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_complex32 PASSED [0.0050s] [ 28%] 2025-12-04T14:02:33.5710523Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_complex64 PASSED [0.0036s] [ 28%] 2025-12-04T14:02:33.5710620Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float64 PASSED [0.9075s] [ 28%] 2025-12-04T14:02:33.5710715Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_int32 PASSED [0.0048s] [ 28%] 2025-12-04T14:02:33.5710811Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_bfloat16 PASSED [0.0036s] [ 28%] 2025-12-04T14:02:33.5710905Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_bool PASSED [0.9014s] [ 28%] 2025-12-04T14:02:33.5711005Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex128 PASSED [0.0050s] [ 28%] 2025-12-04T14:02:33.5711100Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int16 PASSED [0.0036s] [ 28%] 2025-12-04T14:02:33.5711195Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int8 PASSED [0.9220s] [ 28%] 2025-12-04T14:02:33.5711292Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_bfloat16 PASSED [0.0049s] [ 28%] 2025-12-04T14:02:33.5711385Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_bool PASSED [0.0036s] [ 29%] 2025-12-04T14:02:33.5711486Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_complex128 PASSED [0.9172s] [ 29%] 2025-12-04T14:02:33.5711581Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float16 PASSED [0.0049s] [ 29%] 2025-12-04T14:02:33.5711673Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_int16 PASSED [0.0036s] [ 29%] 2025-12-04T14:02:33.5711766Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_uint8 PASSED [0.9022s] [ 29%] 2025-12-04T14:02:33.5711867Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_float64 PASSED [0.0250s] [ 29%] 2025-12-04T14:02:33.5711986Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_complex128 PASSED [0.0955s] [ 29%] 2025-12-04T14:02:33.5712100Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_float32 PASSED [2.7881s] [ 29%] 2025-12-04T14:02:33.5712212Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_inverse_cuda_float64 PASSED [0.0584s] [ 29%] 2025-12-04T14:02:33.5712309Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float16 PASSED [0.0033s] [ 29%] 2025-12-04T14:02:33.5712407Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float32 PASSED [0.9147s] [ 29%] 2025-12-04T14:02:33.5712501Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int16 PASSED [0.0043s] [ 29%] 2025-12-04T14:02:33.5712595Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int64 PASSED [0.9240s] [ 29%] 2025-12-04T14:02:33.5712692Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_bfloat16 PASSED [0.0135s] [ 29%] 2025-12-04T14:02:33.5712786Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_int32 PASSED [0.9283s] [ 29%] 2025-12-04T14:02:33.5712894Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_bool PASSED [0.0129s] [ 29%] 2025-12-04T14:02:33.5713000Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_float16 PASSED [0.0115s] [ 29%] 2025-12-04T14:02:33.5713101Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_int16 PASSED [0.0111s] [ 29%] 2025-12-04T14:02:33.5713213Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_int64 PASSED [0.0111s] [ 29%] 2025-12-04T14:02:33.5713312Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_uint8 PASSED [0.0110s] [ 29%] 2025-12-04T14:02:33.5713415Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_float32 PASSED [0.0112s] [ 29%] 2025-12-04T14:02:33.5713513Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int16 PASSED [0.0111s] [ 29%] 2025-12-04T14:02:33.5713611Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex64 PASSED [0.9048s] [ 29%] 2025-12-04T14:02:33.5713708Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_float32 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.5713819Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int32 PASSED [0.9108s] [ 30%] 2025-12-04T14:02:33.5713924Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int8 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.5714032Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_bfloat16 PASSED [0.0047s] [ 30%] 2025-12-04T14:02:33.5714140Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_float32 PASSED [0.0042s] [ 30%] 2025-12-04T14:02:33.5714245Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int32 PASSED [0.9043s] [ 30%] 2025-12-04T14:02:33.5714350Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int64 PASSED [0.0058s] [ 30%] 2025-12-04T14:02:33.5714454Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_uint8 PASSED [0.0043s] [ 30%] 2025-12-04T14:02:33.5714564Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_bfloat16 PASSED [0.0134s] [ 30%] 2025-12-04T14:02:33.5714669Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_bool PASSED [0.0127s] [ 30%] 2025-12-04T14:02:33.5714782Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_complex128 PASSED [0.9180s] [ 30%] 2025-12-04T14:02:33.5714891Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_float64 PASSED [0.0147s] [ 30%] 2025-12-04T14:02:33.5714990Z test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float16 PASSED [0.0126s] [ 30%] 2025-12-04T14:02:33.5715090Z test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float32 PASSED [0.0079s] [ 30%] 2025-12-04T14:02:33.5715188Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex128 PASSED [0.9077s] [ 30%] 2025-12-04T14:02:33.5715284Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float16 PASSED [0.0042s] [ 30%] 2025-12-04T14:02:33.5715379Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float32 PASSED [0.9174s] [ 30%] 2025-12-04T14:02:33.5715475Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_float64 PASSED [0.0042s] [ 30%] 2025-12-04T14:02:33.5715588Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_bfloat16 PASSED [0.9171s] [ 30%] 2025-12-04T14:02:33.5715700Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_complex32 PASSED [0.0057s] [ 30%] 2025-12-04T14:02:33.5715807Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_float64 PASSED [0.9182s] [ 30%] 2025-12-04T14:02:33.5715914Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int16 PASSED [0.0040s] [ 30%] 2025-12-04T14:02:33.5716025Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_bfloat16 PASSED [0.0157s] [ 30%] 2025-12-04T14:02:33.5716133Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_bool PASSED [0.0149s] [ 30%] 2025-12-04T14:02:33.5716241Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int32 PASSED [0.0149s] [ 31%] 2025-12-04T14:02:33.5716360Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int8 PASSED [0.0147s] [ 31%] 2025-12-04T14:02:33.5716470Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_uint8 PASSED [0.9312s] [ 31%] 2025-12-04T14:02:33.5716571Z test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_bool PASSED [0.0039s] [ 31%] 2025-12-04T14:02:33.5716672Z test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_int16 PASSED [0.9045s] [ 31%] 2025-12-04T14:02:33.5716780Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_bool PASSED [0.0198s] [ 31%] 2025-12-04T14:02:33.5716884Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_float64 PASSED [0.9225s] [ 31%] 2025-12-04T14:02:33.5716982Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int32 PASSED [0.0126s] [ 31%] 2025-12-04T14:02:33.5717080Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int8 PASSED [0.0110s] [ 31%] 2025-12-04T14:02:33.5717177Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_uint8 PASSED [0.0109s] [ 31%] 2025-12-04T14:02:33.5717290Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_bfloat16 PASSED [1.2128s] [ 31%] 2025-12-04T14:02:33.5717408Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_complex128 PASSED [0.9132s] [ 31%] 2025-12-04T14:02:33.5717510Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_float16 PASSED [0.6773s] [ 31%] 2025-12-04T14:02:33.5717606Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int8 PASSED [0.0060s] [ 31%] 2025-12-04T14:02:33.5717699Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_int64 PASSED [0.9097s] [ 31%] 2025-12-04T14:02:33.5717791Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_bool PASSED [0.0052s] [ 31%] 2025-12-04T14:02:33.5717890Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_complex64 PASSED [0.0055s] [ 31%] 2025-12-04T14:02:33.5717984Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float16 PASSED [0.9136s] [ 31%] 2025-12-04T14:02:33.5718080Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float64 PASSED [0.0051s] [ 31%] 2025-12-04T14:02:33.5718176Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_int64 PASSED [0.0038s] [ 31%] 2025-12-04T14:02:33.5718291Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_bfloat16 PASSED [0.0095s] [ 31%] 2025-12-04T14:02:33.5718403Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_complex64 PASSED [0.0092s] [ 31%] 2025-12-04T14:02:33.5718512Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_float32 PASSED [0.0090s] [ 31%] 2025-12-04T14:02:33.5718620Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_float64 PASSED [0.9069s] [ 31%] 2025-12-04T14:02:33.5718725Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int32 PASSED [0.0111s] [ 32%] 2025-12-04T14:02:33.5718830Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int8 PASSED [0.0092s] [ 32%] 2025-12-04T14:02:33.5718925Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_float16 PASSED [0.0283s] [ 32%] 2025-12-04T14:02:33.5719018Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int32 PASSED [0.0269s] [ 32%] 2025-12-04T14:02:33.5719110Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int8 PASSED [0.0292s] [ 32%] 2025-12-04T14:02:33.5719206Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_float64 PASSED [0.0075s] [ 32%] 2025-12-04T14:02:33.5719302Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_int16 PASSED [0.0038s] [ 32%] 2025-12-04T14:02:33.5719395Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_int8 PASSED [0.9195s] [ 32%] 2025-12-04T14:02:33.5719494Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_bfloat16 PASSED [0.0091s] [ 32%] 2025-12-04T14:02:33.5719589Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int32 PASSED [0.9055s] [ 32%] 2025-12-04T14:02:33.5719683Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_uint8 PASSED [0.0045s] [ 32%] 2025-12-04T14:02:33.5719792Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_bfloat16 PASSED [0.9153s] [ 32%] 2025-12-04T14:02:33.5719889Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_bool PASSED [0.0045s] [ 32%] 2025-12-04T14:02:33.5719985Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int64 PASSED [0.9116s] [ 32%] 2025-12-04T14:02:33.5720078Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_uint8 PASSED [0.0046s] [ 32%] 2025-12-04T14:02:33.5720224Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_complex64 PASSED [0.0229s] [ 32%] 2025-12-04T14:02:33.5720323Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float16 PASSED [0.9297s] [ 32%] 2025-12-04T14:02:33.5720421Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int16 PASSED [0.0123s] [ 32%] 2025-12-04T14:02:33.5720517Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int64 PASSED [0.0101s] [ 32%] 2025-12-04T14:02:33.5720613Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_uint8 PASSED [0.9326s] [ 32%] 2025-12-04T14:02:33.5720713Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_bfloat16 PASSED [0.0076s] [ 32%] 2025-12-04T14:02:33.5720836Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_uint8 PASSED [0.9149s] [ 32%] 2025-12-04T14:02:33.5720959Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_bfloat16 PASSED [0.0134s] [ 32%] 2025-12-04T14:02:33.5721083Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_complex64 PASSED [0.0091s] [ 33%] 2025-12-04T14:02:33.5721202Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_float64 PASSED [0.0087s] [ 33%] 2025-12-04T14:02:33.5721316Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int16 PASSED [0.9181s] [ 33%] 2025-12-04T14:02:33.5721431Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int64 PASSED [0.0107s] [ 33%] 2025-12-04T14:02:33.5721545Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_uint8 PASSED [0.0089s] [ 33%] 2025-12-04T14:02:33.5721648Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_float32 PASSED [0.0028s] [ 33%] 2025-12-04T14:02:33.5721746Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_int16 PASSED [0.9189s] [ 33%] 2025-12-04T14:02:33.5721842Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_int32 PASSED [0.0043s] [ 33%] 2025-12-04T14:02:33.5721938Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_uint8 PASSED [0.9154s] [ 33%] 2025-12-04T14:02:33.5722037Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex128 PASSED [0.0116s] [ 33%] 2025-12-04T14:02:33.5722134Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex32 PASSED [0.0096s] [ 33%] 2025-12-04T14:02:33.5722231Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex64 PASSED [0.0093s] [ 33%] 2025-12-04T14:02:33.5722325Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int32 PASSED [0.9269s] [ 33%] 2025-12-04T14:02:33.5722419Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int64 PASSED [0.0110s] [ 33%] 2025-12-04T14:02:33.5722527Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex32 PASSED [0.0106s] [ 33%] 2025-12-04T14:02:33.5722631Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_float16 PASSED [0.0101s] [ 33%] 2025-12-04T14:02:33.5722733Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int16 PASSED [0.9250s] [ 33%] 2025-12-04T14:02:33.5722835Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int32 PASSED [0.0124s] [ 33%] 2025-12-04T14:02:33.5722935Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int8 PASSED [0.0103s] [ 33%] 2025-12-04T14:02:33.5723039Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_bfloat16 PASSED [0.0048s] [ 33%] 2025-12-04T14:02:33.5723147Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_complex128 PASSED [0.0047s] [ 33%] 2025-12-04T14:02:33.5723247Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float16 PASSED [0.9176s] [ 33%] 2025-12-04T14:02:33.5723363Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float64 PASSED [0.0070s] [ 33%] 2025-12-04T14:02:33.5723462Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_int16 PASSED [0.0050s] [ 33%] 2025-12-04T14:02:33.5723559Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_int8 PASSED [0.0048s] [ 34%] 2025-12-04T14:02:33.5723667Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_uint8 PASSED [0.8706s] [ 34%] 2025-12-04T14:02:33.5723782Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex128 PASSED [0.0101s] [ 34%] 2025-12-04T14:02:33.5723894Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex32 PASSED [0.0089s] [ 34%] 2025-12-04T14:02:33.5724006Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex64 PASSED [0.0087s] [ 34%] 2025-12-04T14:02:33.5724101Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_bool PASSED [0.0057s] [ 34%] 2025-12-04T14:02:33.5724208Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex32 PASSED [0.0057s] [ 34%] 2025-12-04T14:02:33.5724331Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float16 PASSED [0.0057s] [ 34%] 2025-12-04T14:02:33.5724432Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float32 PASSED [0.0057s] [ 34%] 2025-12-04T14:02:33.5724544Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int16 PASSED [0.0070s] [ 34%] 2025-12-04T14:02:33.5724653Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int64 PASSED [0.0069s] [ 34%] 2025-12-04T14:02:33.5724762Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_uint8 PASSED [0.0070s] [ 34%] 2025-12-04T14:02:33.5724859Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_complex64 PASSED [0.0611s] [ 34%] 2025-12-04T14:02:33.5724955Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_float32 PASSED [0.0602s] [ 34%] 2025-12-04T14:02:33.5725048Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_int16 PASSED [0.9095s] [ 34%] 2025-12-04T14:02:33.5725150Z test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_float32 PASSED [0.0063s] [ 34%] 2025-12-04T14:02:33.5725245Z test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_int32 PASSED [0.0048s] [ 34%] 2025-12-04T14:02:33.5725341Z test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_bfloat16 PASSED [0.0299s] [ 34%] 2025-12-04T14:02:33.5725457Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_bfloat16 PASSED [0.0440s] [ 34%] 2025-12-04T14:02:33.5725571Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_float32 PASSED [0.0301s] [ 34%] 2025-12-04T14:02:33.5725682Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_uint8 PASSED [0.0091s] [ 34%] 2025-12-04T14:02:33.5725794Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_bool PASSED [0.0086s] [ 34%] 2025-12-04T14:02:33.5725913Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex128 PASSED [0.0086s] [ 34%] 2025-12-04T14:02:33.5726034Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex64 PASSED [0.0086s] [ 35%] 2025-12-04T14:02:33.5726149Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float16 PASSED [0.0087s] [ 35%] 2025-12-04T14:02:33.5726264Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float32 PASSED [0.0085s] [ 35%] 2025-12-04T14:02:33.5726377Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_int32 PASSED [0.0086s] [ 35%] 2025-12-04T14:02:33.5726490Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_int64 PASSED [0.0086s] [ 35%] 2025-12-04T14:02:33.5726604Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_float32 PASSED [0.0093s] [ 35%] 2025-12-04T14:02:33.5726714Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int32 PASSED [0.0086s] [ 35%] 2025-12-04T14:02:33.5726835Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int8 PASSED [0.0086s] [ 35%] 2025-12-04T14:02:33.5726943Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_complex128 PASSED [0.8551s] [ 35%] 2025-12-04T14:02:33.5727042Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int32 PASSED [0.0046s] [ 35%] 2025-12-04T14:02:33.5727155Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int8 PASSED [0.0035s] [ 35%] 2025-12-04T14:02:33.5727251Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_bool PASSED [0.8514s] [ 35%] 2025-12-04T14:02:33.5727353Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_complex64 PASSED [0.0039s] [ 35%] 2025-12-04T14:02:33.5727454Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_float16 PASSED [0.8531s] [ 35%] 2025-12-04T14:02:33.5727551Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_float64 PASSED [0.0039s] [ 35%] 2025-12-04T14:02:33.5727647Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int8 PASSED [0.8536s] [ 35%] 2025-12-04T14:02:33.5727757Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_bfloat16 PASSED [0.0051s] [ 35%] 2025-12-04T14:02:33.5727862Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_bool PASSED [0.0038s] [ 35%] 2025-12-04T14:02:33.5727967Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex128 PASSED [0.0037s] [ 35%] 2025-12-04T14:02:33.5728065Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_float32 PASSED [0.8542s] [ 35%] 2025-12-04T14:02:33.5728160Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int16 PASSED [0.0052s] [ 35%] 2025-12-04T14:02:33.5728254Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int64 PASSED [0.0038s] [ 35%] 2025-12-04T14:02:33.5728351Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float64 PASSED [0.9089s] [ 35%] 2025-12-04T14:02:33.5728445Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_bool PASSED [0.0037s] [ 35%] 2025-12-04T14:02:33.5728548Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_complex128 PASSED [0.0030s] [ 36%] 2025-12-04T14:02:33.5728645Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float64 PASSED [0.0027s] [ 36%] 2025-12-04T14:02:33.5728740Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_int32 PASSED [0.0028s] [ 36%] 2025-12-04T14:02:33.5728844Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_float16 PASSED [0.0057s] [ 36%] 2025-12-04T14:02:33.5728947Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int16 PASSED [0.0051s] [ 36%] 2025-12-04T14:02:33.5729060Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_complex32 PASSED [0.0073s] [ 36%] 2025-12-04T14:02:33.5729168Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int16 PASSED [0.0073s] [ 36%] 2025-12-04T14:02:33.5729273Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int32 PASSED [0.0073s] [ 36%] 2025-12-04T14:02:33.5729380Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int8 PASSED [0.0073s] [ 36%] 2025-12-04T14:02:33.5729492Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_complex64 PASSED [0.0036s] [ 36%] 2025-12-04T14:02:33.5729600Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_uint8 PASSED [0.8776s] [ 36%] 2025-12-04T14:02:33.5729694Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_bfloat16 PASSED [0.0107s] [ 36%] 2025-12-04T14:02:33.5729787Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_bool PASSED [0.0089s] [ 36%] 2025-12-04T14:02:33.5729883Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_complex64 PASSED [0.0088s] [ 36%] 2025-12-04T14:02:33.5729976Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_float32 PASSED [0.8795s] [ 36%] 2025-12-04T14:02:33.5730068Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int32 PASSED [0.0105s] [ 36%] 2025-12-04T14:02:33.5730188Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int8 PASSED [0.0089s] [ 36%] 2025-12-04T14:02:33.5730291Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_uint8 PASSED [0.0087s] [ 36%] 2025-12-04T14:02:33.5730389Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_float32 PASSED [0.8622s] [ 36%] 2025-12-04T14:02:33.5730485Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_int32 PASSED [0.0045s] [ 36%] 2025-12-04T14:02:33.5730577Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_int8 PASSED [0.8606s] [ 36%] 2025-12-04T14:02:33.5730686Z test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_int8 PASSED [0.0041s] [ 36%] 2025-12-04T14:02:33.5730782Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_bfloat16 PASSED [0.0052s] [ 36%] 2025-12-04T14:02:33.5730878Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float64 PASSED [0.8649s] [ 36%] 2025-12-04T14:02:33.5730970Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int16 PASSED [0.0052s] [ 37%] 2025-12-04T14:02:33.5731062Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int32 PASSED [0.0037s] [ 37%] 2025-12-04T14:02:33.5731154Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_int64 PASSED [0.8642s] [ 37%] 2025-12-04T14:02:33.5731278Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float16 PASSED [0.0056s] [ 37%] 2025-12-04T14:02:33.5731375Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float32 PASSED [0.8624s] [ 37%] 2025-12-04T14:02:33.5731472Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int16 PASSED [0.0061s] [ 37%] 2025-12-04T14:02:33.5731567Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_bfloat16 PASSED [0.0053s] [ 37%] 2025-12-04T14:02:33.5731661Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_bool PASSED [0.8598s] [ 37%] 2025-12-04T14:02:33.5731758Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float32 PASSED [0.0059s] [ 37%] 2025-12-04T14:02:33.5731850Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int32 PASSED [0.0037s] [ 37%] 2025-12-04T14:02:33.5731943Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int8 PASSED [0.8648s] [ 37%] 2025-12-04T14:02:33.5732040Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_complex32 PASSED [0.0069s] [ 37%] 2025-12-04T14:02:33.5732135Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_float32 PASSED [0.0037s] [ 37%] 2025-12-04T14:02:33.5732227Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_int16 PASSED [0.8615s] [ 37%] 2025-12-04T14:02:33.5732320Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_uint8 PASSED [0.0052s] [ 37%] 2025-12-04T14:02:33.5732424Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float32 PASSED [0.0033s] [ 37%] 2025-12-04T14:02:33.5732529Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float64 PASSED [0.8601s] [ 37%] 2025-12-04T14:02:33.5732629Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int16 PASSED [0.0045s] [ 37%] 2025-12-04T14:02:33.5732729Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int32 PASSED [0.8607s] [ 37%] 2025-12-04T14:02:33.5732826Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int8 PASSED [0.0044s] [ 37%] 2025-12-04T14:02:33.5732931Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int32 PASSED [0.0065s] [ 37%] 2025-12-04T14:02:33.5733034Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int64 PASSED [0.0062s] [ 37%] 2025-12-04T14:02:33.5733129Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int32 PASSED [0.0043s] [ 37%] 2025-12-04T14:02:33.5733224Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_uint8 PASSED [0.0043s] [ 38%] 2025-12-04T14:02:33.5733326Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_bfloat16 PASSED [0.8615s] [ 38%] 2025-12-04T14:02:33.5733427Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_complex128 PASSED [0.0041s] [ 38%] 2025-12-04T14:02:33.5733524Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float16 PASSED [0.8623s] [ 38%] 2025-12-04T14:02:33.5733619Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float32 PASSED [0.0041s] [ 38%] 2025-12-04T14:02:33.5733724Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float64 PASSED [0.8582s] [ 38%] 2025-12-04T14:02:33.5733834Z test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_bfloat16 PASSED [0.0105s] [ 38%] 2025-12-04T14:02:33.5733942Z test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float16 PASSED [0.0048s] [ 38%] 2025-12-04T14:02:33.5734060Z test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float64 PASSED [0.0045s] [ 38%] 2025-12-04T14:02:33.5734154Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float64 PASSED [0.0377s] [ 38%] 2025-12-04T14:02:33.5734257Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e4m3fn PASSED [0.0375s] [ 38%] 2025-12-04T14:02:33.5734357Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e5m2 PASSED [0.0373s] [ 38%] 2025-12-04T14:02:33.5734449Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int16 PASSED [0.8958s] [ 38%] 2025-12-04T14:02:33.5734545Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int32 PASSED [0.0390s] [ 38%] 2025-12-04T14:02:33.5734663Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_bool PASSED [3.2546s] [ 38%] 2025-12-04T14:02:33.5734767Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex32 PASSED [2.8504s] [ 38%] 2025-12-04T14:02:33.5734871Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex64 PASSED [0.0065s] [ 38%] 2025-12-04T14:02:33.5734972Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_float32 PASSED [0.0056s] [ 38%] 2025-12-04T14:02:33.5735070Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int32 PASSED [0.0054s] [ 38%] 2025-12-04T14:02:33.5735165Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int8 PASSED [0.0054s] [ 38%] 2025-12-04T14:02:33.5735261Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int16 PASSED [1.1606s] [ 38%] 2025-12-04T14:02:33.5735367Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_complex128 PASSED [3.8867s] [ 38%] 2025-12-04T14:02:33.5735473Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_complex32 PASSED [1.2087s] [ 38%] 2025-12-04T14:02:33.5735575Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_float16 PASSED [0.0070s] [ 38%] 2025-12-04T14:02:33.5735671Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_int32 PASSED [2.0501s] [ 39%] 2025-12-04T14:02:33.5735769Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_uint8 PASSED [0.0073s] [ 39%] 2025-12-04T14:02:33.5735881Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_complex128 PASSED [0.0091s] [ 39%] 2025-12-04T14:02:33.5735984Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_uint8 PASSED [0.0045s] [ 39%] 2025-12-04T14:02:33.5736080Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_bool PASSED [1.4536s] [ 39%] 2025-12-04T14:02:33.5736187Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_complex128 PASSED [1.7607s] [ 39%] 2025-12-04T14:02:33.5736293Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_complex64 PASSED [0.0076s] [ 39%] 2025-12-04T14:02:33.5736395Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_int16 PASSED [0.0064s] [ 39%] 2025-12-04T14:02:33.5736494Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_int64 PASSED [0.0061s] [ 39%] 2025-12-04T14:02:33.5736593Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_uint8 PASSED [0.0061s] [ 39%] 2025-12-04T14:02:33.5736698Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_complex128 PASSED [0.4711s] [ 39%] 2025-12-04T14:02:33.5736796Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int16 PASSED [0.4680s] [ 39%] 2025-12-04T14:02:33.5736902Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex128 PASSED [0.0075s] [ 39%] 2025-12-04T14:02:33.5737007Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_float32 PASSED [0.0072s] [ 39%] 2025-12-04T14:02:33.5737114Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_uint8 PASSED [0.0071s] [ 39%] 2025-12-04T14:02:33.5737222Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_complex64 PASSED [0.0055s] [ 39%] 2025-12-04T14:02:33.5737325Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_float32 PASSED [0.0055s] [ 39%] 2025-12-04T14:02:33.5737424Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_uint8 PASSED [0.0054s] [ 39%] 2025-12-04T14:02:33.5737539Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_complex128 PASSED [0.6079s] [ 39%] 2025-12-04T14:02:33.5737640Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float16 PASSED [1.3272s] [ 39%] 2025-12-04T14:02:33.5737738Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int8 PASSED [0.0084s] [ 39%] 2025-12-04T14:02:33.5737835Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_bool PASSED [0.0068s] [ 39%] 2025-12-04T14:02:33.5737937Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float32 PASSED [0.0065s] [ 39%] 2025-12-04T14:02:33.5738046Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_int32 PASSED [0.0065s] [ 40%] 2025-12-04T14:02:33.5738155Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_int64 PASSED [0.0064s] [ 40%] 2025-12-04T14:02:33.5738254Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_uint8 PASSED [0.0065s] [ 40%] 2025-12-04T14:02:33.5738359Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_bool PASSED [0.0047s] [ 40%] 2025-12-04T14:02:33.5738470Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex32 PASSED [0.0045s] [ 40%] 2025-12-04T14:02:33.5738583Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex64 PASSED [1.5228s] [ 40%] 2025-12-04T14:02:33.5738688Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int64 PASSED [0.0065s] [ 40%] 2025-12-04T14:02:33.5738792Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_float64 PASSED [0.7707s] [ 40%] 2025-12-04T14:02:33.5738892Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int32 PASSED [0.3152s] [ 40%] 2025-12-04T14:02:33.5738992Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_bool PASSED [0.0077s] [ 40%] 2025-12-04T14:02:33.5739094Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_float64 PASSED [0.3068s] [ 40%] 2025-12-04T14:02:33.5739199Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_float32 PASSED [0.0086s] [ 40%] 2025-12-04T14:02:33.5739302Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_float64 PASSED [0.0080s] [ 40%] 2025-12-04T14:02:33.5739403Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int32 PASSED [0.0080s] [ 40%] 2025-12-04T14:02:33.5739505Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float16 PASSED [1.5425s] [ 40%] 2025-12-04T14:02:33.5739606Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float64 PASSED [0.4291s] [ 40%] 2025-12-04T14:02:33.5739709Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_float32 PASSED [0.0068s] [ 40%] 2025-12-04T14:02:33.5739809Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int16 PASSED [0.0062s] [ 40%] 2025-12-04T14:02:33.5739913Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int64 PASSED [0.0061s] [ 40%] 2025-12-04T14:02:33.5740012Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_bool PASSED [0.2961s] [ 40%] 2025-12-04T14:02:33.5740149Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_complex128 PASSED [0.0075s] [ 40%] 2025-12-04T14:02:33.5740256Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_complex32 PASSED [0.6017s] [ 40%] 2025-12-04T14:02:33.5740359Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float16 PASSED [0.0069s] [ 40%] 2025-12-04T14:02:33.5740458Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int64 PASSED [0.4348s] [ 40%] 2025-12-04T14:02:33.5740578Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_uint8 PASSED [0.0060s] [ 41%] 2025-12-04T14:02:33.5740683Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_float16 PASSED [0.4652s] [ 41%] 2025-12-04T14:02:33.5740782Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int16 PASSED [0.0075s] [ 41%] 2025-12-04T14:02:33.5740878Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float32 PASSED [1.4914s] [ 41%] 2025-12-04T14:02:33.5740986Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float64 PASSED [0.0048s] [ 41%] 2025-12-04T14:02:33.5741081Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_int64 PASSED [1.4867s] [ 41%] 2025-12-04T14:02:33.5741173Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_int8 PASSED [0.0047s] [ 41%] 2025-12-04T14:02:33.5741278Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex128 PASSED [0.0041s] [ 41%] 2025-12-04T14:02:33.5741381Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex32 PASSED [1.5098s] [ 41%] 2025-12-04T14:02:33.5741485Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex64 PASSED [0.0054s] [ 41%] 2025-12-04T14:02:33.5741608Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int32 PASSED [0.0040s] [ 41%] 2025-12-04T14:02:33.5741709Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_complex128 PASSED [0.0053s] [ 41%] 2025-12-04T14:02:33.5741804Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_int16 PASSED [0.0049s] [ 41%] 2025-12-04T14:02:33.5741897Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_uint8 PASSED [0.0048s] [ 41%] 2025-12-04T14:02:33.5741997Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_bfloat16 PASSED [0.0026s] [ 41%] 2025-12-04T14:02:33.5742101Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_complex128 PASSED [0.0027s] [ 41%] 2025-12-04T14:02:33.5742202Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_complex64 PASSED [0.0026s] [ 41%] 2025-12-04T14:02:33.5742298Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int16 PASSED [0.0027s] [ 41%] 2025-12-04T14:02:33.5742395Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int8 PASSED [0.0025s] [ 41%] 2025-12-04T14:02:33.5742500Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_complex128 PASSED [0.0026s] [ 41%] 2025-12-04T14:02:33.5742597Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float16 PASSED [0.0027s] [ 41%] 2025-12-04T14:02:33.5742696Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float32 PASSED [0.0025s] [ 41%] 2025-12-04T14:02:33.5742791Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int16 PASSED [0.0027s] [ 41%] 2025-12-04T14:02:33.5742887Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int64 PASSED [0.0025s] [ 41%] 2025-12-04T14:02:33.5742982Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_int8 PASSED [0.0025s] [ 42%] 2025-12-04T14:02:33.5743081Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex32 PASSED [1.5037s] [ 42%] 2025-12-04T14:02:33.5743181Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex64 PASSED [0.0052s] [ 42%] 2025-12-04T14:02:33.5743279Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_float64 PASSED [0.0036s] [ 42%] 2025-12-04T14:02:33.5743374Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_int16 PASSED [1.4928s] [ 42%] 2025-12-04T14:02:33.5743467Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_int8 PASSED [0.0052s] [ 42%] 2025-12-04T14:02:33.5743563Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_uint8 PASSED [0.0038s] [ 42%] 2025-12-04T14:02:33.5743670Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_bfloat16 PASSED [0.0090s] [ 42%] 2025-12-04T14:02:33.5743780Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_complex128 PASSED [0.0082s] [ 42%] 2025-12-04T14:02:33.5743886Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_float32 PASSED [0.0083s] [ 42%] 2025-12-04T14:02:33.5743989Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int16 PASSED [0.0083s] [ 42%] 2025-12-04T14:02:33.5744102Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int64 PASSED [0.0083s] [ 42%] 2025-12-04T14:02:33.5744206Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_uint8 PASSED [0.0082s] [ 42%] 2025-12-04T14:02:33.5744304Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_bfloat16 PASSED [1.5121s] [ 42%] 2025-12-04T14:02:33.5744411Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float16 PASSED [0.0045s] [ 42%] 2025-12-04T14:02:33.5744507Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float64 PASSED [1.4522s] [ 42%] 2025-12-04T14:02:33.5744602Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_int64 PASSED [0.0046s] [ 42%] 2025-12-04T14:02:33.5744708Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_float64 PASSED [0.0307s] [ 42%] 2025-12-04T14:02:33.5744811Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int8 PASSED [0.0156s] [ 42%] 2025-12-04T14:02:33.5744904Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_bool PASSED [0.0081s] [ 42%] 2025-12-04T14:02:33.5745024Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_float16 PASSED [0.0082s] [ 42%] 2025-12-04T14:02:33.5745120Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_float32 PASSED [0.0081s] [ 42%] 2025-12-04T14:02:33.5745213Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_int16 PASSED [0.0080s] [ 42%] 2025-12-04T14:02:33.5745309Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_float32 PASSED [0.0081s] [ 43%] 2025-12-04T14:02:33.5745400Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int8 PASSED [0.0085s] [ 43%] 2025-12-04T14:02:33.5745495Z test_meta.py::TestMetaCUDA::test_meta_outplace_frac_cuda_float32 PASSED [0.0031s] [ 43%] 2025-12-04T14:02:33.5745593Z test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_bfloat16 PASSED [1.4573s] [ 43%] 2025-12-04T14:02:33.5745690Z test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float16 PASSED [0.0048s] [ 43%] 2025-12-04T14:02:33.5745786Z test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float64 PASSED [1.4646s] [ 43%] 2025-12-04T14:02:33.5745888Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex128 PASSED [0.0057s] [ 43%] 2025-12-04T14:02:33.5745986Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex32 PASSED [0.0040s] [ 43%] 2025-12-04T14:02:33.5746091Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_bfloat16 PASSED [0.0065s] [ 43%] 2025-12-04T14:02:33.5746193Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float16 PASSED [1.4926s] [ 43%] 2025-12-04T14:02:33.5746291Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int8 PASSED [0.0076s] [ 43%] 2025-12-04T14:02:33.5746393Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_complex128 PASSED [0.0152s] [ 43%] 2025-12-04T14:02:33.5746496Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_complex64 PASSED [0.0065s] [ 43%] 2025-12-04T14:02:33.5746591Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int16 PASSED [0.0064s] [ 43%] 2025-12-04T14:02:33.5746689Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int64 PASSED [0.0063s] [ 43%] 2025-12-04T14:02:33.5746784Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_bfloat16 PASSED [0.0083s] [ 43%] 2025-12-04T14:02:33.5746876Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float32 PASSED [0.0080s] [ 43%] 2025-12-04T14:02:33.5746971Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float64 PASSED [0.0081s] [ 43%] 2025-12-04T14:02:33.5747061Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int16 PASSED [0.0079s] [ 43%] 2025-12-04T14:02:33.5747153Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int32 PASSED [0.0080s] [ 43%] 2025-12-04T14:02:33.5747243Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int8 PASSED [0.0080s] [ 43%] 2025-12-04T14:02:33.5747334Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_uint8 PASSED [0.0080s] [ 43%] 2025-12-04T14:02:33.5747450Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float16 PASSED [1.4607s] [ 43%] 2025-12-04T14:02:33.5747552Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int16 PASSED [0.0067s] [ 43%] 2025-12-04T14:02:33.5747652Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int32 PASSED [0.0048s] [ 44%] 2025-12-04T14:02:33.5747761Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int64 PASSED [0.0046s] [ 44%] 2025-12-04T14:02:33.5747859Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_uint8 PASSED [1.4713s] [ 44%] 2025-12-04T14:02:33.5747961Z test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_complex128 PASSED [0.0413s] [ 44%] 2025-12-04T14:02:33.5748060Z test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_complex64 PASSED [0.1465s] [ 44%] 2025-12-04T14:02:33.5748156Z test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_float32 PASSED [0.0610s] [ 44%] 2025-12-04T14:02:33.5748264Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_complex128 PASSED [0.0209s] [ 44%] 2025-12-04T14:02:33.5748380Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_complex64 PASSED [0.1305s] [ 44%] 2025-12-04T14:02:33.5748517Z test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_float16 SKIPPED [0.0002s] (Skipped!) [ 44%] 2025-12-04T14:02:33.5748645Z test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 44%] 2025-12-04T14:02:33.5748743Z test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_bfloat16 PASSED [0.0091s] [ 44%] 2025-12-04T14:02:33.5748837Z test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_float32 PASSED [0.0082s] [ 44%] 2025-12-04T14:02:33.5748927Z test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int16 PASSED [0.0081s] [ 44%] 2025-12-04T14:02:33.5749016Z test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int64 PASSED [0.0080s] [ 44%] 2025-12-04T14:02:33.5749109Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_bool PASSED [0.0034s] [ 44%] 2025-12-04T14:02:33.5749209Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_complex128 PASSED [1.4792s] [ 44%] 2025-12-04T14:02:33.5749308Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_complex64 PASSED [0.0052s] [ 44%] 2025-12-04T14:02:33.5749403Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_float16 PASSED [0.0037s] [ 44%] 2025-12-04T14:02:33.5749507Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int16 PASSED [0.0101s] [ 44%] 2025-12-04T14:02:33.5749607Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int8 PASSED [0.0098s] [ 44%] 2025-12-04T14:02:33.5749707Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int64 PASSED [0.0330s] [ 44%] 2025-12-04T14:02:33.5749804Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int8 PASSED [0.0143s] [ 44%] 2025-12-04T14:02:33.5749900Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_float32 PASSED [0.0453s] [ 44%] 2025-12-04T14:02:33.5749996Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_float64 PASSED [0.0455s] [ 45%] 2025-12-04T14:02:33.5750149Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int64 PASSED [0.0450s] [ 45%] 2025-12-04T14:02:33.5750243Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int8 PASSED [0.0449s] [ 45%] 2025-12-04T14:02:33.5750339Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_uint8 PASSED [0.0446s] [ 45%] 2025-12-04T14:02:33.5750439Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_bfloat16 PASSED [0.0027s] [ 45%] 2025-12-04T14:02:33.5750542Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_complex64 PASSED [1.4863s] [ 45%] 2025-12-04T14:02:33.5750638Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int64 PASSED [0.0043s] [ 45%] 2025-12-04T14:02:33.5750732Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int8 PASSED [1.3675s] [ 45%] 2025-12-04T14:02:33.5750827Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_bool PASSED [0.0055s] [ 45%] 2025-12-04T14:02:33.5750942Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_float64 PASSED [0.0039s] [ 45%] 2025-12-04T14:02:33.5751042Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int16 PASSED [0.0037s] [ 45%] 2025-12-04T14:02:33.5752795Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int32 PASSED [0.9002s] [ 45%] 2025-12-04T14:02:33.5752898Z test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float32 PASSED [0.0094s] [ 45%] 2025-12-04T14:02:33.5753019Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_float16 PASSED [0.8722s] [ 45%] 2025-12-04T14:02:33.5753112Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_float32 PASSED [0.0057s] [ 45%] 2025-12-04T14:02:33.5753220Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_complex128 PASSED [0.0101s] [ 45%] 2025-12-04T14:02:33.5753323Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_float64 PASSED [0.0095s] [ 45%] 2025-12-04T14:02:33.5753424Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_uint8 PASSED [0.8757s] [ 45%] 2025-12-04T14:02:33.5753550Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_float32 PASSED [0.0058s] [ 45%] 2025-12-04T14:02:33.5753673Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_uint8 PASSED [0.0044s] [ 45%] 2025-12-04T14:02:33.5753780Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex32 PASSED [0.8721s] [ 45%] 2025-12-04T14:02:33.5753886Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float32 PASSED [0.0060s] [ 45%] 2025-12-04T14:02:33.5753987Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int32 PASSED [0.0046s] [ 45%] 2025-12-04T14:02:33.5754088Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int64 PASSED [0.0046s] [ 45%] 2025-12-04T14:02:33.5754189Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_uint8 PASSED [0.8611s] [ 46%] 2025-12-04T14:02:33.5754290Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_bool PASSED [0.0059s] [ 46%] 2025-12-04T14:02:33.5754396Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float32 PASSED [0.0043s] [ 46%] 2025-12-04T14:02:33.5754497Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int16 PASSED [0.8760s] [ 46%] 2025-12-04T14:02:33.5754613Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float32 PASSED [0.0089s] [ 46%] 2025-12-04T14:02:33.5754728Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float64 PASSED [0.0072s] [ 46%] 2025-12-04T14:02:33.5754841Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int64 PASSED [0.0070s] [ 46%] 2025-12-04T14:02:33.5754952Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int8 PASSED [0.8741s] [ 46%] 2025-12-04T14:02:33.5755065Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float32 PASSED [0.0087s] [ 46%] 2025-12-04T14:02:33.5755175Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int16 PASSED [0.0072s] [ 46%] 2025-12-04T14:02:33.5755287Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_uint8 PASSED [0.0072s] [ 46%] 2025-12-04T14:02:33.5755405Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_float32 PASSED [0.8676s] [ 46%] 2025-12-04T14:02:33.5755514Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_int64 PASSED [0.0095s] [ 46%] 2025-12-04T14:02:33.5755623Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_uint8 PASSED [0.0078s] [ 46%] 2025-12-04T14:02:33.5755737Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float64 PASSED [0.0072s] [ 46%] 2025-12-04T14:02:33.5755842Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_bool PASSED [0.8696s] [ 46%] 2025-12-04T14:02:33.5755950Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_float16 PASSED [0.0053s] [ 46%] 2025-12-04T14:02:33.5756056Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_float64 PASSED [0.0040s] [ 46%] 2025-12-04T14:02:33.5756174Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int32 PASSED [0.0039s] [ 46%] 2025-12-04T14:02:33.5756284Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int64 PASSED [0.8640s] [ 46%] 2025-12-04T14:02:33.5756388Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int8 PASSED [0.0052s] [ 46%] 2025-12-04T14:02:33.5756503Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_uint8 PASSED [0.0040s] [ 46%] 2025-12-04T14:02:33.5756601Z test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_float16 PASSED [0.8708s] [ 46%] 2025-12-04T14:02:33.5756697Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_float16 PASSED [0.0049s] [ 46%] 2025-12-04T14:02:33.5756792Z test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_bool PASSED [0.0190s] [ 47%] 2025-12-04T14:02:33.5756892Z test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_float16 PASSED [0.0194s] [ 47%] 2025-12-04T14:02:33.5756997Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_complex32 PASSED [0.8680s] [ 47%] 2025-12-04T14:02:33.5757108Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_float16 PASSED [0.0051s] [ 47%] 2025-12-04T14:02:33.5757215Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_int8 PASSED [0.8752s] [ 47%] 2025-12-04T14:02:33.5757313Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_bfloat16 PASSED [0.0435s] [ 47%] 2025-12-04T14:02:33.5757408Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_float64 PASSED [0.0042s] [ 47%] 2025-12-04T14:02:33.5757502Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int16 PASSED [0.8781s] [ 47%] 2025-12-04T14:02:33.5757596Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_bool PASSED [0.0036s] [ 47%] 2025-12-04T14:02:33.5757698Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex128 PASSED [0.8640s] [ 47%] 2025-12-04T14:02:33.5757798Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex32 PASSED [0.0042s] [ 47%] 2025-12-04T14:02:33.5757898Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex64 PASSED [0.8583s] [ 47%] 2025-12-04T14:02:33.5757998Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_bfloat16 PASSED [0.0037s] [ 47%] 2025-12-04T14:02:33.5758095Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float16 PASSED [0.8618s] [ 47%] 2025-12-04T14:02:33.5758192Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int32 PASSED [0.0037s] [ 47%] 2025-12-04T14:02:33.5758285Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int8 PASSED [0.8532s] [ 47%] 2025-12-04T14:02:33.5758389Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_bfloat16 PASSED [0.0042s] [ 47%] 2025-12-04T14:02:33.5758492Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_float64 PASSED [0.8624s] [ 47%] 2025-12-04T14:02:33.5758595Z test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_bfloat16 PASSED [0.0040s] [ 47%] 2025-12-04T14:02:33.5758694Z test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_int8 PASSED [0.8616s] [ 47%] 2025-12-04T14:02:33.5758793Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_float64 PASSED [0.0046s] [ 47%] 2025-12-04T14:02:33.5758888Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_uint8 PASSED [0.8676s] [ 47%] 2025-12-04T14:02:33.5758991Z test_meta.py::TestMetaCUDA::test_meta_outplace_istft_cuda_complex128 PASSED [0.7505s] [ 47%] 2025-12-04T14:02:33.5759091Z test_meta.py::TestMetaCUDA::test_meta_outplace_istft_cuda_complex64 PASSED [1.2026s] [ 48%] 2025-12-04T14:02:33.5759188Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_bfloat16 PASSED [0.0041s] [ 48%] 2025-12-04T14:02:33.5759287Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex128 PASSED [0.8696s] [ 48%] 2025-12-04T14:02:33.5759385Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex32 PASSED [0.0050s] [ 48%] 2025-12-04T14:02:33.5759480Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_float64 PASSED [0.8710s] [ 48%] 2025-12-04T14:02:33.5759585Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int16 PASSED [0.0051s] [ 48%] 2025-12-04T14:02:33.5759681Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int32 PASSED [0.8762s] [ 48%] 2025-12-04T14:02:33.5759810Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_float32 PASSED [0.2515s] [ 48%] 2025-12-04T14:02:33.5759945Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int32 PASSED [0.2532s] [ 48%] 2025-12-04T14:02:33.5760067Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int64 PASSED [0.0053s] [ 48%] 2025-12-04T14:02:33.5760240Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bfloat16 PASSED [0.0049s] [ 48%] 2025-12-04T14:02:33.5760376Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float16 PASSED [0.0046s] [ 48%] 2025-12-04T14:02:33.5760513Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.0047s] [ 48%] 2025-12-04T14:02:33.5760672Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float64 PASSED [0.0046s] [ 48%] 2025-12-04T14:02:33.5760806Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_uint8 PASSED [0.0046s] [ 48%] 2025-12-04T14:02:33.5760923Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_bfloat16 PASSED [0.3094s] [ 48%] 2025-12-04T14:02:33.5761035Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_uint8 PASSED [0.0051s] [ 48%] 2025-12-04T14:02:33.5761165Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float64 PASSED [0.0046s] [ 48%] 2025-12-04T14:02:33.5761295Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int64 PASSED [0.0043s] [ 48%] 2025-12-04T14:02:33.5761421Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int8 PASSED [0.0044s] [ 48%] 2025-12-04T14:02:33.5761538Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_complex128 PASSED [0.8656s] [ 48%] 2025-12-04T14:02:33.5761651Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_complex64 PASSED [0.0050s] [ 48%] 2025-12-04T14:02:33.5761763Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_float32 PASSED [0.8832s] [ 48%] 2025-12-04T14:02:33.5761876Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_float64 PASSED [0.0046s] [ 48%] 2025-12-04T14:02:33.5761986Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int64 PASSED [0.9769s] [ 49%] 2025-12-04T14:02:33.5762083Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_bfloat16 PASSED [0.0046s] [ 49%] 2025-12-04T14:02:33.5762184Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_complex128 PASSED [0.8688s] [ 49%] 2025-12-04T14:02:33.5762283Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_complex64 PASSED [0.0043s] [ 49%] 2025-12-04T14:02:33.5762379Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_float16 PASSED [0.8671s] [ 49%] 2025-12-04T14:02:33.5762476Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_float32 PASSED [0.0044s] [ 49%] 2025-12-04T14:02:33.5762570Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int32 PASSED [0.8593s] [ 49%] 2025-12-04T14:02:33.5762664Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int8 PASSED [0.0045s] [ 49%] 2025-12-04T14:02:33.5762758Z test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_uint8 PASSED [0.8787s] [ 49%] 2025-12-04T14:02:33.5762855Z test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_int64 PASSED [0.0174s] [ 49%] 2025-12-04T14:02:33.5762951Z test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_int8 PASSED [0.0052s] [ 49%] 2025-12-04T14:02:33.5763044Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_bool PASSED [0.0099s] [ 49%] 2025-12-04T14:02:33.5763140Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_float16 PASSED [0.0095s] [ 49%] 2025-12-04T14:02:33.5763250Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_int16 PASSED [0.0094s] [ 49%] 2025-12-04T14:02:33.5763345Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_float16 PASSED [0.0082s] [ 49%] 2025-12-04T14:02:33.5763439Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_float32 PASSED [0.0081s] [ 49%] 2025-12-04T14:02:33.5763543Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int64 PASSED [0.0080s] [ 49%] 2025-12-04T14:02:33.5763633Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int8 PASSED [0.0081s] [ 49%] 2025-12-04T14:02:33.5763724Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_uint8 PASSED [0.0080s] [ 49%] 2025-12-04T14:02:33.5763825Z test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_complex128 PASSED [0.0206s] [ 49%] 2025-12-04T14:02:33.5763924Z test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_complex32 PASSED [0.0201s] [ 49%] 2025-12-04T14:02:33.5764020Z test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_float16 PASSED [0.0194s] [ 49%] 2025-12-04T14:02:33.5764127Z test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_float64 PASSED [0.0115s] [ 49%] 2025-12-04T14:02:33.5764230Z test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_int32 PASSED [0.0037s] [ 50%] 2025-12-04T14:02:33.5764343Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_cuda_float32 PASSED [0.0117s] [ 50%] 2025-12-04T14:02:33.5764455Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_cuda_float64 PASSED [0.0113s] [ 50%] 2025-12-04T14:02:33.5764575Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_complex64 PASSED [0.0147s] [ 50%] 2025-12-04T14:02:33.5764690Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_float32 PASSED [0.0075s] [ 50%] 2025-12-04T14:02:33.5764805Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_float64 PASSED [0.0074s] [ 50%] 2025-12-04T14:02:33.5764916Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_complex64 PASSED [0.8742s] [ 50%] 2025-12-04T14:02:33.5765025Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_float16 PASSED [0.0053s] [ 50%] 2025-12-04T14:02:33.5765133Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_float32 PASSED [0.0040s] [ 50%] 2025-12-04T14:02:33.5765239Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_uint8 PASSED [0.0039s] [ 50%] 2025-12-04T14:02:33.5765345Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_float64 PASSED [0.0583s] [ 50%] 2025-12-04T14:02:33.5765457Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float16 PASSED [0.0062s] [ 50%] 2025-12-04T14:02:33.5765568Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float32 PASSED [0.0057s] [ 50%] 2025-12-04T14:02:33.5765677Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float64 PASSED [0.0057s] [ 50%] 2025-12-04T14:02:33.5765785Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int8 PASSED [0.0057s] [ 50%] 2025-12-04T14:02:33.5765893Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_complex64 PASSED [0.0424s] [ 50%] 2025-12-04T14:02:33.5766002Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float64 PASSED [0.0569s] [ 50%] 2025-12-04T14:02:33.5766114Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_complex64 PASSED [0.0105s] [ 50%] 2025-12-04T14:02:33.5766226Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_float64 PASSED [0.0101s] [ 50%] 2025-12-04T14:02:33.5766340Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_complex64 PASSED [0.0090s] [ 50%] 2025-12-04T14:02:33.5766450Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_float32 PASSED [0.0088s] [ 50%] 2025-12-04T14:02:33.5766668Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_complex128 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 50%] 2025-12-04T14:02:33.5766893Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 50%] 2025-12-04T14:02:33.5767104Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_float32 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 50%] 2025-12-04T14:02:33.5767223Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_complex64 PASSED [0.0498s] [ 51%] 2025-12-04T14:02:33.5767330Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_float64 PASSED [0.0089s] [ 51%] 2025-12-04T14:02:33.5767441Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_complex128 PASSED [1.5416s] [ 51%] 2025-12-04T14:02:33.5767558Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_complex64 PASSED [0.1482s] [ 51%] 2025-12-04T14:02:33.5767688Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128 PASSED [0.1083s] [ 51%] 2025-12-04T14:02:33.5767835Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex64 PASSED [1.5412s] [ 51%] 2025-12-04T14:02:33.5767961Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_float32 PASSED [0.0501s] [ 51%] 2025-12-04T14:02:33.5768076Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_complex128 PASSED [0.0790s] [ 51%] 2025-12-04T14:02:33.5768186Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_float32 PASSED [0.0786s] [ 51%] 2025-12-04T14:02:33.5768304Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_complex64 PASSED [0.0205s] [ 51%] 2025-12-04T14:02:33.5768418Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_float32 PASSED [1.4762s] [ 51%] 2025-12-04T14:02:33.5768532Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_float64 PASSED [0.0269s] [ 51%] 2025-12-04T14:02:33.5768653Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_power_cuda_complex64 PASSED [0.0752s] [ 51%] 2025-12-04T14:02:33.5768768Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_float32 PASSED [1.5011s] [ 51%] 2025-12-04T14:02:33.5768884Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_complex64 PASSED [0.0095s] [ 51%] 2025-12-04T14:02:33.5768996Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_float32 PASSED [0.0081s] [ 51%] 2025-12-04T14:02:33.5769108Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_float64 PASSED [0.0073s] [ 51%] 2025-12-04T14:02:33.5769217Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float32 PASSED [0.0885s] [ 51%] 2025-12-04T14:02:33.5769328Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_complex128 PASSED [1.5003s] [ 51%] 2025-12-04T14:02:33.5769437Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_complex64 PASSED [0.0390s] [ 51%] 2025-12-04T14:02:33.5769557Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_float32 PASSED [0.0198s] [ 51%] 2025-12-04T14:02:33.5769677Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_float64 PASSED [0.0141s] [ 51%] 2025-12-04T14:02:33.5769874Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0007s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 51%] 2025-12-04T14:02:33.5769990Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_complex128 PASSED [0.0118s] [ 51%] 2025-12-04T14:02:33.5770133Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_float32 PASSED [0.0085s] [ 52%] 2025-12-04T14:02:33.5770259Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_complex128 PASSED [0.1188s] [ 52%] 2025-12-04T14:02:33.5770384Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_complex64 PASSED [0.0864s] [ 52%] 2025-12-04T14:02:33.5770524Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_float32 PASSED [0.0834s] [ 52%] 2025-12-04T14:02:33.5770636Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_float32 PASSED [1.5524s] [ 52%] 2025-12-04T14:02:33.5770743Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_float64 PASSED [0.0738s] [ 52%] 2025-12-04T14:02:33.5770855Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_complex64 PASSED [0.0237s] [ 52%] 2025-12-04T14:02:33.5770981Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_float32 PASSED [0.0220s] [ 52%] 2025-12-04T14:02:33.5771096Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_complex64 PASSED [1.4702s] [ 52%] 2025-12-04T14:02:33.5771215Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_complex64 PASSED [0.0139s] [ 52%] 2025-12-04T14:02:33.5771329Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_float64 PASSED [0.0061s] [ 52%] 2025-12-04T14:02:33.5771440Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_float64 PASSED [0.0086s] [ 52%] 2025-12-04T14:02:33.5771575Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_complex64 PASSED [0.0331s] [ 52%] 2025-12-04T14:02:33.5771681Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_bfloat16 PASSED [0.0184s] [ 52%] 2025-12-04T14:02:33.5771785Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float32 PASSED [0.0185s] [ 52%] 2025-12-04T14:02:33.5771888Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float64 PASSED [0.0184s] [ 52%] 2025-12-04T14:02:33.5771987Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int32 PASSED [0.0184s] [ 52%] 2025-12-04T14:02:33.5772086Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int64 PASSED [0.0183s] [ 52%] 2025-12-04T14:02:33.5772183Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int8 PASSED [0.0185s] [ 52%] 2025-12-04T14:02:33.5772309Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_bfloat16 PASSED [0.0962s] [ 52%] 2025-12-04T14:02:33.5772434Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float16 PASSED [0.0959s] [ 52%] 2025-12-04T14:02:33.5772556Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float32 PASSED [0.0968s] [ 52%] 2025-12-04T14:02:33.5772678Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_int16 PASSED [0.0960s] [ 52%] 2025-12-04T14:02:33.5772777Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_float16 PASSED [0.0037s] [ 53%] 2025-12-04T14:02:33.5772873Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int32 PASSED [1.4673s] [ 53%] 2025-12-04T14:02:33.5772969Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int64 PASSED [0.0058s] [ 53%] 2025-12-04T14:02:33.5773073Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_complex128 PASSED [0.0032s] [ 53%] 2025-12-04T14:02:33.5773166Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int8 PASSED [1.4714s] [ 53%] 2025-12-04T14:02:33.5773264Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_uint8 PASSED [0.0045s] [ 53%] 2025-12-04T14:02:33.5773367Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_complex128 PASSED [0.0059s] [ 53%] 2025-12-04T14:02:33.5773465Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_complex64 PASSED [1.6081s] [ 53%] 2025-12-04T14:02:33.5773560Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int32 PASSED [0.0057s] [ 53%] 2025-12-04T14:02:33.5773656Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex64 PASSED [0.0056s] [ 53%] 2025-12-04T14:02:33.5773749Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int32 PASSED [1.4903s] [ 53%] 2025-12-04T14:02:33.5773841Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int64 PASSED [0.0055s] [ 53%] 2025-12-04T14:02:33.5773945Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float16 PASSED [0.0112s] [ 53%] 2025-12-04T14:02:33.5774064Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float16 PASSED [0.0088s] [ 53%] 2025-12-04T14:02:33.5774173Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float32 PASSED [0.0085s] [ 53%] 2025-12-04T14:02:33.5774278Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_float64 PASSED [0.0085s] [ 53%] 2025-12-04T14:02:33.5774400Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_bfloat16 PASSED [0.0085s] [ 53%] 2025-12-04T14:02:33.5774531Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_float32 PASSED [1.4980s] [ 53%] 2025-12-04T14:02:33.5774651Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_float64 PASSED [0.0108s] [ 53%] 2025-12-04T14:02:33.5774767Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int16 PASSED [0.0089s] [ 53%] 2025-12-04T14:02:33.5774884Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int32 PASSED [0.0086s] [ 53%] 2025-12-04T14:02:33.5775000Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int8 PASSED [0.0086s] [ 53%] 2025-12-04T14:02:33.5775133Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_float16 PASSED [0.0144s] [ 53%] 2025-12-04T14:02:33.5775243Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_complex32 PASSED [0.0302s] [ 53%] 2025-12-04T14:02:33.5775356Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_complex128 PASSED [0.0154s] [ 54%] 2025-12-04T14:02:33.5775468Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_complex64 PASSED [0.0048s] [ 54%] 2025-12-04T14:02:33.5775577Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float16 PASSED [0.0048s] [ 54%] 2025-12-04T14:02:33.5775677Z test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_float32 PASSED [0.0106s] [ 54%] 2025-12-04T14:02:33.5775775Z test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_float64 PASSED [0.0094s] [ 54%] 2025-12-04T14:02:33.5775886Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_complex128 PASSED [0.0128s] [ 54%] 2025-12-04T14:02:33.5775993Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_float32 PASSED [0.0104s] [ 54%] 2025-12-04T14:02:33.5776100Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_float64 PASSED [0.0104s] [ 54%] 2025-12-04T14:02:33.5776209Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_complex64 PASSED [0.0038s] [ 54%] 2025-12-04T14:02:33.5776316Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int16 PASSED [0.0038s] [ 54%] 2025-12-04T14:02:33.5776419Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int32 PASSED [1.4941s] [ 54%] 2025-12-04T14:02:33.5776523Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int64 PASSED [0.0061s] [ 54%] 2025-12-04T14:02:33.5776624Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_int8 PASSED [0.0043s] [ 54%] 2025-12-04T14:02:33.5776728Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_uint8 PASSED [0.0040s] [ 54%] 2025-12-04T14:02:33.5776836Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_complex64 PASSED [0.3904s] [ 54%] 2025-12-04T14:02:33.5776938Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_int16 PASSED [0.0109s] [ 54%] 2025-12-04T14:02:33.5777049Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_complex64 PASSED [0.0131s] [ 54%] 2025-12-04T14:02:33.5777158Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float32 PASSED [0.0105s] [ 54%] 2025-12-04T14:02:33.5777264Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float64 PASSED [0.0104s] [ 54%] 2025-12-04T14:02:33.5777367Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int32 PASSED [0.0104s] [ 54%] 2025-12-04T14:02:33.5777462Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int64 PASSED [0.0070s] [ 54%] 2025-12-04T14:02:33.5777567Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_complex128 PASSED [0.1076s] [ 54%] 2025-12-04T14:02:33.5777679Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int32 PASSED [0.1022s] [ 54%] 2025-12-04T14:02:33.5777777Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int64 PASSED [0.1019s] [ 55%] 2025-12-04T14:02:33.5777875Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_uint8 PASSED [0.0343s] [ 55%] 2025-12-04T14:02:33.5778009Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float16 PASSED [0.6372s] [ 55%] 2025-12-04T14:02:33.5778133Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float32 PASSED [0.6373s] [ 55%] 2025-12-04T14:02:33.5778256Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_float64 PASSED [0.6400s] [ 55%] 2025-12-04T14:02:33.5778376Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int16 PASSED [0.5972s] [ 55%] 2025-12-04T14:02:33.5778497Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int32 PASSED [0.5964s] [ 55%] 2025-12-04T14:02:33.5778626Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int64 PASSED [0.5915s] [ 55%] 2025-12-04T14:02:33.5778742Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_bfloat16 PASSED [1.5116s] [ 55%] 2025-12-04T14:02:33.5778852Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_complex128 PASSED [0.0272s] [ 55%] 2025-12-04T14:02:33.5778961Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_complex64 PASSED [0.0235s] [ 55%] 2025-12-04T14:02:33.5779064Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float16 PASSED [0.0089s] [ 55%] 2025-12-04T14:02:33.5779168Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float64 PASSED [1.4930s] [ 55%] 2025-12-04T14:02:33.5779269Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_complex128 PASSED [0.0063s] [ 55%] 2025-12-04T14:02:33.5779366Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_float16 PASSED [0.0047s] [ 55%] 2025-12-04T14:02:33.5779461Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int64 PASSED [1.4825s] [ 55%] 2025-12-04T14:02:33.5779555Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int8 PASSED [0.0102s] [ 55%] 2025-12-04T14:02:33.5779651Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_complex64 PASSED [0.0599s] [ 55%] 2025-12-04T14:02:33.5779746Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_float32 PASSED [0.0271s] [ 55%] 2025-12-04T14:02:33.5779850Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_complex128 PASSED [0.0273s] [ 55%] 2025-12-04T14:02:33.5779957Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_complex128 PASSED [0.0635s] [ 55%] 2025-12-04T14:02:33.5780062Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_complex64 PASSED [0.0199s] [ 55%] 2025-12-04T14:02:33.5780203Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_float32 PASSED [0.0189s] [ 55%] 2025-12-04T14:02:33.5780299Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_bfloat16 PASSED [0.0033s] [ 55%] 2025-12-04T14:02:33.5780395Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float16 PASSED [1.4723s] [ 56%] 2025-12-04T14:02:33.5780485Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int8 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5780577Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_uint8 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5780673Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_complex64 PASSED [1.4795s] [ 56%] 2025-12-04T14:02:33.5780767Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_float64 PASSED [0.0052s] [ 56%] 2025-12-04T14:02:33.5780857Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int16 PASSED [0.0037s] [ 56%] 2025-12-04T14:02:33.5780947Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int32 PASSED [1.4701s] [ 56%] 2025-12-04T14:02:33.5781037Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int64 PASSED [0.0051s] [ 56%] 2025-12-04T14:02:33.5781161Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_bfloat16 PASSED [0.1951s] [ 56%] 2025-12-04T14:02:33.5781269Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int32 PASSED [0.1616s] [ 56%] 2025-12-04T14:02:33.5781375Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float64 PASSED [0.1863s] [ 56%] 2025-12-04T14:02:33.5781504Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int32 PASSED [0.1623s] [ 56%] 2025-12-04T14:02:33.5781607Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int64 PASSED [0.1668s] [ 56%] 2025-12-04T14:02:33.5781708Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int8 PASSED [0.1627s] [ 56%] 2025-12-04T14:02:33.5781819Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_bfloat16 PASSED [0.1071s] [ 56%] 2025-12-04T14:02:33.5781927Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_float32 PASSED [0.1077s] [ 56%] 2025-12-04T14:02:33.5782032Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int64 PASSED [0.0927s] [ 56%] 2025-12-04T14:02:33.5782154Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_float64 PASSED [0.1072s] [ 56%] 2025-12-04T14:02:33.5782272Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int32 PASSED [0.0472s] [ 56%] 2025-12-04T14:02:33.5782379Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_uint8 PASSED [0.0467s] [ 56%] 2025-12-04T14:02:33.5782485Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int64 PASSED [0.0470s] [ 56%] 2025-12-04T14:02:33.5782591Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int8 PASSED [0.0469s] [ 56%] 2025-12-04T14:02:33.5782696Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_uint8 PASSED [0.0470s] [ 56%] 2025-12-04T14:02:33.5782802Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_bfloat16 PASSED [0.0064s] [ 56%] 2025-12-04T14:02:33.5782911Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_complex32 PASSED [0.0063s] [ 57%] 2025-12-04T14:02:33.5783017Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_float32 PASSED [0.0062s] [ 57%] 2025-12-04T14:02:33.5783122Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int32 PASSED [0.0062s] [ 57%] 2025-12-04T14:02:33.5783224Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int64 PASSED [0.0062s] [ 57%] 2025-12-04T14:02:33.5783327Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int8 PASSED [0.0061s] [ 57%] 2025-12-04T14:02:33.5783443Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_bfloat16 PASSED [0.0532s] [ 57%] 2025-12-04T14:02:33.5783558Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float16 PASSED [0.0557s] [ 57%] 2025-12-04T14:02:33.5783670Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float64 PASSED [0.0536s] [ 57%] 2025-12-04T14:02:33.5783783Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logaddexp_cuda_float16 PASSED [0.0501s] [ 57%] 2025-12-04T14:02:33.5783898Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_complex64 PASSED [0.2012s] [ 57%] 2025-12-04T14:02:33.5784010Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int32 PASSED [0.2046s] [ 57%] 2025-12-04T14:02:33.5784118Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_complex64 PASSED [0.2359s] [ 57%] 2025-12-04T14:02:33.5784229Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_median_cuda_bfloat16 PASSED [0.0337s] [ 57%] 2025-12-04T14:02:33.5784336Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_median_cuda_float32 PASSED [0.0337s] [ 57%] 2025-12-04T14:02:33.5784441Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_norm_cuda_float16 PASSED [0.9520s] [ 57%] 2025-12-04T14:02:33.5784557Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex64 PASSED [0.0455s] [ 57%] 2025-12-04T14:02:33.5784681Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_float16 PASSED [0.0461s] [ 57%] 2025-12-04T14:02:33.5784794Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_float32 PASSED [0.0437s] [ 57%] 2025-12-04T14:02:33.5784903Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_bfloat16 PASSED [0.2173s] [ 57%] 2025-12-04T14:02:33.5785011Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_complex64 PASSED [2.9454s] [ 57%] 2025-12-04T14:02:33.5785127Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int32 PASSED [0.1756s] [ 57%] 2025-12-04T14:02:33.5785239Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_complex64 PASSED [0.0042s] [ 57%] 2025-12-04T14:02:33.5785345Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int16 PASSED [1.4888s] [ 57%] 2025-12-04T14:02:33.5785452Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int32 PASSED [0.0059s] [ 58%] 2025-12-04T14:02:33.5785557Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_uint8 PASSED [0.0043s] [ 58%] 2025-12-04T14:02:33.5785674Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_int64 PASSED [0.0068s] [ 58%] 2025-12-04T14:02:33.5785792Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_int8 PASSED [0.0065s] [ 58%] 2025-12-04T14:02:33.5785904Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmax_cuda_bfloat16 PASSED [0.0468s] [ 58%] 2025-12-04T14:02:33.5786014Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float16 PASSED [0.0481s] [ 58%] 2025-12-04T14:02:33.5786124Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float32 PASSED [0.0479s] [ 58%] 2025-12-04T14:02:33.5786232Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_float64 PASSED [0.0481s] [ 58%] 2025-12-04T14:02:33.5786342Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_complex128 PASSED [0.3827s] [ 58%] 2025-12-04T14:02:33.5786449Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_complex64 PASSED [0.3825s] [ 58%] 2025-12-04T14:02:33.5786555Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_float32 PASSED [0.3837s] [ 58%] 2025-12-04T14:02:33.5786662Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_float64 PASSED [0.3843s] [ 58%] 2025-12-04T14:02:33.5786762Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int32 PASSED [0.3805s] [ 58%] 2025-12-04T14:02:33.5786865Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int64 PASSED [0.3787s] [ 58%] 2025-12-04T14:02:33.5786970Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_bfloat16 PASSED [0.1931s] [ 58%] 2025-12-04T14:02:33.5787070Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_bool PASSED [0.1591s] [ 58%] 2025-12-04T14:02:33.5787178Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_complex128 PASSED [0.1582s] [ 58%] 2025-12-04T14:02:33.5787283Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_float64 PASSED [0.1855s] [ 58%] 2025-12-04T14:02:33.5787383Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int32 PASSED [0.1608s] [ 58%] 2025-12-04T14:02:33.5787487Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int8 PASSED [0.1586s] [ 58%] 2025-12-04T14:02:33.5787587Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_uint8 PASSED [0.1597s] [ 58%] 2025-12-04T14:02:33.5787698Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_complex128 PASSED [0.3794s] [ 58%] 2025-12-04T14:02:33.5787802Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float16 PASSED [1.8693s] [ 58%] 2025-12-04T14:02:33.5787907Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float32 PASSED [0.3752s] [ 58%] 2025-12-04T14:02:33.5788007Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int8 PASSED [0.3701s] [ 59%] 2025-12-04T14:02:33.5788111Z test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_complex128 PASSED [0.0053s] [ 59%] 2025-12-04T14:02:33.5788228Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_complex128 PASSED [1.4831s] [ 59%] 2025-12-04T14:02:33.5788337Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_complex64 PASSED [0.0063s] [ 59%] 2025-12-04T14:02:33.5788442Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float16 PASSED [1.4769s] [ 59%] 2025-12-04T14:02:33.5788542Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_bool PASSED [0.0102s] [ 59%] 2025-12-04T14:02:33.5788652Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int64 PASSED [0.0084s] [ 59%] 2025-12-04T14:02:33.5788788Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16 PASSED [1.0074s] [ 59%] 2025-12-04T14:02:33.5788907Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_float32 PASSED [1.4952s] [ 59%] 2025-12-04T14:02:33.5789019Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_uint8 PASSED [0.0047s] [ 59%] 2025-12-04T14:02:33.5789137Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int32 PASSED [0.0038s] [ 59%] 2025-12-04T14:02:33.5789271Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int64 PASSED [1.4739s] [ 59%] 2025-12-04T14:02:33.5789373Z test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_complex128 PASSED [0.0235s] [ 59%] 2025-12-04T14:02:33.5789476Z test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float16 PASSED [1.4876s] [ 59%] 2025-12-04T14:02:33.5789573Z test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float32 PASSED [0.0154s] [ 59%] 2025-12-04T14:02:33.5789668Z test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_float64 PASSED [1.4985s] [ 59%] 2025-12-04T14:02:33.5789767Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_float32 PASSED [0.0085s] [ 59%] 2025-12-04T14:02:33.5789868Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_float64 PASSED [0.0087s] [ 59%] 2025-12-04T14:02:33.5789963Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int16 PASSED [0.0063s] [ 59%] 2025-12-04T14:02:33.5790062Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int64 PASSED [0.0059s] [ 59%] 2025-12-04T14:02:33.5790234Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_complex128 PASSED [0.0056s] [ 59%] 2025-12-04T14:02:33.5790359Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0053s] [ 59%] 2025-12-04T14:02:33.5790481Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int64 PASSED [0.0053s] [ 59%] 2025-12-04T14:02:33.5790602Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int8 PASSED [0.0053s] [ 60%] 2025-12-04T14:02:33.5790722Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_uint8 PASSED [0.0053s] [ 60%] 2025-12-04T14:02:33.5790844Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_bool PASSED [0.0053s] [ 60%] 2025-12-04T14:02:33.5790974Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_complex64 PASSED [0.0055s] [ 60%] 2025-12-04T14:02:33.5791099Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int16 PASSED [0.0054s] [ 60%] 2025-12-04T14:02:33.5791220Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_uint8 PASSED [0.0053s] [ 60%] 2025-12-04T14:02:33.5791331Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_bfloat16 PASSED [0.0085s] [ 60%] 2025-12-04T14:02:33.5791433Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_bool PASSED [0.0080s] [ 60%] 2025-12-04T14:02:33.5791534Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_int32 PASSED [0.0080s] [ 60%] 2025-12-04T14:02:33.5791650Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_bool PASSED [0.0030s] [ 60%] 2025-12-04T14:02:33.5791765Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_float32 PASSED [1.4853s] [ 60%] 2025-12-04T14:02:33.5791901Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_float64 PASSED [0.0047s] [ 60%] 2025-12-04T14:02:33.5792018Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_bool PASSED [0.0037s] [ 60%] 2025-12-04T14:02:33.5792135Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int64 PASSED [1.4945s] [ 60%] 2025-12-04T14:02:33.5792264Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_uint8 PASSED [0.0052s] [ 60%] 2025-12-04T14:02:33.5792365Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_float32 PASSED [0.0090s] [ 60%] 2025-12-04T14:02:33.5792465Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_float64 PASSED [0.0083s] [ 60%] 2025-12-04T14:02:33.5792565Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int64 PASSED [0.0082s] [ 60%] 2025-12-04T14:02:33.5792663Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_uint8 PASSED [0.0081s] [ 60%] 2025-12-04T14:02:33.5792760Z test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float64 PASSED [1.4765s] [ 60%] 2025-12-04T14:02:33.5792880Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_float32 PASSED [0.1299s] [ 60%] 2025-12-04T14:02:33.5792976Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_int32 PASSED [0.0054s] [ 60%] 2025-12-04T14:02:33.5793070Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_int64 PASSED [1.4797s] [ 60%] 2025-12-04T14:02:33.5793167Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_uint8 PASSED [0.0065s] [ 60%] 2025-12-04T14:02:33.5793266Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_bool PASSED [1.4806s] [ 61%] 2025-12-04T14:02:33.5793370Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex64 PASSED [0.0041s] [ 61%] 2025-12-04T14:02:33.5793466Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_int16 PASSED [1.4729s] [ 61%] 2025-12-04T14:02:33.5793563Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_int8 PASSED [0.0044s] [ 61%] 2025-12-04T14:02:33.5793665Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_bfloat16 PASSED [1.4876s] [ 61%] 2025-12-04T14:02:33.5793763Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_float16 PASSED [0.0055s] [ 61%] 2025-12-04T14:02:33.5793858Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int16 PASSED [0.0038s] [ 61%] 2025-12-04T14:02:33.5793953Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int32 PASSED [1.4764s] [ 61%] 2025-12-04T14:02:33.5794051Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex32 PASSED [0.0132s] [ 61%] 2025-12-04T14:02:33.5794144Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_float64 PASSED [0.0083s] [ 61%] 2025-12-04T14:02:33.5794238Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_bfloat16 PASSED [1.4919s] [ 61%] 2025-12-04T14:02:33.5794331Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float32 PASSED [0.0050s] [ 61%] 2025-12-04T14:02:33.5794424Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float64 PASSED [0.0034s] [ 61%] 2025-12-04T14:02:33.5794547Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16 PASSED [0.0150s] [ 61%] 2025-12-04T14:02:33.5794664Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int32 PASSED [0.0141s] [ 61%] 2025-12-04T14:02:33.5794780Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int64 PASSED [0.0133s] [ 61%] 2025-12-04T14:02:33.5794897Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32 PASSED [1.5078s] [ 61%] 2025-12-04T14:02:33.5795017Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_bfloat16 PASSED [0.0160s] [ 61%] 2025-12-04T14:02:33.5795131Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int32 PASSED [0.0138s] [ 61%] 2025-12-04T14:02:33.5795238Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_float32 PASSED [1.5020s] [ 61%] 2025-12-04T14:02:33.5795359Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int16 PASSED [0.0051s] [ 61%] 2025-12-04T14:02:33.5795463Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int32 PASSED [0.0037s] [ 61%] 2025-12-04T14:02:33.5795564Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int64 PASSED [1.4751s] [ 61%] 2025-12-04T14:02:33.5795669Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_complex32 PASSED [0.0150s] [ 61%] 2025-12-04T14:02:33.5795785Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_complex64 PASSED [1.4921s] [ 62%] 2025-12-04T14:02:33.5795890Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_float16 PASSED [1.4953s] [ 62%] 2025-12-04T14:02:33.5795995Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_float64 PASSED [0.0082s] [ 62%] 2025-12-04T14:02:33.5796097Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int16 PASSED [0.0063s] [ 62%] 2025-12-04T14:02:33.5796195Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int32 PASSED [0.0061s] [ 62%] 2025-12-04T14:02:33.5796304Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanquantile_cuda_float64 PASSED [1.5157s] [ 62%] 2025-12-04T14:02:33.5796428Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_complex128 PASSED [0.0234s] [ 62%] 2025-12-04T14:02:33.5796531Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_complex64 PASSED [0.0171s] [ 62%] 2025-12-04T14:02:33.5796632Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float16 PASSED [0.0168s] [ 62%] 2025-12-04T14:02:33.5796743Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex128 XFAIL [0.0030s] [ 62%] 2025-12-04T14:02:33.5796849Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex32 XFAIL [1.4819s] [ 62%] 2025-12-04T14:02:33.5796958Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_float64 XFAIL [1.4850s] [ 62%] 2025-12-04T14:02:33.5797057Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_bfloat16 PASSED [2.9729s] [ 62%] 2025-12-04T14:02:33.5797161Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex128 PASSED [0.0047s] [ 62%] 2025-12-04T14:02:33.5797264Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex32 PASSED [1.4760s] [ 62%] 2025-12-04T14:02:33.5797363Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float32 PASSED [0.0045s] [ 62%] 2025-12-04T14:02:33.5797461Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float64 PASSED [1.4956s] [ 62%] 2025-12-04T14:02:33.5797555Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_int32 PASSED [0.0044s] [ 62%] 2025-12-04T14:02:33.5797669Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float32 PASSED [0.0139s] [ 62%] 2025-12-04T14:02:33.5797793Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_dropout_backward_cuda_float32 PASSED [0.0172s] [ 62%] 2025-12-04T14:02:33.5797917Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_dropout_backward_cuda_float64 PASSED [0.0063s] [ 62%] 2025-12-04T14:02:33.5798030Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_layer_norm_cuda_float64 PASSED [0.0225s] [ 62%] 2025-12-04T14:02:33.5798130Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_bfloat16 PASSED [0.0082s] [ 62%] 2025-12-04T14:02:33.5798228Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_complex128 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5798325Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_complex64 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5798420Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_float16 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5798512Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int8 PASSED [0.0080s] [ 63%] 2025-12-04T14:02:33.5798607Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_float32 PASSED [1.4795s] [ 63%] 2025-12-04T14:02:33.5798704Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_float64 PASSED [0.0044s] [ 63%] 2025-12-04T14:02:33.5798811Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_complex32 PASSED [0.0046s] [ 63%] 2025-12-04T14:02:33.5798926Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_complex64 PASSED [1.4703s] [ 63%] 2025-12-04T14:02:33.5799031Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int64 PASSED [0.0061s] [ 63%] 2025-12-04T14:02:33.5799131Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int8 PASSED [0.0044s] [ 63%] 2025-12-04T14:02:33.5799255Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_bool PASSED [0.0043s] [ 63%] 2025-12-04T14:02:33.5799373Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_complex128 PASSED [1.4686s] [ 63%] 2025-12-04T14:02:33.5799490Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_complex64 PASSED [0.0060s] [ 63%] 2025-12-04T14:02:33.5799602Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float32 PASSED [0.0044s] [ 63%] 2025-12-04T14:02:33.5799715Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_int8 PASSED [0.0043s] [ 63%] 2025-12-04T14:02:33.5799826Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_uint8 PASSED [1.4608s] [ 63%] 2025-12-04T14:02:33.5799953Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_bfloat16 PASSED [0.0063s] [ 63%] 2025-12-04T14:02:33.5800058Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex128 PASSED [0.0047s] [ 63%] 2025-12-04T14:02:33.5800204Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex32 PASSED [0.0045s] [ 63%] 2025-12-04T14:02:33.5800306Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_float16 PASSED [1.4911s] [ 63%] 2025-12-04T14:02:33.5800408Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_float64 PASSED [0.0062s] [ 63%] 2025-12-04T14:02:33.5800512Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex128 PASSED [0.0046s] [ 63%] 2025-12-04T14:02:33.5800615Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_float64 PASSED [0.0043s] [ 63%] 2025-12-04T14:02:33.5800715Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_int32 PASSED [1.4701s] [ 63%] 2025-12-04T14:02:33.5800820Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float32 PASSED [0.0062s] [ 64%] 2025-12-04T14:02:33.5800922Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float64 PASSED [0.0046s] [ 64%] 2025-12-04T14:02:33.5801020Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_int8 PASSED [0.0043s] [ 64%] 2025-12-04T14:02:33.5801127Z test_meta.py::TestMetaCUDA::test_meta_outplace_nextafter_cuda_bfloat16 PASSED [0.0084s] [ 64%] 2025-12-04T14:02:33.5801265Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64 PASSED [0.0098s] [ 64%] 2025-12-04T14:02:33.5801400Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16 PASSED [0.0048s] [ 64%] 2025-12-04T14:02:33.5801535Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [1.4727s] [ 64%] 2025-12-04T14:02:33.5801669Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float16 PASSED [0.0138s] [ 64%] 2025-12-04T14:02:33.5801805Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float64 PASSED [0.0060s] [ 64%] 2025-12-04T14:02:33.5801942Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16 PASSED [0.0092s] [ 64%] 2025-12-04T14:02:33.5802076Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64 PASSED [1.4928s] [ 64%] 2025-12-04T14:02:33.5802210Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.0155s] [ 64%] 2025-12-04T14:02:33.5802343Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64 PASSED [1.4717s] [ 64%] 2025-12-04T14:02:33.5802468Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_bfloat16 PASSED [0.0125s] [ 64%] 2025-12-04T14:02:33.5802604Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_float16 PASSED [0.0050s] [ 64%] 2025-12-04T14:02:33.5802726Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool1d_cuda_float64 PASSED [0.0048s] [ 64%] 2025-12-04T14:02:33.5802849Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16 PASSED [0.0052s] [ 64%] 2025-12-04T14:02:33.5802982Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_float16 PASSED [0.0052s] [ 64%] 2025-12-04T14:02:33.5803105Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float64 PASSED [1.4967s] [ 64%] 2025-12-04T14:02:33.5803246Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16 PASSED [0.0157s] [ 64%] 2025-12-04T14:02:33.5803402Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16 PASSED [0.0390s] [ 64%] 2025-12-04T14:02:33.5803554Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0275s] [ 64%] 2025-12-04T14:02:33.5803712Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_bool PASSED [1.4850s] [ 64%] 2025-12-04T14:02:33.5803848Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_complex128 PASSED [0.0050s] [ 65%] 2025-12-04T14:02:33.5803987Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_complex64 PASSED [1.4734s] [ 65%] 2025-12-04T14:02:33.5804117Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_float64 PASSED [0.0049s] [ 65%] 2025-12-04T14:02:33.5804244Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_int16 PASSED [1.4744s] [ 65%] 2025-12-04T14:02:33.5804371Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_int64 PASSED [0.0049s] [ 65%] 2025-12-04T14:02:33.5804492Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex128 PASSED [0.0257s] [ 65%] 2025-12-04T14:02:33.5804614Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex32 PASSED [0.1308s] [ 65%] 2025-12-04T14:02:33.5804731Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_float16 PASSED [1.4762s] [ 65%] 2025-12-04T14:02:33.5805037Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_bfloat16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x78e591200c00 size: 768 2025-12-04T14:02:33.5805219Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x78e591200c00 size: 768 2025-12-04T14:02:33.5805263Z PASSED [0.1638s] [ 65%] 2025-12-04T14:02:33.5805387Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_complex128 PASSED [1.5231s] [ 65%] 2025-12-04T14:02:33.5805687Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_complex64 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x78e668200e00 size: 1024 2025-12-04T14:02:33.5805869Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x78e668200e00 size: 1024 2025-12-04T14:02:33.5805910Z PASSED [0.0759s] [ 65%] 2025-12-04T14:02:33.5806200Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x78e570400c00 size: 768 2025-12-04T14:02:33.5806380Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x78e570400c00 size: 768 2025-12-04T14:02:33.5806419Z PASSED [0.0431s] [ 65%] 2025-12-04T14:02:33.5806546Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float32 PASSED [0.0149s] [ 65%] 2025-12-04T14:02:33.5806667Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_float64 PASSED [0.0142s] [ 65%] 2025-12-04T14:02:33.5806787Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex128 PASSED [0.0292s] [ 65%] 2025-12-04T14:02:33.5807098Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x78e529c03200 size: 5888 2025-12-04T14:02:33.5807280Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x78e529c03200 size: 5888 2025-12-04T14:02:33.5807475Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x78e529c03400 size: 6656 2025-12-04T14:02:33.5807657Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x78e529c03400 size: 6656 2025-12-04T14:02:33.5807727Z PASSED [0.0459s] [ 65%] 2025-12-04T14:02:33.5808029Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex64 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x78e54b405a00 size: 11008 2025-12-04T14:02:33.5808211Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x78e54b405a00 size: 11008 2025-12-04T14:02:33.5808407Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x78e54b406000 size: 12544 2025-12-04T14:02:33.5808588Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x78e54b406000 size: 12544 2025-12-04T14:02:33.5808627Z PASSED [0.0430s] [ 65%] 2025-12-04T14:02:33.5808747Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float16 PASSED [0.0104s] [ 65%] 2025-12-04T14:02:33.5808865Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float32 PASSED [0.0104s] [ 65%] 2025-12-04T14:02:33.5809000Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex128 PASSED [1.4765s] [ 65%] 2025-12-04T14:02:33.5809137Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32 PASSED [0.3357s] [ 65%] 2025-12-04T14:02:33.5809269Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex64 PASSED [0.0147s] [ 65%] 2025-12-04T14:02:33.5809400Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float16 PASSED [0.0067s] [ 65%] 2025-12-04T14:02:33.5809531Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.0065s] [ 65%] 2025-12-04T14:02:33.5809665Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32 PASSED [0.0184s] [ 66%] 2025-12-04T14:02:33.5809803Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_complex64 PASSED [0.0191s] [ 66%] 2025-12-04T14:02:33.5809934Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float64 PASSED [1.5036s] [ 66%] 2025-12-04T14:02:33.5810070Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32 PASSED [0.0417s] [ 66%] 2025-12-04T14:02:33.5810226Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_complex64 PASSED [0.0171s] [ 66%] 2025-12-04T14:02:33.5810367Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16 PASSED [0.0112s] [ 66%] 2025-12-04T14:02:33.5810506Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float32 PASSED [0.0102s] [ 66%] 2025-12-04T14:02:33.5810657Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int16 PASSED [0.0103s] [ 66%] 2025-12-04T14:02:33.5810794Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64 PASSED [0.0100s] [ 66%] 2025-12-04T14:02:33.5810930Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float16 PASSED [0.0156s] [ 66%] 2025-12-04T14:02:33.5811074Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float64 PASSED [0.0146s] [ 66%] 2025-12-04T14:02:33.5811194Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_ctc_loss_cuda_float64 PASSED [0.0145s] [ 66%] 2025-12-04T14:02:33.5811319Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float16 PASSED [1.4948s] [ 66%] 2025-12-04T14:02:33.5811441Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float32 PASSED [0.0115s] [ 66%] 2025-12-04T14:02:33.5811564Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_float64 PASSED [0.0080s] [ 66%] 2025-12-04T14:02:33.5811709Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout_cuda_bfloat16 PASSED [0.0120s] [ 66%] 2025-12-04T14:02:33.5811825Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_elu_cuda_float64 PASSED [1.5097s] [ 66%] 2025-12-04T14:02:33.5811952Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_float64 PASSED [0.0351s] [ 66%] 2025-12-04T14:02:33.5812075Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_float32 PASSED [1.5018s] [ 66%] 2025-12-04T14:02:33.5812194Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_float64 PASSED [0.0083s] [ 66%] 2025-12-04T14:02:33.5812350Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [0.0072s] [ 66%] 2025-12-04T14:02:33.5812508Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16 PASSED [0.0070s] [ 66%] 2025-12-04T14:02:33.5812669Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64 PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5812824Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32 PASSED [0.0069s] [ 66%] 2025-12-04T14:02:33.5812977Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8 PASSED [0.0068s] [ 67%] 2025-12-04T14:02:33.5813119Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16 PASSED [0.0251s] [ 67%] 2025-12-04T14:02:33.5813258Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float16 PASSED [0.0283s] [ 67%] 2025-12-04T14:02:33.5813393Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64 PASSED [0.1940s] [ 67%] 2025-12-04T14:02:33.5813509Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_glu_cuda_bfloat16 PASSED [0.0290s] [ 67%] 2025-12-04T14:02:33.5813626Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_glu_cuda_float32 PASSED [1.5185s] [ 67%] 2025-12-04T14:02:33.5813750Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float64 PASSED [0.3065s] [ 67%] 2025-12-04T14:02:33.5813878Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_bfloat16 PASSED [0.0476s] [ 67%] 2025-12-04T14:02:33.5814001Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_float64 PASSED [0.0213s] [ 67%] 2025-12-04T14:02:33.5814131Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16 PASSED [0.0118s] [ 67%] 2025-12-04T14:02:33.5814251Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardswish_cuda_float32 PASSED [0.0126s] [ 67%] 2025-12-04T14:02:33.5814380Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int32 PASSED [0.0057s] [ 67%] 2025-12-04T14:02:33.5814521Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64 PASSED [0.0122s] [ 67%] 2025-12-04T14:02:33.5814649Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float16 PASSED [0.0151s] [ 67%] 2025-12-04T14:02:33.5814785Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float64 PASSED [0.0145s] [ 67%] 2025-12-04T14:02:33.5814916Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16 PASSED [0.0089s] [ 67%] 2025-12-04T14:02:33.5815049Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_float16 PASSED [0.0059s] [ 67%] 2025-12-04T14:02:33.5815181Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_float32 PASSED [0.0085s] [ 67%] 2025-12-04T14:02:33.5815320Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.4176s] [ 67%] 2025-12-04T14:02:33.5815474Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64 PASSED [0.4137s] [ 67%] 2025-12-04T14:02:33.5815612Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16 PASSED [0.0055s] [ 67%] 2025-12-04T14:02:33.5815748Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_bfloat16 PASSED [0.0326s] [ 67%] 2025-12-04T14:02:33.5815884Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_float16 PASSED [1.4953s] [ 67%] 2025-12-04T14:02:33.5816016Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_float64 PASSED [0.0307s] [ 68%] 2025-12-04T14:02:33.5816165Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float64 PASSED [0.0299s] [ 68%] 2025-12-04T14:02:33.5816307Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16 PASSED [0.1250s] [ 68%] 2025-12-04T14:02:33.5816447Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16 PASSED [1.4803s] [ 68%] 2025-12-04T14:02:33.5816586Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.1218s] [ 68%] 2025-12-04T14:02:33.5816710Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_complex128 PASSED [0.0069s] [ 68%] 2025-12-04T14:02:33.5816830Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float64 PASSED [0.0052s] [ 68%] 2025-12-04T14:02:33.5816952Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float16 PASSED [0.0071s] [ 68%] 2025-12-04T14:02:33.5817075Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float64 PASSED [0.0070s] [ 68%] 2025-12-04T14:02:33.5817197Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_leaky_relu_cuda_float64 PASSED [0.0137s] [ 68%] 2025-12-04T14:02:33.5817319Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_bfloat16 PASSED [0.0173s] [ 68%] 2025-12-04T14:02:33.5817441Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_complex128 PASSED [0.0606s] [ 68%] 2025-12-04T14:02:33.5817578Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float32 PASSED [0.0045s] [ 68%] 2025-12-04T14:02:33.5817701Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_logsigmoid_cuda_float32 PASSED [1.4718s] [ 68%] 2025-12-04T14:02:33.5817839Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_bfloat16 PASSED [0.0251s] [ 68%] 2025-12-04T14:02:33.5817971Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8 PASSED [0.0220s] [ 68%] 2025-12-04T14:02:33.5818096Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16 PASSED [0.2531s] [ 68%] 2025-12-04T14:02:33.5818226Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float16 PASSED [0.2507s] [ 68%] 2025-12-04T14:02:33.5818351Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float16 PASSED [0.4154s] [ 68%] 2025-12-04T14:02:33.5818475Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float32 PASSED [0.4169s] [ 68%] 2025-12-04T14:02:33.5818606Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float64 PASSED [0.4155s] [ 68%] 2025-12-04T14:02:33.5818728Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_float16 PASSED [0.1801s] [ 68%] 2025-12-04T14:02:33.5818853Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_float32 PASSED [0.1102s] [ 68%] 2025-12-04T14:02:33.5818987Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0257s] [ 68%] 2025-12-04T14:02:33.5819119Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float64 PASSED [0.0313s] [ 69%] 2025-12-04T14:02:33.5819262Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_float32 PASSED [0.1865s] [ 69%] 2025-12-04T14:02:33.5819387Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_float64 PASSED [0.0631s] [ 69%] 2025-12-04T14:02:33.5819508Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0130s] [ 69%] 2025-12-04T14:02:33.5819629Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_float32 PASSED [0.0058s] [ 69%] 2025-12-04T14:02:33.5819765Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16 PASSED [0.0171s] [ 69%] 2025-12-04T14:02:33.5819906Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16 PASSED [0.0145s] [ 69%] 2025-12-04T14:02:33.5820047Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0106s] [ 69%] 2025-12-04T14:02:33.5820239Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16 PASSED [0.0076s] [ 69%] 2025-12-04T14:02:33.5820384Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0074s] [ 69%] 2025-12-04T14:02:33.5820536Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64 PASSED [0.0074s] [ 69%] 2025-12-04T14:02:33.5820656Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_bfloat16 PASSED [1.5024s] [ 69%] 2025-12-04T14:02:33.5820779Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float64 PASSED [0.0133s] [ 69%] 2025-12-04T14:02:33.5820902Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_bfloat16 PASSED [0.0165s] [ 69%] 2025-12-04T14:02:33.5821027Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float64 PASSED [0.0148s] [ 69%] 2025-12-04T14:02:33.5821144Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_one_hot_cuda_int64 PASSED [1.4990s] [ 69%] 2025-12-04T14:02:33.5821267Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_bool PASSED [0.0043s] [ 69%] 2025-12-04T14:02:33.5821397Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_complex64 PASSED [0.0047s] [ 69%] 2025-12-04T14:02:33.5821525Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float16 PASSED [1.4821s] [ 69%] 2025-12-04T14:02:33.5821649Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float32 PASSED [0.0063s] [ 69%] 2025-12-04T14:02:33.5821773Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int8 PASSED [0.0048s] [ 69%] 2025-12-04T14:02:33.5821898Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_uint8 PASSED [0.0045s] [ 69%] 2025-12-04T14:02:33.5822030Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_bool PASSED [0.0152s] [ 69%] 2025-12-04T14:02:33.5822163Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_complex128 PASSED [0.0153s] [ 70%] 2025-12-04T14:02:33.5822288Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_float16 PASSED [0.0151s] [ 70%] 2025-12-04T14:02:33.5822424Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16 PASSED [1.4904s] [ 70%] 2025-12-04T14:02:33.5822548Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_float16 PASSED [0.0073s] [ 70%] 2025-12-04T14:02:33.5822671Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int32 PASSED [0.0056s] [ 70%] 2025-12-04T14:02:33.5822791Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int64 PASSED [0.0054s] [ 70%] 2025-12-04T14:02:33.5822914Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_int8 PASSED [0.0053s] [ 70%] 2025-12-04T14:02:33.5823063Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_uint8 PASSED [1.5027s] [ 70%] 2025-12-04T14:02:33.5823197Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_bfloat16 PASSED [0.0143s] [ 70%] 2025-12-04T14:02:33.5823326Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_complex64 PASSED [0.0056s] [ 70%] 2025-12-04T14:02:33.5823452Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_int64 PASSED [0.0054s] [ 70%] 2025-12-04T14:02:33.5823576Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_uint8 PASSED [0.0053s] [ 70%] 2025-12-04T14:02:33.5823721Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_bfloat16 PASSED [1.4694s] [ 70%] 2025-12-04T14:02:33.5823865Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float16 PASSED [0.0056s] [ 70%] 2025-12-04T14:02:33.5824006Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64 PASSED [0.0041s] [ 70%] 2025-12-04T14:02:33.5824145Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32 PASSED [1.4915s] [ 70%] 2025-12-04T14:02:33.5824281Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8 PASSED [0.0053s] [ 70%] 2025-12-04T14:02:33.5824420Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_complex128 PASSED [0.0073s] [ 70%] 2025-12-04T14:02:33.5824549Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int32 PASSED [0.0064s] [ 70%] 2025-12-04T14:02:33.5824676Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_bfloat16 PASSED [0.0036s] [ 70%] 2025-12-04T14:02:33.5824799Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_bool PASSED [0.0033s] [ 70%] 2025-12-04T14:02:33.5824925Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int16 PASSED [0.0035s] [ 70%] 2025-12-04T14:02:33.5825055Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0036s] [ 70%] 2025-12-04T14:02:33.5825182Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16 PASSED [0.0034s] [ 70%] 2025-12-04T14:02:33.5825309Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int32 PASSED [0.0035s] [ 71%] 2025-12-04T14:02:33.5825438Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64 PASSED [0.0035s] [ 71%] 2025-12-04T14:02:33.5825571Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float64 PASSED [0.0582s] [ 71%] 2025-12-04T14:02:33.5825696Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32 PASSED [0.0586s] [ 71%] 2025-12-04T14:02:33.5825837Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8 PASSED [0.0584s] [ 71%] 2025-12-04T14:02:33.5825955Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_bfloat16 PASSED [0.0042s] [ 71%] 2025-12-04T14:02:33.5826073Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float16 PASSED [1.4841s] [ 71%] 2025-12-04T14:02:33.5826199Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float32 PASSED [0.0062s] [ 71%] 2025-12-04T14:02:33.5826317Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float64 PASSED [0.0044s] [ 71%] 2025-12-04T14:02:33.5826427Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int32 PASSED [0.0042s] [ 71%] 2025-12-04T14:02:33.5826543Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int8 PASSED [0.0041s] [ 71%] 2025-12-04T14:02:33.5826657Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_float32 PASSED [1.4921s] [ 71%] 2025-12-04T14:02:33.5826791Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int16 PASSED [0.0059s] [ 71%] 2025-12-04T14:02:33.5826902Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int32 PASSED [0.0040s] [ 71%] 2025-12-04T14:02:33.5827035Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_complex_cuda_complex128 PASSED [1.4912s] [ 71%] 2025-12-04T14:02:33.5827151Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_bfloat16 PASSED [0.0046s] [ 71%] 2025-12-04T14:02:33.5827263Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float16 PASSED [1.5075s] [ 71%] 2025-12-04T14:02:33.5827375Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float32 PASSED [0.0048s] [ 71%] 2025-12-04T14:02:33.5827488Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_cuda_float64 PASSED [1.4949s] [ 71%] 2025-12-04T14:02:33.5827619Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0109s] [ 71%] 2025-12-04T14:02:33.5827751Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float16 PASSED [0.0060s] [ 71%] 2025-12-04T14:02:33.5827874Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_bfloat16 PASSED [0.0046s] [ 71%] 2025-12-04T14:02:33.5827994Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float16 PASSED [0.0044s] [ 71%] 2025-12-04T14:02:33.5828117Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float64 PASSED [1.4943s] [ 71%] 2025-12-04T14:02:33.5828254Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16 PASSED [0.0066s] [ 72%] 2025-12-04T14:02:33.5828395Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex128 PASSED [0.0047s] [ 72%] 2025-12-04T14:02:33.5828527Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int32 PASSED [0.0045s] [ 72%] 2025-12-04T14:02:33.5828661Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64 PASSED [1.4827s] [ 72%] 2025-12-04T14:02:33.5828783Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_bfloat16 PASSED [0.0154s] [ 72%] 2025-12-04T14:02:33.5828906Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float32 PASSED [1.4863s] [ 72%] 2025-12-04T14:02:33.5829031Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float64 PASSED [0.0172s] [ 72%] 2025-12-04T14:02:33.5829146Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_bool PASSED [1.4991s] [ 72%] 2025-12-04T14:02:33.5829271Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float16 PASSED [0.0052s] [ 72%] 2025-12-04T14:02:33.5829394Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float64 PASSED [1.5009s] [ 72%] 2025-12-04T14:02:33.5829526Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_float32 PASSED [0.0201s] [ 72%] 2025-12-04T14:02:33.5829645Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int8 PASSED [0.0044s] [ 72%] 2025-12-04T14:02:33.5829788Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128 PASSED [0.0123s] [ 72%] 2025-12-04T14:02:33.5829929Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8 PASSED [0.0113s] [ 72%] 2025-12-04T14:02:33.5830126Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16 PASSED [0.0118s] [ 72%] 2025-12-04T14:02:33.5830280Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 PASSED [0.0114s] [ 72%] 2025-12-04T14:02:33.5830436Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64 PASSED [0.0115s] [ 72%] 2025-12-04T14:02:33.5830573Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_complex128 PASSED [0.0958s] [ 72%] 2025-12-04T14:02:33.5830710Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_complex64 PASSED [1.5668s] [ 72%] 2025-12-04T14:02:33.5830846Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_bfloat16 PASSED [0.0074s] [ 72%] 2025-12-04T14:02:33.5830947Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bool PASSED [0.0113s] [ 72%] 2025-12-04T14:02:33.5831056Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex32 PASSED [0.0108s] [ 72%] 2025-12-04T14:02:33.5831162Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex64 PASSED [0.0107s] [ 72%] 2025-12-04T14:02:33.5831261Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int16 PASSED [0.0106s] [ 73%] 2025-12-04T14:02:33.5831360Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int64 PASSED [0.0106s] [ 73%] 2025-12-04T14:02:33.5831459Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_uint8 PASSED [0.0104s] [ 73%] 2025-12-04T14:02:33.5831598Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_bfloat16 SKIPPED [0.0006s] (Only runs on cpu) [ 73%] 2025-12-04T14:02:33.5831730Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_bool SKIPPED [0.0006s] (Only runs on cpu) [ 73%] 2025-12-04T14:02:33.5831868Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_complex64 SKIPPED [0.0005s] (Only runs on cpu) [ 73%] 2025-12-04T14:02:33.5832003Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_float32 SKIPPED [0.0006s] (Only runs on cpu) [ 73%] 2025-12-04T14:02:33.5832105Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_complex128 PASSED [0.0360s] [ 73%] 2025-12-04T14:02:33.5832204Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_complex64 PASSED [0.0353s] [ 73%] 2025-12-04T14:02:33.5832308Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_bfloat16 PASSED [0.0047s] [ 73%] 2025-12-04T14:02:33.5832417Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_complex128 PASSED [0.0046s] [ 73%] 2025-12-04T14:02:33.5832521Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float32 PASSED [0.0045s] [ 73%] 2025-12-04T14:02:33.5832622Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_nuc_cuda_float64 PASSED [1.4806s] [ 73%] 2025-12-04T14:02:33.5832724Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_bfloat16 PASSED [1.5114s] [ 73%] 2025-12-04T14:02:33.5832837Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_complex64 PASSED [0.0055s] [ 73%] 2025-12-04T14:02:33.5832949Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_float16 PASSED [0.0038s] [ 73%] 2025-12-04T14:02:33.5833063Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_float32 PASSED [0.0055s] [ 73%] 2025-12-04T14:02:33.5833177Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_complex128 PASSED [1.4798s] [ 73%] 2025-12-04T14:02:33.5833273Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_float16 PASSED [0.0043s] [ 73%] 2025-12-04T14:02:33.5833373Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bool PASSED [0.0062s] [ 73%] 2025-12-04T14:02:33.5833480Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_complex128 PASSED [1.5033s] [ 73%] 2025-12-04T14:02:33.5833596Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_float32 PASSED [0.0081s] [ 73%] 2025-12-04T14:02:33.5833696Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int16 PASSED [0.0059s] [ 73%] 2025-12-04T14:02:33.5833794Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int8 PASSED [0.0056s] [ 73%] 2025-12-04T14:02:33.5833890Z test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_float32 PASSED [0.1024s] [ 74%] 2025-12-04T14:02:33.5833986Z test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_float64 PASSED [0.1020s] [ 74%] 2025-12-04T14:02:33.5834084Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_bfloat16 PASSED [1.5037s] [ 74%] 2025-12-04T14:02:33.5834205Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_complex128 PASSED [0.0041s] [ 74%] 2025-12-04T14:02:33.5834299Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_int32 PASSED [1.4961s] [ 74%] 2025-12-04T14:02:33.5834407Z test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_float32 PASSED [0.3891s] [ 74%] 2025-12-04T14:02:33.5834519Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex32 PASSED [1.5037s] [ 74%] 2025-12-04T14:02:33.5834630Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex64 PASSED [0.0059s] [ 74%] 2025-12-04T14:02:33.5834738Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float32 PASSED [0.0043s] [ 74%] 2025-12-04T14:02:33.5834844Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float64 PASSED [0.0041s] [ 74%] 2025-12-04T14:02:33.5834951Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int16 PASSED [0.0040s] [ 74%] 2025-12-04T14:02:33.5835057Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int32 PASSED [1.4741s] [ 74%] 2025-12-04T14:02:33.5835162Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int8 PASSED [0.0059s] [ 74%] 2025-12-04T14:02:33.5835267Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_complex64 PASSED [0.0035s] [ 74%] 2025-12-04T14:02:33.5835367Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float32 PASSED [1.4731s] [ 74%] 2025-12-04T14:02:33.5835464Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_int64 PASSED [0.0046s] [ 74%] 2025-12-04T14:02:33.5835572Z test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_complex128 PASSED [0.0113s] [ 74%] 2025-12-04T14:02:33.5835668Z test_meta.py::TestMetaCUDA::test_meta_outplace_polar_cuda_float32 PASSED [0.0114s] [ 74%] 2025-12-04T14:02:33.5835791Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_int16 PASSED [0.0090s] [ 74%] 2025-12-04T14:02:33.5835912Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_int8 PASSED [0.0066s] [ 74%] 2025-12-04T14:02:33.5836031Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_uint8 PASSED [0.0065s] [ 74%] 2025-12-04T14:02:33.5836148Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_bool PASSED [1.4949s] [ 74%] 2025-12-04T14:02:33.5836268Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int64 PASSED [0.0086s] [ 74%] 2025-12-04T14:02:33.5836385Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int8 PASSED [0.0069s] [ 75%] 2025-12-04T14:02:33.5836502Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_bool PASSED [0.0067s] [ 75%] 2025-12-04T14:02:33.5836629Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0083s] [ 75%] 2025-12-04T14:02:33.5836756Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int16 PASSED [0.0065s] [ 75%] 2025-12-04T14:02:33.5836878Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int32 PASSED [0.0065s] [ 75%] 2025-12-04T14:02:33.5836994Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int8 PASSED [1.4864s] [ 75%] 2025-12-04T14:02:33.5837130Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_uint8 PASSED [0.0083s] [ 75%] 2025-12-04T14:02:33.5837253Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_bfloat16 PASSED [0.0088s] [ 75%] 2025-12-04T14:02:33.5837372Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int32 PASSED [0.0066s] [ 75%] 2025-12-04T14:02:33.5837488Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int64 PASSED [0.0065s] [ 75%] 2025-12-04T14:02:33.5837607Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_uint8 PASSED [0.0065s] [ 75%] 2025-12-04T14:02:33.5837740Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_bfloat16 PASSED [0.0065s] [ 75%] 2025-12-04T14:02:33.5837871Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float16 PASSED [1.4996s] [ 75%] 2025-12-04T14:02:33.5837988Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_int8 PASSED [0.0085s] [ 75%] 2025-12-04T14:02:33.5838098Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex128 PASSED [1.4868s] [ 75%] 2025-12-04T14:02:33.5838206Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex32 PASSED [0.0038s] [ 75%] 2025-12-04T14:02:33.5838311Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex64 PASSED [1.4756s] [ 75%] 2025-12-04T14:02:33.5838413Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_float16 PASSED [0.0039s] [ 75%] 2025-12-04T14:02:33.5838510Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float16 PASSED [0.0094s] [ 75%] 2025-12-04T14:02:33.5838607Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float32 PASSED [0.0082s] [ 75%] 2025-12-04T14:02:33.5838703Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float64 PASSED [0.0081s] [ 75%] 2025-12-04T14:02:33.5838796Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int32 PASSED [0.0081s] [ 75%] 2025-12-04T14:02:33.5838889Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int8 PASSED [0.0081s] [ 75%] 2025-12-04T14:02:33.5838989Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_complex32 PASSED [0.5902s] [ 75%] 2025-12-04T14:02:33.5839084Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_float32 PASSED [0.0133s] [ 76%] 2025-12-04T14:02:33.5839178Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int64 PASSED [0.0128s] [ 76%] 2025-12-04T14:02:33.5839271Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_uint8 PASSED [0.0142s] [ 76%] 2025-12-04T14:02:33.5839368Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bfloat16 PASSED [0.0114s] [ 76%] 2025-12-04T14:02:33.5839459Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bool PASSED [0.0113s] [ 76%] 2025-12-04T14:02:33.5839557Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_complex64 PASSED [0.0113s] [ 76%] 2025-12-04T14:02:33.5839652Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_float32 PASSED [0.0113s] [ 76%] 2025-12-04T14:02:33.5839745Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int64 PASSED [0.0113s] [ 76%] 2025-12-04T14:02:33.5839835Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int8 PASSED [0.0113s] [ 76%] 2025-12-04T14:02:33.5839927Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_uint8 PASSED [0.0112s] [ 76%] 2025-12-04T14:02:33.5840024Z test_meta.py::TestMetaCUDA::test_meta_outplace_qr_cuda_complex64 PASSED [0.0228s] [ 76%] 2025-12-04T14:02:33.5840158Z test_meta.py::TestMetaCUDA::test_meta_outplace_qr_cuda_float64 PASSED [0.0203s] [ 76%] 2025-12-04T14:02:33.5840270Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_bool PASSED [1.4904s] [ 76%] 2025-12-04T14:02:33.5840383Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex128 PASSED [0.0125s] [ 76%] 2025-12-04T14:02:33.5840490Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex32 PASSED [1.5091s] [ 76%] 2025-12-04T14:02:33.5840607Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_float64 PASSED [0.0125s] [ 76%] 2025-12-04T14:02:33.5840706Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int32 PASSED [1.5090s] [ 76%] 2025-12-04T14:02:33.5840802Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int64 PASSED [0.0104s] [ 76%] 2025-12-04T14:02:33.5840910Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float32 PASSED [0.0156s] [ 76%] 2025-12-04T14:02:33.5841015Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int32 PASSED [0.0143s] [ 76%] 2025-12-04T14:02:33.5841119Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int8 PASSED [1.5118s] [ 76%] 2025-12-04T14:02:33.5841232Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_complex128 PASSED [0.0062s] [ 76%] 2025-12-04T14:02:33.5841340Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_float16 PASSED [0.0043s] [ 76%] 2025-12-04T14:02:33.5841436Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_float64 PASSED [0.0040s] [ 76%] 2025-12-04T14:02:33.5841542Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_float16 PASSED [0.0103s] [ 77%] 2025-12-04T14:02:33.5841638Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_bfloat16 PASSED [1.4921s] [ 77%] 2025-12-04T14:02:33.5841740Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex128 PASSED [0.0048s] [ 77%] 2025-12-04T14:02:33.5841839Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex32 PASSED [1.5021s] [ 77%] 2025-12-04T14:02:33.5841937Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_float64 PASSED [0.0048s] [ 77%] 2025-12-04T14:02:33.5842034Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int16 PASSED [1.4928s] [ 77%] 2025-12-04T14:02:33.5842129Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int32 PASSED [0.0048s] [ 77%] 2025-12-04T14:02:33.5842224Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int64 PASSED [1.4967s] [ 77%] 2025-12-04T14:02:33.5842318Z test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_int16 PASSED [0.0045s] [ 77%] 2025-12-04T14:02:33.5842411Z test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_uint8 PASSED [1.4917s] [ 77%] 2025-12-04T14:02:33.5842518Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_bfloat16 PASSED [0.0056s] [ 77%] 2025-12-04T14:02:33.5842619Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_bool PASSED [0.0039s] [ 77%] 2025-12-04T14:02:33.5842729Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_complex128 PASSED [1.4856s] [ 77%] 2025-12-04T14:02:33.5842836Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_float64 PASSED [0.0054s] [ 77%] 2025-12-04T14:02:33.5842938Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int16 PASSED [0.0039s] [ 77%] 2025-12-04T14:02:33.5843039Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int32 PASSED [1.4781s] [ 77%] 2025-12-04T14:02:33.5843142Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_float16 PASSED [0.0109s] [ 77%] 2025-12-04T14:02:33.5843243Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int32 PASSED [0.0089s] [ 77%] 2025-12-04T14:02:33.5843342Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int64 PASSED [0.0086s] [ 77%] 2025-12-04T14:02:33.5843440Z test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float64 PASSED [0.0139s] [ 77%] 2025-12-04T14:02:33.5843537Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_float16 PASSED [0.0128s] [ 77%] 2025-12-04T14:02:33.5843636Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_float32 PASSED [0.0126s] [ 77%] 2025-12-04T14:02:33.5843743Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int64 PASSED [0.0126s] [ 77%] 2025-12-04T14:02:33.5843840Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int8 PASSED [0.0126s] [ 78%] 2025-12-04T14:02:33.5843935Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_uint8 PASSED [0.0125s] [ 78%] 2025-12-04T14:02:33.5844063Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_complex32 PASSED [1.4875s] [ 78%] 2025-12-04T14:02:33.5844181Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_complex64 PASSED [0.0057s] [ 78%] 2025-12-04T14:02:33.5844295Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float16 PASSED [0.0041s] [ 78%] 2025-12-04T14:02:33.5844407Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int32 PASSED [1.5026s] [ 78%] 2025-12-04T14:02:33.5844518Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int64 PASSED [0.0056s] [ 78%] 2025-12-04T14:02:33.5844630Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_uint8 PASSED [0.0041s] [ 78%] 2025-12-04T14:02:33.5844763Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_bfloat16 PASSED [1.4701s] [ 78%] 2025-12-04T14:02:33.5844873Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex64 PASSED [0.0050s] [ 78%] 2025-12-04T14:02:33.5844979Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_float16 PASSED [0.0037s] [ 78%] 2025-12-04T14:02:33.5845080Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int32 PASSED [1.4873s] [ 78%] 2025-12-04T14:02:33.5845179Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int8 PASSED [0.0049s] [ 78%] 2025-12-04T14:02:33.5845284Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex64 PASSED [0.0042s] [ 78%] 2025-12-04T14:02:33.5845383Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float64 PASSED [1.4905s] [ 78%] 2025-12-04T14:02:33.5845481Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int64 PASSED [0.0054s] [ 78%] 2025-12-04T14:02:33.5845582Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_bfloat16 PASSED [0.0035s] [ 78%] 2025-12-04T14:02:33.5845688Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_complex128 PASSED [1.4996s] [ 78%] 2025-12-04T14:02:33.5845791Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_complex64 PASSED [0.0049s] [ 78%] 2025-12-04T14:02:33.5845890Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float32 PASSED [0.0034s] [ 78%] 2025-12-04T14:02:33.5845986Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int64 PASSED [1.4817s] [ 78%] 2025-12-04T14:02:33.5846093Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_complex64 PASSED [0.0049s] [ 78%] 2025-12-04T14:02:33.5846194Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_uint8 PASSED [0.0036s] [ 78%] 2025-12-04T14:02:33.5846304Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_complex128 PASSED [1.4881s] [ 78%] 2025-12-04T14:02:33.5846413Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float16 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5846520Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float32 PASSED [1.4914s] [ 79%] 2025-12-04T14:02:33.5846627Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_float64 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5846732Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int16 PASSED [1.4814s] [ 79%] 2025-12-04T14:02:33.5846837Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_uint8 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5846946Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex128 PASSED [1.4904s] [ 79%] 2025-12-04T14:02:33.5847056Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex32 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5847163Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_complex64 PASSED [1.4882s] [ 79%] 2025-12-04T14:02:33.5847277Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int32 PASSED [0.0042s] [ 79%] 2025-12-04T14:02:33.5847380Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int8 PASSED [1.4923s] [ 79%] 2025-12-04T14:02:33.5847484Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_uint8 PASSED [0.0041s] [ 79%] 2025-12-04T14:02:33.5847587Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_bool PASSED [0.0090s] [ 79%] 2025-12-04T14:02:33.5847686Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex64 PASSED [0.0083s] [ 79%] 2025-12-04T14:02:33.5847778Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int8 PASSED [0.0081s] [ 79%] 2025-12-04T14:02:33.5847874Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_bfloat16 PASSED [0.0118s] [ 79%] 2025-12-04T14:02:33.5847974Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_complex64 PASSED [0.0118s] [ 79%] 2025-12-04T14:02:33.5848068Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int16 PASSED [0.0115s] [ 79%] 2025-12-04T14:02:33.5848174Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int64 PASSED [0.0116s] [ 79%] 2025-12-04T14:02:33.5848281Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_bfloat16 PASSED [1.4738s] [ 79%] 2025-12-04T14:02:33.5848378Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_float16 PASSED [0.0044s] [ 79%] 2025-12-04T14:02:33.5848473Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int64 PASSED [1.4877s] [ 79%] 2025-12-04T14:02:33.5848586Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float64 PASSED [0.0055s] [ 79%] 2025-12-04T14:02:33.5848697Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_3_cuda_float32 PASSED [0.0039s] [ 79%] 2025-12-04T14:02:33.5848797Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex64 PASSED [0.0051s] [ 80%] 2025-12-04T14:02:33.5848891Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int16 PASSED [1.4972s] [ 80%] 2025-12-04T14:02:33.5848987Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int32 PASSED [0.0054s] [ 80%] 2025-12-04T14:02:33.5849082Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int8 PASSED [0.0039s] [ 80%] 2025-12-04T14:02:33.5849176Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_uint8 PASSED [1.4881s] [ 80%] 2025-12-04T14:02:33.5849272Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_bfloat16 PASSED [0.0090s] [ 80%] 2025-12-04T14:02:33.5849372Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_complex128 PASSED [1.4842s] [ 80%] 2025-12-04T14:02:33.5849467Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_float32 PASSED [0.0091s] [ 80%] 2025-12-04T14:02:33.5849560Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int32 PASSED [1.4912s] [ 80%] 2025-12-04T14:02:33.5849653Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int64 PASSED [0.0084s] [ 80%] 2025-12-04T14:02:33.5849745Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_uint8 PASSED [1.4768s] [ 80%] 2025-12-04T14:02:33.5849857Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_bfloat16 PASSED [0.0036s] [ 80%] 2025-12-04T14:02:33.5849965Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float64 PASSED [1.4695s] [ 80%] 2025-12-04T14:02:33.5850072Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_int64 PASSED [0.0036s] [ 80%] 2025-12-04T14:02:33.5850218Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_uint8 PASSED [1.4663s] [ 80%] 2025-12-04T14:02:33.5850328Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_complex64 PASSED [0.0097s] [ 80%] 2025-12-04T14:02:33.5850433Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float16 PASSED [0.0078s] [ 80%] 2025-12-04T14:02:33.5850538Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float64 PASSED [0.0076s] [ 80%] 2025-12-04T14:02:33.5850639Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int8 PASSED [0.0075s] [ 80%] 2025-12-04T14:02:33.5850766Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_complex64 PASSED [1.4923s] [ 80%] 2025-12-04T14:02:33.5850867Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_float16 PASSED [0.0211s] [ 80%] 2025-12-04T14:02:33.5850965Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int16 PASSED [0.0140s] [ 80%] 2025-12-04T14:02:33.5851072Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int32 PASSED [0.0139s] [ 80%] 2025-12-04T14:02:33.5851167Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int8 PASSED [0.0144s] [ 80%] 2025-12-04T14:02:33.5851285Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_float16 PASSED [0.0176s] [ 81%] 2025-12-04T14:02:33.5851400Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_int16 PASSED [0.0167s] [ 81%] 2025-12-04T14:02:33.5851513Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_uint8 PASSED [0.0164s] [ 81%] 2025-12-04T14:02:33.5851630Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_bfloat16 PASSED [0.0165s] [ 81%] 2025-12-04T14:02:33.5851773Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float16 PASSED [0.0164s] [ 81%] 2025-12-04T14:02:33.5851888Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float32 PASSED [1.5059s] [ 81%] 2025-12-04T14:02:33.5852003Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_float64 PASSED [0.0185s] [ 81%] 2025-12-04T14:02:33.5852116Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int16 PASSED [1.5069s] [ 81%] 2025-12-04T14:02:33.5852229Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int16 PASSED [0.0201s] [ 81%] 2025-12-04T14:02:33.5852339Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int8 PASSED [1.5454s] [ 81%] 2025-12-04T14:02:33.5852450Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_uint8 PASSED [0.0208s] [ 81%] 2025-12-04T14:02:33.5852567Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_float64 PASSED [1.5235s] [ 81%] 2025-12-04T14:02:33.5852683Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_bfloat16 PASSED [0.0191s] [ 81%] 2025-12-04T14:02:33.5852795Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_float16 PASSED [1.4955s] [ 81%] 2025-12-04T14:02:33.5852908Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_int16 PASSED [0.0186s] [ 81%] 2025-12-04T14:02:33.5853019Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_int32 PASSED [1.5269s] [ 81%] 2025-12-04T14:02:33.5853129Z test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_float32 PASSED [0.1816s] [ 81%] 2025-12-04T14:02:33.5853230Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_bfloat16 PASSED [1.4696s] [ 81%] 2025-12-04T14:02:33.5853332Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_complex32 PASSED [0.0054s] [ 81%] 2025-12-04T14:02:33.5853436Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_complex64 PASSED [0.0039s] [ 81%] 2025-12-04T14:02:33.5853535Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float16 PASSED [1.4909s] [ 81%] 2025-12-04T14:02:33.5853633Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float32 PASSED [0.0055s] [ 81%] 2025-12-04T14:02:33.5853728Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int16 PASSED [0.0040s] [ 81%] 2025-12-04T14:02:33.5853824Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_uint8 PASSED [1.4744s] [ 81%] 2025-12-04T14:02:33.5853931Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_bool PASSED [0.0055s] [ 82%] 2025-12-04T14:02:33.5854038Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int16 PASSED [0.0040s] [ 82%] 2025-12-04T14:02:33.5854144Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_uint8 PASSED [1.4944s] [ 82%] 2025-12-04T14:02:33.5854250Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_bfloat16 PASSED [0.0045s] [ 82%] 2025-12-04T14:02:33.5854350Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_complex128 PASSED [0.0035s] [ 82%] 2025-12-04T14:02:33.5854442Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_int16 PASSED [1.4659s] [ 82%] 2025-12-04T14:02:33.5854534Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_int64 PASSED [0.0044s] [ 82%] 2025-12-04T14:02:33.5854643Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_bfloat16 PASSED [1.4990s] [ 82%] 2025-12-04T14:02:33.5854744Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_complex128 PASSED [0.0052s] [ 82%] 2025-12-04T14:02:33.5854839Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int16 PASSED [0.0036s] [ 82%] 2025-12-04T14:02:33.5854933Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int32 PASSED [1.4681s] [ 82%] 2025-12-04T14:02:33.5855035Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_bfloat16 PASSED [0.0053s] [ 82%] 2025-12-04T14:02:33.5855139Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_int8 PASSED [0.0030s] [ 82%] 2025-12-04T14:02:33.5855276Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_exponential_cuda_float32 PASSED [0.0081s] [ 82%] 2025-12-04T14:02:33.5855399Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_gaussian_cuda_float32 PASSED [1.4713s] [ 82%] 2025-12-04T14:02:33.5855529Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_cosine_cuda_float32 PASSED [0.0179s] [ 82%] 2025-12-04T14:02:33.5855658Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_cosine_cuda_float64 PASSED [0.0160s] [ 82%] 2025-12-04T14:02:33.5855787Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_hamming_cuda_float64 PASSED [0.0157s] [ 82%] 2025-12-04T14:02:33.5855906Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_hamming_cuda_float64 PASSED [0.0158s] [ 82%] 2025-12-04T14:02:33.5856026Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_nuttall_cuda_float64 PASSED [0.0156s] [ 82%] 2025-12-04T14:02:33.5856130Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_bfloat16 PASSED [0.0027s] [ 82%] 2025-12-04T14:02:33.5856230Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float16 PASSED [1.4753s] [ 82%] 2025-12-04T14:02:33.5856330Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float64 PASSED [0.0043s] [ 82%] 2025-12-04T14:02:33.5856428Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_int32 PASSED [1.4673s] [ 83%] 2025-12-04T14:02:33.5856525Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_int64 PASSED [0.0044s] [ 83%] 2025-12-04T14:02:33.5856616Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_bool PASSED [1.4748s] [ 83%] 2025-12-04T14:02:33.5856712Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float32 PASSED [0.0044s] [ 83%] 2025-12-04T14:02:33.5856806Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float64 PASSED [1.4761s] [ 83%] 2025-12-04T14:02:33.5856898Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_int16 PASSED [0.0044s] [ 83%] 2025-12-04T14:02:33.5856992Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_int32 PASSED [1.4797s] [ 83%] 2025-12-04T14:02:33.5857083Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_uint8 PASSED [0.0044s] [ 83%] 2025-12-04T14:02:33.5857179Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_float64 PASSED [0.0065s] [ 83%] 2025-12-04T14:02:33.5857272Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int16 PASSED [0.0058s] [ 83%] 2025-12-04T14:02:33.5857364Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_uint8 PASSED [0.0048s] [ 83%] 2025-12-04T14:02:33.5857460Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_bfloat16 PASSED [1.4820s] [ 83%] 2025-12-04T14:02:33.5857558Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex64 PASSED [0.2084s] [ 83%] 2025-12-04T14:02:33.5857665Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_bfloat16 PASSED [1.4728s] [ 83%] 2025-12-04T14:02:33.5857767Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex32 PASSED [0.0048s] [ 83%] 2025-12-04T14:02:33.5857863Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float16 PASSED [1.4881s] [ 83%] 2025-12-04T14:02:33.5857960Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float32 PASSED [0.0051s] [ 83%] 2025-12-04T14:02:33.5858068Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_float64 PASSED [1.4766s] [ 83%] 2025-12-04T14:02:33.5858179Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_bfloat16 PASSED [0.0169s] [ 83%] 2025-12-04T14:02:33.5858286Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float16 PASSED [0.0146s] [ 83%] 2025-12-04T14:02:33.5858394Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float32 PASSED [0.0144s] [ 83%] 2025-12-04T14:02:33.5858500Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float64 PASSED [0.0143s] [ 83%] 2025-12-04T14:02:33.5858607Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_int8 PASSED [0.0143s] [ 83%] 2025-12-04T14:02:33.5858726Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float16 PASSED [0.0052s] [ 83%] 2025-12-04T14:02:33.5858825Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float32 PASSED [0.0051s] [ 84%] 2025-12-04T14:02:33.5858925Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float64 PASSED [0.0051s] [ 84%] 2025-12-04T14:02:33.5859037Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_bool PASSED [0.0052s] [ 84%] 2025-12-04T14:02:33.5859152Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float16 PASSED [0.0053s] [ 84%] 2025-12-04T14:02:33.5859244Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_bool PASSED [0.0053s] [ 84%] 2025-12-04T14:02:33.5859340Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float64 PASSED [0.0130s] [ 84%] 2025-12-04T14:02:33.5859434Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_uint8 PASSED [0.0130s] [ 84%] 2025-12-04T14:02:33.5859544Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int64 PASSED [1.4972s] [ 84%] 2025-12-04T14:02:33.5859650Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_uint8 PASSED [0.0054s] [ 84%] 2025-12-04T14:02:33.5859764Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_float64 PASSED [0.1838s] [ 84%] 2025-12-04T14:02:33.5859875Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_uint8 PASSED [1.4763s] [ 84%] 2025-12-04T14:02:33.5859987Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_float64 PASSED [0.1288s] [ 84%] 2025-12-04T14:02:33.5860116Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_float64 PASSED [0.1386s] [ 84%] 2025-12-04T14:02:33.5860227Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_int16 PASSED [1.5063s] [ 84%] 2025-12-04T14:02:33.5860337Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_int64 PASSED [0.0055s] [ 84%] 2025-12-04T14:02:33.5860448Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_bool PASSED [0.0053s] [ 84%] 2025-12-04T14:02:33.5860558Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_int16 PASSED [1.4728s] [ 84%] 2025-12-04T14:02:33.5860667Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_int64 PASSED [0.0054s] [ 84%] 2025-12-04T14:02:33.5860796Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_int32 PASSED [0.0107s] [ 84%] 2025-12-04T14:02:33.5860925Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int16 PASSED [0.0099s] [ 84%] 2025-12-04T14:02:33.5861056Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int32 PASSED [0.0077s] [ 84%] 2025-12-04T14:02:33.5861183Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_u_cuda_int64 PASSED [0.0077s] [ 84%] 2025-12-04T14:02:33.5861332Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0089s] [ 84%] 2025-12-04T14:02:33.5861461Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int16 PASSED [0.0094s] [ 85%] 2025-12-04T14:02:33.5861588Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int32 PASSED [0.0076s] [ 85%] 2025-12-04T14:02:33.5861728Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_int8 PASSED [0.0077s] [ 85%] 2025-12-04T14:02:33.5861859Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0089s] [ 85%] 2025-12-04T14:02:33.5861988Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_float64 PASSED [0.0088s] [ 85%] 2025-12-04T14:02:33.5862116Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int32 PASSED [0.0094s] [ 85%] 2025-12-04T14:02:33.5862244Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int64 PASSED [0.0077s] [ 85%] 2025-12-04T14:02:33.5862390Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_bool PASSED [0.0054s] [ 85%] 2025-12-04T14:02:33.5862500Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float16 PASSED [0.0049s] [ 85%] 2025-12-04T14:02:33.5862609Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float32 PASSED [0.0048s] [ 85%] 2025-12-04T14:02:33.5862717Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_float64 PASSED [0.0048s] [ 85%] 2025-12-04T14:02:33.5862823Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int16 PASSED [0.0043s] [ 85%] 2025-12-04T14:02:33.5862928Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int64 PASSED [0.0044s] [ 85%] 2025-12-04T14:02:33.5863032Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int8 PASSED [1.4788s] [ 85%] 2025-12-04T14:02:33.5863165Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_float64 PASSED [0.0116s] [ 85%] 2025-12-04T14:02:33.5863295Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int16 PASSED [0.0099s] [ 85%] 2025-12-04T14:02:33.5863423Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int32 PASSED [0.0079s] [ 85%] 2025-12-04T14:02:33.5863530Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_bfloat16 PASSED [1.5221s] [ 85%] 2025-12-04T14:02:33.5863635Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int32 PASSED [0.0065s] [ 85%] 2025-12-04T14:02:33.5863738Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int64 PASSED [0.0034s] [ 85%] 2025-12-04T14:02:33.5863840Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int8 PASSED [1.5088s] [ 85%] 2025-12-04T14:02:33.5863939Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bool PASSED [0.0065s] [ 85%] 2025-12-04T14:02:33.5864044Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_float32 PASSED [0.0043s] [ 85%] 2025-12-04T14:02:33.5864146Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int16 PASSED [1.5061s] [ 85%] 2025-12-04T14:02:33.5864246Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int32 PASSED [0.0050s] [ 86%] 2025-12-04T14:02:33.5864348Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int64 PASSED [0.0035s] [ 86%] 2025-12-04T14:02:33.5864455Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_float16 PASSED [1.6191s] [ 86%] 2025-12-04T14:02:33.5864557Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int32 PASSED [0.0066s] [ 86%] 2025-12-04T14:02:33.5864656Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int8 PASSED [0.0036s] [ 86%] 2025-12-04T14:02:33.5864759Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_uint8 PASSED [1.4962s] [ 86%] 2025-12-04T14:02:33.5864895Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_bool PASSED [0.0124s] [ 86%] 2025-12-04T14:02:33.5865027Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_float32 PASSED [0.2857s] [ 86%] 2025-12-04T14:02:33.5865153Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int64 PASSED [0.0082s] [ 86%] 2025-12-04T14:02:33.5865290Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_int32 PASSED [0.0098s] [ 86%] 2025-12-04T14:02:33.5865416Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_uint8 PASSED [0.0077s] [ 86%] 2025-12-04T14:02:33.5865528Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_float64 PASSED [1.5076s] [ 86%] 2025-12-04T14:02:33.5865638Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int16 PASSED [0.0099s] [ 86%] 2025-12-04T14:02:33.5865748Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int32 PASSED [0.0066s] [ 86%] 2025-12-04T14:02:33.5865867Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int64 PASSED [0.0063s] [ 86%] 2025-12-04T14:02:33.5865985Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_int8 PASSED [0.0062s] [ 86%] 2025-12-04T14:02:33.5866094Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_uint8 PASSED [0.0062s] [ 86%] 2025-12-04T14:02:33.5866220Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_float32 PASSED [1.5007s] [ 86%] 2025-12-04T14:02:33.5866343Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int32 PASSED [0.0067s] [ 86%] 2025-12-04T14:02:33.5866464Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int8 PASSED [0.0038s] [ 86%] 2025-12-04T14:02:33.5866585Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int32 PASSED [0.0046s] [ 86%] 2025-12-04T14:02:33.5866706Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int64 PASSED [1.4951s] [ 86%] 2025-12-04T14:02:33.5866830Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_uint8 PASSED [0.0054s] [ 86%] 2025-12-04T14:02:33.5866953Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_float32 PASSED [0.0051s] [ 86%] 2025-12-04T14:02:33.5867078Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_float64 PASSED [1.4931s] [ 87%] 2025-12-04T14:02:33.5867198Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int64 PASSED [0.0070s] [ 87%] 2025-12-04T14:02:33.5867318Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int8 PASSED [0.0038s] [ 87%] 2025-12-04T14:02:33.5867439Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_uint8 PASSED [1.4637s] [ 87%] 2025-12-04T14:02:33.5867564Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_float64 PASSED [0.1860s] [ 87%] 2025-12-04T14:02:33.5867676Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float16 PASSED [0.0052s] [ 87%] 2025-12-04T14:02:33.5867784Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float64 PASSED [0.0045s] [ 87%] 2025-12-04T14:02:33.5867891Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_int32 PASSED [0.0045s] [ 87%] 2025-12-04T14:02:33.5868000Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int16 PASSED [1.5021s] [ 87%] 2025-12-04T14:02:33.5868108Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_uint8 PASSED [0.0056s] [ 87%] 2025-12-04T14:02:33.5868254Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float16 PASSED [0.0071s] [ 87%] 2025-12-04T14:02:33.5868396Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int16 PASSED [0.0066s] [ 87%] 2025-12-04T14:02:33.5868547Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int64 PASSED [1.4879s] [ 87%] 2025-12-04T14:02:33.5868685Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32 PASSED [0.1536s] [ 87%] 2025-12-04T14:02:33.5868819Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64 PASSED [0.0051s] [ 87%] 2025-12-04T14:02:33.5868960Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int32 PASSED [1.5278s] [ 87%] 2025-12-04T14:02:33.5869093Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64 PASSED [0.0068s] [ 87%] 2025-12-04T14:02:33.5869225Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_int32 PASSED [0.0050s] [ 87%] 2025-12-04T14:02:33.5869356Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64 PASSED [1.4838s] [ 87%] 2025-12-04T14:02:33.5869489Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k1_cuda_uint8 PASSED [0.0055s] [ 87%] 2025-12-04T14:02:33.5869654Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_bool PASSED [0.0107s] [ 87%] 2025-12-04T14:02:33.5869799Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.3654s] [ 87%] 2025-12-04T14:02:33.5869941Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int16 PASSED [0.0083s] [ 87%] 2025-12-04T14:02:33.5870079Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32 PASSED [0.0079s] [ 88%] 2025-12-04T14:02:33.5870248Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64 PASSED [0.0078s] [ 88%] 2025-12-04T14:02:33.5870385Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8 PASSED [0.0078s] [ 88%] 2025-12-04T14:02:33.5870525Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_uint8 PASSED [0.0077s] [ 88%] 2025-12-04T14:02:33.5870668Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float64 PASSED [0.3806s] [ 88%] 2025-12-04T14:02:33.5870805Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16 PASSED [0.0104s] [ 88%] 2025-12-04T14:02:33.5870943Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_uint8 PASSED [0.0079s] [ 88%] 2025-12-04T14:02:33.5871080Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool PASSED [0.0095s] [ 88%] 2025-12-04T14:02:33.5871223Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64 PASSED [0.0089s] [ 88%] 2025-12-04T14:02:33.5871365Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [0.0089s] [ 88%] 2025-12-04T14:02:33.5871503Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int64 PASSED [0.0095s] [ 88%] 2025-12-04T14:02:33.5871631Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_float32 PASSED [0.0042s] [ 88%] 2025-12-04T14:02:33.5871759Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_float64 PASSED [1.5280s] [ 88%] 2025-12-04T14:02:33.5871885Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int32 PASSED [0.0072s] [ 88%] 2025-12-04T14:02:33.5871999Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float32 PASSED [0.0136s] [ 88%] 2025-12-04T14:02:33.5872112Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float64 PASSED [0.0128s] [ 88%] 2025-12-04T14:02:33.5872221Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int16 PASSED [0.0128s] [ 88%] 2025-12-04T14:02:33.5872341Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int8 PASSED [0.0127s] [ 88%] 2025-12-04T14:02:33.5872451Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_uint8 PASSED [0.0128s] [ 88%] 2025-12-04T14:02:33.5872557Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_float32 PASSED [0.0098s] [ 88%] 2025-12-04T14:02:33.5872664Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_float64 PASSED [0.0093s] [ 88%] 2025-12-04T14:02:33.5872785Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int64 PASSED [0.0099s] [ 88%] 2025-12-04T14:02:33.5872880Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_bool PASSED [1.4765s] [ 88%] 2025-12-04T14:02:33.5872984Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex32 PASSED [0.0042s] [ 88%] 2025-12-04T14:02:33.5873084Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex64 PASSED [1.4772s] [ 89%] 2025-12-04T14:02:33.5873182Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_float16 PASSED [0.0041s] [ 89%] 2025-12-04T14:02:33.5873278Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int32 PASSED [1.4887s] [ 89%] 2025-12-04T14:02:33.5873398Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int64 PASSED [0.0042s] [ 89%] 2025-12-04T14:02:33.5873511Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_bfloat16 PASSED [1.4922s] [ 89%] 2025-12-04T14:02:33.5873620Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_bool PASSED [0.0045s] [ 89%] 2025-12-04T14:02:33.5873730Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float16 PASSED [1.5024s] [ 89%] 2025-12-04T14:02:33.5873841Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float32 PASSED [0.0047s] [ 89%] 2025-12-04T14:02:33.5873950Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_float64 PASSED [1.5076s] [ 89%] 2025-12-04T14:02:33.5874072Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_bfloat16 PASSED [0.0052s] [ 89%] 2025-12-04T14:02:33.5874191Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float16 PASSED [0.0036s] [ 89%] 2025-12-04T14:02:33.5874310Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float32 PASSED [1.5054s] [ 89%] 2025-12-04T14:02:33.5874427Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_float64 PASSED [0.0052s] [ 89%] 2025-12-04T14:02:33.5874544Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int16 PASSED [0.0038s] [ 89%] 2025-12-04T14:02:33.5874658Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int32 PASSED [1.5021s] [ 89%] 2025-12-04T14:02:33.5874773Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int64 PASSED [0.0053s] [ 89%] 2025-12-04T14:02:33.5874886Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_bfloat16 PASSED [0.0035s] [ 89%] 2025-12-04T14:02:33.5874997Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float64 PASSED [1.4933s] [ 89%] 2025-12-04T14:02:33.5875108Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_int8 PASSED [0.0047s] [ 89%] 2025-12-04T14:02:33.5875208Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_bfloat16 PASSED [1.4993s] [ 89%] 2025-12-04T14:02:33.5875310Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex128 PASSED [0.0043s] [ 89%] 2025-12-04T14:02:33.5875406Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float16 PASSED [1.5034s] [ 89%] 2025-12-04T14:02:33.5875502Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float32 PASSED [0.0044s] [ 89%] 2025-12-04T14:02:33.5875596Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_float64 PASSED [1.5192s] [ 90%] 2025-12-04T14:02:33.5875691Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_int32 PASSED [0.0044s] [ 90%] 2025-12-04T14:02:33.5875794Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_complex64 PASSED [0.0046s] [ 90%] 2025-12-04T14:02:33.5875902Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float32 PASSED [0.0040s] [ 90%] 2025-12-04T14:02:33.5875998Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int32 PASSED [1.5046s] [ 90%] 2025-12-04T14:02:33.5876093Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int8 PASSED [0.0059s] [ 90%] 2025-12-04T14:02:33.5876197Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_bool PASSED [0.0061s] [ 90%] 2025-12-04T14:02:33.5876318Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_complex32 PASSED [0.0059s] [ 90%] 2025-12-04T14:02:33.5876425Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float16 PASSED [0.0057s] [ 90%] 2025-12-04T14:02:33.5876530Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int16 PASSED [0.0057s] [ 90%] 2025-12-04T14:02:33.5876633Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int32 PASSED [0.0057s] [ 90%] 2025-12-04T14:02:33.5876739Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int64 PASSED [0.0057s] [ 90%] 2025-12-04T14:02:33.5876859Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_uint8 PASSED [0.0057s] [ 90%] 2025-12-04T14:02:33.5876972Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_complex64 PASSED [0.0041s] [ 90%] 2025-12-04T14:02:33.5877073Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_float16 PASSED [1.5052s] [ 90%] 2025-12-04T14:02:33.5877172Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_float32 PASSED [0.0058s] [ 90%] 2025-12-04T14:02:33.5877269Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int16 PASSED [0.0043s] [ 90%] 2025-12-04T14:02:33.5877365Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int32 PASSED [0.0041s] [ 90%] 2025-12-04T14:02:33.5877460Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int64 PASSED [1.5080s] [ 90%] 2025-12-04T14:02:33.5877554Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_int8 PASSED [0.0056s] [ 90%] 2025-12-04T14:02:33.5877651Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_uint8 PASSED [0.0043s] [ 90%] 2025-12-04T14:02:33.5877770Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_complex32 PASSED [1.4817s] [ 90%] 2025-12-04T14:02:33.5877888Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_complex64 PASSED [0.0054s] [ 90%] 2025-12-04T14:02:33.5878001Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_float16 PASSED [0.0040s] [ 90%] 2025-12-04T14:02:33.5878113Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_float64 PASSED [1.4950s] [ 91%] 2025-12-04T14:02:33.5878223Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int32 PASSED [0.0053s] [ 91%] 2025-12-04T14:02:33.5878332Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int8 PASSED [0.0039s] [ 91%] 2025-12-04T14:02:33.5878431Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_bfloat16 PASSED [0.0086s] [ 91%] 2025-12-04T14:02:33.5878534Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_complex32 PASSED [0.0083s] [ 91%] 2025-12-04T14:02:33.5878633Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float32 PASSED [0.0081s] [ 91%] 2025-12-04T14:02:33.5878728Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_int64 PASSED [0.0080s] [ 91%] 2025-12-04T14:02:33.5878826Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_uint8 PASSED [0.0081s] [ 91%] 2025-12-04T14:02:33.5878921Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_bfloat16 PASSED [0.0102s] [ 91%] 2025-12-04T14:02:33.5879019Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_complex128 PASSED [0.0107s] [ 91%] 2025-12-04T14:02:33.5879114Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_complex64 PASSED [0.0106s] [ 91%] 2025-12-04T14:02:33.5879208Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_float64 PASSED [1.5223s] [ 91%] 2025-12-04T14:02:33.5879314Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_complex128 PASSED [0.0114s] [ 91%] 2025-12-04T14:02:33.5879429Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_complex64 PASSED [1.4766s] [ 91%] 2025-12-04T14:02:33.5879538Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_bfloat16 PASSED [0.0047s] [ 91%] 2025-12-04T14:02:33.5879651Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_complex128 PASSED [1.4983s] [ 91%] 2025-12-04T14:02:33.5879770Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_complex64 PASSED [0.0048s] [ 91%] 2025-12-04T14:02:33.5879879Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_float16 PASSED [1.4834s] [ 91%] 2025-12-04T14:02:33.5879975Z test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_float32 PASSED [0.3348s] [ 91%] 2025-12-04T14:02:33.5880073Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex32 PASSED [0.0102s] [ 91%] 2025-12-04T14:02:33.5880204Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_float64 PASSED [0.0095s] [ 91%] 2025-12-04T14:02:33.5880298Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int64 PASSED [0.0093s] [ 91%] 2025-12-04T14:02:33.5880414Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int8 PASSED [1.4833s] [ 91%] 2025-12-04T14:02:33.5880510Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_bfloat16 PASSED [0.0106s] [ 91%] 2025-12-04T14:02:33.5880605Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int32 PASSED [0.0080s] [ 92%] 2025-12-04T14:02:33.5880706Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_bool PASSED [0.0068s] [ 92%] 2025-12-04T14:02:33.5880811Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int16 PASSED [0.0067s] [ 92%] 2025-12-04T14:02:33.5880913Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_uint8 PASSED [0.0066s] [ 92%] 2025-12-04T14:02:33.5881008Z test_meta.py::TestMetaCUDA::test_meta_outplace_svd_cuda_float32 PASSED [1.5652s] [ 92%] 2025-12-04T14:02:33.5881117Z test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_complex64 PASSED [0.0974s] [ 92%] 2025-12-04T14:02:33.5881217Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_bfloat16 PASSED [1.4847s] [ 92%] 2025-12-04T14:02:33.5881311Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_bool PASSED [0.0051s] [ 92%] 2025-12-04T14:02:33.5881413Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_complex64 PASSED [0.0037s] [ 92%] 2025-12-04T14:02:33.5881511Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float32 PASSED [1.5151s] [ 92%] 2025-12-04T14:02:33.5881605Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_complex64 PASSED [0.0044s] [ 92%] 2025-12-04T14:02:33.5881696Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float16 PASSED [1.5042s] [ 92%] 2025-12-04T14:02:33.5881787Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float64 PASSED [0.0046s] [ 92%] 2025-12-04T14:02:33.5881899Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_bfloat16 PASSED [1.5177s] [ 92%] 2025-12-04T14:02:33.5882008Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_bool PASSED [0.0050s] [ 92%] 2025-12-04T14:02:33.5882121Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_complex64 PASSED [1.4808s] [ 92%] 2025-12-04T14:02:33.5882231Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float64 PASSED [0.0051s] [ 92%] 2025-12-04T14:02:33.5882333Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_complex128 PASSED [0.0082s] [ 92%] 2025-12-04T14:02:33.5882428Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_float64 PASSED [1.5058s] [ 92%] 2025-12-04T14:02:33.5882522Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int16 PASSED [0.0093s] [ 92%] 2025-12-04T14:02:33.5882615Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int32 PASSED [0.0076s] [ 92%] 2025-12-04T14:02:33.5882707Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int64 PASSED [0.0073s] [ 92%] 2025-12-04T14:02:33.5882797Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int32 PASSED [1.5008s] [ 92%] 2025-12-04T14:02:33.5882901Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int8 PASSED [0.0045s] [ 93%] 2025-12-04T14:02:33.5883000Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex32 PASSED [1.5028s] [ 93%] 2025-12-04T14:02:33.5883100Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex64 PASSED [0.0045s] [ 93%] 2025-12-04T14:02:33.5883208Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float16 PASSED [1.4845s] [ 93%] 2025-12-04T14:02:33.5883303Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float32 PASSED [0.0044s] [ 93%] 2025-12-04T14:02:33.5883395Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int16 PASSED [1.5124s] [ 93%] 2025-12-04T14:02:33.5883488Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int32 PASSED [0.0046s] [ 93%] 2025-12-04T14:02:33.5883579Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int8 PASSED [1.4826s] [ 93%] 2025-12-04T14:02:33.5883690Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_bfloat16 PASSED [0.0072s] [ 93%] 2025-12-04T14:02:33.5883817Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float16 PASSED [0.0054s] [ 93%] 2025-12-04T14:02:33.5883926Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float32 PASSED [0.0051s] [ 93%] 2025-12-04T14:02:33.5884031Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int64 PASSED [0.0050s] [ 93%] 2025-12-04T14:02:33.5884137Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int8 PASSED [1.4973s] [ 93%] 2025-12-04T14:02:33.5884243Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_bfloat16 PASSED [0.0081s] [ 93%] 2025-12-04T14:02:33.5884350Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_complex64 PASSED [1.5134s] [ 93%] 2025-12-04T14:02:33.5884453Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float16 PASSED [0.0083s] [ 93%] 2025-12-04T14:02:33.5884556Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float32 PASSED [0.0054s] [ 93%] 2025-12-04T14:02:33.5884660Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float64 PASSED [0.0051s] [ 93%] 2025-12-04T14:02:33.5884752Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_bool PASSED [0.0164s] [ 93%] 2025-12-04T14:02:33.5884853Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_complex128 PASSED [0.0165s] [ 93%] 2025-12-04T14:02:33.5884949Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_float32 PASSED [0.0163s] [ 93%] 2025-12-04T14:02:33.5885042Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_int32 PASSED [0.0162s] [ 93%] 2025-12-04T14:02:33.5885154Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_bfloat16 SKIPPED [0.0001s] (Skipped!) [ 93%] 2025-12-04T14:02:33.5885267Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_complex128 SKIPPED [0.0001s] (Skipped!) [ 93%] 2025-12-04T14:02:33.5885377Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_complex64 SKIPPED [0.0001s] (Skipped!) [ 94%] 2025-12-04T14:02:33.5885484Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int32 SKIPPED [0.0001s] (Skipped!) [ 94%] 2025-12-04T14:02:33.5885590Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int64 SKIPPED [0.0001s] (Skipped!) [ 94%] 2025-12-04T14:02:33.5885696Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int8 SKIPPED [0.0001s] (Skipped!) [ 94%] 2025-12-04T14:02:33.5885803Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_complex64 PASSED [1.5534s] [ 94%] 2025-12-04T14:02:33.5885904Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int16 PASSED [0.0051s] [ 94%] 2025-12-04T14:02:33.5886004Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int64 PASSED [1.5183s] [ 94%] 2025-12-04T14:02:33.5886103Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_uint8 PASSED [0.0052s] [ 94%] 2025-12-04T14:02:33.5886197Z test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int32 PASSED [0.0204s] [ 94%] 2025-12-04T14:02:33.5886299Z test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int8 PASSED [0.0062s] [ 94%] 2025-12-04T14:02:33.5886396Z test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_uint8 PASSED [0.0060s] [ 94%] 2025-12-04T14:02:33.5886540Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16 PASSED [0.0178s] [ 94%] 2025-12-04T14:02:33.5886685Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int32 PASSED [0.0082s] [ 94%] 2025-12-04T14:02:33.5886785Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_complex32 PASSED [1.4905s] [ 94%] 2025-12-04T14:02:33.5886882Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float32 PASSED [0.0039s] [ 94%] 2025-12-04T14:02:33.5886976Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int16 PASSED [1.5013s] [ 94%] 2025-12-04T14:02:33.5887069Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int8 PASSED [0.0041s] [ 94%] 2025-12-04T14:02:33.5887163Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_uint8 PASSED [1.4964s] [ 94%] 2025-12-04T14:02:33.5887300Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float16 PASSED [0.0111s] [ 94%] 2025-12-04T14:02:33.5887410Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float32 PASSED [1.4935s] [ 94%] 2025-12-04T14:02:33.5887520Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float64 PASSED [0.0077s] [ 94%] 2025-12-04T14:02:33.5887627Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex32 PASSED [1.4951s] [ 94%] 2025-12-04T14:02:33.5887732Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float16 PASSED [0.0059s] [ 94%] 2025-12-04T14:02:33.5887838Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_complex128 PASSED [0.0082s] [ 95%] 2025-12-04T14:02:33.5887945Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_complex64 PASSED [0.0073s] [ 95%] 2025-12-04T14:02:33.5888050Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_float64 PASSED [0.0073s] [ 95%] 2025-12-04T14:02:33.5888150Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_int8 PASSED [0.0074s] [ 95%] 2025-12-04T14:02:33.5888249Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_bfloat16 PASSED [1.4928s] [ 95%] 2025-12-04T14:02:33.5888345Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float32 PASSED [0.0095s] [ 95%] 2025-12-04T14:02:33.5888441Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_int16 PASSED [1.5010s] [ 95%] 2025-12-04T14:02:33.5888559Z test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_complex128 PASSED [0.0144s] [ 95%] 2025-12-04T14:02:33.5888676Z test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_complex64 PASSED [0.0118s] [ 95%] 2025-12-04T14:02:33.5888776Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex128 PASSED [0.0101s] [ 95%] 2025-12-04T14:02:33.5888874Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex64 PASSED [0.0096s] [ 95%] 2025-12-04T14:02:33.5888967Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int16 PASSED [0.0095s] [ 95%] 2025-12-04T14:02:33.5889074Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_indices_cuda_int32 PASSED [0.0082s] [ 95%] 2025-12-04T14:02:33.5889166Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_bool PASSED [0.0099s] [ 95%] 2025-12-04T14:02:33.5889262Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_float64 PASSED [1.5199s] [ 95%] 2025-12-04T14:02:33.5889354Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int16 PASSED [0.0123s] [ 95%] 2025-12-04T14:02:33.5889446Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int8 PASSED [0.0099s] [ 95%] 2025-12-04T14:02:33.5889553Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_bfloat16 PASSED [0.0091s] [ 95%] 2025-12-04T14:02:33.5889659Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float32 PASSED [0.0086s] [ 95%] 2025-12-04T14:02:33.5889775Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_int64 PASSED [0.0086s] [ 95%] 2025-12-04T14:02:33.5889870Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int8 PASSED [1.4983s] [ 95%] 2025-12-04T14:02:33.5889981Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_complex128 PASSED [0.0064s] [ 95%] 2025-12-04T14:02:33.5890126Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_complex32 PASSED [0.0045s] [ 95%] 2025-12-04T14:02:33.5890260Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_float16 PASSED [1.4924s] [ 95%] 2025-12-04T14:02:33.5890365Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_float32 PASSED [0.0063s] [ 96%] 2025-12-04T14:02:33.5890467Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int8 PASSED [0.0045s] [ 96%] 2025-12-04T14:02:33.5890569Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_complex128 PASSED [1.5044s] [ 96%] 2025-12-04T14:02:33.5890665Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int16 PASSED [0.0059s] [ 96%] 2025-12-04T14:02:33.5890760Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int8 PASSED [0.0041s] [ 96%] 2025-12-04T14:02:33.5890882Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_uint8 PASSED [1.4796s] [ 96%] 2025-12-04T14:02:33.5890989Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_complex32 PASSED [0.0063s] [ 96%] 2025-12-04T14:02:33.5891090Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int16 PASSED [0.0048s] [ 96%] 2025-12-04T14:02:33.5891188Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int8 PASSED [0.0046s] [ 96%] 2025-12-04T14:02:33.5891295Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_bfloat16 PASSED [0.0120s] [ 96%] 2025-12-04T14:02:33.5891398Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int16 PASSED [0.0117s] [ 96%] 2025-12-04T14:02:33.5891500Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int8 PASSED [0.0116s] [ 96%] 2025-12-04T14:02:33.5891604Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_uint8 PASSED [0.0115s] [ 96%] 2025-12-04T14:02:33.5891706Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_bfloat16 PASSED [1.5162s] [ 96%] 2025-12-04T14:02:33.5891810Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex128 PASSED [0.0089s] [ 96%] 2025-12-04T14:02:33.5891913Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex64 PASSED [1.4964s] [ 96%] 2025-12-04T14:02:33.5892012Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_float16 PASSED [0.0089s] [ 96%] 2025-12-04T14:02:33.5892108Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_float32 PASSED [1.5058s] [ 96%] 2025-12-04T14:02:33.5892205Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int32 PASSED [0.0089s] [ 96%] 2025-12-04T14:02:33.5892298Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int8 PASSED [1.4903s] [ 96%] 2025-12-04T14:02:33.5892402Z test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_complex64 PASSED [0.0056s] [ 96%] 2025-12-04T14:02:33.5892503Z test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_float32 PASSED [0.0040s] [ 96%] 2025-12-04T14:02:33.5892621Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float32 PASSED [0.0934s] [ 96%] 2025-12-04T14:02:33.5892718Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_float32 PASSED [0.1924s] [ 96%] 2025-12-04T14:02:33.5892815Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int16 PASSED [0.1907s] [ 97%] 2025-12-04T14:02:33.5892910Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int32 PASSED [0.1903s] [ 97%] 2025-12-04T14:02:33.5893005Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int8 PASSED [0.1895s] [ 97%] 2025-12-04T14:02:33.5893101Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint32 PASSED [0.2028s] [ 97%] 2025-12-04T14:02:33.5893197Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint64 PASSED [0.2023s] [ 97%] 2025-12-04T14:02:33.5893321Z test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_int8 PASSED [0.0166s] [ 97%] 2025-12-04T14:02:33.5893427Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_bool PASSED [0.0031s] [ 97%] 2025-12-04T14:02:33.5893537Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_complex32 PASSED [1.5221s] [ 97%] 2025-12-04T14:02:33.5893654Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_float16 PASSED [0.0047s] [ 97%] 2025-12-04T14:02:33.5893762Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_float32 PASSED [1.5086s] [ 97%] 2025-12-04T14:02:33.5893867Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_int32 PASSED [0.0046s] [ 97%] 2025-12-04T14:02:33.5893972Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int32 PASSED [1.5119s] [ 97%] 2025-12-04T14:02:33.5894078Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int64 PASSED [0.0044s] [ 97%] 2025-12-04T14:02:33.5894183Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int8 PASSED [1.5014s] [ 97%] 2025-12-04T14:02:33.5894316Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_complex32 PASSED [0.0080s] [ 97%] 2025-12-04T14:02:33.5894427Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_float16 PASSED [0.0063s] [ 97%] 2025-12-04T14:02:33.5894534Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_int8 PASSED [0.0061s] [ 97%] 2025-12-04T14:02:33.5894643Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex32 PASSED [0.0043s] [ 97%] 2025-12-04T14:02:33.5894748Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex64 PASSED [1.4855s] [ 97%] 2025-12-04T14:02:33.5894851Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_float64 PASSED [0.0061s] [ 97%] 2025-12-04T14:02:33.5894951Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int32 PASSED [0.0045s] [ 97%] 2025-12-04T14:02:33.5895050Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int8 PASSED [0.0043s] [ 97%] 2025-12-04T14:02:33.5895151Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_uint8 PASSED [1.4876s] [ 97%] 2025-12-04T14:02:33.5895249Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_complex64 PASSED [0.0114s] [ 98%] 2025-12-04T14:02:33.5895343Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float64 PASSED [0.0090s] [ 98%] 2025-12-04T14:02:33.5895449Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_complex64 PASSED [0.0081s] [ 98%] 2025-12-04T14:02:33.5895562Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float64 PASSED [0.0029s] [ 98%] 2025-12-04T14:02:33.5895662Z test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_complex128 PASSED [1.5233s] [ 98%] 2025-12-04T14:02:33.5895761Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_float16 PASSED [0.0050s] [ 98%] 2025-12-04T14:02:33.5895858Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_int32 PASSED [0.0037s] [ 98%] 2025-12-04T14:02:33.5895956Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_uint8 PASSED [1.4926s] [ 98%] 2025-12-04T14:02:33.5896063Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_complex64 PASSED [0.0075s] [ 98%] 2025-12-04T14:02:33.5896167Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_float64 PASSED [1.5342s] [ 98%] 2025-12-04T14:02:33.5896266Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int16 PASSED [0.0071s] [ 98%] 2025-12-04T14:02:33.5896365Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int32 PASSED [1.5224s] [ 98%] 2025-12-04T14:02:33.5896461Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int8 PASSED [0.0072s] [ 98%] 2025-12-04T14:02:33.5896560Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_uint8 PASSED [1.5048s] [ 98%] 2025-12-04T14:02:33.5896657Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_complex32 PASSED [0.0056s] [ 98%] 2025-12-04T14:02:33.5896764Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_float16 PASSED [0.0042s] [ 98%] 2025-12-04T14:02:33.5896858Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int16 PASSED [1.5165s] [ 98%] 2025-12-04T14:02:33.5896952Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int32 PASSED [0.0060s] [ 98%] 2025-12-04T14:02:33.5897044Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int8 PASSED [0.0042s] [ 98%] 2025-12-04T14:02:33.5897152Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_bfloat16 PASSED [1.4848s] [ 98%] 2025-12-04T14:02:33.5897247Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_bool PASSED [0.0044s] [ 98%] 2025-12-04T14:02:33.5897350Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex64 PASSED [1.4832s] [ 98%] 2025-12-04T14:02:33.5897448Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_float64 PASSED [0.0044s] [ 98%] 2025-12-04T14:02:33.5897543Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int32 PASSED [1.4892s] [ 98%] 2025-12-04T14:02:33.5897638Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int8 PASSED [0.0043s] [ 99%] 2025-12-04T14:02:33.5897754Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_uint8 PASSED [1.4907s] [ 99%] 2025-12-04T14:02:33.5897852Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float32 PASSED [0.0037s] [ 99%] 2025-12-04T14:02:33.5897947Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_int64 PASSED [1.4796s] [ 99%] 2025-12-04T14:02:33.5898043Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_uint8 PASSED [0.0035s] [ 99%] 2025-12-04T14:02:33.5898139Z test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_bfloat16 PASSED [0.0079s] [ 99%] 2025-12-04T14:02:33.5898241Z test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_complex64 PASSED [0.0070s] [ 99%] 2025-12-04T14:02:33.5898336Z test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_float32 PASSED [1.5066s] [ 99%] 2025-12-04T14:02:33.5898433Z test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_float64 PASSED [0.0090s] [ 99%] 2025-12-04T14:02:33.5898528Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_bool PASSED [0.0131s] [ 99%] 2025-12-04T14:02:33.5898622Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int8 PASSED [0.0128s] [ 99%] 2025-12-04T14:02:33.5898715Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_uint8 PASSED [0.0127s] [ 99%] 2025-12-04T14:02:33.5898812Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_float16 PASSED [1.5042s] [ 99%] 2025-12-04T14:02:33.5898905Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int32 PASSED [0.0050s] [ 99%] 2025-12-04T14:02:33.5898998Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_int32 PASSED [0.0031s] [ 99%] 2025-12-04T14:02:33.5899091Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_uint8 PASSED [1.4929s] [ 99%] 2025-12-04T14:02:33.5899192Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_bool PASSED [0.0078s] [ 99%] 2025-12-04T14:02:33.5899301Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_complex32 PASSED [0.0064s] [ 99%] 2025-12-04T14:02:33.5899403Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_int64 PASSED [0.0056s] [ 99%] 2025-12-04T14:02:33.5899503Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_int8 PASSED [1.4897s] [ 99%] 2025-12-04T14:02:33.5899742Z test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float16_float32_cuda SKIPPED [0.0013s] (not supported input dtype is torch.float16 and bias dtype is torch.float32) [ 99%] 2025-12-04T14:02:33.5899889Z test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_bias_dtype2_cuda PASSED [0.0033s] [ 99%] 2025-12-04T14:02:33.5900024Z test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_float32_cuda PASSED [0.0023s] [ 99%] 2025-12-04T14:02:33.5900148Z test_meta.py::TestMetaCUDA::test_quantized_embedding_bag_cuda PASSED [0.0094s] [100%] 2025-12-04T14:02:33.5900151Z 2025-12-04T14:02:33.5900333Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-363f58d291636c85.xml - 2025-12-04T14:02:33.5900426Z = 2133 passed, 198 skipped, 11036 deselected, 29 xfailed in 661.19s (0:11:01) == 2025-12-04T14:02:33.5900607Z The following tests failed and then succeeded when run in a new process['test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int64'] 2025-12-04T14:02:33.5900620Z 2025-12-04T14:02:33.5900738Z FINISHED PRINTING LOG FILE of test_meta 2/3 (test/test-reports/test_meta_2.3_cccb03203fa43a3b_.log) 2025-12-04T14:02:33.5900740Z 2025-12-04T14:02:33.5900828Z Finished test_meta 2/3 ... [2025-12-04 14:02:33.070269][2206415.531239352], took 41.48min 2025-12-04T14:02:33.5901056Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:02:33.5901144Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:02:33.5901239Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T14:02:33.5901301Z Uploading artifacts took 0.00 seconds 2025-12-04T14:02:33.5901400Z Running test_numpy_interop 1/1 ... [2025-12-04 14:02:33.076966][2206415.537951443] 2025-12-04T14:02:33.5901450Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:02:33.5901743Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_numpy_interop.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:02:33.077228] 2025-12-04T14:02:36.0468171Z 2025-12-04T14:02:36.0469014Z test_numpy_interop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_numpy_interop_1.1_422d8309d3f0f67b_.log 2025-12-04T14:02:36.0480649Z Running 46 items in this shard: test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_bool, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_complex128, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_complex64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_float16, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_float32, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_float64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_int16, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_int32, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_int64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_int8, test/test_numpy_interop.py::TestNumPyInteropCUDA::test___eq___cuda_uint8, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_copy_mode_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_ctor_with_invalid_numpy_array_sequence_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_ctor_with_numpy_scalar_ctor_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_empty_tensors_interop_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_from_list_of_ndarray_warning_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_from_numpy_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_from_numpy_no_leak_on_invalid_dtype_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_from_numpy_zero_element_type_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_has_storage_numpy_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_multiplication_numpy_scalar_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_ndarray_astype_object_graph_break_2_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_ndarray_astype_object_graph_break_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_array_interface_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_index_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_index_multi_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_non_writeable_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_bfloat16, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_bool, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_complex128, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_complex64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_float16, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_float32, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_float64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_int16, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_int32, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_int64, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_int8, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_scalar_cmp_cuda_uint8, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_numpy_unresizable_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_parse_numpy_int_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_parse_numpy_int_overflow_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_to_numpy_bool_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_to_numpy_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_to_numpy_force_argument_cuda, test/test_numpy_interop.py::TestNumPyInteropCUDA::test_to_numpy_zero_tensor_cuda 2025-12-04T14:02:36.0488432Z 2025-12-04T14:02:36.0488593Z Finished test_numpy_interop 1/1 ... [2025-12-04 14:02:36.046551][2206418.507535166], took 0.05min 2025-12-04T14:02:36.0489163Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:02:36.0535662Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:02:36.0538442Z Running profiler/test_cpp_thread 1/1 ... [2025-12-04 14:02:36.053598][2206418.514587252] 2025-12-04T14:02:36.0538947Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:02:36.0539784Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'profiler/test_cpp_thread.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:02:36.053815] 2025-12-04T14:02:47.6400803Z 2025-12-04T14:02:47.6401816Z profiler/test_cpp_thread 1/1 was successful, full logs can be found in artifacts with path test/test-reports/profiler.test_cpp_thread_1.1_d269c5894b72408e_.log 2025-12-04T14:02:47.6404223Z Running 6 items in this shard: test/profiler/test_cpp_thread.py::CppThreadTestCUDA::test_profile_memory_cuda, test/profiler/test_cpp_thread.py::CppThreadTestCUDA::test_with_enable_profiler_in_child_thread_cuda, test/profiler/test_cpp_thread.py::CppThreadTestCUDA::test_without_enable_profiler_in_child_thread_cuda, test/profiler/test_cpp_thread.py::CppThreadTestXPU::test_profile_memory_xpu, test/profiler/test_cpp_thread.py::CppThreadTestXPU::test_with_enable_profiler_in_child_thread_xpu, test/profiler/test_cpp_thread.py::CppThreadTestXPU::test_without_enable_profiler_in_child_thread_xpu 2025-12-04T14:02:47.6406042Z 2025-12-04T14:02:47.6406312Z Finished profiler/test_cpp_thread 1/1 ... [2025-12-04 14:02:47.639778][2206430.100762016], took 0.19min 2025-12-04T14:02:47.6419676Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:02:47.6466481Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:02:47.6468432Z Running test_ops_gradients 1/2 ... [2025-12-04 14:02:47.646658][2206430.107647144] 2025-12-04T14:02:47.6468757Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:02:47.6470562Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops_gradients.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:02:47.646872] 2025-12-04T14:09:56.1322487Z 2025-12-04T14:09:56.1323075Z test_ops_gradients 1/2 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_gradients_1.2_93375d6a22add345_.log 2025-12-04T14:09:56.1671646Z Running 2681 items in this shard: test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_auto_functionalize_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_invoke_quant_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_map_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_map_triple_nested_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_while_loop_stack_output_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpySplitCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyViewCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cond_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_invoke_quant_packed_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_invoke_quant_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_map_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_while_loop_stack_output_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyNMSCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_like_cuda_float64 2025-12-04T14:09:56.2004384Z 2025-12-04T14:09:56.2004504Z Finished test_ops_gradients 1/2 ... [2025-12-04 14:09:56.133803][2206858.594790728], took 7.14min 2025-12-04T14:09:56.2004878Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:09:56.2005231Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:09:56.2005463Z Running distributions/test_constraints 1/1 ... [2025-12-04 14:09:56.140442][2206858.601431517] 2025-12-04T14:09:56.2005658Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:09:56.2006053Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'distributions/test_constraints.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:09:56.140630] 2025-12-04T14:09:59.5689333Z 2025-12-04T14:09:59.5690606Z distributions/test_constraints 1/1 was successful, full logs can be found in artifacts with path test/test-reports/distributions.test_constraints_1.1_6f748f1d670f3231_.log 2025-12-04T14:09:59.5713466Z Running 136 items in this shard: test/distributions/test_constraints.py::test_constraint[False-constraint_fn0-False-value0], test/distributions/test_constraints.py::test_constraint[False-constraint_fn1-False-value1], test/distributions/test_constraints.py::test_constraint[False-constraint_fn2-False-value2], test/distributions/test_constraints.py::test_constraint[False-constraint_fn3-True-value3], test/distributions/test_constraints.py::test_constraint[False-constraint_fn4-False-value4], test/distributions/test_constraints.py::test_constraint[False-constraint_fn5-False-value5], test/distributions/test_constraints.py::test_constraint[False-constraint_fn6-True-value6], test/distributions/test_constraints.py::test_constraint[False-constraint_fn7-True-value7], test/distributions/test_constraints.py::test_constraint[False-constraint_fn8-False-value8], test/distributions/test_constraints.py::test_constraint[False-constraint_fn9-True-value9], test/distributions/test_constraints.py::test_constraint[False-constraint_fn10-False-value10], test/distributions/test_constraints.py::test_constraint[False-constraint_fn11-False-value11], test/distributions/test_constraints.py::test_constraint[False-constraint_fn12-True-value12], test/distributions/test_constraints.py::test_constraint[False-constraint_fn13-True-value13], test/distributions/test_constraints.py::test_constraint[False-constraint_fn14-False-value14], test/distributions/test_constraints.py::test_constraint[False-constraint_fn15-True-value15], test/distributions/test_constraints.py::test_constraint[False-constraint_fn16-True-value16], test/distributions/test_constraints.py::test_constraint[False-constraint_fn17-True-value17], test/distributions/test_constraints.py::test_constraint[True-constraint_fn0-False-value0], test/distributions/test_constraints.py::test_constraint[True-constraint_fn1-False-value1], test/distributions/test_constraints.py::test_constraint[True-constraint_fn2-False-value2], test/distributions/test_constraints.py::test_constraint[True-constraint_fn3-True-value3], test/distributions/test_constraints.py::test_constraint[True-constraint_fn4-False-value4], test/distributions/test_constraints.py::test_constraint[True-constraint_fn5-False-value5], test/distributions/test_constraints.py::test_constraint[True-constraint_fn6-True-value6], test/distributions/test_constraints.py::test_constraint[True-constraint_fn7-True-value7], test/distributions/test_constraints.py::test_constraint[True-constraint_fn8-False-value8], test/distributions/test_constraints.py::test_constraint[True-constraint_fn9-True-value9], test/distributions/test_constraints.py::test_constraint[True-constraint_fn10-False-value10], test/distributions/test_constraints.py::test_constraint[True-constraint_fn11-False-value11], test/distributions/test_constraints.py::test_constraint[True-constraint_fn12-True-value12], test/distributions/test_constraints.py::test_constraint[True-constraint_fn13-True-value13], test/distributions/test_constraints.py::test_constraint[True-constraint_fn14-False-value14], test/distributions/test_constraints.py::test_constraint[True-constraint_fn15-True-value15], test/distributions/test_constraints.py::test_constraint[True-constraint_fn16-True-value16], test/distributions/test_constraints.py::test_constraint[True-constraint_fn17-True-value17], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn0-args0], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn1-args1], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn2-args2], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThan-args3], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThan-args4], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThan-args5], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThan-args6], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThanEq-args7], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThanEq-args8], test/distributions/test_constraints.py::test_biject_to[False-_GreaterThanEq-args9], test/distributions/test_constraints.py::test_biject_to[False-_LessThan-args10], test/distributions/test_constraints.py::test_biject_to[False-_LessThan-args11], test/distributions/test_constraints.py::test_biject_to[False-_LessThan-args12], test/distributions/test_constraints.py::test_biject_to[False-_LessThan-args13], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn14-args14], test/distributions/test_constraints.py::test_biject_to[False-_Interval-args15], test/distributions/test_constraints.py::test_biject_to[False-_Interval-args16], test/distributions/test_constraints.py::test_biject_to[False-_Interval-args17], test/distributions/test_constraints.py::test_biject_to[False-_HalfOpenInterval-args18], test/distributions/test_constraints.py::test_biject_to[False-_HalfOpenInterval-args19], test/distributions/test_constraints.py::test_biject_to[False-_HalfOpenInterval-args20], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn21-args21], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn22-args22], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn23-args23], test/distributions/test_constraints.py::test_biject_to[False-constraint_fn24-args24], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn0-args0], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn1-args1], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn2-args2], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThan-args3], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThan-args4], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThan-args5], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThan-args6], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThanEq-args7], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThanEq-args8], test/distributions/test_constraints.py::test_biject_to[True-_GreaterThanEq-args9], test/distributions/test_constraints.py::test_biject_to[True-_LessThan-args10], test/distributions/test_constraints.py::test_biject_to[True-_LessThan-args11], test/distributions/test_constraints.py::test_biject_to[True-_LessThan-args12], test/distributions/test_constraints.py::test_biject_to[True-_LessThan-args13], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn14-args14], test/distributions/test_constraints.py::test_biject_to[True-_Interval-args15], test/distributions/test_constraints.py::test_biject_to[True-_Interval-args16], test/distributions/test_constraints.py::test_biject_to[True-_Interval-args17], test/distributions/test_constraints.py::test_biject_to[True-_HalfOpenInterval-args18], test/distributions/test_constraints.py::test_biject_to[True-_HalfOpenInterval-args19], test/distributions/test_constraints.py::test_biject_to[True-_HalfOpenInterval-args20], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn21-args21], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn22-args22], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn23-args23], test/distributions/test_constraints.py::test_biject_to[True-constraint_fn24-args24], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn0-args0], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn1-args1], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn2-args2], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThan-args3], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThan-args4], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThan-args5], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThan-args6], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThanEq-args7], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThanEq-args8], test/distributions/test_constraints.py::test_transform_to[False-_GreaterThanEq-args9], test/distributions/test_constraints.py::test_transform_to[False-_LessThan-args10], test/distributions/test_constraints.py::test_transform_to[False-_LessThan-args11], test/distributions/test_constraints.py::test_transform_to[False-_LessThan-args12], test/distributions/test_constraints.py::test_transform_to[False-_LessThan-args13], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn14-args14], test/distributions/test_constraints.py::test_transform_to[False-_Interval-args15], test/distributions/test_constraints.py::test_transform_to[False-_Interval-args16], test/distributions/test_constraints.py::test_transform_to[False-_Interval-args17], test/distributions/test_constraints.py::test_transform_to[False-_HalfOpenInterval-args18], test/distributions/test_constraints.py::test_transform_to[False-_HalfOpenInterval-args19], test/distributions/test_constraints.py::test_transform_to[False-_HalfOpenInterval-args20], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn21-args21], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn22-args22], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn23-args23], test/distributions/test_constraints.py::test_transform_to[False-constraint_fn24-args24], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn0-args0], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn1-args1], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn2-args2], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThan-args3], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThan-args4], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThan-args5], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThan-args6], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThanEq-args7], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThanEq-args8], test/distributions/test_constraints.py::test_transform_to[True-_GreaterThanEq-args9], test/distributions/test_constraints.py::test_transform_to[True-_LessThan-args10], test/distributions/test_constraints.py::test_transform_to[True-_LessThan-args11], test/distributions/test_constraints.py::test_transform_to[True-_LessThan-args12], test/distributions/test_constraints.py::test_transform_to[True-_LessThan-args13], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn14-args14], test/distributions/test_constraints.py::test_transform_to[True-_Interval-args15], test/distributions/test_constraints.py::test_transform_to[True-_Interval-args16], test/distributions/test_constraints.py::test_transform_to[True-_Interval-args17], test/distributions/test_constraints.py::test_transform_to[True-_HalfOpenInterval-args18], test/distributions/test_constraints.py::test_transform_to[True-_HalfOpenInterval-args19], test/distributions/test_constraints.py::test_transform_to[True-_HalfOpenInterval-args20], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn21-args21], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn22-args22], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn23-args23], test/distributions/test_constraints.py::test_transform_to[True-constraint_fn24-args24] 2025-12-04T14:09:59.5729751Z 2025-12-04T14:09:59.5729879Z Finished distributions/test_constraints 1/1 ... [2025-12-04 14:09:59.568741][2206862.029728572], took 0.06min 2025-12-04T14:09:59.5730348Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:09:59.5755855Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:09:59.5757263Z Running test_linalg 1/2 ... [2025-12-04 14:09:59.575514][2206862.036503869] 2025-12-04T14:09:59.5757583Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:09:59.5758774Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_linalg.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:09:59.575700] 2025-12-04T14:21:23.0426496Z 2025-12-04T14:21:23.0427428Z test_linalg 1/2 was successful, full logs can be found in artifacts with path test/test-reports/test_linalg_1.2_4ffb394800b5f220_.log 2025-12-04T14:21:23.0515838Z Running 651 items in this shard: test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_128_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_128_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_32_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_errors_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_baddbmm_overflow_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_addr_type_promotion_cuda, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_blas_empty_cuda, test/test_linalg.py::TestLinalgCUDA::test_blas_mv_large_input_cuda, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_bmm_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_broadcast_batched_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_call_count_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_chain_matmul_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_backward_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cross_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cross_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cross_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_det_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_det_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_einsum_corner_cases_cuda, test/test_linalg.py::TestLinalgCUDA::test_einsum_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_gemm_bias_offline_tunableop_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_gemm_bias_tunableop_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_householder_product_errors_and_warnings_cuda, test/test_linalg.py::TestLinalgCUDA::test_inner_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inner_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_large_bmm_backward_cuda, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_no_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_basic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_ortho_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_torchscript_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_logaddexp_cpu_vs_cuda_complex_cuda, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matmul_45724_cuda, test/test_linalg.py::TestLinalgCUDA::test_matmul_check_entries_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_rtol_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_bmm_non_memory_dense_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_conjtranspose_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_mm_submatrix_offline_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_complexhalf_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_axes_small_brute_force_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_offline_tuning_append_to_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_ger_addr_legacy_tests_cuda, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int16, test/test_linalg.py::TestLinalgCUDA::test_permute_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_error_cases_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_solve_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensordot_cuda, test/test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tf32_offline_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tf32_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_large_matrix_64bit_cuda, test/test_linalg.py::TestLinalgCUDA::test_validator_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vdot_invalid_args_cuda, test/test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_dim_tuple_arg_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_extreme_values_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float64 2025-12-04T14:21:23.0588076Z 2025-12-04T14:21:23.0588188Z Finished test_linalg 1/2 ... [2025-12-04 14:21:23.043008][2207545.503992482], took 11.39min 2025-12-04T14:21:23.0588566Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:21:23.0588922Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:21:23.0589124Z Running test_modules 2/2 ... [2025-12-04 14:21:23.049799][2207545.510788663] 2025-12-04T14:21:23.0589290Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:21:23.0589721Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_modules.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:21:23.050016] 2025-12-04T14:28:49.3382075Z 2025-12-04T14:28:49.3386399Z test_modules 2/2 was successful, full logs can be found in artifacts with path test/test-reports/test_modules_2.2_952521224806fc60_.log 2025-12-04T14:28:49.3616201Z Running 1820 items in this shard: test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Hardswish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_ReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_check_inplace_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AdaptiveAvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AdaptiveAvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AdaptiveAvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BCELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_BatchNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CircularPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CircularPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConstantPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_CrossEntropyLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Embedding_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_FractionalMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LPPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LSTMCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LSTM_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LayerNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LayerNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LocalResponseNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_LogSoftmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MSELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MarginRankingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_NLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_PReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_PoissonNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RMSNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RNNCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_RNN_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReLU6_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReflectionPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReflectionPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReplicationPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_SELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_SiLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softmax2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softmin_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softmin_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_TransformerEncoderLayer_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_TransformerEncoder_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_TransformerEncoder_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Transformer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_Transformer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_cpu_gpu_parity_nn_ZeroPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveAvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BCEWithLogitsLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BatchNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Bilinear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Conv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose2d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_CrossEntropyLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_FractionalMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GRUCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GaussianNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Hardshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_HingeEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_InstanceNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_InstanceNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_InstanceNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LPPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LPPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LPPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LSTMCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LSTM_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LayerNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LogSigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MarginRankingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MultiLabelSoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_NLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_RMSNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_RNNCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReLU6_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReflectionPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReflectionPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReplicationPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ReplicationPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_SELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_SiLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_SoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Softmax2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Softmin_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_TransformerDecoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_TransformerEncoderLayer_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_TransformerEncoder_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_device_ctx_init_nn_ZeroPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_BatchNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_GRU_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_errors_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_errors_nn_GroupNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_errors_nn_LSTM_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_errors_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_AvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BCELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Bilinear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CTCLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CircularPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CircularPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConstantPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose2d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_CrossEntropyLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Hardswish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_HingeEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LPPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LPPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LSTM_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LayerNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LayerNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MarginRankingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiLabelSoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiLabelSoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_PReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_RNNCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReflectionPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReplicationPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_SiLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_SmoothL1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_SmoothL1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_TransformerEncoderLayer_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_TransformerEncoder_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_factory_kwargs_nn_ZeroPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_AdaptiveAvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_BatchNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_BatchNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_BatchNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Bilinear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_CircularPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_CircularPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConstantPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose3d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Embedding_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_FractionalMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_FractionalMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_GLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GRU_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GaussianNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_GaussianNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_forward_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_HingeEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_KLDivLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LPPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LSTMCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LSTMCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LSTM_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LSTM_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LayerNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Linear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_LogSigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MSELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiLabelSoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_NLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_PReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_PoissonNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReLU6_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReflectionPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReflectionPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReflectionPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReplicationPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ReplicationPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_SmoothL1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Softsign_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoderLayer_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoderLayer_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoderLayer_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoder_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_TransformerEncoder_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_Transformer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_Transformer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_forward_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_forward_nn_ZeroPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_grad_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_AdaptiveMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_BatchNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_CTCLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_FractionalMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_FractionalMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_GRUCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_KLDivLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LPPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LPPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LayerNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_LogSoftmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_MaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_PReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_RNNCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_RNN_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_SmoothL1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_TransformerEncoder_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_TransformerEncoder_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_Transformer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_grad_nn_ZeroPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_AdaptiveAvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_CircularPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_FractionalMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_FractionalMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_GaussianNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_KLDivLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LPPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LayerNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LazyConv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_LogSoftmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_MultiLabelSoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_RNN_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_SmoothL1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Softmin_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_TransformerDecoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_gradgrad_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AdaptiveMaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_BatchNorm1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_BatchNorm2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_BatchNorm3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Bilinear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_CircularPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConstantPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose2d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose3d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GRU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GRU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GaussianNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GaussianNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_HingeEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_HuberLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_InstanceNorm1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_InstanceNorm1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_InstanceNorm2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_InstanceNorm3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LPPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LPPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LSTMCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LSTM_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_LogSoftmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiLabelSoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_MultiheadAttention_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_NLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_RMSNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_RNNCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_RNN_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_RNN_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReLU6_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReflectionPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReplicationPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ReplicationPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_SiLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_SmoothL1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_SoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softmax2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softsign_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Tanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_TransformerEncoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_TransformerEncoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_TransformerEncoder_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ZeroPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_if_train_and_eval_modes_differ_nn_ZeroPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveAvgPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveMaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BCELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_BatchNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Bilinear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CTCLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CircularPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CircularPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConstantPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Conv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose2d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose3d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_FractionalMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_FractionalMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GRUCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_GroupNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Hardswish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_HuberLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LPPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LPPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LSTM_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LocalResponseNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LogSigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MSELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MarginRankingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_NLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_PReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_PReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_RMSNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_RNNCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReLU6_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ReplicationPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_SELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_SiLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_SoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Softmin_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Softmin_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Softshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Softshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_TransformerEncoderLayer_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_TransformerEncoderLayer_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_TransformerEncoder_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Transformer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_Transformer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_memory_format_nn_ZeroPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveMaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AdaptiveMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BCEWithLogitsLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_BatchNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Bilinear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConstantPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Conv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ConvTranspose3d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Embedding_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GRUCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GroupNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_HuberLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_InstanceNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LPPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LPPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LSTMCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LSTM_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LayerNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Linear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LocalResponseNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LogSigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_LogSoftmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiLabelSoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiLabelSoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_PReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_RMSNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_RNNCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReflectionPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReflectionPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReflectionPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_SiLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_SoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softmax2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softmax_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_TransformerEncoder_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_TransformerEncoder_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_multiple_device_transfer_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveAvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveAvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveAvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveMaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AdaptiveMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AvgPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_AvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BCELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BCEWithLogitsLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CircularPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose3d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_CrossEntropyLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Embedding_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GRU_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GRU_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GaussianNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GroupNorm_cuda_float16, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GroupNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Hardswish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_HingeEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_HuberLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_KLDivLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_L1Loss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LPPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LPPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LazyConv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LazyConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LazyConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Linear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MSELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiLabelSoftMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_MultiheadAttention_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_NLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_PoissonNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReflectionPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReflectionPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReflectionPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReplicationPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ReplicationPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_SELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_SiLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_SoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Softshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Softsign_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Tanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerDecoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerEncoderLayer_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerEncoderLayer_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerEncoder_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_TransformerEncoder_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_non_contiguous_tensors_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_AdaptiveMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_BCELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm2d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_BatchNorm3d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Bilinear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_CELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_CELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_CTCLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_CircularPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConstantPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Conv3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose1d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose1d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose2d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_CosineEmbeddingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_CosineEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_repr_nn_CrossEntropyLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Embedding_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_FractionalMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_FractionalMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_GLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_GRUCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_GRU_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_GRU_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_GaussianNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_GroupNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Hardshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Hardtanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_InstanceNorm3d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_KLDivLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_L1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_LPPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LPPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LSTMCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LSTMCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_LSTM_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LazyConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_LazyConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LeakyReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LeakyReLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_LocalResponseNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MSELoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_MarginRankingLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MultiLabelMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MultiMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_MultiheadAttention_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_NLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_NLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_RNNCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_RNN_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReflectionPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReflectionPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReplicationPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ReplicationPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_SELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_SELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_SiLU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Softmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Softmin_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Softsign_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Tanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Tanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_TransformerDecoderLayer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_TransformerDecoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_TransformerEncoder_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_Transformer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_repr_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ZeroPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_repr_nn_ZeroPad3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveAvgPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveAvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveAvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveMaxPool1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveMaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AdaptiveMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AvgPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AvgPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_AvgPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BCELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BCEWithLogitsLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BCEWithLogitsLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm1d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm1d_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm1d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm2d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_BatchNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Bilinear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CTCLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CTCLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CircularPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CircularPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConstantPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConstantPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConstantPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConstantPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Conv3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose1d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose2d_cuda_complex128, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose2d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose3d_cuda_complex32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose3d_cuda_complex64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ConvTranspose3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CrossEntropyLoss_cuda_float16, test/test_modules.py::TestModuleCUDA::test_save_load_nn_CrossEntropyLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_FractionalMaxPool2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_FractionalMaxPool3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_FractionalMaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GELU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GELU_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GRUCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GRU_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GRU_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GaussianNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GroupNorm_cuda_bfloat16, test/test_modules.py::TestModuleCUDA::test_save_load_nn_GroupNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Hardshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Hardswish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Hardswish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Hardtanh_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_HingeEmbeddingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_HuberLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_HuberLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_InstanceNorm1d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_InstanceNorm2d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_InstanceNorm2d_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_InstanceNorm3d_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_InstanceNorm3d_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_KLDivLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LPPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LSTMCell_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LSTM_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LSTM_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LSTM_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LayerNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConv1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConv1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConv2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConv2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConvTranspose2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LazyConvTranspose3d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Linear_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Linear_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LocalResponseNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LocalResponseNorm_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LogSigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_LogSoftmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MSELoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MarginRankingLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MaxPool1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MaxPool2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MaxPool3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Mish_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Mish_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MultiLabelMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MultiLabelSoftMarginLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MultiMarginLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MultiheadAttention_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_MultiheadAttention_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_NLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_PoissonNLLLoss_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_PoissonNLLLoss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_RMSNorm_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_RNNCell_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_RNN_eval_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_RNN_eval_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_RNN_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReLU6_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReflectionPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReflectionPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReflectionPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReplicationPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReplicationPad2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ReplicationPad3d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_SiLU_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Sigmoid_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Sigmoid_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_SmoothL1Loss_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softmax2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softmax2d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softmax_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softplus_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softplus_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Softsign_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Tanh_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Tanhshrink_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Tanhshrink_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Threshold_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Threshold_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_TransformerDecoderLayer_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_TransformerEncoderLayer_train_mode_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_TransformerEncoder_train_mode_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_Transformer_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ZeroPad1d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ZeroPad1d_cuda_float64, test/test_modules.py::TestModuleCUDA::test_save_load_nn_ZeroPad2d_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AdaptiveAvgPool1d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AdaptiveAvgPool2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AdaptiveMaxPool1d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AdaptiveMaxPool2d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AvgPool1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AvgPool2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AvgPool2d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_AvgPool3d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BCELoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BCELoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BCEWithLogitsLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BCEWithLogitsLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BatchNorm1d_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BatchNorm2d_train_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BatchNorm3d_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_BatchNorm3d_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Bilinear_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CTCLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CircularPad1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CircularPad1d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CircularPad2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CircularPad3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ConstantPad1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ConstantPad1d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ConstantPad2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Conv1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Conv3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Conv3d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ConvTranspose1d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ConvTranspose3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CosineEmbeddingLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_CrossEntropyLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ELU_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Embedding_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Embedding_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GRUCell_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GRU_eval_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GRU_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GRU_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GaussianNLLLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GaussianNLLLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_GroupNorm_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Hardswish_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Hardtanh_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_HingeEmbeddingLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_HuberLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_HuberLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_InstanceNorm1d_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_InstanceNorm2d_eval_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_InstanceNorm2d_train_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_InstanceNorm2d_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_InstanceNorm3d_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_KLDivLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_L1Loss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_L1Loss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LPPool1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LPPool2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LPPool2d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LSTMCell_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LSTM_eval_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LSTM_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LayerNorm_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LayerNorm_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Linear_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LocalResponseNorm_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_LogSigmoid_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MarginRankingLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MaxPool1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MaxPool3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MaxPool3d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Mish_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MultiLabelSoftMarginLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MultiLabelSoftMarginLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MultiMarginLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MultiheadAttention_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_MultiheadAttention_train_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_PReLU_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_PReLU_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_PoissonNLLLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_PoissonNLLLoss_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_RMSNorm_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_RNNCell_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_RNNCell_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_RNN_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_RNN_train_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReLU6_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReLU6_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReLU_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReLU_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReflectionPad1d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReflectionPad2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReflectionPad3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ReplicationPad2d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_SELU_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_SELU_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_SiLU_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_SoftMarginLoss_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softmax2d_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softmax_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softmin_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softmin_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softplus_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Softshrink_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Tanh_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Tanhshrink_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Threshold_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_TransformerDecoderLayer_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_TransformerDecoderLayer_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_TransformerEncoderLayer_train_mode_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_TransformerEncoderLayer_train_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_TransformerEncoder_eval_mode_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Transformer_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_Transformer_swap_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ZeroPad2d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_empty_nn_ZeroPad3d_swap_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool2d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveAvgPool3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveMaxPool1d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveMaxPool2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AdaptiveMaxPool3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AvgPool1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AvgPool1d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AvgPool2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_AvgPool3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BCELoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BCELoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BCELoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BCEWithLogitsLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm1d_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm2d_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm2d_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm2d_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_train_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_BatchNorm3d_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Bilinear_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Bilinear_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CELU_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CELU_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CELU_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CTCLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CTCLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad1d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad1d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CircularPad3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad1d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad2d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConstantPad3d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Conv1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Conv2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Conv2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Conv3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConvTranspose1d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConvTranspose1d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConvTranspose2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ConvTranspose2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CosineEmbeddingLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CrossEntropyLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_CrossEntropyLoss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Embedding_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Embedding_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_FractionalMaxPool2d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_FractionalMaxPool2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_FractionalMaxPool2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_FractionalMaxPool3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GELU_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GELU_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GRUCell_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GRU_eval_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GRU_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GRU_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GRU_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GaussianNLLLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_GroupNorm_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Hardswish_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Hardswish_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Hardtanh_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Hardtanh_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_HingeEmbeddingLoss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_HuberLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm1d_eval_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm1d_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm1d_eval_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm1d_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm1d_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm2d_eval_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm2d_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm2d_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm2d_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm3d_eval_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm3d_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_InstanceNorm3d_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_KLDivLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_KLDivLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_KLDivLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_L1Loss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_L1Loss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LPPool1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LPPool2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LPPool2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LPPool3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LPPool3d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTMCell_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTMCell_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTMCell_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTM_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTM_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTM_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LSTM_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LayerNorm_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LayerNorm_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LayerNorm_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LeakyReLU_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LeakyReLU_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LeakyReLU_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Linear_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Linear_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Linear_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LocalResponseNorm_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LocalResponseNorm_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LogSigmoid_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LogSigmoid_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LogSoftmax_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_LogSoftmax_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MSELoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MarginRankingLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MarginRankingLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MarginRankingLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool1d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool2d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MaxPool3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Mish_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MultiLabelMarginLoss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MultiMarginLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MultiMarginLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MultiheadAttention_eval_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_MultiheadAttention_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_NLLLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_NLLLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_NLLLoss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_PReLU_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_PReLU_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_PoissonNLLLoss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_PoissonNLLLoss_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RMSNorm_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RMSNorm_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNNCell_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNNCell_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNN_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNN_eval_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNN_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNN_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_RNN_train_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReLU6_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReLU6_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReLU6_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReLU_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReLU_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReflectionPad1d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReflectionPad1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReflectionPad2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReflectionPad3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReflectionPad3d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad1d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad3d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ReplicationPad3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SELU_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SiLU_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SiLU_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SiLU_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SiLU_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Sigmoid_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SmoothL1Loss_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SmoothL1Loss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SoftMarginLoss_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_SoftMarginLoss_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmax2d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmax2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmax_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmax_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmin_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softmin_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softplus_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softplus_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softplus_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softshrink_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softshrink_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softshrink_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softsign_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Softsign_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Tanh_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Tanh_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Tanh_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Tanhshrink_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Tanhshrink_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Threshold_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Threshold_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Threshold_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerDecoderLayer_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoderLayer_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoderLayer_eval_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoderLayer_eval_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoderLayer_train_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoder_eval_mode_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoder_train_mode_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoder_train_mode_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_TransformerEncoder_train_mode_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Transformer_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_Transformer_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad1d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad1d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad2d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad2d_swap_False_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad2d_swap_True_set_grad_True_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad3d_swap_False_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad3d_swap_True_set_grad_False_cuda_float32, test/test_modules.py::TestModuleCUDA::test_to_nn_ZeroPad3d_swap_True_set_grad_True_cuda_float32 2025-12-04T14:28:49.3821955Z 2025-12-04T14:28:49.3822069Z Finished test_modules 2/2 ... [2025-12-04 14:28:49.339349][2207991.800335194], took 7.44min 2025-12-04T14:28:49.3822485Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:28:49.3822842Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:28:49.3823061Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T14:28:49.3823242Z Uploading artifacts took 0.00 seconds 2025-12-04T14:28:49.3823414Z Running optim/test_swa_utils 1/1 ... [2025-12-04 14:28:49.346310][2207991.807298451] 2025-12-04T14:28:49.3823595Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:28:49.3823979Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'optim/test_swa_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:28:49.346501] 2025-12-04T14:28:51.0898637Z 2025-12-04T14:28:51.0900379Z optim/test_swa_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/optim.test_swa_utils_1.1_1240f2b497c29502_.log 2025-12-04T14:28:51.0901062Z 2025-12-04T14:28:51.0901380Z Finished optim/test_swa_utils 1/1 ... [2025-12-04 14:28:51.089530][2207993.550516367], took 0.03min 2025-12-04T14:28:51.0917342Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:28:51.0970645Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:28:51.0972269Z Running cpp_extensions/python_agnostic_extension/test/test_python_agnostic 1/1 ... [2025-12-04 14:28:51.096936][2207993.557924428] 2025-12-04T14:28:51.0972854Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:28:51.0973842Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'cpp_extensions/python_agnostic_extension/test/test_python_agnostic.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:28:51.097136] 2025-12-04T14:29:10.8939607Z 2025-12-04T14:29:10.8940680Z cpp_extensions/python_agnostic_extension/test/test_python_agnostic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp_extensions.python_agnostic_extension.test.test_python_agnostic_1.1_cf07ca064046190f_.log 2025-12-04T14:29:10.8941402Z Running 1 items in this shard: test/cpp_extensions/python_agnostic_extension/test/test_python_agnostic.py::TestPythonAgnosticCUDA::test_extension_is_python_agnostic_cuda 2025-12-04T14:29:10.8941679Z 2025-12-04T14:29:10.8941860Z Finished cpp_extensions/python_agnostic_extension/test/test_python_agnostic 1/1 ... [2025-12-04 14:29:10.893741][2208013.354728366], took 0.33min 2025-12-04T14:29:10.8953802Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:29:10.9005060Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:29:10.9007267Z Running functorch/test_memory_efficient_fusion 1/1 ... [2025-12-04 14:29:10.900628][2208013.361615243] 2025-12-04T14:29:10.9007477Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:29:10.9009163Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_memory_efficient_fusion.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:29:10.900826] 2025-12-04T14:29:18.3281116Z 2025-12-04T14:29:18.3281808Z functorch/test_memory_efficient_fusion 1/1 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_memory_efficient_fusion_1.1_35fa83dcec97be01_.log 2025-12-04T14:29:18.3285510Z Running 22 items in this shard: test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_gelu_bias, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_hard_sigmoid, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_hard_swish, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_layer_norm, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_mish, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_rmsnorm, test/functorch/test_memory_efficient_fusion.py::TestMemoryEfficientOpAuthoring::test_swish, test/functorch/test_memory_efficient_fusion.py::NoChangeTestCase::test_empty, test/functorch/test_memory_efficient_fusion.py::NoChangeTestCase::test_hash_with_numbers, test/functorch/test_memory_efficient_fusion.py::NoChangeTestCase::test_nochange, test/functorch/test_memory_efficient_fusion.py::NoChangeTestCase::test_rand_like, test/functorch/test_memory_efficient_fusion.py::NoChangeTestCase::test_rand_n, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_immutable_list_multiple_entries, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_immutable_list_type, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_kwarg, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_nested_immutable_list_type, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_simple, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_simple_2, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_simple_multiple_same_ops, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_two_args, test/functorch/test_memory_efficient_fusion.py::ReduceTestCase::test_two_args_default, test/functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random 2025-12-04T14:29:18.3288412Z 2025-12-04T14:29:18.3288564Z Finished functorch/test_memory_efficient_fusion 1/1 ... [2025-12-04 14:29:18.327706][2208020.788692864], took 0.12min 2025-12-04T14:29:18.3295233Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:29:18.3345938Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:29:18.3348361Z Running torch_np/numpy_tests/lib/test_histograms 1/1 ... [2025-12-04 14:29:18.334703][2208020.79569035] 2025-12-04T14:29:18.3348592Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:29:18.3350530Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/lib/test_histograms.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:29:18.334914] 2025-12-04T14:29:22.4088357Z 2025-12-04T14:29:22.4089366Z torch_np/numpy_tests/lib/test_histograms 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.lib.test_histograms_1.1_7e0ace14d958afad_.log 2025-12-04T14:29:22.4099870Z Running 60 items in this shard: test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_arr_weights_mismatch, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_big_arrays, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_bin_array_dims, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_bin_edge_cases, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_bool_conversion, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_density, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_empty, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_error_binnum_type, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_exotic_weights, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_f32_rounding, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_finite_range, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_histogram_bin_edges, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_invalid_range, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_last_bin_inclusive_range, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_no_side_effects, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_object_array_of_0d, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_one_bin, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_outliers, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_precision, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_signed_overflow_bounds, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_signed_overflow_bounds_2, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_simple, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_some_nan_values, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_type, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_unsigned_monotonicity_check, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogram::test_weights, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_empty, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_incorrect_methods, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_limited_variance, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_novariance, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_outlier, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_scott_vs_stone, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_auto, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_doane, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_fd, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_rice, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_scott, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_stone, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_signed_integer_data_bins_sturges, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_simple, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_simple_range, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_simple_weighted, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramOptimBinNums::test_small, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_bins_array, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_bins_error_2, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_bins_errors, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_density_non_uniform_1d, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_density_non_uniform_2d, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_edge_dtype, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_empty, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_equal_edges, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_finite_range, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_identical_samples, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_inf_edges, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_large_integers, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_rightmost_binedge, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_shape_3d, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_shape_4d, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_simple, test/torch_np/numpy_tests/lib/test_histograms.py::TestHistogramdd::test_weights 2025-12-04T14:29:22.4108366Z 2025-12-04T14:29:22.4108510Z Finished torch_np/numpy_tests/lib/test_histograms 1/1 ... [2025-12-04 14:29:22.408510][2208024.869496849], took 0.07min 2025-12-04T14:29:22.4108943Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:29:22.4154593Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:29:22.4156677Z Running torch_np/test_indexing 1/1 ... [2025-12-04 14:29:22.415431][2208024.876418015] 2025-12-04T14:29:22.4157082Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:29:22.4158044Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/test_indexing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:29:22.415637] 2025-12-04T14:29:27.5000275Z 2025-12-04T14:29:27.5001067Z torch_np/test_indexing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.test_indexing_1.1_917337c0008f3a5e_.log 2025-12-04T14:29:27.5002513Z Running 5 items in this shard: test/torch_np/test_indexing.py::TestAdvancedIndexing::test_advanced_separation_patterns, test/torch_np/test_indexing.py::TestAdvancedIndexing::test_broadcast_and_numpy_compatibility, test/torch_np/test_indexing.py::TestAdvancedIndexing::test_comprehensive_indexing, test/torch_np/test_indexing.py::TestAdvancedIndexing::test_ellipsis, test/torch_np/test_indexing.py::TestAdvancedIndexing::test_special_index_types 2025-12-04T14:29:27.5003190Z 2025-12-04T14:29:27.5003313Z Finished torch_np/test_indexing 1/1 ... [2025-12-04 14:29:27.499746][2208029.960733613], took 0.08min 2025-12-04T14:29:27.5014781Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:29:27.5066459Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:29:27.5066831Z Running test_tensorboard 1/1 ... [2025-12-04 14:29:27.506589][2208029.967578341] 2025-12-04T14:29:27.5067037Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:29:27.5069440Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_tensorboard.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:29:27.506781] 2025-12-04T14:31:18.5848745Z 2025-12-04T14:31:18.5849827Z test_tensorboard 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_tensorboard_1.1_364706f7da824ca3_.log 2025-12-04T14:31:18.5860721Z Running 50 items in this shard: test/test_tensorboard.py::TestTensorBoardPyTorchNumpy::test_pytorch_autograd_np, test/test_tensorboard.py::TestTensorBoardPyTorchNumpy::test_pytorch_histogram, test/test_tensorboard.py::TestTensorBoardPyTorchNumpy::test_pytorch_histogram_raw, test/test_tensorboard.py::TestTensorBoardPyTorchNumpy::test_pytorch_np, test/test_tensorboard.py::TestTensorBoardPyTorchNumpy::test_pytorch_write, test/test_tensorboard.py::TestTensorBoardUtils::test_convert_to_HWC_dtype_remains_same, test/test_tensorboard.py::TestTensorBoardUtils::test_numpy_vid_uint8, test/test_tensorboard.py::TestTensorBoardUtils::test_prepare_video, test/test_tensorboard.py::TestTensorBoardUtils::test_to_HWC, test/test_tensorboard.py::TestTensorBoardWriter::test_writer, test/test_tensorboard.py::TestTensorBoardSummaryWriter::test_pathlib, test/test_tensorboard.py::TestTensorBoardSummaryWriter::test_summary_writer_close, test/test_tensorboard.py::TestTensorBoardSummaryWriter::test_summary_writer_ctx, test/test_tensorboard.py::TestTensorBoardEmbedding::test_embedding, test/test_tensorboard.py::TestTensorBoardEmbedding::test_embedding_64, test/test_tensorboard.py::TestTensorBoardSummary::test_audio, test/test_tensorboard.py::TestTensorBoardSummary::test_custom_scalars, test/test_tensorboard.py::TestTensorBoardSummary::test_empty_input, test/test_tensorboard.py::TestTensorBoardSummary::test_float32_image, test/test_tensorboard.py::TestTensorBoardSummary::test_histogram_auto, test/test_tensorboard.py::TestTensorBoardSummary::test_histogram_doane, test/test_tensorboard.py::TestTensorBoardSummary::test_histogram_fd, test/test_tensorboard.py::TestTensorBoardSummary::test_image_with_3_channel_batched, test/test_tensorboard.py::TestTensorBoardSummary::test_image_with_boxes, test/test_tensorboard.py::TestTensorBoardSummary::test_image_with_one_channel, test/test_tensorboard.py::TestTensorBoardSummary::test_image_with_one_channel_batched, test/test_tensorboard.py::TestTensorBoardSummary::test_image_without_channel, test/test_tensorboard.py::TestTensorBoardSummary::test_list_input, test/test_tensorboard.py::TestTensorBoardSummary::test_mesh, test/test_tensorboard.py::TestTensorBoardSummary::test_scalar_new_style, test/test_tensorboard.py::TestTensorBoardSummary::test_text, test/test_tensorboard.py::TestTensorBoardSummary::test_uint8_image, test/test_tensorboard.py::TestTensorBoardSummary::test_video, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_mlp_graph, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_nested_nn_squential, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_pytorch_graph, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_pytorch_graph_dict_input, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_torchvision_smoke, test/test_tensorboard.py::TestTensorBoardPytorchGraph::test_wrong_input_size, test/test_tensorboard.py::TestTensorBoardFigure::test_figure, test/test_tensorboard.py::TestTensorBoardFigure::test_figure_list, test/test_tensorboard.py::TestTensorBoardNumpy::test_pytorch_np_expect_fail, test/test_tensorboard.py::TestTensorBoardNumpy::test_scalar, test/test_tensorboard.py::TestTensorProtoSummary::test_complex_tensor_proto, test/test_tensorboard.py::TestTensorProtoSummary::test_empty_tensor_proto, test/test_tensorboard.py::TestTensorProtoSummary::test_float_tensor_proto, test/test_tensorboard.py::TestTensorProtoSummary::test_half_tensor_proto_bfloat16_proto_type_14, test/test_tensorboard.py::TestTensorProtoSummary::test_half_tensor_proto_float16_proto_type_19, test/test_tensorboard.py::TestTensorProtoSummary::test_int_tensor_proto, test/test_tensorboard.py::TestTensorProtoSummary::test_scalar_tensor_proto 2025-12-04T14:31:18.5867789Z 2025-12-04T14:31:18.5867914Z Finished test_tensorboard 1/1 ... [2025-12-04 14:31:18.584374][2208141.045361197], took 1.85min 2025-12-04T14:31:18.5868392Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:31:18.5911434Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:31:18.5913390Z Running test_numba_integration 1/1 ... [2025-12-04 14:31:18.591181][2208141.052169825] 2025-12-04T14:31:18.5913602Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:31:18.5915945Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_numba_integration.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:31:18.591369] 2025-12-04T14:31:21.0096801Z 2025-12-04T14:31:21.0097624Z test_numba_integration 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_numba_integration_1.1_b71b4848a3d29d6d_.log 2025-12-04T14:31:21.0099413Z Running 8 items in this shard: test/test_numba_integration.py::TestNumbaIntegration::test_active_device, test/test_numba_integration.py::TestNumbaIntegration::test_array_adaptor, test/test_numba_integration.py::TestNumbaIntegration::test_conversion_errors, test/test_numba_integration.py::TestNumbaIntegration::test_cuda_array_interface, test/test_numba_integration.py::TestNumbaIntegration::test_from_cuda_array_interface, test/test_numba_integration.py::TestNumbaIntegration::test_from_cuda_array_interface_active_device, test/test_numba_integration.py::TestNumbaIntegration::test_from_cuda_array_interface_inferred_strides, test/test_numba_integration.py::TestNumbaIntegration::test_from_cuda_array_interface_lifetime 2025-12-04T14:31:21.0100601Z 2025-12-04T14:31:21.0100722Z Finished test_numba_integration 1/1 ... [2025-12-04 14:31:21.009284][2208143.470270077], took 0.04min 2025-12-04T14:31:21.0117203Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:31:21.0168085Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:31:21.0170082Z Running test_functional_optim 1/1 ... [2025-12-04 14:31:21.016853][2208143.477842175] 2025-12-04T14:31:21.0170320Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:31:21.0171796Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_functional_optim.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:31:21.017046] 2025-12-04T14:31:23.3856329Z 2025-12-04T14:31:23.3856657Z test_functional_optim 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_functional_optim_1.1_e8f46939e7bbb614_.log 2025-12-04T14:31:23.3857493Z Running 4 items in this shard: test/test_functional_optim.py::TestFunctionalOptimParity::test_functional_optim_parity_adam, test/test_functional_optim.py::TestFunctionalOptimParity::test_functional_optim_parity_adam_w, test/test_functional_optim.py::TestFunctionalOptimParity::test_functional_optim_parity_sgd, test/test_functional_optim.py::TestFunctionalOptimParity::test_functional_optim_registration 2025-12-04T14:31:23.3858078Z 2025-12-04T14:31:23.3858209Z Finished test_functional_optim 1/1 ... [2025-12-04 14:31:23.385300][2208145.846285877], took 0.04min 2025-12-04T14:31:23.3878467Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:31:23.3930705Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:31:23.3930961Z Running test_maskedtensor 1/1 ... [2025-12-04 14:31:23.392998][2208145.853987133] 2025-12-04T14:31:23.3931153Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:31:23.3933681Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_maskedtensor.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:31:23.393191] 2025-12-04T14:33:03.8636337Z 2025-12-04T14:33:03.8637161Z test_maskedtensor 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_maskedtensor_1.1_94e10b59046405b7_.log 2025-12-04T14:33:03.8749525Z Running 958 items in this shard: test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn0, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn1, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn10, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn11, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn12, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn13, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn14, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn15, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn16, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn17, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn18, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn19, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn2, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn20, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn21, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn22, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn23, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn24, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn25, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn26, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn27, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn28, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn29, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn3, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn30, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn31, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn32, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn33, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn34, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn35, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn36, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn37, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn38, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn39, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn4, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn40, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn41, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn42, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn43, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn44, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn45, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn46, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn47, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn48, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn49, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn5, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn50, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn51, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn52, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn53, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn54, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn55, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn56, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn57, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn6, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn7, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn8, test/test_maskedtensor.py::TestUnary::test_inplace_unary_fn9, test/test_maskedtensor.py::TestUnary::test_unary_fn0, test/test_maskedtensor.py::TestUnary::test_unary_fn1, test/test_maskedtensor.py::TestUnary::test_unary_fn10, test/test_maskedtensor.py::TestUnary::test_unary_fn11, test/test_maskedtensor.py::TestUnary::test_unary_fn12, test/test_maskedtensor.py::TestUnary::test_unary_fn13, test/test_maskedtensor.py::TestUnary::test_unary_fn14, test/test_maskedtensor.py::TestUnary::test_unary_fn15, test/test_maskedtensor.py::TestUnary::test_unary_fn16, test/test_maskedtensor.py::TestUnary::test_unary_fn17, test/test_maskedtensor.py::TestUnary::test_unary_fn18, test/test_maskedtensor.py::TestUnary::test_unary_fn19, test/test_maskedtensor.py::TestUnary::test_unary_fn2, test/test_maskedtensor.py::TestUnary::test_unary_fn20, test/test_maskedtensor.py::TestUnary::test_unary_fn21, test/test_maskedtensor.py::TestUnary::test_unary_fn22, test/test_maskedtensor.py::TestUnary::test_unary_fn23, test/test_maskedtensor.py::TestUnary::test_unary_fn24, test/test_maskedtensor.py::TestUnary::test_unary_fn25, test/test_maskedtensor.py::TestUnary::test_unary_fn26, test/test_maskedtensor.py::TestUnary::test_unary_fn27, test/test_maskedtensor.py::TestUnary::test_unary_fn28, test/test_maskedtensor.py::TestUnary::test_unary_fn29, test/test_maskedtensor.py::TestUnary::test_unary_fn3, test/test_maskedtensor.py::TestUnary::test_unary_fn30, test/test_maskedtensor.py::TestUnary::test_unary_fn31, test/test_maskedtensor.py::TestUnary::test_unary_fn32, test/test_maskedtensor.py::TestUnary::test_unary_fn33, test/test_maskedtensor.py::TestUnary::test_unary_fn34, test/test_maskedtensor.py::TestUnary::test_unary_fn35, test/test_maskedtensor.py::TestUnary::test_unary_fn36, test/test_maskedtensor.py::TestUnary::test_unary_fn37, test/test_maskedtensor.py::TestUnary::test_unary_fn38, test/test_maskedtensor.py::TestUnary::test_unary_fn39, test/test_maskedtensor.py::TestUnary::test_unary_fn4, test/test_maskedtensor.py::TestUnary::test_unary_fn40, test/test_maskedtensor.py::TestUnary::test_unary_fn41, test/test_maskedtensor.py::TestUnary::test_unary_fn42, test/test_maskedtensor.py::TestUnary::test_unary_fn43, test/test_maskedtensor.py::TestUnary::test_unary_fn44, test/test_maskedtensor.py::TestUnary::test_unary_fn45, test/test_maskedtensor.py::TestUnary::test_unary_fn46, test/test_maskedtensor.py::TestUnary::test_unary_fn47, test/test_maskedtensor.py::TestUnary::test_unary_fn48, test/test_maskedtensor.py::TestUnary::test_unary_fn49, test/test_maskedtensor.py::TestUnary::test_unary_fn5, test/test_maskedtensor.py::TestUnary::test_unary_fn50, test/test_maskedtensor.py::TestUnary::test_unary_fn51, test/test_maskedtensor.py::TestUnary::test_unary_fn52, test/test_maskedtensor.py::TestUnary::test_unary_fn53, test/test_maskedtensor.py::TestUnary::test_unary_fn54, test/test_maskedtensor.py::TestUnary::test_unary_fn55, test/test_maskedtensor.py::TestUnary::test_unary_fn56, test/test_maskedtensor.py::TestUnary::test_unary_fn57, test/test_maskedtensor.py::TestUnary::test_unary_fn58, test/test_maskedtensor.py::TestUnary::test_unary_fn59, test/test_maskedtensor.py::TestUnary::test_unary_fn6, test/test_maskedtensor.py::TestUnary::test_unary_fn60, test/test_maskedtensor.py::TestUnary::test_unary_fn61, test/test_maskedtensor.py::TestUnary::test_unary_fn7, test/test_maskedtensor.py::TestUnary::test_unary_fn8, test/test_maskedtensor.py::TestUnary::test_unary_fn9, test/test_maskedtensor.py::TestBinary::test_binary_fn0, test/test_maskedtensor.py::TestBinary::test_binary_fn1, test/test_maskedtensor.py::TestBinary::test_binary_fn10, test/test_maskedtensor.py::TestBinary::test_binary_fn11, test/test_maskedtensor.py::TestBinary::test_binary_fn12, test/test_maskedtensor.py::TestBinary::test_binary_fn13, test/test_maskedtensor.py::TestBinary::test_binary_fn14, test/test_maskedtensor.py::TestBinary::test_binary_fn15, test/test_maskedtensor.py::TestBinary::test_binary_fn16, test/test_maskedtensor.py::TestBinary::test_binary_fn17, test/test_maskedtensor.py::TestBinary::test_binary_fn18, test/test_maskedtensor.py::TestBinary::test_binary_fn19, test/test_maskedtensor.py::TestBinary::test_binary_fn2, test/test_maskedtensor.py::TestBinary::test_binary_fn20, test/test_maskedtensor.py::TestBinary::test_binary_fn21, test/test_maskedtensor.py::TestBinary::test_binary_fn22, test/test_maskedtensor.py::TestBinary::test_binary_fn23, test/test_maskedtensor.py::TestBinary::test_binary_fn24, test/test_maskedtensor.py::TestBinary::test_binary_fn25, test/test_maskedtensor.py::TestBinary::test_binary_fn26, test/test_maskedtensor.py::TestBinary::test_binary_fn27, test/test_maskedtensor.py::TestBinary::test_binary_fn28, test/test_maskedtensor.py::TestBinary::test_binary_fn29, test/test_maskedtensor.py::TestBinary::test_binary_fn3, test/test_maskedtensor.py::TestBinary::test_binary_fn30, test/test_maskedtensor.py::TestBinary::test_binary_fn31, test/test_maskedtensor.py::TestBinary::test_binary_fn32, test/test_maskedtensor.py::TestBinary::test_binary_fn33, test/test_maskedtensor.py::TestBinary::test_binary_fn34, test/test_maskedtensor.py::TestBinary::test_binary_fn35, test/test_maskedtensor.py::TestBinary::test_binary_fn4, test/test_maskedtensor.py::TestBinary::test_binary_fn5, test/test_maskedtensor.py::TestBinary::test_binary_fn6, test/test_maskedtensor.py::TestBinary::test_binary_fn7, test/test_maskedtensor.py::TestBinary::test_binary_fn8, test/test_maskedtensor.py::TestBinary::test_binary_fn9, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn0, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn1, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn10, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn11, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn12, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn13, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn14, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn15, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn16, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn17, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn18, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn19, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn2, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn20, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn21, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn22, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn23, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn24, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn25, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn26, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn27, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn28, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn29, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn3, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn4, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn5, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn6, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn7, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn8, test/test_maskedtensor.py::TestBinary::test_inplace_binary_fn9, test/test_maskedtensor.py::TestBinary::test_masks_match_fn_name_add, test/test_maskedtensor.py::TestBinary::test_masks_match_fn_name_add_, test/test_maskedtensor.py::TestReductions::test__is_any_true, test/test_maskedtensor.py::TestReductions::test__is_any_true_false, test/test_maskedtensor.py::TestReductions::test_all, test/test_maskedtensor.py::TestReductions::test_amax, test/test_maskedtensor.py::TestReductions::test_amax_grad, test/test_maskedtensor.py::TestReductions::test_amin, test/test_maskedtensor.py::TestReductions::test_amin_grad, test/test_maskedtensor.py::TestReductions::test_any_true_dtype, test/test_maskedtensor.py::TestReductions::test_backward, test/test_maskedtensor.py::TestReductions::test_grad_dtype, test/test_maskedtensor.py::TestReductions::test_max_not_implemented, test/test_maskedtensor.py::TestReductions::test_mean, test/test_maskedtensor.py::TestReductions::test_mean_dim_grad, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1a, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1b, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1c, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1d, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1e, test/test_maskedtensor.py::TestReductions::test_mean_grad_case_1f, test/test_maskedtensor.py::TestReductions::test_prod, test/test_maskedtensor.py::TestReductions::test_prod_grad, test/test_maskedtensor.py::TestReductions::test_sum, test/test_maskedtensor.py::TestReductions::test_sum_grad, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_add_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_atan2_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_floor_rounding_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_no_rounding_mode_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_div_trunc_rounding_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_eq_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_floor_divide_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmax_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmin_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_fmod_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ge_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_gt_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_le_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_logaddexp_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_lt_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_maximum_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_minimum_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_mul_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_ne_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_nextafter_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_remainder_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_sub_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_binary_core_true_divide_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amax_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_amin_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmax_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_argmin_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_prod_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_reduction_all_sum_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_abs_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acos_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_acosh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_angle_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asin_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_asinh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atan_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_atanh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_ceil_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_conj_physical_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cos_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_cosh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_deg2rad_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_digamma_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erf_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfc_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_erfinv_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp2_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_exp_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_expm1_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_floor_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_frac_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_i0_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_isnan_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_lgamma_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log10_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log1p_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log2_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_log_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_logit_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_nan_to_num_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_neg_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_positive_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rad2deg_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_reciprocal_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_0_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_3_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_decimals_neg_3_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_round_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_rsqrt_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sgn_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sigmoid_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sign_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_signbit_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sin_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinc_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sinh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_sqrt_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_square_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tan_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_tanh_layout2_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout0_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout0_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout0_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout1_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout1_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout1_cuda_float64, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout2_cuda_float16, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout2_cuda_float32, test/test_maskedtensor.py::TestOperatorsCUDA::test_unary_core_trunc_layout2_cuda_float64, test/test_maskedtensor.py::TestBasicsCUDA::test_add_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_contiguous_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_diff_dim_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_diff_layouts_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_diff_sizes_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_grad_warning_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_invalid_sparse_coo_values_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_invalid_sparse_csr_values_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_invalid_sparse_layout_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_invalid_tensor_inputs_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_nn_unfold_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_softmax_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_stack_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_dense_and_sparse_coo_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_dense_and_sparse_csr_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_dense_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_device_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_dtype_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_to_sparse_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_unfold_cuda, test/test_maskedtensor.py::TestBasicsCUDA::test_where_cuda 2025-12-04T14:33:03.8851493Z 2025-12-04T14:33:03.8851613Z Finished test_maskedtensor 1/1 ... [2025-12-04 14:33:03.864004][2208246.324990376], took 1.67min 2025-12-04T14:33:03.8851997Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T14:33:03.8852352Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T14:33:03.8852553Z Running test_ops 2/5 ... [2025-12-04 14:33:03.870840][2208246.331829003] 2025-12-04T14:33:03.8852743Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T14:33:03.8853108Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=2', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:33:03.871032] 2025-12-04T15:22:22.6023520Z 2025-12-04T15:22:22.6024512Z PRINTING LOG FILE of test_ops 2/5 (test/test-reports/test_ops_2.5_5d2d9f84f109f206_.log) 2025-12-04T15:22:22.6025314Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-c76d91645d4bc776.xml 2025-12-04T15:22:22.6025854Z ============================= test session starts ============================== 2025-12-04T15:22:22.6026390Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T15:22:22.6026870Z cachedir: .pytest_cache 2025-12-04T15:22:22.6027503Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T15:22:22.6028173Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T15:22:22.6028485Z configfile: pytest.ini 2025-12-04T15:22:22.6029055Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T15:22:22.6029658Z collecting ... collected 33666 items 2025-12-04T15:22:22.6030010Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T15:22:22.6775054Z Running 6691 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_permuted_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_randn_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unbind_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_short_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addbmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_any_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argsort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gather_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_geqrf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_imag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_kthvalue_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_le_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cond_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logdet_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanmedian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_without_cudnn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_area_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_logsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_head_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_circular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_constant_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_scaled_dot_product_attention_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_upsample_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_permute_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_4_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_put_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_qr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_mm_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_airy_ai_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_v2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__flash_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_where_cuda, test/test_ops.py::TestCommonCUDA::test_errors___radd___cuda, test/test_ops.py::TestCommonCUDA::test_errors_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_errors_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ge_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_mean_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_bartlett_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_errors_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_put_accumulate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cov_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_aminmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_unary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_reduce_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_hermite_polynomial_h_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_scaled_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsqueeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_angle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_gaussian_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_integral_dtype__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_out_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_H_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_offsets_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cov_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_einsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_kron_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_singular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log10_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_4_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resolve_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_slice_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch__scaled_mm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch__scaled_mm_v2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__safe_softmax_default_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unravel_index_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float8_e5m2fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hash_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vdot_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rmul___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rsub___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_abs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flip_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_imag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_and_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_not_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_positive_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sgn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unbind_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_count_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_equal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eye_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flip_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_item_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ldexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vander_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logcumsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logdet_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_unpack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mH_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nanmean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ravel_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_roll_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_sparse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trapz_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triangular_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmatmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rpow___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_long_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_alias_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_column_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isfinite_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_lerp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logaddexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_not_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_positive_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_repeat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tensor_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_true_divide_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcdiv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_clone_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumulative_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dist_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_permuted_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flip_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_put_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_item_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_householder_product_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_singular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_slogdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svdvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logaddexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_unpack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mT_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_silu_complex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_nuc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_renorm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_slice_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_square_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_lowrank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tile_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view_H_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___radd___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rsub___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_byte_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_abs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_alias_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isfinite_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nan_to_num_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nextafter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal__in_place_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_permute_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_erfcx_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_stft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_where_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_xlogy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addbmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_allclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_any_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argsort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_partial_views_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_asin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bool_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cartesian_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_conj_physical_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cummax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flip_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gather_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_histc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isfinite_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lerp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_householder_product_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_hermitian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_slogdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logcumsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_not_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_unpack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_msort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nan_to_num_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanmean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanquantile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nansum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_batch_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_bag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_reflect_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ones_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ormqr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_permute_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pinverse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randint_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scalar_tensor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_h_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trapz_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_triu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsqueeze_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_unbiased_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rxor___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gcd_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_ops.py::TestFakeTensorCUDA::test_fake_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_count_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float64, test/test_ops.py::TestTagsCUDA::test_tags___rxor___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addcdiv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_alias_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_block_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_digamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_eq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erfinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expm1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_float_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ge_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isreal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_item_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_xor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_permute_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_repeat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i1e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_unflatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_var_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_where_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argwhere_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_left_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_not_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_cartesian_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_inverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_count_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagflat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expm1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_geqrf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hash_tensor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_multi_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_tensorinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_not_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_or_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_unpack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mH_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_dropout_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_embedding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_prelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_threshold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ormqr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_real_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_repeat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rsqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signbit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_slice_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_stft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsqueeze_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_xlogy_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_add_cuda_float32 2025-12-04T15:22:22.7502142Z 2025-12-04T15:22:22.7502343Z test_ops.py::TestCommonCUDA::test_compare_cpu___rsub___cuda_float32 SKIPPED [0.0975s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7502748Z test_ops.py::TestCommonCUDA::test_compare_cpu__chunk_cat_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7503155Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_int_cuda_float32 SKIPPED [0.0001s] (Overflow when downcasting signed type is undefined) [ 0%] 2025-12-04T15:22:22.7503566Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcmul_cuda_float32 SKIPPED [0.0012s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7503969Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addr_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7504377Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7504825Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7505235Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7505639Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dsplit_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7506043Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_copy_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7506448Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7506847Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_lerp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7507295Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_norm_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7507719Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_vector_norm_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7508138Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7508546Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7508971Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7509416Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7509831Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_mul_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7510278Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_copy_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7510728Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_full_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7511134Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_ones_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7511604Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_zeros_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7512012Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nextafter_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7512411Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_dropout_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 0%] 2025-12-04T15:22:22.7512814Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7513265Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7513692Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rsub_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7514100Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7514538Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7514961Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stack_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7515359Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7515757Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_trace_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7516181Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_transpose_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7516592Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_var_mean_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7516985Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vdot_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7517377Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7517771Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_xlogy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7518179Z test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_lengths_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7518604Z test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_offsets_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7519023Z test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_decomposed_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7519450Z test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_3d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7519835Z test_ops.py::TestCommonCUDA::test_compare_cpu_bmm_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7520273Z test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7520675Z test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_inverse_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7521071Z test_ops.py::TestCommonCUDA::test_compare_cpu_cumsum_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7521472Z test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7521889Z test_ops.py::TestCommonCUDA::test_compare_cpu_div_floor_rounding_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7522302Z test_ops.py::TestCommonCUDA::test_compare_cpu_div_no_rounding_mode_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7522714Z test_ops.py::TestCommonCUDA::test_compare_cpu_div_trunc_rounding_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7523115Z test_ops.py::TestCommonCUDA::test_compare_cpu_expand_as_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7523497Z test_ops.py::TestCommonCUDA::test_compare_cpu_eye_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7523879Z test_ops.py::TestCommonCUDA::test_compare_cpu_flipud_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7524268Z test_ops.py::TestCommonCUDA::test_compare_cpu_gradient_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7524674Z test_ops.py::TestCommonCUDA::test_compare_cpu_hsplit_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7525059Z test_ops.py::TestCommonCUDA::test_compare_cpu_hstack_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7525444Z test_ops.py::TestCommonCUDA::test_compare_cpu_index_put_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7525826Z test_ops.py::TestCommonCUDA::test_compare_cpu_kron_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7526222Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7526630Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_diagonal_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7527031Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eig_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7527431Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvalsh_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7527865Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 0%] 2025-12-04T15:22:22.7528320Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7528728Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 0%] 2025-12-04T15:22:22.7529162Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_ex_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T15:22:22.7529569Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svdvals_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7529969Z test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7530403Z test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp2_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7530796Z test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7531177Z test_ops.py::TestCommonCUDA::test_compare_cpu_lu_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7531569Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumprod_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7531973Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_median_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7532370Z test_ops.py::TestCommonCUDA::test_compare_cpu_matrix_exp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7532775Z test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7533195Z test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_with_dim_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7533622Z test_ops.py::TestCommonCUDA::test_compare_cpu_mul_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7534027Z test_ops.py::TestCommonCUDA::test_compare_cpu_native_dropout_backward_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7534471Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7534918Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_alpha_dropout_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7535350Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_batch_norm_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7535773Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7536179Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool3d_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T15:22:22.7536586Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardswish_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7537048Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool1d_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7537476Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7537922Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7538375Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multilabel_soft_margin_loss_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7538828Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_circular_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7539268Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7539667Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_rrelu_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T15:22:22.7540067Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7540543Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softshrink_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7540910Z test_ops.py::TestCommonCUDA::test_compare_cpu_normal_number_mean_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 1%] 2025-12-04T15:22:22.7541250Z test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7541637Z test_ops.py::TestCommonCUDA::test_compare_cpu_pinverse_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7542024Z test_ops.py::TestCommonCUDA::test_compare_cpu_polar_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7542412Z test_ops.py::TestCommonCUDA::test_compare_cpu_qr_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7542794Z test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7543178Z test_ops.py::TestCommonCUDA::test_compare_cpu_randn_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7543565Z test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_as_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7543957Z test_ops.py::TestCommonCUDA::test_compare_cpu_resize_as__cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7544350Z test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_conj_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7544737Z test_ops.py::TestCommonCUDA::test_compare_cpu_rsub_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7545139Z test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_prod_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7545549Z test_ops.py::TestCommonCUDA::test_compare_cpu_slice_scatter_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7545936Z test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_mm_reduce_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 1%] 2025-12-04T15:22:22.7546316Z test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7546755Z test_ops.py::TestCommonCUDA::test_compare_cpu_special_zeta_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7547145Z test_ops.py::TestCommonCUDA::test_compare_cpu_stack_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7547539Z test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7547935Z test_ops.py::TestCommonCUDA::test_compare_cpu_std_unbiased_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7548339Z test_ops.py::TestCommonCUDA::test_compare_cpu_tensordot_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7548765Z test_ops.py::TestCommonCUDA::test_compare_cpu_torch_ops_aten__safe_softmax_default_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7549187Z test_ops.py::TestCommonCUDA::test_compare_cpu_triu_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7549578Z test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7549969Z test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7550409Z test_ops.py::TestCommonCUDA::test_compare_cpu_unique_consecutive_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7550798Z test_ops.py::TestCommonCUDA::test_compare_cpu_unique_cuda_float32 SKIPPED [0.0001s] (Output order is undefined when sorted=False) [ 1%] 2025-12-04T15:22:22.7551195Z test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7551594Z test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7551987Z test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_like_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T15:22:22.7552336Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acosh_cuda_complex32 PASSED [0.1429s] [ 1%] 2025-12-04T15:22:22.7552698Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_copy_cuda_complex32 SKIPPED [0.0002s] (Errors when storage_offset is included) [ 1%] 2025-12-04T15:22:22.7553071Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_scatter_cuda_complex32 PASSED [0.9268s] [ 1%] 2025-12-04T15:22:22.7553380Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asinh_cuda_complex32 PASSED [0.7726s] [ 1%] 2025-12-04T15:22:22.7553679Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_block_diag_cuda_complex32 PASSED [0.7669s] [ 1%] 2025-12-04T15:22:22.7553977Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chunk_cuda_complex32 PASSED [0.7412s] [ 1%] 2025-12-04T15:22:22.7554281Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_contiguous_cuda_complex32 PASSED [0.7367s] [ 1%] 2025-12-04T15:22:22.7554632Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_div_no_rounding_mode_cuda_complex32 PASSED [0.3959s] [ 1%] 2025-12-04T15:22:22.7555015Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_permuted_cuda_complex32 SKIPPED [0.0002s] (Expected: empty_permuted is not comparable) [ 1%] 2025-12-04T15:22:22.7555393Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32 PASSED [0.7490s] [ 1%] 2025-12-04T15:22:22.7555692Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft_cuda_complex32 PASSED [4.3311s] [ 1%] 2025-12-04T15:22:22.7555985Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_imag_cuda_complex32 PASSED [0.7385s] [ 2%] 2025-12-04T15:22:22.7556273Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_item_cuda_complex32 PASSED [0.7526s] [ 2%] 2025-12-04T15:22:22.7556562Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_lerp_cuda_complex32 PASSED [0.7760s] [ 2%] 2025-12-04T15:22:22.7556868Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_linalg_diagonal_cuda_complex32 PASSED [0.7521s] [ 2%] 2025-12-04T15:22:22.7557196Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_log_softmax_with_dtype_cuda_complex32 PASSED [0.7950s] [ 2%] 2025-12-04T15:22:22.7557567Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_cuda_complex32 SKIPPED [0.0002s] (Expected: new_empty is not comparable) [ 2%] 2025-12-04T15:22:22.7557940Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv1d_cuda_complex32 PASSED [2.1194s] [ 2%] 2025-12-04T15:22:22.7558313Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose3d_cuda_complex32 SKIPPED [0.0002s] (Skipped for ROCm!) [ 2%] 2025-12-04T15:22:22.7558741Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_randn_like_cuda_complex32 SKIPPED [0.0001s] (Expected: randn_like is not comparable between dtypes) [ 2%] 2025-12-04T15:22:22.7559115Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32 PASSED [0.7679s] [ 2%] 2025-12-04T15:22:22.7559411Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rsqrt_cuda_complex32 PASSED [0.7649s] [ 2%] 2025-12-04T15:22:22.7559699Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sgn_cuda_complex32 PASSED [0.7479s] [ 2%] 2025-12-04T15:22:22.7560004Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sigmoid_cuda_complex32 PASSED [0.7703s] [ 2%] 2025-12-04T15:22:22.7560329Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sin_cuda_complex32 PASSED [0.7683s] [ 2%] 2025-12-04T15:22:22.7560618Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_cuda_complex32 PASSED [0.7515s] [ 2%] 2025-12-04T15:22:22.7560925Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_multiple_cuda_complex32 PASSED [0.7709s] [ 2%] 2025-12-04T15:22:22.7561241Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unbind_copy_cuda_complex32 PASSED [0.7489s] [ 2%] 2025-12-04T15:22:22.7561552Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_copy_cuda_complex32 PASSED [0.0126s] [ 2%] 2025-12-04T15:22:22.7561822Z test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda PASSED [0.8030s] [ 2%] 2025-12-04T15:22:22.7562040Z test_ops.py::TestCommonCUDA::test_dtypes___rxor___cuda PASSED [0.7880s] [ 2%] 2025-12-04T15:22:22.7562255Z test_ops.py::TestCommonCUDA::test_dtypes__refs_T_cuda PASSED [0.7779s] [ 2%] 2025-12-04T15:22:22.7562492Z test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_complex_cuda PASSED [0.7785s] [ 2%] 2025-12-04T15:22:22.7562750Z test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_short_cuda PASSED [0.7654s] [ 2%] 2025-12-04T15:22:22.7562992Z test_ops.py::TestCommonCUDA::test_dtypes__refs_acosh_cuda PASSED [0.7479s] [ 2%] 2025-12-04T15:22:22.7563215Z test_ops.py::TestCommonCUDA::test_dtypes__refs_addcdiv_cuda PASSED [0.8456s] [ 2%] 2025-12-04T15:22:22.7563465Z test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda PASSED [0.7887s] [ 2%] 2025-12-04T15:22:22.7563682Z test_ops.py::TestCommonCUDA::test_dtypes__refs_amin_cuda PASSED [0.8237s] [ 2%] 2025-12-04T15:22:22.7563932Z test_ops.py::TestCommonCUDA::test_dtypes__refs_asinh_cuda PASSED [0.7708s] [ 2%] 2025-12-04T15:22:22.7564158Z test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_1d_cuda PASSED [0.7711s] [ 2%] 2025-12-04T15:22:22.7564393Z test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_and_cuda PASSED [0.7977s] [ 2%] 2025-12-04T15:22:22.7564628Z test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_not_cuda PASSED [0.8135s] [ 2%] 2025-12-04T15:22:22.7564860Z test_ops.py::TestCommonCUDA::test_dtypes__refs_block_diag_cuda PASSED [0.7996s] [ 2%] 2025-12-04T15:22:22.7565082Z test_ops.py::TestCommonCUDA::test_dtypes__refs_cat_cuda PASSED [0.8064s] [ 2%] 2025-12-04T15:22:22.7565301Z test_ops.py::TestCommonCUDA::test_dtypes__refs_cauchy_cuda PASSED [0.7580s] [ 2%] 2025-12-04T15:22:22.7565531Z test_ops.py::TestCommonCUDA::test_dtypes__refs_column_stack_cuda PASSED [0.7704s] [ 2%] 2025-12-04T15:22:22.7565762Z test_ops.py::TestCommonCUDA::test_dtypes__refs_copysign_cuda PASSED [0.8528s] [ 2%] 2025-12-04T15:22:22.7565998Z test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_copy_cuda PASSED [0.8283s] [ 2%] 2025-12-04T15:22:22.7566231Z test_ops.py::TestCommonCUDA::test_dtypes__refs_digamma_cuda PASSED [0.7917s] [ 2%] 2025-12-04T15:22:22.7566471Z test_ops.py::TestCommonCUDA::test_dtypes__refs_div_trunc_rounding_cuda PASSED [0.8456s] [ 2%] 2025-12-04T15:22:22.7566706Z test_ops.py::TestCommonCUDA::test_dtypes__refs_equal_cuda PASSED [0.7926s] [ 2%] 2025-12-04T15:22:22.7566930Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft2_cuda PASSED [11.0147s] [ 2%] 2025-12-04T15:22:22.7567159Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfftn_cuda PASSED [1.7204s] [ 2%] 2025-12-04T15:22:22.7567385Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifft2_cuda PASSED [5.9655s] [ 2%] 2025-12-04T15:22:22.7567612Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft_cuda PASSED [3.0621s] [ 2%] 2025-12-04T15:22:22.7567837Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft_cuda PASSED [2.0016s] [ 2%] 2025-12-04T15:22:22.7568065Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft2_cuda PASSED [2.9099s] [ 2%] 2025-12-04T15:22:22.7568295Z test_ops.py::TestCommonCUDA::test_dtypes__refs_float_power_cuda PASSED [1.3127s] [ 2%] 2025-12-04T15:22:22.7568535Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fmax_cuda PASSED [1.3020s] [ 2%] 2025-12-04T15:22:22.7568754Z test_ops.py::TestCommonCUDA::test_dtypes__refs_hypot_cuda PASSED [1.2873s] [ 2%] 2025-12-04T15:22:22.7568976Z test_ops.py::TestCommonCUDA::test_dtypes__refs_igamma_cuda PASSED [1.2822s] [ 2%] 2025-12-04T15:22:22.7569198Z test_ops.py::TestCommonCUDA::test_dtypes__refs_isneginf_cuda PASSED [1.2424s] [ 2%] 2025-12-04T15:22:22.7569418Z test_ops.py::TestCommonCUDA::test_dtypes__refs_lcm_cuda PASSED [1.2737s] [ 2%] 2025-12-04T15:22:22.7569639Z test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svd_cuda PASSED [4.0901s] [ 2%] 2025-12-04T15:22:22.7569869Z test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp2_cuda PASSED [1.3152s] [ 2%] 2025-12-04T15:22:22.7570140Z test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda PASSED [1.2659s] [ 2%] 2025-12-04T15:22:22.7570373Z test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_xor_cuda PASSED [1.2818s] [ 2%] 2025-12-04T15:22:22.7570598Z test_ops.py::TestCommonCUDA::test_dtypes__refs_lt_cuda PASSED [1.2690s] [ 2%] 2025-12-04T15:22:22.7570842Z test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_list_of_tensors_cuda PASSED [1.3039s] [ 2%] 2025-12-04T15:22:22.7571089Z test_ops.py::TestCommonCUDA::test_dtypes__refs_minimum_cuda PASSED [1.2490s] [ 2%] 2025-12-04T15:22:22.7571312Z test_ops.py::TestCommonCUDA::test_dtypes__refs_movedim_cuda PASSED [1.2470s] [ 2%] 2025-12-04T15:22:22.7571530Z test_ops.py::TestCommonCUDA::test_dtypes__refs_ne_cuda PASSED [1.2715s] [ 2%] 2025-12-04T15:22:22.7571776Z test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_cuda PASSED [1.2974s] [ 2%] 2025-12-04T15:22:22.7572029Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_channel_shuffle_cuda PASSED [1.2660s] [ 2%] 2025-12-04T15:22:22.7572308Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_glu_cuda PASSED [2.6731s] [ 2%] 2025-12-04T15:22:22.7572562Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardshrink_cuda PASSED [1.2043s] [ 2%] 2025-12-04T15:22:22.7572837Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hinge_embedding_loss_cuda PASSED [1.2270s] [ 2%] 2025-12-04T15:22:22.7573111Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_leaky_relu_cuda PASSED [1.2492s] [ 2%] 2025-12-04T15:22:22.7573383Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_margin_ranking_loss_cuda PASSED [1.2959s] [ 3%] 2025-12-04T15:22:22.7573659Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mse_loss_cuda PASSED [1.2266s] [ 3%] 2025-12-04T15:22:22.7573933Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda PASSED [1.2841s] [ 3%] 2025-12-04T15:22:22.7574199Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_selu_cuda PASSED [1.2227s] [ 3%] 2025-12-04T15:22:22.7574460Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_smooth_l1_loss_cuda PASSED [1.2654s] [ 3%] 2025-12-04T15:22:22.7574723Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softshrink_cuda PASSED [1.2647s] [ 3%] 2025-12-04T15:22:22.7574981Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_tanhshrink_cuda PASSED [1.2657s] [ 3%] 2025-12-04T15:22:22.7575222Z test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_cuda PASSED [1.2370s] [ 3%] 2025-12-04T15:22:22.7575449Z test_ops.py::TestCommonCUDA::test_dtypes__refs_permute_copy_cuda PASSED [1.2367s] [ 3%] 2025-12-04T15:22:22.7575680Z test_ops.py::TestCommonCUDA::test_dtypes__refs_positive_cuda PASSED [1.2236s] [ 3%] 2025-12-04T15:22:22.7575907Z test_ops.py::TestCommonCUDA::test_dtypes__refs_reciprocal_cuda PASSED [1.2240s] [ 3%] 2025-12-04T15:22:22.7576136Z test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_as_cuda PASSED [1.2678s] [ 3%] 2025-12-04T15:22:22.7576363Z test_ops.py::TestCommonCUDA::test_dtypes__refs_sigmoid_cuda PASSED [1.2645s] [ 3%] 2025-12-04T15:22:22.7576581Z test_ops.py::TestCommonCUDA::test_dtypes__refs_sign_cuda PASSED [1.2578s] [ 3%] 2025-12-04T15:22:22.7576812Z test_ops.py::TestCommonCUDA::test_dtypes__refs_signbit_cuda PASSED [1.2157s] [ 3%] 2025-12-04T15:22:22.7577031Z test_ops.py::TestCommonCUDA::test_dtypes__refs_sinc_cuda PASSED [1.3013s] [ 3%] 2025-12-04T15:22:22.7577262Z test_ops.py::TestCommonCUDA::test_dtypes__refs_softmax_with_dtype_cuda PASSED [1.3174s] [ 3%] 2025-12-04T15:22:22.7577503Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i0e_cuda PASSED [1.2507s] [ 3%] 2025-12-04T15:22:22.7577764Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_1_cuda PASSED [1.2778s] [ 3%] 2025-12-04T15:22:22.7578056Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_3_cuda PASSED [1.3142s] [ 3%] 2025-12-04T15:22:22.7578320Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda PASSED [1.2328s] [ 3%] 2025-12-04T15:22:22.7578575Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_softmax_with_dtype_cuda PASSED [1.2909s] [ 3%] 2025-12-04T15:22:22.7578830Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_zeta_cuda PASSED [1.2761s] [ 3%] 2025-12-04T15:22:22.7579070Z test_ops.py::TestCommonCUDA::test_dtypes__refs_split_with_sizes_cuda PASSED [1.3122s] [ 3%] 2025-12-04T15:22:22.7579300Z test_ops.py::TestCommonCUDA::test_dtypes__refs_sqrt_cuda PASSED [1.3074s] [ 3%] 2025-12-04T15:22:22.7579518Z test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_cuda PASSED [1.2544s] [ 3%] 2025-12-04T15:22:22.7579734Z test_ops.py::TestCommonCUDA::test_dtypes__refs_std_cuda PASSED [1.2971s] [ 3%] 2025-12-04T15:22:22.7579957Z test_ops.py::TestCommonCUDA::test_dtypes__refs_var_cuda PASSED [1.3056s] [ 3%] 2025-12-04T15:22:22.7580225Z test_ops.py::TestCommonCUDA::test_dtypes__refs_vsplit_cuda PASSED [1.2496s] [ 3%] 2025-12-04T15:22:22.7580459Z test_ops.py::TestCommonCUDA::test_dtypes__refs_xlogy_cuda PASSED [1.3233s] [ 3%] 2025-12-04T15:22:22.7580673Z test_ops.py::TestCommonCUDA::test_dtypes_addbmm_cuda PASSED [2.8020s] [ 3%] 2025-12-04T15:22:22.7580885Z test_ops.py::TestCommonCUDA::test_dtypes_addcdiv_cuda PASSED [1.3589s] [ 3%] 2025-12-04T15:22:22.7581105Z test_ops.py::TestCommonCUDA::test_dtypes_addmm_decomposed_cuda PASSED [1.3008s] [ 3%] 2025-12-04T15:22:22.7581324Z test_ops.py::TestCommonCUDA::test_dtypes_amin_cuda PASSED [1.2729s] [ 3%] 2025-12-04T15:22:22.7581532Z test_ops.py::TestCommonCUDA::test_dtypes_aminmax_cuda PASSED [1.3033s] [ 3%] 2025-12-04T15:22:22.7581740Z test_ops.py::TestCommonCUDA::test_dtypes_any_cuda PASSED [1.2832s] [ 3%] 2025-12-04T15:22:22.7581947Z test_ops.py::TestCommonCUDA::test_dtypes_argmax_cuda PASSED [1.3047s] [ 3%] 2025-12-04T15:22:22.7582156Z test_ops.py::TestCommonCUDA::test_dtypes_argsort_cuda PASSED [1.5583s] [ 3%] 2025-12-04T15:22:22.7582362Z test_ops.py::TestCommonCUDA::test_dtypes_atan2_cuda PASSED [1.2742s] [ 3%] 2025-12-04T15:22:22.7582570Z test_ops.py::TestCommonCUDA::test_dtypes_atanh_cuda PASSED [1.5418s] [ 3%] 2025-12-04T15:22:22.7582779Z test_ops.py::TestCommonCUDA::test_dtypes_atleast_2d_cuda PASSED [1.2585s] [ 3%] 2025-12-04T15:22:22.7582990Z test_ops.py::TestCommonCUDA::test_dtypes_bernoulli_cuda PASSED [1.2971s] [ 3%] 2025-12-04T15:22:22.7583202Z test_ops.py::TestCommonCUDA::test_dtypes_bitwise_and_cuda PASSED [1.2684s] [ 3%] 2025-12-04T15:22:22.7583428Z test_ops.py::TestCommonCUDA::test_dtypes_bitwise_left_shift_cuda PASSED [1.3339s] [ 3%] 2025-12-04T15:22:22.7583654Z test_ops.py::TestCommonCUDA::test_dtypes_bitwise_not_cuda PASSED [1.2847s] [ 3%] 2025-12-04T15:22:22.7583865Z test_ops.py::TestCommonCUDA::test_dtypes_bmm_cuda PASSED [1.2574s] [ 3%] 2025-12-04T15:22:22.7584103Z test_ops.py::TestCommonCUDA::test_dtypes_broadcast_shapes_cuda SKIPPED [0.0003s] (Skipped!) [ 3%] 2025-12-04T15:22:22.7584344Z test_ops.py::TestCommonCUDA::test_dtypes_broadcast_to_cuda PASSED [1.3001s] [ 3%] 2025-12-04T15:22:22.7584559Z test_ops.py::TestCommonCUDA::test_dtypes_ceil_cuda PASSED [1.2731s] [ 3%] 2025-12-04T15:22:22.7584766Z test_ops.py::TestCommonCUDA::test_dtypes_cfloat_cuda PASSED [1.2912s] [ 3%] 2025-12-04T15:22:22.7584990Z test_ops.py::TestCommonCUDA::test_dtypes_cholesky_cuda PASSED [1.3439s] [ 3%] 2025-12-04T15:22:22.7585199Z test_ops.py::TestCommonCUDA::test_dtypes_conj_cuda PASSED [1.2449s] [ 3%] 2025-12-04T15:22:22.7585407Z test_ops.py::TestCommonCUDA::test_dtypes_corrcoef_cuda PASSED [1.3393s] [ 3%] 2025-12-04T15:22:22.7585622Z test_ops.py::TestCommonCUDA::test_dtypes_count_nonzero_cuda PASSED [1.2978s] [ 3%] 2025-12-04T15:22:22.7585838Z test_ops.py::TestCommonCUDA::test_dtypes_double_cuda PASSED [1.2964s] [ 3%] 2025-12-04T15:22:22.7586049Z test_ops.py::TestCommonCUDA::test_dtypes_dsplit_cuda PASSED [1.2736s] [ 3%] 2025-12-04T15:22:22.7586266Z test_ops.py::TestCommonCUDA::test_dtypes_empty_permuted_cuda PASSED [0.0646s] [ 3%] 2025-12-04T15:22:22.7586484Z test_ops.py::TestCommonCUDA::test_dtypes_erfc_cuda PASSED [1.2754s] [ 3%] 2025-12-04T15:22:22.7586690Z test_ops.py::TestCommonCUDA::test_dtypes_exp_cuda PASSED [1.3168s] [ 3%] 2025-12-04T15:22:22.7586905Z test_ops.py::TestCommonCUDA::test_dtypes_expand_copy_cuda PASSED [1.2812s] [ 3%] 2025-12-04T15:22:22.7587117Z test_ops.py::TestCommonCUDA::test_dtypes_expand_cuda PASSED [1.3165s] [ 3%] 2025-12-04T15:22:22.7587323Z test_ops.py::TestCommonCUDA::test_dtypes_expm1_cuda PASSED [1.2630s] [ 3%] 2025-12-04T15:22:22.7587537Z test_ops.py::TestCommonCUDA::test_dtypes_fft_fftshift_cuda PASSED [1.2797s] [ 3%] 2025-12-04T15:22:22.7587751Z test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft_cuda PASSED [2.2247s] [ 3%] 2025-12-04T15:22:22.7587982Z test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftshift_cuda PASSED [1.2887s] [ 3%] 2025-12-04T15:22:22.7588208Z test_ops.py::TestCommonCUDA::test_dtypes_flip_cuda PASSED [1.3351s] [ 3%] 2025-12-04T15:22:22.7588429Z test_ops.py::TestCommonCUDA::test_dtypes_float_cuda PASSED [1.2703s] [ 3%] 2025-12-04T15:22:22.7588641Z test_ops.py::TestCommonCUDA::test_dtypes_floor_divide_cuda PASSED [1.3036s] [ 4%] 2025-12-04T15:22:22.7588856Z test_ops.py::TestCommonCUDA::test_dtypes_full_cuda PASSED [1.2882s] [ 4%] 2025-12-04T15:22:22.7589063Z test_ops.py::TestCommonCUDA::test_dtypes_gather_cuda PASSED [1.3324s] [ 4%] 2025-12-04T15:22:22.7589269Z test_ops.py::TestCommonCUDA::test_dtypes_geqrf_cuda XFAIL [0.0791s] [ 4%] 2025-12-04T15:22:22.7589477Z test_ops.py::TestCommonCUDA::test_dtypes_gradient_cuda PASSED [2.5528s] [ 4%] 2025-12-04T15:22:22.7589695Z test_ops.py::TestCommonCUDA::test_dtypes_grid_sampler_2d_cuda PASSED [1.3058s] [ 4%] 2025-12-04T15:22:22.7589920Z test_ops.py::TestCommonCUDA::test_dtypes_histogramdd_cuda PASSED [1.4420s] [ 4%] 2025-12-04T15:22:22.7590168Z test_ops.py::TestCommonCUDA::test_dtypes_hstack_cuda PASSED [1.2232s] [ 4%] 2025-12-04T15:22:22.7590377Z test_ops.py::TestCommonCUDA::test_dtypes_hypot_cuda PASSED [1.3037s] [ 4%] 2025-12-04T15:22:22.7590586Z test_ops.py::TestCommonCUDA::test_dtypes_i0_cuda PASSED [1.3051s] [ 4%] 2025-12-04T15:22:22.7590791Z test_ops.py::TestCommonCUDA::test_dtypes_imag_cuda PASSED [1.2876s] [ 4%] 2025-12-04T15:22:22.7591005Z test_ops.py::TestCommonCUDA::test_dtypes_index_copy_cuda PASSED [1.2872s] [ 4%] 2025-12-04T15:22:22.7608216Z test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_amin_cuda PASSED [1.2527s] [ 4%] 2025-12-04T15:22:22.7608495Z test_ops.py::TestCommonCUDA::test_dtypes_index_select_cuda PASSED [1.2961s] [ 4%] 2025-12-04T15:22:22.7608733Z test_ops.py::TestCommonCUDA::test_dtypes_isnan_cuda PASSED [1.2830s] [ 4%] 2025-12-04T15:22:22.7608955Z test_ops.py::TestCommonCUDA::test_dtypes_isneginf_cuda PASSED [1.2879s] [ 4%] 2025-12-04T15:22:22.7609212Z test_ops.py::TestCommonCUDA::test_dtypes_jiterator_2inputs_2outputs_cuda PASSED [1.8702s] [ 4%] 2025-12-04T15:22:22.7609475Z test_ops.py::TestCommonCUDA::test_dtypes_jiterator_unary_cuda PASSED [1.7413s] [ 4%] 2025-12-04T15:22:22.7609709Z test_ops.py::TestCommonCUDA::test_dtypes_kthvalue_cuda PASSED [1.3051s] [ 4%] 2025-12-04T15:22:22.7609929Z test_ops.py::TestCommonCUDA::test_dtypes_ldexp_cuda PASSED [1.2620s] [ 4%] 2025-12-04T15:22:22.7610242Z test_ops.py::TestCommonCUDA::test_dtypes_le_cuda PASSED [1.3174s] [ 4%] 2025-12-04T15:22:22.7610468Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_cond_cuda PASSED [1.2764s] [ 4%] 2025-12-04T15:22:22.7610704Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_diagonal_cuda PASSED [1.2481s] [ 4%] 2025-12-04T15:22:22.7610943Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvals_cuda PASSED [1.4352s] [ 4%] 2025-12-04T15:22:22.7611173Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_cuda PASSED [1.2939s] [ 4%] 2025-12-04T15:22:22.7611406Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_cuda PASSED [2.0052s] [ 4%] 2025-12-04T15:22:22.7611653Z test_ops.py::TestCommonCUDA::test_dtypes_linspace_tensor_overload_cuda PASSED [1.5531s] [ 4%] 2025-12-04T15:22:22.7611898Z test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda PASSED [1.3388s] [ 4%] 2025-12-04T15:22:22.7612119Z test_ops.py::TestCommonCUDA::test_dtypes_log_normal_cuda PASSED [1.2515s] [ 4%] 2025-12-04T15:22:22.7612343Z test_ops.py::TestCommonCUDA::test_dtypes_logdet_cuda PASSED [1.2927s] [ 4%] 2025-12-04T15:22:22.7612568Z test_ops.py::TestCommonCUDA::test_dtypes_masked_argmin_cuda PASSED [1.2881s] [ 4%] 2025-12-04T15:22:22.7612800Z test_ops.py::TestCommonCUDA::test_dtypes_masked_cumprod_cuda PASSED [1.3415s] [ 4%] 2025-12-04T15:22:22.7613030Z test_ops.py::TestCommonCUDA::test_dtypes_masked_prod_cuda PASSED [1.4220s] [ 4%] 2025-12-04T15:22:22.7613260Z test_ops.py::TestCommonCUDA::test_dtypes_masked_scatter_cuda PASSED [1.3225s] [ 4%] 2025-12-04T15:22:22.7613521Z test_ops.py::TestCommonCUDA::test_dtypes_masked_std_cuda PASSED [1.5649s] [ 4%] 2025-12-04T15:22:22.7613744Z test_ops.py::TestCommonCUDA::test_dtypes_masked_var_cuda PASSED [1.5276s] [ 4%] 2025-12-04T15:22:22.7613985Z test_ops.py::TestCommonCUDA::test_dtypes_max_binary_cuda PASSED [1.2448s] [ 4%] 2025-12-04T15:22:22.7614225Z test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_with_dim_cuda PASSED [1.2208s] [ 4%] 2025-12-04T15:22:22.7614493Z test_ops.py::TestCommonCUDA::test_dtypes_meshgrid_variadic_tensors_cuda PASSED [1.2717s] [ 4%] 2025-12-04T15:22:22.7614738Z test_ops.py::TestCommonCUDA::test_dtypes_nanmedian_cuda PASSED [1.2370s] [ 4%] 2025-12-04T15:22:22.7614956Z test_ops.py::TestCommonCUDA::test_dtypes_nansum_cuda PASSED [1.7103s] [ 4%] 2025-12-04T15:22:22.7615186Z test_ops.py::TestCommonCUDA::test_dtypes_native_batch_norm_cuda PASSED [1.3004s] [ 4%] 2025-12-04T15:22:22.7615428Z test_ops.py::TestCommonCUDA::test_dtypes_native_layer_norm_cuda PASSED [1.2986s] [ 4%] 2025-12-04T15:22:22.7615665Z test_ops.py::TestCommonCUDA::test_dtypes_neg_cuda PASSED [1.2895s] [ 4%] 2025-12-04T15:22:22.7615925Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_without_cudnn_cuda PASSED [1.2626s] [ 4%] 2025-12-04T15:22:22.7616197Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_celu_cuda PASSED [1.2499s] [ 4%] 2025-12-04T15:22:22.7616457Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_channel_shuffle_cuda PASSED [1.2536s] [ 4%] 2025-12-04T15:22:22.7616733Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose1d_cuda PASSED [2.8136s] [ 4%] 2025-12-04T15:22:22.7616993Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_elu_cuda PASSED [1.2559s] [ 4%] 2025-12-04T15:22:22.7617246Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_cuda PASSED [1.2628s] [ 4%] 2025-12-04T15:22:22.7617518Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hinge_embedding_loss_cuda PASSED [1.2605s] [ 4%] 2025-12-04T15:22:22.7617806Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_area_cuda PASSED [1.2955s] [ 4%] 2025-12-04T15:22:22.7618085Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_linear_cuda PASSED [1.2943s] [ 4%] 2025-12-04T15:22:22.7618355Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_logsigmoid_cuda PASSED [1.2761s] [ 4%] 2025-12-04T15:22:22.7618614Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool1d_cuda PASSED [2.7974s] [ 4%] 2025-12-04T15:22:22.7618893Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_grad_cuda PASSED [1.4405s] [ 4%] 2025-12-04T15:22:22.7619153Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mish_cuda PASSED [1.2774s] [ 4%] 2025-12-04T15:22:22.7619402Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mse_loss_cuda PASSED [1.3023s] [ 4%] 2025-12-04T15:22:22.7619682Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_head_attention_forward_cuda PASSED [2.1490s] [ 4%] 2025-12-04T15:22:22.7619971Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_circular_cuda PASSED [1.2819s] [ 4%] 2025-12-04T15:22:22.7620276Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_constant_cuda PASSED [1.3132s] [ 4%] 2025-12-04T15:22:22.7620544Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_cuda PASSED [1.2552s] [ 4%] 2025-12-04T15:22:22.7620803Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pdist_cuda PASSED [1.2402s] [ 4%] 2025-12-04T15:22:22.7621065Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_poisson_nll_loss_cuda PASSED [1.4345s] [ 4%] 2025-12-04T15:22:22.7621325Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_prelu_cuda PASSED [1.2464s] [ 4%] 2025-12-04T15:22:22.7621602Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_scaled_dot_product_attention_cuda PASSED [1.4779s] [ 4%] 2025-12-04T15:22:22.7621894Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softmin_with_dtype_cuda PASSED [1.2624s] [ 4%] 2025-12-04T15:22:22.7622190Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_unfold_cuda PASSED [1.5045s] [ 5%] 2025-12-04T15:22:22.7622455Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_upsample_nearest_cuda PASSED [1.3051s] [ 5%] 2025-12-04T15:22:22.7622719Z test_ops.py::TestCommonCUDA::test_dtypes_norm_cuda PASSED [1.3270s] [ 5%] 2025-12-04T15:22:22.7622939Z test_ops.py::TestCommonCUDA::test_dtypes_permute_cuda PASSED [1.2546s] [ 5%] 2025-12-04T15:22:22.7623162Z test_ops.py::TestCommonCUDA::test_dtypes_pinverse_cuda PASSED [1.2670s] [ 5%] 2025-12-04T15:22:22.7623405Z test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_0_cuda PASSED [1.2934s] [ 5%] 2025-12-04T15:22:22.7623685Z test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_3_cuda SKIPPED [0.0003s] (Skipped!) [ 5%] 2025-12-04T15:22:22.7623976Z test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_4_cuda SKIPPED [0.0001s] (Skipped!) [ 5%] 2025-12-04T15:22:22.7624237Z test_ops.py::TestCommonCUDA::test_dtypes_positive_cuda PASSED [1.2368s] [ 5%] 2025-12-04T15:22:22.7624458Z test_ops.py::TestCommonCUDA::test_dtypes_prod_cuda PASSED [1.6279s] [ 5%] 2025-12-04T15:22:22.7624676Z test_ops.py::TestCommonCUDA::test_dtypes_put_cuda PASSED [1.2849s] [ 5%] 2025-12-04T15:22:22.7624894Z test_ops.py::TestCommonCUDA::test_dtypes_qr_cuda PASSED [1.3076s] [ 5%] 2025-12-04T15:22:22.7625116Z test_ops.py::TestCommonCUDA::test_dtypes_randint_like_cuda PASSED [1.2908s] [ 5%] 2025-12-04T15:22:22.7625344Z test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda PASSED [1.2706s] [ 5%] 2025-12-04T15:22:22.7625563Z test_ops.py::TestCommonCUDA::test_dtypes_remainder_cuda PASSED [1.2530s] [ 5%] 2025-12-04T15:22:22.7625780Z test_ops.py::TestCommonCUDA::test_dtypes_resize__cuda XFAIL [0.0189s] [ 5%] 2025-12-04T15:22:22.7626015Z test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amax_cuda PASSED [2.5183s] [ 5%] 2025-12-04T15:22:22.7626269Z test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_blackman_cuda PASSED [1.2568s] [ 5%] 2025-12-04T15:22:22.7626510Z test_ops.py::TestCommonCUDA::test_dtypes_signbit_cuda PASSED [1.2445s] [ 5%] 2025-12-04T15:22:22.7626723Z test_ops.py::TestCommonCUDA::test_dtypes_sinc_cuda PASSED [1.2127s] [ 5%] 2025-12-04T15:22:22.7626935Z test_ops.py::TestCommonCUDA::test_dtypes_sinh_cuda PASSED [1.4622s] [ 5%] 2025-12-04T15:22:22.7627156Z test_ops.py::TestCommonCUDA::test_dtypes_slice_scatter_cuda PASSED [1.2722s] [ 5%] 2025-12-04T15:22:22.7627434Z test_ops.py::TestCommonCUDA::test_dtypes_sparse_mm_reduce_cuda SKIPPED [0.0012s] (Only runs on cpu) [ 5%] 2025-12-04T15:22:22.7627694Z test_ops.py::TestCommonCUDA::test_dtypes_special_airy_ai_cuda PASSED [1.2617s] [ 5%] 2025-12-04T15:22:22.7627926Z test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j0_cuda PASSED [1.4299s] [ 5%] 2025-12-04T15:22:22.7628178Z test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda PASSED [1.2666s] [ 5%] 2025-12-04T15:22:22.7628425Z test_ops.py::TestCommonCUDA::test_dtypes_special_i1_cuda PASSED [1.2144s] [ 5%] 2025-12-04T15:22:22.7628642Z test_ops.py::TestCommonCUDA::test_dtypes_special_i1e_cuda PASSED [1.2717s] [ 5%] 2025-12-04T15:22:22.7628886Z test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k0_cuda PASSED [1.2661s] [ 5%] 2025-12-04T15:22:22.7629141Z test_ops.py::TestCommonCUDA::test_dtypes_special_xlog1py_cuda PASSED [1.2270s] [ 5%] 2025-12-04T15:22:22.7629375Z test_ops.py::TestCommonCUDA::test_dtypes_squeeze_multiple_cuda PASSED [1.2442s] [ 5%] 2025-12-04T15:22:22.7629596Z test_ops.py::TestCommonCUDA::test_dtypes_std_mean_cuda PASSED [1.2971s] [ 5%] 2025-12-04T15:22:22.7629809Z test_ops.py::TestCommonCUDA::test_dtypes_svd_lowrank_cuda PASSED [1.5804s] [ 5%] 2025-12-04T15:22:22.7630049Z test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_cuda SKIPPED [0.0003s] (Skipped!) [ 5%] 2025-12-04T15:22:22.7630347Z test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_v2_cuda SKIPPED [0.0002s] (Skipped!) [ 5%] 2025-12-04T15:22:22.7630752Z test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__efficient_attention_forward_cuda SKIPPED [0.0010s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 5%] 2025-12-04T15:22:22.7631148Z test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__flash_attention_forward_cuda PASSED [1.2845s] [ 5%] 2025-12-04T15:22:22.7631399Z test_ops.py::TestCommonCUDA::test_dtypes_trapz_cuda PASSED [1.2736s] [ 5%] 2025-12-04T15:22:22.7631618Z test_ops.py::TestCommonCUDA::test_dtypes_tril_indices_cuda PASSED [0.0267s] [ 5%] 2025-12-04T15:22:22.7631833Z test_ops.py::TestCommonCUDA::test_dtypes_unbind_cuda PASSED [1.2254s] [ 5%] 2025-12-04T15:22:22.7632043Z test_ops.py::TestCommonCUDA::test_dtypes_unique_cuda PASSED [2.6256s] [ 5%] 2025-12-04T15:22:22.7632252Z test_ops.py::TestCommonCUDA::test_dtypes_var_mean_cuda PASSED [1.2599s] [ 5%] 2025-12-04T15:22:22.7632465Z test_ops.py::TestCommonCUDA::test_dtypes_var_unbiased_cuda PASSED [1.2740s] [ 5%] 2025-12-04T15:22:22.7632679Z test_ops.py::TestCommonCUDA::test_dtypes_vdot_cuda PASSED [1.2357s] [ 5%] 2025-12-04T15:22:22.7632887Z test_ops.py::TestCommonCUDA::test_dtypes_where_cuda PASSED [1.2560s] [ 5%] 2025-12-04T15:22:22.7633096Z test_ops.py::TestCommonCUDA::test_errors___radd___cuda PASSED [1.2325s] [ 5%] 2025-12-04T15:22:22.7633303Z test_ops.py::TestCommonCUDA::test_errors_clamp_min_cuda XFAIL [0.0037s] [ 5%] 2025-12-04T15:22:22.7633516Z test_ops.py::TestCommonCUDA::test_errors_exponential_cuda PASSED [2.4154s] [ 5%] 2025-12-04T15:22:22.7633728Z test_ops.py::TestCommonCUDA::test_errors_fft_fftn_cuda PASSED [1.2410s] [ 5%] 2025-12-04T15:22:22.7633938Z test_ops.py::TestCommonCUDA::test_errors_fft_ihfft_cuda PASSED [1.2430s] [ 5%] 2025-12-04T15:22:22.7634148Z test_ops.py::TestCommonCUDA::test_errors_fft_irfftn_cuda PASSED [1.2237s] [ 5%] 2025-12-04T15:22:22.7634358Z test_ops.py::TestCommonCUDA::test_errors_gcd_cuda PASSED [1.1988s] [ 5%] 2025-12-04T15:22:22.7634565Z test_ops.py::TestCommonCUDA::test_errors_ge_cuda PASSED [1.1887s] [ 5%] 2025-12-04T15:22:22.7634774Z test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda PASSED [1.2265s] [ 5%] 2025-12-04T15:22:22.7634988Z test_ops.py::TestCommonCUDA::test_errors_index_select_cuda PASSED [0.0036s] [ 5%] 2025-12-04T15:22:22.7635203Z test_ops.py::TestCommonCUDA::test_errors_isclose_cuda PASSED [1.2342s] [ 5%] 2025-12-04T15:22:22.7635417Z test_ops.py::TestCommonCUDA::test_errors_logcumsumexp_cuda PASSED [1.2349s] [ 5%] 2025-12-04T15:22:22.7635655Z test_ops.py::TestCommonCUDA::test_errors_masked_fill_cuda PASSED [1.2222s] [ 5%] 2025-12-04T15:22:22.7635875Z test_ops.py::TestCommonCUDA::test_errors_masked_select_cuda PASSED [1.2123s] [ 5%] 2025-12-04T15:22:22.7636091Z test_ops.py::TestCommonCUDA::test_errors_mean_cuda PASSED [1.2327s] [ 5%] 2025-12-04T15:22:22.7636333Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool2d_cuda PASSED [1.2168s] [ 5%] 2025-12-04T15:22:22.7636600Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool2d_cuda PASSED [0.0058s] [ 5%] 2025-12-04T15:22:22.7636851Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_hardtanh_cuda PASSED [1.2037s] [ 5%] 2025-12-04T15:22:22.7637108Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_poisson_nll_loss_cuda PASSED [1.2382s] [ 5%] 2025-12-04T15:22:22.7637346Z test_ops.py::TestCommonCUDA::test_errors_ormqr_cuda PASSED [1.2225s] [ 5%] 2025-12-04T15:22:22.7637561Z test_ops.py::TestCommonCUDA::test_errors_reshape_as_cuda PASSED [1.2011s] [ 5%] 2025-12-04T15:22:22.7637775Z test_ops.py::TestCommonCUDA::test_errors_reshape_cuda PASSED [0.0079s] [ 5%] 2025-12-04T15:22:22.7638007Z test_ops.py::TestCommonCUDA::test_errors_signal_windows_bartlett_cuda PASSED [0.0044s] [ 6%] 2025-12-04T15:22:22.7638248Z test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout0_cuda PASSED [0.0335s] [ 6%] 2025-12-04T15:22:22.7638493Z test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout2_cuda PASSED [0.0024s] [ 6%] 2025-12-04T15:22:22.7638757Z test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout1_cuda PASSED [0.0015s] [ 6%] 2025-12-04T15:22:22.7639011Z test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout2_cuda PASSED [0.0015s] [ 6%] 2025-12-04T15:22:22.7639281Z test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout3_cuda PASSED [0.0014s] [ 6%] 2025-12-04T15:22:22.7639541Z test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda PASSED [0.0020s] [ 6%] 2025-12-04T15:22:22.7639783Z test_ops.py::TestCommonCUDA::test_errors_uniform_cuda PASSED [1.2344s] [ 6%] 2025-12-04T15:22:22.7639994Z test_ops.py::TestCommonCUDA::test_errors_view_cuda PASSED [0.0075s] [ 6%] 2025-12-04T15:22:22.7640240Z test_ops.py::TestCommonCUDA::test_errors_vsplit_cuda PASSED [0.0038s] [ 6%] 2025-12-04T15:22:22.7640546Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch__native_batch_norm_legit_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7640920Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7641264Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addr_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7641610Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_all_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7641956Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_angle_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7642309Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_min_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7642661Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_complex_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7643020Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7643380Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_digamma_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7643745Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_div_floor_rounding_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7644125Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_empty_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7644471Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_exp_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7644811Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eye_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7645155Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_hfftn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7645511Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifftn_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7645864Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfftn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7646222Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_floor_divide_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7646574Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7646918Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_gather_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7647263Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hstack_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7647653Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_prod_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7648017Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isposinf_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7648380Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_kron_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7648721Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_le_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7649068Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7649457Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_subgradients_at_zero_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7649850Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7650259Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_tensorinv_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7650637Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_tensorsolve_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7651018Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_vector_norm_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7651384Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_and_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7651731Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lt_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7652076Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_maximum_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7652418Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mul_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7652781Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_1_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7653184Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_3_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7653555Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nanmean_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7653909Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nansum_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7654273Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_native_batch_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7654631Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ne_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7654992Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_linear_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7655384Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_softplus_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7655754Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nonzero_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7656105Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_fro_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7656501Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_permute_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7656874Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_2_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 6%] 2025-12-04T15:22:22.7657246Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_pow_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7657602Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_remainder_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7657952Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7658296Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_rsqrt_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7658644Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7659008Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_amin_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7659391Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_mean_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7659772Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_prod_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7660180Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_softmax_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7660561Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7660969Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_i1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T15:22:22.7661369Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7661774Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_spherical_bessel_j0_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7662166Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_square_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7662512Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_svd_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7662867Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_take_along_dim_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7663236Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_transpose_copy_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7663616Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_triangular_solve_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7663975Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_trunc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7664326Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_view_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T15:22:22.7664665Z test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7664991Z test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7665360Z test_ops.py::TestCommonCUDA::test_multiple_devices__batch_norm_with_update_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7665734Z test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7666117Z test_ops.py::TestCommonCUDA::test_multiple_devices__softmax_backward_data_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7666507Z test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_put_accumulate_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7666871Z test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7667204Z test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7667539Z test_ops.py::TestCommonCUDA::test_multiple_devices_addmv_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7667868Z test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_int64 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7668196Z test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7668530Z test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7668862Z test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7669198Z test_ops.py::TestCommonCUDA::test_multiple_devices_argsort_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7669539Z test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7669874Z test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7670257Z test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_and_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7670611Z test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_right_shift_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7671000Z test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_shapes_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7671353Z test_ops.py::TestCommonCUDA::test_multiple_devices_cauchy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7671685Z test_ops.py::TestCommonCUDA::test_multiple_devices_cdist_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7672016Z test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7672366Z test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_inverse_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7672714Z test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7673047Z test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7673384Z test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7673718Z test_ops.py::TestCommonCUDA::test_multiple_devices_cov_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7674045Z test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7674410Z test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7674748Z test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7675097Z test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7675436Z test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7675779Z test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7676131Z test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7676476Z test_ops.py::TestCommonCUDA::test_multiple_devices_dot_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7676806Z test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7677139Z test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7677469Z test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7677792Z test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7678116Z test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7678444Z test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7678777Z test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7679106Z test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7679431Z test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7679783Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7680163Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7680503Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfftn_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7680845Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7681185Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7681519Z test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7681851Z test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7682185Z test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7682517Z test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7682848Z test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7683221Z test_ops.py::TestCommonCUDA::test_multiple_devices_geometric_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7683568Z test_ops.py::TestCommonCUDA::test_multiple_devices_grid_sampler_2d_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7683924Z test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7684249Z test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 7%] 2025-12-04T15:22:22.7684586Z test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7684925Z test_ops.py::TestCommonCUDA::test_multiple_devices_igamma_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7685270Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amin_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7685626Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_int64 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7685981Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_prod_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7686320Z test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7686650Z test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7686985Z test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7687319Z test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7687653Z test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7687986Z test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7688318Z test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7688690Z test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7689088Z test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7689487Z test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7689851Z test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7690216Z test_ops.py::TestCommonCUDA::test_multiple_devices_lcm_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7690542Z test_ops.py::TestCommonCUDA::test_multiple_devices_lerp_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7690876Z test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7691223Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7691586Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_ex_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7691945Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cond_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7692318Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7692687Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_diagonal_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7693039Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7693388Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_solve_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7693759Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7694174Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 8%] 2025-12-04T15:22:22.7694571Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_int64 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7694913Z test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7695249Z test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7695578Z test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7695903Z test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7696236Z test_ops.py::TestCommonCUDA::test_multiple_devices_log_normal_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7696588Z test_ops.py::TestCommonCUDA::test_multiple_devices_logcumsumexp_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7696933Z test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7697265Z test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7697602Z test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7697936Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7698279Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_int64 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7698625Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumsum_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7698977Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7699331Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_norm_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7699681Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7700035Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7700432Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmax_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7700781Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7701151Z test_ops.py::TestCommonCUDA::test_multiple_devices_matmul_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7701511Z test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7701850Z test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7702209Z test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7702571Z test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7702923Z test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_with_dim_cuda_int64 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7703278Z test_ops.py::TestCommonCUDA::test_multiple_devices_minimum_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7703609Z test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7703938Z test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7704265Z test_ops.py::TestCommonCUDA::test_multiple_devices_mv_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7704594Z test_ops.py::TestCommonCUDA::test_multiple_devices_nanmean_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7704934Z test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7705283Z test_ops.py::TestCommonCUDA::test_multiple_devices_nanquantile_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7705621Z test_ops.py::TestCommonCUDA::test_multiple_devices_ne_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7705961Z test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7706325Z test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7706666Z test_ops.py::TestCommonCUDA::test_multiple_devices_nextafter_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7707040Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool2d_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7707446Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool1d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7707840Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_batch_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T15:22:22.7708226Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_channel_shuffle_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7708613Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv1d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7709002Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7709397Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_bag_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7709805Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7710249Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7710671Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_glu_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7711044Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_group_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7711432Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_area_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7711815Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7712184Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_l1_loss_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7712557Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_leaky_relu_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7712947Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_local_response_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7713348Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7713744Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_margin_loss_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7714130Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7714511Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_constant_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7714904Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7715299Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7715665Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7716035Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7716405Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_selu_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7716777Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7717149Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7717505Z test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7717840Z test_ops.py::TestCommonCUDA::test_multiple_devices_ormqr_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7718178Z test_ops.py::TestCommonCUDA::test_multiple_devices_permute_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7718519Z test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7718885Z test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7719224Z test_ops.py::TestCommonCUDA::test_multiple_devices_pinverse_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7719575Z test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 9%] 2025-12-04T15:22:22.7719903Z test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7720242Z test_ops.py::TestCommonCUDA::test_multiple_devices_randint_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 9%] 2025-12-04T15:22:22.7720559Z test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7720903Z test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7721248Z test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7721600Z test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7721949Z test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7722286Z test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7722619Z test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7722952Z test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7723287Z test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7723605Z test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_3_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 9%] 2025-12-04T15:22:22.7723920Z test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7724271Z test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7724624Z test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7724981Z test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7725339Z test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7725691Z test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7726028Z test_ops.py::TestCommonCUDA::test_multiple_devices_sgn_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7726357Z test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7726710Z test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_bartlett_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7727059Z test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7727385Z test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7727741Z test_ops.py::TestCommonCUDA::test_multiple_devices_sinh_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7728075Z test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7728442Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7728806Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7729167Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7729518Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7729894Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7730296Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7730646Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7730994Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7731337Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_int64 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7731696Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7732061Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T15:22:22.7732444Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7732850Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7733269Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7733681Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7734065Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7734404Z test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7734747Z test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7735087Z test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_int64 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7735421Z test_ops.py::TestCommonCUDA::test_multiple_devices_square_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7735752Z test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7736082Z test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7736411Z test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7736765Z test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7737101Z test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7737425Z test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7737754Z test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7738086Z test_ops.py::TestCommonCUDA::test_multiple_devices_trapezoid_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7738435Z test_ops.py::TestCommonCUDA::test_multiple_devices_triangular_solve_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7738779Z test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_int64 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7739113Z test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7739453Z test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7739788Z test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7740169Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7740513Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7740857Z test_ops.py::TestCommonCUDA::test_multiple_devices_var_unbiased_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7741210Z test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_complex_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7741551Z test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7741900Z test_ops.py::TestCommonCUDA::test_multiple_devices_view_copy_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7742231Z test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7742561Z test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_float32 SKIPPED [0.0008s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7742887Z test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7743219Z test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T15:22:22.7743521Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acos_cuda_bool PASSED [1.2442s] [ 10%] 2025-12-04T15:22:22.7743782Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acosh_cuda_bool PASSED [0.0047s] [ 10%] 2025-12-04T15:22:22.7744041Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_add_cuda_bool PASSED [1.2406s] [ 10%] 2025-12-04T15:22:22.7744295Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_amin_cuda_bool PASSED [0.0088s] [ 10%] 2025-12-04T15:22:22.7744557Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_aminmax_cuda_bool PASSED [0.0045s] [ 10%] 2025-12-04T15:22:22.7744823Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_3d_cuda_bool PASSED [1.2208s] [ 10%] 2025-12-04T15:22:22.7745092Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_or_cuda_bool PASSED [0.0067s] [ 10%] 2025-12-04T15:22:22.7745398Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_tensors_cuda_bool PASSED [1.2502s] [ 10%] 2025-12-04T15:22:22.7745679Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_max_cuda_bool PASSED [0.0069s] [ 10%] 2025-12-04T15:22:22.7745964Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_column_stack_cuda_bool PASSED [1.2423s] [ 10%] 2025-12-04T15:22:22.7746241Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_contiguous_cuda_bool PASSED [0.0040s] [ 10%] 2025-12-04T15:22:22.7746509Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_copysign_cuda_bool PASSED [0.0162s] [ 10%] 2025-12-04T15:22:22.7746787Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 10%] 2025-12-04T15:22:22.7747061Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_equal_cuda_bool PASSED [1.2241s] [ 10%] 2025-12-04T15:22:22.7747321Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erfinv_cuda_bool PASSED [0.0055s] [ 10%] 2025-12-04T15:22:22.7747583Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_as_cuda_bool PASSED [1.2205s] [ 10%] 2025-12-04T15:22:22.7747847Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_cuda_bool PASSED [0.0059s] [ 10%] 2025-12-04T15:22:22.7748107Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft_cuda_bool PASSED [1.3166s] [ 10%] 2025-12-04T15:22:22.7748372Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftn_cuda_bool PASSED [2.1081s] [ 10%] 2025-12-04T15:22:22.7748634Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfft_cuda_bool PASSED [1.2391s] [ 10%] 2025-12-04T15:22:22.7748896Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft_cuda_bool PASSED [0.0068s] [ 10%] 2025-12-04T15:22:22.7749154Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fill_cuda_bool PASSED [1.2226s] [ 10%] 2025-12-04T15:22:22.7749412Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flipud_cuda_bool PASSED [0.0041s] [ 10%] 2025-12-04T15:22:22.7749669Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmin_cuda_bool PASSED [0.0056s] [ 10%] 2025-12-04T15:22:22.7749929Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_like_cuda_bool PASSED [1.2454s] [ 10%] 2025-12-04T15:22:22.7750230Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ge_cuda_bool PASSED [0.0068s] [ 10%] 2025-12-04T15:22:22.7750497Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gt_cuda_bool PASSED [0.0053s] [ 10%] 2025-12-04T15:22:22.7750749Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_i0_cuda_bool PASSED [1.2353s] [ 10%] 2025-12-04T15:22:22.7751003Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isinf_cuda_bool PASSED [0.0038s] [ 10%] 2025-12-04T15:22:22.7751265Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isposinf_cuda_bool PASSED [1.2328s] [ 10%] 2025-12-04T15:22:22.7751541Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_unary_cuda_bool PASSED [0.0044s] [ 10%] 2025-12-04T15:22:22.7751829Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_linalg_diagonal_cuda_bool PASSED [1.2669s] [ 10%] 2025-12-04T15:22:22.7752108Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_or_cuda_bool PASSED [0.0067s] [ 10%] 2025-12-04T15:22:22.7752377Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_xor_cuda_bool PASSED [0.0053s] [ 10%] 2025-12-04T15:22:22.7752646Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_maximum_cuda_bool PASSED [0.0049s] [ 10%] 2025-12-04T15:22:22.7752930Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool PASSED [1.2477s] [ 11%] 2025-12-04T15:22:22.7753213Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mode_cuda_bool PASSED [0.1355s] [ 11%] 2025-12-04T15:22:22.7753467Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mul_cuda_bool PASSED [0.0056s] [ 11%] 2025-12-04T15:22:22.7753756Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nan_to_num_cuda_bool PASSED [1.2515s] [ 11%] 2025-12-04T15:22:22.7754050Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_shuffle_cuda_bool PASSED [0.0049s] [ 11%] 2025-12-04T15:22:22.7754358Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_like_cuda_bool PASSED [1.2344s] [ 11%] 2025-12-04T15:22:22.7754626Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_as_cuda_bool PASSED [0.0050s] [ 11%] 2025-12-04T15:22:22.7754890Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rsqrt_cuda_bool PASSED [1.2479s] [ 11%] 2025-12-04T15:22:22.7755181Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_reduce_sum_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T15:22:22.7755474Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_cuda_bool PASSED [1.2554s] [ 11%] 2025-12-04T15:22:22.7755734Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sgn_cuda_bool PASSED [0.0038s] [ 11%] 2025-12-04T15:22:22.7755993Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sigmoid_cuda_bool PASSED [1.1856s] [ 11%] 2025-12-04T15:22:22.7756289Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_t_cuda_bool PASSED [0.0078s] [ 11%] 2025-12-04T15:22:22.7756615Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_u_cuda_bool PASSED [0.0084s] [ 11%] 2025-12-04T15:22:22.7756940Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_w_cuda_bool PASSED [0.0073s] [ 11%] 2025-12-04T15:22:22.7757240Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_entr_cuda_bool PASSED [1.2280s] [ 11%] 2025-12-04T15:22:22.7757539Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_hermite_polynomial_h_cuda_bool PASSED [0.0108s] [ 11%] 2025-12-04T15:22:22.7757855Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool PASSED [1.2429s] [ 11%] 2025-12-04T15:22:22.7758151Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtr_cuda_bool PASSED [0.0049s] [ 11%] 2025-12-04T15:22:22.7758455Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_scaled_modified_bessel_k0_cuda_bool PASSED [1.2331s] [ 11%] 2025-12-04T15:22:22.7758775Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_spherical_bessel_j0_cuda_bool PASSED [0.0063s] [ 11%] 2025-12-04T15:22:22.7759095Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_copy_cuda_bool PASSED [1.2162s] [ 11%] 2025-12-04T15:22:22.7759390Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_triu_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T15:22:22.7759675Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsqueeze_copy_cuda_bool PASSED [1.2343s] [ 11%] 2025-12-04T15:22:22.7759948Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_copy_cuda_bool PASSED [0.0054s] [ 11%] 2025-12-04T15:22:22.7760258Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vsplit_cuda_bool PASSED [1.2533s] [ 11%] 2025-12-04T15:22:22.7760518Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_where_cuda_bool PASSED [0.0061s] [ 11%] 2025-12-04T15:22:22.7760776Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool PASSED [0.0061s] [ 11%] 2025-12-04T15:22:22.7761035Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_complex64 PASSED [1.2724s] [ 11%] 2025-12-04T15:22:22.7761292Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_float32 PASSED [1.2207s] [ 11%] 2025-12-04T15:22:22.7761545Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_float32 PASSED [1.2500s] [ 11%] 2025-12-04T15:22:22.7761806Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_int64 PASSED [0.0059s] [ 11%] 2025-12-04T15:22:22.7762070Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_float32 PASSED [0.0107s] [ 11%] 2025-12-04T15:22:22.7762365Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_int64 PASSED [0.0053s] [ 11%] 2025-12-04T15:22:22.7762637Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_float32 PASSED [1.2739s] [ 11%] 2025-12-04T15:22:22.7762921Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___ror___cuda_int64 PASSED [0.0064s] [ 11%] 2025-12-04T15:22:22.7763207Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples__softmax_backward_data_cuda_float32 PASSED [1.2560s] [ 11%] 2025-12-04T15:22:22.7763517Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_cuda_complex64 PASSED [0.0123s] [ 11%] 2025-12-04T15:22:22.7763840Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_int64 PASSED [1.2290s] [ 11%] 2025-12-04T15:22:22.7764138Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_complex64 PASSED [0.1607s] [ 11%] 2025-12-04T15:22:22.7764404Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_float32 PASSED [1.2331s] [ 11%] 2025-12-04T15:22:22.7764670Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_complex64 PASSED [0.5220s] [ 11%] 2025-12-04T15:22:22.7764939Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_int64 PASSED [1.2266s] [ 11%] 2025-12-04T15:22:22.7765199Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_float32 PASSED [0.0116s] [ 11%] 2025-12-04T15:22:22.7765476Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_float32 PASSED [1.2498s] [ 11%] 2025-12-04T15:22:22.7765753Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_float32 PASSED [0.0239s] [ 11%] 2025-12-04T15:22:22.7766020Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_float32 PASSED [1.2350s] [ 11%] 2025-12-04T15:22:22.7766286Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_angle_cuda_int64 PASSED [0.0041s] [ 11%] 2025-12-04T15:22:22.7766555Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_complex64 PASSED [1.2525s] [ 11%] 2025-12-04T15:22:22.7766825Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_float32 PASSED [0.0130s] [ 11%] 2025-12-04T15:22:22.7767113Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_complex64 XFAIL [0.0038s] [ 11%] 2025-12-04T15:22:22.7767413Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_int64 XFAIL [1.2274s] [ 11%] 2025-12-04T15:22:22.7767711Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_complex64 XFAIL [0.0041s] [ 11%] 2025-12-04T15:22:22.7768010Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32 PASSED [2.4481s] [ 11%] 2025-12-04T15:22:22.7768277Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_float32 PASSED [1.2190s] [ 11%] 2025-12-04T15:22:22.7768543Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_int64 PASSED [1.2199s] [ 11%] 2025-12-04T15:22:22.7768817Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_float32 PASSED [0.0093s] [ 11%] 2025-12-04T15:22:22.7769093Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64 PASSED [1.2351s] [ 11%] 2025-12-04T15:22:22.7769367Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bincount_cuda_int64 PASSED [0.0191s] [ 11%] 2025-12-04T15:22:22.7769641Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_not_cuda_int64 PASSED [1.2397s] [ 11%] 2025-12-04T15:22:22.7769928Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_int64 PASSED [0.0042s] [ 11%] 2025-12-04T15:22:22.7770259Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_int64 PASSED [1.2321s] [ 11%] 2025-12-04T15:22:22.7770549Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_complex64 PASSED [0.0073s] [ 11%] 2025-12-04T15:22:22.7770866Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_int64 PASSED [1.2398s] [ 11%] 2025-12-04T15:22:22.7771159Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_complex64 PASSED [0.0112s] [ 11%] 2025-12-04T15:22:22.7771444Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdist_cuda_float32 PASSED [0.2046s] [ 11%] 2025-12-04T15:22:22.7771712Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_int64 PASSED [1.2618s] [ 12%] 2025-12-04T15:22:22.7771980Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_float32 PASSED [0.0081s] [ 12%] 2025-12-04T15:22:22.7772249Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_complex64 PASSED [1.2834s] [ 12%] 2025-12-04T15:22:22.7772517Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32 PASSED [0.0051s] [ 12%] 2025-12-04T15:22:22.7772787Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_float32 PASSED [0.0151s] [ 12%] 2025-12-04T15:22:22.7773063Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_max_cuda_float32 PASSED [0.0097s] [ 12%] 2025-12-04T15:22:22.7773340Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_float32 PASSED [0.0093s] [ 12%] 2025-12-04T15:22:22.7773616Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_complex64 PASSED [1.2906s] [ 12%] 2025-12-04T15:22:22.7773887Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_float32 PASSED [0.0055s] [ 12%] 2025-12-04T15:22:22.7774167Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_complex64 PASSED [1.2932s] [ 12%] 2025-12-04T15:22:22.7774461Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_complex64 PASSED [0.0248s] [ 12%] 2025-12-04T15:22:22.7774753Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_float32 PASSED [1.2849s] [ 12%] 2025-12-04T15:22:22.7775039Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_complex64 PASSED [0.0110s] [ 12%] 2025-12-04T15:22:22.7775316Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_float32 PASSED [1.2781s] [ 12%] 2025-12-04T15:22:22.7775582Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_int64 PASSED [0.0050s] [ 12%] 2025-12-04T15:22:22.7775854Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_complex64 PASSED [1.2485s] [ 12%] 2025-12-04T15:22:22.7776128Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_float32 PASSED [0.0073s] [ 12%] 2025-12-04T15:22:22.7776443Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_float32 PASSED [1.2695s] [ 12%] 2025-12-04T15:22:22.7776753Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_int64 PASSED [0.0079s] [ 12%] 2025-12-04T15:22:22.7777052Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_int64 PASSED [1.2538s] [ 12%] 2025-12-04T15:22:22.7777334Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_int64 PASSED [0.0071s] [ 12%] 2025-12-04T15:22:22.7777618Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_int64 PASSED [1.2808s] [ 12%] 2025-12-04T15:22:22.7777916Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_float32 PASSED [0.0128s] [ 12%] 2025-12-04T15:22:22.7778221Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_float32 PASSED [0.0102s] [ 12%] 2025-12-04T15:22:22.7778516Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_complex64 PASSED [1.2785s] [ 12%] 2025-12-04T15:22:22.7778792Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_complex64 PASSED [0.0067s] [ 12%] 2025-12-04T15:22:22.7779065Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_float32 PASSED [1.2843s] [ 12%] 2025-12-04T15:22:22.7779337Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_complex64 PASSED [0.0068s] [ 12%] 2025-12-04T15:22:22.7779621Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_int64 PASSED [1.2682s] [ 12%] 2025-12-04T15:22:22.7779929Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_int64 SKIPPED [0.0003s] (Skipped!) [ 12%] 2025-12-04T15:22:22.7780273Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_float32 PASSED [1.2549s] [ 12%] 2025-12-04T15:22:22.7780547Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_int64 PASSED [0.0060s] [ 12%] 2025-12-04T15:22:22.7780830Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 12%] 2025-12-04T15:22:22.7781105Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_int64 PASSED [1.2430s] [ 12%] 2025-12-04T15:22:22.7781376Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_float32 PASSED [1.8898s] [ 12%] 2025-12-04T15:22:22.7781644Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_int64 PASSED [1.2792s] [ 12%] 2025-12-04T15:22:22.7781916Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64 PASSED [1.3223s] [ 12%] 2025-12-04T15:22:22.7782185Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_float32 PASSED [1.2499s] [ 12%] 2025-12-04T15:22:22.7782450Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_float32 PASSED [1.2691s] [ 12%] 2025-12-04T15:22:22.7782722Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_complex64 PASSED [1.2730s] [ 12%] 2025-12-04T15:22:22.7783003Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_complex64 PASSED [1.2759s] [ 12%] 2025-12-04T15:22:22.7783279Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_float32 PASSED [1.2918s] [ 12%] 2025-12-04T15:22:22.7783552Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_complex64 PASSED [1.2895s] [ 12%] 2025-12-04T15:22:22.7783828Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_float32 PASSED [1.3039s] [ 12%] 2025-12-04T15:22:22.7784110Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_complex64 PASSED [1.6811s] [ 12%] 2025-12-04T15:22:22.7784386Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_int64 PASSED [1.2578s] [ 12%] 2025-12-04T15:22:22.7784655Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_float32 PASSED [0.0079s] [ 12%] 2025-12-04T15:22:22.7784937Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_complex64 PASSED [0.0076s] [ 12%] 2025-12-04T15:22:22.7785205Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_float32 PASSED [1.2788s] [ 12%] 2025-12-04T15:22:22.7785469Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_float32 PASSED [0.0049s] [ 12%] 2025-12-04T15:22:22.7785741Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_float32 PASSED [0.0064s] [ 12%] 2025-12-04T15:22:22.7786015Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_int64 PASSED [0.0046s] [ 12%] 2025-12-04T15:22:22.7786280Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_float32 PASSED [1.2756s] [ 12%] 2025-12-04T15:22:22.7786554Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_complex64 PASSED [0.0189s] [ 12%] 2025-12-04T15:22:22.7786826Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_float32 PASSED [0.0050s] [ 12%] 2025-12-04T15:22:22.7787088Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_float32 PASSED [1.2885s] [ 12%] 2025-12-04T15:22:22.7787350Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_float32 PASSED [0.0345s] [ 12%] 2025-12-04T15:22:22.7787613Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_int64 PASSED [1.3078s] [ 12%] 2025-12-04T15:22:22.7787878Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_float32 PASSED [0.0064s] [ 12%] 2025-12-04T15:22:22.7788142Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hypot_cuda_float32 PASSED [0.0094s] [ 12%] 2025-12-04T15:22:22.7788437Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_float32 PASSED [1.2898s] [ 12%] 2025-12-04T15:22:22.7788714Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_complex64 PASSED [0.0092s] [ 12%] 2025-12-04T15:22:22.7789019Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_float32 PASSED [1.2773s] [ 12%] 2025-12-04T15:22:22.7789312Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_int64 PASSED [0.0072s] [ 12%] 2025-12-04T15:22:22.7789605Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_prod_cuda_float32 PASSED [1.2776s] [ 12%] 2025-12-04T15:22:22.7789882Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_int64 PASSED [0.0062s] [ 12%] 2025-12-04T15:22:22.7790181Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_int64 PASSED [1.2692s] [ 12%] 2025-12-04T15:22:22.7790443Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_int64 PASSED [0.0045s] [ 13%] 2025-12-04T15:22:22.7790709Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_complex64 PASSED [1.2133s] [ 13%] 2025-12-04T15:22:22.7790980Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_complex64 PASSED [0.0039s] [ 13%] 2025-12-04T15:22:22.7791242Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_int64 PASSED [1.2736s] [ 13%] 2025-12-04T15:22:22.7791503Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_int64 PASSED [0.0039s] [ 13%] 2025-12-04T15:22:22.7791768Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_float32 PASSED [1.2805s] [ 13%] 2025-12-04T15:22:22.7792044Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_int64 PASSED [1.5007s] [ 13%] 2025-12-04T15:22:22.7792317Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_float32 PASSED [1.2821s] [ 13%] 2025-12-04T15:22:22.7792597Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_complex64 PASSED [0.0164s] [ 13%] 2025-12-04T15:22:22.7792883Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_float32 PASSED [0.0112s] [ 13%] 2025-12-04T15:22:22.7793170Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_complex64 PASSED [1.3024s] [ 13%] 2025-12-04T15:22:22.7793464Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32 PASSED [0.0128s] [ 13%] 2025-12-04T15:22:22.7793771Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_int64 PASSED [1.2603s] [ 13%] 2025-12-04T15:22:22.7794059Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_float32 PASSED [0.0736s] [ 13%] 2025-12-04T15:22:22.7794342Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_complex64 PASSED [0.0099s] [ 13%] 2025-12-04T15:22:22.7794636Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_ex_cuda_complex64 PASSED [1.3816s] [ 13%] 2025-12-04T15:22:22.7794952Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.2153s] [ 13%] 2025-12-04T15:22:22.7795258Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_complex64 PASSED [0.2344s] [ 13%] 2025-12-04T15:22:22.7795555Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_complex64 PASSED [0.0553s] [ 13%] 2025-12-04T15:22:22.7795855Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_float32 PASSED [1.3363s] [ 13%] 2025-12-04T15:22:22.7796152Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_power_cuda_float32 PASSED [1.3213s] [ 13%] 2025-12-04T15:22:22.7796450Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_float32 PASSED [1.3045s] [ 13%] 2025-12-04T15:22:22.7796759Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_float32 PASSED [1.2772s] [ 13%] 2025-12-04T15:22:22.7797087Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_complex64 PASSED [1.3370s] [ 13%] 2025-12-04T15:22:22.7797369Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svd_cuda_complex64 PASSED [1.5345s] [ 13%] 2025-12-04T15:22:22.7797668Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svdvals_cuda_complex64 PASSED [0.0283s] [ 13%] 2025-12-04T15:22:22.7797964Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_complex64 PASSED [1.2919s] [ 13%] 2025-12-04T15:22:22.7798246Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_complex64 PASSED [1.5238s] [ 13%] 2025-12-04T15:22:22.7798506Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32 PASSED [1.2416s] [ 13%] 2025-12-04T15:22:22.7798789Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_float32 PASSED [1.2452s] [ 13%] 2025-12-04T15:22:22.7799078Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_complex64 PASSED [0.0155s] [ 13%] 2025-12-04T15:22:22.7799352Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_int64 PASSED [0.0051s] [ 13%] 2025-12-04T15:22:22.7799628Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_float32 PASSED [1.2365s] [ 13%] 2025-12-04T15:22:22.7799895Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_int64 PASSED [0.0052s] [ 13%] 2025-12-04T15:22:22.7800200Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_float32 PASSED [0.0938s] [ 13%] 2025-12-04T15:22:22.7800467Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_int64 PASSED [0.0646s] [ 13%] 2025-12-04T15:22:22.7800729Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_float32 PASSED [0.0045s] [ 13%] 2025-12-04T15:22:22.7800996Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amax_cuda_float32 PASSED [0.0575s] [ 13%] 2025-12-04T15:22:22.7801277Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_float32 PASSED [0.0237s] [ 13%] 2025-12-04T15:22:22.7801559Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_float32 PASSED [1.2592s] [ 13%] 2025-12-04T15:22:22.7801841Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_median_cuda_float32 PASSED [0.0170s] [ 13%] 2025-12-04T15:22:22.7802120Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_float32 PASSED [0.0663s] [ 13%] 2025-12-04T15:22:22.7802423Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_int64 PASSED [1.2193s] [ 13%] 2025-12-04T15:22:22.7802696Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_int64 PASSED [0.0281s] [ 13%] 2025-12-04T15:22:22.7802971Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_complex64 PASSED [0.0799s] [ 13%] 2025-12-04T15:22:22.7803245Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_int64 PASSED [0.0471s] [ 13%] 2025-12-04T15:22:22.7803548Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_pool2d_with_indices_backward_cuda_float32 PASSED [0.8644s] [ 13%] 2025-12-04T15:22:22.7803867Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_int64 PASSED [0.0038s] [ 13%] 2025-12-04T15:22:22.7804150Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_complex64 PASSED [1.2775s] [ 13%] 2025-12-04T15:22:22.7804416Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_median_cuda_int64 PASSED [0.0076s] [ 13%] 2025-12-04T15:22:22.7804702Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0130s] [ 13%] 2025-12-04T15:22:22.7805009Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_int64 PASSED [1.2582s] [ 13%] 2025-12-04T15:22:22.7805323Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_complex64 PASSED [0.0161s] [ 13%] 2025-12-04T15:22:22.7805646Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_complex64 PASSED [0.0100s] [ 13%] 2025-12-04T15:22:22.7805910Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_complex64 PASSED [1.2450s] [ 13%] 2025-12-04T15:22:22.7806207Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_int64 PASSED [0.0076s] [ 13%] 2025-12-04T15:22:22.7806495Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_complex64 PASSED [0.0276s] [ 13%] 2025-12-04T15:22:22.7806768Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32 PASSED [0.0110s] [ 13%] 2025-12-04T15:22:22.7807036Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_int64 PASSED [1.2768s] [ 13%] 2025-12-04T15:22:22.7807303Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_complex64 PASSED [0.0262s] [ 13%] 2025-12-04T15:22:22.7807594Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_dropout_backward_cuda_float32 PASSED [0.0083s] [ 13%] 2025-12-04T15:22:22.7807880Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_float32 PASSED [0.0045s] [ 13%] 2025-12-04T15:22:22.7808162Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 13%] 2025-12-04T15:22:22.7808453Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_complex64 PASSED [1.2491s] [ 13%] 2025-12-04T15:22:22.7808728Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_float32 PASSED [0.0065s] [ 13%] 2025-12-04T15:22:22.7808996Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_int64 PASSED [1.2570s] [ 13%] 2025-12-04T15:22:22.7809265Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nextafter_cuda_float32 PASSED [0.0285s] [ 13%] 2025-12-04T15:22:22.7809569Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0097s] [ 14%] 2025-12-04T15:22:22.7809910Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [1.2986s] [ 14%] 2025-12-04T15:22:22.7810280Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0207s] [ 14%] 2025-12-04T15:22:22.7810597Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0149s] [ 14%] 2025-12-04T15:22:22.7810925Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool2d_cuda_float32 PASSED [1.2863s] [ 14%] 2025-12-04T15:22:22.7811245Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0064s] [ 14%] 2025-12-04T15:22:22.7811568Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_int64 PASSED [1.2656s] [ 14%] 2025-12-04T15:22:22.7812076Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_complex64 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7e1a9f826a00 size: 11008 2025-12-04T15:22:22.7812624Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7e1a9f826a00 size: 11008 2025-12-04T15:22:22.7813041Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7e1a9f80f400 size: 11008 2025-12-04T15:22:22.7813457Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7e1a9f80f400 size: 11008 2025-12-04T15:22:22.7813879Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7e1a9f80f400 size: 11008 2025-12-04T15:22:22.7814311Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7e1a9f80f400 size: 11008 2025-12-04T15:22:22.7814761Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7e1a9f803200 size: 12544 2025-12-04T15:22:22.7815193Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7e1a9f803200 size: 12544 2025-12-04T15:22:22.7815609Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7e1a9f80b200 size: 12544 2025-12-04T15:22:22.7820932Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7e1a9f80b200 size: 12544 2025-12-04T15:22:22.7821382Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7e1a9f80b200 size: 12544 2025-12-04T15:22:22.7821822Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7e1a9f80b200 size: 12544 2025-12-04T15:22:22.7822102Z PASSED [0.1629s] [ 14%] 2025-12-04T15:22:22.7822327Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_complex64 PASSED [0.0216s] [ 14%] 2025-12-04T15:22:22.7822673Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_float32 PASSED [1.3007s] [ 14%] 2025-12-04T15:22:22.7823009Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_similarity_cuda_float32 PASSED [0.0148s] [ 14%] 2025-12-04T15:22:22.7823335Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout2d_cuda_float32 PASSED [0.0135s] [ 14%] 2025-12-04T15:22:22.7823685Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [1.2595s] [ 14%] 2025-12-04T15:22:22.7824031Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_int64 PASSED [0.0052s] [ 14%] 2025-12-04T15:22:22.7824356Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [1.2860s] [ 14%] 2025-12-04T15:22:22.7824699Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.0262s] [ 14%] 2025-12-04T15:22:22.7825031Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_leaky_relu_cuda_float32 PASSED [0.0159s] [ 14%] 2025-12-04T15:22:22.7825378Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_complex64 PASSED [1.2728s] [ 14%] 2025-12-04T15:22:22.7825695Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_logsigmoid_cuda_float32 PASSED [0.0063s] [ 14%] 2025-12-04T15:22:22.7826024Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0312s] [ 14%] 2025-12-04T15:22:22.7826353Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool2d_cuda_float32 PASSED [0.1945s] [ 14%] 2025-12-04T15:22:22.7826674Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool3d_cuda_float32 PASSED [0.0743s] [ 14%] 2025-12-04T15:22:22.7827038Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 14%] 2025-12-04T15:22:22.7827388Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_one_hot_cuda_int64 PASSED [1.2775s] [ 14%] 2025-12-04T15:22:22.7827697Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_float32 PASSED [0.0314s] [ 14%] 2025-12-04T15:22:22.7828018Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_complex64 PASSED [1.3065s] [ 14%] 2025-12-04T15:22:22.7828334Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_int64 PASSED [0.0069s] [ 14%] 2025-12-04T15:22:22.7828682Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0053s] [ 14%] 2025-12-04T15:22:22.7828996Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_prelu_cuda_float32 PASSED [1.2605s] [ 14%] 2025-12-04T15:22:22.7829315Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_float32 PASSED [0.0156s] [ 14%] 2025-12-04T15:22:22.7829616Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_float32 PASSED [1.2682s] [ 14%] 2025-12-04T15:22:22.7829915Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_cuda_float32 PASSED [1.2641s] [ 14%] 2025-12-04T15:22:22.7830287Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_float32 PASSED [1.2638s] [ 14%] 2025-12-04T15:22:22.7830624Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_int64 PASSED [1.2159s] [ 14%] 2025-12-04T15:22:22.7830955Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softshrink_cuda_float32 PASSED [0.0163s] [ 14%] 2025-12-04T15:22:22.7831298Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_int64 PASSED [1.2613s] [ 14%] 2025-12-04T15:22:22.7831637Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_unfold_cuda_float32 PASSED [0.0847s] [ 14%] 2025-12-04T15:22:22.7831928Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_float32 PASSED [1.2738s] [ 14%] 2025-12-04T15:22:22.7832202Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_complex64 PASSED [0.2710s] [ 14%] 2025-12-04T15:22:22.7832471Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_complex64 PASSED [1.2628s] [ 14%] 2025-12-04T15:22:22.7832743Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_float32 PASSED [0.0049s] [ 14%] 2025-12-04T15:22:22.7833015Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_complex64 PASSED [1.2769s] [ 14%] 2025-12-04T15:22:22.7833288Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_float32 PASSED [0.0066s] [ 14%] 2025-12-04T15:22:22.7833578Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_float32 PASSED [1.9738s] [ 14%] 2025-12-04T15:22:22.7833911Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_int64 SKIPPED [0.0003s] (Skipped!) [ 14%] 2025-12-04T15:22:22.7834271Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_int64 SKIPPED [0.0002s] (Skipped!) [ 14%] 2025-12-04T15:22:22.7834577Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_float32 PASSED [0.0124s] [ 14%] 2025-12-04T15:22:22.7834842Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_int64 PASSED [0.0047s] [ 14%] 2025-12-04T15:22:22.7835108Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_float32 PASSED [0.0268s] [ 14%] 2025-12-04T15:22:22.7835382Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_complex64 PASSED [0.0228s] [ 14%] 2025-12-04T15:22:22.7835650Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_complex64 PASSED [0.0374s] [ 14%] 2025-12-04T15:22:22.7835927Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_complex64 PASSED [1.2741s] [ 14%] 2025-12-04T15:22:22.7836204Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_float32 PASSED [0.0087s] [ 14%] 2025-12-04T15:22:22.7836519Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_float32 SKIPPED [0.0002s] (Test expects tensor input) [ 14%] 2025-12-04T15:22:22.7836836Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64 PASSED [0.0100s] [ 14%] 2025-12-04T15:22:22.7837148Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_float32 SKIPPED [0.0001s] (Test expects tensor input) [ 14%] 2025-12-04T15:22:22.7837479Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_complex64 PASSED [1.2557s] [ 14%] 2025-12-04T15:22:22.7837774Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_int64 PASSED [0.0043s] [ 14%] 2025-12-04T15:22:22.7838070Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_complex64 PASSED [1.2469s] [ 14%] 2025-12-04T15:22:22.7838351Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_int64 PASSED [0.0069s] [ 14%] 2025-12-04T15:22:22.7838623Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_cuda_int64 PASSED [1.2237s] [ 14%] 2025-12-04T15:22:22.7838888Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_float32 PASSED [0.0051s] [ 14%] 2025-12-04T15:22:22.7839159Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_float32 PASSED [1.2540s] [ 14%] 2025-12-04T15:22:22.7839449Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_complex64 PASSED [1.2085s] [ 14%] 2025-12-04T15:22:22.7839734Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_int64 PASSED [1.2862s] [ 14%] 2025-12-04T15:22:22.7839852Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_complex64 PASSED [1.2895s] [ 14%] 2025-12-04T15:22:22.7839966Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_float32 PASSED [1.2685s] [ 14%] 2025-12-04T15:22:22.7840082Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_int64 PASSED [0.0115s] [ 15%] 2025-12-04T15:22:22.7840268Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_3_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7840384Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64 PASSED [0.0115s] [ 15%] 2025-12-04T15:22:22.7840528Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_int64 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7840645Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_complex64 PASSED [0.0103s] [ 15%] 2025-12-04T15:22:22.7840800Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_cosine_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7840957Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_exponential_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7841122Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7841289Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_kaiser_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7841420Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_int64 PASSED [1.2141s] [ 15%] 2025-12-04T15:22:22.7841573Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0003s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7841724Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7841858Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_float32 PASSED [1.6628s] [ 15%] 2025-12-04T15:22:22.7841990Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y1_cuda_int64 PASSED [1.5279s] [ 15%] 2025-12-04T15:22:22.7842116Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_float32 PASSED [1.4708s] [ 15%] 2025-12-04T15:22:22.7842238Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_float32 PASSED [1.5037s] [ 15%] 2025-12-04T15:22:22.7842359Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_int64 PASSED [1.4438s] [ 15%] 2025-12-04T15:22:22.7842502Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_int64 PASSED [0.0107s] [ 15%] 2025-12-04T15:22:22.7842629Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_log_ndtr_cuda_float32 PASSED [1.5722s] [ 15%] 2025-12-04T15:22:22.7842797Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_float32 PASSED [1.5008s] [ 15%] 2025-12-04T15:22:22.7842940Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k1_cuda_float32 PASSED [1.3778s] [ 15%] 2025-12-04T15:22:22.7843107Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_float32 PASSED [1.5201s] [ 15%] 2025-12-04T15:22:22.7843262Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_int64 PASSED [0.0098s] [ 15%] 2025-12-04T15:22:22.7843421Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_int64 PASSED [0.0073s] [ 15%] 2025-12-04T15:22:22.7843575Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_int64 PASSED [0.0067s] [ 15%] 2025-12-04T15:22:22.7843706Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_float32 PASSED [0.0097s] [ 15%] 2025-12-04T15:22:22.7843822Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_float32 PASSED [1.2153s] [ 15%] 2025-12-04T15:22:22.7843952Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_float32 PASSED [0.0078s] [ 15%] 2025-12-04T15:22:22.7844063Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_int64 PASSED [1.2142s] [ 15%] 2025-12-04T15:22:22.7844183Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_complex64 PASSED [0.0109s] [ 15%] 2025-12-04T15:22:22.7844296Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_complex64 PASSED [0.0124s] [ 15%] 2025-12-04T15:22:22.7844429Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_complex64 PASSED [1.2316s] [ 15%] 2025-12-04T15:22:22.7844553Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_complex64 PASSED [0.0059s] [ 15%] 2025-12-04T15:22:22.7844665Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_int64 PASSED [1.2313s] [ 15%] 2025-12-04T15:22:22.7844786Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_complex64 PASSED [0.0140s] [ 15%] 2025-12-04T15:22:22.7844897Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_float32 PASSED [1.2097s] [ 15%] 2025-12-04T15:22:22.7845023Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_float32 PASSED [0.0087s] [ 15%] 2025-12-04T15:22:22.7845160Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_int64 PASSED [1.2050s] [ 15%] 2025-12-04T15:22:22.7845277Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_complex64 PASSED [0.0108s] [ 15%] 2025-12-04T15:22:22.7845390Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_complex64 PASSED [1.2129s] [ 15%] 2025-12-04T15:22:22.7845503Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_float32 PASSED [0.0048s] [ 15%] 2025-12-04T15:22:22.7845614Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_int64 PASSED [1.1978s] [ 15%] 2025-12-04T15:22:22.7845738Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_float32 PASSED [0.0139s] [ 15%] 2025-12-04T15:22:22.7845849Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_float32 PASSED [0.0176s] [ 15%] 2025-12-04T15:22:22.7845978Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_copy_cuda_float32 PASSED [1.2479s] [ 15%] 2025-12-04T15:22:22.7846099Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_complex64 PASSED [0.0095s] [ 15%] 2025-12-04T15:22:22.7846221Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_complex64 PASSED [1.2643s] [ 15%] 2025-12-04T15:22:22.7846357Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_indices_cuda_int64 SKIPPED [0.0003s] (Skipped!) [ 15%] 2025-12-04T15:22:22.7846479Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_float32 PASSED [0.0121s] [ 15%] 2025-12-04T15:22:22.7846617Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_int64 PASSED [0.0060s] [ 15%] 2025-12-04T15:22:22.7846731Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_int64 PASSED [1.2402s] [ 15%] 2025-12-04T15:22:22.7846859Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_int64 PASSED [0.0060s] [ 15%] 2025-12-04T15:22:22.7846975Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_complex64 PASSED [0.0232s] [ 15%] 2025-12-04T15:22:22.7847092Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_complex64 PASSED [1.2596s] [ 15%] 2025-12-04T15:22:22.7847205Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_cuda_float32 PASSED [0.2897s] [ 15%] 2025-12-04T15:22:22.7847329Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_int64 PASSED [0.0045s] [ 15%] 2025-12-04T15:22:22.7847449Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_complex64 PASSED [1.2429s] [ 15%] 2025-12-04T15:22:22.7847563Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_float32 PASSED [0.0132s] [ 15%] 2025-12-04T15:22:22.7847686Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_unbiased_cuda_complex64 PASSED [1.2318s] [ 15%] 2025-12-04T15:22:22.7847801Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_complex64 PASSED [0.0064s] [ 15%] 2025-12-04T15:22:22.7847918Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_complex64 PASSED [1.2631s] [ 15%] 2025-12-04T15:22:22.7848031Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_int64 PASSED [0.0049s] [ 15%] 2025-12-04T15:22:22.7848143Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_float32 PASSED [1.2289s] [ 15%] 2025-12-04T15:22:22.7848256Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_float32 PASSED [0.0117s] [ 15%] 2025-12-04T15:22:22.7848371Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_float32 PASSED [1.2218s] [ 15%] 2025-12-04T15:22:22.7848489Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64 PASSED [0.0045s] [ 15%] 2025-12-04T15:22:22.7848606Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_int64 PASSED [1.2471s] [ 15%] 2025-12-04T15:22:22.7848709Z test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_complex128 PASSED [0.0116s] [ 16%] 2025-12-04T15:22:22.7848825Z test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_complex128 PASSED [1.2420s] [ 16%] 2025-12-04T15:22:22.7848939Z test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_complex128 PASSED [0.0067s] [ 16%] 2025-12-04T15:22:22.7849045Z test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_float64 PASSED [1.2217s] [ 16%] 2025-12-04T15:22:22.7849139Z test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_float64 PASSED [0.0070s] [ 16%] 2025-12-04T15:22:22.7849236Z test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_float64 PASSED [0.0102s] [ 16%] 2025-12-04T15:22:22.7849330Z test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_int64 PASSED [1.2303s] [ 16%] 2025-12-04T15:22:22.7849425Z test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_float64 PASSED [0.0075s] [ 16%] 2025-12-04T15:22:22.7849529Z test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128 PASSED [1.2598s] [ 16%] 2025-12-04T15:22:22.7849630Z test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_float64 PASSED [0.0053s] [ 16%] 2025-12-04T15:22:22.7849726Z test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_int64 PASSED [1.2296s] [ 16%] 2025-12-04T15:22:22.7849820Z test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_float64 PASSED [0.0221s] [ 16%] 2025-12-04T15:22:22.7849944Z test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_float64 PASSED [0.0055s] [ 16%] 2025-12-04T15:22:22.7850064Z test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_int64 PASSED [0.0048s] [ 16%] 2025-12-04T15:22:22.7850247Z test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_complex128 PASSED [1.2174s] [ 16%] 2025-12-04T15:22:22.7850381Z test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_complex128 XFAIL [0.0755s] [ 16%] 2025-12-04T15:22:22.7850495Z test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_float64 PASSED [1.2590s] [ 16%] 2025-12-04T15:22:22.7850590Z test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_int64 PASSED [1.2487s] [ 16%] 2025-12-04T15:22:22.7850687Z test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64 PASSED [0.0047s] [ 16%] 2025-12-04T15:22:22.7850788Z test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_complex128 PASSED [0.0116s] [ 16%] 2025-12-04T15:22:22.7850893Z test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_float64 PASSED [0.0817s] [ 16%] 2025-12-04T15:22:22.7851008Z test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_cosine_cuda_float64 PASSED [0.0066s] [ 16%] 2025-12-04T15:22:22.7851130Z test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_gaussian_cuda_float64 PASSED [0.0068s] [ 16%] 2025-12-04T15:22:22.7851237Z test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_complex128 PASSED [1.2400s] [ 16%] 2025-12-04T15:22:22.7851336Z test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_float64 PASSED [0.0057s] [ 16%] 2025-12-04T15:22:22.7851448Z test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_complex128 PASSED [1.2531s] [ 16%] 2025-12-04T15:22:22.7851544Z test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_int64 PASSED [0.0143s] [ 16%] 2025-12-04T15:22:22.7851640Z test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_int64 PASSED [1.2298s] [ 16%] 2025-12-04T15:22:22.7851736Z test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_float64 PASSED [0.0099s] [ 16%] 2025-12-04T15:22:22.7851823Z test_ops.py::TestCommonCUDA::test_out_H_cuda_float32 PASSED [1.2593s] [ 16%] 2025-12-04T15:22:22.7851915Z test_ops.py::TestCommonCUDA::test_out___rand___cuda_int64 PASSED [0.0033s] [ 16%] 2025-12-04T15:22:22.7852007Z test_ops.py::TestCommonCUDA::test_out___rpow___cuda_float32 PASSED [1.2355s] [ 16%] 2025-12-04T15:22:22.7852122Z test_ops.py::TestCommonCUDA::test_out__refs__conversions_cdouble_cuda_float32 PASSED [0.0032s] [ 16%] 2025-12-04T15:22:22.7852233Z test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32 PASSED [1.2142s] [ 16%] 2025-12-04T15:22:22.7852326Z test_ops.py::TestCommonCUDA::test_out__refs_acosh_cuda_float32 PASSED [0.0067s] [ 16%] 2025-12-04T15:22:22.7852432Z test_ops.py::TestCommonCUDA::test_out__refs_all_cuda_float32 PASSED [1.2401s] [ 16%] 2025-12-04T15:22:22.7852525Z test_ops.py::TestCommonCUDA::test_out__refs_amax_cuda_float32 PASSED [0.0265s] [ 16%] 2025-12-04T15:22:22.7852634Z test_ops.py::TestCommonCUDA::test_out__refs_as_strided_scatter_cuda_float32 PASSED [1.2482s] [ 16%] 2025-12-04T15:22:22.7852726Z test_ops.py::TestCommonCUDA::test_out__refs_asin_cuda_float32 PASSED [0.0054s] [ 16%] 2025-12-04T15:22:22.7852829Z test_ops.py::TestCommonCUDA::test_out__refs_block_diag_cuda_float32 PASSED [1.2410s] [ 16%] 2025-12-04T15:22:22.7852922Z test_ops.py::TestCommonCUDA::test_out__refs_chunk_cuda_float32 PASSED [0.0032s] [ 16%] 2025-12-04T15:22:22.7853025Z test_ops.py::TestCommonCUDA::test_out__refs_contiguous_cuda_float32 PASSED [1.2091s] [ 16%] 2025-12-04T15:22:22.7853121Z test_ops.py::TestCommonCUDA::test_out__refs_diagonal_cuda_float32 PASSED [0.0033s] [ 16%] 2025-12-04T15:22:22.7853234Z test_ops.py::TestCommonCUDA::test_out__refs_diagonal_scatter_cuda_float32 PASSED [0.0202s] [ 16%] 2025-12-04T15:22:22.7853323Z test_ops.py::TestCommonCUDA::test_out__refs_eq_cuda_float32 PASSED [0.0112s] [ 16%] 2025-12-04T15:22:22.7853415Z test_ops.py::TestCommonCUDA::test_out__refs_erfc_cuda_float32 PASSED [0.0050s] [ 16%] 2025-12-04T15:22:22.7853509Z test_ops.py::TestCommonCUDA::test_out__refs_fft_fft2_cuda_float32 PASSED [1.2479s] [ 16%] 2025-12-04T15:22:22.7853605Z test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft_cuda_float32 PASSED [0.0133s] [ 16%] 2025-12-04T15:22:22.7853731Z test_ops.py::TestCommonCUDA::test_out__refs_fft_rfftn_cuda_float32 PASSED [0.0129s] [ 16%] 2025-12-04T15:22:22.7853828Z test_ops.py::TestCommonCUDA::test_out__refs_flipud_cuda_float32 PASSED [1.2901s] [ 16%] 2025-12-04T15:22:22.7853928Z test_ops.py::TestCommonCUDA::test_out__refs_gcd_cuda_int64 PASSED [0.0123s] [ 16%] 2025-12-04T15:22:22.7854019Z test_ops.py::TestCommonCUDA::test_out__refs_ge_cuda_float32 PASSED [0.0104s] [ 16%] 2025-12-04T15:22:22.7854121Z test_ops.py::TestCommonCUDA::test_out__refs_index_copy_cuda_float32 PASSED [0.0048s] [ 16%] 2025-12-04T15:22:22.7854220Z test_ops.py::TestCommonCUDA::test_out__refs_index_fill_cuda_float32 PASSED [1.2624s] [ 16%] 2025-12-04T15:22:22.7854316Z test_ops.py::TestCommonCUDA::test_out__refs_istft_cuda_complex64 PASSED [0.0085s] [ 16%] 2025-12-04T15:22:22.7854426Z test_ops.py::TestCommonCUDA::test_out__refs_linalg_matrix_norm_cuda_float32 PASSED [0.1107s] [ 16%] 2025-12-04T15:22:22.7854528Z test_ops.py::TestCommonCUDA::test_out__refs_linalg_svd_cuda_float32 PASSED [0.4486s] [ 16%] 2025-12-04T15:22:22.7854622Z test_ops.py::TestCommonCUDA::test_out__refs_log10_cuda_float32 PASSED [0.0052s] [ 16%] 2025-12-04T15:22:22.7854716Z test_ops.py::TestCommonCUDA::test_out__refs_log1p_cuda_float32 PASSED [1.2617s] [ 16%] 2025-12-04T15:22:22.7854809Z test_ops.py::TestCommonCUDA::test_out__refs_log2_cuda_float32 PASSED [0.0072s] [ 16%] 2025-12-04T15:22:22.7854931Z test_ops.py::TestCommonCUDA::test_out__refs_logspace_tensor_overload_cuda_float32 PASSED [1.1320s] [ 16%] 2025-12-04T15:22:22.7855050Z test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32 PASSED [1.2197s] [ 16%] 2025-12-04T15:22:22.7855149Z test_ops.py::TestCommonCUDA::test_out__refs_nan_to_num_cuda_float32 PASSED [0.0069s] [ 16%] 2025-12-04T15:22:22.7855239Z test_ops.py::TestCommonCUDA::test_out__refs_neg_cuda_float32 PASSED [1.2183s] [ 16%] 2025-12-04T15:22:22.7855338Z test_ops.py::TestCommonCUDA::test_out__refs_nextafter_cuda_float32 PASSED [0.0157s] [ 16%] 2025-12-04T15:22:22.7855462Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_alpha_dropout_cuda_float32 PASSED [1.2558s] [ 16%] 2025-12-04T15:22:22.7855629Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32 SKIPPED [0.0003s] (Expected: dropout is not comparable) [ 16%] 2025-12-04T15:22:22.7855764Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_log_softmax_with_dtype_cuda_float32 PASSED [1.2703s] [ 17%] 2025-12-04T15:22:22.7855901Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.0036s] [ 17%] 2025-12-04T15:22:22.7856010Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_selu_cuda_float32 PASSED [1.2300s] [ 17%] 2025-12-04T15:22:22.7856127Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_threshold_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T15:22:22.7856221Z test_ops.py::TestCommonCUDA::test_out__refs_norm_cuda_float32 PASSED [1.2554s] [ 17%] 2025-12-04T15:22:22.7856379Z test_ops.py::TestCommonCUDA::test_out__refs_normal__in_place_cuda_float32 SKIPPED [0.0002s] (Expected: normal is not comparable) [ 17%] 2025-12-04T15:22:22.7856477Z test_ops.py::TestCommonCUDA::test_out__refs_permute_cuda_float32 PASSED [1.2517s] [ 17%] 2025-12-04T15:22:22.7856577Z test_ops.py::TestCommonCUDA::test_out__refs_remainder_cuda_float32 PASSED [0.0160s] [ 17%] 2025-12-04T15:22:22.7856673Z test_ops.py::TestCommonCUDA::test_out__refs_renorm_cuda_float32 PASSED [1.2380s] [ 17%] 2025-12-04T15:22:22.7856766Z test_ops.py::TestCommonCUDA::test_out__refs_roll_cuda_float32 PASSED [0.0029s] [ 17%] 2025-12-04T15:22:22.7856860Z test_ops.py::TestCommonCUDA::test_out__refs_rot90_cuda_float32 PASSED [1.2235s] [ 17%] 2025-12-04T15:22:22.7856950Z test_ops.py::TestCommonCUDA::test_out__refs_sinc_cuda_float32 PASSED [0.0068s] [ 17%] 2025-12-04T15:22:22.7857072Z test_ops.py::TestCommonCUDA::test_out__refs_special_softmax_with_dtype_cuda_float32 PASSED [1.2357s] [ 17%] 2025-12-04T15:22:22.7857174Z test_ops.py::TestCommonCUDA::test_out__refs_sum_cuda_float32 XFAIL [0.0053s] [ 17%] 2025-12-04T15:22:22.7857280Z test_ops.py::TestCommonCUDA::test_out__refs_t_copy_cuda_float32 PASSED [2.4695s] [ 17%] 2025-12-04T15:22:22.7857370Z test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32 PASSED [0.0051s] [ 17%] 2025-12-04T15:22:22.7857483Z test_ops.py::TestCommonCUDA::test_out__refs_triu_indices_cuda_int64 PASSED [0.0023s] [ 17%] 2025-12-04T15:22:22.7857585Z test_ops.py::TestCommonCUDA::test_out__refs_unbind_copy_cuda_float32 PASSED [1.2380s] [ 17%] 2025-12-04T15:22:22.7857684Z test_ops.py::TestCommonCUDA::test_out__refs_unflatten_cuda_float32 PASSED [0.0032s] [ 17%] 2025-12-04T15:22:22.7857773Z test_ops.py::TestCommonCUDA::test_out__refs_var_cuda_float32 PASSED [0.0190s] [ 17%] 2025-12-04T15:22:22.7857871Z test_ops.py::TestCommonCUDA::test_out__refs_view_copy_cuda_float32 PASSED [1.2614s] [ 17%] 2025-12-04T15:22:22.7857959Z test_ops.py::TestCommonCUDA::test_out_acos_cuda_float32 PASSED [0.0062s] [ 17%] 2025-12-04T15:22:22.7858151Z test_ops.py::TestCommonCUDA::test_out_allclose_cuda_float32 SKIPPED [0.0024s] (Skipped! Only supports single tensor or iterable of tensor outputs.) [ 17%] 2025-12-04T15:22:22.7858243Z test_ops.py::TestCommonCUDA::test_out_argwhere_cuda_float32 PASSED [2.4937s] [ 17%] 2025-12-04T15:22:22.7858332Z test_ops.py::TestCommonCUDA::test_out_atan2_cuda_float32 PASSED [0.0111s] [ 17%] 2025-12-04T15:22:22.7858424Z test_ops.py::TestCommonCUDA::test_out_baddbmm_cuda_float32 PASSED [1.2288s] [ 17%] 2025-12-04T15:22:22.7858510Z test_ops.py::TestCommonCUDA::test_out_bmm_cuda_float32 PASSED [0.0050s] [ 17%] 2025-12-04T15:22:22.7858596Z test_ops.py::TestCommonCUDA::test_out_bool_cuda_float32 PASSED [1.2419s] [ 17%] 2025-12-04T15:22:22.7858682Z test_ops.py::TestCommonCUDA::test_out_cat_cuda_float32 PASSED [0.0109s] [ 17%] 2025-12-04T15:22:22.7858770Z test_ops.py::TestCommonCUDA::test_out_cauchy_cuda_float32 PASSED [1.2429s] [ 17%] 2025-12-04T15:22:22.7858857Z test_ops.py::TestCommonCUDA::test_out_chalf_cuda_float32 PASSED [0.0033s] [ 17%] 2025-12-04T15:22:22.7858944Z test_ops.py::TestCommonCUDA::test_out_char_cuda_float32 PASSED [1.2078s] [ 17%] 2025-12-04T15:22:22.7859041Z test_ops.py::TestCommonCUDA::test_out_cholesky_solve_cuda_float32 PASSED [0.0229s] [ 17%] 2025-12-04T15:22:22.7859130Z test_ops.py::TestCommonCUDA::test_out_clone_cuda_float32 PASSED [1.2288s] [ 17%] 2025-12-04T15:22:22.7859225Z test_ops.py::TestCommonCUDA::test_out_count_nonzero_cuda_float32 PASSED [0.0034s] [ 17%] 2025-12-04T15:22:22.7859328Z test_ops.py::TestCommonCUDA::test_out_diag_embed_cuda_float32 PASSED [1.2097s] [ 17%] 2025-12-04T15:22:22.7859418Z test_ops.py::TestCommonCUDA::test_out_digamma_cuda_float32 PASSED [0.1817s] [ 17%] 2025-12-04T15:22:22.7859504Z test_ops.py::TestCommonCUDA::test_out_dist_cuda_float32 PASSED [1.2414s] [ 17%] 2025-12-04T15:22:22.7859591Z test_ops.py::TestCommonCUDA::test_out_einsum_cuda_float32 PASSED [0.0034s] [ 17%] 2025-12-04T15:22:22.7859684Z test_ops.py::TestCommonCUDA::test_out_empty_like_cuda_float32 PASSED [1.1914s] [ 17%] 2025-12-04T15:22:22.7859770Z test_ops.py::TestCommonCUDA::test_out_eq_cuda_float32 PASSED [0.0088s] [ 17%] 2025-12-04T15:22:22.7859862Z test_ops.py::TestCommonCUDA::test_out_fft_ihfftn_cuda_float32 XFAIL [0.0061s] [ 17%] 2025-12-04T15:22:22.7859948Z test_ops.py::TestCommonCUDA::test_out_fill_cuda_float32 PASSED [1.2451s] [ 17%] 2025-12-04T15:22:22.7860037Z test_ops.py::TestCommonCUDA::test_out_flatten_cuda_float32 PASSED [0.0032s] [ 17%] 2025-12-04T15:22:22.7860156Z test_ops.py::TestCommonCUDA::test_out_fmod_cuda_float32 PASSED [0.0106s] [ 17%] 2025-12-04T15:22:22.7860245Z test_ops.py::TestCommonCUDA::test_out_igammac_cuda_float32 PASSED [0.0088s] [ 17%] 2025-12-04T15:22:22.7860359Z test_ops.py::TestCommonCUDA::test_out_integral_dtype__refs_prod_cuda_int16 PASSED [0.0242s] [ 17%] 2025-12-04T15:22:22.7860539Z test_ops.py::TestCommonCUDA::test_out_item_cuda_float32 SKIPPED [0.0021s] (Skipped! Only supports single tensor or iterable of tensor outputs.) [ 17%] 2025-12-04T15:22:22.7860683Z test_ops.py::TestCommonCUDA::test_out_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [2.4847s] [ 17%] 2025-12-04T15:22:22.7860799Z test_ops.py::TestCommonCUDA::test_out_jiterator_binary_cuda_float32 PASSED [1.2020s] [ 17%] 2025-12-04T15:22:22.7860919Z test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_ex_cuda_float32 PASSED [1.2519s] [ 17%] 2025-12-04T15:22:22.7861014Z test_ops.py::TestCommonCUDA::test_out_linalg_cross_cuda_float32 PASSED [0.0135s] [ 17%] 2025-12-04T15:22:22.7861115Z test_ops.py::TestCommonCUDA::test_out_linalg_diagonal_cuda_float32 PASSED [1.2337s] [ 17%] 2025-12-04T15:22:22.7861214Z test_ops.py::TestCommonCUDA::test_out_linalg_multi_dot_cuda_float32 PASSED [0.0109s] [ 17%] 2025-12-04T15:22:22.7861309Z test_ops.py::TestCommonCUDA::test_out_linalg_norm_cuda_float32 PASSED [1.3559s] [ 17%] 2025-12-04T15:22:22.7861432Z test_ops.py::TestCommonCUDA::test_out_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.0767s] [ 17%] 2025-12-04T15:22:22.7861617Z test_ops.py::TestCommonCUDA::test_out_linalg_pinv_singular_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 17%] 2025-12-04T15:22:22.7861711Z test_ops.py::TestCommonCUDA::test_out_linalg_svd_cuda_float32 PASSED [1.6496s] [ 17%] 2025-12-04T15:22:22.7861808Z test_ops.py::TestCommonCUDA::test_out_linalg_vecdot_cuda_float32 PASSED [0.0331s] [ 17%] 2025-12-04T15:22:22.7861921Z test_ops.py::TestCommonCUDA::test_out_linspace_tensor_overload_cuda_float32 PASSED [0.1393s] [ 17%] 2025-12-04T15:22:22.7862009Z test_ops.py::TestCommonCUDA::test_out_log10_cuda_float32 PASSED [1.2908s] [ 17%] 2025-12-04T15:22:22.7862119Z test_ops.py::TestCommonCUDA::test_out_log_softmax_with_dtype_cuda_float32 PASSED [0.0097s] [ 17%] 2025-12-04T15:22:22.7862213Z test_ops.py::TestCommonCUDA::test_out_logical_not_cuda_float32 PASSED [1.2209s] [ 17%] 2025-12-04T15:22:22.7862300Z test_ops.py::TestCommonCUDA::test_out_long_cuda_float32 PASSED [0.0033s] [ 17%] 2025-12-04T15:22:22.7862385Z test_ops.py::TestCommonCUDA::test_out_lt_cuda_float32 PASSED [0.0074s] [ 17%] 2025-12-04T15:22:22.7862473Z test_ops.py::TestCommonCUDA::test_out_mT_cuda_float32 PASSED [1.2122s] [ 17%] 2025-12-04T15:22:22.7862566Z test_ops.py::TestCommonCUDA::test_out_masked_fill_cuda_float32 PASSED [0.0033s] [ 17%] 2025-12-04T15:22:22.7862661Z test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32 PASSED [1.2397s] [ 18%] 2025-12-04T15:22:22.7862757Z test_ops.py::TestCommonCUDA::test_out_masked_softmax_cuda_float32 PASSED [0.0032s] [ 18%] 2025-12-04T15:22:22.7862865Z test_ops.py::TestCommonCUDA::test_out_masked_var_cuda_float32 PASSED [1.2186s] [ 18%] 2025-12-04T15:22:22.7862956Z test_ops.py::TestCommonCUDA::test_out_max_binary_cuda_float32 PASSED [0.0111s] [ 18%] 2025-12-04T15:22:22.7863065Z test_ops.py::TestCommonCUDA::test_out_min_reduction_no_dim_cuda_float32 PASSED [1.2155s] [ 18%] 2025-12-04T15:22:22.7863149Z test_ops.py::TestCommonCUDA::test_out_mm_cuda_float32 PASSED [0.0065s] [ 18%] 2025-12-04T15:22:22.7863240Z test_ops.py::TestCommonCUDA::test_out_movedim_cuda_float32 PASSED [1.2378s] [ 18%] 2025-12-04T15:22:22.7863354Z test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0127s] [ 18%] 2025-12-04T15:22:22.7863446Z test_ops.py::TestCommonCUDA::test_out_nan_to_num_cuda_float32 PASSED [1.2673s] [ 18%] 2025-12-04T15:22:22.7863537Z test_ops.py::TestCommonCUDA::test_out_nansum_cuda_float32 PASSED [0.0292s] [ 18%] 2025-12-04T15:22:22.7863649Z test_ops.py::TestCommonCUDA::test_out_native_dropout_backward_cuda_float32 PASSED [1.2339s] [ 18%] 2025-12-04T15:22:22.7863738Z test_ops.py::TestCommonCUDA::test_out_ne_cuda_float32 PASSED [0.0084s] [ 18%] 2025-12-04T15:22:22.7863829Z test_ops.py::TestCommonCUDA::test_out_new_ones_cuda_float32 PASSED [1.2610s] [ 18%] 2025-12-04T15:22:22.7863955Z test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0029s] [ 18%] 2025-12-04T15:22:22.7864078Z test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [1.2579s] [ 18%] 2025-12-04T15:22:22.7864211Z test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool1d_cuda_float32 PASSED [1.2223s] [ 18%] 2025-12-04T15:22:22.7864321Z test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool2d_cuda_float32 XFAIL [0.0048s] [ 18%] 2025-12-04T15:22:22.7864444Z test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool3d_cuda_float32 PASSED [1.2228s] [ 18%] 2025-12-04T15:22:22.7864562Z test_ops.py::TestCommonCUDA::test_out_nn_functional_channel_shuffle_cuda_float32 PASSED [1.2713s] [ 18%] 2025-12-04T15:22:22.7864684Z test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.0036s] [ 18%] 2025-12-04T15:22:22.7864803Z test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose3d_cuda_float32 PASSED [1.2361s] [ 18%] 2025-12-04T15:22:22.7864926Z test_ops.py::TestCommonCUDA::test_out_nn_functional_cross_entropy_cuda_float32 PASSED [0.0175s] [ 18%] 2025-12-04T15:22:22.7865035Z test_ops.py::TestCommonCUDA::test_out_nn_functional_ctc_loss_cuda_float32 PASSED [1.2666s] [ 18%] 2025-12-04T15:22:22.7865147Z test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout3d_cuda_float32 PASSED [0.0036s] [ 18%] 2025-12-04T15:22:22.7865252Z test_ops.py::TestCommonCUDA::test_out_nn_functional_elu_cuda_float32 PASSED [1.2422s] [ 18%] 2025-12-04T15:22:22.7865380Z test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool3d_cuda_float32 PASSED [0.0103s] [ 18%] 2025-12-04T15:22:22.7865503Z test_ops.py::TestCommonCUDA::test_out_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [1.2268s] [ 18%] 2025-12-04T15:22:22.7865621Z test_ops.py::TestCommonCUDA::test_out_nn_functional_gelu_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 18%] 2025-12-04T15:22:22.7865725Z test_ops.py::TestCommonCUDA::test_out_nn_functional_glu_cuda_float32 PASSED [1.2303s] [ 18%] 2025-12-04T15:22:22.7865835Z test_ops.py::TestCommonCUDA::test_out_nn_functional_grid_sample_cuda_float32 PASSED [1.2591s] [ 18%] 2025-12-04T15:22:22.7865945Z test_ops.py::TestCommonCUDA::test_out_nn_functional_hardtanh_cuda_float32 PASSED [0.0031s] [ 18%] 2025-12-04T15:22:22.7866062Z test_ops.py::TestCommonCUDA::test_out_nn_functional_instance_norm_cuda_float32 PASSED [1.3135s] [ 18%] 2025-12-04T15:22:22.7866185Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_area_cuda_float32 PASSED [0.0032s] [ 18%] 2025-12-04T15:22:22.7866307Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_linear_cuda_float32 PASSED [1.2040s] [ 18%] 2025-12-04T15:22:22.7866428Z test_ops.py::TestCommonCUDA::test_out_nn_functional_kl_div_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T15:22:22.7866539Z test_ops.py::TestCommonCUDA::test_out_nn_functional_layer_norm_cuda_float32 PASSED [1.2297s] [ 18%] 2025-12-04T15:22:22.7866646Z test_ops.py::TestCommonCUDA::test_out_nn_functional_linear_cuda_float32 XFAIL [0.0052s] [ 18%] 2025-12-04T15:22:22.7866755Z test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool1d_cuda_float32 PASSED [2.4883s] [ 18%] 2025-12-04T15:22:22.7866875Z test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0042s] [ 18%] 2025-12-04T15:22:22.7866991Z test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_cuda_float32 PASSED [1.2394s] [ 18%] 2025-12-04T15:22:22.7867104Z test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_replicate_cuda_float32 PASSED [0.0033s] [ 18%] 2025-12-04T15:22:22.7867223Z test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32 PASSED [1.2141s] [ 18%] 2025-12-04T15:22:22.7867330Z test_ops.py::TestCommonCUDA::test_out_nn_functional_prelu_cuda_float32 PASSED [0.0033s] [ 18%] 2025-12-04T15:22:22.7867435Z test_ops.py::TestCommonCUDA::test_out_nn_functional_relu6_cuda_float32 PASSED [1.2444s] [ 18%] 2025-12-04T15:22:22.7867556Z test_ops.py::TestCommonCUDA::test_out_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0032s] [ 18%] 2025-12-04T15:22:22.7867649Z test_ops.py::TestCommonCUDA::test_out_positive_cuda_float32 PASSED [1.1971s] [ 18%] 2025-12-04T15:22:22.7867736Z test_ops.py::TestCommonCUDA::test_out_put_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T15:22:22.7867858Z test_ops.py::TestCommonCUDA::test_out_quantile_cuda_float32 PASSED [1.3221s] [ 18%] 2025-12-04T15:22:22.7867951Z test_ops.py::TestCommonCUDA::test_out_rand_like_cuda_float32 PASSED [0.0037s] [ 18%] 2025-12-04T15:22:22.7868059Z test_ops.py::TestCommonCUDA::test_out_randint_like_cuda_float32 PASSED [1.2379s] [ 18%] 2025-12-04T15:22:22.7868147Z test_ops.py::TestCommonCUDA::test_out_randn_cuda_float32 PASSED [0.0075s] [ 18%] 2025-12-04T15:22:22.7868239Z test_ops.py::TestCommonCUDA::test_out_repeat_cuda_float32 PASSED [1.2534s] [ 18%] 2025-12-04T15:22:22.7868377Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32 PASSED [0.0165s] [ 18%] 2025-12-04T15:22:22.7868520Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error__native_batch_norm_legit_cuda_float32 PASSED [1.2483s] [ 18%] 2025-12-04T15:22:22.7868632Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_float32 PASSED [0.0036s] [ 18%] 2025-12-04T15:22:22.7868750Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_complex64 PASSED [1.2587s] [ 18%] 2025-12-04T15:22:22.7868866Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T15:22:22.7868984Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_complex64 PASSED [1.2109s] [ 18%] 2025-12-04T15:22:22.7869099Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_float32 PASSED [0.0036s] [ 18%] 2025-12-04T15:22:22.7869213Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_complex64 PASSED [1.2341s] [ 18%] 2025-12-04T15:22:22.7869325Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amax_cuda_float32 PASSED [0.0033s] [ 18%] 2025-12-04T15:22:22.7869438Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_complex64 PASSED [1.2106s] [ 18%] 2025-12-04T15:22:22.7869553Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64 PASSED [0.0032s] [ 18%] 2025-12-04T15:22:22.7869665Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_float32 PASSED [1.2401s] [ 18%] 2025-12-04T15:22:22.7869779Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cat_cuda_complex64 PASSED [0.0032s] [ 18%] 2025-12-04T15:22:22.7869889Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ceil_cuda_float32 PASSED [1.2159s] [ 18%] 2025-12-04T15:22:22.7870019Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_cuda_float32 PASSED [0.0038s] [ 18%] 2025-12-04T15:22:22.7870175Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_float32 PASSED [1.2704s] [ 19%] 2025-12-04T15:22:22.7870303Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7870416Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_complex64 PASSED [1.2297s] [ 19%] 2025-12-04T15:22:22.7870531Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_complex64 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7870647Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32 PASSED [1.2514s] [ 19%] 2025-12-04T15:22:22.7870760Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummax_cuda_float32 PASSED [0.0112s] [ 19%] 2025-12-04T15:22:22.7870874Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummin_cuda_float32 PASSED [1.2545s] [ 19%] 2025-12-04T15:22:22.7871001Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_complex64 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7871116Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_digamma_cuda_float32 PASSED [1.2455s] [ 19%] 2025-12-04T15:22:22.7871244Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_floor_rounding_cuda_float32 PASSED [0.0036s] [ 19%] 2025-12-04T15:22:22.7871376Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_float32 PASSED [1.2396s] [ 19%] 2025-12-04T15:22:22.7871514Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_complex64 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7871635Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_complex64 PASSED [1.2618s] [ 19%] 2025-12-04T15:22:22.7871764Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_float32 PASSED [0.0036s] [ 19%] 2025-12-04T15:22:22.7871883Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_float32 PASSED [1.2459s] [ 19%] 2025-12-04T15:22:22.7871995Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7872114Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_complex64 PASSED [1.2363s] [ 19%] 2025-12-04T15:22:22.7872233Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7872356Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_float32 PASSED [1.2393s] [ 19%] 2025-12-04T15:22:22.7872471Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_complex64 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7872582Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_float32 PASSED [1.2245s] [ 19%] 2025-12-04T15:22:22.7872713Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_cuda_float32 PASSED [0.0037s] [ 19%] 2025-12-04T15:22:22.7872839Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_complex64 PASSED [1.2018s] [ 19%] 2025-12-04T15:22:22.7872963Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eig_cuda_complex64 PASSED [0.0135s] [ 19%] 2025-12-04T15:22:22.7873097Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_complex64 PASSED [1.3008s] [ 19%] 2025-12-04T15:22:22.7873226Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_complex64 PASSED [0.0044s] [ 19%] 2025-12-04T15:22:22.7873352Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_float32 PASSED [1.2393s] [ 19%] 2025-12-04T15:22:22.7873491Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_complex64 PASSED [0.0047s] [ 19%] 2025-12-04T15:22:22.7873624Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_float32 PASSED [1.2328s] [ 19%] 2025-12-04T15:22:22.7873762Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_complex64 PASSED [0.0046s] [ 19%] 2025-12-04T15:22:22.7873888Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_float32 PASSED [1.2471s] [ 19%] 2025-12-04T15:22:22.7874026Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_triangular_cuda_float32 PASSED [0.0035s] [ 19%] 2025-12-04T15:22:22.7874151Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svdvals_cuda_float32 PASSED [1.2658s] [ 19%] 2025-12-04T15:22:22.7874285Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_complex64 PASSED [0.0038s] [ 19%] 2025-12-04T15:22:22.7874407Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_cuda_float32 PASSED [0.0024s] [ 19%] 2025-12-04T15:22:22.7874551Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_complex64 PASSED [1.2622s] [ 19%] 2025-12-04T15:22:22.7874666Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7874789Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_float32 PASSED [1.2555s] [ 19%] 2025-12-04T15:22:22.7874932Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_complex64 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7875054Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_float32 PASSED [1.2496s] [ 19%] 2025-12-04T15:22:22.7875187Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_no_dim_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7875349Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_with_dim_cuda_float32 PASSED [1.2613s] [ 19%] 2025-12-04T15:22:22.7875475Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mean_cuda_complex64 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7875586Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mean_cuda_float32 PASSED [1.2296s] [ 19%] 2025-12-04T15:22:22.7875725Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_reduction_with_dim_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T15:22:22.7875836Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_msort_cuda_float32 PASSED [1.2339s] [ 19%] 2025-12-04T15:22:22.7875949Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mul_cuda_complex64 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7876062Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_complex64 PASSED [1.2274s] [ 19%] 2025-12-04T15:22:22.7876199Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7876328Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_native_batch_norm_cuda_float32 PASSED [1.2600s] [ 19%] 2025-12-04T15:22:22.7876467Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_hardshrink_cuda_float32 PASSED [0.0105s] [ 19%] 2025-12-04T15:22:22.7876602Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_float32 PASSED [1.2659s] [ 19%] 2025-12-04T15:22:22.7876738Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7876873Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softplus_cuda_float32 PASSED [1.2397s] [ 19%] 2025-12-04T15:22:22.7876984Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7877104Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_float32 PASSED [1.2599s] [ 19%] 2025-12-04T15:22:22.7877234Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_normal_number_mean_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7877350Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_float32 PASSED [1.2665s] [ 19%] 2025-12-04T15:22:22.7877464Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_complex64 PASSED [0.0036s] [ 19%] 2025-12-04T15:22:22.7877592Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_complex64 PASSED [1.2613s] [ 19%] 2025-12-04T15:22:22.7877706Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polar_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T15:22:22.7877815Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_float32 PASSED [1.2567s] [ 19%] 2025-12-04T15:22:22.7877932Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_complex64 PASSED [0.0100s] [ 19%] 2025-12-04T15:22:22.7878047Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_complex64 PASSED [1.2338s] [ 19%] 2025-12-04T15:22:22.7878160Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T15:22:22.7878282Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_float32 PASSED [1.2139s] [ 19%] 2025-12-04T15:22:22.7878401Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_complex64 PASSED [0.0049s] [ 19%] 2025-12-04T15:22:22.7878512Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sgn_cuda_float32 PASSED [1.2329s] [ 20%] 2025-12-04T15:22:22.7878629Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_float32 PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7878740Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_float32 PASSED [1.1834s] [ 20%] 2025-12-04T15:22:22.7878866Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_slice_scatter_cuda_float32 PASSED [0.0034s] [ 20%] 2025-12-04T15:22:22.7879006Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1_cuda_float32 PASSED [1.2278s] [ 20%] 2025-12-04T15:22:22.7879129Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1e_cuda_float32 PASSED [0.0033s] [ 20%] 2025-12-04T15:22:22.7879267Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_log_ndtr_cuda_float32 PASSED [1.2545s] [ 20%] 2025-12-04T15:22:22.7879390Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtr_cuda_float32 PASSED [0.0034s] [ 20%] 2025-12-04T15:22:22.7879518Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_xlog1py_cuda_float32 PASSED [1.2428s] [ 20%] 2025-12-04T15:22:22.7879631Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sqrt_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7879743Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_float32 PASSED [1.2377s] [ 20%] 2025-12-04T15:22:22.7879856Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_float32 PASSED [0.0033s] [ 20%] 2025-12-04T15:22:22.7879969Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_cuda_float32 PASSED [1.2349s] [ 20%] 2025-12-04T15:22:22.7880087Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_float32 PASSED [0.0038s] [ 20%] 2025-12-04T15:22:22.7880247Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_float32 PASSED [1.2525s] [ 20%] 2025-12-04T15:22:22.7880367Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_float32 PASSED [0.0037s] [ 20%] 2025-12-04T15:22:22.7880479Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_float32 PASSED [1.2557s] [ 20%] 2025-12-04T15:22:22.7880592Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_complex64 PASSED [0.0034s] [ 20%] 2025-12-04T15:22:22.7880707Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_float32 PASSED [1.2057s] [ 20%] 2025-12-04T15:22:22.7880804Z test_ops.py::TestCommonCUDA::test_out_resize_as__cuda_float32 PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7880910Z test_ops.py::TestCommonCUDA::test_out_round_decimals_0_cuda_float32 PASSED [1.2560s] [ 20%] 2025-12-04T15:22:22.7881026Z test_ops.py::TestCommonCUDA::test_out_round_decimals_3_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 20%] 2025-12-04T15:22:22.7881136Z test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amin_cuda_float32 PASSED [1.2611s] [ 20%] 2025-12-04T15:22:22.7881264Z test_ops.py::TestCommonCUDA::test_out_scatter_reduce_sum_cuda_float32 PASSED [0.0220s] [ 20%] 2025-12-04T15:22:22.7881365Z test_ops.py::TestCommonCUDA::test_out_select_scatter_cuda_float32 PASSED [1.2401s] [ 20%] 2025-12-04T15:22:22.7881454Z test_ops.py::TestCommonCUDA::test_out_sign_cuda_float32 PASSED [0.0046s] [ 20%] 2025-12-04T15:22:22.7881559Z test_ops.py::TestCommonCUDA::test_out_signal_windows_hann_cuda_float32 PASSED [0.0022s] [ 20%] 2025-12-04T15:22:22.7881665Z test_ops.py::TestCommonCUDA::test_out_softmax_with_dtype_cuda_float32 PASSED [1.2415s] [ 20%] 2025-12-04T15:22:22.7881789Z test_ops.py::TestCommonCUDA::test_out_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 20%] 2025-12-04T15:22:22.7881892Z test_ops.py::TestCommonCUDA::test_out_special_bessel_y0_cuda_float32 PASSED [1.2278s] [ 20%] 2025-12-04T15:22:22.7881995Z test_ops.py::TestCommonCUDA::test_out_special_bessel_y1_cuda_float32 PASSED [0.4291s] [ 20%] 2025-12-04T15:22:22.7882120Z test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0091s] [ 20%] 2025-12-04T15:22:22.7882240Z test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0103s] [ 20%] 2025-12-04T15:22:22.7882360Z test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_h_cuda_float32 PASSED [0.0097s] [ 20%] 2025-12-04T15:22:22.7882454Z test_ops.py::TestCommonCUDA::test_out_special_i1_cuda_float32 PASSED [1.4005s] [ 20%] 2025-12-04T15:22:22.7882557Z test_ops.py::TestCommonCUDA::test_out_special_log_ndtr_cuda_float32 PASSED [0.0060s] [ 20%] 2025-12-04T15:22:22.7882698Z test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i1_cuda_float32 PASSED [1.2385s] [ 20%] 2025-12-04T15:22:22.7882826Z test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k1_cuda_float32 PASSED [1.2150s] [ 20%] 2025-12-04T15:22:22.7882971Z test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0132s] [ 20%] 2025-12-04T15:22:22.7883074Z test_ops.py::TestCommonCUDA::test_out_special_xlog1py_cuda_float32 PASSED [0.0088s] [ 20%] 2025-12-04T15:22:22.7883177Z test_ops.py::TestCommonCUDA::test_out_split_with_sizes_cuda_float32 PASSED [1.2376s] [ 20%] 2025-12-04T15:22:22.7883280Z test_ops.py::TestCommonCUDA::test_out_squeeze_multiple_cuda_float32 PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7883371Z test_ops.py::TestCommonCUDA::test_out_stack_cuda_float32 PASSED [0.0097s] [ 20%] 2025-12-04T15:22:22.7883471Z test_ops.py::TestCommonCUDA::test_out_std_mean_unbiased_cuda_float32 PASSED [1.2669s] [ 20%] 2025-12-04T15:22:22.7883562Z test_ops.py::TestCommonCUDA::test_out_sum_cuda_float32 PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7883656Z test_ops.py::TestCommonCUDA::test_out_svd_lowrank_cuda_float32 PASSED [1.2314s] [ 20%] 2025-12-04T15:22:22.7883746Z test_ops.py::TestCommonCUDA::test_out_tan_cuda_float32 PASSED [0.0046s] [ 20%] 2025-12-04T15:22:22.7883841Z test_ops.py::TestCommonCUDA::test_out_tensor_split_cuda_float32 PASSED [1.1928s] [ 20%] 2025-12-04T15:22:22.7884070Z test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0012s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 20%] 2025-12-04T15:22:22.7884195Z test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [1.2274s] [ 20%] 2025-12-04T15:22:22.7884297Z test_ops.py::TestCommonCUDA::test_out_transpose_copy_cuda_float32 PASSED [0.0102s] [ 20%] 2025-12-04T15:22:22.7884391Z test_ops.py::TestCommonCUDA::test_out_transpose_cuda_float32 PASSED [1.2461s] [ 20%] 2025-12-04T15:22:22.7884488Z test_ops.py::TestCommonCUDA::test_out_trapezoid_cuda_float32 PASSED [0.0033s] [ 20%] 2025-12-04T15:22:22.7884582Z test_ops.py::TestCommonCUDA::test_out_true_divide_cuda_float32 PASSED [0.0104s] [ 20%] 2025-12-04T15:22:22.7884675Z test_ops.py::TestCommonCUDA::test_out_unflatten_cuda_float32 PASSED [1.2522s] [ 20%] 2025-12-04T15:22:22.7884769Z test_ops.py::TestCommonCUDA::test_out_unfold_copy_cuda_float32 PASSED [0.0220s] [ 20%] 2025-12-04T15:22:22.7884873Z test_ops.py::TestCommonCUDA::test_out_unique_cuda_float32 PASSED [1.2433s] [ 20%] 2025-12-04T15:22:22.7884967Z test_ops.py::TestCommonCUDA::test_out_unravel_index_cuda_int64 PASSED [0.0034s] [ 20%] 2025-12-04T15:22:22.7885060Z test_ops.py::TestCommonCUDA::test_out_unsqueeze_cuda_float32 PASSED [1.2461s] [ 20%] 2025-12-04T15:22:22.7885147Z test_ops.py::TestCommonCUDA::test_out_warning_H_cuda PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7885240Z test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda PASSED [1.2674s] [ 20%] 2025-12-04T15:22:22.7885354Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_byte_cuda PASSED [0.0032s] [ 20%] 2025-12-04T15:22:22.7885467Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_float_cuda PASSED [1.2612s] [ 20%] 2025-12-04T15:22:22.7885579Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_long_cuda PASSED [0.0034s] [ 20%] 2025-12-04T15:22:22.7885689Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_polar_cuda PASSED [0.0362s] [ 20%] 2025-12-04T15:22:22.7885784Z test_ops.py::TestCommonCUDA::test_out_warning__refs_acos_cuda PASSED [1.2153s] [ 20%] 2025-12-04T15:22:22.7885880Z test_ops.py::TestCommonCUDA::test_out_warning__refs_addcdiv_cuda PASSED [0.0457s] [ 20%] 2025-12-04T15:22:22.7885978Z test_ops.py::TestCommonCUDA::test_out_warning__refs_addcmul_cuda PASSED [0.0239s] [ 21%] 2025-12-04T15:22:22.7886070Z test_ops.py::TestCommonCUDA::test_out_warning__refs_addr_cuda PASSED [1.2408s] [ 21%] 2025-12-04T15:22:22.7886185Z test_ops.py::TestCommonCUDA::test_out_warning__refs_all_cuda PASSED [0.0463s] [ 21%] 2025-12-04T15:22:22.7886377Z test_ops.py::TestCommonCUDA::test_out_warning__refs_allclose_cuda SKIPPED [0.0025s] (Skipped! Only supports single tensor or iterable of tensor outputs.) [ 21%] 2025-12-04T15:22:22.7886485Z test_ops.py::TestCommonCUDA::test_out_warning__refs_any_cuda PASSED [2.5126s] [ 21%] 2025-12-04T15:22:22.7886588Z test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_cuda PASSED [0.0037s] [ 21%] 2025-12-04T15:22:22.7886708Z test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_partial_views_cuda PASSED [1.2199s] [ 21%] 2025-12-04T15:22:22.7886817Z test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_scatter_cuda PASSED [0.0033s] [ 21%] 2025-12-04T15:22:22.7886914Z test_ops.py::TestCommonCUDA::test_out_warning__refs_atan2_cuda PASSED [0.0343s] [ 21%] 2025-12-04T15:22:22.7887010Z test_ops.py::TestCommonCUDA::test_out_warning__refs_atanh_cuda PASSED [1.2742s] [ 21%] 2025-12-04T15:22:22.7887112Z test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_1d_cuda PASSED [0.0032s] [ 21%] 2025-12-04T15:22:22.7887214Z test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_or_cuda PASSED [0.0316s] [ 21%] 2025-12-04T15:22:22.7887316Z test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_xor_cuda PASSED [0.0169s] [ 21%] 2025-12-04T15:22:22.7887416Z test_ops.py::TestCommonCUDA::test_out_warning__refs_bucketize_cuda PASSED [1.3024s] [ 21%] 2025-12-04T15:22:22.7887564Z test_ops.py::TestCommonCUDA::test_out_warning__refs_cauchy_cuda SKIPPED [0.0003s] (Expected: cauchy is not comparable) [ 21%] 2025-12-04T15:22:22.7887665Z test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_max_cuda PASSED [0.0364s] [ 21%] 2025-12-04T15:22:22.7887757Z test_ops.py::TestCommonCUDA::test_out_warning__refs_conj_cuda PASSED [1.2389s] [ 21%] 2025-12-04T15:22:22.7887868Z test_ops.py::TestCommonCUDA::test_out_warning__refs_constant_pad_nd_cuda PASSED [0.0035s] [ 21%] 2025-12-04T15:22:22.7887960Z test_ops.py::TestCommonCUDA::test_out_warning__refs_dot_cuda PASSED [1.2397s] [ 21%] 2025-12-04T15:22:22.7888106Z test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_cuda SKIPPED [0.0002s] (Expected: empty is not comparable) [ 21%] 2025-12-04T15:22:22.7888204Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft2_cuda PASSED [1.2887s] [ 21%] 2025-12-04T15:22:22.7888310Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fftshift_cuda PASSED [0.0031s] [ 21%] 2025-12-04T15:22:22.7888418Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft_cuda PASSED [1.2925s] [ 21%] 2025-12-04T15:22:22.7888520Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfftn_cuda PASSED [0.0356s] [ 21%] 2025-12-04T15:22:22.7888622Z test_ops.py::TestCommonCUDA::test_out_warning__refs_float_power_cuda PASSED [0.0190s] [ 21%] 2025-12-04T15:22:22.7888717Z test_ops.py::TestCommonCUDA::test_out_warning__refs_frac_cuda PASSED [1.2644s] [ 21%] 2025-12-04T15:22:22.7888812Z test_ops.py::TestCommonCUDA::test_out_warning__refs_gt_cuda PASSED [0.0267s] [ 21%] 2025-12-04T15:22:22.7888909Z test_ops.py::TestCommonCUDA::test_out_warning__refs_hypot_cuda PASSED [0.0184s] [ 21%] 2025-12-04T15:22:22.7889002Z test_ops.py::TestCommonCUDA::test_out_warning__refs_i0_cuda PASSED [1.2586s] [ 21%] 2025-12-04T15:22:22.7889104Z test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svd_cuda PASSED [0.4329s] [ 21%] 2025-12-04T15:22:22.7889212Z test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vecdot_cuda PASSED [0.0752s] [ 21%] 2025-12-04T15:22:22.7889314Z test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_not_cuda PASSED [1.2362s] [ 21%] 2025-12-04T15:22:22.7889413Z test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_cuda PASSED [0.6618s] [ 21%] 2025-12-04T15:22:22.7889532Z test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_tensor_overload_cuda PASSED [2.1125s] [ 21%] 2025-12-04T15:22:22.7889632Z test_ops.py::TestCommonCUDA::test_out_warning__refs_logsumexp_cuda PASSED [1.2875s] [ 21%] 2025-12-04T15:22:22.7889744Z test_ops.py::TestCommonCUDA::test_out_warning__refs_lt_cuda PASSED [0.0352s] [ 21%] 2025-12-04T15:22:22.7889842Z test_ops.py::TestCommonCUDA::test_out_warning__refs_maximum_cuda PASSED [0.0465s] [ 21%] 2025-12-04T15:22:22.7889948Z test_ops.py::TestCommonCUDA::test_out_warning__refs_minimum_cuda PASSED [0.0179s] [ 21%] 2025-12-04T15:22:22.7890064Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda PASSED [1.2282s] [ 21%] 2025-12-04T15:22:22.7890254Z test_ops.py::TestCommonCUDA::test_out_warning__refs_normal_number_mean_cuda SKIPPED [0.0002s] (Expected: normal is not comparable) [ 21%] 2025-12-04T15:22:22.7890354Z test_ops.py::TestCommonCUDA::test_out_warning__refs_positive_cuda PASSED [1.2422s] [ 21%] 2025-12-04T15:22:22.7890449Z test_ops.py::TestCommonCUDA::test_out_warning__refs_rad2deg_cuda PASSED [0.0110s] [ 21%] 2025-12-04T15:22:22.7890550Z test_ops.py::TestCommonCUDA::test_out_warning__refs_remainder_cuda PASSED [0.0282s] [ 21%] 2025-12-04T15:22:22.7890645Z test_ops.py::TestCommonCUDA::test_out_warning__refs_roll_cuda PASSED [0.0020s] [ 21%] 2025-12-04T15:22:22.7890755Z test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_ndtr_cuda PASSED [1.2528s] [ 21%] 2025-12-04T15:22:22.7890864Z test_ops.py::TestCommonCUDA::test_out_warning__refs_special_logit_cuda PASSED [0.0278s] [ 21%] 2025-12-04T15:22:22.7890960Z test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_cuda PASSED [1.2458s] [ 21%] 2025-12-04T15:22:22.7891060Z test_ops.py::TestCommonCUDA::test_out_warning__refs_stack_cuda PASSED [0.0328s] [ 21%] 2025-12-04T15:22:22.7891156Z test_ops.py::TestCommonCUDA::test_out_warning__refs_t_copy_cuda PASSED [1.2716s] [ 21%] 2025-12-04T15:22:22.7891249Z test_ops.py::TestCommonCUDA::test_out_warning__refs_tanh_cuda PASSED [0.0114s] [ 21%] 2025-12-04T15:22:22.7891340Z test_ops.py::TestCommonCUDA::test_out_warning__refs_to_cuda PASSED [1.2389s] [ 21%] 2025-12-04T15:22:22.7891445Z test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_indices_cuda PASSED [0.0032s] [ 21%] 2025-12-04T15:22:22.7891547Z test_ops.py::TestCommonCUDA::test_out_warning__refs_true_divide_cuda PASSED [0.0380s] [ 21%] 2025-12-04T15:22:22.7891655Z test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_copy_cuda PASSED [1.2258s] [ 21%] 2025-12-04T15:22:22.7891750Z test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda PASSED [0.0158s] [ 21%] 2025-12-04T15:22:22.7891862Z test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_offsets_cuda PASSED [1.2877s] [ 21%] 2025-12-04T15:22:22.7891962Z test_ops.py::TestCommonCUDA::test_out_warning_add_cuda PASSED [0.0360s] [ 21%] 2025-12-04T15:22:22.7892049Z test_ops.py::TestCommonCUDA::test_out_warning_addr_cuda PASSED [1.2750s] [ 21%] 2025-12-04T15:22:22.7892158Z test_ops.py::TestCommonCUDA::test_out_warning_as_strided_partial_views_cuda PASSED [0.0033s] [ 21%] 2025-12-04T15:22:22.7892251Z test_ops.py::TestCommonCUDA::test_out_warning_bfloat16_cuda PASSED [1.2403s] [ 21%] 2025-12-04T15:22:22.7892354Z test_ops.py::TestCommonCUDA::test_out_warning_broadcast_shapes_cuda PASSED [0.0034s] [ 21%] 2025-12-04T15:22:22.7892451Z test_ops.py::TestCommonCUDA::test_out_warning_broadcast_to_cuda PASSED [1.2510s] [ 21%] 2025-12-04T15:22:22.7892548Z test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda PASSED [0.0033s] [ 21%] 2025-12-04T15:22:22.7892635Z test_ops.py::TestCommonCUDA::test_out_warning_cat_cuda PASSED [1.2885s] [ 21%] 2025-12-04T15:22:22.7892724Z test_ops.py::TestCommonCUDA::test_out_warning_ceil_cuda PASSED [0.0103s] [ 21%] 2025-12-04T15:22:22.7892811Z test_ops.py::TestCommonCUDA::test_out_warning_chalf_cuda PASSED [1.2297s] [ 21%] 2025-12-04T15:22:22.7892899Z test_ops.py::TestCommonCUDA::test_out_warning_clamp_cuda PASSED [0.0325s] [ 21%] 2025-12-04T15:22:22.7892983Z test_ops.py::TestCommonCUDA::test_out_warning_cov_cuda PASSED [1.2531s] [ 22%] 2025-12-04T15:22:22.7893092Z test_ops.py::TestCommonCUDA::test_out_warning_cumulative_trapezoid_cuda PASSED [0.0035s] [ 22%] 2025-12-04T15:22:22.7893197Z test_ops.py::TestCommonCUDA::test_out_warning_diagonal_cuda PASSED [1.2180s] [ 22%] 2025-12-04T15:22:22.7893315Z test_ops.py::TestCommonCUDA::test_out_warning_div_trunc_rounding_cuda PASSED [0.0345s] [ 22%] 2025-12-04T15:22:22.7893420Z test_ops.py::TestCommonCUDA::test_out_warning_dot_cuda PASSED [1.2646s] [ 22%] 2025-12-04T15:22:22.7893510Z test_ops.py::TestCommonCUDA::test_out_warning_double_cuda PASSED [0.0033s] [ 22%] 2025-12-04T15:22:22.7893598Z test_ops.py::TestCommonCUDA::test_out_warning_einsum_cuda PASSED [1.2397s] [ 22%] 2025-12-04T15:22:22.7893734Z test_ops.py::TestCommonCUDA::test_out_warning_empty_cuda SKIPPED [0.0002s] (Expected: empty is not comparable) [ 22%] 2025-12-04T15:22:22.7893896Z test_ops.py::TestCommonCUDA::test_out_warning_empty_permuted_cuda SKIPPED [0.0002s] (Expected: empty_permuted is not comparable) [ 22%] 2025-12-04T15:22:22.7893982Z test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda PASSED [1.2860s] [ 22%] 2025-12-04T15:22:22.7894072Z test_ops.py::TestCommonCUDA::test_out_warning_fft_hfftn_cuda PASSED [0.0373s] [ 22%] 2025-12-04T15:22:22.7894167Z test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft_cuda PASSED [1.2484s] [ 22%] 2025-12-04T15:22:22.7894260Z test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft_cuda PASSED [0.0432s] [ 22%] 2025-12-04T15:22:22.7894355Z test_ops.py::TestCommonCUDA::test_out_warning_flatten_cuda PASSED [1.2550s] [ 22%] 2025-12-04T15:22:22.7894441Z test_ops.py::TestCommonCUDA::test_out_warning_ge_cuda PASSED [0.0361s] [ 22%] 2025-12-04T15:22:22.7894530Z test_ops.py::TestCommonCUDA::test_out_warning_hstack_cuda PASSED [1.2840s] [ 22%] 2025-12-04T15:22:22.7894621Z test_ops.py::TestCommonCUDA::test_out_warning_index_fill_cuda PASSED [0.0036s] [ 22%] 2025-12-04T15:22:22.7894724Z test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_amax_cuda PASSED [1.2851s] [ 22%] 2025-12-04T15:22:22.7894812Z test_ops.py::TestCommonCUDA::test_out_warning_isinf_cuda PASSED [0.0034s] [ 22%] 2025-12-04T15:22:22.7894899Z test_ops.py::TestCommonCUDA::test_out_warning_isnan_cuda PASSED [1.2626s] [ 22%] 2025-12-04T15:22:22.7894994Z test_ops.py::TestCommonCUDA::test_out_warning_isneginf_cuda PASSED [0.0107s] [ 22%] 2025-12-04T15:22:22.7895097Z test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_cuda PASSED [1.2262s] [ 22%] 2025-12-04T15:22:22.7895222Z test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_return_by_ref_cuda PASSED [0.0045s] [ 22%] 2025-12-04T15:22:22.7895306Z test_ops.py::TestCommonCUDA::test_out_warning_kron_cuda PASSED [1.2847s] [ 22%] 2025-12-04T15:22:22.7895426Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_ex_cuda PASSED [0.0391s] [ 22%] 2025-12-04T15:22:22.7895526Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_solve_cuda PASSED [0.0995s] [ 22%] 2025-12-04T15:22:22.7895631Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_norm_cuda PASSED [1.3354s] [ 22%] 2025-12-04T15:22:22.7895738Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_power_cuda PASSED [0.0431s] [ 22%] 2025-12-04T15:22:22.7895852Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_hermitian_cuda PASSED [0.0126s] [ 22%] 2025-12-04T15:22:22.7896040Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_singular_cuda SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 22%] 2025-12-04T15:22:22.7896133Z test_ops.py::TestCommonCUDA::test_out_warning_log10_cuda PASSED [1.2422s] [ 22%] 2025-12-04T15:22:22.7896227Z test_ops.py::TestCommonCUDA::test_out_warning_log_normal_cuda PASSED [0.0037s] [ 22%] 2025-12-04T15:22:22.7896325Z test_ops.py::TestCommonCUDA::test_out_warning_logical_or_cuda PASSED [0.0387s] [ 22%] 2025-12-04T15:22:22.7896413Z test_ops.py::TestCommonCUDA::test_out_warning_logit_cuda PASSED [1.2452s] [ 22%] 2025-12-04T15:22:22.7896503Z test_ops.py::TestCommonCUDA::test_out_warning_lu_cuda XFAIL [0.0119s] [ 22%] 2025-12-04T15:22:22.7896606Z test_ops.py::TestCommonCUDA::test_out_warning_masked_logaddexp_cuda PASSED [2.4727s] [ 22%] 2025-12-04T15:22:22.7896719Z test_ops.py::TestCommonCUDA::test_out_warning_masked_select_cuda PASSED [1.2812s] [ 22%] 2025-12-04T15:22:22.7896826Z test_ops.py::TestCommonCUDA::test_out_warning_masked_var_cuda PASSED [0.0036s] [ 22%] 2025-12-04T15:22:22.7896920Z test_ops.py::TestCommonCUDA::test_out_warning_max_binary_cuda PASSED [0.0298s] [ 22%] 2025-12-04T15:22:22.7897044Z test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_no_dim_cuda PASSED [1.2539s] [ 22%] 2025-12-04T15:22:22.7897137Z test_ops.py::TestCommonCUDA::test_out_warning_movedim_cuda PASSED [0.0033s] [ 22%] 2025-12-04T15:22:22.7897227Z test_ops.py::TestCommonCUDA::test_out_warning_mv_cuda PASSED [1.2712s] [ 22%] 2025-12-04T15:22:22.7897337Z test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_3_cuda PASSED [0.0422s] [ 22%] 2025-12-04T15:22:22.7897450Z test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_5_cuda PASSED [1.2550s] [ 22%] 2025-12-04T15:22:22.7897547Z test_ops.py::TestCommonCUDA::test_out_warning_nanquantile_cuda PASSED [0.1369s] [ 22%] 2025-12-04T15:22:22.7897641Z test_ops.py::TestCommonCUDA::test_out_warning_nansum_cuda PASSED [1.2823s] [ 22%] 2025-12-04T15:22:22.7897730Z test_ops.py::TestCommonCUDA::test_out_warning_narrow_cuda PASSED [0.0033s] [ 22%] 2025-12-04T15:22:22.7897838Z test_ops.py::TestCommonCUDA::test_out_warning_native_layer_norm_cuda PASSED [1.2394s] [ 22%] 2025-12-04T15:22:22.7897923Z test_ops.py::TestCommonCUDA::test_out_warning_ne_cuda PASSED [0.0350s] [ 22%] 2025-12-04T15:22:22.7898019Z test_ops.py::TestCommonCUDA::test_out_warning_new_full_cuda PASSED [1.2349s] [ 22%] 2025-12-04T15:22:22.7898147Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool1d_cuda PASSED [0.0030s] [ 22%] 2025-12-04T15:22:22.7898275Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool1d_cuda PASSED [1.2078s] [ 22%] 2025-12-04T15:22:22.7898388Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool2d_cuda PASSED [0.0306s] [ 22%] 2025-12-04T15:22:22.7898514Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda PASSED [1.2566s] [ 22%] 2025-12-04T15:22:22.7898631Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout2d_cuda PASSED [0.0036s] [ 22%] 2025-12-04T15:22:22.7898758Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bilinear_cuda PASSED [1.2340s] [ 22%] 2025-12-04T15:22:22.7898885Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest_cuda PASSED [0.0032s] [ 22%] 2025-12-04T15:22:22.7899009Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_leaky_relu_cuda PASSED [1.2208s] [ 22%] 2025-12-04T15:22:22.7899127Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_cuda PASSED [0.0034s] [ 22%] 2025-12-04T15:22:22.7899240Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda PASSED [1.2127s] [ 22%] 2025-12-04T15:22:22.7899356Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_cuda PASSED [0.0034s] [ 22%] 2025-12-04T15:22:22.7899478Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_grad_cuda PASSED [1.2834s] [ 22%] 2025-12-04T15:22:22.7899592Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_normalize_cuda PASSED [0.0338s] [ 22%] 2025-12-04T15:22:22.7899703Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rms_norm_cuda PASSED [1.2206s] [ 22%] 2025-12-04T15:22:22.7899813Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_selu_cuda PASSED [0.0032s] [ 22%] 2025-12-04T15:22:22.7899929Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_smooth_l1_loss_cuda PASSED [1.2566s] [ 22%] 2025-12-04T15:22:22.7900041Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_cuda PASSED [0.0032s] [ 23%] 2025-12-04T15:22:22.7900187Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softplus_cuda PASSED [1.2326s] [ 23%] 2025-12-04T15:22:22.7900303Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softshrink_cuda PASSED [0.0252s] [ 23%] 2025-12-04T15:22:22.7900475Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_with_distance_loss_cuda PASSED [1.2557s] [ 23%] 2025-12-04T15:22:22.7900572Z test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda PASSED [0.0241s] [ 23%] 2025-12-04T15:22:22.7900676Z test_ops.py::TestCommonCUDA::test_out_warning_ones_cuda XFAIL [0.0058s] [ 23%] 2025-12-04T15:22:22.7900776Z test_ops.py::TestCommonCUDA::test_out_warning_permute_copy_cuda PASSED [1.2381s] [ 23%] 2025-12-04T15:22:22.7900910Z test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_3_cuda SKIPPED [0.0002s] (Skipped!) [ 23%] 2025-12-04T15:22:22.7901037Z test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_4_cuda SKIPPED [0.0002s] (Skipped!) [ 23%] 2025-12-04T15:22:22.7901126Z test_ops.py::TestCommonCUDA::test_out_warning_prod_cuda PASSED [1.2177s] [ 23%] 2025-12-04T15:22:22.7901215Z test_ops.py::TestCommonCUDA::test_out_warning_randint_cuda XFAIL [0.0041s] [ 23%] 2025-12-04T15:22:22.7901303Z test_ops.py::TestCommonCUDA::test_out_warning_randn_cuda XFAIL [1.2214s] [ 23%] 2025-12-04T15:22:22.7901397Z test_ops.py::TestCommonCUDA::test_out_warning_remainder_cuda PASSED [0.0307s] [ 23%] 2025-12-04T15:22:22.7901497Z test_ops.py::TestCommonCUDA::test_out_warning_resolve_conj_cuda PASSED [1.2218s] [ 23%] 2025-12-04T15:22:22.7901604Z test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amax_cuda PASSED [0.0582s] [ 23%] 2025-12-04T15:22:22.7901711Z test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_sum_cuda PASSED [1.2832s] [ 23%] 2025-12-04T15:22:22.7901830Z test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_cosine_cuda PASSED [0.0029s] [ 23%] 2025-12-04T15:22:22.7901920Z test_ops.py::TestCommonCUDA::test_out_warning_slice_cuda PASSED [1.2328s] [ 23%] 2025-12-04T15:22:22.7902006Z test_ops.py::TestCommonCUDA::test_out_warning_sort_cuda PASSED [0.0787s] [ 23%] 2025-12-04T15:22:22.7902131Z test_ops.py::TestCommonCUDA::test_out_warning_sparse_sampled_addmm_cuda SKIPPED [0.0002s] (Skipped!) [ 23%] 2025-12-04T15:22:22.7902230Z test_ops.py::TestCommonCUDA::test_out_warning_special_erfcx_cuda PASSED [0.0057s] [ 23%] 2025-12-04T15:22:22.7902327Z test_ops.py::TestCommonCUDA::test_out_warning_special_i0e_cuda PASSED [1.2284s] [ 23%] 2025-12-04T15:22:22.7902422Z test_ops.py::TestCommonCUDA::test_out_warning_special_i1e_cuda PASSED [0.0160s] [ 23%] 2025-12-04T15:22:22.7902543Z test_ops.py::TestCommonCUDA::test_out_warning_special_laguerre_polynomial_l_cuda PASSED [0.0261s] [ 23%] 2025-12-04T15:22:22.7902679Z test_ops.py::TestCommonCUDA::test_out_warning_special_legendre_polynomial_p_cuda PASSED [0.0162s] [ 23%] 2025-12-04T15:22:22.7902793Z test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k1_cuda PASSED [0.0056s] [ 23%] 2025-12-04T15:22:22.7902910Z test_ops.py::TestCommonCUDA::test_out_warning_special_spherical_bessel_j0_cuda PASSED [1.2334s] [ 23%] 2025-12-04T15:22:22.7902998Z test_ops.py::TestCommonCUDA::test_out_warning_square_cuda PASSED [0.0173s] [ 23%] 2025-12-04T15:22:22.7903090Z test_ops.py::TestCommonCUDA::test_out_warning_stack_cuda PASSED [0.0246s] [ 23%] 2025-12-04T15:22:22.7903193Z test_ops.py::TestCommonCUDA::test_out_warning_std_mean_unbiased_cuda PASSED [1.2398s] [ 23%] 2025-12-04T15:22:22.7903293Z test_ops.py::TestCommonCUDA::test_out_warning_std_unbiased_cuda PASSED [0.0033s] [ 23%] 2025-12-04T15:22:22.7903379Z test_ops.py::TestCommonCUDA::test_out_warning_stft_cuda PASSED [1.2194s] [ 23%] 2025-12-04T15:22:22.7903519Z test_ops.py::TestCommonCUDA::test_out_warning_torch__scaled_mm_cuda SKIPPED [0.0012s] (Requires CUDA SM >= 8.9) [ 23%] 2025-12-04T15:22:22.7903655Z test_ops.py::TestCommonCUDA::test_out_warning_torch__scaled_mm_v2_cuda SKIPPED [0.0008s] (Requires CUDA SM >= 8.9) [ 23%] 2025-12-04T15:22:22.7903884Z test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__efficient_attention_forward_cuda SKIPPED [0.0007s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 23%] 2025-12-04T15:22:22.7904011Z test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__safe_softmax_default_cuda PASSED [1.2129s] [ 23%] 2025-12-04T15:22:22.7904122Z test_ops.py::TestCommonCUDA::test_out_warning_trace_cuda PASSED [0.0032s] [ 23%] 2025-12-04T15:22:22.7904226Z test_ops.py::TestCommonCUDA::test_out_warning_triangular_solve_cuda PASSED [0.0381s] [ 23%] 2025-12-04T15:22:22.7904336Z test_ops.py::TestCommonCUDA::test_out_warning_tril_indices_cuda PASSED [0.0022s] [ 23%] 2025-12-04T15:22:22.7904433Z test_ops.py::TestCommonCUDA::test_out_warning_triu_indices_cuda PASSED [0.0020s] [ 23%] 2025-12-04T15:22:22.7904522Z test_ops.py::TestCommonCUDA::test_out_warning_trunc_cuda PASSED [1.2621s] [ 23%] 2025-12-04T15:22:22.7904630Z test_ops.py::TestCommonCUDA::test_out_warning_unique_consecutive_cuda PASSED [0.0034s] [ 23%] 2025-12-04T15:22:22.7904725Z test_ops.py::TestCommonCUDA::test_out_warning_unravel_index_cuda PASSED [1.2300s] [ 23%] 2025-12-04T15:22:22.7904821Z test_ops.py::TestCommonCUDA::test_out_warning_unsafe_chunk_cuda PASSED [0.0031s] [ 23%] 2025-12-04T15:22:22.7904917Z test_ops.py::TestCommonCUDA::test_out_warning_unsafe_split_cuda PASSED [1.2435s] [ 23%] 2025-12-04T15:22:22.7905005Z test_ops.py::TestCommonCUDA::test_out_warning_xlogy_cuda PASSED [0.0397s] [ 23%] 2025-12-04T15:22:22.7905098Z test_ops.py::TestCommonCUDA::test_out_warning_zeros_like_cuda PASSED [1.2666s] [ 23%] 2025-12-04T15:22:22.7905210Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int64 PASSED [0.0035s] [ 23%] 2025-12-04T15:22:22.7905315Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int8 PASSED [1.2306s] [ 23%] 2025-12-04T15:22:22.7905425Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int8 PASSED [0.0036s] [ 23%] 2025-12-04T15:22:22.7905532Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_uint8 PASSED [1.2435s] [ 23%] 2025-12-04T15:22:22.7905639Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int32 PASSED [0.0032s] [ 23%] 2025-12-04T15:22:22.7905744Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int8 PASSED [1.2334s] [ 23%] 2025-12-04T15:22:22.7905854Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int16 PASSED [0.0033s] [ 23%] 2025-12-04T15:22:22.7905958Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_uint8 PASSED [1.2191s] [ 23%] 2025-12-04T15:22:22.7906067Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int16 PASSED [0.0045s] [ 23%] 2025-12-04T15:22:22.7906183Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int8 PASSED [0.0035s] [ 23%] 2025-12-04T15:22:22.7906291Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int8 PASSED [1.2644s] [ 23%] 2025-12-04T15:22:22.7906398Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_bool PASSED [0.0032s] [ 23%] 2025-12-04T15:22:22.7906501Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int16 PASSED [1.1971s] [ 23%] 2025-12-04T15:22:22.7906615Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int16 PASSED [0.0032s] [ 23%] 2025-12-04T15:22:22.7906727Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int8 PASSED [1.2305s] [ 23%] 2025-12-04T15:22:22.7906838Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int64 PASSED [0.0050s] [ 23%] 2025-12-04T15:22:22.7906964Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int64 PASSED [0.0041s] [ 23%] 2025-12-04T15:22:22.7907074Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int32 PASSED [1.2590s] [ 23%] 2025-12-04T15:22:22.7907180Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int32 PASSED [0.0036s] [ 23%] 2025-12-04T15:22:22.7907287Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int64 PASSED [1.2365s] [ 23%] 2025-12-04T15:22:22.7907392Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_uint8 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7907507Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int64 PASSED [0.0039s] [ 24%] 2025-12-04T15:22:22.7907638Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int16 PASSED [1.2367s] [ 24%] 2025-12-04T15:22:22.7907745Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int32 PASSED [0.0035s] [ 24%] 2025-12-04T15:22:22.7907859Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int8 PASSED [1.2133s] [ 24%] 2025-12-04T15:22:22.7907968Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8 PASSED [0.0036s] [ 24%] 2025-12-04T15:22:22.7908074Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int16 PASSED [1.2317s] [ 24%] 2025-12-04T15:22:22.7908182Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int8 PASSED [0.0032s] [ 24%] 2025-12-04T15:22:22.7908288Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_bool PASSED [1.2409s] [ 24%] 2025-12-04T15:22:22.7908392Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int64 PASSED [0.0036s] [ 24%] 2025-12-04T15:22:22.7908503Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int64 PASSED [1.2064s] [ 24%] 2025-12-04T15:22:22.7908607Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int8 PASSED [0.0037s] [ 24%] 2025-12-04T15:22:22.7908722Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int8 PASSED [0.0264s] [ 24%] 2025-12-04T15:22:22.7908850Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int8 PASSED [1.2477s] [ 24%] 2025-12-04T15:22:22.7908983Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int8 PASSED [0.0044s] [ 24%] 2025-12-04T15:22:22.7909132Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int16 SKIPPED [0.0002s] (Skipped!) [ 24%] 2025-12-04T15:22:22.7909246Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int16 PASSED [1.2339s] [ 24%] 2025-12-04T15:22:22.7909359Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int16 PASSED [0.0036s] [ 24%] 2025-12-04T15:22:22.7909475Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int64 PASSED [1.2198s] [ 24%] 2025-12-04T15:22:22.7909586Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_uint8 PASSED [0.0036s] [ 24%] 2025-12-04T15:22:22.7909696Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_bool PASSED [1.2377s] [ 24%] 2025-12-04T15:22:22.7909812Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int32 PASSED [0.0035s] [ 24%] 2025-12-04T15:22:22.7909924Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_uint8 PASSED [1.2404s] [ 24%] 2025-12-04T15:22:22.7910031Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int8 PASSED [0.0031s] [ 24%] 2025-12-04T15:22:22.7910180Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_uint8 PASSED [1.2581s] [ 24%] 2025-12-04T15:22:22.7910288Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_uint8 PASSED [0.0036s] [ 24%] 2025-12-04T15:22:22.7910431Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int32 PASSED [0.0037s] [ 24%] 2025-12-04T15:22:22.7910572Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int16 PASSED [0.0035s] [ 24%] 2025-12-04T15:22:22.7910712Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int32 PASSED [0.0058s] [ 24%] 2025-12-04T15:22:22.7910854Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_uint8 PASSED [0.0032s] [ 24%] 2025-12-04T15:22:22.7910991Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int8 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7911132Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_bool PASSED [0.0049s] [ 24%] 2025-12-04T15:22:22.7911268Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int8 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7911438Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int32 PASSED [0.0048s] [ 24%] 2025-12-04T15:22:22.7911591Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int32 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7911743Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_bool PASSED [0.0047s] [ 24%] 2025-12-04T15:22:22.7911895Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_bool PASSED [0.0031s] [ 24%] 2025-12-04T15:22:22.7912046Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_uint8 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7912195Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_bool PASSED [0.0031s] [ 24%] 2025-12-04T15:22:22.7912303Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int32 PASSED [1.2342s] [ 24%] 2025-12-04T15:22:22.7912408Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int64 PASSED [0.0033s] [ 24%] 2025-12-04T15:22:22.7912514Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8 PASSED [1.2594s] [ 24%] 2025-12-04T15:22:22.7912629Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_uint8 PASSED [0.0051s] [ 24%] 2025-12-04T15:22:22.7912738Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int16 PASSED [0.0035s] [ 24%] 2025-12-04T15:22:22.7912836Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int8 PASSED [1.2647s] [ 24%] 2025-12-04T15:22:22.7912933Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_uint8 PASSED [0.0044s] [ 24%] 2025-12-04T15:22:22.7913060Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float32 PASSED [0.0216s] [ 24%] 2025-12-04T15:22:22.7913184Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex64 PASSED [0.0291s] [ 24%] 2025-12-04T15:22:22.7913301Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int32 PASSED [1.2425s] [ 24%] 2025-12-04T15:22:22.7913414Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int64 PASSED [0.0173s] [ 24%] 2025-12-04T15:22:22.7913538Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex128 PASSED [0.0300s] [ 24%] 2025-12-04T15:22:22.7913669Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float32 PASSED [0.0164s] [ 24%] 2025-12-04T15:22:22.7913796Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex128 PASSED [1.2526s] [ 24%] 2025-12-04T15:22:22.7913919Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float32 PASSED [0.0242s] [ 24%] 2025-12-04T15:22:22.7914039Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_uint8 PASSED [0.0183s] [ 24%] 2025-12-04T15:22:22.7914162Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16 PASSED [0.0206s] [ 24%] 2025-12-04T15:22:22.7914282Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float32 PASSED [1.2880s] [ 24%] 2025-12-04T15:22:22.7914400Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int64 PASSED [0.0217s] [ 24%] 2025-12-04T15:22:22.7914518Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_uint8 PASSED [0.0180s] [ 24%] 2025-12-04T15:22:22.7914633Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bool PASSED [0.0214s] [ 24%] 2025-12-04T15:22:22.7914751Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float32 PASSED [0.0206s] [ 24%] 2025-12-04T15:22:22.7914869Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bfloat16 PASSED [0.0161s] [ 24%] 2025-12-04T15:22:22.7914989Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex128 PASSED [0.0290s] [ 24%] 2025-12-04T15:22:22.7915132Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex64 PASSED [0.0289s] [ 24%] 2025-12-04T15:22:22.7915258Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float16 PASSED [0.0161s] [ 24%] 2025-12-04T15:22:22.7915375Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float32 PASSED [1.2742s] [ 24%] 2025-12-04T15:22:22.7915496Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float64 PASSED [0.0191s] [ 24%] 2025-12-04T15:22:22.7915611Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int64 PASSED [0.0154s] [ 25%] 2025-12-04T15:22:22.7915736Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float16 PASSED [0.0467s] [ 25%] 2025-12-04T15:22:22.7915856Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float64 PASSED [0.0465s] [ 25%] 2025-12-04T15:22:22.7915977Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32 PASSED [0.0199s] [ 25%] 2025-12-04T15:22:22.7916096Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float64 PASSED [1.2431s] [ 25%] 2025-12-04T15:22:22.7916215Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int32 PASSED [0.0207s] [ 25%] 2025-12-04T15:22:22.7916331Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int8 PASSED [0.0174s] [ 25%] 2025-12-04T15:22:22.7916454Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex64 PASSED [0.0367s] [ 25%] 2025-12-04T15:22:22.7916572Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float64 PASSED [0.0208s] [ 25%] 2025-12-04T15:22:22.7916689Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int16 PASSED [0.0180s] [ 25%] 2025-12-04T15:22:22.7916802Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int8 PASSED [1.2471s] [ 25%] 2025-12-04T15:22:22.7916923Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32 PASSED [0.0228s] [ 25%] 2025-12-04T15:22:22.7917038Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int8 PASSED [0.0176s] [ 25%] 2025-12-04T15:22:22.7917158Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex32 PASSED [0.0296s] [ 25%] 2025-12-04T15:22:22.7917287Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float64 PASSED [1.2684s] [ 25%] 2025-12-04T15:22:22.7917411Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_complex128 PASSED [0.0321s] [ 25%] 2025-12-04T15:22:22.7917531Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float16 PASSED [0.0167s] [ 25%] 2025-12-04T15:22:22.7917648Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int16 PASSED [1.2707s] [ 25%] 2025-12-04T15:22:22.7917767Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int32 PASSED [0.0175s] [ 25%] 2025-12-04T15:22:22.7917869Z test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bfloat16 PASSED [0.0222s] [ 25%] 2025-12-04T15:22:22.7917976Z test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex64 PASSED [1.3120s] [ 25%] 2025-12-04T15:22:22.7918078Z test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float64 PASSED [0.0178s] [ 25%] 2025-12-04T15:22:22.7918178Z test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int32 PASSED [0.0115s] [ 25%] 2025-12-04T15:22:22.7918281Z test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bfloat16 PASSED [0.0233s] [ 25%] 2025-12-04T15:22:22.7918380Z test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bool PASSED [0.0211s] [ 25%] 2025-12-04T15:22:22.7918485Z test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex32 PASSED [0.0369s] [ 25%] 2025-12-04T15:22:22.7918605Z test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex64 PASSED [0.0301s] [ 25%] 2025-12-04T15:22:22.7918717Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_complex64 PASSED [0.0759s] [ 25%] 2025-12-04T15:22:22.7918819Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float32 PASSED [1.3174s] [ 25%] 2025-12-04T15:22:22.7918933Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float64 PASSED [0.0647s] [ 25%] 2025-12-04T15:22:22.7919031Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int16 PASSED [0.0484s] [ 25%] 2025-12-04T15:22:22.7919129Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int64 PASSED [0.0478s] [ 25%] 2025-12-04T15:22:22.7919237Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex64 PASSED [0.5964s] [ 25%] 2025-12-04T15:22:22.7919342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64 PASSED [0.0502s] [ 25%] 2025-12-04T15:22:22.7919443Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8 PASSED [0.3158s] [ 25%] 2025-12-04T15:22:22.7919548Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bfloat16 XFAIL [0.0070s] [ 25%] 2025-12-04T15:22:22.7919644Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bool XFAIL [1.2609s] [ 25%] 2025-12-04T15:22:22.7919751Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_complex128 XFAIL [1.2618s] [ 25%] 2025-12-04T15:22:22.7919850Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float64 XFAIL [1.2839s] [ 25%] 2025-12-04T15:22:22.7919949Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int32 XFAIL [1.2314s] [ 25%] 2025-12-04T15:22:22.7920045Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_uint8 XFAIL [1.2280s] [ 25%] 2025-12-04T15:22:22.7920217Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bfloat16 PASSED [2.4827s] [ 25%] 2025-12-04T15:22:22.7920328Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex32 PASSED [0.0051s] [ 25%] 2025-12-04T15:22:22.7920442Z test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex128 PASSED [0.0394s] [ 25%] 2025-12-04T15:22:22.7920553Z test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex64 PASSED [0.0373s] [ 25%] 2025-12-04T15:22:22.7920658Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bfloat16 PASSED [1.2612s] [ 25%] 2025-12-04T15:22:22.7920760Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float32 PASSED [0.0115s] [ 25%] 2025-12-04T15:22:22.7920875Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int16 PASSED [0.0079s] [ 25%] 2025-12-04T15:22:22.7920975Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int64 PASSED [0.0076s] [ 25%] 2025-12-04T15:22:22.7921073Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int32 PASSED [1.2419s] [ 25%] 2025-12-04T15:22:22.7921173Z test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float32 PASSED [0.0162s] [ 25%] 2025-12-04T15:22:22.7921271Z test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int32 PASSED [0.0136s] [ 25%] 2025-12-04T15:22:22.7921370Z test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_uint8 PASSED [0.0150s] [ 25%] 2025-12-04T15:22:22.7921473Z test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_float16 PASSED [0.0172s] [ 25%] 2025-12-04T15:22:22.7921577Z test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int64 PASSED [0.0081s] [ 25%] 2025-12-04T15:22:22.7921692Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bfloat16 XFAIL [0.0024s] [ 25%] 2025-12-04T15:22:22.7921807Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_float16 XFAIL [1.2759s] [ 25%] 2025-12-04T15:22:22.7921914Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float16 PASSED [1.2119s] [ 25%] 2025-12-04T15:22:22.7922019Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int8 PASSED [0.0055s] [ 25%] 2025-12-04T15:22:22.7922124Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_uint8 PASSED [0.0039s] [ 25%] 2025-12-04T15:22:22.7922288Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32 PASSED [1.2557s] [ 25%] 2025-12-04T15:22:22.7922411Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int16 PASSED [0.0049s] [ 25%] 2025-12-04T15:22:22.7922539Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bool PASSED [1.2443s] [ 25%] 2025-12-04T15:22:22.7922645Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex64 PASSED [0.0314s] [ 25%] 2025-12-04T15:22:22.7922746Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float32 PASSED [0.0154s] [ 25%] 2025-12-04T15:22:22.7922846Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_uint8 PASSED [0.0160s] [ 25%] 2025-12-04T15:22:22.7922949Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bfloat16 PASSED [1.2473s] [ 25%] 2025-12-04T15:22:22.7923053Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16 PASSED [0.0245s] [ 26%] 2025-12-04T15:22:22.7923155Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int16 PASSED [0.0173s] [ 26%] 2025-12-04T15:22:22.7923255Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int32 PASSED [1.3023s] [ 26%] 2025-12-04T15:22:22.7923354Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int8 PASSED [0.0190s] [ 26%] 2025-12-04T15:22:22.7923455Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_uint8 PASSED [0.0161s] [ 26%] 2025-12-04T15:22:22.7923552Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_uint8 PASSED [0.0706s] [ 26%] 2025-12-04T15:22:22.7923651Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_bool PASSED [1.2550s] [ 26%] 2025-12-04T15:22:22.7923755Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex64 PASSED [0.2210s] [ 26%] 2025-12-04T15:22:22.7923857Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float64 PASSED [0.0154s] [ 26%] 2025-12-04T15:22:22.7923966Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex128 PASSED [0.2268s] [ 26%] 2025-12-04T15:22:22.7924072Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex32 PASSED [0.2212s] [ 26%] 2025-12-04T15:22:22.7924171Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16 PASSED [0.0172s] [ 26%] 2025-12-04T15:22:22.7924270Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_uint8 PASSED [0.0157s] [ 26%] 2025-12-04T15:22:22.7924388Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float16 PASSED [0.0045s] [ 26%] 2025-12-04T15:22:22.7924499Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float32 PASSED [1.2619s] [ 26%] 2025-12-04T15:22:22.7924604Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_uint8 PASSED [0.0052s] [ 26%] 2025-12-04T15:22:22.7924715Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex64 PASSED [0.0056s] [ 26%] 2025-12-04T15:22:22.7924821Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_uint8 PASSED [1.2320s] [ 26%] 2025-12-04T15:22:22.7924933Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex64 PASSED [0.0074s] [ 26%] 2025-12-04T15:22:22.7925041Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int16 PASSED [1.2629s] [ 26%] 2025-12-04T15:22:22.7925155Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int8 PASSED [0.0483s] [ 26%] 2025-12-04T15:22:22.7925262Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int32 PASSED [0.0123s] [ 26%] 2025-12-04T15:22:22.7925367Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_bool PASSED [0.0435s] [ 26%] 2025-12-04T15:22:22.7925486Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int16 PASSED [0.0458s] [ 26%] 2025-12-04T15:22:22.7925592Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int16 PASSED [0.0456s] [ 26%] 2025-12-04T15:22:22.7925720Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_uint8 PASSED [0.0448s] [ 26%] 2025-12-04T15:22:22.7925831Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bfloat16 PASSED [1.2279s] [ 26%] 2025-12-04T15:22:22.7925955Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex128 PASSED [0.0110s] [ 26%] 2025-12-04T15:22:22.7926060Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int32 PASSED [0.0083s] [ 26%] 2025-12-04T15:22:22.7926165Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int8 PASSED [0.0081s] [ 26%] 2025-12-04T15:22:22.7926279Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_uint8 PASSED [0.0059s] [ 26%] 2025-12-04T15:22:22.7926394Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_complex128 PASSED [0.0050s] [ 26%] 2025-12-04T15:22:22.7926506Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float64 PASSED [1.2436s] [ 26%] 2025-12-04T15:22:22.7926613Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float32 PASSED [0.3278s] [ 26%] 2025-12-04T15:22:22.7926719Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int32 PASSED [0.3220s] [ 26%] 2025-12-04T15:22:22.7926823Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex64 PASSED [0.0086s] [ 26%] 2025-12-04T15:22:22.7926924Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float16 PASSED [0.0079s] [ 26%] 2025-12-04T15:22:22.7927023Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float32 PASSED [0.0079s] [ 26%] 2025-12-04T15:22:22.7927123Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_uint8 PASSED [1.2548s] [ 26%] 2025-12-04T15:22:22.7927221Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int16 PASSED [0.0157s] [ 26%] 2025-12-04T15:22:22.7927320Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int32 PASSED [1.2343s] [ 26%] 2025-12-04T15:22:22.7927418Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int64 PASSED [0.0151s] [ 26%] 2025-12-04T15:22:22.7927523Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float16 PASSED [0.0491s] [ 26%] 2025-12-04T15:22:22.7927626Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float32 PASSED [0.0366s] [ 26%] 2025-12-04T15:22:22.7927728Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_uint8 PASSED [0.0352s] [ 26%] 2025-12-04T15:22:22.7927845Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float32 PASSED [0.0892s] [ 26%] 2025-12-04T15:22:22.7927951Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bool PASSED [0.0705s] [ 26%] 2025-12-04T15:22:22.7928056Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex128 PASSED [0.0266s] [ 26%] 2025-12-04T15:22:22.7928162Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex32 PASSED [1.2794s] [ 26%] 2025-12-04T15:22:22.7928265Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float16 PASSED [0.0283s] [ 26%] 2025-12-04T15:22:22.7928369Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float32 PASSED [0.0259s] [ 26%] 2025-12-04T15:22:22.7928469Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int32 PASSED [0.0197s] [ 26%] 2025-12-04T15:22:22.7928579Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float32 PASSED [0.0042s] [ 26%] 2025-12-04T15:22:22.7928688Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int16 PASSED [1.2369s] [ 26%] 2025-12-04T15:22:22.7928796Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int64 PASSED [0.0056s] [ 26%] 2025-12-04T15:22:22.7928902Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int8 PASSED [0.0039s] [ 26%] 2025-12-04T15:22:22.7929006Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex32 PASSED [0.1492s] [ 26%] 2025-12-04T15:22:22.7929105Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int8 PASSED [1.2523s] [ 26%] 2025-12-04T15:22:22.7929243Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex128 PASSED [0.0310s] [ 26%] 2025-12-04T15:22:22.7929354Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int32 PASSED [1.2737s] [ 26%] 2025-12-04T15:22:22.7929472Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int8 PASSED [0.0104s] [ 26%] 2025-12-04T15:22:22.7929591Z test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bfloat16 PASSED [0.0393s] [ 26%] 2025-12-04T15:22:22.7929703Z test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bool PASSED [0.0331s] [ 26%] 2025-12-04T15:22:22.7929818Z test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float16 PASSED [0.0378s] [ 26%] 2025-12-04T15:22:22.7929928Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bfloat16 PASSED [0.0220s] [ 26%] 2025-12-04T15:22:22.7930038Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float64 PASSED [1.2754s] [ 26%] 2025-12-04T15:22:22.7930187Z test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float32 PASSED [0.0978s] [ 26%] 2025-12-04T15:22:22.7930291Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32 PASSED [0.4298s] [ 27%] 2025-12-04T15:22:22.7930390Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_uint8 PASSED [0.0180s] [ 27%] 2025-12-04T15:22:22.7930493Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_bfloat16 PASSED [0.0231s] [ 27%] 2025-12-04T15:22:22.7930600Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex128 PASSED [0.3865s] [ 27%] 2025-12-04T15:22:22.7930713Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bfloat16 PASSED [0.0140s] [ 27%] 2025-12-04T15:22:22.7930825Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float64 PASSED [0.0115s] [ 27%] 2025-12-04T15:22:22.7930933Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int16 PASSED [0.0113s] [ 27%] 2025-12-04T15:22:22.7931044Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int32 PASSED [1.2836s] [ 27%] 2025-12-04T15:22:22.7931147Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int16 PASSED [0.0164s] [ 27%] 2025-12-04T15:22:22.7931251Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int32 PASSED [1.2827s] [ 27%] 2025-12-04T15:22:22.7931357Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex128 PASSED [0.0092s] [ 27%] 2025-12-04T15:22:22.7931476Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int64 PASSED [0.0063s] [ 27%] 2025-12-04T15:22:22.7931577Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int8 PASSED [0.0067s] [ 27%] 2025-12-04T15:22:22.7931680Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int32 PASSED [1.2552s] [ 27%] 2025-12-04T15:22:22.7931781Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_uint8 PASSED [0.0208s] [ 27%] 2025-12-04T15:22:22.7931888Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex32 PASSED [0.0108s] [ 27%] 2025-12-04T15:22:22.7931989Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float32 PASSED [1.2942s] [ 27%] 2025-12-04T15:22:22.7932087Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int8 PASSED [0.0110s] [ 27%] 2025-12-04T15:22:22.7932196Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float32 PASSED [0.0379s] [ 27%] 2025-12-04T15:22:22.7932304Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16 PASSED [0.0340s] [ 27%] 2025-12-04T15:22:22.7932410Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_uint8 PASSED [1.2537s] [ 27%] 2025-12-04T15:22:22.7932518Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bool PASSED [0.0125s] [ 27%] 2025-12-04T15:22:22.7932623Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_uint8 PASSED [0.0099s] [ 27%] 2025-12-04T15:22:22.7932752Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float32 PASSED [0.0123s] [ 27%] 2025-12-04T15:22:22.7932879Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int16 PASSED [1.2786s] [ 27%] 2025-12-04T15:22:22.7933011Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_bfloat16 PASSED [0.3473s] [ 27%] 2025-12-04T15:22:22.7933133Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float16 PASSED [0.0975s] [ 27%] 2025-12-04T15:22:22.7933250Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float16 PASSED [0.1072s] [ 27%] 2025-12-04T15:22:22.7933365Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int64 PASSED [0.0526s] [ 27%] 2025-12-04T15:22:22.7933477Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_uint8 PASSED [0.0514s] [ 27%] 2025-12-04T15:22:22.7933579Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_bfloat16 PASSED [1.2664s] [ 27%] 2025-12-04T15:22:22.7933683Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_uint8 PASSED [0.0051s] [ 27%] 2025-12-04T15:22:22.7933784Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bool PASSED [0.0044s] [ 27%] 2025-12-04T15:22:22.7933886Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32 PASSED [0.0039s] [ 27%] 2025-12-04T15:22:22.7934042Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 27%] 2025-12-04T15:22:22.7934191Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_uint8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 27%] 2025-12-04T15:22:22.7934342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 27%] 2025-12-04T15:22:22.7934493Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 27%] 2025-12-04T15:22:22.7934644Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_uint8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 27%] 2025-12-04T15:22:22.7934813Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 27%] 2025-12-04T15:22:22.7934916Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bfloat16 PASSED [0.0708s] [ 27%] 2025-12-04T15:22:22.7935042Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_complex128 PASSED [1.3267s] [ 27%] 2025-12-04T15:22:22.7935143Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float16 PASSED [0.0730s] [ 27%] 2025-12-04T15:22:22.7935242Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float64 PASSED [0.0482s] [ 27%] 2025-12-04T15:22:22.7935337Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int16 PASSED [0.0460s] [ 27%] 2025-12-04T15:22:22.7935440Z test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float16 PASSED [0.0065s] [ 27%] 2025-12-04T15:22:22.7935542Z test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int8 PASSED [0.0057s] [ 27%] 2025-12-04T15:22:22.7935638Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int32 PASSED [0.0169s] [ 27%] 2025-12-04T15:22:22.7935736Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8 PASSED [1.2558s] [ 27%] 2025-12-04T15:22:22.7935836Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float16 PASSED [0.3316s] [ 27%] 2025-12-04T15:22:22.7935935Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int64 PASSED [0.2099s] [ 27%] 2025-12-04T15:22:22.7936032Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_uint8 PASSED [0.0175s] [ 27%] 2025-12-04T15:22:22.7936132Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bool PASSED [1.5009s] [ 27%] 2025-12-04T15:22:22.7936235Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float16 PASSED [0.2696s] [ 27%] 2025-12-04T15:22:22.7936350Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float64 PASSED [0.1985s] [ 27%] 2025-12-04T15:22:22.7936461Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int8 PASSED [0.0163s] [ 27%] 2025-12-04T15:22:22.7936579Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_complex128 PASSED [1.8058s] [ 27%] 2025-12-04T15:22:22.7936679Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float16 PASSED [0.3179s] [ 27%] 2025-12-04T15:22:22.7936777Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int8 PASSED [0.2049s] [ 27%] 2025-12-04T15:22:22.7936873Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bool PASSED [0.0236s] [ 27%] 2025-12-04T15:22:22.7936973Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float16 PASSED [0.0231s] [ 27%] 2025-12-04T15:22:22.7937069Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int32 PASSED [0.0180s] [ 27%] 2025-12-04T15:22:22.7937176Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int16 PASSED [1.2734s] [ 27%] 2025-12-04T15:22:22.7937280Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int8 PASSED [0.0049s] [ 27%] 2025-12-04T15:22:22.7937393Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_complex64 PASSED [0.0072s] [ 27%] 2025-12-04T15:22:22.7937502Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_int64 PASSED [0.0055s] [ 27%] 2025-12-04T15:22:22.7937603Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int16 PASSED [0.0048s] [ 27%] 2025-12-04T15:22:22.7937703Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bool PASSED [0.0204s] [ 28%] 2025-12-04T15:22:22.7937806Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float16 PASSED [1.3007s] [ 28%] 2025-12-04T15:22:22.7937906Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int16 PASSED [0.0198s] [ 28%] 2025-12-04T15:22:22.7938003Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int8 PASSED [0.0162s] [ 28%] 2025-12-04T15:22:22.7938194Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_bfloat16 SKIPPED [0.0002s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 28%] 2025-12-04T15:22:22.7938382Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float64 SKIPPED [0.0002s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 28%] 2025-12-04T15:22:22.7938483Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int8 PASSED [0.0665s] [ 28%] 2025-12-04T15:22:22.7938604Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex64 PASSED [1.2902s] [ 28%] 2025-12-04T15:22:22.7938712Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int32 PASSED [0.0106s] [ 28%] 2025-12-04T15:22:22.7938819Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex32 PASSED [0.0066s] [ 28%] 2025-12-04T15:22:22.7938924Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int8 PASSED [0.0076s] [ 28%] 2025-12-04T15:22:22.7939030Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_uint8 PASSED [0.0074s] [ 28%] 2025-12-04T15:22:22.7939145Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex64 PASSED [0.0061s] [ 28%] 2025-12-04T15:22:22.7939257Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int16 PASSED [1.2708s] [ 28%] 2025-12-04T15:22:22.7939362Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float32 PASSED [0.0104s] [ 28%] 2025-12-04T15:22:22.7939471Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float64 PASSED [1.2809s] [ 28%] 2025-12-04T15:22:22.7939576Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_bool PASSED [0.0105s] [ 28%] 2025-12-04T15:22:22.7939687Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16 PASSED [1.2694s] [ 28%] 2025-12-04T15:22:22.7939792Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32 PASSED [0.0111s] [ 28%] 2025-12-04T15:22:22.7939910Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int64 PASSED [1.3057s] [ 28%] 2025-12-04T15:22:22.7940025Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_uint8 PASSED [0.0114s] [ 28%] 2025-12-04T15:22:22.7940179Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_bool PASSED [1.2688s] [ 28%] 2025-12-04T15:22:22.7940285Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16 PASSED [0.0124s] [ 28%] 2025-12-04T15:22:22.7940391Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_uint8 PASSED [0.0095s] [ 28%] 2025-12-04T15:22:22.7940493Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int64 PASSED [1.2781s] [ 28%] 2025-12-04T15:22:22.7940611Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex64 PASSED [0.0086s] [ 28%] 2025-12-04T15:22:22.7940721Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int32 PASSED [0.0057s] [ 28%] 2025-12-04T15:22:22.7940835Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_uint8 PASSED [1.2275s] [ 28%] 2025-12-04T15:22:22.7940943Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int8 PASSED [0.0119s] [ 28%] 2025-12-04T15:22:22.7941048Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_uint8 PASSED [0.0091s] [ 28%] 2025-12-04T15:22:22.7941156Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int16 PASSED [0.0105s] [ 28%] 2025-12-04T15:22:22.7941268Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex128 PASSED [1.7229s] [ 28%] 2025-12-04T15:22:22.7941381Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex32 PASSED [0.8031s] [ 28%] 2025-12-04T15:22:22.7941490Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex64 PASSED [0.0070s] [ 28%] 2025-12-04T15:22:22.7941600Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float16 PASSED [0.0087s] [ 28%] 2025-12-04T15:22:22.7941704Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_bool PASSED [1.2340s] [ 28%] 2025-12-04T15:22:22.7941819Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex128 PASSED [0.0098s] [ 28%] 2025-12-04T15:22:22.7941926Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_uint8 PASSED [0.0090s] [ 28%] 2025-12-04T15:22:22.7942033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float64 PASSED [0.0071s] [ 28%] 2025-12-04T15:22:22.7942150Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int16 PASSED [0.0078s] [ 28%] 2025-12-04T15:22:22.7942256Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int16 PASSED [1.2769s] [ 28%] 2025-12-04T15:22:22.7942360Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int8 PASSED [0.0115s] [ 28%] 2025-12-04T15:22:22.7942466Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex32 PASSED [0.0311s] [ 28%] 2025-12-04T15:22:22.7942567Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float16 PASSED [0.0168s] [ 28%] 2025-12-04T15:22:22.7942668Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int16 PASSED [1.2804s] [ 28%] 2025-12-04T15:22:22.7942768Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int32 PASSED [0.0143s] [ 28%] 2025-12-04T15:22:22.7942872Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int16 PASSED [0.0149s] [ 28%] 2025-12-04T15:22:22.7942978Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_uint8 PASSED [0.0143s] [ 28%] 2025-12-04T15:22:22.7943076Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int8 PASSED [0.0046s] [ 28%] 2025-12-04T15:22:22.7943177Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_uint8 PASSED [0.0043s] [ 28%] 2025-12-04T15:22:22.7943284Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex128 PASSED [0.0029s] [ 28%] 2025-12-04T15:22:22.7943392Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex64 PASSED [0.0028s] [ 28%] 2025-12-04T15:22:22.7943520Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float64 PASSED [0.0026s] [ 28%] 2025-12-04T15:22:22.7943623Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int64 PASSED [0.0026s] [ 28%] 2025-12-04T15:22:22.7943750Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int16 PASSED [0.0025s] [ 28%] 2025-12-04T15:22:22.7943863Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float32 PASSED [0.0859s] [ 28%] 2025-12-04T15:22:22.7943970Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int16 PASSED [0.0816s] [ 28%] 2025-12-04T15:22:22.7944078Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int64 PASSED [0.0816s] [ 28%] 2025-12-04T15:22:22.7944185Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_uint8 PASSED [0.0797s] [ 28%] 2025-12-04T15:22:22.7944285Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int8 PASSED [0.0105s] [ 28%] 2025-12-04T15:22:22.7944398Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float16 PASSED [0.3381s] [ 28%] 2025-12-04T15:22:22.7944509Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int32 PASSED [0.1343s] [ 28%] 2025-12-04T15:22:22.7944622Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int64 PASSED [0.1362s] [ 28%] 2025-12-04T15:22:22.7944726Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bfloat16 PASSED [0.0832s] [ 28%] 2025-12-04T15:22:22.7944829Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float16 PASSED [0.0817s] [ 28%] 2025-12-04T15:22:22.7944928Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8 PASSED [0.0411s] [ 28%] 2025-12-04T15:22:22.7945028Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int32 PASSED [1.3301s] [ 28%] 2025-12-04T15:22:22.7945127Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int16 PASSED [0.0554s] [ 29%] 2025-12-04T15:22:22.7945231Z test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float32 PASSED [0.0264s] [ 29%] 2025-12-04T15:22:22.7945334Z test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float32 PASSED [1.2837s] [ 29%] 2025-12-04T15:22:22.7945439Z test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float64 PASSED [0.0197s] [ 29%] 2025-12-04T15:22:22.7945538Z test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int32 PASSED [0.1588s] [ 29%] 2025-12-04T15:22:22.7945651Z test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int64 PASSED [0.1644s] [ 29%] 2025-12-04T15:22:22.7945836Z test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float64 SKIPPED [0.0002s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 29%] 2025-12-04T15:22:22.7945938Z test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float64 PASSED [0.0474s] [ 29%] 2025-12-04T15:22:22.7946035Z test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_uint8 PASSED [0.0440s] [ 29%] 2025-12-04T15:22:22.7946145Z test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int64 PASSED [1.3571s] [ 29%] 2025-12-04T15:22:22.7946246Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bool PASSED [0.0050s] [ 29%] 2025-12-04T15:22:22.7946354Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float64 PASSED [0.0043s] [ 29%] 2025-12-04T15:22:22.7946460Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bfloat16 PASSED [1.3003s] [ 29%] 2025-12-04T15:22:22.7946566Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float64 PASSED [0.0056s] [ 29%] 2025-12-04T15:22:22.7946669Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int32 PASSED [1.2757s] [ 29%] 2025-12-04T15:22:22.7946770Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int8 PASSED [0.0048s] [ 29%] 2025-12-04T15:22:22.7946874Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float32 PASSED [0.0550s] [ 29%] 2025-12-04T15:22:22.7946970Z test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int64 PASSED [0.2203s] [ 29%] 2025-12-04T15:22:22.7947099Z test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float32 PASSED [0.0630s] [ 29%] 2025-12-04T15:22:22.7947202Z test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float64 PASSED [0.0644s] [ 29%] 2025-12-04T15:22:22.7947330Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float32 XFAIL [0.0027s] [ 29%] 2025-12-04T15:22:22.7947434Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_uint8 XFAIL [1.2696s] [ 29%] 2025-12-04T15:22:22.7947547Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex32 XFAIL [0.0036s] [ 29%] 2025-12-04T15:22:22.7947655Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float64 XFAIL [0.0031s] [ 29%] 2025-12-04T15:22:22.7947761Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int8 XFAIL [1.2925s] [ 29%] 2025-12-04T15:22:22.7947867Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int16 XFAIL [1.2981s] [ 29%] 2025-12-04T15:22:22.7947982Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_bfloat16 XFAIL [1.2442s] [ 29%] 2025-12-04T15:22:22.7948089Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float64 PASSED [1.4150s] [ 29%] 2025-12-04T15:22:22.7948195Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int16 PASSED [0.1668s] [ 29%] 2025-12-04T15:22:22.7948297Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int64 PASSED [0.1659s] [ 29%] 2025-12-04T15:22:22.7948402Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int8 PASSED [0.1647s] [ 29%] 2025-12-04T15:22:22.7948515Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex32 PASSED [0.0337s] [ 29%] 2025-12-04T15:22:22.7948624Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex64 PASSED [1.3040s] [ 29%] 2025-12-04T15:22:22.7948729Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int16 PASSED [0.0180s] [ 29%] 2025-12-04T15:22:22.7948834Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_uint8 PASSED [0.0148s] [ 29%] 2025-12-04T15:22:22.7948941Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex32 PASSED [0.0508s] [ 29%] 2025-12-04T15:22:22.7949045Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float64 PASSED [0.0169s] [ 29%] 2025-12-04T15:22:22.7949147Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int32 PASSED [0.0139s] [ 29%] 2025-12-04T15:22:22.7949259Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int64 PASSED [0.0140s] [ 29%] 2025-12-04T15:22:22.7949366Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float64 PASSED [0.0126s] [ 29%] 2025-12-04T15:22:22.7949466Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int64 PASSED [1.2969s] [ 29%] 2025-12-04T15:22:22.7949574Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float16 PASSED [0.0215s] [ 29%] 2025-12-04T15:22:22.7949682Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float32 PASSED [0.0150s] [ 29%] 2025-12-04T15:22:22.7949788Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int16 PASSED [0.0140s] [ 29%] 2025-12-04T15:22:22.7949894Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int32 PASSED [0.0139s] [ 29%] 2025-12-04T15:22:22.7950000Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int64 PASSED [0.0139s] [ 29%] 2025-12-04T15:22:22.7950171Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int8 PASSED [0.0132s] [ 29%] 2025-12-04T15:22:22.7950281Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex32 PASSED [0.0370s] [ 29%] 2025-12-04T15:22:22.7950386Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float32 PASSED [0.0169s] [ 29%] 2025-12-04T15:22:22.7950486Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_uint8 PASSED [0.0144s] [ 29%] 2025-12-04T15:22:22.7950593Z test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex128 XFAIL [0.0102s] [ 29%] 2025-12-04T15:22:22.7950721Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bool PASSED [2.5356s] [ 29%] 2025-12-04T15:22:22.7950830Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex128 PASSED [0.0050s] [ 29%] 2025-12-04T15:22:22.7950946Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex64 PASSED [1.2657s] [ 29%] 2025-12-04T15:22:22.7951048Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int64 PASSED [0.0050s] [ 29%] 2025-12-04T15:22:22.7951145Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int8 PASSED [0.2573s] [ 29%] 2025-12-04T15:22:22.7951248Z test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bfloat16 PASSED [0.0684s] [ 29%] 2025-12-04T15:22:22.7951346Z test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float32 PASSED [0.0483s] [ 29%] 2025-12-04T15:22:22.7951445Z test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int16 PASSED [0.0458s] [ 29%] 2025-12-04T15:22:22.7951542Z test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int8 PASSED [0.0447s] [ 29%] 2025-12-04T15:22:22.7951646Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex32 PASSED [0.0453s] [ 29%] 2025-12-04T15:22:22.7951749Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float32 PASSED [0.0237s] [ 29%] 2025-12-04T15:22:22.7951864Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_bfloat16 PASSED [0.0090s] [ 29%] 2025-12-04T15:22:22.7951977Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float16 PASSED [0.0086s] [ 29%] 2025-12-04T15:22:22.7952101Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex32 PASSED [0.1587s] [ 29%] 2025-12-04T15:22:22.7952219Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float16 PASSED [0.0083s] [ 29%] 2025-12-04T15:22:22.7952333Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float32 PASSED [0.0078s] [ 29%] 2025-12-04T15:22:22.7952449Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float64 PASSED [0.0077s] [ 30%] 2025-12-04T15:22:22.7952560Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int32 PASSED [0.0063s] [ 30%] 2025-12-04T15:22:22.7952673Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_uint8 PASSED [0.0063s] [ 30%] 2025-12-04T15:22:22.7952783Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_bfloat16 PASSED [0.0722s] [ 30%] 2025-12-04T15:22:22.7952912Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_complex128 PASSED [0.0916s] [ 30%] 2025-12-04T15:22:22.7953025Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex64 PASSED [1.4271s] [ 30%] 2025-12-04T15:22:22.7953138Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float64 PASSED [0.1600s] [ 30%] 2025-12-04T15:22:22.7953249Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float64 PASSED [0.0178s] [ 30%] 2025-12-04T15:22:22.7953364Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_bfloat16 PASSED [0.0314s] [ 30%] 2025-12-04T15:22:22.7953486Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_bfloat16 PASSED [0.1369s] [ 30%] 2025-12-04T15:22:22.7953596Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float64 PASSED [0.0386s] [ 30%] 2025-12-04T15:22:22.7953725Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_bfloat16 XFAIL [0.0043s] [ 30%] 2025-12-04T15:22:22.7953851Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int16 XFAIL [1.2860s] [ 30%] 2025-12-04T15:22:22.7953974Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int32 XFAIL [1.2731s] [ 30%] 2025-12-04T15:22:22.7954098Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int64 XFAIL [1.2646s] [ 30%] 2025-12-04T15:22:22.7954232Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_uint8 XFAIL [1.2661s] [ 30%] 2025-12-04T15:22:22.7954344Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float16 PASSED [1.2782s] [ 30%] 2025-12-04T15:22:22.7954460Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int64 PASSED [0.0190s] [ 30%] 2025-12-04T15:22:22.7954561Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bool PASSED [0.0198s] [ 30%] 2025-12-04T15:22:22.7954670Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex128 PASSED [1.3045s] [ 30%] 2025-12-04T15:22:22.7954770Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bool PASSED [0.0243s] [ 30%] 2025-12-04T15:22:22.7954877Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex128 PASSED [0.3403s] [ 30%] 2025-12-04T15:22:22.7954978Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float16 PASSED [0.0253s] [ 30%] 2025-12-04T15:22:22.7955080Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32 PASSED [0.0174s] [ 30%] 2025-12-04T15:22:22.7955182Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bfloat16 PASSED [0.0232s] [ 30%] 2025-12-04T15:22:22.7955287Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex64 PASSED [0.0304s] [ 30%] 2025-12-04T15:22:22.7955387Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float32 PASSED [0.0166s] [ 30%] 2025-12-04T15:22:22.7955488Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float64 PASSED [0.0167s] [ 30%] 2025-12-04T15:22:22.7955588Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int64 PASSED [0.0184s] [ 30%] 2025-12-04T15:22:22.7955772Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float16 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 30%] 2025-12-04T15:22:22.7955903Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex128 PASSED [1.2857s] [ 30%] 2025-12-04T15:22:22.7956027Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float32 PASSED [1.3088s] [ 30%] 2025-12-04T15:22:22.7956149Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int32 PASSED [1.3042s] [ 30%] 2025-12-04T15:22:22.7956261Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_bfloat16 PASSED [0.0069s] [ 30%] 2025-12-04T15:22:22.7956373Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float32 PASSED [0.0043s] [ 30%] 2025-12-04T15:22:22.7956498Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_complex32 XFAIL [0.0591s] [ 30%] 2025-12-04T15:22:22.7956607Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_complex64 XFAIL [1.9356s] [ 30%] 2025-12-04T15:22:22.7956714Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float16 PASSED [1.4502s] [ 30%] 2025-12-04T15:22:22.7956826Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bfloat16 PASSED [0.0874s] [ 30%] 2025-12-04T15:22:22.7956939Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex128 PASSED [0.0806s] [ 30%] 2025-12-04T15:22:22.7957051Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex64 PASSED [0.0799s] [ 30%] 2025-12-04T15:22:22.7957159Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int64 PASSED [0.0144s] [ 30%] 2025-12-04T15:22:22.7957267Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_uint8 PASSED [1.3045s] [ 30%] 2025-12-04T15:22:22.7957378Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bfloat16 PASSED [0.0902s] [ 30%] 2025-12-04T15:22:22.7957488Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float16 PASSED [0.0866s] [ 30%] 2025-12-04T15:22:22.7957595Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int8 PASSED [0.0619s] [ 30%] 2025-12-04T15:22:22.7957698Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int16 XFAIL [0.0400s] [ 30%] 2025-12-04T15:22:22.7957845Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int64 XFAIL [0.0110s] [ 30%] 2025-12-04T15:22:22.7957956Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_complex64 PASSED [1.3097s] [ 30%] 2025-12-04T15:22:22.7958077Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float32 PASSED [0.0240s] [ 30%] 2025-12-04T15:22:22.7958178Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float16 PASSED [0.0686s] [ 30%] 2025-12-04T15:22:22.7958280Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float64 PASSED [0.0471s] [ 30%] 2025-12-04T15:22:22.7958376Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int8 PASSED [0.0445s] [ 30%] 2025-12-04T15:22:22.7958488Z test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32 PASSED [1.3163s] [ 30%] 2025-12-04T15:22:22.7958595Z test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_uint8 PASSED [0.0092s] [ 30%] 2025-12-04T15:22:22.7958698Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bool PASSED [0.0402s] [ 30%] 2025-12-04T15:22:22.7958800Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int16 PASSED [0.0417s] [ 30%] 2025-12-04T15:22:22.7958905Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int32 PASSED [0.0416s] [ 30%] 2025-12-04T15:22:22.7959006Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_uint8 PASSED [0.0404s] [ 30%] 2025-12-04T15:22:22.7959139Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bfloat16 PASSED [0.0088s] [ 30%] 2025-12-04T15:22:22.7959265Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64 PASSED [0.0068s] [ 30%] 2025-12-04T15:22:22.7959372Z test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float16 PASSED [0.0818s] [ 30%] 2025-12-04T15:22:22.7959479Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bfloat16 PASSED [0.0060s] [ 30%] 2025-12-04T15:22:22.7959588Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex128 PASSED [0.0062s] [ 30%] 2025-12-04T15:22:22.7959697Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex64 PASSED [0.0061s] [ 30%] 2025-12-04T15:22:22.7959800Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16 PASSED [1.3066s] [ 30%] 2025-12-04T15:22:22.7959904Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int64 PASSED [0.0070s] [ 30%] 2025-12-04T15:22:22.7960015Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bfloat16 PASSED [0.0916s] [ 31%] 2025-12-04T15:22:22.7960160Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex64 PASSED [0.0745s] [ 31%] 2025-12-04T15:22:22.7960263Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float16 PASSED [0.0901s] [ 31%] 2025-12-04T15:22:22.7960366Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float32 PASSED [0.0587s] [ 31%] 2025-12-04T15:22:22.7960464Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float64 PASSED [1.3528s] [ 31%] 2025-12-04T15:22:22.7960565Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_uint8 PASSED [0.0491s] [ 31%] 2025-12-04T15:22:22.7960671Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int16 PASSED [0.0131s] [ 31%] 2025-12-04T15:22:22.7960780Z test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16 XFAIL [0.0026s] [ 31%] 2025-12-04T15:22:22.7960881Z test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_uint8 PASSED [1.2802s] [ 31%] 2025-12-04T15:22:22.7961000Z test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float16 PASSED [0.0396s] [ 31%] 2025-12-04T15:22:22.7961099Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bfloat16 PASSED [0.0686s] [ 31%] 2025-12-04T15:22:22.7961198Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float32 PASSED [0.0471s] [ 31%] 2025-12-04T15:22:22.7961295Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int64 PASSED [0.0451s] [ 31%] 2025-12-04T15:22:22.7961431Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_uint8 PASSED [0.0440s] [ 31%] 2025-12-04T15:22:22.7961535Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex64 PASSED [0.0284s] [ 31%] 2025-12-04T15:22:22.7961647Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float16 PASSED [0.0213s] [ 31%] 2025-12-04T15:22:22.7961746Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float64 PASSED [1.2912s] [ 31%] 2025-12-04T15:22:22.7961843Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int8 PASSED [0.0127s] [ 31%] 2025-12-04T15:22:22.7961997Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int64 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 31%] 2025-12-04T15:22:22.7962172Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 31%] 2025-12-04T15:22:22.7962346Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 31%] 2025-12-04T15:22:22.7962451Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bool PASSED [1.2663s] [ 31%] 2025-12-04T15:22:22.7962562Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex128 PASSED [0.0086s] [ 31%] 2025-12-04T15:22:22.7962671Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex32 PASSED [1.2728s] [ 31%] 2025-12-04T15:22:22.7962780Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex64 PASSED [0.0080s] [ 31%] 2025-12-04T15:22:22.7962884Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64 PASSED [1.3082s] [ 31%] 2025-12-04T15:22:22.7962987Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int8 PASSED [0.0076s] [ 31%] 2025-12-04T15:22:22.7963093Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_uint8 PASSED [1.3242s] [ 31%] 2025-12-04T15:22:22.7963203Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex128 PASSED [0.0086s] [ 31%] 2025-12-04T15:22:22.7963312Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex32 PASSED [1.2790s] [ 31%] 2025-12-04T15:22:22.7963418Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex64 PASSED [0.0079s] [ 31%] 2025-12-04T15:22:22.7963523Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int32 PASSED [1.2832s] [ 31%] 2025-12-04T15:22:22.7963646Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int64 PASSED [0.0073s] [ 31%] 2025-12-04T15:22:22.7963831Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_bfloat16 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 31%] 2025-12-04T15:22:22.7964010Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float64 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 31%] 2025-12-04T15:22:22.7964134Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_bfloat16 PASSED [0.0394s] [ 31%] 2025-12-04T15:22:22.7964269Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_float32 PASSED [1.3239s] [ 31%] 2025-12-04T15:22:22.7964390Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float64 PASSED [0.0377s] [ 31%] 2025-12-04T15:22:22.7964513Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float32 PASSED [0.0467s] [ 31%] 2025-12-04T15:22:22.7964637Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float64 PASSED [0.0461s] [ 31%] 2025-12-04T15:22:22.7964777Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0293s] [ 31%] 2025-12-04T15:22:22.7964899Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float16 PASSED [1.3259s] [ 31%] 2025-12-04T15:22:22.7965038Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_bool PASSED [1.3144s] [ 31%] 2025-12-04T15:22:22.7965200Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float32 PASSED [1.3050s] [ 31%] 2025-12-04T15:22:22.7965350Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int16 PASSED [1.2794s] [ 31%] 2025-12-04T15:22:22.7965487Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int64 PASSED [1.3101s] [ 31%] 2025-12-04T15:22:22.7965625Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float16 PASSED [0.0715s] [ 31%] 2025-12-04T15:22:22.7965762Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_uint8 PASSED [0.0362s] [ 31%] 2025-12-04T15:22:22.7965881Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float16 PASSED [1.3283s] [ 31%] 2025-12-04T15:22:22.7966003Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float32 PASSED [0.0089s] [ 31%] 2025-12-04T15:22:22.7966130Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_bfloat16 PASSED [0.1179s] [ 31%] 2025-12-04T15:22:22.7966252Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float64 PASSED [0.0905s] [ 31%] 2025-12-04T15:22:22.7966371Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pdist_cuda_float32 XFAIL [0.0027s] [ 31%] 2025-12-04T15:22:22.7966502Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float16 PASSED [1.2746s] [ 31%] 2025-12-04T15:22:22.7966631Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int32 PASSED [0.0055s] [ 31%] 2025-12-04T15:22:22.7966770Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex128 PASSED [0.0055s] [ 31%] 2025-12-04T15:22:22.7966907Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0054s] [ 31%] 2025-12-04T15:22:22.7967044Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.0862s] [ 31%] 2025-12-04T15:22:22.7967176Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int64 PASSED [0.1008s] [ 31%] 2025-12-04T15:22:22.7967298Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_bfloat16 PASSED [0.1099s] [ 31%] 2025-12-04T15:22:22.7967425Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float32 PASSED [1.3238s] [ 31%] 2025-12-04T15:22:22.7967543Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int16 PASSED [0.0218s] [ 31%] 2025-12-04T15:22:22.7967658Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int32 PASSED [0.0194s] [ 31%] 2025-12-04T15:22:22.7967776Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float64 PASSED [0.0321s] [ 31%] 2025-12-04T15:22:22.7967909Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_bfloat16 PASSED [0.0133s] [ 31%] 2025-12-04T15:22:22.7968039Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float16 PASSED [1.2955s] [ 31%] 2025-12-04T15:22:22.7968177Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bfloat16 PASSED [1.2809s] [ 31%] 2025-12-04T15:22:22.7968316Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float32 PASSED [1.2886s] [ 32%] 2025-12-04T15:22:22.7968455Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float64 PASSED [1.3021s] [ 32%] 2025-12-04T15:22:22.7968587Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int8 PASSED [1.2851s] [ 32%] 2025-12-04T15:22:22.7968722Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float16 PASSED [1.2794s] [ 32%] 2025-12-04T15:22:22.7968877Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float32 PASSED [1.2730s] [ 32%] 2025-12-04T15:22:22.7969013Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int16 PASSED [1.2700s] [ 32%] 2025-12-04T15:22:22.7969155Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int8 PASSED [1.3021s] [ 32%] 2025-12-04T15:22:22.7969285Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_bfloat16 PASSED [0.0775s] [ 32%] 2025-12-04T15:22:22.7969411Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float32 PASSED [0.0226s] [ 32%] 2025-12-04T15:22:22.7969534Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int16 PASSED [0.0229s] [ 32%] 2025-12-04T15:22:22.7969667Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int8 PASSED [0.0187s] [ 32%] 2025-12-04T15:22:22.7969863Z test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex64 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 32%] 2025-12-04T15:22:22.7970052Z test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float32 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 32%] 2025-12-04T15:22:22.7970270Z test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_bfloat16 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 32%] 2025-12-04T15:22:22.7970374Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32 PASSED [0.0031s] [ 32%] 2025-12-04T15:22:22.7970475Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int16 PASSED [1.2910s] [ 32%] 2025-12-04T15:22:22.7970576Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64 PASSED [0.0045s] [ 32%] 2025-12-04T15:22:22.7970691Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_complex64 PASSED [0.0301s] [ 32%] 2025-12-04T15:22:22.7970804Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float32 PASSED [0.0277s] [ 32%] 2025-12-04T15:22:22.7970913Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int8 PASSED [0.0216s] [ 32%] 2025-12-04T15:22:22.7971023Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex32 PASSED [0.0253s] [ 32%] 2025-12-04T15:22:22.7971151Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex64 PASSED [1.3187s] [ 32%] 2025-12-04T15:22:22.7971260Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float64 PASSED [0.0268s] [ 32%] 2025-12-04T15:22:22.7971361Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int8 PASSED [0.0186s] [ 32%] 2025-12-04T15:22:22.7971468Z test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float16 PASSED [1.2880s] [ 32%] 2025-12-04T15:22:22.7971570Z test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int8 PASSED [0.0103s] [ 32%] 2025-12-04T15:22:22.7971674Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_bfloat16 PASSED [0.0924s] [ 32%] 2025-12-04T15:22:22.7971779Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex128 PASSED [1.3770s] [ 32%] 2025-12-04T15:22:22.7971880Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float16 PASSED [0.0934s] [ 32%] 2025-12-04T15:22:22.7971981Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float32 PASSED [0.0592s] [ 32%] 2025-12-04T15:22:22.7972081Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float64 PASSED [0.0587s] [ 32%] 2025-12-04T15:22:22.7972179Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int16 PASSED [0.0460s] [ 32%] 2025-12-04T15:22:22.7972275Z test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int8 PASSED [0.0172s] [ 32%] 2025-12-04T15:22:22.7972382Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_bfloat16 PASSED [0.0237s] [ 32%] 2025-12-04T15:22:22.7972518Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bfloat16 PASSED [1.3089s] [ 32%] 2025-12-04T15:22:22.7972619Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bool PASSED [0.0052s] [ 32%] 2025-12-04T15:22:22.7972737Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float16 PASSED [1.2612s] [ 32%] 2025-12-04T15:22:22.7972838Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_uint8 PASSED [0.0054s] [ 32%] 2025-12-04T15:22:22.7972942Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex32 PASSED [0.0314s] [ 32%] 2025-12-04T15:22:22.7973044Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float32 PASSED [0.0135s] [ 32%] 2025-12-04T15:22:22.7973142Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int64 PASSED [1.2823s] [ 32%] 2025-12-04T15:22:22.7973255Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex64 PASSED [0.0324s] [ 32%] 2025-12-04T15:22:22.7973362Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8 PASSED [0.0176s] [ 32%] 2025-12-04T15:22:22.7973471Z test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float32 PASSED [0.0638s] [ 32%] 2025-12-04T15:22:22.7973578Z test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float64 PASSED [0.0632s] [ 32%] 2025-12-04T15:22:22.7973680Z test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int64 PASSED [0.0218s] [ 32%] 2025-12-04T15:22:22.7973794Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128 PASSED [0.0159s] [ 32%] 2025-12-04T15:22:22.7973903Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float64 PASSED [0.0151s] [ 32%] 2025-12-04T15:22:22.7974009Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int16 PASSED [0.0127s] [ 32%] 2025-12-04T15:22:22.7974114Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_uint8 PASSED [0.0127s] [ 32%] 2025-12-04T15:22:22.7974222Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex32 PASSED [1.3168s] [ 32%] 2025-12-04T15:22:22.7974330Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex64 PASSED [0.0230s] [ 32%] 2025-12-04T15:22:22.7974437Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float16 PASSED [0.0199s] [ 32%] 2025-12-04T15:22:22.7974541Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int32 PASSED [0.0159s] [ 32%] 2025-12-04T15:22:22.7974663Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex64 PASSED [0.0111s] [ 32%] 2025-12-04T15:22:22.7974762Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int16 PASSED [0.0091s] [ 32%] 2025-12-04T15:22:22.7974862Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_uint8 PASSED [0.0090s] [ 32%] 2025-12-04T15:22:22.7974959Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bool PASSED [0.0115s] [ 32%] 2025-12-04T15:22:22.7975060Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int16 PASSED [0.0114s] [ 32%] 2025-12-04T15:22:22.7975161Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_uint8 PASSED [0.0113s] [ 32%] 2025-12-04T15:22:22.7975264Z test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float16 PASSED [1.3226s] [ 32%] 2025-12-04T15:22:22.7975368Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bfloat16 PASSED [0.0254s] [ 32%] 2025-12-04T15:22:22.7975467Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bool PASSED [0.0212s] [ 32%] 2025-12-04T15:22:22.7975574Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex64 PASSED [0.4426s] [ 32%] 2025-12-04T15:22:22.7975675Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float32 PASSED [0.0169s] [ 32%] 2025-12-04T15:22:22.7975775Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_uint8 PASSED [0.0173s] [ 32%] 2025-12-04T15:22:22.7975873Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int8 PASSED [0.0471s] [ 32%] 2025-12-04T15:22:22.7976005Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bfloat16 PASSED [1.2750s] [ 33%] 2025-12-04T15:22:22.7976125Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bool PASSED [0.0091s] [ 33%] 2025-12-04T15:22:22.7976249Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_float64 PASSED [0.0076s] [ 33%] 2025-12-04T15:22:22.7976361Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int16 PASSED [0.0066s] [ 33%] 2025-12-04T15:22:22.7976476Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int64 PASSED [1.2904s] [ 33%] 2025-12-04T15:22:22.7976577Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float32 PASSED [0.0175s] [ 33%] 2025-12-04T15:22:22.7976680Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int16 PASSED [0.0120s] [ 33%] 2025-12-04T15:22:22.7978803Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int32 PASSED [0.0118s] [ 33%] 2025-12-04T15:22:22.7978907Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_uint8 PASSED [0.0111s] [ 33%] 2025-12-04T15:22:22.7979015Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int64 PASSED [0.0280s] [ 33%] 2025-12-04T15:22:22.7979123Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16 PASSED [0.0213s] [ 33%] 2025-12-04T15:22:22.7979223Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float64 PASSED [1.2711s] [ 33%] 2025-12-04T15:22:22.7979332Z test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float16 PASSED [0.0190s] [ 33%] 2025-12-04T15:22:22.7979435Z test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_uint8 PASSED [0.0109s] [ 33%] 2025-12-04T15:22:22.7979532Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bool PASSED [0.0203s] [ 33%] 2025-12-04T15:22:22.7979638Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex64 PASSED [0.5003s] [ 33%] 2025-12-04T15:22:22.7979746Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex128 PASSED [0.2253s] [ 33%] 2025-12-04T15:22:22.7979846Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_uint8 PASSED [0.0165s] [ 33%] 2025-12-04T15:22:22.7979970Z test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16 PASSED [1.2993s] [ 33%] 2025-12-04T15:22:22.7980087Z test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bool PASSED [1.2369s] [ 33%] 2025-12-04T15:22:22.7980449Z test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float32 PASSED [1.2299s] [ 33%] 2025-12-04T15:22:22.7980564Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int16 PASSED [0.2722s] [ 33%] 2025-12-04T15:22:22.7980678Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_bool PASSED [0.2417s] [ 33%] 2025-12-04T15:22:22.7980791Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int64 PASSED [0.0186s] [ 33%] 2025-12-04T15:22:22.7980906Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_uint8 PASSED [0.2607s] [ 33%] 2025-12-04T15:22:22.7981017Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float16 PASSED [1.6652s] [ 33%] 2025-12-04T15:22:22.7981129Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float64 PASSED [0.3131s] [ 33%] 2025-12-04T15:22:22.7981239Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int16 PASSED [0.2248s] [ 33%] 2025-12-04T15:22:22.7981350Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float16 PASSED [0.3391s] [ 33%] 2025-12-04T15:22:22.7981464Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float32 PASSED [1.2592s] [ 33%] 2025-12-04T15:22:22.7981604Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex128 PASSED [1.2845s] [ 33%] 2025-12-04T15:22:22.7981740Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int64 PASSED [1.3145s] [ 33%] 2025-12-04T15:22:22.7981880Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float32 PASSED [0.0450s] [ 33%] 2025-12-04T15:22:22.7981992Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_uint8 PASSED [0.0398s] [ 33%] 2025-12-04T15:22:22.7982146Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float16 PASSED [0.0765s] [ 33%] 2025-12-04T15:22:22.7982287Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16 PASSED [0.0643s] [ 33%] 2025-12-04T15:22:22.7982426Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8 PASSED [0.0616s] [ 33%] 2025-12-04T15:22:22.7982570Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16 PASSED [0.0812s] [ 33%] 2025-12-04T15:22:22.7982705Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32 PASSED [1.3397s] [ 33%] 2025-12-04T15:22:22.7982849Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int8 PASSED [0.0676s] [ 33%] 2025-12-04T15:22:22.7982989Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float32 PASSED [0.0670s] [ 33%] 2025-12-04T15:22:22.7983128Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float64 PASSED [0.0671s] [ 33%] 2025-12-04T15:22:22.7983245Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bfloat16 PASSED [1.2969s] [ 33%] 2025-12-04T15:22:22.7983356Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float32 PASSED [0.0304s] [ 33%] 2025-12-04T15:22:22.7983465Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int16 PASSED [1.3071s] [ 33%] 2025-12-04T15:22:22.7983572Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int32 PASSED [0.0312s] [ 33%] 2025-12-04T15:22:22.7983703Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float64 PASSED [0.0090s] [ 33%] 2025-12-04T15:22:22.7983836Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float64 PASSED [1.5727s] [ 33%] 2025-12-04T15:22:22.7983967Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_uint8 PASSED [0.2421s] [ 33%] 2025-12-04T15:22:22.7984078Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bool PASSED [0.1314s] [ 33%] 2025-12-04T15:22:22.7984206Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int32 PASSED [0.1362s] [ 33%] 2025-12-04T15:22:22.7984315Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_bool PASSED [1.4189s] [ 33%] 2025-12-04T15:22:22.7984427Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_float32 PASSED [0.8160s] [ 33%] 2025-12-04T15:22:22.7984534Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int64 PASSED [0.0797s] [ 33%] 2025-12-04T15:22:22.7984653Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bfloat16 PASSED [1.2947s] [ 33%] 2025-12-04T15:22:22.7984773Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex32 PASSED [0.0091s] [ 33%] 2025-12-04T15:22:22.7984894Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex64 PASSED [1.2823s] [ 33%] 2025-12-04T15:22:22.7985000Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bfloat16 PASSED [0.0242s] [ 33%] 2025-12-04T15:22:22.7985103Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float64 PASSED [0.0157s] [ 33%] 2025-12-04T15:22:22.7985206Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int16 PASSED [0.0170s] [ 33%] 2025-12-04T15:22:22.7985305Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int8 PASSED [1.2939s] [ 33%] 2025-12-04T15:22:22.7985406Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_uint8 PASSED [0.0187s] [ 33%] 2025-12-04T15:22:22.7985517Z test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int32 PASSED [0.0147s] [ 33%] 2025-12-04T15:22:22.7985638Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float64 PASSED [0.0060s] [ 33%] 2025-12-04T15:22:22.7985758Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex32 PASSED [0.0053s] [ 33%] 2025-12-04T15:22:22.7985866Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float32 PASSED [1.2725s] [ 33%] 2025-12-04T15:22:22.7985988Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bfloat16 PASSED [0.0064s] [ 34%] 2025-12-04T15:22:22.7986104Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int64 PASSED [0.0041s] [ 34%] 2025-12-04T15:22:22.7986209Z test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex32 PASSED [0.0080s] [ 34%] 2025-12-04T15:22:22.7986312Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float16 PASSED [0.0122s] [ 34%] 2025-12-04T15:22:22.7986413Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float32 PASSED [1.2768s] [ 34%] 2025-12-04T15:22:22.7986522Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float32 PASSED [0.0158s] [ 34%] 2025-12-04T15:22:22.7986627Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex32 PASSED [0.1158s] [ 34%] 2025-12-04T15:22:22.7986726Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int64 PASSED [1.3049s] [ 34%] 2025-12-04T15:22:22.7986833Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex128 PASSED [0.0121s] [ 34%] 2025-12-04T15:22:22.7986935Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex32 PASSED [0.0155s] [ 34%] 2025-12-04T15:22:22.7987038Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex64 PASSED [0.0098s] [ 34%] 2025-12-04T15:22:22.7987138Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float32 PASSED [1.2783s] [ 34%] 2025-12-04T15:22:22.7987235Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int8 PASSED [0.0121s] [ 34%] 2025-12-04T15:22:22.7987346Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bfloat16 PASSED [0.0120s] [ 34%] 2025-12-04T15:22:22.7987461Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex128 PASSED [0.0093s] [ 34%] 2025-12-04T15:22:22.7987570Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32 PASSED [0.0089s] [ 34%] 2025-12-04T15:22:22.7987686Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int64 PASSED [0.0072s] [ 34%] 2025-12-04T15:22:22.7987792Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_uint8 PASSED [0.0090s] [ 34%] 2025-12-04T15:22:22.7987893Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int8 PASSED [1.2725s] [ 34%] 2025-12-04T15:22:22.7987989Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float16 PASSED [0.0055s] [ 34%] 2025-12-04T15:22:22.7988087Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float64 PASSED [1.2635s] [ 34%] 2025-12-04T15:22:22.7988187Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int16 PASSED [0.0047s] [ 34%] 2025-12-04T15:22:22.7988299Z test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bool XFAIL [0.0037s] [ 34%] 2025-12-04T15:22:22.7988419Z test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex128 XFAIL [1.2954s] [ 34%] 2025-12-04T15:22:22.7988521Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16 PASSED [1.2856s] [ 34%] 2025-12-04T15:22:22.7988620Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int16 PASSED [0.0184s] [ 34%] 2025-12-04T15:22:22.7988725Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex128 PASSED [0.0313s] [ 34%] 2025-12-04T15:22:22.7988828Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float16 PASSED [0.0224s] [ 34%] 2025-12-04T15:22:22.7988936Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_uint8 XFAIL [0.0026s] [ 34%] 2025-12-04T15:22:22.7989049Z test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bfloat16 PASSED [1.2991s] [ 34%] 2025-12-04T15:22:22.7989163Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float32 PASSED [1.2533s] [ 34%] 2025-12-04T15:22:22.7989275Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int32 PASSED [0.0049s] [ 34%] 2025-12-04T15:22:22.7989392Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_complex32 PASSED [0.0084s] [ 34%] 2025-12-04T15:22:22.7989505Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int16 PASSED [0.0051s] [ 34%] 2025-12-04T15:22:22.7989615Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_uint8 PASSED [1.2767s] [ 34%] 2025-12-04T15:22:22.7989724Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int64 PASSED [0.0065s] [ 34%] 2025-12-04T15:22:22.7989830Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_uint8 PASSED [0.0048s] [ 34%] 2025-12-04T15:22:22.7989934Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float16 PASSED [0.0118s] [ 34%] 2025-12-04T15:22:22.7990033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int16 PASSED [0.0101s] [ 34%] 2025-12-04T15:22:22.7990181Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex128 PASSED [0.0112s] [ 34%] 2025-12-04T15:22:22.7990287Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex32 PASSED [0.0110s] [ 34%] 2025-12-04T15:22:22.7990388Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int32 PASSED [1.3028s] [ 34%] 2025-12-04T15:22:22.7990488Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_uint8 PASSED [0.0125s] [ 34%] 2025-12-04T15:22:22.7990594Z test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int8 PASSED [0.0853s] [ 34%] 2025-12-04T15:22:22.7990696Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int64 PASSED [1.2778s] [ 34%] 2025-12-04T15:22:22.7990795Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_uint8 PASSED [0.0131s] [ 34%] 2025-12-04T15:22:22.7990908Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bfloat16 PASSED [0.0111s] [ 34%] 2025-12-04T15:22:22.7991015Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bool PASSED [0.0085s] [ 34%] 2025-12-04T15:22:22.7991131Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_complex128 PASSED [0.0107s] [ 34%] 2025-12-04T15:22:22.7991271Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float32 PASSED [0.0102s] [ 34%] 2025-12-04T15:22:22.7991384Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float64 PASSED [0.0100s] [ 34%] 2025-12-04T15:22:22.7991490Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int16 PASSED [0.0082s] [ 34%] 2025-12-04T15:22:22.7991596Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float32 PASSED [0.0089s] [ 34%] 2025-12-04T15:22:22.7991698Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_uint8 PASSED [0.0072s] [ 34%] 2025-12-04T15:22:22.7991805Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int32 PASSED [0.0054s] [ 34%] 2025-12-04T15:22:22.7991911Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int16 PASSED [0.0092s] [ 34%] 2025-12-04T15:22:22.7992020Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int32 PASSED [1.2772s] [ 34%] 2025-12-04T15:22:22.7992128Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_uint8 PASSED [0.0116s] [ 34%] 2025-12-04T15:22:22.7992229Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bool PASSED [0.0085s] [ 34%] 2025-12-04T15:22:22.7992336Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex32 PASSED [0.0113s] [ 34%] 2025-12-04T15:22:22.7992452Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex32 PASSED [0.0064s] [ 34%] 2025-12-04T15:22:22.7992568Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex64 PASSED [0.0063s] [ 34%] 2025-12-04T15:22:22.7992704Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_uint8 PASSED [0.0052s] [ 34%] 2025-12-04T15:22:22.7992810Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int16 PASSED [1.2850s] [ 34%] 2025-12-04T15:22:22.7992927Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int64 PASSED [0.0066s] [ 34%] 2025-12-04T15:22:22.7993033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex64 PASSED [0.0089s] [ 34%] 2025-12-04T15:22:22.7993134Z test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float64 PASSED [1.2572s] [ 34%] 2025-12-04T15:22:22.7993242Z test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_bfloat16 PASSED [0.0199s] [ 35%] 2025-12-04T15:22:22.7993342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float64 PASSED [0.0032s] [ 35%] 2025-12-04T15:22:22.7993456Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float32 PASSED [1.2413s] [ 35%] 2025-12-04T15:22:22.7993559Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bool PASSED [0.0152s] [ 35%] 2025-12-04T15:22:22.7993668Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex32 PASSED [0.0166s] [ 35%] 2025-12-04T15:22:22.7993771Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int16 PASSED [0.0129s] [ 35%] 2025-12-04T15:22:22.7993876Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int64 PASSED [0.0128s] [ 35%] 2025-12-04T15:22:22.7993979Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_uint8 PASSED [0.0128s] [ 35%] 2025-12-04T15:22:22.7994089Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_bfloat16 PASSED [0.0055s] [ 35%] 2025-12-04T15:22:22.7994187Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bool PASSED [0.0157s] [ 35%] 2025-12-04T15:22:22.7994290Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex64 PASSED [0.0202s] [ 35%] 2025-12-04T15:22:22.7994392Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float16 PASSED [0.0190s] [ 35%] 2025-12-04T15:22:22.7994490Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int32 PASSED [0.0157s] [ 35%] 2025-12-04T15:22:22.7994589Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_uint8 PASSED [0.0156s] [ 35%] 2025-12-04T15:22:22.7994696Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bfloat16 PASSED [0.0037s] [ 35%] 2025-12-04T15:22:22.7994812Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bool PASSED [1.2735s] [ 35%] 2025-12-04T15:22:22.7994920Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex128 PASSED [0.0057s] [ 35%] 2025-12-04T15:22:22.7995027Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex64 PASSED [0.0042s] [ 35%] 2025-12-04T15:22:22.7995129Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int32 PASSED [1.2569s] [ 35%] 2025-12-04T15:22:22.7995234Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bfloat16 PASSED [0.0056s] [ 35%] 2025-12-04T15:22:22.7995342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32 PASSED [0.0041s] [ 35%] 2025-12-04T15:22:22.7995442Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int32 PASSED [1.2737s] [ 35%] 2025-12-04T15:22:22.7995546Z test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex32 PASSED [0.0162s] [ 35%] 2025-12-04T15:22:22.7995649Z test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float64 PASSED [0.1181s] [ 35%] 2025-12-04T15:22:22.7995750Z test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int64 PASSED [0.1380s] [ 35%] 2025-12-04T15:22:22.7995854Z test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex128 PASSED [1.2673s] [ 35%] 2025-12-04T15:22:22.7995957Z test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32 PASSED [0.0051s] [ 35%] 2025-12-04T15:22:22.7996058Z test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int16 PASSED [1.2679s] [ 35%] 2025-12-04T15:22:22.7996179Z test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int64 PASSED [0.0046s] [ 35%] 2025-12-04T15:22:22.7996300Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_polar_cuda PASSED [1.2636s] [ 35%] 2025-12-04T15:22:22.7996414Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cauchy_cuda PASSED [1.2713s] [ 35%] 2025-12-04T15:22:22.7996519Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_min_cuda XFAIL [0.0057s] [ 35%] 2025-12-04T15:22:22.7996624Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_cuda PASSED [2.5123s] [ 35%] 2025-12-04T15:22:22.7996723Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eq_cuda PASSED [1.2436s] [ 35%] 2025-12-04T15:22:22.7996826Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft2_cuda PASSED [1.2318s] [ 35%] 2025-12-04T15:22:22.7996932Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft_cuda PASSED [1.2438s] [ 35%] 2025-12-04T15:22:22.7997042Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfftn_cuda PASSED [1.2342s] [ 35%] 2025-12-04T15:22:22.7997147Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft2_cuda PASSED [1.2620s] [ 35%] 2025-12-04T15:22:22.7997252Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfftn_cuda PASSED [1.3063s] [ 35%] 2025-12-04T15:22:22.7997351Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_ge_cuda PASSED [1.3223s] [ 35%] 2025-12-04T15:22:22.7997455Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hsplit_cuda PASSED [1.2779s] [ 35%] 2025-12-04T15:22:22.7997555Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hstack_cuda XFAIL [0.0046s] [ 35%] 2025-12-04T15:22:22.7997861Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_tensor_overload_cuda E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.linspace.Tensor_Tensor 2025-12-04T15:22:22.7998002Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T15:22:22.7998266Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T15:22:22.7998393Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T15:22:22.7998621Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T15:22:22.7998758Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T15:22:22.7998990Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T15:22:22.7999122Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T15:22:22.7999380Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 128, in meta_linspace_logspace 2025-12-04T15:22:22.7999492Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T15:22:22.7999706Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T15:22:22.7999903Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T15:22:22.8000206Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T15:22:22.8000348Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T15:22:22.8000553Z E1204 14:52:37.004000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: linspace only supports 0-dimensional start and end tensors 2025-12-04T15:22:22.8000596Z PASSED [2.5087s] [ 35%] 2025-12-04T15:22:22.8000710Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_masked_fill_cuda XFAIL [0.0054s] [ 35%] 2025-12-04T15:22:22.8000821Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_copy_cuda PASSED [2.5433s] [ 35%] 2025-12-04T15:22:22.8000929Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nextafter_cuda PASSED [1.2788s] [ 35%] 2025-12-04T15:22:22.8001068Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda PASSED [1.2543s] [ 35%] 2025-12-04T15:22:22.8001341Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_l1_loss_cuda E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.sub.Tensor 2025-12-04T15:22:22.8001477Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T15:22:22.8001730Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T15:22:22.8001855Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T15:22:22.8002067Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T15:22:22.8002201Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T15:22:22.8002430Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T15:22:22.8002574Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T15:22:22.8002784Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_compile.py", line 54, in inner 2025-12-04T15:22:22.8002920Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return disable_fn(*args, **kwargs) 2025-12-04T15:22:22.8003145Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T15:22:22.8003272Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return fn(*args, **kwargs) 2025-12-04T15:22:22.8003500Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 152, in _fn 2025-12-04T15:22:22.8003634Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(**bound.arguments) 2025-12-04T15:22:22.8003854Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 1841, in sub 2025-12-04T15:22:22.8003985Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] a, b = _maybe_broadcast(a, b) 2025-12-04T15:22:22.8004243Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 470, in _maybe_broadcast 2025-12-04T15:22:22.8004399Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] common_shape = _broadcast_shapes( 2025-12-04T15:22:22.8004637Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 458, in _broadcast_shapes 2025-12-04T15:22:22.8004747Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T15:22:22.8004959Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T15:22:22.8005156Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T15:22:22.8005379Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T15:22:22.8005520Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T15:22:22.8005838Z E1204 14:52:43.333000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: Attempting to broadcast a dimension of length 5 at -1! Mismatching argument at index 1 had torch.Size([5]); but expected shape should be broadcastable to [5, 4] 2025-12-04T15:22:22.8005881Z PASSED [1.2770s] [ 35%] 2025-12-04T15:22:22.8005987Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_roll_cuda PASSED [1.2592s] [ 35%] 2025-12-04T15:22:22.8006092Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_trace_cuda PASSED [1.3182s] [ 35%] 2025-12-04T15:22:22.8006351Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_unbind_copy_cuda E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.unbind.int 2025-12-04T15:22:22.8006487Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T15:22:22.8006745Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T15:22:22.8006870Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T15:22:22.8007080Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T15:22:22.8007215Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T15:22:22.8007391Z E1204 14:52:47.191000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension specified as 0 but tensor has no dimensions 2025-12-04T15:22:22.8007552Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.unbind.int 2025-12-04T15:22:22.8007686Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T15:22:22.8007934Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T15:22:22.8008058Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T15:22:22.8008287Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T15:22:22.8008421Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T15:22:22.8008626Z E1204 14:52:47.193000 1589956 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 2) 2025-12-04T15:22:22.8008665Z PASSED [1.2931s] [ 35%] 2025-12-04T15:22:22.8008775Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vsplit_cuda PASSED [1.3114s] [ 35%] 2025-12-04T15:22:22.8008906Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex64 PASSED [1.3462s] [ 35%] 2025-12-04T15:22:22.8009035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float64 PASSED [0.0115s] [ 35%] 2025-12-04T15:22:22.8009162Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int32 PASSED [0.0070s] [ 35%] 2025-12-04T15:22:22.8009317Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int16 PASSED [0.0920s] [ 35%] 2025-12-04T15:22:22.8009469Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bfloat16 PASSED [0.0934s] [ 35%] 2025-12-04T15:22:22.8009623Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex64 PASSED [0.1137s] [ 35%] 2025-12-04T15:22:22.8009771Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_uint8 PASSED [0.0801s] [ 35%] 2025-12-04T15:22:22.8009919Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int32 PASSED [0.0850s] [ 35%] 2025-12-04T15:22:22.8010064Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_uint8 PASSED [0.0505s] [ 35%] 2025-12-04T15:22:22.8010268Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bool PASSED [0.1034s] [ 35%] 2025-12-04T15:22:22.8010426Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex128 PASSED [0.1101s] [ 35%] 2025-12-04T15:22:22.8010578Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float64 PASSED [0.0974s] [ 35%] 2025-12-04T15:22:22.8010747Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex32 PASSED [0.0862s] [ 35%] 2025-12-04T15:22:22.8010897Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float16 PASSED [0.0986s] [ 35%] 2025-12-04T15:22:22.8011045Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float32 PASSED [0.0967s] [ 35%] 2025-12-04T15:22:22.8011192Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool PASSED [0.0924s] [ 36%] 2025-12-04T15:22:22.8011339Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float16 PASSED [0.0831s] [ 36%] 2025-12-04T15:22:22.8011486Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_uint8 PASSED [0.0714s] [ 36%] 2025-12-04T15:22:22.8011641Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float64 PASSED [0.4895s] [ 36%] 2025-12-04T15:22:22.8011789Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_bool PASSED [0.0950s] [ 36%] 2025-12-04T15:22:22.8011944Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex128 PASSED [0.1011s] [ 36%] 2025-12-04T15:22:22.8012095Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float64 PASSED [0.0627s] [ 36%] 2025-12-04T15:22:22.8012268Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int32 PASSED [0.0786s] [ 36%] 2025-12-04T15:22:22.8012431Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_uint8 PASSED [0.0784s] [ 36%] 2025-12-04T15:22:22.8012579Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bool PASSED [0.1037s] [ 36%] 2025-12-04T15:22:22.8012733Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex64 PASSED [0.1016s] [ 36%] 2025-12-04T15:22:22.8012882Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float16 PASSED [0.0865s] [ 36%] 2025-12-04T15:22:22.8013028Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_bool PASSED [0.0943s] [ 36%] 2025-12-04T15:22:22.8013175Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int64 PASSED [0.0879s] [ 36%] 2025-12-04T15:22:22.8013326Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bfloat16 PASSED [0.0848s] [ 36%] 2025-12-04T15:22:22.8013473Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int32 PASSED [0.0750s] [ 36%] 2025-12-04T15:22:22.8013620Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_uint8 PASSED [0.0715s] [ 36%] 2025-12-04T15:22:22.8013770Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float64 PASSED [0.0850s] [ 36%] 2025-12-04T15:22:22.8013916Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int64 PASSED [0.0747s] [ 36%] 2025-12-04T15:22:22.8014050Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex128 PASSED [0.0856s] [ 36%] 2025-12-04T15:22:22.8014182Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex32 PASSED [0.2792s] [ 36%] 2025-12-04T15:22:22.8014314Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float16 PASSED [0.1144s] [ 36%] 2025-12-04T15:22:22.8014446Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float32 PASSED [0.0744s] [ 36%] 2025-12-04T15:22:22.8014589Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float64 PASSED [0.0806s] [ 36%] 2025-12-04T15:22:22.8014717Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int8 PASSED [0.0875s] [ 36%] 2025-12-04T15:22:22.8014848Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int32 PASSED [0.0860s] [ 36%] 2025-12-04T15:22:22.8014977Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_uint8 PASSED [0.0817s] [ 36%] 2025-12-04T15:22:22.8015111Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128 PASSED [0.3395s] [ 36%] 2025-12-04T15:22:22.8015236Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int64 PASSED [0.3045s] [ 36%] 2025-12-04T15:22:22.8015377Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex64 PASSED [0.2615s] [ 36%] 2025-12-04T15:22:22.8015519Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex128 PASSED [0.2674s] [ 36%] 2025-12-04T15:22:22.8015653Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float32 PASSED [0.2623s] [ 36%] 2025-12-04T15:22:22.8015785Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int16 PASSED [0.2620s] [ 36%] 2025-12-04T15:22:22.8015916Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int32 PASSED [0.2637s] [ 36%] 2025-12-04T15:22:22.8016073Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex64 PASSED [0.0221s] [ 36%] 2025-12-04T15:22:22.8016211Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_float16 PASSED [0.0080s] [ 36%] 2025-12-04T15:22:22.8016360Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_float64 PASSED [0.0065s] [ 36%] 2025-12-04T15:22:22.8016490Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_bfloat16 PASSED [0.0941s] [ 36%] 2025-12-04T15:22:22.8016623Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex128 PASSED [0.0869s] [ 36%] 2025-12-04T15:22:22.8016753Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex64 PASSED [0.0829s] [ 36%] 2025-12-04T15:22:22.8016879Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int16 PASSED [0.0818s] [ 36%] 2025-12-04T15:22:22.8017005Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int32 PASSED [0.0815s] [ 36%] 2025-12-04T15:22:22.8017131Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_uint8 PASSED [0.0911s] [ 36%] 2025-12-04T15:22:22.8017274Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128 PASSED [0.2172s] [ 36%] 2025-12-04T15:22:22.8017411Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float16 PASSED [0.3717s] [ 36%] 2025-12-04T15:22:22.8017548Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float64 PASSED [0.2147s] [ 36%] 2025-12-04T15:22:22.8017675Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int16 PASSED [0.0439s] [ 36%] 2025-12-04T15:22:22.8017801Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int8 PASSED [0.0442s] [ 36%] 2025-12-04T15:22:22.8017929Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int32 PASSED [0.0434s] [ 36%] 2025-12-04T15:22:22.8018059Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bfloat16 PASSED [0.0710s] [ 36%] 2025-12-04T15:22:22.8018185Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_uint8 PASSED [0.0796s] [ 36%] 2025-12-04T15:22:22.8018345Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_complex64 PASSED [0.0160s] [ 36%] 2025-12-04T15:22:22.8018488Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int32 PASSED [0.0141s] [ 36%] 2025-12-04T15:22:22.8018631Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int64 PASSED [0.0141s] [ 36%] 2025-12-04T15:22:22.8018771Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bfloat16 PASSED [1.5722s] [ 36%] 2025-12-04T15:22:22.8018909Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bool PASSED [0.0174s] [ 36%] 2025-12-04T15:22:22.8019044Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int16 PASSED [0.0126s] [ 36%] 2025-12-04T15:22:22.8019178Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int32 PASSED [0.0118s] [ 36%] 2025-12-04T15:22:22.8019312Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int8 PASSED [0.0123s] [ 36%] 2025-12-04T15:22:22.8019471Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex64 PASSED [1.3223s] [ 36%] 2025-12-04T15:22:22.8019625Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int16 PASSED [0.0126s] [ 36%] 2025-12-04T15:22:22.8019803Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bfloat16 PASSED [1.3467s] [ 36%] 2025-12-04T15:22:22.8019955Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex128 PASSED [0.0235s] [ 36%] 2025-12-04T15:22:22.8020155Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float32 PASSED [0.0197s] [ 36%] 2025-12-04T15:22:22.8020303Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_uint8 PASSED [0.0183s] [ 36%] 2025-12-04T15:22:22.8020437Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex128 PASSED [0.0806s] [ 36%] 2025-12-04T15:22:22.8020569Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float64 PASSED [0.0661s] [ 36%] 2025-12-04T15:22:22.8020699Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int64 PASSED [0.3599s] [ 37%] 2025-12-04T15:22:22.8020828Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int8 PASSED [0.3670s] [ 37%] 2025-12-04T15:22:22.8020963Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex32 PASSED [0.3131s] [ 37%] 2025-12-04T15:22:22.8021091Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int32 PASSED [0.0769s] [ 37%] 2025-12-04T15:22:22.8021220Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_uint8 PASSED [0.0712s] [ 37%] 2025-12-04T15:22:22.8021361Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex32 PASSED [0.0168s] [ 37%] 2025-12-04T15:22:22.8021500Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float32 PASSED [0.0153s] [ 37%] 2025-12-04T15:22:22.8021636Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float64 PASSED [0.0189s] [ 37%] 2025-12-04T15:22:22.8021774Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int64 PASSED [0.0173s] [ 37%] 2025-12-04T15:22:22.8021909Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int8 PASSED [0.2768s] [ 37%] 2025-12-04T15:22:22.8022048Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_uint8 PASSED [0.2823s] [ 37%] 2025-12-04T15:22:22.8022202Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float16 PASSED [0.0486s] [ 37%] 2025-12-04T15:22:22.8022356Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex128 PASSED [0.0317s] [ 37%] 2025-12-04T15:22:22.8022508Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float64 PASSED [0.0315s] [ 37%] 2025-12-04T15:22:22.8022651Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_complex64 PASSED [1.4108s] [ 37%] 2025-12-04T15:22:22.8022791Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int16 PASSED [1.3713s] [ 37%] 2025-12-04T15:22:22.8022929Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_bfloat16 XFAIL [0.0154s] [ 37%] 2025-12-04T15:22:22.8023070Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float32 XFAIL [0.0103s] [ 37%] 2025-12-04T15:22:22.8023206Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int64 XFAIL [1.3757s] [ 37%] 2025-12-04T15:22:22.8023341Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex32 PASSED [1.3941s] [ 37%] 2025-12-04T15:22:22.8023473Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float16 PASSED [0.0375s] [ 37%] 2025-12-04T15:22:22.8023601Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int16 PASSED [0.0405s] [ 37%] 2025-12-04T15:22:22.8023756Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int32 PASSED [0.0360s] [ 37%] 2025-12-04T15:22:22.8023903Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float16 XFAIL [0.0159s] [ 37%] 2025-12-04T15:22:22.8024039Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float32 XFAIL [0.0123s] [ 37%] 2025-12-04T15:22:22.8024170Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int64 PASSED [1.4129s] [ 37%] 2025-12-04T15:22:22.8024301Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_bool PASSED [0.0674s] [ 37%] 2025-12-04T15:22:22.8024433Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float16 PASSED [0.0734s] [ 37%] 2025-12-04T15:22:22.8024567Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float64 PASSED [0.2076s] [ 37%] 2025-12-04T15:22:22.8024699Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int32 PASSED [0.1932s] [ 37%] 2025-12-04T15:22:22.8024832Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int64 PASSED [0.1950s] [ 37%] 2025-12-04T15:22:22.8024967Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int64 PASSED [0.4389s] [ 37%] 2025-12-04T15:22:22.8025111Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float32 PASSED [0.4808s] [ 37%] 2025-12-04T15:22:22.8025251Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex32 PASSED [0.1336s] [ 37%] 2025-12-04T15:22:22.8025384Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float16 PASSED [0.1350s] [ 37%] 2025-12-04T15:22:22.8025516Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int16 PASSED [0.1260s] [ 37%] 2025-12-04T15:22:22.8025657Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int8 PASSED [0.0127s] [ 37%] 2025-12-04T15:22:22.8025789Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bool PASSED [0.0560s] [ 37%] 2025-12-04T15:22:22.8025920Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float64 PASSED [0.0540s] [ 37%] 2025-12-04T15:22:22.8026081Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex64 PASSED [0.0795s] [ 37%] 2025-12-04T15:22:22.8026225Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float64 PASSED [0.0491s] [ 37%] 2025-12-04T15:22:22.8026367Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int8 PASSED [0.0386s] [ 37%] 2025-12-04T15:22:22.8026515Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bfloat16 PASSED [0.2517s] [ 37%] 2025-12-04T15:22:22.8026664Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex32 PASSED [0.1068s] [ 37%] 2025-12-04T15:22:22.8026807Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int64 PASSED [0.1050s] [ 37%] 2025-12-04T15:22:22.8026941Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bool PASSED [0.5757s] [ 37%] 2025-12-04T15:22:22.8027083Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float64 PASSED [0.5257s] [ 37%] 2025-12-04T15:22:22.8027218Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int16 PASSED [0.6097s] [ 37%] 2025-12-04T15:22:22.8027357Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int32 PASSED [0.6048s] [ 37%] 2025-12-04T15:22:22.8027519Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_uint8 PASSED [0.5844s] [ 37%] 2025-12-04T15:22:22.8027649Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bool PASSED [0.1041s] [ 37%] 2025-12-04T15:22:22.8027798Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bool PASSED [0.0749s] [ 37%] 2025-12-04T15:22:22.8027949Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex64 PASSED [0.0601s] [ 37%] 2025-12-04T15:22:22.8028082Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_uint8 PASSED [0.0295s] [ 37%] 2025-12-04T15:22:22.8028219Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float32 PASSED [0.0723s] [ 37%] 2025-12-04T15:22:22.8028352Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int32 PASSED [0.0808s] [ 37%] 2025-12-04T15:22:22.8028489Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_bool PASSED [0.2253s] [ 37%] 2025-12-04T15:22:22.8028633Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex64 PASSED [0.2203s] [ 37%] 2025-12-04T15:22:22.8028769Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int16 PASSED [0.2158s] [ 37%] 2025-12-04T15:22:22.8028908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int64 PASSED [0.2168s] [ 37%] 2025-12-04T15:22:22.8029038Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float16 PASSED [0.0481s] [ 37%] 2025-12-04T15:22:22.8029171Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float64 PASSED [0.0474s] [ 37%] 2025-12-04T15:22:22.8029298Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int8 PASSED [0.0450s] [ 37%] 2025-12-04T15:22:22.8029450Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex32 PASSED [0.0649s] [ 37%] 2025-12-04T15:22:22.8029591Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int64 PASSED [0.0610s] [ 37%] 2025-12-04T15:22:22.8029733Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex32 PASSED [0.0500s] [ 37%] 2025-12-04T15:22:22.8029879Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float16 PASSED [1.7451s] [ 37%] 2025-12-04T15:22:22.8030020Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float32 PASSED [0.0562s] [ 38%] 2025-12-04T15:22:22.8030302Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float64 PASSED [0.0502s] [ 38%] 2025-12-04T15:22:22.8030439Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int16 PASSED [0.0468s] [ 38%] 2025-12-04T15:22:22.8030577Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int32 PASSED [0.0479s] [ 38%] 2025-12-04T15:22:22.8030723Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64 PASSED [0.0620s] [ 38%] 2025-12-04T15:22:22.8030860Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32 PASSED [0.3148s] [ 38%] 2025-12-04T15:22:22.8031013Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float16 PASSED [1.9197s] [ 38%] 2025-12-04T15:22:22.8031169Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bfloat16 PASSED [0.4899s] [ 38%] 2025-12-04T15:22:22.8031321Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float64 PASSED [0.3135s] [ 38%] 2025-12-04T15:22:22.8031475Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_bfloat16 PASSED [0.5326s] [ 38%] 2025-12-04T15:22:22.8031650Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int8 PASSED [0.2802s] [ 38%] 2025-12-04T15:22:22.8031804Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float64 PASSED [0.0135s] [ 38%] 2025-12-04T15:22:22.8031986Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex32 SKIPPED [0.0001s] (Can't check result for empty) [ 38%] 2025-12-04T15:22:22.8032172Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool SKIPPED [0.0001s] (Can't check result for empty_like) [ 38%] 2025-12-04T15:22:22.8032381Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex128 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 38%] 2025-12-04T15:22:22.8032577Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 38%] 2025-12-04T15:22:22.8032708Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bool PASSED [0.2751s] [ 38%] 2025-12-04T15:22:22.8032841Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex128 PASSED [0.2978s] [ 38%] 2025-12-04T15:22:22.8032974Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bool PASSED [0.0271s] [ 38%] 2025-12-04T15:22:22.8033111Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex128 PASSED [0.0264s] [ 38%] 2025-12-04T15:22:22.8033244Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_uint8 PASSED [0.0265s] [ 38%] 2025-12-04T15:22:22.8033373Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float64 PASSED [0.0730s] [ 38%] 2025-12-04T15:22:22.8033502Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int16 PASSED [0.0778s] [ 38%] 2025-12-04T15:22:22.8033629Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int64 PASSED [0.0770s] [ 38%] 2025-12-04T15:22:22.8033760Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bool PASSED [0.1036s] [ 38%] 2025-12-04T15:22:22.8033907Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float64 PASSED [0.3528s] [ 38%] 2025-12-04T15:22:22.8034039Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int64 PASSED [0.0858s] [ 38%] 2025-12-04T15:22:22.8034179Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bfloat16 PASSED [0.3596s] [ 38%] 2025-12-04T15:22:22.8034309Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int16 PASSED [0.0773s] [ 38%] 2025-12-04T15:22:22.8034444Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int8 PASSED [0.0720s] [ 38%] 2025-12-04T15:22:22.8034575Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_uint8 PASSED [0.0723s] [ 38%] 2025-12-04T15:22:22.8034708Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int32 PASSED [0.0853s] [ 38%] 2025-12-04T15:22:22.8034836Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int64 PASSED [0.0850s] [ 38%] 2025-12-04T15:22:22.8034966Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int16 PASSED [0.0854s] [ 38%] 2025-12-04T15:22:22.8035105Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bfloat16 PASSED [0.0103s] [ 38%] 2025-12-04T15:22:22.8035248Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex64 PASSED [0.0090s] [ 38%] 2025-12-04T15:22:22.8035402Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int64 PASSED [0.0088s] [ 38%] 2025-12-04T15:22:22.8035551Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int8 PASSED [0.0091s] [ 38%] 2025-12-04T15:22:22.8035701Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_bool PASSED [0.0245s] [ 38%] 2025-12-04T15:22:22.8035847Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_complex128 PASSED [0.0257s] [ 38%] 2025-12-04T15:22:22.8035995Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_complex64 PASSED [0.0248s] [ 38%] 2025-12-04T15:22:22.8036134Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int64 PASSED [0.0234s] [ 38%] 2025-12-04T15:22:22.8036272Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bfloat16 PASSED [0.0218s] [ 38%] 2025-12-04T15:22:22.8036413Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex128 PASSED [0.0206s] [ 38%] 2025-12-04T15:22:22.8036546Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int8 PASSED [0.0198s] [ 38%] 2025-12-04T15:22:22.8036679Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_uint8 PASSED [0.0206s] [ 38%] 2025-12-04T15:22:22.8036812Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bool PASSED [0.0961s] [ 38%] 2025-12-04T15:22:22.8036945Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float16 PASSED [0.1144s] [ 38%] 2025-12-04T15:22:22.8037089Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float32 XFAIL [0.0202s] [ 38%] 2025-12-04T15:22:22.8037229Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float8_e5m2fnuz PASSED [2.2911s] [ 38%] 2025-12-04T15:22:22.8037361Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int32 PASSED [0.3988s] [ 38%] 2025-12-04T15:22:22.8037491Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int8 PASSED [0.4081s] [ 38%] 2025-12-04T15:22:22.8037624Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int32 PASSED [0.0310s] [ 38%] 2025-12-04T15:22:22.8037768Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int64 PASSED [0.0290s] [ 38%] 2025-12-04T15:22:22.8037905Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex32 PASSED [0.0216s] [ 38%] 2025-12-04T15:22:22.8038041Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float32 PASSED [0.0235s] [ 38%] 2025-12-04T15:22:22.8038175Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float64 PASSED [0.0488s] [ 38%] 2025-12-04T15:22:22.8038311Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int16 PASSED [0.0268s] [ 38%] 2025-12-04T15:22:22.8038441Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int32 PASSED [0.0258s] [ 38%] 2025-12-04T15:22:22.8038579Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float16 PASSED [0.0297s] [ 38%] 2025-12-04T15:22:22.8038712Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int64 PASSED [0.0337s] [ 38%] 2025-12-04T15:22:22.8038858Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bfloat16 PASSED [0.0276s] [ 38%] 2025-12-04T15:22:22.8038997Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bool PASSED [0.0267s] [ 38%] 2025-12-04T15:22:22.8039138Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float64 PASSED [0.0249s] [ 38%] 2025-12-04T15:22:22.8039300Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int8 PASSED [0.0240s] [ 38%] 2025-12-04T15:22:22.8039441Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex32 PASSED [0.0313s] [ 38%] 2025-12-04T15:22:22.8039588Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float32 PASSED [0.0283s] [ 38%] 2025-12-04T15:22:22.8039724Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_uint8 PASSED [1.4259s] [ 39%] 2025-12-04T15:22:22.8039860Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int64 PASSED [0.0317s] [ 39%] 2025-12-04T15:22:22.8039992Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_bool PASSED [0.0309s] [ 39%] 2025-12-04T15:22:22.8040174Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex128 PASSED [0.0277s] [ 39%] 2025-12-04T15:22:22.8040311Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int16 PASSED [0.0301s] [ 39%] 2025-12-04T15:22:22.8040448Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int32 PASSED [0.0303s] [ 39%] 2025-12-04T15:22:22.8040584Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float16 PASSED [0.0580s] [ 39%] 2025-12-04T15:22:22.8040723Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float64 PASSED [0.0279s] [ 39%] 2025-12-04T15:22:22.8040858Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int32 PASSED [0.0320s] [ 39%] 2025-12-04T15:22:22.8040995Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float64 PASSED [1.4967s] [ 39%] 2025-12-04T15:22:22.8041130Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int16 PASSED [0.0375s] [ 39%] 2025-12-04T15:22:22.8041278Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex32 PASSED [0.0251s] [ 39%] 2025-12-04T15:22:22.8041431Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 39%] 2025-12-04T15:22:22.8041588Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 39%] 2025-12-04T15:22:22.8041761Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float16 SKIPPED [0.0001s] (Skipped!) [ 39%] 2025-12-04T15:22:22.8041900Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32 PASSED [0.0383s] [ 39%] 2025-12-04T15:22:22.8042040Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float64 PASSED [0.0468s] [ 39%] 2025-12-04T15:22:22.8042176Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int32 PASSED [0.0476s] [ 39%] 2025-12-04T15:22:22.8042314Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_uint8 PASSED [0.0424s] [ 39%] 2025-12-04T15:22:22.8042459Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex32 PASSED [1.3913s] [ 39%] 2025-12-04T15:22:22.8042599Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex64 PASSED [0.0256s] [ 39%] 2025-12-04T15:22:22.8042736Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int8 PASSED [0.0258s] [ 39%] 2025-12-04T15:22:22.8042876Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex64 PASSED [0.0248s] [ 39%] 2025-12-04T15:22:22.8043014Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int16 PASSED [0.0291s] [ 39%] 2025-12-04T15:22:22.8043176Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int8 PASSED [0.0286s] [ 39%] 2025-12-04T15:22:22.8043315Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float32 PASSED [1.3653s] [ 39%] 2025-12-04T15:22:22.8043460Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_bool PASSED [0.0458s] [ 39%] 2025-12-04T15:22:22.8043598Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float32 PASSED [0.0296s] [ 39%] 2025-12-04T15:22:22.8043730Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int8 PASSED [0.0333s] [ 39%] 2025-12-04T15:22:22.8043864Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float64 PASSED [0.0748s] [ 39%] 2025-12-04T15:22:22.8043993Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int16 PASSED [0.0637s] [ 39%] 2025-12-04T15:22:22.8044134Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bfloat16 PASSED [0.0935s] [ 39%] 2025-12-04T15:22:22.8044268Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int64 PASSED [0.0886s] [ 39%] 2025-12-04T15:22:22.8044402Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex64 PASSED [0.0210s] [ 39%] 2025-12-04T15:22:22.8044537Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int32 PASSED [0.0057s] [ 39%] 2025-12-04T15:22:22.8044674Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bfloat16 PASSED [0.0058s] [ 39%] 2025-12-04T15:22:22.8044807Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bool PASSED [0.0057s] [ 39%] 2025-12-04T15:22:22.8044943Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_complex64 PASSED [0.0061s] [ 39%] 2025-12-04T15:22:22.8045080Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float32 PASSED [0.0058s] [ 39%] 2025-12-04T15:22:22.8045221Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float32 PASSED [0.4150s] [ 39%] 2025-12-04T15:22:22.8045360Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int8 PASSED [0.3888s] [ 39%] 2025-12-04T15:22:22.8045502Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float16 PASSED [0.1034s] [ 39%] 2025-12-04T15:22:22.8045634Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int16 PASSED [0.0559s] [ 39%] 2025-12-04T15:22:22.8045764Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int64 PASSED [0.0567s] [ 39%] 2025-12-04T15:22:22.8045897Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_uint8 PASSED [0.0578s] [ 39%] 2025-12-04T15:22:22.8046035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bfloat16 PASSED [0.4246s] [ 39%] 2025-12-04T15:22:22.8046166Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float32 PASSED [0.2756s] [ 39%] 2025-12-04T15:22:22.8046301Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_bfloat16 PASSED [0.4664s] [ 39%] 2025-12-04T15:22:22.8046432Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float64 PASSED [0.1321s] [ 39%] 2025-12-04T15:22:22.8046567Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float32 PASSED [0.1042s] [ 39%] 2025-12-04T15:22:22.8046696Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int32 PASSED [0.2604s] [ 39%] 2025-12-04T15:22:22.8046827Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int8 PASSED [0.3926s] [ 39%] 2025-12-04T15:22:22.8046976Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float32 PASSED [0.2800s] [ 39%] 2025-12-04T15:22:22.8047104Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int16 PASSED [0.2805s] [ 39%] 2025-12-04T15:22:22.8047308Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int32 SKIPPED [0.0002s] (Expected: geometric is not comparable) [ 39%] 2025-12-04T15:22:22.8047439Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float16 PASSED [0.3981s] [ 39%] 2025-12-04T15:22:22.8047564Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int16 PASSED [0.2717s] [ 39%] 2025-12-04T15:22:22.8047690Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int32 PASSED [0.2794s] [ 39%] 2025-12-04T15:22:22.8047817Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int64 PASSED [0.2721s] [ 39%] 2025-12-04T15:22:22.8047958Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float16 PASSED [0.7469s] [ 39%] 2025-12-04T15:22:22.8048097Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float16 PASSED [0.0100s] [ 39%] 2025-12-04T15:22:22.8048230Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_uint8 PASSED [0.0086s] [ 39%] 2025-12-04T15:22:22.8048370Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex64 PASSED [0.0086s] [ 39%] 2025-12-04T15:22:22.8048500Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int8 PASSED [0.0082s] [ 39%] 2025-12-04T15:22:22.8048636Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_bfloat16 PASSED [0.4287s] [ 39%] 2025-12-04T15:22:22.8048761Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int16 PASSED [0.0819s] [ 39%] 2025-12-04T15:22:22.8048902Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float32 PASSED [0.2883s] [ 39%] 2025-12-04T15:22:22.8049046Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex128 PASSED [0.0325s] [ 40%] 2025-12-04T15:22:22.8049189Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex32 PASSED [0.0289s] [ 40%] 2025-12-04T15:22:22.8049342Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float32 PASSED [0.0273s] [ 40%] 2025-12-04T15:22:22.8049477Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int16 PASSED [0.0264s] [ 40%] 2025-12-04T15:22:22.8049613Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int8 PASSED [0.0263s] [ 40%] 2025-12-04T15:22:22.8049748Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_bool PASSED [0.0110s] [ 40%] 2025-12-04T15:22:22.8049886Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bool PASSED [0.0261s] [ 40%] 2025-12-04T15:22:22.8050025Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float32 PASSED [0.0262s] [ 40%] 2025-12-04T15:22:22.8050200Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int16 PASSED [0.0255s] [ 40%] 2025-12-04T15:22:22.8050337Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bfloat16 PASSED [3.5714s] [ 40%] 2025-12-04T15:22:22.8050480Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_complex128 PASSED [0.8745s] [ 40%] 2025-12-04T15:22:22.8050612Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int16 PASSED [0.9544s] [ 40%] 2025-12-04T15:22:22.8050747Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int64 PASSED [0.9491s] [ 40%] 2025-12-04T15:22:22.8050905Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_uint8 PASSED [0.9356s] [ 40%] 2025-12-04T15:22:22.8051048Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex64 PASSED [0.0922s] [ 40%] 2025-12-04T15:22:22.8051197Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int16 PASSED [0.0835s] [ 40%] 2025-12-04T15:22:22.8051329Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int8 PASSED [0.0703s] [ 40%] 2025-12-04T15:22:22.8051466Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float64 PASSED [0.0654s] [ 40%] 2025-12-04T15:22:22.8051596Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int16 PASSED [0.0574s] [ 40%] 2025-12-04T15:22:22.8051730Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int32 PASSED [0.0580s] [ 40%] 2025-12-04T15:22:22.8051866Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int32 PASSED [0.0794s] [ 40%] 2025-12-04T15:22:22.8052006Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float64 PASSED [0.0704s] [ 40%] 2025-12-04T15:22:22.8052141Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int32 PASSED [0.0753s] [ 40%] 2025-12-04T15:22:22.8052278Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int8 PASSED [0.0703s] [ 40%] 2025-12-04T15:22:22.8052412Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float16 PASSED [0.1147s] [ 40%] 2025-12-04T15:22:22.8052546Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32 PASSED [1.7508s] [ 40%] 2025-12-04T15:22:22.8052675Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bfloat16 PASSED [0.3904s] [ 40%] 2025-12-04T15:22:22.8052805Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bool PASSED [0.2627s] [ 40%] 2025-12-04T15:22:22.8052935Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float64 PASSED [0.2817s] [ 40%] 2025-12-04T15:22:22.8053060Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int64 PASSED [0.2684s] [ 40%] 2025-12-04T15:22:22.8053210Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_bfloat16 PASSED [0.1458s] [ 40%] 2025-12-04T15:22:22.8053345Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex64 PASSED [0.1808s] [ 40%] 2025-12-04T15:22:22.8053476Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int8 PASSED [0.3145s] [ 40%] 2025-12-04T15:22:22.8053628Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex128 PASSED [0.0387s] [ 40%] 2025-12-04T15:22:22.8053781Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float16 PASSED [0.0319s] [ 40%] 2025-12-04T15:22:22.8053929Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float64 PASSED [0.0314s] [ 40%] 2025-12-04T15:22:22.8054077Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_uint8 PASSED [0.0309s] [ 40%] 2025-12-04T15:22:22.8054231Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex64 PASSED [0.3200s] [ 40%] 2025-12-04T15:22:22.8054383Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float64 PASSED [0.3170s] [ 40%] 2025-12-04T15:22:22.8054529Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128 PASSED [0.7879s] [ 40%] 2025-12-04T15:22:22.8054668Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float64 PASSED [0.7687s] [ 40%] 2025-12-04T15:22:22.8054840Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex64 PASSED [0.1275s] [ 40%] 2025-12-04T15:22:22.8054995Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float32 PASSED [0.1360s] [ 40%] 2025-12-04T15:22:22.8055145Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex128 PASSED [0.1581s] [ 40%] 2025-12-04T15:22:22.8055289Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float32 PASSED [0.1228s] [ 40%] 2025-12-04T15:22:22.8055426Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int16 XFAIL [0.0057s] [ 40%] 2025-12-04T15:22:22.8055560Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_uint8 XFAIL [1.5446s] [ 40%] 2025-12-04T15:22:22.8055718Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int64 XFAIL [1.4052s] [ 40%] 2025-12-04T15:22:22.8055851Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float32 PASSED [1.4664s] [ 40%] 2025-12-04T15:22:22.8055984Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int32 PASSED [0.0846s] [ 40%] 2025-12-04T15:22:22.8056118Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_bfloat16 PASSED [0.1018s] [ 40%] 2025-12-04T15:22:22.8056256Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex64 PASSED [0.0799s] [ 40%] 2025-12-04T15:22:22.8056390Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float16 PASSED [0.1023s] [ 40%] 2025-12-04T15:22:22.8056522Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float64 PASSED [0.0663s] [ 40%] 2025-12-04T15:22:22.8056658Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_uint8 PASSED [0.0706s] [ 40%] 2025-12-04T15:22:22.8056790Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float16 PASSED [0.1127s] [ 40%] 2025-12-04T15:22:22.8056924Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float32 PASSED [0.0738s] [ 40%] 2025-12-04T15:22:22.8057064Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int32 PASSED [0.0951s] [ 40%] 2025-12-04T15:22:22.8057199Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex32 PASSED [0.5650s] [ 40%] 2025-12-04T15:22:22.8057330Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex64 PASSED [0.0881s] [ 40%] 2025-12-04T15:22:22.8057463Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float32 PASSED [0.0755s] [ 40%] 2025-12-04T15:22:22.8057592Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int16 PASSED [0.0857s] [ 40%] 2025-12-04T15:22:22.8057721Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int64 PASSED [0.0851s] [ 40%] 2025-12-04T15:22:22.8057873Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bool PASSED [0.0595s] [ 40%] 2025-12-04T15:22:22.8058035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex128 PASSED [0.0581s] [ 40%] 2025-12-04T15:22:22.8058193Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float64 PASSED [0.0538s] [ 40%] 2025-12-04T15:22:22.8058344Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int64 PASSED [0.0580s] [ 40%] 2025-12-04T15:22:22.8058489Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex128 XFAIL [0.0152s] [ 41%] 2025-12-04T15:22:22.8058651Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex64 XFAIL [1.4606s] [ 41%] 2025-12-04T15:22:22.8058793Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float16 PASSED [0.9645s] [ 41%] 2025-12-04T15:22:22.8058944Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float16 PASSED [0.4676s] [ 41%] 2025-12-04T15:22:22.8059084Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int32 PASSED [0.3490s] [ 41%] 2025-12-04T15:22:22.8059220Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int64 PASSED [0.3501s] [ 41%] 2025-12-04T15:22:22.8059365Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bfloat16 PASSED [0.0968s] [ 41%] 2025-12-04T15:22:22.8059509Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex64 PASSED [0.0885s] [ 41%] 2025-12-04T15:22:22.8059654Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bfloat16 PASSED [0.4824s] [ 41%] 2025-12-04T15:22:22.8059795Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int64 PASSED [0.3519s] [ 41%] 2025-12-04T15:22:22.8059938Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex64 PASSED [0.3993s] [ 41%] 2025-12-04T15:22:22.8060077Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_bfloat16 PASSED [2.0152s] [ 41%] 2025-12-04T15:22:22.8060282Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex64 PASSED [12.4665s] [ 41%] 2025-12-04T15:22:22.8060444Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_uint8 PASSED [2.8270s] [ 41%] 2025-12-04T15:22:22.8060582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bool PASSED [0.0576s] [ 41%] 2025-12-04T15:22:22.8060728Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex128 PASSED [0.1467s] [ 41%] 2025-12-04T15:22:22.8060857Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bool PASSED [0.2608s] [ 41%] 2025-12-04T15:22:22.8060997Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int8 PASSED [0.2772s] [ 41%] 2025-12-04T15:22:22.8061136Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bool PASSED [0.0274s] [ 41%] 2025-12-04T15:22:22.8061283Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex128 PASSED [0.0286s] [ 41%] 2025-12-04T15:22:22.8061423Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float16 PASSED [0.0285s] [ 41%] 2025-12-04T15:22:22.8061569Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float32 PASSED [0.0279s] [ 41%] 2025-12-04T15:22:22.8061710Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float16 PASSED [2.3643s] [ 41%] 2025-12-04T15:22:22.8061866Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bool PASSED [0.0362s] [ 41%] 2025-12-04T15:22:22.8062024Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int32 PASSED [0.0364s] [ 41%] 2025-12-04T15:22:22.8062186Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bfloat16 PASSED [0.0378s] [ 41%] 2025-12-04T15:22:22.8062345Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_uint8 PASSED [0.0355s] [ 41%] 2025-12-04T15:22:22.8062482Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float16 PASSED [0.0244s] [ 41%] 2025-12-04T15:22:22.8062650Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float64 PASSED [0.0240s] [ 41%] 2025-12-04T15:22:22.8062802Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int16 PASSED [0.0226s] [ 41%] 2025-12-04T15:22:22.8062934Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bool PASSED [0.2609s] [ 41%] 2025-12-04T15:22:22.8063064Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int32 PASSED [0.2768s] [ 41%] 2025-12-04T15:22:22.8063192Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int8 PASSED [0.2706s] [ 41%] 2025-12-04T15:22:22.8063334Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_bfloat16 PASSED [0.2322s] [ 41%] 2025-12-04T15:22:22.8063480Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex128 PASSED [0.0618s] [ 41%] 2025-12-04T15:22:22.8063623Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_uint8 PASSED [0.0587s] [ 41%] 2025-12-04T15:22:22.8063759Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float32 PASSED [0.1100s] [ 41%] 2025-12-04T15:22:22.8063896Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float64 PASSED [0.1086s] [ 41%] 2025-12-04T15:22:22.8064029Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int32 PASSED [0.1046s] [ 41%] 2025-12-04T15:22:22.8064164Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8 PASSED [1.5568s] [ 41%] 2025-12-04T15:22:22.8064292Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bfloat16 PASSED [0.3874s] [ 41%] 2025-12-04T15:22:22.8064420Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bool PASSED [0.2549s] [ 41%] 2025-12-04T15:22:22.8064545Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16 PASSED [0.2656s] [ 41%] 2025-12-04T15:22:22.8064674Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int8 PASSED [0.2693s] [ 41%] 2025-12-04T15:22:22.8064806Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex32 PASSED [0.2801s] [ 41%] 2025-12-04T15:22:22.8064954Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex64 PASSED [0.0808s] [ 41%] 2025-12-04T15:22:22.8065143Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bfloat16 SKIPPED [0.0002s] (Can't check result for new_empty) [ 41%] 2025-12-04T15:22:22.8065325Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float16 SKIPPED [0.0001s] (Can't check result for new_empty) [ 41%] 2025-12-04T15:22:22.8065535Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bool SKIPPED [0.0002s] (Expected: empty_strided is not comparable) [ 41%] 2025-12-04T15:22:22.8065741Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 41%] 2025-12-04T15:22:22.8065947Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_uint8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 41%] 2025-12-04T15:22:22.8066236Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex128 PASSED [0.0224s] [ 41%] 2025-12-04T15:22:22.8066374Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int16 PASSED [0.0211s] [ 41%] 2025-12-04T15:22:22.8066508Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int32 PASSED [0.0213s] [ 41%] 2025-12-04T15:22:22.8066666Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_uint8 PASSED [0.0216s] [ 41%] 2025-12-04T15:22:22.8066806Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bfloat16 PASSED [0.0213s] [ 41%] 2025-12-04T15:22:22.8066958Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex64 PASSED [0.0216s] [ 41%] 2025-12-04T15:22:22.8067094Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int16 PASSED [0.0206s] [ 41%] 2025-12-04T15:22:22.8067236Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex128 PASSED [0.0206s] [ 41%] 2025-12-04T15:22:22.8067380Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex32 PASSED [0.0219s] [ 41%] 2025-12-04T15:22:22.8067549Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex64 PASSED [0.0131s] [ 41%] 2025-12-04T15:22:22.8067715Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int16 PASSED [0.0130s] [ 41%] 2025-12-04T15:22:22.8067868Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float64 XFAIL [0.0101s] [ 41%] 2025-12-04T15:22:22.8068023Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_bfloat16 PASSED [1.4961s] [ 41%] 2025-12-04T15:22:22.8068172Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_bfloat16 PASSED [0.3024s] [ 41%] 2025-12-04T15:22:22.8068321Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float16 PASSED [0.2900s] [ 41%] 2025-12-04T15:22:22.8068470Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float32 PASSED [0.2305s] [ 41%] 2025-12-04T15:22:22.8068618Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float64 PASSED [0.2445s] [ 42%] 2025-12-04T15:22:22.8068781Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float32 PASSED [0.1345s] [ 42%] 2025-12-04T15:22:22.8068936Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float32 PASSED [0.2304s] [ 42%] 2025-12-04T15:22:22.8069105Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int16 PASSED [0.2048s] [ 42%] 2025-12-04T15:22:22.8069275Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_float64 PASSED [0.1403s] [ 42%] 2025-12-04T15:22:22.8069433Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float32 PASSED [0.0523s] [ 42%] 2025-12-04T15:22:22.8069592Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_bfloat16 PASSED [0.0407s] [ 42%] 2025-12-04T15:22:22.8069763Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bool PASSED [0.0579s] [ 42%] 2025-12-04T15:22:22.8069938Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex64 PASSED [0.0576s] [ 42%] 2025-12-04T15:22:22.8070162Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float64 PASSED [0.0537s] [ 42%] 2025-12-04T15:22:22.8070334Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int32 PASSED [0.0577s] [ 42%] 2025-12-04T15:22:22.8070501Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float32 PASSED [0.2116s] [ 42%] 2025-12-04T15:22:22.8070693Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32 PASSED [0.1900s] [ 42%] 2025-12-04T15:22:22.8070856Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int8 PASSED [0.1896s] [ 42%] 2025-12-04T15:22:22.8071019Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float32 PASSED [0.1674s] [ 42%] 2025-12-04T15:22:22.8071172Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_bfloat16 PASSED [0.0312s] [ 42%] 2025-12-04T15:22:22.8071324Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float32 PASSED [0.0233s] [ 42%] 2025-12-04T15:22:22.8071475Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float16 XFAIL [0.0656s] [ 42%] 2025-12-04T15:22:22.8071644Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_bfloat16 PASSED [1.5862s] [ 42%] 2025-12-04T15:22:22.8071813Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex64 PASSED [0.0353s] [ 42%] 2025-12-04T15:22:22.8071976Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int32 PASSED [0.0359s] [ 42%] 2025-12-04T15:22:22.8072135Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int16 PASSED [0.0227s] [ 42%] 2025-12-04T15:22:22.8072292Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int32 PASSED [0.0224s] [ 42%] 2025-12-04T15:22:22.8072453Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_uint8 PASSED [0.0218s] [ 42%] 2025-12-04T15:22:22.8072620Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_complex128 PASSED [0.0222s] [ 42%] 2025-12-04T15:22:22.8072782Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int64 PASSED [0.0219s] [ 42%] 2025-12-04T15:22:22.8072941Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int8 PASSED [0.0218s] [ 42%] 2025-12-04T15:22:22.8073117Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float32 PASSED [0.4577s] [ 42%] 2025-12-04T15:22:22.8073276Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_uint8 PASSED [0.5387s] [ 42%] 2025-12-04T15:22:22.8073427Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_bfloat16 PASSED [0.5618s] [ 42%] 2025-12-04T15:22:22.8073578Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float32 PASSED [0.4068s] [ 42%] 2025-12-04T15:22:22.8073726Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float32 PASSED [0.1070s] [ 42%] 2025-12-04T15:22:22.8073875Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float16 PASSED [0.1908s] [ 42%] 2025-12-04T15:22:22.8074035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_bfloat16 PASSED [0.0614s] [ 42%] 2025-12-04T15:22:22.8074195Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float16 PASSED [0.0611s] [ 42%] 2025-12-04T15:22:22.8074362Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_bfloat16 PASSED [0.0536s] [ 42%] 2025-12-04T15:22:22.8074528Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float16 PASSED [0.0607s] [ 42%] 2025-12-04T15:22:22.8074705Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_bfloat16 PASSED [0.2101s] [ 42%] 2025-12-04T15:22:22.8074876Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex64 PASSED [0.1113s] [ 42%] 2025-12-04T15:22:22.8075035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_float64 PASSED [0.0969s] [ 42%] 2025-12-04T15:22:22.8075187Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int16 PASSED [0.1044s] [ 42%] 2025-12-04T15:22:22.8075341Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int32 PASSED [0.1050s] [ 42%] 2025-12-04T15:22:22.8075494Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_uint8 PASSED [0.0988s] [ 42%] 2025-12-04T15:22:22.8075650Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float64 PASSED [0.1118s] [ 42%] 2025-12-04T15:22:22.8075803Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int64 PASSED [0.1173s] [ 42%] 2025-12-04T15:22:22.8075957Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_uint8 PASSED [0.1050s] [ 42%] 2025-12-04T15:22:22.8076125Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_bfloat16 PASSED [0.1598s] [ 42%] 2025-12-04T15:22:22.8076292Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float16 PASSED [0.1656s] [ 42%] 2025-12-04T15:22:22.8076455Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int8 PASSED [0.0881s] [ 42%] 2025-12-04T15:22:22.8076658Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float32 SKIPPED [0.0002s] (make_traced() doesn't set seed properly!) [ 42%] 2025-12-04T15:22:22.8076862Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float64 SKIPPED [0.0001s] (make_traced() doesn't set seed properly!) [ 42%] 2025-12-04T15:22:22.8077072Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float16 SKIPPED [0.0001s] (make_traced() doesn't set seed properly!) [ 42%] 2025-12-04T15:22:22.8077203Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool PASSED [0.0065s] [ 42%] 2025-12-04T15:22:22.8077334Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float64 PASSED [1.6644s] [ 42%] 2025-12-04T15:22:22.8077463Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int64 PASSED [0.0097s] [ 42%] 2025-12-04T15:22:22.8077602Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_bool PASSED [0.1539s] [ 42%] 2025-12-04T15:22:22.8077747Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex32 PASSED [0.1620s] [ 42%] 2025-12-04T15:22:22.8077880Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bool PASSED [0.1164s] [ 42%] 2025-12-04T15:22:22.8078015Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_uint8 PASSED [0.0379s] [ 42%] 2025-12-04T15:22:22.8078144Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex32 XFAIL [0.0839s] [ 42%] 2025-12-04T15:22:22.8078269Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int8 PASSED [1.7036s] [ 42%] 2025-12-04T15:22:22.8078404Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex128 PASSED [0.0829s] [ 42%] 2025-12-04T15:22:22.8078561Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float32 PASSED [0.0810s] [ 42%] 2025-12-04T15:22:22.8078696Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float16 PASSED [0.1069s] [ 42%] 2025-12-04T15:22:22.8078894Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float16 SKIPPED [0.0002s] (make_traced() doesn't set seed properly!) [ 42%] 2025-12-04T15:22:22.8079028Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float64 PASSED [1.4303s] [ 42%] 2025-12-04T15:22:22.8079157Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int32 PASSED [0.0125s] [ 42%] 2025-12-04T15:22:22.8079285Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int8 PASSED [0.0084s] [ 43%] 2025-12-04T15:22:22.8079416Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex32 PASSED [0.0913s] [ 43%] 2025-12-04T15:22:22.8079550Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex64 PASSED [0.0907s] [ 43%] 2025-12-04T15:22:22.8079679Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float16 PASSED [0.0529s] [ 43%] 2025-12-04T15:22:22.8079805Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int8 PASSED [0.0419s] [ 43%] 2025-12-04T15:22:22.8079950Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex64 PASSED [0.0868s] [ 43%] 2025-12-04T15:22:22.8080087Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int32 PASSED [0.0842s] [ 43%] 2025-12-04T15:22:22.8080266Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int8 PASSED [0.0795s] [ 43%] 2025-12-04T15:22:22.8080399Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int8 PASSED [0.2707s] [ 43%] 2025-12-04T15:22:22.8080533Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_bfloat16 PASSED [0.0441s] [ 43%] 2025-12-04T15:22:22.8080665Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float32 PASSED [0.1298s] [ 43%] 2025-12-04T15:22:22.8080816Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int16 PASSED [0.1284s] [ 43%] 2025-12-04T15:22:22.8080945Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int8 PASSED [0.1283s] [ 43%] 2025-12-04T15:22:22.8081076Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_uint8 PASSED [0.1280s] [ 43%] 2025-12-04T15:22:22.8081215Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float16 PASSED [0.0906s] [ 43%] 2025-12-04T15:22:22.8081354Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float64 PASSED [0.0899s] [ 43%] 2025-12-04T15:22:22.8081490Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_uint8 PASSED [0.0878s] [ 43%] 2025-12-04T15:22:22.8081625Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bool PASSED [0.1073s] [ 43%] 2025-12-04T15:22:22.8081762Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex64 PASSED [0.1115s] [ 43%] 2025-12-04T15:22:22.8081894Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int32 PASSED [0.1075s] [ 43%] 2025-12-04T15:22:22.8082025Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int64 PASSED [0.1058s] [ 43%] 2025-12-04T15:22:22.8082156Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_uint8 PASSED [0.0974s] [ 43%] 2025-12-04T15:22:22.8082301Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex64 PASSED [0.0525s] [ 43%] 2025-12-04T15:22:22.8082441Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float32 PASSED [0.0513s] [ 43%] 2025-12-04T15:22:22.8082582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8 PASSED [0.0507s] [ 43%] 2025-12-04T15:22:22.8082717Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex64 PASSED [0.0780s] [ 43%] 2025-12-04T15:22:22.8082848Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float16 PASSED [0.1034s] [ 43%] 2025-12-04T15:22:22.8082979Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float32 PASSED [0.0671s] [ 43%] 2025-12-04T15:22:22.8083113Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float64 PASSED [0.0670s] [ 43%] 2025-12-04T15:22:22.8083243Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int16 PASSED [0.0566s] [ 43%] 2025-12-04T15:22:22.8083379Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bfloat16 PASSED [0.1140s] [ 43%] 2025-12-04T15:22:22.8083513Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex64 PASSED [0.0883s] [ 43%] 2025-12-04T15:22:22.8083645Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float64 PASSED [0.0748s] [ 43%] 2025-12-04T15:22:22.8083778Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex128 PASSED [0.3180s] [ 43%] 2025-12-04T15:22:22.8083908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex64 PASSED [0.3199s] [ 43%] 2025-12-04T15:22:22.8084040Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float16 PASSED [0.4767s] [ 43%] 2025-12-04T15:22:22.8084169Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float32 PASSED [0.3016s] [ 43%] 2025-12-04T15:22:22.8084300Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float64 PASSED [0.3031s] [ 43%] 2025-12-04T15:22:22.8084442Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int16 PASSED [0.0295s] [ 43%] 2025-12-04T15:22:22.8084581Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int32 PASSED [1.5758s] [ 43%] 2025-12-04T15:22:22.8084706Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int64 PASSED [0.0596s] [ 43%] 2025-12-04T15:22:22.8084831Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int8 PASSED [0.0532s] [ 43%] 2025-12-04T15:22:22.8084967Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex64 PASSED [0.1231s] [ 43%] 2025-12-04T15:22:22.8085104Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float32 PASSED [0.0633s] [ 43%] 2025-12-04T15:22:22.8085241Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float64 PASSED [0.0630s] [ 43%] 2025-12-04T15:22:22.8085374Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int8 PASSED [0.0525s] [ 43%] 2025-12-04T15:22:22.8085499Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bool PASSED [0.0934s] [ 43%] 2025-12-04T15:22:22.8085630Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex32 PASSED [0.3329s] [ 43%] 2025-12-04T15:22:22.8085758Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float16 PASSED [0.0979s] [ 43%] 2025-12-04T15:22:22.8085882Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int16 PASSED [0.0764s] [ 43%] 2025-12-04T15:22:22.8086007Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_uint8 PASSED [0.0711s] [ 43%] 2025-12-04T15:22:22.8086157Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bfloat16 PASSED [0.5781s] [ 43%] 2025-12-04T15:22:22.8086297Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_uint8 PASSED [0.3654s] [ 43%] 2025-12-04T15:22:22.8086431Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex128 PASSED [0.0812s] [ 43%] 2025-12-04T15:22:22.8086582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float16 PASSED [0.0440s] [ 43%] 2025-12-04T15:22:22.8086731Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float32 PASSED [0.0433s] [ 43%] 2025-12-04T15:22:22.8086879Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float64 PASSED [0.4245s] [ 43%] 2025-12-04T15:22:22.8087026Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_bool PASSED [0.1031s] [ 43%] 2025-12-04T15:22:22.8087173Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float32 PASSED [0.2196s] [ 43%] 2025-12-04T15:22:22.8087315Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_uint8 PASSED [0.3906s] [ 43%] 2025-12-04T15:22:22.8087454Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int8 PASSED [0.0808s] [ 43%] 2025-12-04T15:22:22.8087593Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float64 PASSED [0.3590s] [ 43%] 2025-12-04T15:22:22.8087729Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float16 PASSED [0.1092s] [ 43%] 2025-12-04T15:22:22.8087867Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_uint8 PASSED [0.0766s] [ 43%] 2025-12-04T15:22:22.8088007Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float32 PASSED [0.0706s] [ 43%] 2025-12-04T15:22:22.8088156Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float32 PASSED [0.2840s] [ 43%] 2025-12-04T15:22:22.8088299Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_uint8 PASSED [0.4869s] [ 43%] 2025-12-04T15:22:22.8088475Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float16 PASSED [0.0589s] [ 44%] 2025-12-04T15:22:22.8088638Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int16 PASSED [0.0587s] [ 44%] 2025-12-04T15:22:22.8088796Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int8 PASSED [0.0576s] [ 44%] 2025-12-04T15:22:22.8088942Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float32 PASSED [0.1993s] [ 44%] 2025-12-04T15:22:22.8089081Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int32 PASSED [0.1986s] [ 44%] 2025-12-04T15:22:22.8089252Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float64 PASSED [0.3662s] [ 44%] 2025-12-04T15:22:22.8089419Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_uint8 PASSED [0.3471s] [ 44%] 2025-12-04T15:22:22.8089587Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int16 PASSED [0.3932s] [ 44%] 2025-12-04T15:22:22.8089753Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_uint8 PASSED [0.3644s] [ 44%] 2025-12-04T15:22:22.8089896Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float16 PASSED [1.8249s] [ 44%] 2025-12-04T15:22:22.8090059Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float64 PASSED [0.1354s] [ 44%] 2025-12-04T15:22:22.8090243Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int32 PASSED [0.1346s] [ 44%] 2025-12-04T15:22:22.8090383Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int64 PASSED [0.1345s] [ 44%] 2025-12-04T15:22:22.8090521Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int16 PASSED [0.3024s] [ 44%] 2025-12-04T15:22:22.8090684Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex64 PASSED [0.0434s] [ 44%] 2025-12-04T15:22:22.8090842Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float16 PASSED [0.0432s] [ 44%] 2025-12-04T15:22:22.8091004Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float64 PASSED [0.0386s] [ 44%] 2025-12-04T15:22:22.8091158Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int8 PASSED [0.0427s] [ 44%] 2025-12-04T15:22:22.8091315Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_uint8 PASSED [0.0425s] [ 44%] 2025-12-04T15:22:22.8091471Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_bool PASSED [0.1025s] [ 44%] 2025-12-04T15:22:22.8091630Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int16 PASSED [0.0845s] [ 44%] 2025-12-04T15:22:22.8091776Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bool PASSED [0.6816s] [ 44%] 2025-12-04T15:22:22.8091926Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float32 PASSED [0.6187s] [ 44%] 2025-12-04T15:22:22.8092064Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int64 PASSED [0.3957s] [ 44%] 2025-12-04T15:22:22.8092202Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_uint8 PASSED [0.3902s] [ 44%] 2025-12-04T15:22:22.8092368Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bfloat16 PASSED [0.0162s] [ 44%] 2025-12-04T15:22:22.8092514Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float64 PASSED [0.0166s] [ 44%] 2025-12-04T15:22:22.8092649Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex64 PASSED [0.3019s] [ 44%] 2025-12-04T15:22:22.8092776Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int8 PASSED [0.0730s] [ 44%] 2025-12-04T15:22:22.8092908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int8 PASSED [0.0644s] [ 44%] 2025-12-04T15:22:22.8093051Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_bfloat16 PASSED [0.0248s] [ 44%] 2025-12-04T15:22:22.8093199Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex128 PASSED [0.0239s] [ 44%] 2025-12-04T15:22:22.8093343Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex64 PASSED [0.0227s] [ 44%] 2025-12-04T15:22:22.8093486Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float32 PASSED [0.0234s] [ 44%] 2025-12-04T15:22:22.8093623Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int16 PASSED [0.0224s] [ 44%] 2025-12-04T15:22:22.8093760Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int32 PASSED [0.0224s] [ 44%] 2025-12-04T15:22:22.8093926Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int32 PASSED [0.0161s] [ 44%] 2025-12-04T15:22:22.8094077Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex32 PASSED [0.0154s] [ 44%] 2025-12-04T15:22:22.8094237Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float32 PASSED [0.0138s] [ 44%] 2025-12-04T15:22:22.8094382Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int32 PASSED [0.0131s] [ 44%] 2025-12-04T15:22:22.8094526Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int64 PASSED [0.0139s] [ 44%] 2025-12-04T15:22:22.8094669Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_uint8 PASSED [0.0135s] [ 44%] 2025-12-04T15:22:22.8094805Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex32 PASSED [0.0394s] [ 44%] 2025-12-04T15:22:22.8094939Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex64 PASSED [0.0371s] [ 44%] 2025-12-04T15:22:22.8095072Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float16 PASSED [0.0360s] [ 44%] 2025-12-04T15:22:22.8095201Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int32 PASSED [0.0348s] [ 44%] 2025-12-04T15:22:22.8095331Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int64 PASSED [0.0349s] [ 44%] 2025-12-04T15:22:22.8095463Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_complex128 PASSED [0.0402s] [ 44%] 2025-12-04T15:22:22.8095598Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float32 PASSED [0.0633s] [ 44%] 2025-12-04T15:22:22.8095731Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex128 PASSED [1.3161s] [ 44%] 2025-12-04T15:22:22.8095863Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float64 PASSED [0.3959s] [ 44%] 2025-12-04T15:22:22.8095993Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float16 PASSED [0.4885s] [ 44%] 2025-12-04T15:22:22.8096118Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int32 PASSED [0.2912s] [ 44%] 2025-12-04T15:22:22.8096254Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_uint8 PASSED [0.2864s] [ 44%] 2025-12-04T15:22:22.8096392Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float16 PASSED [0.0536s] [ 44%] 2025-12-04T15:22:22.8096524Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_bfloat16 PASSED [0.0096s] [ 44%] 2025-12-04T15:22:22.8096653Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_bool PASSED [1.5950s] [ 44%] 2025-12-04T15:22:22.8096787Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_complex64 PASSED [0.0138s] [ 44%] 2025-12-04T15:22:22.8096919Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_float16 PASSED [0.0093s] [ 44%] 2025-12-04T15:22:22.8097048Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int16 PASSED [1.4217s] [ 44%] 2025-12-04T15:22:22.8097176Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int8 PASSED [0.0134s] [ 44%] 2025-12-04T15:22:22.8097304Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_uint8 PASSED [0.0093s] [ 44%] 2025-12-04T15:22:22.8097429Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float16 PASSED [1.4249s] [ 44%] 2025-12-04T15:22:22.8097559Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex32 PASSED [0.4503s] [ 44%] 2025-12-04T15:22:22.8097704Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int8 PASSED [0.0730s] [ 44%] 2025-12-04T15:22:22.8097841Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_uint8 PASSED [0.0717s] [ 44%] 2025-12-04T15:22:22.8097974Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex32 PASSED [0.1182s] [ 45%] 2025-12-04T15:22:22.8098105Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float32 PASSED [0.0676s] [ 45%] 2025-12-04T15:22:22.8098233Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32 PASSED [0.0774s] [ 45%] 2025-12-04T15:22:22.8098370Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bool PASSED [0.0345s] [ 45%] 2025-12-04T15:22:22.8098513Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float64 PASSED [0.0363s] [ 45%] 2025-12-04T15:22:22.8098653Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_uint8 PASSED [0.0346s] [ 45%] 2025-12-04T15:22:22.8098781Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float16 PASSED [0.0625s] [ 45%] 2025-12-04T15:22:22.8098906Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int64 PASSED [0.0619s] [ 45%] 2025-12-04T15:22:22.8099039Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex32 PASSED [0.0059s] [ 45%] 2025-12-04T15:22:22.8099170Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_uint8 PASSED [0.0052s] [ 45%] 2025-12-04T15:22:22.8099311Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_bool PASSED [0.0213s] [ 45%] 2025-12-04T15:22:22.8099453Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int64 PASSED [0.0206s] [ 45%] 2025-12-04T15:22:22.8099582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int16 PASSED [0.0510s] [ 45%] 2025-12-04T15:22:22.8099708Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int8 PASSED [0.0569s] [ 45%] 2025-12-04T15:22:22.8099845Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int64 PASSED [0.1060s] [ 45%] 2025-12-04T15:22:22.8099987Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex64 PASSED [0.0513s] [ 45%] 2025-12-04T15:22:22.8100150Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float16 PASSED [0.0506s] [ 45%] 2025-12-04T15:22:22.8100277Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int8 PASSED [0.0492s] [ 45%] 2025-12-04T15:22:22.8100414Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int32 PASSED [0.1025s] [ 45%] 2025-12-04T15:22:22.8100553Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int64 PASSED [0.1013s] [ 45%] 2025-12-04T15:22:22.8100694Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex32 XFAIL [0.0787s] [ 45%] 2025-12-04T15:22:22.8100836Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex64 PASSED [1.8043s] [ 45%] 2025-12-04T15:22:22.8100973Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_uint8 PASSED [0.3980s] [ 45%] 2025-12-04T15:22:22.8101105Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float16 PASSED [0.1030s] [ 45%] 2025-12-04T15:22:22.8101234Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int16 PASSED [0.0567s] [ 45%] 2025-12-04T15:22:22.8101376Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int8 PASSED [0.0531s] [ 45%] 2025-12-04T15:22:22.8101530Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_bfloat16 PASSED [0.0513s] [ 45%] 2025-12-04T15:22:22.8101678Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int64 PASSED [0.0496s] [ 45%] 2025-12-04T15:22:22.8101815Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_int8 PASSED [0.0493s] [ 45%] 2025-12-04T15:22:22.8101946Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bool PASSED [0.0383s] [ 45%] 2025-12-04T15:22:22.8102078Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float16 PASSED [0.0394s] [ 45%] 2025-12-04T15:22:22.8102214Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int32 PASSED [0.0240s] [ 45%] 2025-12-04T15:22:22.8102351Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bool PASSED [0.0599s] [ 45%] 2025-12-04T15:22:22.8102493Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex64 PASSED [0.0627s] [ 45%] 2025-12-04T15:22:22.8102630Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int16 PASSED [0.0591s] [ 45%] 2025-12-04T15:22:22.8102760Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_bool PASSED [0.0442s] [ 45%] 2025-12-04T15:22:22.8102893Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float32 PASSED [0.0492s] [ 45%] 2025-12-04T15:22:22.8103021Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int8 PASSED [0.0474s] [ 45%] 2025-12-04T15:22:22.8103164Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float32 PASSED [0.0239s] [ 45%] 2025-12-04T15:22:22.8103310Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float64 PASSED [0.0246s] [ 45%] 2025-12-04T15:22:22.8103451Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_uint8 PASSED [0.0237s] [ 45%] 2025-12-04T15:22:22.8103592Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex32 PASSED [0.0208s] [ 45%] 2025-12-04T15:22:22.8103742Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_uint8 PASSED [0.0182s] [ 45%] 2025-12-04T15:22:22.8103874Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex128 PASSED [0.0327s] [ 45%] 2025-12-04T15:22:22.8104003Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float16 PASSED [0.0479s] [ 45%] 2025-12-04T15:22:22.8104134Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex64 PASSED [0.0091s] [ 45%] 2025-12-04T15:22:22.8104272Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bfloat16 PASSED [0.0896s] [ 45%] 2025-12-04T15:22:22.8104402Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bool PASSED [0.0789s] [ 45%] 2025-12-04T15:22:22.8104537Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex64 PASSED [0.0826s] [ 45%] 2025-12-04T15:22:22.8104668Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int64 PASSED [0.0789s] [ 45%] 2025-12-04T15:22:22.8104805Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bfloat16 PASSED [0.0196s] [ 45%] 2025-12-04T15:22:22.8104936Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int64 PASSED [0.0190s] [ 45%] 2025-12-04T15:22:22.8105063Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bool PASSED [0.0965s] [ 45%] 2025-12-04T15:22:22.8105214Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32 PASSED [0.1141s] [ 45%] 2025-12-04T15:22:22.8105349Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bfloat16 PASSED [0.0087s] [ 45%] 2025-12-04T15:22:22.8105492Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int16 PASSED [0.0085s] [ 45%] 2025-12-04T15:22:22.8105625Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float32 PASSED [0.0104s] [ 45%] 2025-12-04T15:22:22.8105755Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int16 PASSED [1.4931s] [ 45%] 2025-12-04T15:22:22.8105888Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex64 PASSED [0.0688s] [ 45%] 2025-12-04T15:22:22.8106017Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int64 PASSED [0.0492s] [ 45%] 2025-12-04T15:22:22.8106149Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float16 PASSED [0.7884s] [ 45%] 2025-12-04T15:22:22.8106281Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float64 PASSED [0.6236s] [ 45%] 2025-12-04T15:22:22.8106413Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float32 PASSED [0.0069s] [ 45%] 2025-12-04T15:22:22.8106545Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float64 PASSED [0.0063s] [ 45%] 2025-12-04T15:22:22.8106673Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int32 PASSED [0.0067s] [ 45%] 2025-12-04T15:22:22.8106802Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_uint8 PASSED [0.0060s] [ 45%] 2025-12-04T15:22:22.8106930Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float32 PASSED [0.0436s] [ 45%] 2025-12-04T15:22:22.8107058Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_uint8 PASSED [0.0338s] [ 46%] 2025-12-04T15:22:22.8107182Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float16 PASSED [1.5337s] [ 46%] 2025-12-04T15:22:22.8107306Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float32 PASSED [1.4585s] [ 46%] 2025-12-04T15:22:22.8107442Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int16 PASSED [1.4777s] [ 46%] 2025-12-04T15:22:22.8107563Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int64 PASSED [1.4668s] [ 46%] 2025-12-04T15:22:22.8107683Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bool PASSED [1.4674s] [ 46%] 2025-12-04T15:22:22.8107809Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_complex64 PASSED [1.4887s] [ 46%] 2025-12-04T15:22:22.8107933Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float16 PASSED [1.4566s] [ 46%] 2025-12-04T15:22:22.8108056Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float64 PASSED [1.4781s] [ 46%] 2025-12-04T15:22:22.8108176Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int8 PASSED [1.5030s] [ 46%] 2025-12-04T15:22:22.8108308Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex128 PASSED [1.4920s] [ 46%] 2025-12-04T15:22:22.8108435Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int16 PASSED [1.4772s] [ 46%] 2025-12-04T15:22:22.8108558Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int32 PASSED [1.4918s] [ 46%] 2025-12-04T15:22:22.8108682Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_uint8 PASSED [1.4832s] [ 46%] 2025-12-04T15:22:22.8108810Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bfloat16 PASSED [1.4763s] [ 46%] 2025-12-04T15:22:22.8108955Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bool PASSED [1.4738s] [ 46%] 2025-12-04T15:22:22.8109097Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex128 PASSED [1.5172s] [ 46%] 2025-12-04T15:22:22.8109218Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int16 PASSED [1.4609s] [ 46%] 2025-12-04T15:22:22.8109342Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int64 PASSED [1.4827s] [ 46%] 2025-12-04T15:22:22.8109462Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int16 PASSED [1.4801s] [ 46%] 2025-12-04T15:22:22.8109584Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int32 PASSED [1.4961s] [ 46%] 2025-12-04T15:22:22.8109704Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_uint8 PASSED [1.4738s] [ 46%] 2025-12-04T15:22:22.8109825Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bool PASSED [1.4645s] [ 46%] 2025-12-04T15:22:22.8109946Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float16 PASSED [1.4729s] [ 46%] 2025-12-04T15:22:22.8110070Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float64 PASSED [1.4827s] [ 46%] 2025-12-04T15:22:22.8110229Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int8 PASSED [1.4726s] [ 46%] 2025-12-04T15:22:22.8110358Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex32 PASSED [1.5280s] [ 46%] 2025-12-04T15:22:22.8110484Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float16 PASSED [1.4859s] [ 46%] 2025-12-04T15:22:22.8110607Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float64 PASSED [1.4764s] [ 46%] 2025-12-04T15:22:22.8110729Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int16 PASSED [1.4754s] [ 46%] 2025-12-04T15:22:22.8110848Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bool PASSED [1.4858s] [ 46%] 2025-12-04T15:22:22.8110975Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex64 PASSED [1.5236s] [ 46%] 2025-12-04T15:22:22.8111109Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float32 PASSED [1.4957s] [ 46%] 2025-12-04T15:22:22.8111230Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int16 PASSED [1.4752s] [ 46%] 2025-12-04T15:22:22.8111348Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int32 PASSED [1.4909s] [ 46%] 2025-12-04T15:22:22.8111468Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bool PASSED [1.4874s] [ 46%] 2025-12-04T15:22:22.8111585Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int32 PASSED [1.4685s] [ 46%] 2025-12-04T15:22:22.8111713Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex32 PASSED [1.5453s] [ 46%] 2025-12-04T15:22:22.8111833Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_uint8 PASSED [1.4779s] [ 46%] 2025-12-04T15:22:22.8111962Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex128 PASSED [1.5099s] [ 46%] 2025-12-04T15:22:22.8112090Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex64 PASSED [1.5228s] [ 46%] 2025-12-04T15:22:22.8112212Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int16 PASSED [1.4638s] [ 46%] 2025-12-04T15:22:22.8112334Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int64 PASSED [1.4774s] [ 46%] 2025-12-04T15:22:22.8112439Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bool PASSED [1.4890s] [ 46%] 2025-12-04T15:22:22.8112583Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex128 PASSED [1.4871s] [ 46%] 2025-12-04T15:22:22.8112690Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int16 PASSED [1.5039s] [ 46%] 2025-12-04T15:22:22.8112809Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int8 PASSED [1.4862s] [ 46%] 2025-12-04T15:22:22.8112920Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex128 PASSED [1.5030s] [ 46%] 2025-12-04T15:22:22.8113029Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float16 PASSED [1.5116s] [ 46%] 2025-12-04T15:22:22.8113133Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int32 PASSED [1.4652s] [ 46%] 2025-12-04T15:22:22.8113239Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_uint8 PASSED [1.4645s] [ 46%] 2025-12-04T15:22:22.8113344Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int64 PASSED [1.4874s] [ 46%] 2025-12-04T15:22:22.8113449Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bool PASSED [0.1149s] [ 46%] 2025-12-04T15:22:22.8113561Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_bfloat16 PASSED [0.0757s] [ 46%] 2025-12-04T15:22:22.8113672Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float32 PASSED [0.0446s] [ 46%] 2025-12-04T15:22:22.8113782Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_bfloat16 PASSED [1.4880s] [ 46%] 2025-12-04T15:22:22.8113892Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float16 PASSED [1.5300s] [ 46%] 2025-12-04T15:22:22.8113998Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int16 PASSED [1.4652s] [ 46%] 2025-12-04T15:22:22.8114101Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int8 PASSED [1.4470s] [ 46%] 2025-12-04T15:22:22.8114213Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int16 PASSED [1.4138s] [ 46%] 2025-12-04T15:22:22.8114325Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int32 PASSED [1.4046s] [ 46%] 2025-12-04T15:22:22.8114434Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bfloat16 PASSED [1.4352s] [ 46%] 2025-12-04T15:22:22.8114539Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int64 PASSED [1.4292s] [ 46%] 2025-12-04T15:22:22.8114654Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex64 PASSED [1.4667s] [ 46%] 2025-12-04T15:22:22.8114776Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float32 PASSED [1.4629s] [ 46%] 2025-12-04T15:22:22.8114881Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_uint8 PASSED [1.4326s] [ 46%] 2025-12-04T15:22:22.8114991Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_bfloat16 PASSED [1.4389s] [ 46%] 2025-12-04T15:22:22.8115098Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float32 PASSED [1.4254s] [ 47%] 2025-12-04T15:22:22.8115204Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int32 PASSED [1.4377s] [ 47%] 2025-12-04T15:22:22.8115307Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bool PASSED [1.4602s] [ 47%] 2025-12-04T15:22:22.8115431Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_complex128 PASSED [1.4611s] [ 47%] 2025-12-04T15:22:22.8115551Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float16 PASSED [1.4412s] [ 47%] 2025-12-04T15:22:22.8115671Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float32 PASSED [1.4693s] [ 47%] 2025-12-04T15:22:22.8115781Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_bool PASSED [1.4493s] [ 47%] 2025-12-04T15:22:22.8115900Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex64 PASSED [1.4536s] [ 47%] 2025-12-04T15:22:22.8116011Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float32 PASSED [1.4566s] [ 47%] 2025-12-04T15:22:22.8116143Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int8 PASSED [1.4357s] [ 47%] 2025-12-04T15:22:22.8116272Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int64 PASSED [1.4323s] [ 47%] 2025-12-04T15:22:22.8116409Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex64 PASSED [1.4601s] [ 47%] 2025-12-04T15:22:22.8116521Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex64 PASSED [1.4889s] [ 47%] 2025-12-04T15:22:22.8116628Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int64 PASSED [1.4769s] [ 47%] 2025-12-04T15:22:22.8116732Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_uint8 PASSED [1.4726s] [ 47%] 2025-12-04T15:22:22.8116842Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bfloat16 PASSED [1.4871s] [ 47%] 2025-12-04T15:22:22.8116951Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float64 PASSED [1.4525s] [ 47%] 2025-12-04T15:22:22.8117061Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32 PASSED [0.1087s] [ 47%] 2025-12-04T15:22:22.8117164Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int8 PASSED [0.1165s] [ 47%] 2025-12-04T15:22:22.8117270Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bool PASSED [0.0368s] [ 47%] 2025-12-04T15:22:22.8117381Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex128 PASSED [0.2468s] [ 47%] 2025-12-04T15:22:22.8117488Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float64 PASSED [1.4692s] [ 47%] 2025-12-04T15:22:22.8117592Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int8 PASSED [1.4845s] [ 47%] 2025-12-04T15:22:22.8117702Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bfloat16 PASSED [1.4684s] [ 47%] 2025-12-04T15:22:22.8117813Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex128 PASSED [1.4805s] [ 47%] 2025-12-04T15:22:22.8117923Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bool PASSED [1.4482s] [ 47%] 2025-12-04T15:22:22.8118042Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex128 PASSED [1.4679s] [ 47%] 2025-12-04T15:22:22.8118160Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex32 PASSED [1.4589s] [ 47%] 2025-12-04T15:22:22.8118280Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int8 PASSED [1.4307s] [ 47%] 2025-12-04T15:22:22.8118390Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int16 PASSED [1.4716s] [ 47%] 2025-12-04T15:22:22.8118500Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int8 PASSED [1.4610s] [ 47%] 2025-12-04T15:22:22.8118610Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int8 PASSED [0.1096s] [ 47%] 2025-12-04T15:22:22.8118724Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_uint8 PASSED [0.1104s] [ 47%] 2025-12-04T15:22:22.8118845Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16 PASSED [0.1111s] [ 47%] 2025-12-04T15:22:22.8118958Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_uint8 PASSED [0.0244s] [ 47%] 2025-12-04T15:22:22.8119081Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int32 PASSED [0.1126s] [ 47%] 2025-12-04T15:22:22.8119205Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_uint8 PASSED [0.0956s] [ 47%] 2025-12-04T15:22:22.8119322Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex64 PASSED [0.0185s] [ 47%] 2025-12-04T15:22:22.8119434Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float16 PASSED [1.4768s] [ 47%] 2025-12-04T15:22:22.8119556Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float16 PASSED [1.4495s] [ 47%] 2025-12-04T15:22:22.8119695Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int8 PASSED [1.4881s] [ 47%] 2025-12-04T15:22:22.8119817Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128 PASSED [1.4790s] [ 47%] 2025-12-04T15:22:22.8119955Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float16 PASSED [1.4642s] [ 47%] 2025-12-04T15:22:22.8120070Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int64 PASSED [1.4408s] [ 47%] 2025-12-04T15:22:22.8120219Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float64 PASSED [0.1836s] [ 47%] 2025-12-04T15:22:22.8120329Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int16 PASSED [1.6082s] [ 47%] 2025-12-04T15:22:22.8120438Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex128 PASSED [0.0164s] [ 47%] 2025-12-04T15:22:22.8120544Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int64 PASSED [0.0132s] [ 47%] 2025-12-04T15:22:22.8120656Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float16 PASSED [0.0090s] [ 47%] 2025-12-04T15:22:22.8120760Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int8 PASSED [0.0218s] [ 47%] 2025-12-04T15:22:22.8120867Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_uint8 PASSED [0.0216s] [ 47%] 2025-12-04T15:22:22.8120976Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bfloat16 PASSED [0.0180s] [ 47%] 2025-12-04T15:22:22.8121081Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int16 PASSED [1.4789s] [ 47%] 2025-12-04T15:22:22.8121187Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int64 PASSED [1.4648s] [ 47%] 2025-12-04T15:22:22.8121292Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int8 PASSED [1.4991s] [ 47%] 2025-12-04T15:22:22.8121401Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_bfloat16 PASSED [1.5209s] [ 47%] 2025-12-04T15:22:22.8121516Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_bfloat16 PASSED [0.1022s] [ 47%] 2025-12-04T15:22:22.8121624Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int8 PASSED [0.0888s] [ 47%] 2025-12-04T15:22:22.8121734Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int32 PASSED [0.0896s] [ 47%] 2025-12-04T15:22:22.8121862Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int64 PASSED [0.0935s] [ 47%] 2025-12-04T15:22:22.8121972Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_uint8 PASSED [0.0875s] [ 47%] 2025-12-04T15:22:22.8122080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float64 PASSED [1.5076s] [ 47%] 2025-12-04T15:22:22.8122186Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int32 PASSED [1.4984s] [ 47%] 2025-12-04T15:22:22.8122299Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int16 PASSED [1.4714s] [ 47%] 2025-12-04T15:22:22.8122416Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int32 PASSED [1.4664s] [ 47%] 2025-12-04T15:22:22.8122526Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128 PASSED [1.5174s] [ 47%] 2025-12-04T15:22:22.8122639Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex32 PASSED [1.4925s] [ 47%] 2025-12-04T15:22:22.8122749Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex64 PASSED [1.5039s] [ 48%] 2025-12-04T15:22:22.8122856Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float16 PASSED [1.4661s] [ 48%] 2025-12-04T15:22:22.8122970Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bool PASSED [1.4788s] [ 48%] 2025-12-04T15:22:22.8123091Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex32 PASSED [1.5327s] [ 48%] 2025-12-04T15:22:22.8123223Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float32 PASSED [1.5010s] [ 48%] 2025-12-04T15:22:22.8123354Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64 PASSED [1.4806s] [ 48%] 2025-12-04T15:22:22.8123481Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int64 PASSED [1.4947s] [ 48%] 2025-12-04T15:22:22.8123605Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex128 PASSED [0.0739s] [ 48%] 2025-12-04T15:22:22.8123729Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64 PASSED [0.0712s] [ 48%] 2025-12-04T15:22:22.8123848Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float64 PASSED [0.0729s] [ 48%] 2025-12-04T15:22:22.8123965Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int32 PASSED [0.0709s] [ 48%] 2025-12-04T15:22:22.8124080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bfloat16 PASSED [0.0323s] [ 48%] 2025-12-04T15:22:22.8124194Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int16 PASSED [0.0319s] [ 48%] 2025-12-04T15:22:22.8124304Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bool PASSED [0.2005s] [ 48%] 2025-12-04T15:22:22.8124416Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float32 PASSED [0.1701s] [ 48%] 2025-12-04T15:22:22.8124528Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float64 PASSED [0.2117s] [ 48%] 2025-12-04T15:22:22.8124637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex64 PASSED [0.4348s] [ 48%] 2025-12-04T15:22:22.8124747Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32 PASSED [0.0623s] [ 48%] 2025-12-04T15:22:22.8124865Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bfloat16 PASSED [1.4941s] [ 48%] 2025-12-04T15:22:22.8124980Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int64 PASSED [1.4728s] [ 48%] 2025-12-04T15:22:22.8125092Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_bfloat16 PASSED [1.5186s] [ 48%] 2025-12-04T15:22:22.8125200Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int64 PASSED [1.5139s] [ 48%] 2025-12-04T15:22:22.8125311Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bfloat16 PASSED [1.5156s] [ 48%] 2025-12-04T15:22:22.8125430Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float16 PASSED [1.4853s] [ 48%] 2025-12-04T15:22:22.8125539Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int16 PASSED [1.4978s] [ 48%] 2025-12-04T15:22:22.8125647Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int64 PASSED [1.4785s] [ 48%] 2025-12-04T15:22:22.8125751Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool PASSED [1.4708s] [ 48%] 2025-12-04T15:22:22.8125859Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float32 PASSED [1.4624s] [ 48%] 2025-12-04T15:22:22.8125965Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int64 PASSED [1.4673s] [ 48%] 2025-12-04T15:22:22.8126080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bfloat16 PASSED [1.4997s] [ 48%] 2025-12-04T15:22:22.8126190Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int8 PASSED [1.5230s] [ 48%] 2025-12-04T15:22:22.8126305Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bool PASSED [1.4706s] [ 48%] 2025-12-04T15:22:22.8126427Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex128 PASSED [1.4655s] [ 48%] 2025-12-04T15:22:22.8126548Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex64 PASSED [1.4965s] [ 48%] 2025-12-04T15:22:22.8126666Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float32 PASSED [1.4863s] [ 48%] 2025-12-04T15:22:22.8126789Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float16 PASSED [1.4581s] [ 48%] 2025-12-04T15:22:22.8126908Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int16 PASSED [1.4752s] [ 48%] 2025-12-04T15:22:22.8127049Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_complex128 PASSED [1.4724s] [ 48%] 2025-12-04T15:22:22.8127161Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bfloat16 PASSED [1.4946s] [ 48%] 2025-12-04T15:22:22.8127269Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int32 PASSED [1.4825s] [ 48%] 2025-12-04T15:22:22.8127378Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int64 PASSED [1.4801s] [ 48%] 2025-12-04T15:22:22.8127498Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int16 PASSED [0.2462s] [ 48%] 2025-12-04T15:22:22.8127618Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int64 PASSED [0.2383s] [ 48%] 2025-12-04T15:22:22.8127738Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int8 PASSED [0.2158s] [ 48%] 2025-12-04T15:22:22.8127857Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_uint8 PASSED [0.1102s] [ 48%] 2025-12-04T15:22:22.8127982Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float32 PASSED [0.1084s] [ 48%] 2025-12-04T15:22:22.8128104Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int32 PASSED [1.6211s] [ 48%] 2025-12-04T15:22:22.8128226Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float32 PASSED [0.1199s] [ 48%] 2025-12-04T15:22:22.8128345Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int16 PASSED [1.5834s] [ 48%] 2025-12-04T15:22:22.8128463Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int32 PASSED [0.1094s] [ 48%] 2025-12-04T15:22:22.8128572Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_bfloat16 PASSED [0.0042s] [ 48%] 2025-12-04T15:22:22.8128683Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bfloat16 PASSED [1.4837s] [ 48%] 2025-12-04T15:22:22.8128796Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex128 PASSED [1.4675s] [ 48%] 2025-12-04T15:22:22.8128906Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float64 PASSED [1.4717s] [ 48%] 2025-12-04T15:22:22.8129023Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int64 PASSED [1.4668s] [ 48%] 2025-12-04T15:22:22.8129130Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bool PASSED [1.4746s] [ 48%] 2025-12-04T15:22:22.8129240Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex64 PASSED [1.4753s] [ 48%] 2025-12-04T15:22:22.8129349Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float32 PASSED [1.4582s] [ 48%] 2025-12-04T15:22:22.8129456Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_uint8 PASSED [1.4797s] [ 48%] 2025-12-04T15:22:22.8129565Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bfloat16 PASSED [0.0060s] [ 48%] 2025-12-04T15:22:22.8129675Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex128 PASSED [0.0044s] [ 48%] 2025-12-04T15:22:22.8129784Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex32 PASSED [0.0042s] [ 48%] 2025-12-04T15:22:22.8129894Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex64 PASSED [0.0042s] [ 48%] 2025-12-04T15:22:22.8130001Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_uint8 PASSED [0.0041s] [ 48%] 2025-12-04T15:22:22.8130150Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bfloat16 PASSED [0.0187s] [ 48%] 2025-12-04T15:22:22.8130260Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int16 PASSED [0.0177s] [ 48%] 2025-12-04T15:22:22.8130397Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int32 PASSED [1.5117s] [ 48%] 2025-12-04T15:22:22.8130506Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int64 PASSED [1.4921s] [ 49%] 2025-12-04T15:22:22.8130635Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int32 PASSED [1.4740s] [ 49%] 2025-12-04T15:22:22.8130749Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int8 PASSED [1.4832s] [ 49%] 2025-12-04T15:22:22.8130858Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex64 PASSED [0.1290s] [ 49%] 2025-12-04T15:22:22.8130964Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float16 PASSED [0.1301s] [ 49%] 2025-12-04T15:22:22.8131067Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int16 PASSED [0.0978s] [ 49%] 2025-12-04T15:22:22.8131169Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int32 PASSED [0.0984s] [ 49%] 2025-12-04T15:22:22.8131280Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_bfloat16 XFAIL [0.0039s] [ 49%] 2025-12-04T15:22:22.8131389Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex128 XFAIL [1.5147s] [ 49%] 2025-12-04T15:22:22.8131498Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex64 XFAIL [1.4919s] [ 49%] 2025-12-04T15:22:22.8131601Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bool PASSED [1.5321s] [ 49%] 2025-12-04T15:22:22.8131707Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float32 PASSED [0.0246s] [ 49%] 2025-12-04T15:22:22.8131811Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int16 PASSED [1.5002s] [ 49%] 2025-12-04T15:22:22.8131915Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int32 PASSED [1.5009s] [ 49%] 2025-12-04T15:22:22.8132020Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int8 PASSED [1.5088s] [ 49%] 2025-12-04T15:22:22.8132124Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_bool PASSED [1.5298s] [ 49%] 2025-12-04T15:22:22.8132235Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex64 PASSED [1.9409s] [ 49%] 2025-12-04T15:22:22.8132341Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float32 PASSED [0.1456s] [ 49%] 2025-12-04T15:22:22.8132446Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int32 PASSED [0.0229s] [ 49%] 2025-12-04T15:22:22.8132562Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int64 PASSED [0.0218s] [ 49%] 2025-12-04T15:22:22.8132670Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bfloat16 PASSED [1.5243s] [ 49%] 2025-12-04T15:22:22.8132779Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex128 PASSED [1.8726s] [ 49%] 2025-12-04T15:22:22.8132896Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64 PASSED [1.4840s] [ 49%] 2025-12-04T15:22:22.8133005Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int8 PASSED [1.4900s] [ 49%] 2025-12-04T15:22:22.8133119Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int16 PASSED [1.4860s] [ 49%] 2025-12-04T15:22:22.8133231Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int32 PASSED [1.4839s] [ 49%] 2025-12-04T15:22:22.8133343Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int64 PASSED [1.5131s] [ 49%] 2025-12-04T15:22:22.8133450Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bool PASSED [1.4939s] [ 49%] 2025-12-04T15:22:22.8133563Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128 PASSED [1.4814s] [ 49%] 2025-12-04T15:22:22.8133674Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex64 PASSED [1.4836s] [ 49%] 2025-12-04T15:22:22.8133782Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float16 PASSED [1.4839s] [ 49%] 2025-12-04T15:22:22.8133899Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int8 PASSED [1.4924s] [ 49%] 2025-12-04T15:22:22.8134026Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex128 PASSED [1.5189s] [ 49%] 2025-12-04T15:22:22.8134146Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float16 PASSED [1.4962s] [ 49%] 2025-12-04T15:22:22.8134253Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float32 PASSED [1.5128s] [ 49%] 2025-12-04T15:22:22.8134373Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_bfloat16 PASSED [1.5079s] [ 49%] 2025-12-04T15:22:22.8134478Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float64 PASSED [0.0904s] [ 49%] 2025-12-04T15:22:22.8134590Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e4m3fn PASSED [0.0706s] [ 49%] 2025-12-04T15:22:22.8134699Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e5m2 PASSED [0.0704s] [ 49%] 2025-12-04T15:22:22.8134806Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int8 PASSED [0.0660s] [ 49%] 2025-12-04T15:22:22.8134910Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_uint8 PASSED [0.0672s] [ 49%] 2025-12-04T15:22:22.8135024Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex32 PASSED [0.0072s] [ 49%] 2025-12-04T15:22:22.8135131Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int16 PASSED [0.0072s] [ 49%] 2025-12-04T15:22:22.8135241Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int8 PASSED [0.0053s] [ 49%] 2025-12-04T15:22:22.8135351Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex32 PASSED [0.0071s] [ 49%] 2025-12-04T15:22:22.8135463Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex64 PASSED [0.0068s] [ 49%] 2025-12-04T15:22:22.8135572Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float64 PASSED [0.0070s] [ 49%] 2025-12-04T15:22:22.8135681Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int16 PASSED [0.0074s] [ 49%] 2025-12-04T15:22:22.8135788Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int8 PASSED [0.0064s] [ 49%] 2025-12-04T15:22:22.8135896Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_bool PASSED [0.0081s] [ 49%] 2025-12-04T15:22:22.8136006Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float64 PASSED [0.0093s] [ 49%] 2025-12-04T15:22:22.8136124Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int32 PASSED [1.5335s] [ 49%] 2025-12-04T15:22:22.8136243Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex64 PASSED [1.5174s] [ 49%] 2025-12-04T15:22:22.8136355Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int16 PASSED [1.4940s] [ 49%] 2025-12-04T15:22:22.8136472Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex128 PASSED [1.5370s] [ 49%] 2025-12-04T15:22:22.8136587Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex32 PASSED [1.4908s] [ 49%] 2025-12-04T15:22:22.8136698Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float32 PASSED [1.5122s] [ 49%] 2025-12-04T15:22:22.8136807Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int16 PASSED [1.5109s] [ 49%] 2025-12-04T15:22:22.8136917Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int32 PASSED [1.5016s] [ 49%] 2025-12-04T15:22:22.8137025Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int64 PASSED [1.5095s] [ 49%] 2025-12-04T15:22:22.8137134Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int8 PASSED [1.4856s] [ 49%] 2025-12-04T15:22:22.8137244Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float64 PASSED [1.4981s] [ 49%] 2025-12-04T15:22:22.8137352Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_bool PASSED [1.4928s] [ 49%] 2025-12-04T15:22:22.8137490Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex64 PASSED [1.4859s] [ 49%] 2025-12-04T15:22:22.8137602Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float16 PASSED [1.5070s] [ 49%] 2025-12-04T15:22:22.8137723Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int16 PASSED [1.5217s] [ 49%] 2025-12-04T15:22:22.8137837Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex32 PASSED [1.5252s] [ 49%] 2025-12-04T15:22:22.8137952Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex64 PASSED [1.5061s] [ 50%] 2025-12-04T15:22:22.8138060Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_uint8 PASSED [1.4915s] [ 50%] 2025-12-04T15:22:22.8138173Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex128 PASSED [1.5124s] [ 50%] 2025-12-04T15:22:22.8138282Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_uint8 PASSED [0.0115s] [ 50%] 2025-12-04T15:22:22.8138392Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_bool PASSED [0.0093s] [ 50%] 2025-12-04T15:22:22.8138501Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_uint8 PASSED [0.0073s] [ 50%] 2025-12-04T15:22:22.8138615Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bool PASSED [0.0059s] [ 50%] 2025-12-04T15:22:22.8138736Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64 PASSED [0.0055s] [ 50%] 2025-12-04T15:22:22.8138849Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float64 PASSED [0.0207s] [ 50%] 2025-12-04T15:22:22.8138958Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int16 PASSED [1.5182s] [ 50%] 2025-12-04T15:22:22.8139065Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_bool PASSED [1.5088s] [ 50%] 2025-12-04T15:22:22.8139177Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float16 PASSED [1.5262s] [ 50%] 2025-12-04T15:22:22.8139287Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float64 PASSED [1.5226s] [ 50%] 2025-12-04T15:22:22.8139399Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float32 PASSED [1.5264s] [ 50%] 2025-12-04T15:22:22.8139514Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex32 PASSED [1.4936s] [ 50%] 2025-12-04T15:22:22.8139636Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_uint8 PASSED [1.6578s] [ 50%] 2025-12-04T15:22:22.8139745Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int64 PASSED [0.0116s] [ 50%] 2025-12-04T15:22:22.8139853Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int8 PASSED [0.0069s] [ 50%] 2025-12-04T15:22:22.8139969Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex32 PASSED [0.0200s] [ 50%] 2025-12-04T15:22:22.8140081Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int16 PASSED [0.0085s] [ 50%] 2025-12-04T15:22:22.8140235Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32 PASSED [1.5079s] [ 50%] 2025-12-04T15:22:22.8140346Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_uint8 PASSED [1.5018s] [ 50%] 2025-12-04T15:22:22.8140454Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16 PASSED [1.5082s] [ 50%] 2025-12-04T15:22:22.8140565Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float16 PASSED [1.5187s] [ 50%] 2025-12-04T15:22:22.8140676Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float32 PASSED [1.5038s] [ 50%] 2025-12-04T15:22:22.8140784Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int32 PASSED [1.5079s] [ 50%] 2025-12-04T15:22:22.8140892Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex64 PASSED [1.5510s] [ 50%] 2025-12-04T15:22:22.8141033Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex32 PASSED [0.0380s] [ 50%] 2025-12-04T15:22:22.8141145Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex64 PASSED [0.0354s] [ 50%] 2025-12-04T15:22:22.8141268Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int32 PASSED [0.0344s] [ 50%] 2025-12-04T15:22:22.8141377Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int8 PASSED [0.0345s] [ 50%] 2025-12-04T15:22:22.8141485Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex64 PASSED [0.0076s] [ 50%] 2025-12-04T15:22:22.8141592Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_uint8 PASSED [0.0072s] [ 50%] 2025-12-04T15:22:22.8141697Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bool PASSED [0.0034s] [ 50%] 2025-12-04T15:22:22.8141804Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16 PASSED [0.0034s] [ 50%] 2025-12-04T15:22:22.8141912Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int8 PASSED [0.0033s] [ 50%] 2025-12-04T15:22:22.8142023Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex64 PASSED [0.0028s] [ 50%] 2025-12-04T15:22:22.8142129Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int8 PASSED [0.0032s] [ 50%] 2025-12-04T15:22:22.8142248Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_complex64 PASSED [0.1591s] [ 50%] 2025-12-04T15:22:22.8142359Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int32 PASSED [0.1344s] [ 50%] 2025-12-04T15:22:22.8142471Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_uint8 PASSED [0.1235s] [ 50%] 2025-12-04T15:22:22.8142577Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int32 PASSED [0.0234s] [ 50%] 2025-12-04T15:22:22.8142695Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float64 PASSED [0.6647s] [ 50%] 2025-12-04T15:22:22.8142804Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float16 PASSED [0.1119s] [ 50%] 2025-12-04T15:22:22.8142910Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float32 PASSED [0.0880s] [ 50%] 2025-12-04T15:22:22.8143016Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int64 PASSED [0.0886s] [ 50%] 2025-12-04T15:22:22.8143134Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_uint8 PASSED [0.0867s] [ 50%] 2025-12-04T15:22:22.8143238Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int64 PASSED [1.6133s] [ 50%] 2025-12-04T15:22:22.8143341Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_uint8 PASSED [1.6152s] [ 50%] 2025-12-04T15:22:22.8143450Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_bfloat16 PASSED [1.5813s] [ 50%] 2025-12-04T15:22:22.8143555Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float16 PASSED [1.5470s] [ 50%] 2025-12-04T15:22:22.8143661Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int64 PASSED [1.6034s] [ 50%] 2025-12-04T15:22:22.8143761Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bool PASSED [1.6120s] [ 50%] 2025-12-04T15:22:22.8143867Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float32 PASSED [0.1057s] [ 50%] 2025-12-04T15:22:22.8143971Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64 PASSED [0.0965s] [ 50%] 2025-12-04T15:22:22.8144080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int8 XFAIL [0.0029s] [ 50%] 2025-12-04T15:22:22.8144189Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8 XFAIL [1.5351s] [ 50%] 2025-12-04T15:22:22.8144293Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float16 PASSED [1.6450s] [ 50%] 2025-12-04T15:22:22.8144396Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int32 PASSED [1.6228s] [ 50%] 2025-12-04T15:22:22.8144529Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float16 PASSED [0.1842s] [ 50%] 2025-12-04T15:22:22.8144642Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int16 PASSED [0.1747s] [ 50%] 2025-12-04T15:22:22.8144762Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float64 PASSED [1.5429s] [ 50%] 2025-12-04T15:22:22.8144876Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex128 PASSED [1.5273s] [ 50%] 2025-12-04T15:22:22.8144983Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float16 PASSED [1.5275s] [ 50%] 2025-12-04T15:22:22.8145090Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int32 PASSED [1.5366s] [ 50%] 2025-12-04T15:22:22.8145198Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float64 PASSED [1.6131s] [ 50%] 2025-12-04T15:22:22.8145304Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bfloat16 PASSED [1.5571s] [ 50%] 2025-12-04T15:22:22.8145423Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex128 PASSED [1.5297s] [ 51%] 2025-12-04T15:22:22.8145533Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int16 PASSED [1.5418s] [ 51%] 2025-12-04T15:22:22.8145643Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32 PASSED [1.5339s] [ 51%] 2025-12-04T15:22:22.8145752Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_uint8 PASSED [1.5297s] [ 51%] 2025-12-04T15:22:22.8145868Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bfloat16 PASSED [1.5154s] [ 51%] 2025-12-04T15:22:22.8145977Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bool PASSED [1.5291s] [ 51%] 2025-12-04T15:22:22.8146092Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float16 PASSED [1.5489s] [ 51%] 2025-12-04T15:22:22.8146205Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float64 PASSED [1.5233s] [ 51%] 2025-12-04T15:22:22.8146317Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int16 PASSED [1.5373s] [ 51%] 2025-12-04T15:22:22.8146429Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_bool PASSED [1.5348s] [ 51%] 2025-12-04T15:22:22.8146551Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex32 PASSED [1.5311s] [ 51%] 2025-12-04T15:22:22.8146675Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int16 PASSED [1.5320s] [ 51%] 2025-12-04T15:22:22.8146789Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_uint8 PASSED [1.5472s] [ 51%] 2025-12-04T15:22:22.8146898Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float16 PASSED [0.3867s] [ 51%] 2025-12-04T15:22:22.8147007Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float64 PASSED [0.2948s] [ 51%] 2025-12-04T15:22:22.8147116Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int16 PASSED [0.2829s] [ 51%] 2025-12-04T15:22:22.8147226Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int64 PASSED [0.2866s] [ 51%] 2025-12-04T15:22:22.8147335Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_uint8 PASSED [0.2835s] [ 51%] 2025-12-04T15:22:22.8147446Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float16 PASSED [0.0272s] [ 51%] 2025-12-04T15:22:22.8147557Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float32 PASSED [0.0266s] [ 51%] 2025-12-04T15:22:22.8147667Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int16 PASSED [0.0246s] [ 51%] 2025-12-04T15:22:22.8147775Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bfloat16 PASSED [0.0278s] [ 51%] 2025-12-04T15:22:22.8147885Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex32 PASSED [0.0810s] [ 51%] 2025-12-04T15:22:22.8148013Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16 PASSED [0.0241s] [ 51%] 2025-12-04T15:22:22.8148128Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int8 PASSED [0.0214s] [ 51%] 2025-12-04T15:22:22.8148247Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_bfloat16 PASSED [0.0302s] [ 51%] 2025-12-04T15:22:22.8148357Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex128 PASSED [0.0398s] [ 51%] 2025-12-04T15:22:22.8148467Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex64 PASSED [0.0394s] [ 51%] 2025-12-04T15:22:22.8148575Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int32 PASSED [0.0214s] [ 51%] 2025-12-04T15:22:22.8148684Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_uint8 PASSED [1.5867s] [ 51%] 2025-12-04T15:22:22.8148791Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bool PASSED [1.5529s] [ 51%] 2025-12-04T15:22:22.8148903Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float64 PASSED [1.5517s] [ 51%] 2025-12-04T15:22:22.8149012Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bfloat16 PASSED [1.5493s] [ 51%] 2025-12-04T15:22:22.8149125Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex128 PASSED [1.5885s] [ 51%] 2025-12-04T15:22:22.8149234Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128 PASSED [0.6806s] [ 51%] 2025-12-04T15:22:22.8149341Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float16 XFAIL [0.0034s] [ 51%] 2025-12-04T15:22:22.8149442Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bool PASSED [0.0926s] [ 51%] 2025-12-04T15:22:22.8149546Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int32 PASSED [0.0962s] [ 51%] 2025-12-04T15:22:22.8149648Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int64 PASSED [0.0960s] [ 51%] 2025-12-04T15:22:22.8149756Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float16 PASSED [1.5618s] [ 51%] 2025-12-04T15:22:22.8149864Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_uint8 PASSED [1.5886s] [ 51%] 2025-12-04T15:22:22.8149982Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_bfloat16 PASSED [1.5521s] [ 51%] 2025-12-04T15:22:22.8150143Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float64 PASSED [1.5593s] [ 51%] 2025-12-04T15:22:22.8150277Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_bfloat16 PASSED [1.6172s] [ 51%] 2025-12-04T15:22:22.8150392Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float64 PASSED [1.6830s] [ 51%] 2025-12-04T15:22:22.8150516Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128 PASSED [0.0385s] [ 51%] 2025-12-04T15:22:22.8150641Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float32 PASSED [0.1045s] [ 51%] 2025-12-04T15:22:22.8150767Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float64 PASSED [0.1046s] [ 51%] 2025-12-04T15:22:22.8150906Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex64 PASSED [0.1222s] [ 51%] 2025-12-04T15:22:22.8151040Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float32 PASSED [0.1183s] [ 51%] 2025-12-04T15:22:22.8151171Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int16 PASSED [0.1164s] [ 51%] 2025-12-04T15:22:22.8151299Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int8 PASSED [0.1116s] [ 51%] 2025-12-04T15:22:22.8151409Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bfloat16 PASSED [1.5524s] [ 51%] 2025-12-04T15:22:22.8151518Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float16 PASSED [1.5670s] [ 51%] 2025-12-04T15:22:22.8151636Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_uint8 PASSED [1.5510s] [ 51%] 2025-12-04T15:22:22.8151757Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64 PASSED [1.5670s] [ 51%] 2025-12-04T15:22:22.8151874Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bool PASSED [1.5765s] [ 51%] 2025-12-04T15:22:22.8151982Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex64 PASSED [1.6043s] [ 51%] 2025-12-04T15:22:22.8152087Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float64 PASSED [1.5672s] [ 51%] 2025-12-04T15:22:22.8152190Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int8 PASSED [1.5665s] [ 51%] 2025-12-04T15:22:22.8152304Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_bfloat16 PASSED [1.5486s] [ 51%] 2025-12-04T15:22:22.8152417Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float32 PASSED [1.5737s] [ 51%] 2025-12-04T15:22:22.8152545Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int32 PASSED [1.5591s] [ 51%] 2025-12-04T15:22:22.8152671Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_uint8 PASSED [1.5335s] [ 51%] 2025-12-04T15:22:22.8152786Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float64 PASSED [1.5438s] [ 51%] 2025-12-04T15:22:22.8152904Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex32 PASSED [0.4529s] [ 51%] 2025-12-04T15:22:22.8153020Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_complex64 PASSED [0.3392s] [ 51%] 2025-12-04T15:22:22.8153139Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128 PASSED [0.1298s] [ 52%] 2025-12-04T15:22:22.8153257Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex64 PASSED [0.1194s] [ 52%] 2025-12-04T15:22:22.8153369Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int64 PASSED [0.0960s] [ 52%] 2025-12-04T15:22:22.8153482Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_uint8 PASSED [0.0985s] [ 52%] 2025-12-04T15:22:22.8153600Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_bfloat16 PASSED [0.0249s] [ 52%] 2025-12-04T15:22:22.8153722Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex128 PASSED [0.0413s] [ 52%] 2025-12-04T15:22:22.8153847Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float32 PASSED [1.5815s] [ 52%] 2025-12-04T15:22:22.8153960Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int8 PASSED [1.5765s] [ 52%] 2025-12-04T15:22:22.8156637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float16 PASSED [0.1275s] [ 52%] 2025-12-04T15:22:22.8156762Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_uint8 PASSED [1.6345s] [ 52%] 2025-12-04T15:22:22.8156880Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bfloat16 PASSED [0.1229s] [ 52%] 2025-12-04T15:22:22.8156994Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bool PASSED [0.0815s] [ 52%] 2025-12-04T15:22:22.8157111Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float64 PASSED [1.6694s] [ 52%] 2025-12-04T15:22:22.8157225Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex64 PASSED [0.1341s] [ 52%] 2025-12-04T15:22:22.8157370Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_complex128 PASSED [0.5778s] [ 52%] 2025-12-04T15:22:22.8157501Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int16 PASSED [0.6052s] [ 52%] 2025-12-04T15:22:22.8157631Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int8 PASSED [0.2489s] [ 52%] 2025-12-04T15:22:22.8157761Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_uint8 PASSED [0.2010s] [ 52%] 2025-12-04T15:22:22.8157909Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float64 PASSED [0.0221s] [ 52%] 2025-12-04T15:22:22.8158019Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int32 PASSED [0.0156s] [ 52%] 2025-12-04T15:22:22.8158139Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float64 PASSED [0.1010s] [ 52%] 2025-12-04T15:22:22.8158260Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex32 PASSED [0.0086s] [ 52%] 2025-12-04T15:22:22.8158375Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float16 PASSED [0.0082s] [ 52%] 2025-12-04T15:22:22.8158485Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int32 PASSED [0.0082s] [ 52%] 2025-12-04T15:22:22.8158596Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int64 PASSED [0.0079s] [ 52%] 2025-12-04T15:22:22.8158708Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_uint8 PASSED [0.0082s] [ 52%] 2025-12-04T15:22:22.8158817Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bool PASSED [1.6669s] [ 52%] 2025-12-04T15:22:22.8158928Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float64 PASSED [1.6556s] [ 52%] 2025-12-04T15:22:22.8159037Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex64 PASSED [1.5656s] [ 52%] 2025-12-04T15:22:22.8159174Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float64 PASSED [1.5601s] [ 52%] 2025-12-04T15:22:22.8159307Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int64 PASSED [1.5778s] [ 52%] 2025-12-04T15:22:22.8159422Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex32 PASSED [1.5726s] [ 52%] 2025-12-04T15:22:22.8159531Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float64 PASSED [1.5457s] [ 52%] 2025-12-04T15:22:22.8159645Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex32 PASSED [0.1403s] [ 52%] 2025-12-04T15:22:22.8159749Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int8 PASSED [0.0970s] [ 52%] 2025-12-04T15:22:22.8159861Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bool PASSED [1.5783s] [ 52%] 2025-12-04T15:22:22.8159973Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float64 PASSED [1.6297s] [ 52%] 2025-12-04T15:22:22.8160141Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int8 PASSED [1.5887s] [ 52%] 2025-12-04T15:22:22.8160250Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bfloat16 PASSED [1.5951s] [ 52%] 2025-12-04T15:22:22.8160357Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bool PASSED [1.5675s] [ 52%] 2025-12-04T15:22:22.8160465Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float32 PASSED [1.5798s] [ 52%] 2025-12-04T15:22:22.8160573Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bfloat16 PASSED [0.1116s] [ 52%] 2025-12-04T15:22:22.8160682Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex128 PASSED [0.1127s] [ 52%] 2025-12-04T15:22:22.8160789Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float32 PASSED [0.0959s] [ 52%] 2025-12-04T15:22:22.8160894Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float64 PASSED [0.1017s] [ 52%] 2025-12-04T15:22:22.8161001Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_bfloat16 PASSED [1.5886s] [ 52%] 2025-12-04T15:22:22.8161110Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex32 PASSED [1.5898s] [ 52%] 2025-12-04T15:22:22.8161213Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int8 PASSED [1.5798s] [ 52%] 2025-12-04T15:22:22.8161329Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex32 PASSED [1.5516s] [ 52%] 2025-12-04T15:22:22.8161479Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex64 PASSED [1.6028s] [ 52%] 2025-12-04T15:22:22.8161592Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float16 PASSED [1.5672s] [ 52%] 2025-12-04T15:22:22.8161716Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float64 PASSED [1.5639s] [ 52%] 2025-12-04T15:22:22.8161836Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int32 PASSED [1.5783s] [ 52%] 2025-12-04T15:22:22.8161957Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int8 PASSED [1.5825s] [ 52%] 2025-12-04T15:22:22.8162065Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_bool PASSED [1.5609s] [ 52%] 2025-12-04T15:22:22.8162177Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex32 PASSED [1.5836s] [ 52%] 2025-12-04T15:22:22.8162287Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float64 PASSED [1.5771s] [ 52%] 2025-12-04T15:22:22.8162380Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8 2025-12-04T15:22:22.8162384Z 2025-12-04T15:22:22.8162553Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-c76d91645d4bc776.xml - 2025-12-04T15:22:22.8162618Z !!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T15:22:22.8162775Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py:2653: KeyboardInterrupt 2025-12-04T15:22:22.8162855Z (to show a full traceback on KeyboardInterrupt use --full-trace) 2025-12-04T15:22:22.8162933Z ========== 2965 passed, 502 skipped, 69 xfailed in 1792.87s (0:29:52) ========== 2025-12-04T15:22:22.8162984Z Command took >30min, returning 124 2025-12-04T15:22:22.8163022Z Got exit code 124 2025-12-04T15:22:22.8163063Z Retrying single test... 2025-12-04T15:22:22.8163185Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-25e70fbc47c362ea.xml 2025-12-04T15:22:22.8163247Z ============================= test session starts ============================== 2025-12-04T15:22:22.8163361Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T15:22:22.8163404Z cachedir: .pytest_cache 2025-12-04T15:22:22.8163564Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T15:22:22.8163636Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T15:22:22.8163678Z configfile: pytest.ini 2025-12-04T15:22:22.8163845Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T15:22:22.8163928Z collecting ... collected 33666 items / 6690 deselected / 26976 selected 2025-12-04T15:22:22.8164101Z stepcurrent: skipping 3536 already run items. Running only test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8 2025-12-04T15:22:22.8164148Z Running 1 items in this shard 2025-12-04T15:22:22.8164151Z 2025-12-04T15:22:22.8164265Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8 PASSED [1.0056s] [100%] 2025-12-04T15:22:22.8164268Z 2025-12-04T15:22:22.8164429Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-25e70fbc47c362ea.xml - 2025-12-04T15:22:22.8164497Z ====================== 1 passed, 6690 deselected in 3.06s ====================== 2025-12-04T15:22:22.8164535Z Got exit code 0 2025-12-04T15:22:22.8164618Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T15:22:22.8164734Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-c9beaf04e8863d0f.xml 2025-12-04T15:22:22.8164792Z ============================= test session starts ============================== 2025-12-04T15:22:22.8164902Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T15:22:22.8164953Z cachedir: .pytest_cache 2025-12-04T15:22:22.8165119Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T15:22:22.8165177Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T15:22:22.8165220Z configfile: pytest.ini 2025-12-04T15:22:22.8165380Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T15:22:22.8165463Z collecting ... collected 33666 items / 3537 deselected / 30129 selected 2025-12-04T15:22:22.8165519Z stepcurrent: skipping 3537 already run items. 2025-12-04T15:22:22.8165566Z Running 3154 items in this shard 2025-12-04T15:22:22.8165568Z 2025-12-04T15:22:22.8165681Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bool PASSED [0.9513s] [ 0%] 2025-12-04T15:22:22.8165800Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex32 PASSED [0.7600s] [ 0%] 2025-12-04T15:22:22.8165917Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex32 PASSED [0.0496s] [ 0%] 2025-12-04T15:22:22.8166029Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int16 PASSED [0.0065s] [ 0%] 2025-12-04T15:22:22.8166139Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int64 PASSED [0.0066s] [ 0%] 2025-12-04T15:22:22.8166281Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float16 PASSED [0.0621s] [ 0%] 2025-12-04T15:22:22.8166406Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16 PASSED [0.1403s] [ 0%] 2025-12-04T15:22:22.8166530Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float32 PASSED [0.0451s] [ 0%] 2025-12-04T15:22:22.8166654Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float16 PASSED [0.0229s] [ 0%] 2025-12-04T15:22:22.8166779Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float32 PASSED [0.0089s] [ 0%] 2025-12-04T15:22:22.8166902Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float64 PASSED [0.0652s] [ 0%] 2025-12-04T15:22:22.8167036Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float32 PASSED [0.0504s] [ 0%] 2025-12-04T15:22:22.8167180Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float64 PASSED [0.0240s] [ 0%] 2025-12-04T15:22:22.8167306Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int16 PASSED [0.0470s] [ 0%] 2025-12-04T15:22:22.8167437Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float16 PASSED [0.7427s] [ 0%] 2025-12-04T15:22:22.8167567Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float64 PASSED [0.7495s] [ 0%] 2025-12-04T15:22:22.8167698Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_bfloat16 PASSED [0.0122s] [ 0%] 2025-12-04T15:22:22.8167826Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float16 PASSED [0.7259s] [ 0%] 2025-12-04T15:22:22.8167961Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_bfloat16 PASSED [0.7196s] [ 0%] 2025-12-04T15:22:22.8168092Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float32 PASSED [0.7119s] [ 0%] 2025-12-04T15:22:22.8168223Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float64 PASSED [0.0102s] [ 0%] 2025-12-04T15:22:22.8168373Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16 PASSED [0.0351s] [ 0%] 2025-12-04T15:22:22.8168520Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float16 PASSED [0.0075s] [ 0%] 2025-12-04T15:22:22.8168688Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float64 PASSED [0.0058s] [ 0%] 2025-12-04T15:22:22.8168834Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int32 PASSED [0.0072s] [ 0%] 2025-12-04T15:22:22.8168991Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_uint8 PASSED [0.0076s] [ 0%] 2025-12-04T15:22:22.8169137Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float32 PASSED [0.0343s] [ 0%] 2025-12-04T15:22:22.8169282Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.7453s] [ 0%] 2025-12-04T15:22:22.8169423Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int32 PASSED [0.7418s] [ 0%] 2025-12-04T15:22:22.8169564Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int8 PASSED [0.0294s] [ 0%] 2025-12-04T15:22:22.8169692Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float16 PASSED [0.0824s] [ 0%] 2025-12-04T15:22:22.8169816Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float32 PASSED [0.0394s] [ 1%] 2025-12-04T15:22:22.8169948Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_bfloat16 PASSED [0.0345s] [ 1%] 2025-12-04T15:22:22.8170077Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float32 PASSED [0.0056s] [ 1%] 2025-12-04T15:22:22.8170250Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float64 PASSED [0.1238s] [ 1%] 2025-12-04T15:22:22.8170390Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex128 PASSED [0.0128s] [ 1%] 2025-12-04T15:22:22.8170530Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex64 PASSED [0.0118s] [ 1%] 2025-12-04T15:22:22.8170669Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0111s] [ 1%] 2025-12-04T15:22:22.8170808Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.8343s] [ 1%] 2025-12-04T15:22:22.8170934Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float64 PASSED [0.1427s] [ 1%] 2025-12-04T15:22:22.8171078Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float16 PASSED [0.0431s] [ 1%] 2025-12-04T15:22:22.8171202Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float32 PASSED [0.0432s] [ 1%] 2025-12-04T15:22:22.8171325Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int64 PASSED [0.0415s] [ 1%] 2025-12-04T15:22:22.8171448Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float16 PASSED [0.0467s] [ 1%] 2025-12-04T15:22:22.8171569Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int32 PASSED [0.0423s] [ 1%] 2025-12-04T15:22:22.8171693Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_bfloat16 PASSED [0.0453s] [ 1%] 2025-12-04T15:22:22.8171838Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float16 PASSED [0.0059s] [ 1%] 2025-12-04T15:22:22.8171980Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int32 PASSED [0.0047s] [ 1%] 2025-12-04T15:22:22.8172118Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int8 PASSED [0.0062s] [ 1%] 2025-12-04T15:22:22.8172261Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_bfloat16 PASSED [0.0062s] [ 1%] 2025-12-04T15:22:22.8172400Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int64 PASSED [0.0074s] [ 1%] 2025-12-04T15:22:22.8172560Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float32 PASSED [0.0718s] [ 1%] 2025-12-04T15:22:22.8172692Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float64 PASSED [0.0736s] [ 1%] 2025-12-04T15:22:22.8172839Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float16 PASSED [0.0322s] [ 1%] 2025-12-04T15:22:22.8172967Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8 PASSED [0.0348s] [ 1%] 2025-12-04T15:22:22.8173098Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int64 PASSED [0.0407s] [ 1%] 2025-12-04T15:22:22.8173225Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_uint8 PASSED [0.0388s] [ 1%] 2025-12-04T15:22:22.8173372Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_bfloat16 PASSED [0.7812s] [ 1%] 2025-12-04T15:22:22.8173520Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [0.7239s] [ 1%] 2025-12-04T15:22:22.8173634Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex128 PASSED [0.7550s] [ 1%] 2025-12-04T15:22:22.8173744Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float16 PASSED [0.7413s] [ 1%] 2025-12-04T15:22:22.8173857Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_bfloat16 PASSED [0.7327s] [ 1%] 2025-12-04T15:22:22.8173971Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int32 PASSED [0.7667s] [ 1%] 2025-12-04T15:22:22.8174086Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_uint8 PASSED [0.7677s] [ 2%] 2025-12-04T15:22:22.8174198Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex32 PASSED [0.7679s] [ 2%] 2025-12-04T15:22:22.8174309Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float32 PASSED [0.7770s] [ 2%] 2025-12-04T15:22:22.8174422Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float64 PASSED [0.7369s] [ 2%] 2025-12-04T15:22:22.8174532Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int8 PASSED [0.7418s] [ 2%] 2025-12-04T15:22:22.8174639Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float16 PASSED [0.1539s] [ 2%] 2025-12-04T15:22:22.8174757Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float64 PASSED [0.1146s] [ 2%] 2025-12-04T15:22:22.8174864Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int32 PASSED [0.8288s] [ 2%] 2025-12-04T15:22:22.8174973Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex128 PASSED [0.7432s] [ 2%] 2025-12-04T15:22:22.8175081Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float64 PASSED [0.0258s] [ 2%] 2025-12-04T15:22:22.8175190Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float16 PASSED [0.0308s] [ 2%] 2025-12-04T15:22:22.8175301Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float32 PASSED [0.0238s] [ 2%] 2025-12-04T15:22:22.8175410Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64 PASSED [0.0288s] [ 2%] 2025-12-04T15:22:22.8175517Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8 PASSED [0.7548s] [ 2%] 2025-12-04T15:22:22.8175627Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float32 PASSED [0.7313s] [ 2%] 2025-12-04T15:22:22.8175737Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bfloat16 PASSED [0.7332s] [ 2%] 2025-12-04T15:22:22.8175842Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bool PASSED [0.7414s] [ 2%] 2025-12-04T15:22:22.8175955Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex128 PASSED [0.7260s] [ 2%] 2025-12-04T15:22:22.8176076Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bfloat16 PASSED [0.7468s] [ 2%] 2025-12-04T15:22:22.8176193Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool PASSED [0.7658s] [ 2%] 2025-12-04T15:22:22.8176318Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float16 PASSED [0.7471s] [ 2%] 2025-12-04T15:22:22.8176422Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int64 PASSED [0.7386s] [ 2%] 2025-12-04T15:22:22.8176544Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex128 PASSED [0.8264s] [ 2%] 2025-12-04T15:22:22.8176661Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex64 PASSED [0.0482s] [ 2%] 2025-12-04T15:22:22.8176773Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int32 PASSED [0.0371s] [ 2%] 2025-12-04T15:22:22.8176885Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float64 PASSED [0.1312s] [ 2%] 2025-12-04T15:22:22.8176996Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float64 PASSED [0.7376s] [ 2%] 2025-12-04T15:22:22.8177105Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bool PASSED [0.7604s] [ 2%] 2025-12-04T15:22:22.8177217Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8 PASSED [0.7853s] [ 2%] 2025-12-04T15:22:22.8177327Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bfloat16 PASSED [0.0450s] [ 2%] 2025-12-04T15:22:22.8177435Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bool PASSED [0.0274s] [ 2%] 2025-12-04T15:22:22.8177546Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex32 PASSED [0.0436s] [ 3%] 2025-12-04T15:22:22.8177656Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float16 PASSED [0.0436s] [ 3%] 2025-12-04T15:22:22.8177764Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int16 PASSED [0.0454s] [ 3%] 2025-12-04T15:22:22.8177873Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_uint8 PASSED [0.7648s] [ 3%] 2025-12-04T15:22:22.8177982Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bfloat16 PASSED [0.7617s] [ 3%] 2025-12-04T15:22:22.8178093Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex128 PASSED [0.0147s] [ 3%] 2025-12-04T15:22:22.8178202Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex64 PASSED [0.0124s] [ 3%] 2025-12-04T15:22:22.8178318Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float64 PASSED [0.0120s] [ 3%] 2025-12-04T15:22:22.8178428Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bfloat16 PASSED [0.0150s] [ 3%] 2025-12-04T15:22:22.8178534Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float16 PASSED [0.0147s] [ 3%] 2025-12-04T15:22:22.8178644Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_bfloat16 PASSED [0.0302s] [ 3%] 2025-12-04T15:22:22.8178752Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float16 PASSED [0.7629s] [ 3%] 2025-12-04T15:22:22.8178859Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float64 PASSED [0.7635s] [ 3%] 2025-12-04T15:22:22.8178965Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int16 PASSED [0.7682s] [ 3%] 2025-12-04T15:22:22.8179073Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bfloat16 PASSED [0.7578s] [ 3%] 2025-12-04T15:22:22.8179183Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex32 PASSED [0.8024s] [ 3%] 2025-12-04T15:22:22.8179289Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int64 PASSED [0.0257s] [ 3%] 2025-12-04T15:22:22.8179393Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_uint8 PASSED [0.7590s] [ 3%] 2025-12-04T15:22:22.8179499Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float64 PASSED [0.8282s] [ 3%] 2025-12-04T15:22:22.8179626Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int16 PASSED [0.0910s] [ 3%] 2025-12-04T15:22:22.8179732Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int8 PASSED [0.0871s] [ 3%] 2025-12-04T15:22:22.8179849Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_uint8 PASSED [0.0866s] [ 3%] 2025-12-04T15:22:22.8179968Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int32 PASSED [0.0100s] [ 3%] 2025-12-04T15:22:22.8180086Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int8 PASSED [0.0071s] [ 3%] 2025-12-04T15:22:22.8180232Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_bool PASSED [0.0247s] [ 3%] 2025-12-04T15:22:22.8180337Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float16 PASSED [0.0238s] [ 3%] 2025-12-04T15:22:22.8180440Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int32 PASSED [0.0216s] [ 3%] 2025-12-04T15:22:22.8180553Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float64 PASSED [0.0527s] [ 3%] 2025-12-04T15:22:22.8180660Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int16 PASSED [0.0606s] [ 3%] 2025-12-04T15:22:22.8180768Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int8 PASSED [0.7838s] [ 3%] 2025-12-04T15:22:22.8180872Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_uint8 PASSED [0.7642s] [ 3%] 2025-12-04T15:22:22.8180975Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_bool PASSED [0.7792s] [ 3%] 2025-12-04T15:22:22.8181080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float64 PASSED [0.0258s] [ 4%] 2025-12-04T15:22:22.8181189Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex64 PASSED [0.0686s] [ 4%] 2025-12-04T15:22:22.8181292Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int16 PASSED [0.0493s] [ 4%] 2025-12-04T15:22:22.8181403Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex64 PASSED [0.2421s] [ 4%] 2025-12-04T15:22:22.8181507Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int16 PASSED [0.0285s] [ 4%] 2025-12-04T15:22:22.8181635Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bfloat16 PASSED [0.0217s] [ 4%] 2025-12-04T15:22:22.8181759Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_uint8 PASSED [0.0113s] [ 4%] 2025-12-04T15:22:22.8181892Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_uint8 PASSED [0.0255s] [ 4%] 2025-12-04T15:22:22.8182013Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int16 PASSED [0.0266s] [ 4%] 2025-12-04T15:22:22.8182131Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float32 PASSED [0.0715s] [ 4%] 2025-12-04T15:22:22.8182246Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int16 PASSED [0.0476s] [ 4%] 2025-12-04T15:22:22.8182365Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float32 PASSED [0.1870s] [ 4%] 2025-12-04T15:22:22.8182481Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float16 PASSED [0.0306s] [ 4%] 2025-12-04T15:22:22.8182593Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_uint8 PASSED [0.7617s] [ 4%] 2025-12-04T15:22:22.8182705Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_uint8 PASSED [0.7641s] [ 4%] 2025-12-04T15:22:22.8182819Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float16 PASSED [0.0333s] [ 4%] 2025-12-04T15:22:22.8182934Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float64 PASSED [0.2967s] [ 4%] 2025-12-04T15:22:22.8183045Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int64 PASSED [0.7642s] [ 4%] 2025-12-04T15:22:22.8183177Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_uint8 PASSED [0.8046s] [ 4%] 2025-12-04T15:22:22.8183325Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool PASSED [0.0082s] [ 4%] 2025-12-04T15:22:22.8183480Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float16 PASSED [0.0050s] [ 4%] 2025-12-04T15:22:22.8183618Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int16 PASSED [0.0063s] [ 4%] 2025-12-04T15:22:22.8183754Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_uint8 PASSED [0.7363s] [ 4%] 2025-12-04T15:22:22.8183873Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bfloat16 PASSED [0.8160s] [ 4%] 2025-12-04T15:22:22.8183991Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float16 PASSED [0.7816s] [ 4%] 2025-12-04T15:22:22.8184108Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int64 PASSED [0.7855s] [ 4%] 2025-12-04T15:22:22.8184223Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int8 PASSED [0.7764s] [ 4%] 2025-12-04T15:22:22.8184371Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float64 PASSED [0.0807s] [ 4%] 2025-12-04T15:22:22.8184519Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16 PASSED [0.0840s] [ 4%] 2025-12-04T15:22:22.8184637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float64 PASSED [0.7841s] [ 4%] 2025-12-04T15:22:22.8184754Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float32 PASSED [0.8879s] [ 4%] 2025-12-04T15:22:22.8184887Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int32 PASSED [0.7437s] [ 5%] 2025-12-04T15:22:22.8185019Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_bool PASSED [0.7789s] [ 5%] 2025-12-04T15:22:22.8185157Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float32 PASSED [0.3222s] [ 5%] 2025-12-04T15:22:22.8185293Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int16 PASSED [0.0221s] [ 5%] 2025-12-04T15:22:22.8185413Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float16 PASSED [0.1797s] [ 5%] 2025-12-04T15:22:22.8185540Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int32 PASSED [0.1230s] [ 5%] 2025-12-04T15:22:22.8185653Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_uint8 PASSED [0.1126s] [ 5%] 2025-12-04T15:22:22.8185777Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex32 PASSED [0.0057s] [ 5%] 2025-12-04T15:22:22.8185895Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int32 PASSED [0.0054s] [ 5%] 2025-12-04T15:22:22.8186014Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int8 PASSED [0.0053s] [ 5%] 2025-12-04T15:22:22.8186120Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int64 PASSED [0.0236s] [ 5%] 2025-12-04T15:22:22.8186231Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float16 PASSED [0.0349s] [ 5%] 2025-12-04T15:22:22.8186343Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bool PASSED [0.0085s] [ 5%] 2025-12-04T15:22:22.8186461Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float32 PASSED [0.7564s] [ 5%] 2025-12-04T15:22:22.8186573Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int16 PASSED [0.7523s] [ 5%] 2025-12-04T15:22:22.8186686Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_uint8 PASSED [0.7524s] [ 5%] 2025-12-04T15:22:22.8186796Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float64 PASSED [0.7583s] [ 5%] 2025-12-04T15:22:22.8186928Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int16 PASSED [0.7630s] [ 5%] 2025-12-04T15:22:22.8187037Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int8 PASSED [0.7643s] [ 5%] 2025-12-04T15:22:22.8187165Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_bfloat16 PASSED [0.0218s] [ 5%] 2025-12-04T15:22:22.8187272Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int32 PASSED [0.0107s] [ 5%] 2025-12-04T15:22:22.8187387Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex128 PASSED [0.7807s] [ 5%] 2025-12-04T15:22:22.8187500Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex64 PASSED [0.7554s] [ 5%] 2025-12-04T15:22:22.8187610Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float64 PASSED [0.7589s] [ 5%] 2025-12-04T15:22:22.8187719Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64 PASSED [1.8251s] [ 5%] 2025-12-04T15:22:22.8187826Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float32 PASSED [0.1137s] [ 5%] 2025-12-04T15:22:22.8187930Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int16 PASSED [0.1105s] [ 5%] 2025-12-04T15:22:22.8188035Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float16 PASSED [0.0206s] [ 5%] 2025-12-04T15:22:22.8188142Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int16 PASSED [0.0174s] [ 5%] 2025-12-04T15:22:22.8188245Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int64 PASSED [0.0102s] [ 5%] 2025-12-04T15:22:22.8188356Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int8 PASSED [0.0088s] [ 5%] 2025-12-04T15:22:22.8188464Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float16 PASSED [0.7473s] [ 5%] 2025-12-04T15:22:22.8188570Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int32 PASSED [0.7583s] [ 6%] 2025-12-04T15:22:22.8188674Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bool PASSED [0.7547s] [ 6%] 2025-12-04T15:22:22.8188779Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex64 PASSED [0.7592s] [ 6%] 2025-12-04T15:22:22.8188881Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int8 PASSED [0.7579s] [ 6%] 2025-12-04T15:22:22.8189011Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float16 PASSED [0.8256s] [ 6%] 2025-12-04T15:22:22.8189132Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float64 PASSED [0.0102s] [ 6%] 2025-12-04T15:22:22.8189248Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int64 PASSED [0.0081s] [ 6%] 2025-12-04T15:22:22.8189358Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex128 PASSED [0.0582s] [ 6%] 2025-12-04T15:22:22.8189463Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float32 PASSED [0.7833s] [ 6%] 2025-12-04T15:22:22.8189569Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float64 PASSED [0.7800s] [ 6%] 2025-12-04T15:22:22.8189677Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex32 PASSED [0.8088s] [ 6%] 2025-12-04T15:22:22.8189785Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float64 PASSED [0.0281s] [ 6%] 2025-12-04T15:22:22.8189890Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int16 PASSED [0.0246s] [ 6%] 2025-12-04T15:22:22.8189996Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_uint8 PASSED [0.0203s] [ 6%] 2025-12-04T15:22:22.8190144Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int32 PASSED [0.0078s] [ 6%] 2025-12-04T15:22:22.8190255Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex128 PASSED [0.0152s] [ 6%] 2025-12-04T15:22:22.8190360Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float16 PASSED [0.0144s] [ 6%] 2025-12-04T15:22:22.8190499Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bool PASSED [0.7607s] [ 6%] 2025-12-04T15:22:22.8190609Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float16 PASSED [0.7540s] [ 6%] 2025-12-04T15:22:22.8190727Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int32 PASSED [0.7500s] [ 6%] 2025-12-04T15:22:22.8190832Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int8 PASSED [0.7656s] [ 6%] 2025-12-04T15:22:22.8190949Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bool PASSED [0.7672s] [ 6%] 2025-12-04T15:22:22.8191075Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex128 PASSED [0.7571s] [ 6%] 2025-12-04T15:22:22.8191198Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex32 PASSED [0.7629s] [ 6%] 2025-12-04T15:22:22.8191318Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float32 PASSED [0.7559s] [ 6%] 2025-12-04T15:22:22.8191428Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bool PASSED [0.7546s] [ 6%] 2025-12-04T15:22:22.8191542Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float64 PASSED [0.7488s] [ 6%] 2025-12-04T15:22:22.8191652Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int64 PASSED [0.7545s] [ 6%] 2025-12-04T15:22:22.8191762Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float64 PASSED [0.7543s] [ 6%] 2025-12-04T15:22:22.8191865Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int16 PASSED [0.7557s] [ 6%] 2025-12-04T15:22:22.8191975Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex128 PASSED [0.7607s] [ 6%] 2025-12-04T15:22:22.8192079Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int8 PASSED [0.7557s] [ 7%] 2025-12-04T15:22:22.8192194Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int32 PASSED [0.7905s] [ 7%] 2025-12-04T15:22:22.8192308Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int64 PASSED [0.0190s] [ 7%] 2025-12-04T15:22:22.8192422Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int64 PASSED [0.1305s] [ 7%] 2025-12-04T15:22:22.8192531Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_bfloat16 PASSED [0.0280s] [ 7%] 2025-12-04T15:22:22.8192653Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float16 PASSED [0.7732s] [ 7%] 2025-12-04T15:22:22.8192759Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int8 PASSED [0.7773s] [ 7%] 2025-12-04T15:22:22.8192873Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float16 PASSED [0.7554s] [ 7%] 2025-12-04T15:22:22.8192988Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float64 PASSED [0.7572s] [ 7%] 2025-12-04T15:22:22.8193101Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int64 PASSED [0.7679s] [ 7%] 2025-12-04T15:22:22.8193210Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float32 PASSED [0.7733s] [ 7%] 2025-12-04T15:22:22.8193318Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float64 PASSED [0.7748s] [ 7%] 2025-12-04T15:22:22.8193434Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex64 PASSED [0.0112s] [ 7%] 2025-12-04T15:22:22.8193545Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int16 PASSED [0.0093s] [ 7%] 2025-12-04T15:22:22.8193656Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_uint8 PASSED [0.0092s] [ 7%] 2025-12-04T15:22:22.8193774Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex128 PASSED [0.0157s] [ 7%] 2025-12-04T15:22:22.8193890Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float16 PASSED [0.7654s] [ 7%] 2025-12-04T15:22:22.8194023Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int32 PASSED [0.7700s] [ 7%] 2025-12-04T15:22:22.8194134Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_bfloat16 PASSED [0.7627s] [ 7%] 2025-12-04T15:22:22.8194255Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex32 PASSED [0.7676s] [ 7%] 2025-12-04T15:22:22.8194366Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex64 PASSED [0.7594s] [ 7%] 2025-12-04T15:22:22.8194474Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int32 PASSED [0.7674s] [ 7%] 2025-12-04T15:22:22.8194580Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_uint8 PASSED [0.7647s] [ 7%] 2025-12-04T15:22:22.8194698Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int16 PASSED [0.7555s] [ 7%] 2025-12-04T15:22:22.8194810Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float32 PASSED [0.7675s] [ 7%] 2025-12-04T15:22:22.8194922Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int16 PASSED [0.7654s] [ 7%] 2025-12-04T15:22:22.8195032Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int32 PASSED [0.7645s] [ 7%] 2025-12-04T15:22:22.8195145Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex64 PASSED [0.7645s] [ 7%] 2025-12-04T15:22:22.8195260Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16 PASSED [0.7770s] [ 7%] 2025-12-04T15:22:22.8195367Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float64 PASSED [0.7555s] [ 7%] 2025-12-04T15:22:22.8195480Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex32 PASSED [0.7753s] [ 7%] 2025-12-04T15:22:22.8195591Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex64 PASSED [0.7948s] [ 7%] 2025-12-04T15:22:22.8195702Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int32 PASSED [0.7650s] [ 8%] 2025-12-04T15:22:22.8195812Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int8 PASSED [0.7674s] [ 8%] 2025-12-04T15:22:22.8195923Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex32 PASSED [0.7808s] [ 8%] 2025-12-04T15:22:22.8196028Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int8 PASSED [0.8063s] [ 8%] 2025-12-04T15:22:22.8196151Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex128 PASSED [0.7792s] [ 8%] 2025-12-04T15:22:22.8196261Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float16 PASSED [0.7728s] [ 8%] 2025-12-04T15:22:22.8196369Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int32 PASSED [0.7800s] [ 8%] 2025-12-04T15:22:22.8196475Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_uint8 PASSED [0.7781s] [ 8%] 2025-12-04T15:22:22.8196581Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bool PASSED [0.7808s] [ 8%] 2025-12-04T15:22:22.8196688Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int32 PASSED [0.7728s] [ 8%] 2025-12-04T15:22:22.8196794Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_uint8 PASSED [0.7876s] [ 8%] 2025-12-04T15:22:22.8196905Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_bfloat16 PASSED [0.7755s] [ 8%] 2025-12-04T15:22:22.8197015Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float64 PASSED [0.7680s] [ 8%] 2025-12-04T15:22:22.8197128Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int8 PASSED [0.0040s] [ 8%] 2025-12-04T15:22:22.8197278Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex128 PASSED [0.0274s] [ 8%] 2025-12-04T15:22:22.8197427Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float16 PASSED [0.7622s] [ 8%] 2025-12-04T15:22:22.8197593Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float64 PASSED [0.0166s] [ 8%] 2025-12-04T15:22:22.8197737Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int16 PASSED [0.7694s] [ 8%] 2025-12-04T15:22:22.8197888Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_uint8 PASSED [0.0143s] [ 8%] 2025-12-04T15:22:22.8198030Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex32 PASSED [0.7812s] [ 8%] 2025-12-04T15:22:22.8198170Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float64 PASSED [0.0135s] [ 8%] 2025-12-04T15:22:22.8198306Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int8 PASSED [0.7841s] [ 8%] 2025-12-04T15:22:22.8198453Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex32 PASSED [0.0297s] [ 8%] 2025-12-04T15:22:22.8198596Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64 PASSED [0.7775s] [ 8%] 2025-12-04T15:22:22.8198733Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int8 PASSED [0.0151s] [ 8%] 2025-12-04T15:22:22.8198873Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float32 PASSED [0.7739s] [ 8%] 2025-12-04T15:22:22.8199011Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int64 PASSED [0.0159s] [ 8%] 2025-12-04T15:22:22.8199147Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_uint8 PASSED [0.7810s] [ 8%] 2025-12-04T15:22:22.8199283Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int8 PASSED [0.0111s] [ 8%] 2025-12-04T15:22:22.8199425Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16 PASSED [0.0402s] [ 8%] 2025-12-04T15:22:22.8199570Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32 PASSED [0.0401s] [ 8%] 2025-12-04T15:22:22.8199716Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex128 PASSED [0.0272s] [ 9%] 2025-12-04T15:22:22.8199857Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float64 PASSED [0.7899s] [ 9%] 2025-12-04T15:22:22.8200005Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int16 PASSED [0.0151s] [ 9%] 2025-12-04T15:22:22.8200184Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int8 PASSED [0.7762s] [ 9%] 2025-12-04T15:22:22.8200324Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bfloat16 PASSED [0.0167s] [ 9%] 2025-12-04T15:22:22.8200470Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex32 PASSED [0.8017s] [ 9%] 2025-12-04T15:22:22.8200609Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int16 PASSED [0.0148s] [ 9%] 2025-12-04T15:22:22.8200747Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int64 PASSED [0.7774s] [ 9%] 2025-12-04T15:22:22.8200883Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float16 PASSED [0.0167s] [ 9%] 2025-12-04T15:22:22.8201019Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int64 PASSED [0.7928s] [ 9%] 2025-12-04T15:22:22.8201158Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float64 PASSED [0.0480s] [ 9%] 2025-12-04T15:22:22.8201296Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float32 PASSED [0.0117s] [ 9%] 2025-12-04T15:22:22.8201451Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float64 PASSED [0.7763s] [ 9%] 2025-12-04T15:22:22.8201598Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int8 PASSED [0.0114s] [ 9%] 2025-12-04T15:22:22.8201739Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex128 PASSED [0.0298s] [ 9%] 2025-12-04T15:22:22.8201860Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float32 PASSED [0.7804s] [ 9%] 2025-12-04T15:22:22.8201982Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float64 PASSED [0.0168s] [ 9%] 2025-12-04T15:22:22.8202099Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16 PASSED [0.0110s] [ 9%] 2025-12-04T15:22:22.8202217Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int8 PASSED [0.0194s] [ 9%] 2025-12-04T15:22:22.8202336Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_uint8 PASSED [0.7809s] [ 9%] 2025-12-04T15:22:22.8202463Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex32 PASSED [0.0384s] [ 9%] 2025-12-04T15:22:22.8202581Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int64 PASSED [0.0386s] [ 9%] 2025-12-04T15:22:22.8202697Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int8 PASSED [0.0373s] [ 9%] 2025-12-04T15:22:22.8202827Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_complex64 PASSED [0.0272s] [ 9%] 2025-12-04T15:22:22.8202942Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bool PASSED [0.7981s] [ 9%] 2025-12-04T15:22:22.8203062Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int32 PASSED [0.0046s] [ 9%] 2025-12-04T15:22:22.8203178Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64 PASSED [0.7623s] [ 9%] 2025-12-04T15:22:22.8203295Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int16 PASSED [0.0077s] [ 9%] 2025-12-04T15:22:22.8203426Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex128 PASSED [0.7722s] [ 9%] 2025-12-04T15:22:22.8203548Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float64 PASSED [0.0109s] [ 9%] 2025-12-04T15:22:22.8203664Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int16 PASSED [0.0075s] [ 9%] 2025-12-04T15:22:22.8203793Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bool PASSED [0.0141s] [ 9%] 2025-12-04T15:22:22.8203913Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float32 PASSED [0.7762s] [ 10%] 2025-12-04T15:22:22.8204034Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float64 PASSED [0.0110s] [ 10%] 2025-12-04T15:22:22.8204149Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bool PASSED [0.0065s] [ 10%] 2025-12-04T15:22:22.8204266Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int32 PASSED [0.7728s] [ 10%] 2025-12-04T15:22:22.8204390Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64 PASSED [0.0117s] [ 10%] 2025-12-04T15:22:22.8204511Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int16 PASSED [0.0079s] [ 10%] 2025-12-04T15:22:22.8204632Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_uint8 PASSED [0.0075s] [ 10%] 2025-12-04T15:22:22.8204770Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex128 PASSED [0.0040s] [ 10%] 2025-12-04T15:22:22.8204908Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex64 PASSED [0.7699s] [ 10%] 2025-12-04T15:22:22.8205039Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int16 PASSED [0.0046s] [ 10%] 2025-12-04T15:22:22.8205214Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bfloat16 PASSED [0.7640s] [ 10%] 2025-12-04T15:22:22.8205350Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bool PASSED [0.0052s] [ 10%] 2025-12-04T15:22:22.8205499Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float16 PASSED [0.7673s] [ 10%] 2025-12-04T15:22:22.8205633Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int32 PASSED [0.0052s] [ 10%] 2025-12-04T15:22:22.8205769Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int64 PASSED [0.7607s] [ 10%] 2025-12-04T15:22:22.8205887Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int16 PASSED [0.0205s] [ 10%] 2025-12-04T15:22:22.8206004Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int32 PASSED [0.0132s] [ 10%] 2025-12-04T15:22:22.8206133Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex32 PASSED [0.0320s] [ 10%] 2025-12-04T15:22:22.8206255Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float16 PASSED [0.7831s] [ 10%] 2025-12-04T15:22:22.8206376Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int64 PASSED [0.0153s] [ 10%] 2025-12-04T15:22:22.8206495Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_uint8 PASSED [0.0124s] [ 10%] 2025-12-04T15:22:22.8206619Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float32 PASSED [0.0566s] [ 10%] 2025-12-04T15:22:22.8206736Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bool PASSED [0.0193s] [ 10%] 2025-12-04T15:22:22.8206856Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float32 PASSED [0.7820s] [ 10%] 2025-12-04T15:22:22.8206975Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_uint8 PASSED [0.0142s] [ 10%] 2025-12-04T15:22:22.8207104Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex128 PASSED [0.0307s] [ 10%] 2025-12-04T15:22:22.8207229Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex32 PASSED [0.0308s] [ 10%] 2025-12-04T15:22:22.8207349Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_uint8 PASSED [0.7875s] [ 10%] 2025-12-04T15:22:22.8207492Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex128 PASSED [0.0063s] [ 10%] 2025-12-04T15:22:22.8207621Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float32 PASSED [0.7565s] [ 10%] 2025-12-04T15:22:22.8207749Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int32 PASSED [0.0051s] [ 10%] 2025-12-04T15:22:22.8207873Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int64 PASSED [0.7655s] [ 11%] 2025-12-04T15:22:22.8208000Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_uint8 PASSED [0.0050s] [ 11%] 2025-12-04T15:22:22.8208127Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float32 PASSED [0.0052s] [ 11%] 2025-12-04T15:22:22.8208256Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float64 PASSED [0.7727s] [ 11%] 2025-12-04T15:22:22.8208380Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_uint8 PASSED [0.0053s] [ 11%] 2025-12-04T15:22:22.8208505Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int64 PASSED [0.0044s] [ 11%] 2025-12-04T15:22:22.8208630Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_bool PASSED [0.0355s] [ 11%] 2025-12-04T15:22:22.8208759Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int32 PASSED [0.0363s] [ 11%] 2025-12-04T15:22:22.8208904Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int8 PASSED [0.0355s] [ 11%] 2025-12-04T15:22:22.8209041Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int64 PASSED [0.8204s] [ 11%] 2025-12-04T15:22:22.8209176Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_bool PASSED [0.0367s] [ 11%] 2025-12-04T15:22:22.8209309Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex32 PASSED [0.0061s] [ 11%] 2025-12-04T15:22:22.8209438Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float16 PASSED [0.7740s] [ 11%] 2025-12-04T15:22:22.8209565Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float32 PASSED [0.0052s] [ 11%] 2025-12-04T15:22:22.8209690Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int16 PASSED [0.7714s] [ 11%] 2025-12-04T15:22:22.8209830Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bfloat16 PASSED [0.0074s] [ 11%] 2025-12-04T15:22:22.8209969Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float16 PASSED [0.7730s] [ 11%] 2025-12-04T15:22:22.8210144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float64 PASSED [0.7549s] [ 11%] 2025-12-04T15:22:22.8210275Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int16 PASSED [0.7664s] [ 11%] 2025-12-04T15:22:22.8210398Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128 PASSED [0.0090s] [ 11%] 2025-12-04T15:22:22.8210522Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex64 PASSED [0.0072s] [ 11%] 2025-12-04T15:22:22.8210641Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float16 PASSED [0.0067s] [ 11%] 2025-12-04T15:22:22.8210762Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float64 PASSED [0.0066s] [ 11%] 2025-12-04T15:22:22.8210879Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int8 PASSED [0.0058s] [ 11%] 2025-12-04T15:22:22.8211057Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float32 SKIPPED [0.0002s] (Expected: cauchy is not comparable) [ 11%] 2025-12-04T15:22:22.8211198Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex32 PASSED [0.7861s] [ 11%] 2025-12-04T15:22:22.8211319Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int16 PASSED [0.0101s] [ 11%] 2025-12-04T15:22:22.8211439Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int64 PASSED [0.7749s] [ 11%] 2025-12-04T15:22:22.8211556Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int8 PASSED [0.0103s] [ 11%] 2025-12-04T15:22:22.8211679Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float64 PASSED [0.7750s] [ 11%] 2025-12-04T15:22:22.8211798Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int8 PASSED [0.0116s] [ 11%] 2025-12-04T15:22:22.8211925Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_bool PASSED [0.0243s] [ 11%] 2025-12-04T15:22:22.8212050Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float32 PASSED [0.0350s] [ 12%] 2025-12-04T15:22:22.8212176Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int32 PASSED [0.8085s] [ 12%] 2025-12-04T15:22:22.8212299Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int8 PASSED [0.0264s] [ 12%] 2025-12-04T15:22:22.8212426Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float32 PASSED [0.0357s] [ 12%] 2025-12-04T15:22:22.8212550Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_uint8 PASSED [0.0245s] [ 12%] 2025-12-04T15:22:22.8212705Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bool PASSED [0.7776s] [ 12%] 2025-12-04T15:22:22.8212843Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex128 PASSED [0.0054s] [ 12%] 2025-12-04T15:22:22.8212985Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int64 PASSED [0.7647s] [ 12%] 2025-12-04T15:22:22.8213115Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_uint8 PASSED [0.0045s] [ 12%] 2025-12-04T15:22:22.8213238Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex32 PASSED [0.0348s] [ 12%] 2025-12-04T15:22:22.8213361Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float32 PASSED [0.7915s] [ 12%] 2025-12-04T15:22:22.8213480Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int64 PASSED [0.0109s] [ 12%] 2025-12-04T15:22:22.8213610Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bool PASSED [0.7815s] [ 12%] 2025-12-04T15:22:22.8213745Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex32 PASSED [0.0290s] [ 12%] 2025-12-04T15:22:22.8213879Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float32 PASSED [0.7926s] [ 12%] 2025-12-04T15:22:22.8214008Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int16 PASSED [0.0103s] [ 12%] 2025-12-04T15:22:22.8214144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float32 PASSED [0.0208s] [ 12%] 2025-12-04T15:22:22.8214278Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float64 PASSED [0.0197s] [ 12%] 2025-12-04T15:22:22.8214409Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_bfloat16 PASSED [0.0204s] [ 12%] 2025-12-04T15:22:22.8214540Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float16 PASSED [0.7994s] [ 12%] 2025-12-04T15:22:22.8214667Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float32 PASSED [0.0223s] [ 12%] 2025-12-04T15:22:22.8214795Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int64 PASSED [0.7906s] [ 12%] 2025-12-04T15:22:22.8214929Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_bool PASSED [0.0760s] [ 12%] 2025-12-04T15:22:22.8215049Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int32 PASSED [0.7837s] [ 12%] 2025-12-04T15:22:22.8215173Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex64 PASSED [0.4028s] [ 12%] 2025-12-04T15:22:22.8215293Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int64 PASSED [0.0186s] [ 12%] 2025-12-04T15:22:22.8215420Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_bool PASSED [0.7642s] [ 12%] 2025-12-04T15:22:22.8215557Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex64 PASSED [0.0077s] [ 12%] 2025-12-04T15:22:22.8215686Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int16 PASSED [0.7655s] [ 12%] 2025-12-04T15:22:22.8215811Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float16 PASSED [0.7946s] [ 12%] 2025-12-04T15:22:22.8215934Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_bfloat16 PASSED [0.7791s] [ 12%] 2025-12-04T15:22:22.8216061Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex128 PASSED [0.7654s] [ 12%] 2025-12-04T15:22:22.8216183Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int32 PASSED [0.0051s] [ 13%] 2025-12-04T15:22:22.8216304Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_uint8 PASSED [0.7586s] [ 13%] 2025-12-04T15:22:22.8216448Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float16 PASSED [0.0157s] [ 13%] 2025-12-04T15:22:22.8216571Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float32 PASSED [0.0133s] [ 13%] 2025-12-04T15:22:22.8216704Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float16 PASSED [0.7744s] [ 13%] 2025-12-04T15:22:22.8216822Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int8 PASSED [0.0066s] [ 13%] 2025-12-04T15:22:22.8216956Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex128 PASSED [0.7653s] [ 13%] 2025-12-04T15:22:22.8217083Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float16 PASSED [0.0126s] [ 13%] 2025-12-04T15:22:22.8217210Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int32 PASSED [0.7760s] [ 13%] 2025-12-04T15:22:22.8217349Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex128 PASSED [0.0114s] [ 13%] 2025-12-04T15:22:22.8217483Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float64 PASSED [0.0093s] [ 13%] 2025-12-04T15:22:22.8217614Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_uint8 PASSED [0.7707s] [ 13%] 2025-12-04T15:22:22.8217739Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int8 PASSED [0.0083s] [ 13%] 2025-12-04T15:22:22.8217871Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bool PASSED [0.0066s] [ 13%] 2025-12-04T15:22:22.8218008Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float32 PASSED [0.0079s] [ 13%] 2025-12-04T15:22:22.8218134Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bfloat16 PASSED [0.4116s] [ 13%] 2025-12-04T15:22:22.8218258Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int32 PASSED [0.0171s] [ 13%] 2025-12-04T15:22:22.8218380Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int8 PASSED [0.7751s] [ 13%] 2025-12-04T15:22:22.8218518Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float16 PASSED [0.2262s] [ 13%] 2025-12-04T15:22:22.8218672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int32 PASSED [0.0664s] [ 13%] 2025-12-04T15:22:22.8218816Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex128 PASSED [0.0703s] [ 13%] 2025-12-04T15:22:22.8218960Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex64 PASSED [0.0698s] [ 13%] 2025-12-04T15:22:22.8219096Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float32 PASSED [0.0591s] [ 13%] 2025-12-04T15:22:22.8219238Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float64 PASSED [0.0594s] [ 13%] 2025-12-04T15:22:22.8219360Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex64 PASSED [0.7974s] [ 13%] 2025-12-04T15:22:22.8219488Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex32 PASSED [0.0049s] [ 13%] 2025-12-04T15:22:22.8219612Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float64 PASSED [0.7924s] [ 13%] 2025-12-04T15:22:22.8219731Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8 PASSED [0.0040s] [ 13%] 2025-12-04T15:22:22.8219852Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bool PASSED [0.0039s] [ 13%] 2025-12-04T15:22:22.8219978Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex128 PASSED [0.0040s] [ 13%] 2025-12-04T15:22:22.8220143Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float64 PASSED [0.7961s] [ 13%] 2025-12-04T15:22:22.8220276Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int64 PASSED [0.0048s] [ 14%] 2025-12-04T15:22:22.8220410Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int8 PASSED [0.0035s] [ 14%] 2025-12-04T15:22:22.8220530Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_uint8 PASSED [0.0033s] [ 14%] 2025-12-04T15:22:22.8220706Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8220876Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221043Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221215Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bool SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221397Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex128 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221573Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221744Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8221915Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_uint8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 14%] 2025-12-04T15:22:22.8222099Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bool SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 14%] 2025-12-04T15:22:22.8222296Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex128 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 14%] 2025-12-04T15:22:22.8222488Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 14%] 2025-12-04T15:22:22.8222691Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 14%] 2025-12-04T15:22:22.8222812Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bfloat16 PASSED [0.8290s] [ 14%] 2025-12-04T15:22:22.8222932Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float32 PASSED [0.0403s] [ 14%] 2025-12-04T15:22:22.8223049Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int64 PASSED [0.8279s] [ 14%] 2025-12-04T15:22:22.8223164Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int8 PASSED [0.0379s] [ 14%] 2025-12-04T15:22:22.8223287Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float16 PASSED [0.0045s] [ 14%] 2025-12-04T15:22:22.8223406Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int32 PASSED [0.7960s] [ 14%] 2025-12-04T15:22:22.8223530Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bfloat16 PASSED [0.0172s] [ 14%] 2025-12-04T15:22:22.8223648Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float16 PASSED [0.0152s] [ 14%] 2025-12-04T15:22:22.8223765Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bool PASSED [0.0195s] [ 14%] 2025-12-04T15:22:22.8223883Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float64 PASSED [0.0176s] [ 14%] 2025-12-04T15:22:22.8224013Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int32 PASSED [0.8173s] [ 14%] 2025-12-04T15:22:22.8224139Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int64 PASSED [0.0167s] [ 14%] 2025-12-04T15:22:22.8224272Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float32 PASSED [0.1898s] [ 14%] 2025-12-04T15:22:22.8224388Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bool PASSED [0.0201s] [ 14%] 2025-12-04T15:22:22.8224507Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int16 PASSED [0.0143s] [ 14%] 2025-12-04T15:22:22.8224623Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int32 PASSED [0.8130s] [ 14%] 2025-12-04T15:22:22.8224740Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int8 PASSED [0.0155s] [ 14%] 2025-12-04T15:22:22.8224865Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex128 PASSED [0.0332s] [ 15%] 2025-12-04T15:22:22.8224986Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float16 PASSED [0.0163s] [ 15%] 2025-12-04T15:22:22.8225102Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int16 PASSED [0.8081s] [ 15%] 2025-12-04T15:22:22.8225227Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_uint8 PASSED [0.0040s] [ 15%] 2025-12-04T15:22:22.8225359Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float64 PASSED [0.7831s] [ 15%] 2025-12-04T15:22:22.8225482Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bfloat16 PASSED [0.0075s] [ 15%] 2025-12-04T15:22:22.8225603Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bool PASSED [0.0048s] [ 15%] 2025-12-04T15:22:22.8225729Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_complex64 PASSED [0.0059s] [ 15%] 2025-12-04T15:22:22.8225852Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float32 PASSED [0.7996s] [ 15%] 2025-12-04T15:22:22.8225972Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int64 PASSED [0.0060s] [ 15%] 2025-12-04T15:22:22.8226093Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int32 PASSED [0.0140s] [ 15%] 2025-12-04T15:22:22.8226293Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_bfloat16 SKIPPED [0.0002s] (Expected: exponential is not comparable) [ 15%] 2025-12-04T15:22:22.8226417Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex64 PASSED [0.8215s] [ 15%] 2025-12-04T15:22:22.8226537Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float16 PASSED [0.0294s] [ 15%] 2025-12-04T15:22:22.8226660Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e5m2 PASSED [0.8134s] [ 15%] 2025-12-04T15:22:22.8226790Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex128 PASSED [2.3975s] [ 15%] 2025-12-04T15:22:22.8226917Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex64 PASSED [3.1978s] [ 15%] 2025-12-04T15:22:22.8227041Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int64 PASSED [0.8105s] [ 15%] 2025-12-04T15:22:22.8227161Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int16 PASSED [0.9040s] [ 15%] 2025-12-04T15:22:22.8227281Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int8 PASSED [0.8152s] [ 15%] 2025-12-04T15:22:22.8227409Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex128 PASSED [2.1105s] [ 15%] 2025-12-04T15:22:22.8227535Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float16 PASSED [4.2643s] [ 15%] 2025-12-04T15:22:22.8227669Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex32 PASSED [0.8206s] [ 15%] 2025-12-04T15:22:22.8227820Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex32 PASSED [1.8577s] [ 15%] 2025-12-04T15:22:22.8227955Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float32 PASSED [2.8745s] [ 15%] 2025-12-04T15:22:22.8228078Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int16 PASSED [0.8182s] [ 15%] 2025-12-04T15:22:22.8228201Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int32 PASSED [0.0063s] [ 15%] 2025-12-04T15:22:22.8228324Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int64 PASSED [1.2899s] [ 15%] 2025-12-04T15:22:22.8228452Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex32 PASSED [1.4446s] [ 15%] 2025-12-04T15:22:22.8228580Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex64 PASSED [0.8396s] [ 15%] 2025-12-04T15:22:22.8228704Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8 PASSED [0.0062s] [ 15%] 2025-12-04T15:22:22.8228825Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int32 PASSED [0.8273s] [ 16%] 2025-12-04T15:22:22.8228947Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int64 PASSED [0.0080s] [ 16%] 2025-12-04T15:22:22.8229068Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int8 PASSED [0.8257s] [ 16%] 2025-12-04T15:22:22.8229205Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex128 PASSED [0.0052s] [ 16%] 2025-12-04T15:22:22.8229336Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float64 PASSED [0.8341s] [ 16%] 2025-12-04T15:22:22.8229464Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int8 PASSED [0.0047s] [ 16%] 2025-12-04T15:22:22.8229590Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float64 PASSED [1.5729s] [ 16%] 2025-12-04T15:22:22.8229715Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int16 PASSED [0.8706s] [ 16%] 2025-12-04T15:22:22.8229838Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int8 PASSED [0.8301s] [ 16%] 2025-12-04T15:22:22.8229974Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_uint8 PASSED [0.0081s] [ 16%] 2025-12-04T15:22:22.8230136Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_bool PASSED [0.3184s] [ 16%] 2025-12-04T15:22:22.8230260Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_uint8 PASSED [1.2913s] [ 16%] 2025-12-04T15:22:22.8230392Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex128 PASSED [2.1656s] [ 16%] 2025-12-04T15:22:22.8230516Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_bool PASSED [0.8291s] [ 16%] 2025-12-04T15:22:22.8230643Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float16 PASSED [0.7506s] [ 16%] 2025-12-04T15:22:22.8230770Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float64 PASSED [1.1244s] [ 16%] 2025-12-04T15:22:22.8230895Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int32 PASSED [0.8258s] [ 16%] 2025-12-04T15:22:22.8231019Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float64 PASSED [0.6950s] [ 16%] 2025-12-04T15:22:22.8231143Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_uint8 PASSED [1.3562s] [ 16%] 2025-12-04T15:22:22.8231264Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_bool PASSED [0.8054s] [ 16%] 2025-12-04T15:22:22.8231387Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_bool PASSED [0.0080s] [ 16%] 2025-12-04T15:22:22.8231540Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int64 PASSED [0.8241s] [ 16%] 2025-12-04T15:22:22.8231685Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int8 PASSED [0.0082s] [ 16%] 2025-12-04T15:22:22.8231803Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bool PASSED [0.0140s] [ 16%] 2025-12-04T15:22:22.8231923Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float16 PASSED [0.0158s] [ 16%] 2025-12-04T15:22:22.8232043Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float32 PASSED [0.8347s] [ 16%] 2025-12-04T15:22:22.8232161Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float64 PASSED [0.0181s] [ 16%] 2025-12-04T15:22:22.8232289Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex32 PASSED [0.0182s] [ 16%] 2025-12-04T15:22:22.8232416Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex64 PASSED [0.0177s] [ 16%] 2025-12-04T15:22:22.8232540Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float16 PASSED [0.0166s] [ 16%] 2025-12-04T15:22:22.8232663Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float32 PASSED [0.0166s] [ 16%] 2025-12-04T15:22:22.8232784Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int8 PASSED [0.0131s] [ 16%] 2025-12-04T15:22:22.8232904Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_uint8 PASSED [0.0131s] [ 17%] 2025-12-04T15:22:22.8233024Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float32 PASSED [0.0048s] [ 17%] 2025-12-04T15:22:22.8233142Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int64 PASSED [0.0043s] [ 17%] 2025-12-04T15:22:22.8233260Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8 PASSED [0.0044s] [ 17%] 2025-12-04T15:22:22.8233385Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_bfloat16 PASSED [0.0026s] [ 17%] 2025-12-04T15:22:22.8233505Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_uint8 PASSED [0.0026s] [ 17%] 2025-12-04T15:22:22.8233625Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int64 PASSED [0.0024s] [ 17%] 2025-12-04T15:22:22.8233773Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex128 PASSED [0.8957s] [ 17%] 2025-12-04T15:22:22.8233902Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int16 PASSED [0.0537s] [ 17%] 2025-12-04T15:22:22.8234027Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int32 PASSED [0.0511s] [ 17%] 2025-12-04T15:22:22.8234153Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int8 PASSED [0.0500s] [ 17%] 2025-12-04T15:22:22.8234276Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float32 PASSED [0.0145s] [ 17%] 2025-12-04T15:22:22.8234396Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int32 PASSED [0.0104s] [ 17%] 2025-12-04T15:22:22.8234516Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int64 PASSED [0.8422s] [ 17%] 2025-12-04T15:22:22.8234646Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int16 PASSED [0.0533s] [ 17%] 2025-12-04T15:22:22.8234762Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_bool PASSED [0.0307s] [ 17%] 2025-12-04T15:22:22.8234882Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float32 PASSED [0.0432s] [ 17%] 2025-12-04T15:22:22.8235002Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bfloat16 PASSED [0.0474s] [ 17%] 2025-12-04T15:22:22.8235144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float32 PASSED [0.0429s] [ 17%] 2025-12-04T15:22:22.8235265Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float64 PASSED [0.0430s] [ 17%] 2025-12-04T15:22:22.8235392Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int32 PASSED [0.0403s] [ 17%] 2025-12-04T15:22:22.8235518Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_bfloat16 PASSED [0.0175s] [ 17%] 2025-12-04T15:22:22.8235633Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int16 PASSED [0.1524s] [ 17%] 2025-12-04T15:22:22.8235748Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int64 PASSED [0.0344s] [ 17%] 2025-12-04T15:22:22.8235865Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float16 PASSED [0.0390s] [ 17%] 2025-12-04T15:22:22.8235980Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int8 PASSED [0.0347s] [ 17%] 2025-12-04T15:22:22.8236161Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float16 SKIPPED [0.0001s] (Expected: geometric is not comparable) [ 17%] 2025-12-04T15:22:22.8236338Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int8 SKIPPED [0.0001s] (Expected: geometric is not comparable) [ 17%] 2025-12-04T15:22:22.8236458Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bool PASSED [0.8233s] [ 17%] 2025-12-04T15:22:22.8236585Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex128 PASSED [0.0051s] [ 17%] 2025-12-04T15:22:22.8236704Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int16 PASSED [0.8114s] [ 17%] 2025-12-04T15:22:22.8236824Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_uint8 PASSED [0.0041s] [ 18%] 2025-12-04T15:22:22.8236950Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bfloat16 PASSED [0.8148s] [ 18%] 2025-12-04T15:22:22.8237069Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int16 PASSED [0.0042s] [ 18%] 2025-12-04T15:22:22.8237190Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int64 PASSED [0.8114s] [ 18%] 2025-12-04T15:22:22.8237307Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int8 PASSED [0.0043s] [ 18%] 2025-12-04T15:22:22.8237440Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float32 PASSED [0.0447s] [ 18%] 2025-12-04T15:22:22.8237561Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float64 PASSED [0.0433s] [ 18%] 2025-12-04T15:22:22.8237674Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_bool PASSED [0.0194s] [ 18%] 2025-12-04T15:22:22.8237791Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float16 PASSED [1.1605s] [ 18%] 2025-12-04T15:22:22.8237909Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int16 PASSED [0.0161s] [ 18%] 2025-12-04T15:22:22.8238035Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex128 PASSED [0.0290s] [ 18%] 2025-12-04T15:22:22.8238159Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex64 PASSED [0.0282s] [ 18%] 2025-12-04T15:22:22.8238285Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_uint8 PASSED [0.8258s] [ 18%] 2025-12-04T15:22:22.8238414Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bfloat16 PASSED [0.0057s] [ 18%] 2025-12-04T15:22:22.8238547Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex128 PASSED [0.8131s] [ 18%] 2025-12-04T15:22:22.8238672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool PASSED [0.0041s] [ 18%] 2025-12-04T15:22:22.8238824Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex64 PASSED [0.8183s] [ 18%] 2025-12-04T15:22:22.8238952Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int16 PASSED [0.0042s] [ 18%] 2025-12-04T15:22:22.8239083Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bool PASSED [0.1183s] [ 18%] 2025-12-04T15:22:22.8239211Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex128 PASSED [0.1231s] [ 18%] 2025-12-04T15:22:22.8239332Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int32 PASSED [0.1195s] [ 18%] 2025-12-04T15:22:22.8239452Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int64 PASSED [0.1192s] [ 18%] 2025-12-04T15:22:22.8239573Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_uint8 PASSED [0.1172s] [ 18%] 2025-12-04T15:22:22.8239698Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float32 PASSED [0.8324s] [ 18%] 2025-12-04T15:22:22.8239823Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int32 PASSED [0.0122s] [ 18%] 2025-12-04T15:22:22.8239942Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool PASSED [0.0114s] [ 18%] 2025-12-04T15:22:22.8240062Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_uint8 PASSED [0.8329s] [ 18%] 2025-12-04T15:22:22.8240216Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bool PASSED [0.0143s] [ 18%] 2025-12-04T15:22:22.8240337Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float16 PASSED [0.0124s] [ 18%] 2025-12-04T15:22:22.8240463Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bfloat16 PASSED [0.8346s] [ 18%] 2025-12-04T15:22:22.8240585Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int64 PASSED [0.0113s] [ 18%] 2025-12-04T15:22:22.8240712Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float16 PASSED [0.0115s] [ 18%] 2025-12-04T15:22:22.8240832Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int8 PASSED [0.8396s] [ 19%] 2025-12-04T15:22:22.8240959Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex32 PASSED [0.0265s] [ 19%] 2025-12-04T15:22:22.8241100Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex64 PASSED [0.0242s] [ 19%] 2025-12-04T15:22:22.8241222Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float32 PASSED [0.8335s] [ 19%] 2025-12-04T15:22:22.8241341Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_uint8 PASSED [0.0128s] [ 19%] 2025-12-04T15:22:22.8241466Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex128 PASSED [2.0416s] [ 19%] 2025-12-04T15:22:22.8241590Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex64 PASSED [1.4453s] [ 19%] 2025-12-04T15:22:22.8241711Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float16 PASSED [0.8098s] [ 19%] 2025-12-04T15:22:22.8241828Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_uint8 PASSED [0.0047s] [ 19%] 2025-12-04T15:22:22.8241942Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bool PASSED [0.0349s] [ 19%] 2025-12-04T15:22:22.8242058Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int16 PASSED [0.0354s] [ 19%] 2025-12-04T15:22:22.8242171Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int8 PASSED [0.0349s] [ 19%] 2025-12-04T15:22:22.8242290Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int64 PASSED [0.8322s] [ 19%] 2025-12-04T15:22:22.8242422Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex64 PASSED [0.0109s] [ 19%] 2025-12-04T15:22:22.8242593Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bfloat16 PASSED [0.8239s] [ 19%] 2025-12-04T15:22:22.8242734Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex128 PASSED [0.0084s] [ 19%] 2025-12-04T15:22:22.8242884Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64 PASSED [0.8143s] [ 19%] 2025-12-04T15:22:22.8243016Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int16 PASSED [0.0061s] [ 19%] 2025-12-04T15:22:22.8243147Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int8 PASSED [0.8290s] [ 19%] 2025-12-04T15:22:22.8243284Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float32 PASSED [0.1345s] [ 19%] 2025-12-04T15:22:22.8243416Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex64 PASSED [0.1240s] [ 19%] 2025-12-04T15:22:22.8243545Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float32 PASSED [2.0581s] [ 19%] 2025-12-04T15:22:22.8243673Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float64 PASSED [1.1456s] [ 19%] 2025-12-04T15:22:22.8243816Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex64 PASSED [0.8666s] [ 19%] 2025-12-04T15:22:22.8243953Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float32 PASSED [0.0549s] [ 19%] 2025-12-04T15:22:22.8244082Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex64 PASSED [0.0201s] [ 19%] 2025-12-04T15:22:22.8244203Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int64 XFAIL [0.0095s] [ 19%] 2025-12-04T15:22:22.8244324Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int8 XFAIL [0.8330s] [ 19%] 2025-12-04T15:22:22.8244469Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int64 XFAIL [0.0301s] [ 19%] 2025-12-04T15:22:22.8244593Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_complex128 PASSED [1.1427s] [ 19%] 2025-12-04T15:22:22.8244713Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float32 PASSED [0.0164s] [ 19%] 2025-12-04T15:22:22.8244844Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int16 PASSED [0.0148s] [ 20%] 2025-12-04T15:22:22.8244965Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float16 PASSED [0.0149s] [ 20%] 2025-12-04T15:22:22.8245086Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float32 PASSED [0.8338s] [ 20%] 2025-12-04T15:22:22.8245205Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bfloat16 PASSED [0.0190s] [ 20%] 2025-12-04T15:22:22.8245325Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float32 PASSED [0.0164s] [ 20%] 2025-12-04T15:22:22.8245507Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float32 SKIPPED [0.0001s] (Expected: log_normal is not comparable) [ 20%] 2025-12-04T15:22:22.8245652Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float16 PASSED [0.8155s] [ 20%] 2025-12-04T15:22:22.8245795Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float64 PASSED [0.8131s] [ 20%] 2025-12-04T15:22:22.8245935Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int64 PASSED [0.8181s] [ 20%] 2025-12-04T15:22:22.8246073Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int8 PASSED [0.8213s] [ 20%] 2025-12-04T15:22:22.8246204Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_bfloat16 PASSED [0.0142s] [ 20%] 2025-12-04T15:22:22.8246363Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float32 PASSED [0.8230s] [ 20%] 2025-12-04T15:22:22.8246492Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex32 XFAIL [0.0262s] [ 20%] 2025-12-04T15:22:22.8246630Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float64 PASSED [0.8632s] [ 20%] 2025-12-04T15:22:22.8246761Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bfloat16 PASSED [0.0333s] [ 20%] 2025-12-04T15:22:22.8246887Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int16 PASSED [0.0299s] [ 20%] 2025-12-04T15:22:22.8247012Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int64 PASSED [0.0299s] [ 20%] 2025-12-04T15:22:22.8247138Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int32 PASSED [0.0104s] [ 20%] 2025-12-04T15:22:22.8247272Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128 PASSED [0.0538s] [ 20%] 2025-12-04T15:22:22.8247396Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bool PASSED [0.0272s] [ 20%] 2025-12-04T15:22:22.8247526Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float32 PASSED [0.0329s] [ 20%] 2025-12-04T15:22:22.8247654Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex64 PASSED [0.1136s] [ 20%] 2025-12-04T15:22:22.8247776Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int16 XFAIL [0.0134s] [ 20%] 2025-12-04T15:22:22.8247896Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int8 PASSED [0.8618s] [ 20%] 2025-12-04T15:22:22.8248026Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_complex64 PASSED [0.0103s] [ 20%] 2025-12-04T15:22:22.8248153Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float32 PASSED [0.0091s] [ 20%] 2025-12-04T15:22:22.8248274Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float16 PASSED [0.8461s] [ 20%] 2025-12-04T15:22:22.8248393Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float32 PASSED [0.0396s] [ 20%] 2025-12-04T15:22:22.8248522Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int32 PASSED [0.0359s] [ 20%] 2025-12-04T15:22:22.8248636Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int64 PASSED [0.0358s] [ 20%] 2025-12-04T15:22:22.8248762Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int64 PASSED [0.8237s] [ 20%] 2025-12-04T15:22:22.8248881Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int8 PASSED [0.0340s] [ 20%] 2025-12-04T15:22:22.8249006Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_complex128 PASSED [0.0116s] [ 21%] 2025-12-04T15:22:22.8249130Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float32 PASSED [0.0108s] [ 21%] 2025-12-04T15:22:22.8249278Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bfloat16 PASSED [0.0081s] [ 21%] 2025-12-04T15:22:22.8249426Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float16 PASSED [0.0079s] [ 21%] 2025-12-04T15:22:22.8249572Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_uint8 PASSED [0.8217s] [ 21%] 2025-12-04T15:22:22.8249722Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bfloat16 PASSED [0.0098s] [ 21%] 2025-12-04T15:22:22.8249868Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float32 PASSED [0.0082s] [ 21%] 2025-12-04T15:22:22.8250035Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int16 PASSED [0.0062s] [ 21%] 2025-12-04T15:22:22.8250195Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float64 PASSED [0.0441s] [ 21%] 2025-12-04T15:22:22.8250338Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex32 PASSED [0.8176s] [ 21%] 2025-12-04T15:22:22.8250454Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bool PASSED [0.0372s] [ 21%] 2025-12-04T15:22:22.8250575Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex32 XFAIL [0.0331s] [ 21%] 2025-12-04T15:22:22.8250696Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex64 PASSED [0.8917s] [ 21%] 2025-12-04T15:22:22.8250824Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_bfloat16 PASSED [0.0157s] [ 21%] 2025-12-04T15:22:22.8250951Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float16 PASSED [0.8364s] [ 21%] 2025-12-04T15:22:22.8251076Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int32 PASSED [0.0115s] [ 21%] 2025-12-04T15:22:22.8251211Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex128 PASSED [0.0096s] [ 21%] 2025-12-04T15:22:22.8251330Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_uint8 PASSED [0.8357s] [ 21%] 2025-12-04T15:22:22.8251469Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_bfloat16 PASSED [0.0239s] [ 21%] 2025-12-04T15:22:22.8251605Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float16 PASSED [0.0140s] [ 21%] 2025-12-04T15:22:22.8251726Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex128 PASSED [0.0507s] [ 21%] 2025-12-04T15:22:22.8251843Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float16 PASSED [0.0390s] [ 21%] 2025-12-04T15:22:22.8251959Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int64 PASSED [0.0357s] [ 21%] 2025-12-04T15:22:22.8252134Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float16 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 21%] 2025-12-04T15:22:22.8252321Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 21%] 2025-12-04T15:22:22.8252493Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int16 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 21%] 2025-12-04T15:22:22.8252662Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 21%] 2025-12-04T15:22:22.8252859Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 21%] 2025-12-04T15:22:22.8253052Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float32 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 21%] 2025-12-04T15:22:22.8253242Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_uint8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 21%] 2025-12-04T15:22:22.8253369Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bfloat16 PASSED [0.8156s] [ 21%] 2025-12-04T15:22:22.8253499Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex128 PASSED [0.0061s] [ 22%] 2025-12-04T15:22:22.8253623Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float64 PASSED [0.8205s] [ 22%] 2025-12-04T15:22:22.8253746Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int64 PASSED [0.0056s] [ 22%] 2025-12-04T15:22:22.8253900Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex128 PASSED [0.8137s] [ 22%] 2025-12-04T15:22:22.8254029Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex64 PASSED [0.0062s] [ 22%] 2025-12-04T15:22:22.8254167Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float32 PASSED [0.8195s] [ 22%] 2025-12-04T15:22:22.8254291Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int32 PASSED [0.0054s] [ 22%] 2025-12-04T15:22:22.8254415Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_uint8 PASSED [0.8198s] [ 22%] 2025-12-04T15:22:22.8254614Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float16 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 22%] 2025-12-04T15:22:22.8254767Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_bool PASSED [0.8130s] [ 22%] 2025-12-04T15:22:22.8254921Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float16 PASSED [0.0046s] [ 22%] 2025-12-04T15:22:22.8255073Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float32 PASSED [0.8222s] [ 22%] 2025-12-04T15:22:22.8255222Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int8 PASSED [0.0042s] [ 22%] 2025-12-04T15:22:22.8255374Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_uint8 PASSED [0.8273s] [ 22%] 2025-12-04T15:22:22.8255563Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float16 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 22%] 2025-12-04T15:22:22.8255753Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float64 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 22%] 2025-12-04T15:22:22.8255894Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_bfloat16 PASSED [0.8434s] [ 22%] 2025-12-04T15:22:22.8256031Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32 PASSED [0.8384s] [ 22%] 2025-12-04T15:22:22.8256180Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_bfloat16 PASSED [0.8350s] [ 22%] 2025-12-04T15:22:22.8256336Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_bfloat16 PASSED [0.0238s] [ 22%] 2025-12-04T15:22:22.8256481Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_bfloat16 PASSED [0.0164s] [ 22%] 2025-12-04T15:22:22.8256624Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float16 PASSED [0.8370s] [ 22%] 2025-12-04T15:22:22.8256765Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int8 PASSED [0.0128s] [ 22%] 2025-12-04T15:22:22.8256926Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float64 PASSED [0.8317s] [ 22%] 2025-12-04T15:22:22.8257069Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_bfloat16 PASSED [0.0063s] [ 22%] 2025-12-04T15:22:22.8257234Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex128 PASSED [0.8269s] [ 22%] 2025-12-04T15:22:22.8257399Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex64 PASSED [0.8411s] [ 22%] 2025-12-04T15:22:22.8257555Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int8 PASSED [0.8203s] [ 22%] 2025-12-04T15:22:22.8257713Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_bfloat16 PASSED [0.0164s] [ 22%] 2025-12-04T15:22:22.8257887Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int8 PASSED [0.8334s] [ 22%] 2025-12-04T15:22:22.8258050Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_uint8 PASSED [0.0134s] [ 22%] 2025-12-04T15:22:22.8258196Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_bfloat16 PASSED [0.8419s] [ 22%] 2025-12-04T15:22:22.8258339Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32 PASSED [0.8231s] [ 23%] 2025-12-04T15:22:22.8258482Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float64 PASSED [0.0058s] [ 23%] 2025-12-04T15:22:22.8258632Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int16 PASSED [0.8271s] [ 23%] 2025-12-04T15:22:22.8258789Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0050s] [ 23%] 2025-12-04T15:22:22.8258941Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0035s] [ 23%] 2025-12-04T15:22:22.8259093Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int64 PASSED [0.8495s] [ 23%] 2025-12-04T15:22:22.8259234Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float64 PASSED [0.0592s] [ 23%] 2025-12-04T15:22:22.8259372Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float32 PASSED [0.8312s] [ 23%] 2025-12-04T15:22:22.8259510Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int16 PASSED [0.0117s] [ 23%] 2025-12-04T15:22:22.8259645Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int64 PASSED [0.8303s] [ 23%] 2025-12-04T15:22:22.8259786Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float16 PASSED [0.0181s] [ 23%] 2025-12-04T15:22:22.8259922Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float64 PASSED [0.0159s] [ 23%] 2025-12-04T15:22:22.8260086Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bool PASSED [0.8287s] [ 23%] 2025-12-04T15:22:22.8260278Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float64 PASSED [0.8223s] [ 23%] 2025-12-04T15:22:22.8260431Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int64 PASSED [0.7733s] [ 23%] 2025-12-04T15:22:22.8260581Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int8 PASSED [0.7628s] [ 23%] 2025-12-04T15:22:22.8260727Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float32 PASSED [0.0177s] [ 23%] 2025-12-04T15:22:22.8260877Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex64 PASSED [0.0285s] [ 23%] 2025-12-04T15:22:22.8261020Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int16 PASSED [0.7884s] [ 23%] 2025-12-04T15:22:22.8261168Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_bfloat16 PASSED [0.0317s] [ 23%] 2025-12-04T15:22:22.8261327Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.7799s] [ 23%] 2025-12-04T15:22:22.8261484Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64 PASSED [0.0067s] [ 23%] 2025-12-04T15:22:22.8261608Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_bfloat16 PASSED [0.7839s] [ 23%] 2025-12-04T15:22:22.8261825Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_bfloat16 SKIPPED [0.0002s] (Expected: normal is not comparable) [ 23%] 2025-12-04T15:22:22.8262024Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex128 SKIPPED [0.0001s] (Expected: normal is not comparable) [ 23%] 2025-12-04T15:22:22.8262211Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex64 SKIPPED [0.0001s] (Expected: normal is not comparable) [ 23%] 2025-12-04T15:22:22.8262393Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float32 SKIPPED [0.0001s] (Expected: normal is not comparable) [ 23%] 2025-12-04T15:22:22.8262563Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float16 SKIPPED [0.0001s] (Expected: normal is not comparable) [ 23%] 2025-12-04T15:22:22.8262689Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bfloat16 PASSED [0.7630s] [ 23%] 2025-12-04T15:22:22.8262814Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex128 PASSED [0.0041s] [ 23%] 2025-12-04T15:22:22.8262943Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_bool PASSED [0.0162s] [ 23%] 2025-12-04T15:22:22.8263078Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex128 PASSED [0.0216s] [ 24%] 2025-12-04T15:22:22.8263212Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex64 PASSED [0.0217s] [ 24%] 2025-12-04T15:22:22.8263344Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float64 PASSED [0.0206s] [ 24%] 2025-12-04T15:22:22.8263473Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_uint8 PASSED [0.0157s] [ 24%] 2025-12-04T15:22:22.8263596Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_uint8 PASSED [0.0178s] [ 24%] 2025-12-04T15:22:22.8263721Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float32 PASSED [0.7795s] [ 24%] 2025-12-04T15:22:22.8263844Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_uint8 PASSED [0.0097s] [ 24%] 2025-12-04T15:22:22.8263967Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex128 PASSED [0.0649s] [ 24%] 2025-12-04T15:22:22.8264101Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex32 XFAIL [0.0209s] [ 24%] 2025-12-04T15:22:22.8264223Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex64 PASSED [0.8339s] [ 24%] 2025-12-04T15:22:22.8264342Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float16 PASSED [0.0523s] [ 24%] 2025-12-04T15:22:22.8264459Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int16 PASSED [0.0365s] [ 24%] 2025-12-04T15:22:22.8264578Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int32 PASSED [0.0364s] [ 24%] 2025-12-04T15:22:22.8264693Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bool PASSED [0.0144s] [ 24%] 2025-12-04T15:22:22.8264817Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex64 PASSED [0.0175s] [ 24%] 2025-12-04T15:22:22.8264936Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int16 PASSED [0.0138s] [ 24%] 2025-12-04T15:22:22.8265051Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int8 PASSED [0.0136s] [ 24%] 2025-12-04T15:22:22.8265168Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_uint8 PASSED [0.0137s] [ 24%] 2025-12-04T15:22:22.8265294Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bfloat16 PASSED [0.0138s] [ 24%] 2025-12-04T15:22:22.8265430Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_bfloat16 PASSED [0.7711s] [ 24%] 2025-12-04T15:22:22.8265566Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex64 PASSED [0.0049s] [ 24%] 2025-12-04T15:22:22.8265697Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float16 PASSED [0.7729s] [ 24%] 2025-12-04T15:22:22.8265821Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex64 PASSED [0.0055s] [ 24%] 2025-12-04T15:22:22.8265942Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int64 PASSED [0.7821s] [ 24%] 2025-12-04T15:22:22.8266060Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_uint8 PASSED [0.0041s] [ 24%] 2025-12-04T15:22:22.8266193Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_complex64 PASSED [0.0294s] [ 24%] 2025-12-04T15:22:22.8266320Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float32 PASSED [0.0157s] [ 24%] 2025-12-04T15:22:22.8266446Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int8 PASSED [0.7846s] [ 24%] 2025-12-04T15:22:22.8266568Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float32 PASSED [0.0104s] [ 24%] 2025-12-04T15:22:22.8266688Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bool PASSED [0.0110s] [ 24%] 2025-12-04T15:22:22.8266814Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex128 PASSED [0.0135s] [ 24%] 2025-12-04T15:22:22.8266936Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float32 PASSED [0.0128s] [ 24%] 2025-12-04T15:22:22.8267055Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int32 PASSED [0.0105s] [ 25%] 2025-12-04T15:22:22.8267184Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_bfloat16 PASSED [0.7714s] [ 25%] 2025-12-04T15:22:22.8267310Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int16 PASSED [0.0093s] [ 25%] 2025-12-04T15:22:22.8267435Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int8 PASSED [0.7726s] [ 25%] 2025-12-04T15:22:22.8267561Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bfloat16 PASSED [0.0195s] [ 25%] 2025-12-04T15:22:22.8267692Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex32 PASSED [0.7760s] [ 25%] 2025-12-04T15:22:22.8267810Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int32 PASSED [0.0069s] [ 25%] 2025-12-04T15:22:22.8267926Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int8 PASSED [0.7797s] [ 25%] 2025-12-04T15:22:22.8268044Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int32 PASSED [0.0100s] [ 25%] 2025-12-04T15:22:22.8268162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_bool PASSED [0.0176s] [ 25%] 2025-12-04T15:22:22.8268288Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex128 PASSED [0.4067s] [ 25%] 2025-12-04T15:22:22.8268412Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex64 PASSED [0.8052s] [ 25%] 2025-12-04T15:22:22.8268529Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8 PASSED [0.0157s] [ 25%] 2025-12-04T15:22:22.8268652Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex64 PASSED [0.0551s] [ 25%] 2025-12-04T15:22:22.8268772Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float64 PASSED [0.0402s] [ 25%] 2025-12-04T15:22:22.8268887Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_uint8 PASSED [0.0281s] [ 25%] 2025-12-04T15:22:22.8269022Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_bfloat16 PASSED [0.7820s] [ 25%] 2025-12-04T15:22:22.8269179Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float16 PASSED [0.0052s] [ 25%] 2025-12-04T15:22:22.8269310Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int64 PASSED [0.7750s] [ 25%] 2025-12-04T15:22:22.8269451Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int8 PASSED [0.0045s] [ 25%] 2025-12-04T15:22:22.8269571Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bfloat16 PASSED [0.7768s] [ 25%] 2025-12-04T15:22:22.8269695Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex128 PASSED [0.0281s] [ 25%] 2025-12-04T15:22:22.8269815Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bool PASSED [0.0251s] [ 25%] 2025-12-04T15:22:22.8269936Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_uint8 PASSED [0.0203s] [ 25%] 2025-12-04T15:22:22.8270054Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int16 PASSED [0.7795s] [ 25%] 2025-12-04T15:22:22.8270211Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float32 PASSED [0.0166s] [ 25%] 2025-12-04T15:22:22.8270355Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex64 PASSED [0.0071s] [ 25%] 2025-12-04T15:22:22.8270495Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float32 PASSED [0.0066s] [ 25%] 2025-12-04T15:22:22.8270632Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float64 PASSED [0.0065s] [ 25%] 2025-12-04T15:22:22.8270769Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64 PASSED [0.0066s] [ 25%] 2025-12-04T15:22:22.8270901Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_bool PASSED [0.0194s] [ 25%] 2025-12-04T15:22:22.8271039Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_float64 PASSED [0.0176s] [ 25%] 2025-12-04T15:22:22.8271172Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int16 PASSED [0.7840s] [ 26%] 2025-12-04T15:22:22.8271308Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_float64 PASSED [0.2945s] [ 26%] 2025-12-04T15:22:22.8271454Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_bool PASSED [0.0200s] [ 26%] 2025-12-04T15:22:22.8271586Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bfloat16 PASSED [0.3440s] [ 26%] 2025-12-04T15:22:22.8271713Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int16 PASSED [0.9848s] [ 26%] 2025-12-04T15:22:22.8271839Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int32 PASSED [0.0160s] [ 26%] 2025-12-04T15:22:22.8271970Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_bfloat16 PASSED [0.3596s] [ 26%] 2025-12-04T15:22:22.8272093Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int64 PASSED [0.0164s] [ 26%] 2025-12-04T15:22:22.8272218Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_uint8 PASSED [0.7906s] [ 26%] 2025-12-04T15:22:22.8272344Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int16 PASSED [0.0187s] [ 26%] 2025-12-04T15:22:22.8272471Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8 PASSED [0.0134s] [ 26%] 2025-12-04T15:22:22.8272604Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool PASSED [0.7906s] [ 26%] 2025-12-04T15:22:22.8272736Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int16 PASSED [0.0169s] [ 26%] 2025-12-04T15:22:22.8272894Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int32 PASSED [0.0153s] [ 26%] 2025-12-04T15:22:22.8273049Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float64 PASSED [0.7782s] [ 26%] 2025-12-04T15:22:22.8273194Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bfloat16 PASSED [0.0206s] [ 26%] 2025-12-04T15:22:22.8273323Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int16 PASSED [0.0163s] [ 26%] 2025-12-04T15:22:22.8273453Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_uint8 PASSED [0.7915s] [ 26%] 2025-12-04T15:22:22.8273609Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int16 PASSED [0.0235s] [ 26%] 2025-12-04T15:22:22.8273767Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int32 PASSED [0.0213s] [ 26%] 2025-12-04T15:22:22.8273924Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int64 PASSED [0.0211s] [ 26%] 2025-12-04T15:22:22.8274079Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8 PASSED [0.7868s] [ 26%] 2025-12-04T15:22:22.8274238Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float16 PASSED [0.8163s] [ 26%] 2025-12-04T15:22:22.8274394Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int16 PASSED [0.7990s] [ 26%] 2025-12-04T15:22:22.8274554Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16 PASSED [0.8046s] [ 26%] 2025-12-04T15:22:22.8274707Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int32 PASSED [0.7964s] [ 26%] 2025-12-04T15:22:22.8274864Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int64 PASSED [0.0233s] [ 26%] 2025-12-04T15:22:22.8275016Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int8 PASSED [0.0203s] [ 26%] 2025-12-04T15:22:22.8275150Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float32 PASSED [0.0152s] [ 26%] 2025-12-04T15:22:22.8275295Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_uint8 PASSED [0.7871s] [ 26%] 2025-12-04T15:22:22.8275425Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_bool PASSED [0.0214s] [ 26%] 2025-12-04T15:22:22.8275554Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int64 PASSED [0.0149s] [ 27%] 2025-12-04T15:22:22.8275682Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int8 PASSED [0.0135s] [ 27%] 2025-12-04T15:22:22.8275830Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_bool PASSED [0.7922s] [ 27%] 2025-12-04T15:22:22.8275978Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int64 PASSED [0.7807s] [ 27%] 2025-12-04T15:22:22.8276126Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_bool PASSED [0.0220s] [ 27%] 2025-12-04T15:22:22.8276277Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float32 PASSED [0.0162s] [ 27%] 2025-12-04T15:22:22.8276423Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int32 PASSED [0.0145s] [ 27%] 2025-12-04T15:22:22.8276555Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int64 PASSED [0.0466s] [ 27%] 2025-12-04T15:22:22.8276683Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int32 PASSED [0.0544s] [ 27%] 2025-12-04T15:22:22.8276839Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_bfloat16 PASSED [0.7793s] [ 27%] 2025-12-04T15:22:22.8276981Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int64 PASSED [0.0046s] [ 27%] 2025-12-04T15:22:22.8277103Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bfloat16 PASSED [0.0161s] [ 27%] 2025-12-04T15:22:22.8277220Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bool PASSED [0.7926s] [ 27%] 2025-12-04T15:22:22.8277341Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float16 PASSED [0.0171s] [ 27%] 2025-12-04T15:22:22.8277461Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32 PASSED [0.0148s] [ 27%] 2025-12-04T15:22:22.8277579Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int64 PASSED [0.0134s] [ 27%] 2025-12-04T15:22:22.8277700Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_uint8 PASSED [0.0125s] [ 27%] 2025-12-04T15:22:22.8277832Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float16 PASSED [0.7830s] [ 27%] 2025-12-04T15:22:22.8277962Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int32 PASSED [0.0049s] [ 27%] 2025-12-04T15:22:22.8278090Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex32 PASSED [0.0055s] [ 27%] 2025-12-04T15:22:22.8278214Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float64 PASSED [0.7691s] [ 27%] 2025-12-04T15:22:22.8278335Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int64 PASSED [0.0056s] [ 27%] 2025-12-04T15:22:22.8278455Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_uint8 PASSED [0.0044s] [ 27%] 2025-12-04T15:22:22.8278594Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bfloat16 PASSED [0.0046s] [ 27%] 2025-12-04T15:22:22.8278733Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex64 PASSED [0.7690s] [ 27%] 2025-12-04T15:22:22.8278870Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float16 PASSED [0.0059s] [ 27%] 2025-12-04T15:22:22.8279010Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int16 PASSED [0.0039s] [ 27%] 2025-12-04T15:22:22.8279142Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int8 PASSED [0.7721s] [ 27%] 2025-12-04T15:22:22.8279261Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int64 PASSED [0.0053s] [ 27%] 2025-12-04T15:22:22.8279383Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex64 PASSED [0.7819s] [ 27%] 2025-12-04T15:22:22.8279505Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float32 PASSED [0.3324s] [ 27%] 2025-12-04T15:22:22.8279620Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int8 PASSED [0.0079s] [ 27%] 2025-12-04T15:22:22.8279752Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex128 PASSED [0.7798s] [ 28%] 2025-12-04T15:22:22.8279881Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float32 PASSED [0.0087s] [ 28%] 2025-12-04T15:22:22.8280010Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float64 PASSED [0.0071s] [ 28%] 2025-12-04T15:22:22.8280177Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int32 PASSED [0.7878s] [ 28%] 2025-12-04T15:22:22.8280303Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_complex128 PASSED [0.0047s] [ 28%] 2025-12-04T15:22:22.8280422Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex128 PASSED [0.7681s] [ 28%] 2025-12-04T15:22:22.8280564Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float16 PASSED [0.0044s] [ 28%] 2025-12-04T15:22:22.8280691Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_uint8 PASSED [0.7726s] [ 28%] 2025-12-04T15:22:22.8280824Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float32 PASSED [0.0070s] [ 28%] 2025-12-04T15:22:22.8280946Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16 PASSED [0.0168s] [ 28%] 2025-12-04T15:22:22.8281065Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float16 PASSED [0.0161s] [ 28%] 2025-12-04T15:22:22.8281186Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bfloat16 PASSED [0.7855s] [ 28%] 2025-12-04T15:22:22.8281308Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex64 PASSED [0.0310s] [ 28%] 2025-12-04T15:22:22.8281426Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int8 PASSED [0.0134s] [ 28%] 2025-12-04T15:22:22.8281557Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float16 PASSED [0.7818s] [ 28%] 2025-12-04T15:22:22.8281677Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16 PASSED [0.0129s] [ 28%] 2025-12-04T15:22:22.8281792Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_uint8 PASSED [0.0115s] [ 28%] 2025-12-04T15:22:22.8281912Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float16 PASSED [0.7769s] [ 28%] 2025-12-04T15:22:22.8282030Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int16 PASSED [0.0033s] [ 28%] 2025-12-04T15:22:22.8282149Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int8 PASSED [0.7680s] [ 28%] 2025-12-04T15:22:22.8282287Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex32 PASSED [0.0062s] [ 28%] 2025-12-04T15:22:22.8282421Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float16 PASSED [0.7847s] [ 28%] 2025-12-04T15:22:22.8282555Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float64 PASSED [0.0061s] [ 28%] 2025-12-04T15:22:22.8282678Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bool PASSED [0.7688s] [ 28%] 2025-12-04T15:22:22.8282820Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex64 PASSED [0.0062s] [ 28%] 2025-12-04T15:22:22.8282948Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int32 PASSED [0.7795s] [ 28%] 2025-12-04T15:22:22.8283072Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64 PASSED [0.0049s] [ 28%] 2025-12-04T15:22:22.8283192Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float16 PASSED [0.7999s] [ 28%] 2025-12-04T15:22:22.8283312Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int16 PASSED [0.0056s] [ 28%] 2025-12-04T15:22:22.8283430Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int32 PASSED [0.7632s] [ 28%] 2025-12-04T15:22:22.8283548Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int64 PASSED [0.0055s] [ 28%] 2025-12-04T15:22:22.8283672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex128 PASSED [0.7770s] [ 29%] 2025-12-04T15:22:22.8283805Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex64 PASSED [0.0724s] [ 29%] 2025-12-04T15:22:22.8283933Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32 PASSED [0.8190s] [ 29%] 2025-12-04T15:22:22.8284059Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int16 PASSED [0.0571s] [ 29%] 2025-12-04T15:22:22.8284207Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int64 PASSED [0.8247s] [ 29%] 2025-12-04T15:22:22.8284330Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_bfloat16 PASSED [0.0172s] [ 29%] 2025-12-04T15:22:22.8284462Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float32 PASSED [0.0146s] [ 29%] 2025-12-04T15:22:22.8284581Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int8 PASSED [0.0101s] [ 29%] 2025-12-04T15:22:22.8284702Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_uint8 PASSED [0.7846s] [ 29%] 2025-12-04T15:22:22.8284826Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int32 PASSED [0.0060s] [ 29%] 2025-12-04T15:22:22.8284953Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int64 PASSED [0.7904s] [ 29%] 2025-12-04T15:22:22.8285077Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex64 PASSED [0.0083s] [ 29%] 2025-12-04T15:22:22.8285200Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float64 PASSED [0.7656s] [ 29%] 2025-12-04T15:22:22.8285320Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int64 PASSED [0.0058s] [ 29%] 2025-12-04T15:22:22.8285446Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float32 PASSED [0.0052s] [ 29%] 2025-12-04T15:22:22.8285570Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int16 PASSED [0.0042s] [ 29%] 2025-12-04T15:22:22.8285694Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int32 PASSED [0.0042s] [ 29%] 2025-12-04T15:22:22.8285816Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int64 PASSED [0.0040s] [ 29%] 2025-12-04T15:22:22.8285946Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16 PASSED [0.7793s] [ 29%] 2025-12-04T15:22:22.8286080Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex128 PASSED [0.0107s] [ 29%] 2025-12-04T15:22:22.8286211Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float16 PASSED [0.7773s] [ 29%] 2025-12-04T15:22:22.8286337Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int32 PASSED [0.0082s] [ 29%] 2025-12-04T15:22:22.8286466Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_bool PASSED [0.7802s] [ 29%] 2025-12-04T15:22:22.8286589Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_uint8 PASSED [0.0077s] [ 29%] 2025-12-04T15:22:22.8286712Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_uint8 PASSED [0.0048s] [ 29%] 2025-12-04T15:22:22.8286831Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float32 PASSED [0.7858s] [ 29%] 2025-12-04T15:22:22.8286950Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float64 PASSED [0.0095s] [ 29%] 2025-12-04T15:22:22.8287077Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex64 PASSED [0.0128s] [ 29%] 2025-12-04T15:22:22.8287201Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float64 PASSED [0.0110s] [ 29%] 2025-12-04T15:22:22.8287327Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_bfloat16 PASSED [0.7938s] [ 29%] 2025-12-04T15:22:22.8287453Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex128 PASSED [0.0123s] [ 29%] 2025-12-04T15:22:22.8287580Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex64 PASSED [0.7916s] [ 29%] 2025-12-04T15:22:22.8287702Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float64 PASSED [0.0121s] [ 30%] 2025-12-04T15:22:22.8287842Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int8 PASSED [0.7978s] [ 30%] 2025-12-04T15:22:22.8287970Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_bfloat16 PASSED [0.0060s] [ 30%] 2025-12-04T15:22:22.8288113Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float64 PASSED [0.7768s] [ 30%] 2025-12-04T15:22:22.8288235Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float32 PASSED [0.0195s] [ 30%] 2025-12-04T15:22:22.8288357Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float32 PASSED [0.7789s] [ 30%] 2025-12-04T15:22:22.8288479Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float64 PASSED [0.0047s] [ 30%] 2025-12-04T15:22:22.8288601Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_bfloat16 PASSED [0.7979s] [ 30%] 2025-12-04T15:22:22.8288724Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float32 PASSED [0.0049s] [ 30%] 2025-12-04T15:22:22.8288846Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex64 PASSED [0.0114s] [ 30%] 2025-12-04T15:22:22.8288967Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_uint8 PASSED [0.7926s] [ 30%] 2025-12-04T15:22:22.8289087Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float16 PASSED [0.0526s] [ 30%] 2025-12-04T15:22:22.8289209Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float64 PASSED [0.8273s] [ 30%] 2025-12-04T15:22:22.8289329Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float32 PASSED [0.0038s] [ 30%] 2025-12-04T15:22:22.8289446Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_no_dim_cuda PASSED [0.7834s] [ 30%] 2025-12-04T15:22:22.8289552Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_cuda PASSED [0.0046s] [ 30%] 2025-12-04T15:22:22.8289668Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_unbiased_cuda PASSED [0.7737s] [ 30%] 2025-12-04T15:22:22.8289772Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_mean_cuda PASSED [0.0049s] [ 30%] 2025-12-04T15:22:22.8289887Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_mean_unbiased_cuda PASSED [0.7854s] [ 30%] 2025-12-04T15:22:22.8290002Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_float32 PASSED [0.0043s] [ 30%] 2025-12-04T15:22:22.8290167Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32 PASSED [0.7904s] [ 30%] 2025-12-04T15:22:22.8290291Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_complex64 PASSED [0.7990s] [ 30%] 2025-12-04T15:22:22.8290409Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_complex64 PASSED [0.7778s] [ 30%] 2025-12-04T15:22:22.8290526Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_complex64 PASSED [0.8104s] [ 30%] 2025-12-04T15:22:22.8290642Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_float32 PASSED [0.0183s] [ 30%] 2025-12-04T15:22:22.8290778Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 30%] 2025-12-04T15:22:22.8290898Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_complex64 XFAIL [0.0207s] [ 30%] 2025-12-04T15:22:22.8291016Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_float32 XFAIL [0.8074s] [ 30%] 2025-12-04T15:22:22.8291150Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_complex64 PASSED [0.8038s] [ 30%] 2025-12-04T15:22:22.8291265Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_float32 PASSED [0.0353s] [ 30%] 2025-12-04T15:22:22.8291377Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_float32 PASSED [0.7850s] [ 30%] 2025-12-04T15:22:22.8291513Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_complex64 PASSED [0.7966s] [ 31%] 2025-12-04T15:22:22.8291639Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_amax_cuda_float32 PASSED [0.0158s] [ 31%] 2025-12-04T15:22:22.8291770Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmax_cuda_float32 PASSED [0.7989s] [ 31%] 2025-12-04T15:22:22.8291958Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_copy_cuda_complex64 SKIPPED [0.0002s] (Errors when storage_offset is included) [ 31%] 2025-12-04T15:22:22.8292097Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_float32 XFAIL [0.0063s] [ 31%] 2025-12-04T15:22:22.8292214Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_float32 PASSED [1.5402s] [ 31%] 2025-12-04T15:22:22.8292328Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_float32 PASSED [0.0060s] [ 31%] 2025-12-04T15:22:22.8292452Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_2d_cuda_float32 PASSED [0.7745s] [ 31%] 2025-12-04T15:22:22.8292601Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_shapes_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 31%] 2025-12-04T15:22:22.8292716Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_float32 PASSED [0.7776s] [ 31%] 2025-12-04T15:22:22.8292846Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_complex64 PASSED [0.7825s] [ 31%] 2025-12-04T15:22:22.8292963Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_complex64 PASSED [0.7948s] [ 31%] 2025-12-04T15:22:22.8293075Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ceil_cuda_float32 PASSED [0.0046s] [ 31%] 2025-12-04T15:22:22.8293190Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_float32 PASSED [0.7823s] [ 31%] 2025-12-04T15:22:22.8293308Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chunk_cuda_complex64 PASSED [0.7817s] [ 31%] 2025-12-04T15:22:22.8293425Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_cuda_float32 PASSED [0.0191s] [ 31%] 2025-12-04T15:22:22.8293546Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_complex64 PASSED [0.0116s] [ 31%] 2025-12-04T15:22:22.8293658Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_cuda_float32 PASSED [0.7718s] [ 31%] 2025-12-04T15:22:22.8293793Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_float32 PASSED [0.0118s] [ 31%] 2025-12-04T15:22:22.8293912Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_float32 PASSED [0.7730s] [ 31%] 2025-12-04T15:22:22.8294045Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_float32 PASSED [0.0123s] [ 31%] 2025-12-04T15:22:22.8294179Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_float32 PASSED [0.7805s] [ 31%] 2025-12-04T15:22:22.8294297Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_float32 PASSED [0.0043s] [ 31%] 2025-12-04T15:22:22.8294419Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_complex64 PASSED [0.1308s] [ 31%] 2025-12-04T15:22:22.8294557Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_complex64 SKIPPED [0.0002s] (Skipped!) [ 31%] 2025-12-04T15:22:22.8294687Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 31%] 2025-12-04T15:22:22.8294802Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32 PASSED [0.0050s] [ 31%] 2025-12-04T15:22:22.8294924Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exponential_cuda_float32 XFAIL [0.0083s] [ 31%] 2025-12-04T15:22:22.8295055Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_complex64 SKIPPED [0.0001s] (Skipped!) [ 31%] 2025-12-04T15:22:22.8295174Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_float32 PASSED [1.3879s] [ 31%] 2025-12-04T15:22:22.8295317Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_complex64 PASSED [1.3915s] [ 31%] 2025-12-04T15:22:22.8295436Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_float32 PASSED [0.7668s] [ 31%] 2025-12-04T15:22:22.8295570Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft2_cuda_complex64 PASSED [1.0865s] [ 32%] 2025-12-04T15:22:22.8295694Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_complex64 PASSED [1.0931s] [ 32%] 2025-12-04T15:22:22.8295814Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_float32 PASSED [0.9287s] [ 32%] 2025-12-04T15:22:22.8295927Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_float32 PASSED [0.7752s] [ 32%] 2025-12-04T15:22:22.8296045Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_complex64 PASSED [0.7796s] [ 32%] 2025-12-04T15:22:22.8296164Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_complex64 PASSED [0.7704s] [ 32%] 2025-12-04T15:22:22.8296281Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_cuda_float32 PASSED [0.0046s] [ 32%] 2025-12-04T15:22:22.8296396Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_frac_cuda_float32 PASSED [0.7695s] [ 32%] 2025-12-04T15:22:22.8296516Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_like_cuda_float32 PASSED [0.0040s] [ 32%] 2025-12-04T15:22:22.8296637Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_complex64 PASSED [0.0145s] [ 32%] 2025-12-04T15:22:22.8296753Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_float32 PASSED [0.7838s] [ 32%] 2025-12-04T15:22:22.8296871Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_complex64 PASSED [0.8236s] [ 32%] 2025-12-04T15:22:22.8296992Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_heaviside_cuda_float32 PASSED [0.7858s] [ 32%] 2025-12-04T15:22:22.8297108Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_histc_cuda_float32 PASSED [0.0356s] [ 32%] 2025-12-04T15:22:22.8297225Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igamma_cuda_float32 PASSED [0.7872s] [ 32%] 2025-12-04T15:22:22.8297342Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igammac_cuda_float32 PASSED [0.0080s] [ 32%] 2025-12-04T15:22:22.8297459Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_imag_cuda_complex64 PASSED [0.7630s] [ 32%] 2025-12-04T15:22:22.8297593Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_complex64 PASSED [0.7943s] [ 32%] 2025-12-04T15:22:22.8297712Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_complex64 PASSED [0.7687s] [ 32%] 2025-12-04T15:22:22.8297827Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_float32 PASSED [0.0050s] [ 32%] 2025-12-04T15:22:22.8297951Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_complex64 PASSED [0.7727s] [ 32%] 2025-12-04T15:22:22.8298069Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_complex64 PASSED [0.7637s] [ 32%] 2025-12-04T15:22:22.8298184Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_float32 PASSED [0.0033s] [ 32%] 2025-12-04T15:22:22.8298304Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64 PASSED [0.7675s] [ 32%] 2025-12-04T15:22:22.8298422Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_complex64 PASSED [0.7691s] [ 32%] 2025-12-04T15:22:22.8298580Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_complex64 PASSED [1.1044s] [ 32%] 2025-12-04T15:22:22.8298711Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_cuda_float32 PASSED [2.4019s] [ 32%] 2025-12-04T15:22:22.8298837Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_complex64 PASSED [0.9729s] [ 32%] 2025-12-04T15:22:22.8298982Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigh_cuda_float32 PASSED [0.9024s] [ 32%] 2025-12-04T15:22:22.8299115Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_complex64 PASSED [0.8574s] [ 32%] 2025-12-04T15:22:22.8299358Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_float32 SKIPPED [0.0011s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 32%] 2025-12-04T15:22:22.8299489Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_complex64 PASSED [0.0547s] [ 33%] 2025-12-04T15:22:22.8299614Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_float32 PASSED [0.8762s] [ 33%] 2025-12-04T15:22:22.8299754Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_complex64 PASSED [1.4984s] [ 33%] 2025-12-04T15:22:22.8299897Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_float32 PASSED [1.5898s] [ 33%] 2025-12-04T15:22:22.8300038Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_ex_cuda_complex64 PASSED [0.0511s] [ 33%] 2025-12-04T15:22:22.8300203Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_float32 PASSED [0.4202s] [ 33%] 2025-12-04T15:22:22.8300340Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_complex64 PASSED [1.4242s] [ 33%] 2025-12-04T15:22:22.8300474Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_float32 PASSED [0.0343s] [ 33%] 2025-12-04T15:22:22.8300607Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_cuda_float32 PASSED [0.0303s] [ 33%] 2025-12-04T15:22:22.8300732Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_float32 PASSED [0.0294s] [ 33%] 2025-12-04T15:22:22.8300869Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_float32 PASSED [1.3513s] [ 33%] 2025-12-04T15:22:22.8300996Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_qr_cuda_complex64 PASSED [1.3772s] [ 33%] 2025-12-04T15:22:22.8301126Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_complex64 PASSED [1.4006s] [ 33%] 2025-12-04T15:22:22.8301268Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_float32 PASSED [0.0993s] [ 33%] 2025-12-04T15:22:22.8301404Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svd_cuda_float32 PASSED [0.0814s] [ 33%] 2025-12-04T15:22:22.8301524Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_complex64 PASSED [1.3629s] [ 33%] 2025-12-04T15:22:22.8301639Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_float32 PASSED [0.0043s] [ 33%] 2025-12-04T15:22:22.8301760Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_float32 PASSED [0.0482s] [ 33%] 2025-12-04T15:22:22.8301874Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_float32 PASSED [1.3796s] [ 33%] 2025-12-04T15:22:22.8301998Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amin_cuda_float32 PASSED [0.0272s] [ 33%] 2025-12-04T15:22:22.8302123Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmin_cuda_float32 PASSED [0.0201s] [ 33%] 2025-12-04T15:22:22.8302246Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_float32 PASSED [0.0268s] [ 33%] 2025-12-04T15:22:22.8302372Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_select_cuda_float32 PASSED [1.4557s] [ 33%] 2025-12-04T15:22:22.8302493Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32 PASSED [0.0244s] [ 33%] 2025-12-04T15:22:22.8302615Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matrix_exp_cuda_float32 PASSED [1.4075s] [ 33%] 2025-12-04T15:22:22.8302736Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_binary_cuda_float32 PASSED [1.3679s] [ 33%] 2025-12-04T15:22:22.8302887Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_maximum_cuda_float32 PASSED [0.0105s] [ 33%] 2025-12-04T15:22:22.8303031Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_float32 PASSED [1.3845s] [ 33%] 2025-12-04T15:22:22.8303160Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mode_cuda_float32 PASSED [0.1356s] [ 33%] 2025-12-04T15:22:22.8303275Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_msort_cuda_float32 PASSED [1.5114s] [ 33%] 2025-12-04T15:22:22.8303391Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_complex64 PASSED [0.0542s] [ 33%] 2025-12-04T15:22:22.8303514Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_multinomial_cuda_float32 PASSED [1.3725s] [ 33%] 2025-12-04T15:22:22.8303635Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_complex64 PASSED [0.0638s] [ 34%] 2025-12-04T15:22:22.8303753Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_float32 PASSED [1.3832s] [ 34%] 2025-12-04T15:22:22.8303885Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_layer_norm_cuda_float32 PASSED [0.0061s] [ 34%] 2025-12-04T15:22:22.8304001Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_complex64 PASSED [1.3804s] [ 34%] 2025-12-04T15:22:22.8304197Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_strided_cuda_float32 SKIPPED [0.0002s] (Expected: new_empty_strided is not comparable) [ 34%] 2025-12-04T15:22:22.8304319Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_full_cuda_complex64 PASSED [1.3631s] [ 34%] 2025-12-04T15:22:22.8304439Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nextafter_cuda_float32 PASSED [0.0070s] [ 34%] 2025-12-04T15:22:22.8304593Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [1.3922s] [ 34%] 2025-12-04T15:22:22.8304736Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_cuda_float32 PASSED [0.2537s] [ 34%] 2025-12-04T15:22:22.8304911Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [1.3740s] [ 34%] 2025-12-04T15:22:22.8305062Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [0.1683s] [ 34%] 2025-12-04T15:22:22.8305208Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_glu_cuda_float32 PASSED [1.3764s] [ 34%] 2025-12-04T15:22:22.8305352Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardsigmoid_cuda_float32 PASSED [1.3773s] [ 34%] 2025-12-04T15:22:22.8305492Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardswish_cuda_float32 PASSED [1.4003s] [ 34%] 2025-12-04T15:22:22.8305643Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_area_cuda_float32 PASSED [1.4196s] [ 34%] 2025-12-04T15:22:22.8305785Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_layer_norm_cuda_float32 PASSED [1.4054s] [ 34%] 2025-12-04T15:22:22.8305925Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool3d_cuda_float32 PASSED [0.1090s] [ 34%] 2025-12-04T15:22:22.8306069Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_cuda_float32 PASSED [0.0963s] [ 34%] 2025-12-04T15:22:22.8306219Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.0501s] [ 34%] 2025-12-04T15:22:22.8306361Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_cuda_float32 PASSED [0.0281s] [ 34%] 2025-12-04T15:22:22.8306510Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [1.3870s] [ 34%] 2025-12-04T15:22:22.8306696Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0175s] [ 34%] 2025-12-04T15:22:22.8306841Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_complex64 PASSED [0.0100s] [ 34%] 2025-12-04T15:22:22.8306992Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_float32 PASSED [1.3697s] [ 34%] 2025-12-04T15:22:22.8307142Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_complex64 PASSED [1.3899s] [ 34%] 2025-12-04T15:22:22.8307287Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_float32 PASSED [1.3670s] [ 34%] 2025-12-04T15:22:22.8307444Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_static_cuda_complex64 SKIPPED [0.0012s] (Only runs on cpu) [ 34%] 2025-12-04T15:22:22.8307568Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_complex64 PASSED [1.3692s] [ 34%] 2025-12-04T15:22:22.8307684Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_float32 XFAIL [0.0033s] [ 34%] 2025-12-04T15:22:22.8307809Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_like_cuda_complex64 PASSED [1.3602s] [ 34%] 2025-12-04T15:22:22.8307928Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_complex64 PASSED [1.3625s] [ 34%] 2025-12-04T15:22:22.8308058Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_copy_cuda_complex64 PASSED [1.3677s] [ 35%] 2025-12-04T15:22:22.8308180Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_complex64 PASSED [1.4205s] [ 35%] 2025-12-04T15:22:22.8308296Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_complex64 PASSED [1.3783s] [ 35%] 2025-12-04T15:22:22.8308409Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_float32 PASSED [0.0164s] [ 35%] 2025-12-04T15:22:22.8308527Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rad2deg_cuda_float32 PASSED [1.3457s] [ 35%] 2025-12-04T15:22:22.8308643Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_float32 PASSED [0.0041s] [ 35%] 2025-12-04T15:22:22.8308764Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_remainder_cuda_float32 PASSED [1.3711s] [ 35%] 2025-12-04T15:22:22.8308885Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_cuda_complex64 PASSED [1.3953s] [ 35%] 2025-12-04T15:22:22.8309022Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_complex64 PASSED [1.3628s] [ 35%] 2025-12-04T15:22:22.8309158Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_prod_cuda_float32 PASSED [0.0363s] [ 35%] 2025-12-04T15:22:22.8309278Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_float32 PASSED [1.3880s] [ 35%] 2025-12-04T15:22:22.8309434Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_blackman_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 35%] 2025-12-04T15:22:22.8309596Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_exponential_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 35%] 2025-12-04T15:22:22.8309750Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_gaussian_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 35%] 2025-12-04T15:22:22.8309902Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_kaiser_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 35%] 2025-12-04T15:22:22.8310033Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_airy_ai_cuda_float32 PASSED [1.3609s] [ 35%] 2025-12-04T15:22:22.8310221Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_t_cuda_float32 PASSED [1.3622s] [ 35%] 2025-12-04T15:22:22.8310348Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_erfcx_cuda_float32 PASSED [1.3728s] [ 35%] 2025-12-04T15:22:22.8310473Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtr_cuda_float32 PASSED [0.0047s] [ 35%] 2025-12-04T15:22:22.8310645Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_spherical_bessel_j0_cuda_float32 PASSED [1.3529s] [ 35%] 2025-12-04T15:22:22.8310785Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_list_args_cuda_float32 PASSED [0.0048s] [ 35%] 2025-12-04T15:22:22.8310900Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_float32 PASSED [1.3526s] [ 35%] 2025-12-04T15:22:22.8311017Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_float32 PASSED [0.0065s] [ 35%] 2025-12-04T15:22:22.8311152Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_complex64 PASSED [0.0154s] [ 35%] 2025-12-04T15:22:22.8311270Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_complex64 PASSED [1.3647s] [ 35%] 2025-12-04T15:22:22.8311389Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_float32 PASSED [0.0051s] [ 35%] 2025-12-04T15:22:22.8311522Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32 PASSED [1.3497s] [ 35%] 2025-12-04T15:22:22.8311638Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_complex64 PASSED [1.4165s] [ 35%] 2025-12-04T15:22:22.8311754Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_cuda_complex64 PASSED [1.3854s] [ 35%] 2025-12-04T15:22:22.8311877Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_to_size_cuda_float32 PASSED [0.0122s] [ 35%] 2025-12-04T15:22:22.8311992Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_cuda_complex64 PASSED [1.6740s] [ 35%] 2025-12-04T15:22:22.8312105Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_complex64 PASSED [1.3787s] [ 35%] 2025-12-04T15:22:22.8312219Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_complex64 PASSED [1.3814s] [ 36%] 2025-12-04T15:22:22.8312332Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_float32 PASSED [0.0051s] [ 36%] 2025-12-04T15:22:22.8312461Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_complex64 PASSED [0.0147s] [ 36%] 2025-12-04T15:22:22.8312587Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_float32 PASSED [1.3628s] [ 36%] 2025-12-04T15:22:22.8312711Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_complex64 PASSED [1.4127s] [ 36%] 2025-12-04T15:22:22.8312836Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_topk_cuda_float32 PASSED [1.3711s] [ 36%] 2025-12-04T15:22:22.8313093Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0011s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 36%] 2025-12-04T15:22:22.8313213Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_complex64 PASSED [1.3665s] [ 36%] 2025-12-04T15:22:22.8313328Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trunc_cuda_float32 PASSED [0.0059s] [ 36%] 2025-12-04T15:22:22.8313453Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_copy_cuda_float32 PASSED [1.3663s] [ 36%] 2025-12-04T15:22:22.8313574Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_cuda_complex64 PASSED [1.3895s] [ 36%] 2025-12-04T15:22:22.8313709Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unique_consecutive_cuda_float32 PASSED [0.1297s] [ 36%] 2025-12-04T15:22:22.8313837Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_copy_cuda_float32 PASSED [1.3578s] [ 36%] 2025-12-04T15:22:22.8313958Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_float32 PASSED [0.0115s] [ 36%] 2025-12-04T15:22:22.8314073Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_complex64 PASSED [0.0206s] [ 36%] 2025-12-04T15:22:22.8314194Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64 PASSED [1.3848s] [ 36%] 2025-12-04T15:22:22.8314342Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_complex_cuda_float32 PASSED [0.0039s] [ 36%] 2025-12-04T15:22:22.8314458Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_float32 PASSED [1.3572s] [ 36%] 2025-12-04T15:22:22.8314585Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_complex64 XFAIL [0.0045s] [ 36%] 2025-12-04T15:22:22.8314700Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_float32 XFAIL [0.0027s] [ 36%] 2025-12-04T15:22:22.8314826Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_complex64 PASSED [0.0059s] [ 36%] 2025-12-04T15:22:22.8314940Z test_ops.py::TestCompositeComplianceCUDA::test_backward_H_cuda_float32 PASSED [0.0080s] [ 36%] 2025-12-04T15:22:22.8315055Z test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32 PASSED [0.0056s] [ 36%] 2025-12-04T15:22:22.8315197Z test_ops.py::TestCompositeComplianceCUDA::test_backward__native_batch_norm_legit_cuda_float32 PASSED [0.4173s] [ 36%] 2025-12-04T15:22:22.8315340Z test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_lengths_cuda_float32 PASSED [0.4361s] [ 36%] 2025-12-04T15:22:22.8315497Z test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.1366s] [ 36%] 2025-12-04T15:22:22.8315641Z test_ops.py::TestCompositeComplianceCUDA::test_backward__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0218s] [ 36%] 2025-12-04T15:22:22.8315758Z test_ops.py::TestCompositeComplianceCUDA::test_backward_abs_cuda_float32 PASSED [0.0041s] [ 36%] 2025-12-04T15:22:22.8315877Z test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_cuda_float32 PASSED [0.0845s] [ 36%] 2025-12-04T15:22:22.8316004Z test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_to_cuda_float32 PASSED [0.0139s] [ 36%] 2025-12-04T15:22:22.8316125Z test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_max_cuda_float32 PASSED [0.0544s] [ 36%] 2025-12-04T15:22:22.8316243Z test_ops.py::TestCompositeComplianceCUDA::test_backward_clone_cuda_float32 PASSED [0.0046s] [ 37%] 2025-12-04T15:22:22.8316368Z test_ops.py::TestCompositeComplianceCUDA::test_backward_contiguous_cuda_float32 PASSED [1.3377s] [ 37%] 2025-12-04T15:22:22.8316484Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cosh_cuda_float32 PASSED [0.0102s] [ 37%] 2025-12-04T15:22:22.8316609Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32 PASSED [0.8728s] [ 37%] 2025-12-04T15:22:22.8316749Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cumulative_trapezoid_cuda_float32 PASSED [0.0728s] [ 37%] 2025-12-04T15:22:22.8316864Z test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_cuda_float32 PASSED [1.3468s] [ 37%] 2025-12-04T15:22:22.8316987Z test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_embed_cuda_float32 PASSED [0.0291s] [ 37%] 2025-12-04T15:22:22.8317118Z test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_scatter_cuda_float32 PASSED [0.0693s] [ 37%] 2025-12-04T15:22:22.8317236Z test_ops.py::TestCompositeComplianceCUDA::test_backward_diff_cuda_float32 PASSED [0.7483s] [ 37%] 2025-12-04T15:22:22.8317371Z test_ops.py::TestCompositeComplianceCUDA::test_backward_div_no_rounding_mode_cuda_float32 PASSED [0.0568s] [ 37%] 2025-12-04T15:22:22.8317493Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32 PASSED [0.0177s] [ 37%] 2025-12-04T15:22:22.8317611Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fliplr_cuda_float32 PASSED [0.0052s] [ 37%] 2025-12-04T15:22:22.8317734Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_copy_cuda_float32 PASSED [0.0313s] [ 37%] 2025-12-04T15:22:22.8317856Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_put_cuda_float32 PASSED [0.0431s] [ 37%] 2025-12-04T15:22:22.8317983Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_select_cuda_float32 PASSED [0.0129s] [ 37%] 2025-12-04T15:22:22.8318125Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cond_cuda_float32 PASSED [0.0240s] [ 37%] 2025-12-04T15:22:22.8318260Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cross_cuda_float32 PASSED [0.0172s] [ 37%] 2025-12-04T15:22:22.8318400Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigvalsh_cuda_float32 PASSED [0.0327s] [ 37%] 2025-12-04T15:22:22.8318527Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_ex_cuda_float32 PASSED [0.0307s] [ 37%] 2025-12-04T15:22:22.8318653Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_cuda_float32 PASSED [0.6281s] [ 37%] 2025-12-04T15:22:22.8318798Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.6552s] [ 37%] 2025-12-04T15:22:22.8318930Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_multi_dot_cuda_float32 PASSED [0.0356s] [ 37%] 2025-12-04T15:22:22.8319054Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_cuda_float32 PASSED [0.2930s] [ 37%] 2025-12-04T15:22:22.8319180Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svd_cuda_float32 PASSED [2.1487s] [ 37%] 2025-12-04T15:22:22.8319314Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vector_norm_cuda_float32 PASSED [0.5016s] [ 37%] 2025-12-04T15:22:22.8319432Z test_ops.py::TestCompositeComplianceCUDA::test_backward_log10_cuda_float32 PASSED [0.8239s] [ 37%] 2025-12-04T15:22:22.8319554Z test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp_cuda_float32 PASSED [0.0721s] [ 37%] 2025-12-04T15:22:22.8319682Z test_ops.py::TestCompositeComplianceCUDA::test_backward_logcumsumexp_cuda_float32 PASSED [0.0385s] [ 37%] 2025-12-04T15:22:22.8319803Z test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32 PASSED [0.3245s] [ 37%] 2025-12-04T15:22:22.8319918Z test_ops.py::TestCompositeComplianceCUDA::test_backward_mH_cuda_float32 PASSED [0.0106s] [ 37%] 2025-12-04T15:22:22.8320048Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32 PASSED [0.1413s] [ 37%] 2025-12-04T15:22:22.8320208Z test_ops.py::TestCompositeComplianceCUDA::test_backward_median_cuda_float32 PASSED [0.8366s] [ 37%] 2025-12-04T15:22:22.8320331Z test_ops.py::TestCompositeComplianceCUDA::test_backward_min_binary_cuda_float32 PASSED [0.0774s] [ 38%] 2025-12-04T15:22:22.8320444Z test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32 PASSED [0.0183s] [ 38%] 2025-12-04T15:22:22.8320598Z test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0289s] [ 38%] 2025-12-04T15:22:22.8320721Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nan_to_num_cuda_float32 PASSED [0.7926s] [ 38%] 2025-12-04T15:22:22.8320840Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmean_cuda_float32 PASSED [0.1128s] [ 38%] 2025-12-04T15:22:22.8320960Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmedian_cuda_float32 PASSED [0.0344s] [ 38%] 2025-12-04T15:22:22.8321119Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0227s] [ 38%] 2025-12-04T15:22:22.8321261Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool2d_cuda_float32 PASSED [0.8063s] [ 38%] 2025-12-04T15:22:22.8321404Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0290s] [ 38%] 2025-12-04T15:22:22.8321545Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_cuda_float32 PASSED [0.4776s] [ 38%] 2025-12-04T15:22:22.8321867Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x780dd8c00a00 size: 1024 2025-12-04T15:22:22.8322051Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x780dd8c00a00 size: 1024 2025-12-04T15:22:22.8322270Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x780dd8c00e00 size: 1024 2025-12-04T15:22:22.8322451Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x780dd8c00e00 size: 1024 2025-12-04T15:22:22.8322663Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x780dd8c01000 size: 1024 2025-12-04T15:22:22.8322851Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x780dd8c01000 size: 1024 2025-12-04T15:22:22.8322892Z PASSED [0.6729s] [ 38%] 2025-12-04T15:22:22.8323211Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv3d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x780dd8003000 size: 11008 2025-12-04T15:22:22.8323394Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x780dd8003000 size: 11008 2025-12-04T15:22:22.8323588Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x780dd8005c00 size: 11008 2025-12-04T15:22:22.8323769Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x780dd8005c00 size: 11008 2025-12-04T15:22:22.8323970Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x780dd8005c00 size: 11008 2025-12-04T15:22:22.8324159Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x780dd8005c00 size: 11008 2025-12-04T15:22:22.8324355Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x780dd8004c00 size: 12544 2025-12-04T15:22:22.8324537Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x780dd8004c00 size: 12544 2025-12-04T15:22:22.8324732Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x780dd8007e00 size: 12544 2025-12-04T15:22:22.8324926Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x780dd8007e00 size: 12544 2025-12-04T15:22:22.8325129Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x780dd8007e00 size: 12544 2025-12-04T15:22:22.8325320Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x780dd8007e00 size: 12544 2025-12-04T15:22:22.8325360Z PASSED [0.3107s] [ 38%] 2025-12-04T15:22:22.8325518Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cosine_similarity_cuda_float32 PASSED [0.1316s] [ 38%] 2025-12-04T15:22:22.8325662Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout2d_cuda_float32 PASSED [0.0374s] [ 38%] 2025-12-04T15:22:22.8325801Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout_cuda_float32 PASSED [0.0481s] [ 38%] 2025-12-04T15:22:22.8325936Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_elu_cuda_float32 PASSED [0.0075s] [ 38%] 2025-12-04T15:22:22.8326108Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.8005s] [ 38%] 2025-12-04T15:22:22.8326266Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.1081s] [ 38%] 2025-12-04T15:22:22.8326428Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_group_norm_cuda_float32 PASSED [0.3145s] [ 38%] 2025-12-04T15:22:22.8326571Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardshrink_cuda_float32 PASSED [0.0246s] [ 38%] 2025-12-04T15:22:22.8326720Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardswish_cuda_float32 PASSED [0.0100s] [ 38%] 2025-12-04T15:22:22.8326878Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0822s] [ 38%] 2025-12-04T15:22:22.8327037Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.0477s] [ 38%] 2025-12-04T15:22:22.8327195Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0415s] [ 38%] 2025-12-04T15:22:22.8327350Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0576s] [ 38%] 2025-12-04T15:22:22.8327494Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_layer_norm_cuda_float32 PASSED [0.0660s] [ 38%] 2025-12-04T15:22:22.8329310Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_logsigmoid_cuda_float32 PASSED [0.0080s] [ 38%] 2025-12-04T15:22:22.8329479Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_margin_ranking_loss_cuda_float32 PASSED [0.2888s] [ 38%] 2025-12-04T15:22:22.8329624Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool1d_cuda_float32 PASSED [1.9992s] [ 38%] 2025-12-04T15:22:22.8329769Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_cuda_float32 PASSED [0.9495s] [ 38%] 2025-12-04T15:22:22.8329913Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_cuda_float32 PASSED [0.3146s] [ 38%] 2025-12-04T15:22:22.8330066Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0338s] [ 38%] 2025-12-04T15:22:22.8330252Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_circular_cuda_float32 PASSED [0.0359s] [ 39%] 2025-12-04T15:22:22.8330396Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_reflect_cuda_float32 PASSED [0.0165s] [ 39%] 2025-12-04T15:22:22.8330533Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pdist_cuda_float32 PASSED [0.0208s] [ 39%] 2025-12-04T15:22:22.8330708Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0073s] [ 39%] 2025-12-04T15:22:22.8330860Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.5182s] [ 39%] 2025-12-04T15:22:22.8331002Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rms_norm_cuda_float32 PASSED [0.0226s] [ 39%] 2025-12-04T15:22:22.8331151Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0317s] [ 39%] 2025-12-04T15:22:22.8331304Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0177s] [ 39%] 2025-12-04T15:22:22.8331445Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_threshold_cuda_float32 PASSED [0.0071s] [ 39%] 2025-12-04T15:22:22.8331599Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.1705s] [ 39%] 2025-12-04T15:22:22.8331773Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.1715s] [ 39%] 2025-12-04T15:22:22.8331924Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0171s] [ 39%] 2025-12-04T15:22:22.8332046Z test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_fro_cuda_float32 PASSED [0.0072s] [ 39%] 2025-12-04T15:22:22.8332174Z test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_copy_cuda_float32 PASSED [0.0068s] [ 39%] 2025-12-04T15:22:22.8332323Z test_ops.py::TestCompositeComplianceCUDA::test_backward_pinverse_cuda_float32 PASSED [0.0653s] [ 39%] 2025-12-04T15:22:22.8332456Z test_ops.py::TestCompositeComplianceCUDA::test_backward_positive_cuda_float32 PASSED [1.3839s] [ 39%] 2025-12-04T15:22:22.8332575Z test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_cuda_float32 PASSED [0.0353s] [ 39%] 2025-12-04T15:22:22.8332696Z test_ops.py::TestCompositeComplianceCUDA::test_backward_round_cuda_float32 PASSED [0.0037s] [ 39%] 2025-12-04T15:22:22.8332822Z test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_add_cuda_float32 PASSED [0.0474s] [ 39%] 2025-12-04T15:22:22.8332940Z test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_cuda_float32 PASSED [0.0606s] [ 39%] 2025-12-04T15:22:22.8333076Z test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_mean_cuda_float32 PASSED [0.2650s] [ 39%] 2025-12-04T15:22:22.8333194Z test_ops.py::TestCompositeComplianceCUDA::test_backward_sign_cuda_float32 PASSED [0.0033s] [ 39%] 2025-12-04T15:22:22.8333312Z test_ops.py::TestCompositeComplianceCUDA::test_backward_sinc_cuda_float32 PASSED [0.0093s] [ 39%] 2025-12-04T15:22:22.8333430Z test_ops.py::TestCompositeComplianceCUDA::test_backward_slice_cuda_float32 PASSED [1.3782s] [ 39%] 2025-12-04T15:22:22.8333586Z test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 39%] 2025-12-04T15:22:22.8333715Z test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtri_cuda_float32 PASSED [1.3741s] [ 39%] 2025-12-04T15:22:22.8333845Z test_ops.py::TestCompositeComplianceCUDA::test_backward_special_xlog1py_cuda_float32 PASSED [0.0545s] [ 39%] 2025-12-04T15:22:22.8333970Z test_ops.py::TestCompositeComplianceCUDA::test_backward_tensor_split_cuda_float32 XFAIL [0.0098s] [ 39%] 2025-12-04T15:22:22.8334085Z test_ops.py::TestCompositeComplianceCUDA::test_backward_tile_cuda_float32 PASSED [1.4013s] [ 39%] 2025-12-04T15:22:22.8334203Z test_ops.py::TestCompositeComplianceCUDA::test_backward_to_cuda_float32 PASSED [0.0403s] [ 39%] 2025-12-04T15:22:22.8334354Z test_ops.py::TestCompositeComplianceCUDA::test_backward_to_sparse_cuda_float32 SKIPPED [0.0002s] (Allowed exception) [ 39%] 2025-12-04T15:22:22.8334630Z test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 40%] 2025-12-04T15:22:22.8334789Z test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [0.0143s] [ 40%] 2025-12-04T15:22:22.8334907Z test_ops.py::TestCompositeComplianceCUDA::test_backward_trace_cuda_float32 PASSED [1.3971s] [ 40%] 2025-12-04T15:22:22.8335038Z test_ops.py::TestCompositeComplianceCUDA::test_backward_triangular_solve_cuda_float32 PASSED [0.1971s] [ 40%] 2025-12-04T15:22:22.8335154Z test_ops.py::TestCompositeComplianceCUDA::test_backward_tril_cuda_float32 PASSED [0.0119s] [ 40%] 2025-12-04T15:22:22.8335273Z test_ops.py::TestCompositeComplianceCUDA::test_backward_unfold_cuda_float32 PASSED [0.0300s] [ 40%] 2025-12-04T15:22:22.8335401Z test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_chunk_cuda_float32 PASSED [0.0341s] [ 40%] 2025-12-04T15:22:22.8335526Z test_ops.py::TestCompositeComplianceCUDA::test_backward_unsqueeze_cuda_float32 PASSED [0.0122s] [ 40%] 2025-12-04T15:22:22.8335653Z test_ops.py::TestCompositeComplianceCUDA::test_backward_var_unbiased_cuda_float32 PASSED [0.0058s] [ 40%] 2025-12-04T15:22:22.8335768Z test_ops.py::TestCompositeComplianceCUDA::test_backward_vdot_cuda_float32 PASSED [0.0068s] [ 40%] 2025-12-04T15:22:22.8335884Z test_ops.py::TestCompositeComplianceCUDA::test_backward_zero__cuda_float32 PASSED [0.0062s] [ 40%] 2025-12-04T15:22:22.8335999Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_H_cuda_float32 PASSED [0.0029s] [ 40%] 2025-12-04T15:22:22.8336123Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_T_cuda_float32 PASSED [1.3901s] [ 40%] 2025-12-04T15:22:22.8336258Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input___getitem___cuda_float32 PASSED [0.0119s] [ 40%] 2025-12-04T15:22:22.8336387Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmul___cuda_float32 PASSED [0.0068s] [ 40%] 2025-12-04T15:22:22.8336505Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addbmm_cuda_float32 PASSED [1.3633s] [ 40%] 2025-12-04T15:22:22.8336622Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_cuda_float32 PASSED [0.0084s] [ 40%] 2025-12-04T15:22:22.8336741Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_allclose_cuda_float32 PASSED [0.0101s] [ 40%] 2025-12-04T15:22:22.8336857Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amax_cuda_float32 PASSED [0.0093s] [ 40%] 2025-12-04T15:22:22.8336976Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argsort_cuda_float32 PASSED [0.0097s] [ 40%] 2025-12-04T15:22:22.8337096Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argwhere_cuda_float32 PASSED [1.3747s] [ 40%] 2025-12-04T15:22:22.8337219Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_cuda_float32 PASSED [0.0059s] [ 40%] 2025-12-04T15:22:22.8337362Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_partial_views_cuda_float32 PASSED [1.3663s] [ 40%] 2025-12-04T15:22:22.8337480Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asinh_cuda_float32 PASSED [0.0041s] [ 40%] 2025-12-04T15:22:22.8337603Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32 PASSED [1.3646s] [ 40%] 2025-12-04T15:22:22.8337738Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_tensors_cuda_float32 PASSED [0.0047s] [ 40%] 2025-12-04T15:22:22.8337854Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_byte_cuda_float32 PASSED [1.3488s] [ 40%] 2025-12-04T15:22:22.8337972Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cfloat_cuda_float32 PASSED [0.0058s] [ 40%] 2025-12-04T15:22:22.8338090Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chalf_cuda_float32 PASSED [1.3706s] [ 40%] 2025-12-04T15:22:22.8338211Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_max_cuda_float32 PASSED [0.0088s] [ 40%] 2025-12-04T15:22:22.8338338Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_column_stack_cuda_float32 PASSED [1.3608s] [ 40%] 2025-12-04T15:22:22.8338469Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_complex_cuda_float32 PASSED [0.0084s] [ 40%] 2025-12-04T15:22:22.8338688Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cross_cuda_float32 PASSED [1.3805s] [ 41%] 2025-12-04T15:22:22.8338805Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummax_cuda_float32 PASSED [0.0133s] [ 41%] 2025-12-04T15:22:22.8338922Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumprod_cuda_float32 PASSED [1.3812s] [ 41%] 2025-12-04T15:22:22.8339039Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_deg2rad_cuda_float32 PASSED [0.0041s] [ 41%] 2025-12-04T15:22:22.8339157Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_cuda_float32 PASSED [0.0031s] [ 41%] 2025-12-04T15:22:22.8339282Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exponential_cuda_float32 PASSED [1.3658s] [ 41%] 2025-12-04T15:22:22.8339401Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft_cuda_float32 PASSED [0.0152s] [ 41%] 2025-12-04T15:22:22.8339523Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft2_cuda_float32 PASSED [1.3986s] [ 41%] 2025-12-04T15:22:22.8339643Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftn_cuda_float32 PASSED [1.3836s] [ 41%] 2025-12-04T15:22:22.8339764Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft_cuda_float32 PASSED [0.0121s] [ 41%] 2025-12-04T15:22:22.8339880Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fill_cuda_float32 PASSED [1.3677s] [ 41%] 2025-12-04T15:22:22.8340012Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flatten_cuda_float32 PASSED [0.0060s] [ 41%] 2025-12-04T15:22:22.8340177Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_cuda_float32 PASSED [1.3716s] [ 41%] 2025-12-04T15:22:22.8340308Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_cuda_float32 PASSED [0.0044s] [ 41%] 2025-12-04T15:22:22.8340426Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_full_cuda_float32 PASSED [1.3548s] [ 41%] 2025-12-04T15:22:22.8340543Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_half_cuda_float32 PASSED [0.0058s] [ 41%] 2025-12-04T15:22:22.8340667Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hash_tensor_cuda_float32 PASSED [0.0061s] [ 41%] 2025-12-04T15:22:22.8340784Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_histc_cuda_float32 PASSED [0.0216s] [ 41%] 2025-12-04T15:22:22.8340899Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hypot_cuda_float32 PASSED [0.0066s] [ 41%] 2025-12-04T15:22:22.8341015Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_i0_cuda_float32 PASSED [1.3756s] [ 41%] 2025-12-04T15:22:22.8341136Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_add_cuda_float32 PASSED [0.0095s] [ 41%] 2025-12-04T15:22:22.8341259Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_put_cuda_float32 PASSED [1.3652s] [ 41%] 2025-12-04T15:22:22.8341391Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_mean_cuda_float32 PASSED [0.0098s] [ 41%] 2025-12-04T15:22:22.8341525Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_prod_cuda_float32 PASSED [1.3582s] [ 41%] 2025-12-04T15:22:22.8341651Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_select_cuda_float32 PASSED [0.0055s] [ 41%] 2025-12-04T15:22:22.8341766Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_int_cuda_float32 PASSED [1.3644s] [ 41%] 2025-12-04T15:22:22.8341885Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isfinite_cuda_float32 PASSED [0.0043s] [ 41%] 2025-12-04T15:22:22.8342017Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_unary_cuda_float32 PASSED [1.3817s] [ 41%] 2025-12-04T15:22:22.8342134Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kron_cuda_float32 PASSED [0.0043s] [ 41%] 2025-12-04T15:22:22.8342248Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_le_cuda_float32 PASSED [0.0050s] [ 41%] 2025-12-04T15:22:22.8342364Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lerp_cuda_float32 PASSED [1.4236s] [ 41%] 2025-12-04T15:22:22.8342507Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_cuda_float32 PASSED [0.0122s] [ 42%] 2025-12-04T15:22:22.8342634Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cross_cuda_float32 PASSED [0.0039s] [ 42%] 2025-12-04T15:22:22.8342771Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_ex_cuda_float32 PASSED [0.0220s] [ 42%] 2025-12-04T15:22:22.8342896Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_cuda_float32 PASSED [0.0176s] [ 42%] 2025-12-04T15:22:22.8343032Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorsolve_cuda_float32 PASSED [1.3887s] [ 42%] 2025-12-04T15:22:22.8343173Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_with_dtype_cuda_float32 PASSED [0.0067s] [ 42%] 2025-12-04T15:22:22.8343295Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp_cuda_float32 PASSED [0.0069s] [ 42%] 2025-12-04T15:22:22.8343425Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logcumsumexp_cuda_float32 PASSED [1.4062s] [ 42%] 2025-12-04T15:22:22.8343542Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logdet_cuda_float32 PASSED [0.0124s] [ 42%] 2025-12-04T15:22:22.8343667Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_not_cuda_float32 PASSED [1.3958s] [ 42%] 2025-12-04T15:22:22.8343788Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_unpack_cuda_float32 PASSED [0.0268s] [ 42%] 2025-12-04T15:22:22.8343938Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumsum_cuda_float32 PASSED [0.0105s] [ 42%] 2025-12-04T15:22:22.8344063Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_fill_cuda_float32 PASSED [1.3929s] [ 42%] 2025-12-04T15:22:22.8344207Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32 PASSED [0.0119s] [ 42%] 2025-12-04T15:22:22.8344335Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32 PASSED [0.0101s] [ 42%] 2025-12-04T15:22:22.8344463Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmax_cuda_float32 PASSED [0.0119s] [ 42%] 2025-12-04T15:22:22.8344585Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_sum_cuda_float32 PASSED [0.0341s] [ 42%] 2025-12-04T15:22:22.8344706Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_binary_cuda_float32 PASSED [0.0064s] [ 42%] 2025-12-04T15:22:22.8344846Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_with_dim_cuda_float32 PASSED [1.3890s] [ 42%] 2025-12-04T15:22:22.8344964Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mean_cuda_float32 PASSED [0.0109s] [ 42%] 2025-12-04T15:22:22.8345110Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_variadic_tensors_cuda_float32 PASSED [0.0082s] [ 42%] 2025-12-04T15:22:22.8345225Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mul_cuda_float32 PASSED [0.0064s] [ 42%] 2025-12-04T15:22:22.8345344Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_cuda_float32 PASSED [1.3809s] [ 42%] 2025-12-04T15:22:22.8345487Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_dropout_backward_cuda_float32 PASSED [0.0081s] [ 42%] 2025-12-04T15:22:22.8345619Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_layer_norm_cuda_float32 PASSED [0.0117s] [ 42%] 2025-12-04T15:22:22.8345761Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool1d_cuda_float32 PASSED [1.3979s] [ 42%] 2025-12-04T15:22:22.8345933Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0195s] [ 42%] 2025-12-04T15:22:22.8346086Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.0172s] [ 42%] 2025-12-04T15:22:22.8346228Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout2d_cuda_float32 PASSED [1.3933s] [ 42%] 2025-12-04T15:22:22.8346380Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout3d_cuda_float32 PASSED [0.0111s] [ 42%] 2025-12-04T15:22:22.8346513Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_glu_cuda_float32 PASSED [0.0183s] [ 42%] 2025-12-04T15:22:22.8346657Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_grid_sample_cuda_float32 PASSED [1.4148s] [ 42%] 2025-12-04T15:22:22.8346798Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_huber_loss_cuda_float32 PASSED [0.0086s] [ 43%] 2025-12-04T15:22:22.8346957Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0090s] [ 43%] 2025-12-04T15:22:22.8347116Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.0171s] [ 43%] 2025-12-04T15:22:22.8347259Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_layer_norm_cuda_float32 PASSED [1.3676s] [ 43%] 2025-12-04T15:22:22.8347415Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_local_response_norm_cuda_float32 PASSED [0.0101s] [ 43%] 2025-12-04T15:22:22.8347560Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_cuda_float32 PASSED [0.0763s] [ 43%] 2025-12-04T15:22:22.8347703Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_cuda_float32 PASSED [0.1435s] [ 43%] 2025-12-04T15:22:22.8347835Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mish_cuda_float32 PASSED [0.0163s] [ 43%] 2025-12-04T15:22:22.8348009Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_margin_loss_cuda_float32 PASSED [1.4040s] [ 43%] 2025-12-04T15:22:22.8348166Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_normalize_cuda_float32 PASSED [0.0080s] [ 43%] 2025-12-04T15:22:22.8348308Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_circular_cuda_float32 PASSED [1.4127s] [ 43%] 2025-12-04T15:22:22.8348461Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pairwise_distance_cuda_float32 PASSED [0.0082s] [ 43%] 2025-12-04T15:22:22.8348614Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.0450s] [ 43%] 2025-12-04T15:22:22.8348751Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_prelu_cuda_float32 PASSED [1.4079s] [ 43%] 2025-12-04T15:22:22.8348885Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu_cuda_float32 PASSED [0.0055s] [ 43%] 2025-12-04T15:22:22.8349135Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_scaled_dot_product_attention_cuda_float32 SKIPPED [0.0002s] (test_cow_input does not work with efficient attention on ROCM) [ 43%] 2025-12-04T15:22:22.8349284Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0065s] [ 43%] 2025-12-04T15:22:22.8349422Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_cuda_float32 PASSED [1.3855s] [ 43%] 2025-12-04T15:22:22.8349562Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softsign_cuda_float32 PASSED [0.0053s] [ 43%] 2025-12-04T15:22:22.8349790Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nonzero_static_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 43%] 2025-12-04T15:22:22.8349910Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_outer_cuda_float32 PASSED [1.3782s] [ 43%] 2025-12-04T15:22:22.8350038Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_like_cuda_float32 PASSED [0.0089s] [ 43%] 2025-12-04T15:22:22.8350212Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_renorm_cuda_float32 PASSED [0.0048s] [ 43%] 2025-12-04T15:22:22.8350345Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_repeat_interleave_cuda_float32 PASSED [1.3924s] [ 43%] 2025-12-04T15:22:22.8350479Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_roll_cuda_float32 PASSED [0.0097s] [ 43%] 2025-12-04T15:22:22.8350617Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_neg_3_cuda_float32 PASSED [0.0035s] [ 43%] 2025-12-04T15:22:22.8350741Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_add_cuda_float32 PASSED [1.3777s] [ 43%] 2025-12-04T15:22:22.8350859Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_cuda_float32 PASSED [0.0102s] [ 43%] 2025-12-04T15:22:22.8350996Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_sum_cuda_float32 PASSED [0.0152s] [ 43%] 2025-12-04T15:22:22.8351126Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_scatter_cuda_float32 PASSED [1.4137s] [ 43%] 2025-12-04T15:22:22.8351243Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinc_cuda_float32 PASSED [0.0052s] [ 43%] 2025-12-04T15:22:22.8351401Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_mm_reduce_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 44%] 2025-12-04T15:22:22.8351555Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_legendre_polynomial_p_cuda_float32 PASSED [0.0069s] [ 44%] 2025-12-04T15:22:22.8351718Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.0059s] [ 44%] 2025-12-04T15:22:22.8351836Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_cuda_float32 PASSED [1.3820s] [ 44%] 2025-12-04T15:22:22.8351982Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_unbiased_cuda_float32 PASSED [0.0050s] [ 44%] 2025-12-04T15:22:22.8352114Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_cuda_float32 PASSED [0.1338s] [ 44%] 2025-12-04T15:22:22.8352254Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_lowrank_cuda_float32 PASSED [0.0540s] [ 44%] 2025-12-04T15:22:22.8352379Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tensor_split_cuda_float32 PASSED [1.3911s] [ 44%] 2025-12-04T15:22:22.8352535Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [0.0076s] [ 44%] 2025-12-04T15:22:22.8352659Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unflatten_cuda_float32 PASSED [0.0059s] [ 44%] 2025-12-04T15:22:22.8352775Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vstack_cuda_float32 PASSED [1.3981s] [ 44%] 2025-12-04T15:22:22.8352892Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zero__cuda_float32 PASSED [0.0051s] [ 44%] 2025-12-04T15:22:22.8353009Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_cuda_float32 PASSED [1.3671s] [ 44%] 2025-12-04T15:22:22.8353136Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___getitem___cuda_float32 PASSED [0.0988s] [ 44%] 2025-12-04T15:22:22.8353258Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rdiv___cuda_float32 PASSED [0.1268s] [ 44%] 2025-12-04T15:22:22.8353384Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmatmul___cuda_float32 PASSED [0.2552s] [ 44%] 2025-12-04T15:22:22.8353503Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmul___cuda_float32 PASSED [0.1041s] [ 44%] 2025-12-04T15:22:22.8353622Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rsub___cuda_float32 PASSED [0.0947s] [ 44%] 2025-12-04T15:22:22.8353760Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__unsafe_masked_index_cuda_float32 PASSED [0.0701s] [ 44%] 2025-12-04T15:22:22.8353904Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0193s] [ 44%] 2025-12-04T15:22:22.8354026Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcdiv_cuda_float32 PASSED [0.7376s] [ 44%] 2025-12-04T15:22:22.8354153Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_alias_copy_cuda_float32 PASSED [0.0067s] [ 44%] 2025-12-04T15:22:22.8354316Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_allclose_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 44%] 2025-12-04T15:22:22.8354443Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_amin_cuda_float32 PASSED [0.0624s] [ 44%] 2025-12-04T15:22:22.8354575Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_copy_cuda_float32 PASSED [0.0149s] [ 44%] 2025-12-04T15:22:22.8354698Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_cuda_float32 PASSED [0.0147s] [ 44%] 2025-12-04T15:22:22.8354843Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_partial_views_cuda_float32 PASSED [0.0109s] [ 44%] 2025-12-04T15:22:22.8354961Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asin_cuda_float32 PASSED [0.0052s] [ 44%] 2025-12-04T15:22:22.8355078Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan2_cuda_float32 PASSED [0.1247s] [ 44%] 2025-12-04T15:22:22.8355195Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan_cuda_float32 PASSED [0.0047s] [ 44%] 2025-12-04T15:22:22.8355354Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_byte_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 44%] 2025-12-04T15:22:22.8355473Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chunk_cuda_float32 PASSED [0.0188s] [ 44%] 2025-12-04T15:22:22.8355598Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_contiguous_cuda_float32 PASSED [0.0056s] [ 45%] 2025-12-04T15:22:22.8355715Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cross_cuda_float32 PASSED [0.0371s] [ 45%] 2025-12-04T15:22:22.8355847Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32 PASSED [0.0381s] [ 45%] 2025-12-04T15:22:22.8355996Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumulative_trapezoid_cuda_float32 PASSED [0.1591s] [ 45%] 2025-12-04T15:22:22.8356140Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_scatter_cuda_float32 PASSED [0.1544s] [ 45%] 2025-12-04T15:22:22.8356261Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_digamma_cuda_float32 PASSED [0.0099s] [ 45%] 2025-12-04T15:22:22.8356395Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32 PASSED [0.0947s] [ 45%] 2025-12-04T15:22:22.8356514Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dsplit_cuda_float32 PASSED [0.0114s] [ 45%] 2025-12-04T15:22:22.8356672Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8356795Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_as_cuda_float32 PASSED [0.0138s] [ 45%] 2025-12-04T15:22:22.8356950Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eye_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8357074Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft2_cuda_float32 PASSED [0.0284s] [ 45%] 2025-12-04T15:22:22.8357194Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft_cuda_float32 PASSED [0.0277s] [ 45%] 2025-12-04T15:22:22.8357312Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmax_cuda_float32 PASSED [0.1169s] [ 45%] 2025-12-04T15:22:22.8357465Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ge_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8357624Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geqrf_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8357745Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gradient_cuda_float32 PASSED [0.4902s] [ 45%] 2025-12-04T15:22:22.8357919Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_grid_sampler_2d_cuda_float32 SKIPPED [0.0013s] (Does not support forward_ad) [ 45%] 2025-12-04T15:22:22.8358076Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_histc_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8358195Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hsplit_cuda_float32 PASSED [1.4009s] [ 45%] 2025-12-04T15:22:22.8358378Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_prod_cuda_float32 SKIPPED [0.0017s] (Does not support forward_ad) [ 45%] 2025-12-04T15:22:22.8358507Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_select_cuda_float32 PASSED [0.0177s] [ 45%] 2025-12-04T15:22:22.8358692Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8358866Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8358986Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lgamma_cuda_float32 PASSED [0.0092s] [ 45%] 2025-12-04T15:22:22.8359204Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 45%] 2025-12-04T15:22:22.8359372Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_cuda_float32 SKIPPED [0.0009s] (Does not support forward_ad) [ 45%] 2025-12-04T15:22:22.8359519Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_grad_oriented_cuda_float32 PASSED [1.9254s] [ 45%] 2025-12-04T15:22:22.8359694Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_cuda_float32 SKIPPED [0.0014s] (Does not support autograd) [ 45%] 2025-12-04T15:22:22.8359822Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_cuda_float32 PASSED [0.2606s] [ 45%] 2025-12-04T15:22:22.8359975Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_ex_cuda_float32 PASSED [0.3820s] [ 45%] 2025-12-04T15:22:22.8360161Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_triangular_cuda_float32 PASSED [3.8832s] [ 46%] 2025-12-04T15:22:22.8360306Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vander_cuda_float32 PASSED [0.0554s] [ 46%] 2025-12-04T15:22:22.8360470Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_normal_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 46%] 2025-12-04T15:22:22.8360594Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp_cuda_float32 PASSED [0.1326s] [ 46%] 2025-12-04T15:22:22.8360723Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logcumsumexp_cuda_float32 PASSED [0.0214s] [ 46%] 2025-12-04T15:22:22.8360878Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_long_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T15:22:22.8361033Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lt_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T15:22:22.8361154Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_solve_cuda_float32 PASSED [1.9020s] [ 46%] 2025-12-04T15:22:22.8361278Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_unpack_cuda_float32 PASSED [0.3390s] [ 46%] 2025-12-04T15:22:22.8361409Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumprod_cuda_float32 PASSED [0.1929s] [ 46%] 2025-12-04T15:22:22.8361535Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_fill_cuda_float32 PASSED [0.1012s] [ 46%] 2025-12-04T15:22:22.8361669Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logsumexp_cuda_float32 PASSED [0.7126s] [ 46%] 2025-12-04T15:22:22.8361797Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_median_cuda_float32 PASSED [0.1208s] [ 46%] 2025-12-04T15:22:22.8361925Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_binary_cuda_float32 PASSED [0.1271s] [ 46%] 2025-12-04T15:22:22.8362068Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_reduction_with_dim_cuda_float32 PASSED [0.0137s] [ 46%] 2025-12-04T15:22:22.8362187Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mul_cuda_float32 PASSED [0.1092s] [ 46%] 2025-12-04T15:22:22.8362319Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mv_cuda_float32 PASSED [0.0140s] [ 46%] 2025-12-04T15:22:22.8362461Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0309s] [ 46%] 2025-12-04T15:22:22.8362585Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmedian_cuda_float32 PASSED [0.0392s] [ 46%] 2025-12-04T15:22:22.8362768Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_dropout_backward_cuda_float32 SKIPPED [0.0010s] (Does not support forward_ad) [ 46%] 2025-12-04T15:22:22.8362923Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ne_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T15:22:22.8363085Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T15:22:22.8363228Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_bilinear_cuda_float32 PASSED [5.5346s] [ 46%] 2025-12-04T15:22:22.8363367Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv1d_cuda_float32 PASSED [0.5209s] [ 46%] 2025-12-04T15:22:22.8363555Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_bag_cuda_float32 SKIPPED [0.0011s] (Does not support forward_ad) [ 46%] 2025-12-04T15:22:22.8363730Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0331s] [ 46%] 2025-12-04T15:22:22.8363913Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_grid_sample_cuda_float32 SKIPPED [0.0010s] (Does not support forward_ad) [ 46%] 2025-12-04T15:22:22.8367103Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardswish_cuda_float32 PASSED [0.0112s] [ 46%] 2025-12-04T15:22:22.8367284Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.2254s] [ 46%] 2025-12-04T15:22:22.8367447Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.0590s] [ 46%] 2025-12-04T15:22:22.8367603Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_local_response_norm_cuda_float32 PASSED [0.0981s] [ 46%] 2025-12-04T15:22:22.8367757Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_margin_ranking_loss_cuda_float32 PASSED [0.6746s] [ 46%] 2025-12-04T15:22:22.8367897Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mse_loss_cuda_float32 PASSED [0.0841s] [ 47%] 2025-12-04T15:22:22.8368046Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_circular_cuda_float32 PASSED [0.0490s] [ 47%] 2025-12-04T15:22:22.8368190Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_reflect_cuda_float32 PASSED [0.0279s] [ 47%] 2025-12-04T15:22:22.8368352Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32 PASSED [0.0155s] [ 47%] 2025-12-04T15:22:22.8368506Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pairwise_distance_cuda_float32 PASSED [0.1269s] [ 47%] 2025-12-04T15:22:22.8368655Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.1133s] [ 47%] 2025-12-04T15:22:22.8368795Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softplus_cuda_float32 PASSED [1.3952s] [ 47%] 2025-12-04T15:22:22.8368935Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softsign_cuda_float32 PASSED [0.0177s] [ 47%] 2025-12-04T15:22:22.8369081Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_tanhshrink_cuda_float32 PASSED [0.0102s] [ 47%] 2025-12-04T15:22:22.8369243Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8369366Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_inf_cuda_float32 PASSED [0.0183s] [ 47%] 2025-12-04T15:22:22.8369502Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_nuc_cuda_float32 PASSED [1.3888s] [ 47%] 2025-12-04T15:22:22.8369664Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_cuda_float32 SKIPPED [0.0017s] (Does not support forward_ad) [ 47%] 2025-12-04T15:22:22.8369819Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_cuda_float32 SKIPPED [0.0013s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8369938Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_put_cuda_float32 PASSED [0.5653s] [ 47%] 2025-12-04T15:22:22.8370060Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rad2deg_cuda_float32 PASSED [0.0046s] [ 47%] 2025-12-04T15:22:22.8370262Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randn_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8370381Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ravel_cuda_float32 PASSED [0.0096s] [ 47%] 2025-12-04T15:22:22.8370504Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_cuda_float32 PASSED [0.0177s] [ 47%] 2025-12-04T15:22:22.8370624Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_cuda_float32 PASSED [0.1729s] [ 47%] 2025-12-04T15:22:22.8370800Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_prod_cuda_float32 SKIPPED [0.0009s] (Does not support forward_ad) [ 47%] 2025-12-04T15:22:22.8370981Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_blackman_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8371196Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_exponential_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8371394Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8371514Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sin_cuda_float32 PASSED [1.3524s] [ 47%] 2025-12-04T15:22:22.8371633Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinc_cuda_float32 PASSED [0.0143s] [ 47%] 2025-12-04T15:22:22.8371752Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinh_cuda_float32 PASSED [0.0049s] [ 47%] 2025-12-04T15:22:22.8371873Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32 PASSED [0.0258s] [ 47%] 2025-12-04T15:22:22.8371989Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sort_cuda_float32 PASSED [0.0960s] [ 47%] 2025-12-04T15:22:22.8372161Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_airy_ai_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8372333Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j0_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 47%] 2025-12-04T15:22:22.8372504Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8372673Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y0_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8372863Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8372993Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_erfcx_cuda_float32 PASSED [0.0094s] [ 48%] 2025-12-04T15:22:22.8373186Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8373387Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8373597Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8373783Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_spherical_bessel_j0_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8373902Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_cuda_float32 PASSED [0.0089s] [ 48%] 2025-12-04T15:22:22.8374044Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_copy_cuda_float32 PASSED [0.0238s] [ 48%] 2025-12-04T15:22:22.8374179Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_cuda_float32 PASSED [0.0202s] [ 48%] 2025-12-04T15:22:22.8374298Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stack_cuda_float32 PASSED [0.0253s] [ 48%] 2025-12-04T15:22:22.8374424Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_to_size_cuda_float32 PASSED [0.0371s] [ 48%] 2025-12-04T15:22:22.8374542Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tanh_cuda_float32 PASSED [0.0043s] [ 48%] 2025-12-04T15:22:22.8374665Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensordot_cuda_float32 PASSED [0.0695s] [ 48%] 2025-12-04T15:22:22.8374922Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 48%] 2025-12-04T15:22:22.8375040Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trace_cuda_float32 PASSED [0.0043s] [ 48%] 2025-12-04T15:22:22.8375177Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapz_cuda_float32 PASSED [0.1619s] [ 48%] 2025-12-04T15:22:22.8375311Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_triangular_solve_cuda_float32 PASSED [0.3156s] [ 48%] 2025-12-04T15:22:22.8375482Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_uniform_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 48%] 2025-12-04T15:22:22.8375610Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsafe_split_cuda_float32 PASSED [0.0088s] [ 48%] 2025-12-04T15:22:22.8375731Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_cuda_float32 PASSED [0.0618s] [ 48%] 2025-12-04T15:22:22.8375858Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_unbiased_cuda_float32 PASSED [0.0077s] [ 48%] 2025-12-04T15:22:22.8375987Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_complex_cuda_float32 PASSED [1.3609s] [ 48%] 2025-12-04T15:22:22.8376107Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_cuda_float32 PASSED [0.0207s] [ 48%] 2025-12-04T15:22:22.8376223Z test_ops.py::TestCompositeComplianceCUDA::test_operator_H_cuda_float32 PASSED [0.0040s] [ 48%] 2025-12-04T15:22:22.8376354Z test_ops.py::TestCompositeComplianceCUDA::test_operator___getitem___cuda_float32 PASSED [0.0189s] [ 48%] 2025-12-04T15:22:22.8376473Z test_ops.py::TestCompositeComplianceCUDA::test_operator___radd___cuda_float32 PASSED [0.0120s] [ 48%] 2025-12-04T15:22:22.8376593Z test_ops.py::TestCompositeComplianceCUDA::test_operator___rmod___cuda_float32 PASSED [0.0118s] [ 48%] 2025-12-04T15:22:22.8376710Z test_ops.py::TestCompositeComplianceCUDA::test_operator___rmul___cuda_float32 PASSED [0.0117s] [ 48%] 2025-12-04T15:22:22.8376847Z test_ops.py::TestCompositeComplianceCUDA::test_operator__unsafe_masked_index_cuda_float32 PASSED [0.0180s] [ 48%] 2025-12-04T15:22:22.8376964Z test_ops.py::TestCompositeComplianceCUDA::test_operator_argmax_cuda_float32 PASSED [0.0082s] [ 48%] 2025-12-04T15:22:22.8377087Z test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_cuda_float32 PASSED [0.0059s] [ 49%] 2025-12-04T15:22:22.8377229Z test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_partial_views_cuda_float32 PASSED [0.0044s] [ 49%] 2025-12-04T15:22:22.8377347Z test_ops.py::TestCompositeComplianceCUDA::test_operator_atanh_cuda_float32 PASSED [1.3612s] [ 49%] 2025-12-04T15:22:22.8377481Z test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_1d_cuda_float32 PASSED [0.0097s] [ 49%] 2025-12-04T15:22:22.8377601Z test_ops.py::TestCompositeComplianceCUDA::test_operator_bfloat16_cuda_float32 PASSED [0.0064s] [ 49%] 2025-12-04T15:22:22.8377717Z test_ops.py::TestCompositeComplianceCUDA::test_operator_bmm_cuda_float32 PASSED [0.0038s] [ 49%] 2025-12-04T15:22:22.8377846Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cartesian_prod_cuda_float32 PASSED [0.0099s] [ 49%] 2025-12-04T15:22:22.8377968Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cdouble_cuda_float32 PASSED [0.0062s] [ 49%] 2025-12-04T15:22:22.8378084Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cfloat_cuda_float32 PASSED [0.0060s] [ 49%] 2025-12-04T15:22:22.8378206Z test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_min_cuda_float32 PASSED [0.0120s] [ 49%] 2025-12-04T15:22:22.8378323Z test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_cuda_float32 PASSED [0.0037s] [ 49%] 2025-12-04T15:22:22.8378449Z test_ops.py::TestCompositeComplianceCUDA::test_operator_copysign_cuda_float32 PASSED [0.0126s] [ 49%] 2025-12-04T15:22:22.8378567Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cos_cuda_float32 PASSED [0.0040s] [ 49%] 2025-12-04T15:22:22.8378685Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cross_cuda_float32 PASSED [0.0055s] [ 49%] 2025-12-04T15:22:22.8378803Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cummax_cuda_float32 PASSED [0.0048s] [ 49%] 2025-12-04T15:22:22.8378936Z test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_embed_cuda_float32 PASSED [0.0135s] [ 49%] 2025-12-04T15:22:22.8379068Z test_ops.py::TestCompositeComplianceCUDA::test_operator_diagflat_cuda_float32 PASSED [0.0065s] [ 49%] 2025-12-04T15:22:22.8379195Z test_ops.py::TestCompositeComplianceCUDA::test_operator_dstack_cuda_float32 PASSED [0.0044s] [ 49%] 2025-12-04T15:22:22.8379314Z test_ops.py::TestCompositeComplianceCUDA::test_operator_erfinv_cuda_float32 PASSED [0.0028s] [ 49%] 2025-12-04T15:22:22.8379431Z test_ops.py::TestCompositeComplianceCUDA::test_operator_exp_cuda_float32 PASSED [0.0040s] [ 49%] 2025-12-04T15:22:22.8379556Z test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_as_cuda_float32 PASSED [0.0052s] [ 49%] 2025-12-04T15:22:22.8379675Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft2_cuda_float32 PASSED [0.0089s] [ 49%] 2025-12-04T15:22:22.8379801Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft2_cuda_float32 PASSED [0.0085s] [ 49%] 2025-12-04T15:22:22.8379923Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft_cuda_float32 PASSED [0.0092s] [ 49%] 2025-12-04T15:22:22.8380045Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft_cuda_float32 PASSED [0.0088s] [ 49%] 2025-12-04T15:22:22.8380199Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fill_cuda_float32 PASSED [0.0041s] [ 49%] 2025-12-04T15:22:22.8380315Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fmod_cuda_float32 PASSED [0.0121s] [ 49%] 2025-12-04T15:22:22.8380435Z test_ops.py::TestCompositeComplianceCUDA::test_operator_histc_cuda_float32 PASSED [0.0779s] [ 49%] 2025-12-04T15:22:22.8380553Z test_ops.py::TestCompositeComplianceCUDA::test_operator_igamma_cuda_float32 PASSED [0.0117s] [ 49%] 2025-12-04T15:22:22.8380681Z test_ops.py::TestCompositeComplianceCUDA::test_operator_index_fill_cuda_float32 PASSED [0.0117s] [ 49%] 2025-12-04T15:22:22.8380816Z test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_mean_cuda_float32 PASSED [0.0212s] [ 49%] 2025-12-04T15:22:22.8380947Z test_ops.py::TestCompositeComplianceCUDA::test_operator_index_select_cuda_float32 PASSED [0.0054s] [ 50%] 2025-12-04T15:22:22.8381065Z test_ops.py::TestCompositeComplianceCUDA::test_operator_inner_cuda_float32 PASSED [0.0048s] [ 50%] 2025-12-04T15:22:22.8381185Z test_ops.py::TestCompositeComplianceCUDA::test_operator_isin_cuda_float32 PASSED [1.3802s] [ 50%] 2025-12-04T15:22:22.8381360Z test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0003s] (skip) [ 50%] 2025-12-04T15:22:22.8381527Z test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_return_by_ref_cuda_float32 SKIPPED [0.0002s] (skip) [ 50%] 2025-12-04T15:22:22.8381652Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eig_cuda_float32 PASSED [0.1648s] [ 50%] 2025-12-04T15:22:22.8381782Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32 PASSED [0.7020s] [ 50%] 2025-12-04T15:22:22.8381921Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_ex_cuda_float32 PASSED [0.0477s] [ 50%] 2025-12-04T15:22:22.8382059Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_norm_cuda_float32 PASSED [1.3937s] [ 50%] 2025-12-04T15:22:22.8382198Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_power_cuda_float32 PASSED [0.0380s] [ 50%] 2025-12-04T15:22:22.8382332Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_cuda_float32 PASSED [0.1338s] [ 50%] 2025-12-04T15:22:22.8382483Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.0138s] [ 50%] 2025-12-04T15:22:22.8382639Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.0835s] [ 50%] 2025-12-04T15:22:22.8382763Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_qr_cuda_float32 PASSED [0.0532s] [ 50%] 2025-12-04T15:22:22.8382894Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_slogdet_cuda_float32 PASSED [0.0163s] [ 50%] 2025-12-04T15:22:22.8383057Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_cuda_float32 PASSED [0.0431s] [ 50%] 2025-12-04T15:22:22.8383187Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svdvals_cuda_float32 PASSED [0.0358s] [ 50%] 2025-12-04T15:22:22.8383335Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32 PASSED [1.3273s] [ 50%] 2025-12-04T15:22:22.8383479Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_tensor_overload_cuda_float32 PASSED [0.1798s] [ 50%] 2025-12-04T15:22:22.8383621Z test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_with_dtype_cuda_float32 PASSED [0.0083s] [ 50%] 2025-12-04T15:22:22.8383747Z test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_not_cuda_float32 PASSED [0.0036s] [ 50%] 2025-12-04T15:22:22.8383873Z test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32 PASSED [0.0088s] [ 50%] 2025-12-04T15:22:22.8383999Z test_ops.py::TestCompositeComplianceCUDA::test_operator_logsumexp_cuda_float32 PASSED [0.0162s] [ 50%] 2025-12-04T15:22:22.8384116Z test_ops.py::TestCompositeComplianceCUDA::test_operator_long_cuda_float32 PASSED [0.0046s] [ 50%] 2025-12-04T15:22:22.8384240Z test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_unpack_cuda_float32 PASSED [0.0963s] [ 50%] 2025-12-04T15:22:22.8384354Z test_ops.py::TestCompositeComplianceCUDA::test_operator_mH_cuda_float32 PASSED [1.3242s] [ 50%] 2025-12-04T15:22:22.8384483Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmin_cuda_float32 PASSED [0.1085s] [ 50%] 2025-12-04T15:22:22.8384610Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumsum_cuda_float32 PASSED [0.0335s] [ 50%] 2025-12-04T15:22:22.8384739Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmin_cuda_float32 PASSED [0.0425s] [ 50%] 2025-12-04T15:22:22.8384862Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_std_cuda_float32 PASSED [0.2491s] [ 50%] 2025-12-04T15:22:22.8384983Z test_ops.py::TestCompositeComplianceCUDA::test_operator_matmul_cuda_float32 PASSED [0.0286s] [ 50%] 2025-12-04T15:22:22.8385106Z test_ops.py::TestCompositeComplianceCUDA::test_operator_max_binary_cuda_float32 PASSED [0.0119s] [ 50%] 2025-12-04T15:22:22.8385249Z test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_with_dim_cuda_float32 PASSED [0.0058s] [ 51%] 2025-12-04T15:22:22.8385376Z test_ops.py::TestCompositeComplianceCUDA::test_operator_mode_cuda_float32 PASSED [0.0098s] [ 51%] 2025-12-04T15:22:22.8385518Z test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0101s] [ 51%] 2025-12-04T15:22:22.8385638Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nanmean_cuda_float32 PASSED [0.0457s] [ 51%] 2025-12-04T15:22:22.8385760Z test_ops.py::TestCompositeComplianceCUDA::test_operator_new_full_cuda_float32 PASSED [0.0078s] [ 51%] 2025-12-04T15:22:22.8385920Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [1.3418s] [ 51%] 2025-12-04T15:22:22.8386063Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0129s] [ 51%] 2025-12-04T15:22:22.8386209Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool2d_cuda_float32 PASSED [0.0080s] [ 51%] 2025-12-04T15:22:22.8386352Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_cuda_float32 PASSED [0.0801s] [ 51%] 2025-12-04T15:22:22.8386515Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_without_cudnn_cuda_float32 PASSED [0.0806s] [ 51%] 2025-12-04T15:22:22.8386667Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.0440s] [ 51%] 2025-12-04T15:22:22.8386822Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_similarity_cuda_float32 PASSED [0.0266s] [ 51%] 2025-12-04T15:22:22.8386984Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout3d_cuda_float32 PASSED [0.0222s] [ 51%] 2025-12-04T15:22:22.8387163Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0143s] [ 51%] 2025-12-04T15:22:22.8387332Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0649s] [ 51%] 2025-12-04T15:22:22.8387470Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_glu_cuda_float32 PASSED [0.0341s] [ 51%] 2025-12-04T15:22:22.8387614Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_grid_sample_cuda_float32 PASSED [0.0372s] [ 51%] 2025-12-04T15:22:22.8387772Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0293s] [ 51%] 2025-12-04T15:22:22.8387921Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_instance_norm_cuda_float32 PASSED [0.1615s] [ 51%] 2025-12-04T15:22:22.8388075Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_area_cuda_float32 PASSED [0.0207s] [ 51%] 2025-12-04T15:22:22.8388232Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.0268s] [ 51%] 2025-12-04T15:22:22.8388390Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.0209s] [ 51%] 2025-12-04T15:22:22.8388553Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.0212s] [ 51%] 2025-12-04T15:22:22.8388689Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32 PASSED [0.0293s] [ 51%] 2025-12-04T15:22:22.8388874Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 51%] 2025-12-04T15:22:22.8389026Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0148s] [ 51%] 2025-12-04T15:22:22.8389172Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_reflect_cuda_float32 PASSED [0.0105s] [ 51%] 2025-12-04T15:22:22.8389327Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pairwise_distance_cuda_float32 PASSED [1.3298s] [ 51%] 2025-12-04T15:22:22.8389476Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_prelu_cuda_float32 PASSED [0.0366s] [ 51%] 2025-12-04T15:22:22.8389619Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rms_norm_cuda_float32 PASSED [0.0092s] [ 51%] 2025-12-04T15:22:22.8389754Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_selu_cuda_float32 PASSED [1.3117s] [ 51%] 2025-12-04T15:22:22.8389890Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_silu_cuda_float32 PASSED [0.0189s] [ 51%] 2025-12-04T15:22:22.8390046Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0093s] [ 52%] 2025-12-04T15:22:22.8390223Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softplus_cuda_float32 PASSED [1.3198s] [ 52%] 2025-12-04T15:22:22.8390398Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0419s] [ 52%] 2025-12-04T15:22:22.8390554Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0119s] [ 52%] 2025-12-04T15:22:22.8390673Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_cuda_float32 PASSED [0.0201s] [ 52%] 2025-12-04T15:22:22.8390791Z test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_cuda_float32 PASSED [0.0031s] [ 52%] 2025-12-04T15:22:22.8390908Z test_ops.py::TestCompositeComplianceCUDA::test_operator_outer_cuda_float32 PASSED [1.3502s] [ 52%] 2025-12-04T15:22:22.8391054Z test_ops.py::TestCompositeComplianceCUDA::test_operator_permute_copy_cuda_float32 PASSED [0.0071s] [ 52%] 2025-12-04T15:22:22.8391184Z test_ops.py::TestCompositeComplianceCUDA::test_operator_polar_cuda_float32 PASSED [0.0134s] [ 52%] 2025-12-04T15:22:22.8391314Z test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_cuda_float32 PASSED [0.0044s] [ 52%] 2025-12-04T15:22:22.8391440Z test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_like_cuda_float32 PASSED [0.0127s] [ 52%] 2025-12-04T15:22:22.8391556Z test_ops.py::TestCompositeComplianceCUDA::test_operator_real_cuda_float32 PASSED [1.3404s] [ 52%] 2025-12-04T15:22:22.8391681Z test_ops.py::TestCompositeComplianceCUDA::test_operator_reciprocal_cuda_float32 PASSED [0.0062s] [ 52%] 2025-12-04T15:22:22.8391804Z test_ops.py::TestCompositeComplianceCUDA::test_operator_remainder_cuda_float32 PASSED [0.0129s] [ 52%] 2025-12-04T15:22:22.8391925Z test_ops.py::TestCompositeComplianceCUDA::test_operator_renorm_cuda_float32 PASSED [1.3392s] [ 52%] 2025-12-04T15:22:22.8392076Z test_ops.py::TestCompositeComplianceCUDA::test_operator_resize__cuda_float32 SKIPPED [0.0003s] (Allowed exception) [ 52%] 2025-12-04T15:22:22.8392205Z test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_neg_cuda_float32 PASSED [1.3518s] [ 52%] 2025-12-04T15:22:22.8392338Z test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_3_cuda_float32 PASSED [0.0060s] [ 52%] 2025-12-04T15:22:22.8392458Z test_ops.py::TestCompositeComplianceCUDA::test_operator_rsqrt_cuda_float32 PASSED [1.3320s] [ 52%] 2025-12-04T15:22:22.8392595Z test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_mean_cuda_float32 PASSED [0.0548s] [ 52%] 2025-12-04T15:22:22.8392725Z test_ops.py::TestCompositeComplianceCUDA::test_operator_searchsorted_cuda_float32 PASSED [0.3224s] [ 52%] 2025-12-04T15:22:22.8392845Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sigmoid_cuda_float32 PASSED [1.3219s] [ 52%] 2025-12-04T15:22:22.8392962Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sign_cuda_float32 PASSED [0.0047s] [ 52%] 2025-12-04T15:22:22.8393107Z test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_bartlett_cuda_float32 PASSED [0.0102s] [ 52%] 2025-12-04T15:22:22.8393250Z test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hamming_cuda_float32 PASSED [0.0128s] [ 52%] 2025-12-04T15:22:22.8393389Z test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hann_cuda_float32 PASSED [0.0123s] [ 52%] 2025-12-04T15:22:22.8393537Z test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_with_dtype_cuda_float32 PASSED [0.0080s] [ 52%] 2025-12-04T15:22:22.8393673Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y0_cuda_float32 PASSED [1.3267s] [ 52%] 2025-12-04T15:22:22.8393805Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y1_cuda_float32 PASSED [0.0073s] [ 52%] 2025-12-04T15:22:22.8393957Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_h_cuda_float32 PASSED [0.0140s] [ 52%] 2025-12-04T15:22:22.8394112Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_he_cuda_float32 PASSED [0.2918s] [ 52%] 2025-12-04T15:22:22.8394242Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i0e_cuda_float32 PASSED [1.3707s] [ 53%] 2025-12-04T15:22:22.8394367Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i1e_cuda_float32 PASSED [0.0054s] [ 53%] 2025-12-04T15:22:22.8394503Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_log_ndtr_cuda_float32 PASSED [1.3492s] [ 53%] 2025-12-04T15:22:22.8394651Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32 PASSED [0.0074s] [ 53%] 2025-12-04T15:22:22.8394809Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k0_cuda_float32 PASSED [1.3327s] [ 53%] 2025-12-04T15:22:22.8394928Z test_ops.py::TestCompositeComplianceCUDA::test_operator_stack_cuda_float32 PASSED [0.0109s] [ 53%] 2025-12-04T15:22:22.8395085Z test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_unbiased_cuda_float32 PASSED [1.3402s] [ 53%] 2025-12-04T15:22:22.8395203Z test_ops.py::TestCompositeComplianceCUDA::test_operator_stft_cuda_float32 PASSED [0.0219s] [ 53%] 2025-12-04T15:22:22.8395330Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sum_cuda_float32 PASSED [0.0173s] [ 53%] 2025-12-04T15:22:22.8395462Z test_ops.py::TestCompositeComplianceCUDA::test_operator_take_along_dim_cuda_float32 PASSED [1.3330s] [ 53%] 2025-12-04T15:22:22.8395588Z test_ops.py::TestCompositeComplianceCUDA::test_operator_tensor_split_cuda_float32 XFAIL [0.0037s] [ 53%] 2025-12-04T15:22:22.8395703Z test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32 PASSED [1.3722s] [ 53%] 2025-12-04T15:22:22.8395855Z test_ops.py::TestCompositeComplianceCUDA::test_operator_to_sparse_cuda_float32 SKIPPED [0.0002s] (Allowed exception) [ 53%] 2025-12-04T15:22:22.8395972Z test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32 PASSED [0.0155s] [ 53%] 2025-12-04T15:22:22.8396130Z test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [0.0092s] [ 53%] 2025-12-04T15:22:22.8396262Z test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_copy_cuda_float32 PASSED [0.0080s] [ 53%] 2025-12-04T15:22:22.8396386Z test_ops.py::TestCompositeComplianceCUDA::test_operator_trapezoid_cuda_float32 PASSED [0.0208s] [ 53%] 2025-12-04T15:22:22.8396506Z test_ops.py::TestCompositeComplianceCUDA::test_operator_trapz_cuda_float32 PASSED [0.0209s] [ 53%] 2025-12-04T15:22:22.8396633Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_copy_cuda_float32 PASSED [0.0142s] [ 53%] 2025-12-04T15:22:22.8396753Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unique_cuda_float32 PASSED [0.7537s] [ 53%] 2025-12-04T15:22:22.8396872Z test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_cuda_float32 PASSED [0.0065s] [ 53%] 2025-12-04T15:22:22.8396992Z test_ops.py::TestCompositeComplianceCUDA::test_operator_vsplit_cuda_float32 PASSED [0.0052s] [ 53%] 2025-12-04T15:22:22.8397111Z test_ops.py::TestCompositeComplianceCUDA::test_operator_xlogy_cuda_float32 PASSED [0.0118s] [ 53%] 2025-12-04T15:22:22.8397240Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay___getitem___cuda_float32 PASSED [0.0084s] [ 53%] 2025-12-04T15:22:22.8397364Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rpow___cuda_float32 PASSED [0.0038s] [ 53%] 2025-12-04T15:22:22.8397499Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acosh_cuda_float32 PASSED [1.3202s] [ 53%] 2025-12-04T15:22:22.8397620Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amax_cuda_float32 PASSED [0.0053s] [ 53%] 2025-12-04T15:22:22.8397743Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argsort_cuda_float32 PASSED [1.3365s] [ 53%] 2025-12-04T15:22:22.8397878Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_copy_cuda_float32 PASSED [0.0038s] [ 53%] 2025-12-04T15:22:22.8397997Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan_cuda_float32 PASSED [1.3225s] [ 53%] 2025-12-04T15:22:22.8398117Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bmm_cuda_float32 PASSED [0.0034s] [ 53%] 2025-12-04T15:22:22.8398255Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_tensors_cuda_float32 PASSED [1.3129s] [ 53%] 2025-12-04T15:22:22.8398373Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_char_cuda_float32 PASSED [0.0038s] [ 54%] 2025-12-04T15:22:22.8398501Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_max_cuda_float32 PASSED [0.0035s] [ 54%] 2025-12-04T15:22:22.8398633Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_combinations_cuda_float32 PASSED [1.3264s] [ 54%] 2025-12-04T15:22:22.8398833Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_conj_cuda_float32 PASSED [0.0034s] [ 54%] 2025-12-04T15:22:22.8398963Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_count_nonzero_cuda_float32 PASSED [1.3182s] [ 54%] 2025-12-04T15:22:22.8399106Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_deg2rad_cuda_float32 PASSED [0.0032s] [ 54%] 2025-12-04T15:22:22.8399239Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_copy_cuda_float32 PASSED [1.3260s] [ 54%] 2025-12-04T15:22:22.8399388Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_floor_rounding_cuda_float32 PASSED [0.0051s] [ 54%] 2025-12-04T15:22:22.8399508Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dsplit_cuda_float32 PASSED [1.3327s] [ 54%] 2025-12-04T15:22:22.8399630Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_equal_cuda_float32 PASSED [0.0044s] [ 54%] 2025-12-04T15:22:22.8399747Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exp_cuda_float32 PASSED [1.3341s] [ 54%] 2025-12-04T15:22:22.8399871Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft_cuda_float32 PASSED [0.0046s] [ 54%] 2025-12-04T15:22:22.8399996Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfftn_cuda_float32 PASSED [1.3927s] [ 54%] 2025-12-04T15:22:22.8400159Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfft2_cuda_float32 PASSED [0.0044s] [ 54%] 2025-12-04T15:22:22.8400284Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfftn_cuda_float32 PASSED [1.3560s] [ 54%] 2025-12-04T15:22:22.8400404Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_cuda_float32 PASSED [0.0037s] [ 54%] 2025-12-04T15:22:22.8400534Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_divide_cuda_float32 PASSED [0.0040s] [ 54%] 2025-12-04T15:22:22.8400661Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gradient_cuda_float32 PASSED [1.3779s] [ 54%] 2025-12-04T15:22:22.8400781Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_histc_cuda_float32 PASSED [0.0158s] [ 54%] 2025-12-04T15:22:22.8400904Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hstack_cuda_float32 PASSED [1.3715s] [ 54%] 2025-12-04T15:22:22.8401023Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_i0_cuda_float32 PASSED [0.0034s] [ 54%] 2025-12-04T15:22:22.8401142Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isreal_cuda_float32 PASSED [1.3684s] [ 54%] 2025-12-04T15:22:22.8401263Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_item_cuda_float32 PASSED [0.0042s] [ 54%] 2025-12-04T15:22:22.8401399Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kthvalue_cuda_float32 PASSED [1.3851s] [ 54%] 2025-12-04T15:22:22.8401521Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lgamma_cuda_float32 PASSED [0.0034s] [ 54%] 2025-12-04T15:22:22.8401649Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cond_cuda_float32 PASSED [1.3683s] [ 54%] 2025-12-04T15:22:22.8401786Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_cuda_float32 PASSED [0.0044s] [ 54%] 2025-12-04T15:22:22.8401914Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_cuda_float32 PASSED [1.4733s] [ 54%] 2025-12-04T15:22:22.8402067Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.1089s] [ 54%] 2025-12-04T15:22:22.8402205Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_norm_cuda_float32 PASSED [1.3683s] [ 54%] 2025-12-04T15:22:22.8402358Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.0076s] [ 54%] 2025-12-04T15:22:22.8402484Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_cuda_float32 PASSED [1.3995s] [ 55%] 2025-12-04T15:22:22.8402643Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.0151s] [ 55%] 2025-12-04T15:22:22.8402771Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_cuda_float32 PASSED [1.3746s] [ 55%] 2025-12-04T15:22:22.8402898Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_qr_cuda_float32 PASSED [0.0149s] [ 55%] 2025-12-04T15:22:22.8403059Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_svdvals_cuda_float32 PASSED [0.0083s] [ 55%] 2025-12-04T15:22:22.8403198Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorsolve_cuda_float32 PASSED [1.3849s] [ 55%] 2025-12-04T15:22:22.8403337Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_cuda_float32 PASSED [0.0093s] [ 55%] 2025-12-04T15:22:22.8403457Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_cuda_float32 PASSED [1.3598s] [ 55%] 2025-12-04T15:22:22.8403586Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_or_cuda_float32 PASSED [0.0045s] [ 55%] 2025-12-04T15:22:22.8403714Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_xor_cuda_float32 PASSED [0.0034s] [ 55%] 2025-12-04T15:22:22.8403839Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logspace_cuda_float32 PASSED [0.0414s] [ 55%] 2025-12-04T15:22:22.8403959Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_long_cuda_float32 PASSED [1.3687s] [ 55%] 2025-12-04T15:22:22.8404079Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_cuda_float32 PASSED [0.0213s] [ 55%] 2025-12-04T15:22:22.8404212Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumprod_cuda_float32 PASSED [0.0056s] [ 55%] 2025-12-04T15:22:22.8404346Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_select_cuda_float32 PASSED [1.3552s] [ 55%] 2025-12-04T15:22:22.8404473Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_sum_cuda_float32 PASSED [0.0179s] [ 55%] 2025-12-04T15:22:22.8404601Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_binary_cuda_float32 PASSED [0.0036s] [ 55%] 2025-12-04T15:22:22.8404746Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_with_dim_cuda_float32 PASSED [1.3656s] [ 55%] 2025-12-04T15:22:22.8404870Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_maximum_cuda_float32 PASSED [0.0045s] [ 55%] 2025-12-04T15:22:22.8405022Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_variadic_tensors_cuda_float32 PASSED [1.3754s] [ 55%] 2025-12-04T15:22:22.8405296Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32 PASSED [0.0033s] [ 55%] 2025-12-04T15:22:22.8405444Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_with_dim_cuda_float32 PASSED [1.3832s] [ 55%] 2025-12-04T15:22:22.8405583Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_multinomial_cuda_float32 PASSED [0.0056s] [ 55%] 2025-12-04T15:22:22.8405702Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32 PASSED [1.3792s] [ 55%] 2025-12-04T15:22:22.8405821Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_cuda_float32 PASSED [0.0092s] [ 55%] 2025-12-04T15:22:22.8405968Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_dropout_backward_cuda_float32 PASSED [1.3749s] [ 55%] 2025-12-04T15:22:22.8406096Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_cuda_float32 PASSED [0.0041s] [ 55%] 2025-12-04T15:22:22.8406221Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_full_cuda_float32 PASSED [1.3698s] [ 55%] 2025-12-04T15:22:22.8406381Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0046s] [ 55%] 2025-12-04T15:22:22.8406533Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_alpha_dropout_cuda_float32 PASSED [1.3610s] [ 55%] 2025-12-04T15:22:22.8406679Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0041s] [ 55%] 2025-12-04T15:22:22.8406821Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv3d_cuda_float32 PASSED [1.3792s] [ 55%] 2025-12-04T15:22:22.8406967Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout2d_cuda_float32 PASSED [0.0064s] [ 56%] 2025-12-04T15:22:22.8407150Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bilinear_cuda_float32 PASSED [1.3659s] [ 56%] 2025-12-04T15:22:22.8407309Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0054s] [ 56%] 2025-12-04T15:22:22.8407481Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_trilinear_cuda_float32 PASSED [1.3810s] [ 56%] 2025-12-04T15:22:22.8407625Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32 PASSED [0.0041s] [ 56%] 2025-12-04T15:22:22.8407774Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_layer_norm_cuda_float32 PASSED [1.3478s] [ 56%] 2025-12-04T15:22:22.8407919Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool3d_cuda_float32 PASSED [0.0725s] [ 56%] 2025-12-04T15:22:22.8408067Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_cuda_float32 PASSED [1.4140s] [ 56%] 2025-12-04T15:22:22.8408225Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0223s] [ 56%] 2025-12-04T15:22:22.8408395Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [1.4681s] [ 56%] 2025-12-04T15:22:22.8408566Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0041s] [ 56%] 2025-12-04T15:22:22.8408713Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_normalize_cuda_float32 PASSED [1.3905s] [ 56%] 2025-12-04T15:22:22.8408864Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_circular_cuda_float32 PASSED [0.0043s] [ 56%] 2025-12-04T15:22:22.8409023Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pairwise_distance_cuda_float32 PASSED [1.3710s] [ 56%] 2025-12-04T15:22:22.8409165Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu6_cuda_float32 PASSED [0.0035s] [ 56%] 2025-12-04T15:22:22.8409310Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_threshold_cuda_float32 PASSED [1.3602s] [ 56%] 2025-12-04T15:22:22.8409452Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_unfold_cuda_float32 PASSED [0.0222s] [ 56%] 2025-12-04T15:22:22.8409619Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_upsample_bilinear_cuda_float32 PASSED [1.3689s] [ 56%] 2025-12-04T15:22:22.8409741Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ormqr_cuda_float32 PASSED [0.0278s] [ 56%] 2025-12-04T15:22:22.8409874Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_copy_cuda_float32 PASSED [1.3613s] [ 56%] 2025-12-04T15:22:22.8410020Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0056s] [ 56%] 2025-12-04T15:22:22.8410198Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_2_cuda_float32 PASSED [1.3668s] [ 56%] 2025-12-04T15:22:22.8410341Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0043s] [ 56%] 2025-12-04T15:22:22.8410470Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32 PASSED [1.3662s] [ 56%] 2025-12-04T15:22:22.8410610Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_interleave_cuda_float32 PASSED [0.0041s] [ 56%] 2025-12-04T15:22:22.8410731Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_roll_cuda_float32 PASSED [1.3838s] [ 56%] 2025-12-04T15:22:22.8410867Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_3_cuda_float32 PASSED [0.0035s] [ 56%] 2025-12-04T15:22:22.8411005Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amax_cuda_float32 PASSED [1.3918s] [ 56%] 2025-12-04T15:22:22.8411142Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_short_cuda_float32 PASSED [0.0037s] [ 56%] 2025-12-04T15:22:22.8411295Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_nuttall_cuda_float32 PASSED [1.3672s] [ 56%] 2025-12-04T15:22:22.8411436Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signbit_cuda_float32 PASSED [0.0031s] [ 56%] 2025-12-04T15:22:22.8411596Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32 SKIPPED [0.0007s] (Only runs on cpu) [ 57%] 2025-12-04T15:22:22.8411733Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_j1_cuda_float32 PASSED [1.3612s] [ 57%] 2025-12-04T15:22:22.8411891Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0064s] [ 57%] 2025-12-04T15:22:22.8412027Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_log_ndtr_cuda_float32 PASSED [0.0026s] [ 57%] 2025-12-04T15:22:22.8412177Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_k0_cuda_float32 PASSED [1.3545s] [ 57%] 2025-12-04T15:22:22.8412311Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_zeta_cuda_float32 PASSED [0.0044s] [ 57%] 2025-12-04T15:22:22.8412435Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_cuda_float32 PASSED [1.3624s] [ 57%] 2025-12-04T15:22:22.8412575Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_copy_cuda_float32 PASSED [0.0036s] [ 57%] 2025-12-04T15:22:22.8412699Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_square_cuda_float32 PASSED [1.3643s] [ 57%] 2025-12-04T15:22:22.8412821Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_cuda_float32 PASSED [0.0084s] [ 57%] 2025-12-04T15:22:22.8412941Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_stft_cuda_float32 PASSED [1.3844s] [ 57%] 2025-12-04T15:22:22.8413058Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sum_cuda_float32 PASSED [0.0053s] [ 57%] 2025-12-04T15:22:22.8413188Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_lowrank_cuda_float32 PASSED [1.3901s] [ 57%] 2025-12-04T15:22:22.8413307Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_cuda_float32 PASSED [0.0045s] [ 57%] 2025-12-04T15:22:22.8413426Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tanh_cuda_float32 PASSED [1.4005s] [ 57%] 2025-12-04T15:22:22.8413574Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triangular_solve_cuda_float32 PASSED [0.0090s] [ 57%] 2025-12-04T15:22:22.8413694Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triu_cuda_float32 PASSED [1.3832s] [ 57%] 2025-12-04T15:22:22.8413813Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trunc_cuda_float32 PASSED [0.0032s] [ 57%] 2025-12-04T15:22:22.8413942Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_copy_cuda_float32 PASSED [1.3740s] [ 57%] 2025-12-04T15:22:22.8414067Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsqueeze_cuda_float32 PASSED [0.0087s] [ 57%] 2025-12-04T15:22:22.8414204Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_unbiased_cuda_float32 PASSED [1.3750s] [ 57%] 2025-12-04T15:22:22.8414336Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_unbiased_cuda_float32 PASSED [0.0033s] [ 57%] 2025-12-04T15:22:22.8414455Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vdot_cuda_float32 PASSED [1.3702s] [ 57%] 2025-12-04T15:22:22.8414556Z test_ops.py::TestMathBitsCUDA::test_conj_view_T_cuda_complex64 PASSED [1.3570s] [ 57%] 2025-12-04T15:22:22.8414662Z test_ops.py::TestMathBitsCUDA::test_conj_view___rmul___cuda_complex64 PASSED [0.0140s] [ 57%] 2025-12-04T15:22:22.8414767Z test_ops.py::TestMathBitsCUDA::test_conj_view___rsub___cuda_complex64 PASSED [0.0108s] [ 57%] 2025-12-04T15:22:22.8414869Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_T_cuda_complex64 PASSED [1.3750s] [ 57%] 2025-12-04T15:22:22.8414997Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_float_cuda_complex64 PASSED [0.0055s] [ 57%] 2025-12-04T15:22:22.8415122Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_abs_cuda_complex64 PASSED [1.3951s] [ 57%] 2025-12-04T15:22:22.8415232Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_acos_cuda_complex64 PASSED [0.0050s] [ 57%] 2025-12-04T15:22:22.8415350Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcmul_cuda_complex64 PASSED [1.3875s] [ 57%] 2025-12-04T15:22:22.8415459Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_all_cuda_complex64 PASSED [0.0079s] [ 57%] 2025-12-04T15:22:22.8415583Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_scatter_cuda_complex64 PASSED [1.3679s] [ 58%] 2025-12-04T15:22:22.8415698Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_1d_cuda_complex64 PASSED [0.0065s] [ 58%] 2025-12-04T15:22:22.8415822Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_tensors_cuda_complex64 PASSED [1.3769s] [ 58%] 2025-12-04T15:22:22.8415931Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_chunk_cuda_complex64 PASSED [0.0063s] [ 58%] 2025-12-04T15:22:22.8416040Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_clone_cuda_complex64 PASSED [1.3761s] [ 58%] 2025-12-04T15:22:22.8416161Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_constant_pad_nd_cuda_complex64 PASSED [0.0248s] [ 58%] 2025-12-04T15:22:22.8416270Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cos_cuda_complex64 PASSED [1.3748s] [ 58%] 2025-12-04T15:22:22.8416375Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cosh_cuda_complex64 PASSED [0.0050s] [ 58%] 2025-12-04T15:22:22.8416499Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_scatter_cuda_complex64 PASSED [0.0091s] [ 58%] 2025-12-04T15:22:22.8416655Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_cuda_complex64 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 58%] 2025-12-04T15:22:22.8416761Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_eq_cuda_complex64 PASSED [1.3724s] [ 58%] 2025-12-04T15:22:22.8416868Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flip_cuda_complex64 PASSED [0.0069s] [ 58%] 2025-12-04T15:22:22.8416975Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_imag_cuda_complex64 PASSED [1.3530s] [ 58%] 2025-12-04T15:22:22.8417086Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_copy_cuda_complex64 PASSED [0.0048s] [ 58%] 2025-12-04T15:22:22.8417203Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_select_cuda_complex64 PASSED [1.3744s] [ 58%] 2025-12-04T15:22:22.8417321Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isnan_cuda_complex64 PASSED [0.0039s] [ 58%] 2025-12-04T15:22:22.8417429Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64 PASSED [0.0156s] [ 58%] 2025-12-04T15:22:22.8417544Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_cross_cuda_complex64 PASSED [1.3524s] [ 58%] 2025-12-04T15:22:22.8417660Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_and_cuda_complex64 PASSED [0.0067s] [ 58%] 2025-12-04T15:22:22.8417775Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_not_cuda_complex64 PASSED [1.3451s] [ 58%] 2025-12-04T15:22:22.8417887Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logsumexp_cuda_complex64 PASSED [0.0107s] [ 58%] 2025-12-04T15:22:22.8418001Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_copy_cuda_complex64 PASSED [1.3743s] [ 58%] 2025-12-04T15:22:22.8418106Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_neg_cuda_complex64 PASSED [0.0042s] [ 58%] 2025-12-04T15:22:22.8418291Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_strided_cuda_complex64 SKIPPED [0.0002s] (Expected: empty_strided is not comparable) [ 58%] 2025-12-04T15:22:22.8418401Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_full_cuda_complex64 PASSED [1.3776s] [ 58%] 2025-12-04T15:22:22.8418552Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex64 PASSED [0.0057s] [ 58%] 2025-12-04T15:22:22.8418713Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pairwise_distance_cuda_complex64 PASSED [1.3758s] [ 58%] 2025-12-04T15:22:22.8418830Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_copy_cuda_complex64 PASSED [0.0050s] [ 58%] 2025-12-04T15:22:22.8418950Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_positive_cuda_complex64 PASSED [1.3690s] [ 58%] 2025-12-04T15:22:22.8419058Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_prod_cuda_complex64 PASSED [0.0207s] [ 58%] 2025-12-04T15:22:22.8419162Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sgn_cuda_complex64 PASSED [1.3713s] [ 58%] 2025-12-04T15:22:22.8419268Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinh_cuda_complex64 PASSED [0.0041s] [ 59%] 2025-12-04T15:22:22.8419389Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_split_with_sizes_cuda_complex64 PASSED [1.3769s] [ 59%] 2025-12-04T15:22:22.8419494Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sqrt_cuda_complex64 PASSED [0.0059s] [ 59%] 2025-12-04T15:22:22.8419606Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_square_cuda_complex64 PASSED [1.3790s] [ 59%] 2025-12-04T15:22:22.8419714Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_cuda_complex64 PASSED [0.0071s] [ 59%] 2025-12-04T15:22:22.8419840Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_multiple_cuda_complex64 PASSED [1.3647s] [ 59%] 2025-12-04T15:22:22.8419951Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_to_size_cuda_complex64 PASSED [0.0094s] [ 59%] 2025-12-04T15:22:22.8420056Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_cuda_complex64 PASSED [1.3954s] [ 59%] 2025-12-04T15:22:22.8420188Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_to_cuda_complex64 PASSED [0.0148s] [ 59%] 2025-12-04T15:22:22.8420293Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tril_cuda_complex64 PASSED [1.3682s] [ 59%] 2025-12-04T15:22:22.8420404Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unbind_copy_cuda_complex64 PASSED [0.0086s] [ 59%] 2025-12-04T15:22:22.8420526Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_copy_cuda_complex64 PASSED [1.3649s] [ 59%] 2025-12-04T15:22:22.8420634Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_mean_cuda_complex64 PASSED [0.0165s] [ 59%] 2025-12-04T15:22:22.8420744Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vsplit_cuda_complex64 PASSED [1.3823s] [ 59%] 2025-12-04T15:22:22.8420844Z test_ops.py::TestMathBitsCUDA::test_conj_view_acos_cuda_complex64 PASSED [0.0079s] [ 59%] 2025-12-04T15:22:22.8420956Z test_ops.py::TestMathBitsCUDA::test_conj_view_add_cuda_complex64 PASSED [1.3913s] [ 59%] 2025-12-04T15:22:22.8421057Z test_ops.py::TestMathBitsCUDA::test_conj_view_addcdiv_cuda_complex64 PASSED [0.0214s] [ 59%] 2025-12-04T15:22:22.8421158Z test_ops.py::TestMathBitsCUDA::test_conj_view_byte_cuda_complex64 PASSED [1.3752s] [ 59%] 2025-12-04T15:22:22.8421253Z test_ops.py::TestMathBitsCUDA::test_conj_view_cat_cuda_complex64 PASSED [0.0132s] [ 59%] 2025-12-04T15:22:22.8421365Z test_ops.py::TestMathBitsCUDA::test_conj_view_column_stack_cuda_complex64 PASSED [1.3779s] [ 59%] 2025-12-04T15:22:22.8421475Z test_ops.py::TestMathBitsCUDA::test_conj_view_count_nonzero_cuda_complex64 PASSED [0.0078s] [ 59%] 2025-12-04T15:22:22.8421574Z test_ops.py::TestMathBitsCUDA::test_conj_view_cross_cuda_complex64 PASSED [1.3663s] [ 59%] 2025-12-04T15:22:22.8421677Z test_ops.py::TestMathBitsCUDA::test_conj_view_dsplit_cuda_complex64 PASSED [0.0050s] [ 59%] 2025-12-04T15:22:22.8421777Z test_ops.py::TestMathBitsCUDA::test_conj_view_dstack_cuda_complex64 PASSED [1.3760s] [ 59%] 2025-12-04T15:22:22.8421878Z test_ops.py::TestMathBitsCUDA::test_conj_view_equal_cuda_complex64 PASSED [0.0050s] [ 59%] 2025-12-04T15:22:22.8421975Z test_ops.py::TestMathBitsCUDA::test_conj_view_exp2_cuda_complex64 PASSED [1.3838s] [ 59%] 2025-12-04T15:22:22.8422071Z test_ops.py::TestMathBitsCUDA::test_conj_view_exp_cuda_complex64 PASSED [0.0075s] [ 59%] 2025-12-04T15:22:22.8422181Z test_ops.py::TestMathBitsCUDA::test_conj_view_eye_cuda_complex64 SKIPPED [0.0002s] (Skipped!) [ 59%] 2025-12-04T15:22:22.8422307Z test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft_cuda_complex64 PASSED [1.4072s] [ 59%] 2025-12-04T15:22:22.8422404Z test_ops.py::TestMathBitsCUDA::test_conj_view_fill_cuda_complex64 PASSED [1.3701s] [ 59%] 2025-12-04T15:22:22.8422515Z test_ops.py::TestMathBitsCUDA::test_conj_view_flip_cuda_complex64 PASSED [0.0125s] [ 59%] 2025-12-04T15:22:22.8422611Z test_ops.py::TestMathBitsCUDA::test_conj_view_full_cuda_complex64 XFAIL [0.0027s] [ 59%] 2025-12-04T15:22:22.8422719Z test_ops.py::TestMathBitsCUDA::test_conj_view_index_copy_cuda_complex64 PASSED [1.3690s] [ 60%] 2025-12-04T15:22:22.8422815Z test_ops.py::TestMathBitsCUDA::test_conj_view_item_cuda_complex64 PASSED [0.0054s] [ 60%] 2025-12-04T15:22:22.8422954Z test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_4inputs_with_extra_args_cuda_complex64 XFAIL [0.0032s] [ 60%] 2025-12-04T15:22:22.8423065Z test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_cuda_complex64 XFAIL [1.4980s] [ 60%] 2025-12-04T15:22:22.8423164Z test_ops.py::TestMathBitsCUDA::test_conj_view_ldexp_cuda_complex64 XFAIL [0.0040s] [ 60%] 2025-12-04T15:22:22.8423276Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_diagonal_cuda_complex64 PASSED [2.7933s] [ 60%] 2025-12-04T15:22:22.8423397Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_ex_cuda_complex64 PASSED [0.0343s] [ 60%] 2025-12-04T15:22:22.8423508Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_solve_cuda_complex64 PASSED [0.2913s] [ 60%] 2025-12-04T15:22:22.8423622Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_norm_cuda_complex64 PASSED [1.4279s] [ 60%] 2025-12-04T15:22:22.8423821Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0012s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 60%] 2025-12-04T15:22:22.8423931Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vander_cuda_complex64 PASSED [1.3737s] [ 60%] 2025-12-04T15:22:22.8424041Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vecdot_cuda_complex64 PASSED [0.0460s] [ 60%] 2025-12-04T15:22:22.8424140Z test_ops.py::TestMathBitsCUDA::test_conj_view_log2_cuda_complex64 PASSED [1.3947s] [ 60%] 2025-12-04T15:22:22.8424251Z test_ops.py::TestMathBitsCUDA::test_conj_view_logcumsumexp_cuda_complex64 PASSED [0.0112s] [ 60%] 2025-12-04T15:22:22.8424350Z test_ops.py::TestMathBitsCUDA::test_conj_view_logdet_cuda_complex64 PASSED [1.4093s] [ 60%] 2025-12-04T15:22:22.8424466Z test_ops.py::TestMathBitsCUDA::test_conj_view_lu_unpack_cuda_complex64 PASSED [0.0287s] [ 60%] 2025-12-04T15:22:22.8424563Z test_ops.py::TestMathBitsCUDA::test_conj_view_mH_cuda_complex64 PASSED [0.0079s] [ 60%] 2025-12-04T15:22:22.8424670Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_prod_cuda_complex64 PASSED [0.1186s] [ 60%] 2025-12-04T15:22:22.8424775Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_std_cuda_complex64 PASSED [0.1077s] [ 60%] 2025-12-04T15:22:22.8424877Z test_ops.py::TestMathBitsCUDA::test_conj_view_nanmean_cuda_complex64 PASSED [1.3958s] [ 60%] 2025-12-04T15:22:22.8424978Z test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_cuda_complex64 XFAIL [0.0065s] [ 60%] 2025-12-04T15:22:22.8425098Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv1d_cuda_complex64 PASSED [2.8087s] [ 60%] 2025-12-04T15:22:22.8425226Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_circular_cuda_complex64 PASSED [1.3975s] [ 60%] 2025-12-04T15:22:22.8425369Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_negative_cuda_complex64 PASSED [0.0104s] [ 60%] 2025-12-04T15:22:22.8425495Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_silu_complex_cuda_complex64 PASSED [1.3712s] [ 60%] 2025-12-04T15:22:22.8425618Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_tanhshrink_cuda_complex64 PASSED [0.0094s] [ 60%] 2025-12-04T15:22:22.8425716Z test_ops.py::TestMathBitsCUDA::test_conj_view_pow_cuda_complex64 PASSED [0.0152s] [ 60%] 2025-12-04T15:22:22.8425826Z test_ops.py::TestMathBitsCUDA::test_conj_view_ravel_cuda_complex64 PASSED [1.3675s] [ 60%] 2025-12-04T15:22:22.8425938Z test_ops.py::TestMathBitsCUDA::test_conj_view_roll_cuda_complex64 PASSED [0.0183s] [ 60%] 2025-12-04T15:22:22.8426046Z test_ops.py::TestMathBitsCUDA::test_conj_view_rsub_cuda_complex64 PASSED [1.3920s] [ 60%] 2025-12-04T15:22:22.8426142Z test_ops.py::TestMathBitsCUDA::test_conj_view_sinh_cuda_complex64 PASSED [0.0054s] [ 60%] 2025-12-04T15:22:22.8426239Z test_ops.py::TestMathBitsCUDA::test_conj_view_stack_cuda_complex64 XFAIL [0.0047s] [ 61%] 2025-12-04T15:22:22.8426344Z test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64 PASSED [1.3729s] [ 61%] 2025-12-04T15:22:22.8426448Z test_ops.py::TestMathBitsCUDA::test_conj_view_to_sparse_cuda_complex64 PASSED [0.0054s] [ 61%] 2025-12-04T15:22:22.8426549Z test_ops.py::TestMathBitsCUDA::test_conj_view_trapz_cuda_complex64 PASSED [1.3797s] [ 61%] 2025-12-04T15:22:22.8426666Z test_ops.py::TestMathBitsCUDA::test_conj_view_triangular_solve_cuda_complex64 PASSED [0.0138s] [ 61%] 2025-12-04T15:22:22.8426766Z test_ops.py::TestMathBitsCUDA::test_conj_view_tril_cuda_complex64 PASSED [1.3772s] [ 61%] 2025-12-04T15:22:22.8426861Z test_ops.py::TestMathBitsCUDA::test_conj_view_triu_cuda_complex64 PASSED [0.0130s] [ 61%] 2025-12-04T15:22:22.8426973Z test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_copy_cuda_complex64 PASSED [1.3742s] [ 61%] 2025-12-04T15:22:22.8427082Z test_ops.py::TestMathBitsCUDA::test_conj_view_var_unbiased_cuda_complex64 PASSED [0.0058s] [ 61%] 2025-12-04T15:22:22.8427184Z test_ops.py::TestMathBitsCUDA::test_conj_view_view_copy_cuda_complex64 PASSED [1.3721s] [ 61%] 2025-12-04T15:22:22.8427285Z test_ops.py::TestMathBitsCUDA::test_conj_view_vsplit_cuda_complex64 PASSED [0.0050s] [ 61%] 2025-12-04T15:22:22.8427384Z test_ops.py::TestMathBitsCUDA::test_conj_view_vstack_cuda_complex64 PASSED [1.4129s] [ 61%] 2025-12-04T15:22:22.8427484Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128 PASSED [0.0065s] [ 61%] 2025-12-04T15:22:22.8427597Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmatmul___cuda_complex128 PASSED [1.3710s] [ 61%] 2025-12-04T15:22:22.8427705Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rpow___cuda_complex128 PASSED [0.0080s] [ 61%] 2025-12-04T15:22:22.8427841Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_bfloat16_cuda_complex128 PASSED [1.3733s] [ 61%] 2025-12-04T15:22:22.8427978Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_byte_cuda_complex128 PASSED [0.0035s] [ 61%] 2025-12-04T15:22:22.8428113Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cdouble_cuda_complex128 PASSED [1.3713s] [ 61%] 2025-12-04T15:22:22.8428240Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_half_cuda_complex128 PASSED [0.0039s] [ 61%] 2025-12-04T15:22:22.8428365Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_long_cuda_complex128 PASSED [1.3715s] [ 61%] 2025-12-04T15:22:22.8428476Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_add_cuda_complex128 PASSED [0.0043s] [ 61%] 2025-12-04T15:22:22.8428597Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_alias_copy_cuda_complex128 PASSED [1.3665s] [ 61%] 2025-12-04T15:22:22.8428727Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_scatter_cuda_complex128 PASSED [0.0039s] [ 61%] 2025-12-04T15:22:22.8428845Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_2d_cuda_complex128 PASSED [1.3642s] [ 61%] 2025-12-04T15:22:22.8428972Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_tensors_cuda_complex128 PASSED [0.0044s] [ 61%] 2025-12-04T15:22:22.8429093Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_to_cuda_complex128 PASSED [1.3823s] [ 61%] 2025-12-04T15:22:22.8429202Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cat_cuda_complex128 PASSED [0.0043s] [ 61%] 2025-12-04T15:22:22.8429315Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_chunk_cuda_complex128 PASSED [1.3730s] [ 61%] 2025-12-04T15:22:22.8429459Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_column_stack_cuda_complex128 PASSED [0.0042s] [ 61%] 2025-12-04T15:22:22.8429569Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_cuda_complex128 PASSED [1.3726s] [ 61%] 2025-12-04T15:22:22.8429699Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_physical_cuda_complex128 PASSED [0.0040s] [ 61%] 2025-12-04T15:22:22.8429808Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_cuda_complex128 PASSED [1.3789s] [ 61%] 2025-12-04T15:22:22.8429930Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_copy_cuda_complex128 PASSED [0.0037s] [ 62%] 2025-12-04T15:22:22.8430056Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_scatter_cuda_complex128 PASSED [1.3657s] [ 62%] 2025-12-04T15:22:22.8430199Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp_cuda_complex128 PASSED [0.0040s] [ 62%] 2025-12-04T15:22:22.8430313Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftn_cuda_complex128 PASSED [1.3708s] [ 62%] 2025-12-04T15:22:22.8430431Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft2_cuda_complex128 PASSED [0.0039s] [ 62%] 2025-12-04T15:22:22.8430540Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_imag_cuda_complex128 PASSED [1.3885s] [ 62%] 2025-12-04T15:22:22.8430657Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_add_cuda_complex128 PASSED [0.0039s] [ 62%] 2025-12-04T15:22:22.8430773Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isfinite_cuda_complex128 PASSED [1.3665s] [ 62%] 2025-12-04T15:22:22.8430881Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_lerp_cuda_complex128 PASSED [0.0043s] [ 62%] 2025-12-04T15:22:22.8430997Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logaddexp_cuda_complex128 PASSED [1.3648s] [ 62%] 2025-12-04T15:22:22.8431116Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_not_cuda_complex128 PASSED [0.0036s] [ 62%] 2025-12-04T15:22:22.8431225Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mul_cuda_complex128 PASSED [1.3866s] [ 62%] 2025-12-04T15:22:22.8431338Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_cuda_complex128 PASSED [0.0038s] [ 62%] 2025-12-04T15:22:22.8431504Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_cuda_complex128 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 62%] 2025-12-04T15:22:22.8431655Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_l1_loss_cuda_complex128 PASSED [1.3929s] [ 62%] 2025-12-04T15:22:22.8431804Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pairwise_distance_cuda_complex128 PASSED [0.0039s] [ 62%] 2025-12-04T15:22:22.8431952Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex128 PASSED [1.3811s] [ 62%] 2025-12-04T15:22:22.8432097Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex128 PASSED [0.0038s] [ 62%] 2025-12-04T15:22:22.8432238Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_tanhshrink_cuda_complex128 PASSED [1.3651s] [ 62%] 2025-12-04T15:22:22.8432347Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_norm_cuda_complex128 PASSED [0.0037s] [ 62%] 2025-12-04T15:22:22.8432463Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_positive_cuda_complex128 PASSED [1.3863s] [ 62%] 2025-12-04T15:22:22.8432577Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_repeat_cuda_complex128 PASSED [0.0037s] [ 62%] 2025-12-04T15:22:22.8432691Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_cuda_complex128 PASSED [1.3871s] [ 62%] 2025-12-04T15:22:22.8432838Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_special_log_softmax_with_dtype_cuda_complex128 PASSED [0.0037s] [ 62%] 2025-12-04T15:22:22.8432946Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_stft_cuda_complex128 PASSED [1.4087s] [ 62%] 2025-12-04T15:22:22.8433077Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_to_size_cuda_complex128 PASSED [1.3678s] [ 62%] 2025-12-04T15:22:22.8433198Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_copy_cuda_complex128 PASSED [0.0037s] [ 62%] 2025-12-04T15:22:22.8433331Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tensor_split_cuda_complex128 PASSED [1.3778s] [ 62%] 2025-12-04T15:22:22.8433452Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_true_divide_cuda_complex128 PASSED [0.0042s] [ 62%] 2025-12-04T15:22:22.8433571Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_cuda_complex128 PASSED [1.3786s] [ 62%] 2025-12-04T15:22:22.8433683Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_mean_cuda_complex128 PASSED [0.0046s] [ 62%] 2025-12-04T15:22:22.8433792Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vdot_cuda_complex128 PASSED [1.4146s] [ 62%] 2025-12-04T15:22:22.8433938Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__unsafe_masked_index_put_accumulate_cuda_complex128 PASSED [0.0064s] [ 63%] 2025-12-04T15:22:22.8434049Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcdiv_cuda_complex128 PASSED [1.3806s] [ 63%] 2025-12-04T15:22:22.8434155Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addr_cuda_complex128 PASSED [1.3828s] [ 63%] 2025-12-04T15:22:22.8434265Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_allclose_cuda_complex128 PASSED [0.0036s] [ 63%] 2025-12-04T15:22:22.8434368Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_any_cuda_complex128 PASSED [1.3978s] [ 63%] 2025-12-04T15:22:22.8434478Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_1d_cuda_complex128 PASSED [0.0050s] [ 63%] 2025-12-04T15:22:22.8434585Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cdouble_cuda_complex128 PASSED [1.3629s] [ 63%] 2025-12-04T15:22:22.8434690Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cfloat_cuda_complex128 PASSED [0.0050s] [ 63%] 2025-12-04T15:22:22.8434797Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_clone_cuda_complex128 PASSED [1.3927s] [ 63%] 2025-12-04T15:22:22.8434902Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_cuda_complex128 PASSED [0.0051s] [ 63%] 2025-12-04T15:22:22.8435005Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cos_cuda_complex128 PASSED [1.3835s] [ 63%] 2025-12-04T15:22:22.8435120Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_count_nonzero_cuda_complex128 PASSED [1.3746s] [ 63%] 2025-12-04T15:22:22.8435228Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumprod_cuda_complex128 PASSED [0.0057s] [ 63%] 2025-12-04T15:22:22.8435346Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumsum_cuda_complex128 PASSED [1.3877s] [ 63%] 2025-12-04T15:22:22.8435474Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumulative_trapezoid_cuda_complex128 PASSED [0.0060s] [ 63%] 2025-12-04T15:22:22.8435583Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_embed_cuda_complex128 PASSED [1.3867s] [ 63%] 2025-12-04T15:22:22.8435687Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dist_cuda_complex128 PASSED [0.0049s] [ 63%] 2025-12-04T15:22:22.8435792Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dot_cuda_complex128 PASSED [1.3817s] [ 63%] 2025-12-04T15:22:22.8435897Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dsplit_cuda_complex128 PASSED [0.0043s] [ 63%] 2025-12-04T15:22:22.8436026Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_like_cuda_complex128 SKIPPED [0.0002s] (Skipped!) [ 63%] 2025-12-04T15:22:22.8436161Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_permuted_cuda_complex128 SKIPPED [0.0001s] (Skipped!) [ 63%] 2025-12-04T15:22:22.8436262Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eq_cuda_complex128 PASSED [0.0026s] [ 63%] 2025-12-04T15:22:22.8436374Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_copy_cuda_complex128 PASSED [1.4005s] [ 63%] 2025-12-04T15:22:22.8436479Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_cuda_complex128 PASSED [0.0051s] [ 63%] 2025-12-04T15:22:22.8436591Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftshift_cuda_complex128 PASSED [1.3880s] [ 63%] 2025-12-04T15:22:22.8436723Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128 PASSED [0.5583s] [ 63%] 2025-12-04T15:22:22.8436833Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfftn_cuda_complex128 PASSED [1.3856s] [ 63%] 2025-12-04T15:22:22.8436950Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flip_cuda_complex128 PASSED [1.3839s] [ 63%] 2025-12-04T15:22:22.8437056Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fliplr_cuda_complex128 PASSED [0.0050s] [ 63%] 2025-12-04T15:22:22.8437168Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_power_cuda_complex128 PASSED [1.3780s] [ 63%] 2025-12-04T15:22:22.8437272Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hsplit_cuda_complex128 PASSED [0.0040s] [ 63%] 2025-12-04T15:22:22.8437382Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_fill_cuda_complex128 PASSED [1.3744s] [ 64%] 2025-12-04T15:22:22.8437565Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_put_cuda_complex128 SKIPPED [0.0017s] (Operation not tested with tensors with negative bit.) [ 64%] 2025-12-04T15:22:22.8437669Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_int_cuda_complex128 PASSED [1.3877s] [ 64%] 2025-12-04T15:22:22.8437775Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isinf_cuda_complex128 PASSED [0.0035s] [ 64%] 2025-12-04T15:22:22.8437878Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_item_cuda_complex128 PASSED [1.3801s] [ 64%] 2025-12-04T15:22:22.8438002Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_ex_cuda_complex128 PASSED [0.0133s] [ 64%] 2025-12-04T15:22:22.8438222Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_householder_product_cuda_complex128 SKIPPED [0.0009s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 64%] 2025-12-04T15:22:22.8438357Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_grad_oriented_cuda_complex128 PASSED [1.4998s] [ 64%] 2025-12-04T15:22:22.8438481Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_ex_cuda_complex128 PASSED [0.0494s] [ 64%] 2025-12-04T15:22:22.8438605Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_power_cuda_complex128 PASSED [1.3770s] [ 64%] 2025-12-04T15:22:22.8438730Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_hermitian_cuda_complex128 PASSED [0.0595s] [ 64%] 2025-12-04T15:22:22.8438945Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_singular_cuda_complex128 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 64%] 2025-12-04T15:22:22.8439063Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_slogdet_cuda_complex128 PASSED [0.0027s] [ 64%] 2025-12-04T15:22:22.8439180Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svdvals_cuda_complex128 PASSED [1.3985s] [ 64%] 2025-12-04T15:22:22.8439303Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorsolve_cuda_complex128 PASSED [0.0185s] [ 64%] 2025-12-04T15:22:22.8439424Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vector_norm_cuda_complex128 PASSED [1.3797s] [ 64%] 2025-12-04T15:22:22.8439534Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_cuda_complex128 XFAIL [0.0031s] [ 64%] 2025-12-04T15:22:22.8439645Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logaddexp_cuda_complex128 PASSED [1.3691s] [ 64%] 2025-12-04T15:22:22.8439757Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_xor_cuda_complex128 PASSED [0.1745s] [ 64%] 2025-12-04T15:22:22.8439865Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_cuda_complex128 XFAIL [0.0029s] [ 64%] 2025-12-04T15:22:22.8439997Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_tensor_overload_cuda_complex128 PASSED [1.3668s] [ 64%] 2025-12-04T15:22:22.8440127Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128 PASSED [0.0113s] [ 64%] 2025-12-04T15:22:22.8440237Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_solve_cuda_complex128 PASSED [1.3943s] [ 64%] 2025-12-04T15:22:22.8440367Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_unpack_cuda_complex128 PASSED [0.0043s] [ 64%] 2025-12-04T15:22:22.8440482Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mT_cuda_complex128 PASSED [1.3702s] [ 64%] 2025-12-04T15:22:22.8440610Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumprod_cuda_complex128 PASSED [0.0061s] [ 64%] 2025-12-04T15:22:22.8440731Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_logsumexp_cuda_complex128 PASSED [1.3790s] [ 64%] 2025-12-04T15:22:22.8440862Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_list_of_tensors_cuda_complex128 PASSED [0.0038s] [ 64%] 2025-12-04T15:22:22.8440974Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_copy_cuda_complex128 PASSED [1.3726s] [ 64%] 2025-12-04T15:22:22.8441099Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv3d_cuda_complex128 PASSED [0.0078s] [ 64%] 2025-12-04T15:22:22.8441237Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose1d_cuda_complex128 PASSED [1.3871s] [ 64%] 2025-12-04T15:22:22.8441372Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128 PASSED [0.0050s] [ 64%] 2025-12-04T15:22:22.8441503Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_silu_complex_cuda_complex128 PASSED [1.3856s] [ 65%] 2025-12-04T15:22:22.8441615Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128 PASSED [0.0048s] [ 65%] 2025-12-04T15:22:22.8441724Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_nuc_cuda_complex128 PASSED [1.3788s] [ 65%] 2025-12-04T15:22:22.8441829Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pow_cuda_complex128 PASSED [0.0059s] [ 65%] 2025-12-04T15:22:22.8441932Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_prod_cuda_complex128 PASSED [1.3868s] [ 65%] 2025-12-04T15:22:22.8442038Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_renorm_cuda_complex128 PASSED [0.0058s] [ 65%] 2025-12-04T15:22:22.8442140Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sin_cuda_complex128 PASSED [1.3696s] [ 65%] 2025-12-04T15:22:22.8442247Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_slice_cuda_complex128 PASSED [0.0048s] [ 65%] 2025-12-04T15:22:22.8442350Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sqrt_cuda_complex128 PASSED [1.3982s] [ 65%] 2025-12-04T15:22:22.8442456Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_square_cuda_complex128 PASSED [0.0070s] [ 65%] 2025-12-04T15:22:22.8442569Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_cuda_complex128 PASSED [1.3177s] [ 65%] 2025-12-04T15:22:22.8442681Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_lowrank_cuda_complex128 PASSED [0.0530s] [ 65%] 2025-12-04T15:22:22.8442785Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_copy_cuda_complex128 PASSED [1.3318s] [ 65%] 2025-12-04T15:22:22.8442889Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128 PASSED [0.0053s] [ 65%] 2025-12-04T15:22:22.8442994Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tanh_cuda_complex128 PASSED [1.5254s] [ 65%] 2025-12-04T15:22:22.8443098Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tile_cuda_complex128 PASSED [1.3210s] [ 65%] 2025-12-04T15:22:22.8443209Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_cuda_complex128 PASSED [0.0050s] [ 65%] 2025-12-04T15:22:22.8443320Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unflatten_cuda_complex128 PASSED [1.3288s] [ 65%] 2025-12-04T15:22:22.8443438Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_copy_cuda_complex128 PASSED [0.0049s] [ 65%] 2025-12-04T15:22:22.8443548Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_cuda_complex128 PASSED [1.3189s] [ 65%] 2025-12-04T15:22:22.8443667Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_unbiased_cuda_complex128 PASSED [0.0038s] [ 65%] 2025-12-04T15:22:22.8443778Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_unbiased_cuda_complex128 PASSED [1.3179s] [ 65%] 2025-12-04T15:22:22.8443873Z test_ops.py::TestMathBitsCUDA::test_neg_view_H_cuda_float64 PASSED [0.0058s] [ 65%] 2025-12-04T15:22:22.8443986Z test_ops.py::TestMathBitsCUDA::test_neg_view_T_cuda_float64 PASSED [1.3343s] [ 65%] 2025-12-04T15:22:22.8444086Z test_ops.py::TestMathBitsCUDA::test_neg_view___radd___cuda_float64 PASSED [0.0126s] [ 65%] 2025-12-04T15:22:22.8444195Z test_ops.py::TestMathBitsCUDA::test_neg_view___rsub___cuda_float64 PASSED [0.0099s] [ 65%] 2025-12-04T15:22:22.8444292Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_T_cuda_float64 PASSED [1.3853s] [ 65%] 2025-12-04T15:22:22.8444410Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_byte_cuda_float64 PASSED [0.0045s] [ 65%] 2025-12-04T15:22:22.8444511Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_abs_cuda_float64 PASSED [1.3870s] [ 65%] 2025-12-04T15:22:22.8444612Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acos_cuda_float64 PASSED [0.0049s] [ 65%] 2025-12-04T15:22:22.8444712Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64 PASSED [0.0078s] [ 65%] 2025-12-04T15:22:22.8444823Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_alias_copy_cuda_float64 PASSED [1.4088s] [ 66%] 2025-12-04T15:22:22.8444922Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_all_cuda_float64 PASSED [0.0080s] [ 66%] 2025-12-04T15:22:22.8445024Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amin_cuda_float64 PASSED [0.0100s] [ 66%] 2025-12-04T15:22:22.8445132Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_1d_cuda_float64 PASSED [1.3976s] [ 66%] 2025-12-04T15:22:22.8445252Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_tensors_cuda_float64 PASSED [0.0045s] [ 66%] 2025-12-04T15:22:22.8445357Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_copysign_cuda_float64 PASSED [0.0116s] [ 66%] 2025-12-04T15:22:22.8445462Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cumprod_cuda_float64 PASSED [0.0064s] [ 66%] 2025-12-04T15:22:22.8445564Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dstack_cuda_float64 PASSED [0.0040s] [ 66%] 2025-12-04T15:22:22.8445720Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_cuda_float64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 66%] 2025-12-04T15:22:22.8445890Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_strided_cuda_float64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 66%] 2025-12-04T15:22:22.8445992Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eq_cuda_float64 PASSED [0.0059s] [ 66%] 2025-12-04T15:22:22.8446108Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_as_cuda_float64 PASSED [1.3909s] [ 66%] 2025-12-04T15:22:22.8446212Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expm1_cuda_float64 PASSED [0.0040s] [ 66%] 2025-12-04T15:22:22.8446315Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fft_cuda_float64 PASSED [1.4355s] [ 66%] 2025-12-04T15:22:22.8446421Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftn_cuda_float64 PASSED [1.3989s] [ 66%] 2025-12-04T15:22:22.8446528Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfftn_cuda_float64 PASSED [0.0080s] [ 66%] 2025-12-04T15:22:22.8446634Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flatten_cuda_float64 PASSED [0.0046s] [ 66%] 2025-12-04T15:22:22.8446737Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fliplr_cuda_float64 PASSED [0.0026s] [ 66%] 2025-12-04T15:22:22.8446840Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igamma_cuda_float64 PASSED [0.0064s] [ 66%] 2025-12-04T15:22:22.8446945Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igammac_cuda_float64 PASSED [0.0063s] [ 66%] 2025-12-04T15:22:22.8447051Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_add_cuda_float64 PASSED [1.3826s] [ 66%] 2025-12-04T15:22:22.8447156Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isfinite_cuda_float64 PASSED [0.0047s] [ 66%] 2025-12-04T15:22:22.8447258Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isinf_cuda_float64 PASSED [1.3832s] [ 66%] 2025-12-04T15:22:22.8447361Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lgamma_cuda_float64 PASSED [0.0049s] [ 66%] 2025-12-04T15:22:22.8447495Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_diagonal_cuda_float64 PASSED [1.4146s] [ 66%] 2025-12-04T15:22:22.8447598Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log10_cuda_float64 PASSED [0.0050s] [ 66%] 2025-12-04T15:22:22.8447707Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log2_cuda_float64 PASSED [1.3921s] [ 66%] 2025-12-04T15:22:22.8447816Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp2_cuda_float64 PASSED [0.0055s] [ 66%] 2025-12-04T15:22:22.8447923Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logsumexp_cuda_float64 PASSED [1.3829s] [ 66%] 2025-12-04T15:22:22.8448030Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nan_to_num_cuda_float64 PASSED [0.0046s] [ 66%] 2025-12-04T15:22:22.8448129Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_neg_cuda_float64 PASSED [1.3630s] [ 66%] 2025-12-04T15:22:22.8448306Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_empty_strided_cuda_float64 SKIPPED [0.0002s] (Expected: empty_strided is not comparable) [ 66%] 2025-12-04T15:22:22.8448413Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_ones_cuda_float64 PASSED [1.3789s] [ 67%] 2025-12-04T15:22:22.8448519Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nextafter_cuda_float64 PASSED [0.0080s] [ 67%] 2025-12-04T15:22:22.8448701Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_alpha_dropout_cuda_float64 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 67%] 2025-12-04T15:22:22.8448819Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_elu_cuda_float64 PASSED [0.0034s] [ 67%] 2025-12-04T15:22:22.8448947Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_group_norm_cuda_float64 PASSED [1.3993s] [ 67%] 2025-12-04T15:22:22.8449073Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_huber_loss_cuda_float64 PASSED [0.0076s] [ 67%] 2025-12-04T15:22:22.8449198Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_layer_norm_cuda_float64 PASSED [1.3805s] [ 67%] 2025-12-04T15:22:22.8449324Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_leaky_relu_cuda_float64 PASSED [0.0070s] [ 67%] 2025-12-04T15:22:22.8449466Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_log_softmax_with_dtype_cuda_float64 PASSED [1.4184s] [ 67%] 2025-12-04T15:22:22.8449589Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_nll_loss_cuda_float64 PASSED [0.0349s] [ 67%] 2025-12-04T15:22:22.8449718Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64 PASSED [1.4054s] [ 67%] 2025-12-04T15:22:22.8449849Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_smooth_l1_loss_cuda_float64 PASSED [0.0074s] [ 67%] 2025-12-04T15:22:22.8449963Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal__in_place_cuda_float64 XFAIL [0.0031s] [ 67%] 2025-12-04T15:22:22.8450075Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_permute_copy_cuda_float64 PASSED [1.3945s] [ 67%] 2025-12-04T15:22:22.8450214Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_positive_cuda_float64 PASSED [0.0037s] [ 67%] 2025-12-04T15:22:22.8450318Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ravel_cuda_float64 PASSED [1.4058s] [ 67%] 2025-12-04T15:22:22.8450428Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reciprocal_cuda_float64 PASSED [0.0049s] [ 67%] 2025-12-04T15:22:22.8450540Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_select_scatter_cuda_float64 PASSED [1.3720s] [ 67%] 2025-12-04T15:22:22.8450641Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sign_cuda_float64 PASSED [0.0042s] [ 67%] 2025-12-04T15:22:22.8450759Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j1_cuda_float64 PASSED [1.4070s] [ 67%] 2025-12-04T15:22:22.8450869Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_erfcx_cuda_float64 PASSED [0.0065s] [ 67%] 2025-12-04T15:22:22.8450978Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1_cuda_float64 PASSED [1.4058s] [ 67%] 2025-12-04T15:22:22.8451152Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_5_cuda_float64 PASSED [0.0076s] [ 67%] 2025-12-04T15:22:22.8451264Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtri_cuda_float64 PASSED [1.4023s] [ 67%] 2025-12-04T15:22:22.8451380Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_cuda_float64 PASSED [0.0073s] [ 67%] 2025-12-04T15:22:22.8451482Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_stft_cuda_float64 PASSED [1.7096s] [ 67%] 2025-12-04T15:22:22.8451580Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_cuda_float64 PASSED [1.3708s] [ 67%] 2025-12-04T15:22:22.8451692Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_take_along_dim_cuda_float64 PASSED [0.0076s] [ 67%] 2025-12-04T15:22:22.8451790Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_to_cuda_float64 PASSED [1.3639s] [ 67%] 2025-12-04T15:22:22.8451893Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_where_cuda_float64 PASSED [0.0074s] [ 67%] 2025-12-04T15:22:22.8451996Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_xlogy_cuda_float64 PASSED [0.0060s] [ 67%] 2025-12-04T15:22:22.8452096Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_zeros_cuda_float64 XFAIL [0.0024s] [ 68%] 2025-12-04T15:22:22.8452192Z test_ops.py::TestMathBitsCUDA::test_neg_view_addbmm_cuda_float64 PASSED [2.6990s] [ 68%] 2025-12-04T15:22:22.8452290Z test_ops.py::TestMathBitsCUDA::test_neg_view_allclose_cuda_float64 PASSED [0.0110s] [ 68%] 2025-12-04T15:22:22.8452385Z test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64 PASSED [1.3825s] [ 68%] 2025-12-04T15:22:22.8452479Z test_ops.py::TestMathBitsCUDA::test_neg_view_any_cuda_float64 PASSED [0.0073s] [ 68%] 2025-12-04T15:22:22.8452576Z test_ops.py::TestMathBitsCUDA::test_neg_view_argsort_cuda_float64 PASSED [1.3618s] [ 68%] 2025-12-04T15:22:22.8452738Z test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_partial_views_cuda_float64 SKIPPED [0.0002s] (Test changes in memory layout) [ 68%] 2025-12-04T15:22:22.8452834Z test_ops.py::TestMathBitsCUDA::test_neg_view_asin_cuda_float64 PASSED [1.3601s] [ 68%] 2025-12-04T15:22:22.8452931Z test_ops.py::TestMathBitsCUDA::test_neg_view_atan2_cuda_float64 PASSED [0.0158s] [ 68%] 2025-12-04T15:22:22.8453026Z test_ops.py::TestMathBitsCUDA::test_neg_view_bmm_cuda_float64 PASSED [1.3572s] [ 68%] 2025-12-04T15:22:22.8453120Z test_ops.py::TestMathBitsCUDA::test_neg_view_bool_cuda_float64 PASSED [0.0046s] [ 68%] 2025-12-04T15:22:22.8453246Z test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_to_cuda_float64 PASSED [1.3775s] [ 68%] 2025-12-04T15:22:22.8453354Z test_ops.py::TestMathBitsCUDA::test_neg_view_cartesian_prod_cuda_float64 XFAIL [0.0067s] [ 68%] 2025-12-04T15:22:22.8453449Z test_ops.py::TestMathBitsCUDA::test_neg_view_cauchy_cuda_float64 XFAIL [1.3383s] [ 68%] 2025-12-04T15:22:22.8453547Z test_ops.py::TestMathBitsCUDA::test_neg_view_cdouble_cuda_float64 PASSED [1.3247s] [ 68%] 2025-12-04T15:22:22.8453641Z test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_cuda_float64 PASSED [0.0138s] [ 68%] 2025-12-04T15:22:22.8453744Z test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_max_cuda_float64 PASSED [0.0124s] [ 68%] 2025-12-04T15:22:22.8453850Z test_ops.py::TestMathBitsCUDA::test_neg_view_conj_physical_cuda_float64 PASSED [1.3527s] [ 68%] 2025-12-04T15:22:22.8453944Z test_ops.py::TestMathBitsCUDA::test_neg_view_cosh_cuda_float64 PASSED [0.0068s] [ 68%] 2025-12-04T15:22:22.8454039Z test_ops.py::TestMathBitsCUDA::test_neg_view_cummax_cuda_float64 PASSED [1.3352s] [ 68%] 2025-12-04T15:22:22.8454137Z test_ops.py::TestMathBitsCUDA::test_neg_view_cumprod_cuda_float64 PASSED [0.0205s] [ 68%] 2025-12-04T15:22:22.8454248Z test_ops.py::TestMathBitsCUDA::test_neg_view_empty_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 68%] 2025-12-04T15:22:22.8454366Z test_ops.py::TestMathBitsCUDA::test_neg_view_empty_like_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 68%] 2025-12-04T15:22:22.8454467Z test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64 PASSED [0.0051s] [ 68%] 2025-12-04T15:22:22.8454573Z test_ops.py::TestMathBitsCUDA::test_neg_view_expm1_cuda_float64 PASSED [1.3332s] [ 68%] 2025-12-04T15:22:22.8454685Z test_ops.py::TestMathBitsCUDA::test_neg_view_exponential_cuda_float64 XFAIL [0.0048s] [ 68%] 2025-12-04T15:22:22.8454784Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftn_cuda_float64 PASSED [2.7280s] [ 68%] 2025-12-04T15:22:22.8454895Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft2_cuda_float64 PASSED [1.9793s] [ 68%] 2025-12-04T15:22:22.8454996Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfftn_cuda_float64 PASSED [1.6807s] [ 68%] 2025-12-04T15:22:22.8455095Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft2_cuda_float64 PASSED [1.3342s] [ 68%] 2025-12-04T15:22:22.8455193Z test_ops.py::TestMathBitsCUDA::test_neg_view_flatten_cuda_float64 PASSED [0.0097s] [ 68%] 2025-12-04T15:22:22.8455287Z test_ops.py::TestMathBitsCUDA::test_neg_view_flip_cuda_float64 PASSED [0.0093s] [ 68%] 2025-12-04T15:22:22.8455381Z test_ops.py::TestMathBitsCUDA::test_neg_view_fmin_cuda_float64 PASSED [0.0114s] [ 69%] 2025-12-04T15:22:22.8455475Z test_ops.py::TestMathBitsCUDA::test_neg_view_full_cuda_float64 XFAIL [0.0026s] [ 69%] 2025-12-04T15:22:22.8455572Z test_ops.py::TestMathBitsCUDA::test_neg_view_gather_cuda_float64 PASSED [2.6667s] [ 69%] 2025-12-04T15:22:22.8455665Z test_ops.py::TestMathBitsCUDA::test_neg_view_ge_cuda_float64 PASSED [0.0055s] [ 69%] 2025-12-04T15:22:22.8455757Z test_ops.py::TestMathBitsCUDA::test_neg_view_gt_cuda_float64 PASSED [0.0042s] [ 69%] 2025-12-04T15:22:22.8455851Z test_ops.py::TestMathBitsCUDA::test_neg_view_half_cuda_float64 PASSED [1.3440s] [ 69%] 2025-12-04T15:22:22.8455945Z test_ops.py::TestMathBitsCUDA::test_neg_view_histc_cuda_float64 PASSED [0.0328s] [ 69%] 2025-12-04T15:22:22.8456041Z test_ops.py::TestMathBitsCUDA::test_neg_view_hstack_cuda_float64 PASSED [1.3267s] [ 69%] 2025-12-04T15:22:22.8456138Z test_ops.py::TestMathBitsCUDA::test_neg_view_igammac_cuda_float64 PASSED [0.0074s] [ 69%] 2025-12-04T15:22:22.8456250Z test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_amax_cuda_float64 PASSED [1.3410s] [ 69%] 2025-12-04T15:22:22.8456363Z test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_mean_cuda_float64 PASSED [0.0164s] [ 69%] 2025-12-04T15:22:22.8456474Z test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_prod_cuda_float64 PASSED [1.3545s] [ 69%] 2025-12-04T15:22:22.8456573Z test_ops.py::TestMathBitsCUDA::test_neg_view_isfinite_cuda_float64 PASSED [0.0042s] [ 69%] 2025-12-04T15:22:22.8456666Z test_ops.py::TestMathBitsCUDA::test_neg_view_isin_cuda_float64 PASSED [1.3255s] [ 69%] 2025-12-04T15:22:22.8456771Z test_ops.py::TestMathBitsCUDA::test_neg_view_lerp_cuda_float64 PASSED [0.0195s] [ 69%] 2025-12-04T15:22:22.8456877Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigh_cuda_float64 PASSED [0.0089s] [ 69%] 2025-12-04T15:22:22.8456987Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvals_cuda_float64 PASSED [0.1335s] [ 69%] 2025-12-04T15:22:22.8457202Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_householder_product_cuda_float64 SKIPPED [0.0010s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 69%] 2025-12-04T15:22:22.8457313Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_cuda_float64 PASSED [0.0043s] [ 69%] 2025-12-04T15:22:22.8457437Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_grad_oriented_cuda_float64 PASSED [1.0884s] [ 69%] 2025-12-04T15:22:22.8457539Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_cuda_float64 PASSED [0.0628s] [ 69%] 2025-12-04T15:22:22.8457650Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_norm_cuda_float64 PASSED [0.8917s] [ 69%] 2025-12-04T15:22:22.8457763Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_power_cuda_float64 PASSED [0.0992s] [ 69%] 2025-12-04T15:22:22.8457874Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_cuda_float64 PASSED [0.0396s] [ 69%] 2025-12-04T15:22:22.8457989Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_hermitian_cuda_float64 PASSED [0.0136s] [ 69%] 2025-12-04T15:22:22.8458105Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_slogdet_cuda_float64 PASSED [0.0080s] [ 69%] 2025-12-04T15:22:22.8458212Z test_ops.py::TestMathBitsCUDA::test_neg_view_log1p_cuda_float64 PASSED [0.8089s] [ 69%] 2025-12-04T15:22:22.8458314Z test_ops.py::TestMathBitsCUDA::test_neg_view_log2_cuda_float64 PASSED [0.0072s] [ 69%] 2025-12-04T15:22:22.8458418Z test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_cuda_float64 PASSED [0.8246s] [ 69%] 2025-12-04T15:22:22.8458535Z test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_with_dtype_cuda_float64 PASSED [0.0102s] [ 69%] 2025-12-04T15:22:22.8458638Z test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp2_cuda_float64 PASSED [0.8012s] [ 69%] 2025-12-04T15:22:22.8458743Z test_ops.py::TestMathBitsCUDA::test_neg_view_logcumsumexp_cuda_float64 PASSED [0.0113s] [ 70%] 2025-12-04T15:22:22.8458847Z test_ops.py::TestMathBitsCUDA::test_neg_view_logical_not_cuda_float64 PASSED [0.8166s] [ 70%] 2025-12-04T15:22:22.8458943Z test_ops.py::TestMathBitsCUDA::test_neg_view_logit_cuda_float64 PASSED [0.0084s] [ 70%] 2025-12-04T15:22:22.8459040Z test_ops.py::TestMathBitsCUDA::test_neg_view_long_cuda_float64 PASSED [0.8071s] [ 70%] 2025-12-04T15:22:22.8459139Z test_ops.py::TestMathBitsCUDA::test_neg_view_lu_unpack_cuda_float64 PASSED [0.0265s] [ 70%] 2025-12-04T15:22:22.8459243Z test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amin_cuda_float64 PASSED [0.0861s] [ 70%] 2025-12-04T15:22:22.8459350Z test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumprod_cuda_float64 PASSED [0.0206s] [ 70%] 2025-12-04T15:22:22.8459455Z test_ops.py::TestMathBitsCUDA::test_neg_view_masked_softmin_cuda_float64 PASSED [0.0260s] [ 70%] 2025-12-04T15:22:22.8459576Z test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64 PASSED [0.0068s] [ 70%] 2025-12-04T15:22:22.8459670Z test_ops.py::TestMathBitsCUDA::test_neg_view_msort_cuda_float64 PASSED [0.8388s] [ 70%] 2025-12-04T15:22:22.8459771Z test_ops.py::TestMathBitsCUDA::test_neg_view_nan_to_num_cuda_float64 PASSED [0.0074s] [ 70%] 2025-12-04T15:22:22.8459870Z test_ops.py::TestMathBitsCUDA::test_neg_view_nanmean_cuda_float64 PASSED [0.8719s] [ 70%] 2025-12-04T15:22:22.8459975Z test_ops.py::TestMathBitsCUDA::test_neg_view_nanquantile_cuda_float64 PASSED [0.0970s] [ 70%] 2025-12-04T15:22:22.8460070Z test_ops.py::TestMathBitsCUDA::test_neg_view_nansum_cuda_float64 PASSED [0.8521s] [ 70%] 2025-12-04T15:22:22.8460214Z test_ops.py::TestMathBitsCUDA::test_neg_view_native_batch_norm_cuda_float64 PASSED [0.0096s] [ 70%] 2025-12-04T15:22:22.8460321Z test_ops.py::TestMathBitsCUDA::test_neg_view_ne_cuda_float64 PASSED [0.0042s] [ 70%] 2025-12-04T15:22:22.8460437Z test_ops.py::TestMathBitsCUDA::test_neg_view_new_empty_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 70%] 2025-12-04T15:22:22.8460537Z test_ops.py::TestMathBitsCUDA::test_neg_view_new_zeros_cuda_float64 PASSED [0.8152s] [ 70%] 2025-12-04T15:22:22.8460672Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool1d_cuda_float64 PASSED [0.0101s] [ 70%] 2025-12-04T15:22:22.8460805Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool2d_cuda_float64 PASSED [0.8221s] [ 70%] 2025-12-04T15:22:22.8460937Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool3d_cuda_float64 PASSED [0.0236s] [ 70%] 2025-12-04T15:22:22.8461088Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_with_logits_cuda_float64 PASSED [0.8373s] [ 70%] 2025-12-04T15:22:22.8461217Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose2d_cuda_float64 PASSED [0.0223s] [ 70%] 2025-12-04T15:22:22.8461342Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_bag_cuda_float64 PASSED [0.8739s] [ 70%] 2025-12-04T15:22:22.8461495Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_feature_alpha_dropout_without_train_cuda_float64 PASSED [0.0176s] [ 70%] 2025-12-04T15:22:22.8461616Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_group_norm_cuda_float64 PASSED [0.8359s] [ 70%] 2025-12-04T15:22:22.8461761Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardshrink_cuda_float64 PASSED [0.0092s] [ 70%] 2025-12-04T15:22:22.8461880Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64 PASSED [0.8073s] [ 70%] 2025-12-04T15:22:22.8462024Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bicubic_cuda_float64 PASSED [0.0257s] [ 70%] 2025-12-04T15:22:22.8462159Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bilinear_cuda_float64 PASSED [0.8347s] [ 70%] 2025-12-04T15:22:22.8462290Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0295s] [ 70%] 2025-12-04T15:22:22.8462406Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mse_loss_cuda_float64 PASSED [0.8258s] [ 70%] 2025-12-04T15:22:22.8462543Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_margin_loss_cuda_float64 PASSED [0.0159s] [ 71%] 2025-12-04T15:22:22.8462664Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_reflect_cuda_float64 PASSED [0.8380s] [ 71%] 2025-12-04T15:22:22.8462774Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_relu_cuda_float64 PASSED [0.0080s] [ 71%] 2025-12-04T15:22:22.8462901Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_soft_margin_loss_cuda_float64 PASSED [0.8261s] [ 71%] 2025-12-04T15:22:22.8463016Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softmin_cuda_float64 PASSED [0.0102s] [ 71%] 2025-12-04T15:22:22.8463135Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_tanhshrink_cuda_float64 PASSED [0.8181s] [ 71%] 2025-12-04T15:22:22.8463265Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_bilinear_cuda_float64 PASSED [0.0077s] [ 71%] 2025-12-04T15:22:22.8463366Z test_ops.py::TestMathBitsCUDA::test_neg_view_ones_like_cuda_float64 PASSED [0.8118s] [ 71%] 2025-12-04T15:22:22.8463462Z test_ops.py::TestMathBitsCUDA::test_neg_view_ormqr_cuda_float64 PASSED [0.1925s] [ 71%] 2025-12-04T15:22:22.8463562Z test_ops.py::TestMathBitsCUDA::test_neg_view_permute_cuda_float64 PASSED [0.8243s] [ 71%] 2025-12-04T15:22:22.8463662Z test_ops.py::TestMathBitsCUDA::test_neg_view_pinverse_cuda_float64 PASSED [0.0162s] [ 71%] 2025-12-04T15:22:22.8463757Z test_ops.py::TestMathBitsCUDA::test_neg_view_pow_cuda_float64 PASSED [0.0139s] [ 71%] 2025-12-04T15:22:22.8463852Z test_ops.py::TestMathBitsCUDA::test_neg_view_prod_cuda_float64 PASSED [0.8580s] [ 71%] 2025-12-04T15:22:22.8463959Z test_ops.py::TestMathBitsCUDA::test_neg_view_randint_cuda_float64 XFAIL [0.0032s] [ 71%] 2025-12-04T15:22:22.8464055Z test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64 PASSED [1.6257s] [ 71%] 2025-12-04T15:22:22.8464156Z test_ops.py::TestMathBitsCUDA::test_neg_view_reciprocal_cuda_float64 PASSED [0.0068s] [ 71%] 2025-12-04T15:22:22.8464268Z test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_0_cuda_float64 PASSED [0.8230s] [ 71%] 2025-12-04T15:22:22.8464390Z test_ops.py::TestMathBitsCUDA::test_neg_view_scalar_tensor_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 71%] 2025-12-04T15:22:22.8464496Z test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_add_cuda_float64 PASSED [0.8192s] [ 71%] 2025-12-04T15:22:22.8464607Z test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_sum_cuda_float64 PASSED [0.0299s] [ 71%] 2025-12-04T15:22:22.8464741Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_cosine_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 71%] 2025-12-04T15:22:22.8464884Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_cosine_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 71%] 2025-12-04T15:22:22.8465029Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_hamming_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 71%] 2025-12-04T15:22:22.8465124Z test_ops.py::TestMathBitsCUDA::test_neg_view_sin_cuda_float64 PASSED [0.8199s] [ 71%] 2025-12-04T15:22:22.8465232Z test_ops.py::TestMathBitsCUDA::test_neg_view_slice_scatter_cuda_float64 PASSED [0.0137s] [ 71%] 2025-12-04T15:22:22.8465356Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j0_cuda_float64 PASSED [0.8133s] [ 71%] 2025-12-04T15:22:22.8465498Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_v_cuda_float64 PASSED [0.3499s] [ 71%] 2025-12-04T15:22:22.8465644Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_h_cuda_float64 PASSED [0.0064s] [ 71%] 2025-12-04T15:22:22.8465750Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_ndtri_cuda_float64 PASSED [0.8224s] [ 71%] 2025-12-04T15:22:22.8465885Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k1_cuda_float64 PASSED [0.0053s] [ 71%] 2025-12-04T15:22:22.8465991Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64 PASSED [0.0062s] [ 71%] 2025-12-04T15:22:22.8466097Z test_ops.py::TestMathBitsCUDA::test_neg_view_tensor_split_cuda_float64 PASSED [0.8242s] [ 72%] 2025-12-04T15:22:22.8466192Z test_ops.py::TestMathBitsCUDA::test_neg_view_trapz_cuda_float64 PASSED [0.0164s] [ 72%] 2025-12-04T15:22:22.8466286Z test_ops.py::TestMathBitsCUDA::test_neg_view_tril_cuda_float64 PASSED [0.8356s] [ 72%] 2025-12-04T15:22:22.8466380Z test_ops.py::TestMathBitsCUDA::test_neg_view_triu_cuda_float64 PASSED [0.0124s] [ 72%] 2025-12-04T15:22:22.8466485Z test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_copy_cuda_float64 PASSED [0.8401s] [ 72%] 2025-12-04T15:22:22.8466582Z test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_cuda_float64 PASSED [0.0240s] [ 72%] 2025-12-04T15:22:22.8466678Z test_ops.py::TestMathBitsCUDA::test_neg_view_unique_cuda_float64 PASSED [0.2669s] [ 72%] 2025-12-04T15:22:22.8466785Z test_ops.py::TestMathBitsCUDA::test_neg_view_unsqueeze_copy_cuda_float64 PASSED [0.8070s] [ 72%] 2025-12-04T15:22:22.8466893Z test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_unbiased_cuda_float64 PASSED [0.0042s] [ 72%] 2025-12-04T15:22:22.8466989Z test_ops.py::TestFakeTensorCUDA::test_fake___radd___cuda_float32 PASSED [0.0200s] [ 72%] 2025-12-04T15:22:22.8467084Z test_ops.py::TestFakeTensorCUDA::test_fake___rdiv___cuda_float32 PASSED [0.0128s] [ 72%] 2025-12-04T15:22:22.8467183Z test_ops.py::TestFakeTensorCUDA::test_fake___rpow___cuda_float32 PASSED [0.0099s] [ 72%] 2025-12-04T15:22:22.8467299Z test_ops.py::TestFakeTensorCUDA::test_fake__batch_norm_with_update_cuda_float32 PASSED [0.0508s] [ 72%] 2025-12-04T15:22:22.8467401Z test_ops.py::TestFakeTensorCUDA::test_fake__chunk_cat_cuda_float32 PASSED [0.0382s] [ 72%] 2025-12-04T15:22:22.8467508Z test_ops.py::TestFakeTensorCUDA::test_fake_abs_cuda_float32 PASSED [0.0028s] [ 72%] 2025-12-04T15:22:22.8467618Z test_ops.py::TestFakeTensorCUDA::test_fake_addmm_decomposed_cuda_float32 PASSED [0.0146s] [ 72%] 2025-12-04T15:22:22.8467712Z test_ops.py::TestFakeTensorCUDA::test_fake_amin_cuda_float32 PASSED [0.0154s] [ 72%] 2025-12-04T15:22:22.8467838Z test_ops.py::TestFakeTensorCUDA::test_fake_aminmax_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 72%] 2025-12-04T15:22:22.8467934Z test_ops.py::TestFakeTensorCUDA::test_fake_angle_cuda_float32 PASSED [0.0029s] [ 72%] 2025-12-04T15:22:22.8468033Z test_ops.py::TestFakeTensorCUDA::test_fake_argwhere_cuda_float32 PASSED [0.0059s] [ 72%] 2025-12-04T15:22:22.8468140Z test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_copy_cuda_float32 PASSED [0.0056s] [ 72%] 2025-12-04T15:22:22.8468236Z test_ops.py::TestFakeTensorCUDA::test_fake_asinh_cuda_float32 PASSED [0.0029s] [ 72%] 2025-12-04T15:22:22.8468328Z test_ops.py::TestFakeTensorCUDA::test_fake_atan_cuda_float32 PASSED [0.8152s] [ 72%] 2025-12-04T15:22:22.8468423Z test_ops.py::TestFakeTensorCUDA::test_fake_atanh_cuda_float32 PASSED [0.0047s] [ 72%] 2025-12-04T15:22:22.8468524Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_T_cuda_float32 PASSED [0.0042s] [ 72%] 2025-12-04T15:22:22.8468637Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast___getitem___cuda_float32 PASSED [0.0239s] [ 72%] 2025-12-04T15:22:22.8468746Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmod___cuda_float32 PASSED [0.0101s] [ 72%] 2025-12-04T15:22:22.8468864Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rxor___cuda_int64 PASSED [0.0100s] [ 72%] 2025-12-04T15:22:22.8468986Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast__chunk_cat_cuda_float32 PASSED [0.8482s] [ 72%] 2025-12-04T15:22:22.8469089Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_add_cuda_float32 PASSED [0.0143s] [ 72%] 2025-12-04T15:22:22.8469206Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_cuda_float32 PASSED [0.3865s] [ 72%] 2025-12-04T15:22:22.8469314Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_arange_cuda_float32 PASSED [0.0206s] [ 72%] 2025-12-04T15:22:22.8469448Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_partial_views_cuda_float32 PASSED [0.8272s] [ 73%] 2025-12-04T15:22:22.8469570Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_scatter_cuda_float32 PASSED [0.0100s] [ 73%] 2025-12-04T15:22:22.8469675Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asin_cuda_float32 PASSED [0.8183s] [ 73%] 2025-12-04T15:22:22.8469781Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asinh_cuda_float32 PASSED [0.0046s] [ 73%] 2025-12-04T15:22:22.8469895Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_1d_cuda_float32 PASSED [0.0066s] [ 73%] 2025-12-04T15:22:22.8470008Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bernoulli_cuda_float32 PASSED [0.0061s] [ 73%] 2025-12-04T15:22:22.8470168Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_left_shift_cuda_int64 PASSED [0.0100s] [ 73%] 2025-12-04T15:22:22.8470281Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_xor_cuda_int64 PASSED [0.0099s] [ 73%] 2025-12-04T15:22:22.8470386Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bmm_cuda_float32 PASSED [0.0042s] [ 73%] 2025-12-04T15:22:22.8470498Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bucketize_cuda_float32 PASSED [0.0207s] [ 73%] 2025-12-04T15:22:22.8470616Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cartesian_prod_cuda_float32 PASSED [0.0104s] [ 73%] 2025-12-04T15:22:22.8470722Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32 PASSED [0.2593s] [ 73%] 2025-12-04T15:22:22.8470831Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdouble_cuda_float32 PASSED [0.0075s] [ 73%] 2025-12-04T15:22:22.8470936Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ceil_cuda_float32 PASSED [0.0029s] [ 73%] 2025-12-04T15:22:22.8471046Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_cuda_float32 PASSED [0.0141s] [ 73%] 2025-12-04T15:22:22.8471169Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chunk_cuda_float32 PASSED [0.0054s] [ 73%] 2025-12-04T15:22:22.8471275Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_cuda_float32 PASSED [0.0108s] [ 73%] 2025-12-04T15:22:22.8471393Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_combinations_cuda_float32 PASSED [0.0935s] [ 73%] 2025-12-04T15:22:22.8471509Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32 PASSED [0.0026s] [ 73%] 2025-12-04T15:22:22.8471617Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumprod_cuda_float32 PASSED [0.0112s] [ 73%] 2025-12-04T15:22:22.8471730Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_embed_cuda_float32 PASSED [0.0169s] [ 73%] 2025-12-04T15:22:22.8471840Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagflat_cuda_float32 PASSED [0.0086s] [ 73%] 2025-12-04T15:22:22.8471950Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_cuda_float32 PASSED [0.0121s] [ 73%] 2025-12-04T15:22:22.8472056Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diff_cuda_float32 PASSED [0.1704s] [ 73%] 2025-12-04T15:22:22.8472161Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dist_cuda_float32 PASSED [0.0705s] [ 73%] 2025-12-04T15:22:22.8472269Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_einsum_cuda_float32 PASSED [0.2873s] [ 73%] 2025-12-04T15:22:22.8472386Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_permuted_cuda_float32 PASSED [0.0225s] [ 73%] 2025-12-04T15:22:22.8472491Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfc_cuda_float32 PASSED [0.0044s] [ 73%] 2025-12-04T15:22:22.8472619Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp_cuda_float32 PASSED [0.0042s] [ 73%] 2025-12-04T15:22:22.8472733Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_copy_cuda_float32 PASSED [0.0084s] [ 73%] 2025-12-04T15:22:22.8472861Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftshift_cuda_float32 PASSED [0.0068s] [ 73%] 2025-12-04T15:22:22.8472974Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft2_cuda_float32 PASSED [0.0134s] [ 74%] 2025-12-04T15:22:22.8473085Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft_cuda_float32 PASSED [0.0131s] [ 74%] 2025-12-04T15:22:22.8473196Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfftn_cuda_float32 PASSED [0.0154s] [ 74%] 2025-12-04T15:22:22.8473305Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft_cuda_float32 PASSED [0.0078s] [ 74%] 2025-12-04T15:22:22.8473412Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fill_cuda_float32 PASSED [0.0041s] [ 74%] 2025-12-04T15:22:22.8473523Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fliplr_cuda_float32 PASSED [0.0044s] [ 74%] 2025-12-04T15:22:22.8473624Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gcd_cuda_int64 PASSED [0.0110s] [ 74%] 2025-12-04T15:22:22.8473731Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_histc_cuda_float32 PASSED [0.0639s] [ 74%] 2025-12-04T15:22:22.8473842Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_add_cuda_float32 PASSED [0.0155s] [ 74%] 2025-12-04T15:22:22.8473956Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_copy_cuda_float32 PASSED [0.0057s] [ 74%] 2025-12-04T15:22:22.8474059Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32 PASSED [0.0075s] [ 74%] 2025-12-04T15:22:22.8474168Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isneginf_cuda_float32 PASSED [0.0029s] [ 74%] 2025-12-04T15:22:22.8474307Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_istft_cuda_complex64 SKIPPED [0.0009s] (Skip failing test) [ 74%] 2025-12-04T15:22:22.8474471Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 74%] 2025-12-04T15:22:22.8474619Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_unary_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 74%] 2025-12-04T15:22:22.8474723Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32 PASSED [0.0097s] [ 74%] 2025-12-04T15:22:22.8474859Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_cuda_float32 PASSED [0.0175s] [ 74%] 2025-12-04T15:22:22.8474984Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_ex_cuda_float32 PASSED [0.0156s] [ 74%] 2025-12-04T15:22:22.8475099Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cross_cuda_float32 PASSED [0.8454s] [ 74%] 2025-12-04T15:22:22.8475212Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eig_cuda_float32 PASSED [0.0491s] [ 74%] 2025-12-04T15:22:22.8475328Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigh_cuda_float32 PASSED [0.0114s] [ 74%] 2025-12-04T15:22:22.8475551Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_householder_product_cuda_float32 SKIPPED [0.0008s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 74%] 2025-12-04T15:22:22.8475697Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 74%] 2025-12-04T15:22:22.8475815Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_ex_cuda_float32 PASSED [0.8326s] [ 74%] 2025-12-04T15:22:22.8475929Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_xor_cuda_float32 PASSED [0.0132s] [ 74%] 2025-12-04T15:22:22.8476034Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mT_cuda_float32 PASSED [0.0057s] [ 74%] 2025-12-04T15:22:22.8476146Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_fill_cuda_float32 PASSED [0.8162s] [ 74%] 2025-12-04T15:22:22.8476279Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logaddexp_cuda_float32 PASSED [0.0372s] [ 74%] 2025-12-04T15:22:22.8476404Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmax_cuda_float32 PASSED [0.0339s] [ 74%] 2025-12-04T15:22:22.8476523Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32 PASSED [0.0100s] [ 74%] 2025-12-04T15:22:22.8476630Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_median_cuda_float32 PASSED [0.0119s] [ 74%] 2025-12-04T15:22:22.8476736Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mm_cuda_float32 PASSED [0.0069s] [ 74%] 2025-12-04T15:22:22.8476891Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_3_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 75%] 2025-12-04T15:22:22.8477035Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanquantile_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 75%] 2025-12-04T15:22:22.8477142Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nansum_cuda_float32 PASSED [0.0251s] [ 75%] 2025-12-04T15:22:22.8477249Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_neg_cuda_float32 PASSED [0.8203s] [ 75%] 2025-12-04T15:22:22.8477360Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_cuda_float32 PASSED [0.0098s] [ 75%] 2025-12-04T15:22:22.8477484Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_strided_cuda_float32 PASSED [0.8296s] [ 75%] 2025-12-04T15:22:22.8477630Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0124s] [ 75%] 2025-12-04T15:22:22.8477767Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0219s] [ 75%] 2025-12-04T15:22:22.8477899Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_cuda_float32 PASSED [0.0348s] [ 75%] 2025-12-04T15:22:22.8478060Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0961s] [ 75%] 2025-12-04T15:22:22.8478197Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cross_entropy_cuda_float32 PASSED [0.1111s] [ 75%] 2025-12-04T15:22:22.8478351Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 75%] 2025-12-04T15:22:22.8478499Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0485s] [ 75%] 2025-12-04T15:22:22.8478641Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_grid_sample_cuda_float32 PASSED [1.4514s] [ 75%] 2025-12-04T15:22:22.8478773Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_group_norm_cuda_float32 PASSED [0.0356s] [ 75%] 2025-12-04T15:22:22.8478900Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardtanh_cuda_float32 PASSED [0.0080s] [ 75%] 2025-12-04T15:22:22.8479031Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_huber_loss_cuda_float32 PASSED [0.0140s] [ 75%] 2025-12-04T15:22:22.8479178Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.2539s] [ 75%] 2025-12-04T15:22:22.8479305Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_l1_loss_cuda_float32 PASSED [0.0102s] [ 75%] 2025-12-04T15:22:22.8479449Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_local_response_norm_cuda_float32 PASSED [0.0292s] [ 75%] 2025-12-04T15:22:22.8479582Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_logsigmoid_cuda_float32 PASSED [0.0092s] [ 75%] 2025-12-04T15:22:22.8479717Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_cuda_float32 PASSED [0.5848s] [ 75%] 2025-12-04T15:22:22.8479858Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.0764s] [ 75%] 2025-12-04T15:22:22.8479999Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0400s] [ 75%] 2025-12-04T15:22:22.8480210Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0484s] [ 75%] 2025-12-04T15:22:22.8480365Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0240s] [ 75%] 2025-12-04T15:22:22.8480533Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 75%] 2025-12-04T15:22:22.8480665Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_normalize_cuda_float32 PASSED [0.0181s] [ 75%] 2025-12-04T15:22:22.8480790Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pdist_cuda_float32 PASSED [0.0083s] [ 75%] 2025-12-04T15:22:22.8480926Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0055s] [ 75%] 2025-12-04T15:22:22.8481049Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rrelu_cuda_float32 PASSED [0.0270s] [ 75%] 2025-12-04T15:22:22.8481188Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_complex_cuda_complex64 PASSED [0.0045s] [ 75%] 2025-12-04T15:22:22.8481330Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0123s] [ 76%] 2025-12-04T15:22:22.8481461Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_tanhshrink_cuda_float32 PASSED [0.0052s] [ 76%] 2025-12-04T15:22:22.8481606Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_static_cuda_float32 SKIPPED [0.0006s] (Only runs on cpu) [ 76%] 2025-12-04T15:22:22.8481714Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polar_cuda_float32 PASSED [0.0120s] [ 76%] 2025-12-04T15:22:22.8481820Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_qr_cuda_float32 PASSED [0.0407s] [ 76%] 2025-12-04T15:22:22.8481961Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_quantile_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 76%] 2025-12-04T15:22:22.8482073Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_cuda_float32 PASSED [0.0153s] [ 76%] 2025-12-04T15:22:22.8482179Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_cuda_float32 PASSED [0.0038s] [ 76%] 2025-12-04T15:22:22.8482294Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_like_cuda_float32 PASSED [0.0113s] [ 76%] 2025-12-04T15:22:22.8482444Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_interleave_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 76%] 2025-12-04T15:22:22.8482565Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reshape_cuda_float32 PASSED [0.0091s] [ 76%] 2025-12-04T15:22:22.8482672Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_roll_cuda_float32 PASSED [0.0155s] [ 76%] 2025-12-04T15:22:22.8482778Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsub_cuda_float32 PASSED [0.0116s] [ 76%] 2025-12-04T15:22:22.8482903Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amin_cuda_float32 PASSED [0.8443s] [ 76%] 2025-12-04T15:22:22.8483030Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_prod_cuda_float32 PASSED [0.0242s] [ 76%] 2025-12-04T15:22:22.8483136Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sign_cuda_float32 PASSED [0.0032s] [ 76%] 2025-12-04T15:22:22.8483263Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_cosine_cuda_float32 PASSED [0.0112s] [ 76%] 2025-12-04T15:22:22.8483369Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sinc_cuda_float32 PASSED [0.0051s] [ 76%] 2025-12-04T15:22:22.8483476Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_cuda_float32 PASSED [0.0055s] [ 76%] 2025-12-04T15:22:22.8483592Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_scatter_cuda_float32 PASSED [0.0093s] [ 76%] 2025-12-04T15:22:22.8483714Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y1_cuda_float32 PASSED [0.0042s] [ 76%] 2025-12-04T15:22:22.8483853Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_legendre_polynomial_p_cuda_float32 PASSED [0.0098s] [ 76%] 2025-12-04T15:22:22.8484032Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_polygamma_special_polygamma_n_0_cuda_float32 PASSED [0.0090s] [ 76%] 2025-12-04T15:22:22.8484164Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_multiple_cuda_float32 PASSED [0.0063s] [ 76%] 2025-12-04T15:22:22.8484283Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_unbiased_cuda_float32 PASSED [0.0041s] [ 76%] 2025-12-04T15:22:22.8484387Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_t_cuda_float32 PASSED [0.0041s] [ 76%] 2025-12-04T15:22:22.8484492Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tanh_cuda_float32 PASSED [0.8263s] [ 76%] 2025-12-04T15:22:22.8484635Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tensor_split_cuda_float32 SKIPPED [0.0015s] (Skip failing test) [ 76%] 2025-12-04T15:22:22.8484785Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__flash_attention_forward_cuda_float16 PASSED [0.0323s] [ 76%] 2025-12-04T15:22:22.8484904Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_transpose_copy_cuda_float32 PASSED [0.0078s] [ 76%] 2025-12-04T15:22:22.8485018Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapezoid_cuda_float32 PASSED [0.0268s] [ 76%] 2025-12-04T15:22:22.8485125Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_where_cuda_float32 PASSED [0.0085s] [ 77%] 2025-12-04T15:22:22.8485231Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zero__cuda_float32 PASSED [0.0045s] [ 77%] 2025-12-04T15:22:22.8485345Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_like_cuda_float32 PASSED [0.0077s] [ 77%] 2025-12-04T15:22:22.8485446Z test_ops.py::TestFakeTensorCUDA::test_fake_clamp_min_cuda_float32 PASSED [0.0117s] [ 77%] 2025-12-04T15:22:22.8485550Z test_ops.py::TestFakeTensorCUDA::test_fake_combinations_cuda_float32 PASSED [0.0851s] [ 77%] 2025-12-04T15:22:22.8485645Z test_ops.py::TestFakeTensorCUDA::test_fake_cosh_cuda_float32 PASSED [0.0043s] [ 77%] 2025-12-04T15:22:22.8485776Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___getitem___cuda_float32 PASSED [0.0391s] [ 77%] 2025-12-04T15:22:22.8485902Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rdiv___cuda_float32 PASSED [0.0634s] [ 77%] 2025-12-04T15:22:22.8486051Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__batch_norm_with_update_cuda_float32 PASSED [0.1794s] [ 77%] 2025-12-04T15:22:22.8486208Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__segment_reduce_offsets_cuda_float32 PASSED [0.2308s] [ 77%] 2025-12-04T15:22:22.8486331Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_cuda_float32 PASSED [0.7385s] [ 77%] 2025-12-04T15:22:22.8486468Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_copy_cuda_float32 PASSED [0.0260s] [ 77%] 2025-12-04T15:22:22.8486606Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_scatter_cuda_float32 PASSED [0.0277s] [ 77%] 2025-12-04T15:22:22.8486731Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atanh_cuda_float32 PASSED [0.8270s] [ 77%] 2025-12-04T15:22:22.8486856Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_baddbmm_cuda_float32 PASSED [0.0813s] [ 77%] 2025-12-04T15:22:22.8486985Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bernoulli_cuda_float32 PASSED [0.0095s] [ 77%] 2025-12-04T15:22:22.8487126Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_tensors_cuda_float32 PASSED [0.0097s] [ 77%] 2025-12-04T15:22:22.8487246Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cat_cuda_float32 PASSED [0.0236s] [ 77%] 2025-12-04T15:22:22.8487370Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cfloat_cuda_float32 PASSED [0.0175s] [ 77%] 2025-12-04T15:22:22.8487491Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_chunk_cuda_float32 PASSED [0.0124s] [ 77%] 2025-12-04T15:22:22.8487622Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_column_stack_cuda_float32 PASSED [0.0146s] [ 77%] 2025-12-04T15:22:22.8487766Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_complex_cuda_float32 PASSED [0.0523s] [ 77%] 2025-12-04T15:22:22.8487910Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_conj_physical_cuda_float32 PASSED [0.0028s] [ 77%] 2025-12-04T15:22:22.8488046Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_constant_pad_nd_cuda_float32 PASSED [0.0857s] [ 77%] 2025-12-04T15:22:22.8488168Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cos_cuda_float32 PASSED [0.0106s] [ 77%] 2025-12-04T15:22:22.8488293Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummax_cuda_float32 PASSED [0.0091s] [ 77%] 2025-12-04T15:22:22.8488417Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummin_cuda_float32 PASSED [0.0091s] [ 77%] 2025-12-04T15:22:22.8488540Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumsum_cuda_float32 PASSED [0.0130s] [ 77%] 2025-12-04T15:22:22.8488663Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diag_cuda_float32 PASSED [0.0329s] [ 77%] 2025-12-04T15:22:22.8488791Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_cuda_float32 PASSED [0.0299s] [ 77%] 2025-12-04T15:22:22.8488928Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_scatter_cuda_float32 PASSED [0.0421s] [ 77%] 2025-12-04T15:22:22.8489051Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dist_cuda_float32 PASSED [0.6232s] [ 77%] 2025-12-04T15:22:22.8489189Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32 PASSED [0.0322s] [ 78%] 2025-12-04T15:22:22.8489310Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfc_cuda_float32 PASSED [0.0140s] [ 78%] 2025-12-04T15:22:22.8489436Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft2_cuda_float32 PASSED [0.0658s] [ 78%] 2025-12-04T15:22:22.8489566Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfftn_cuda_float32 PASSED [0.0609s] [ 78%] 2025-12-04T15:22:22.8489691Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfftn_cuda_float32 PASSED [0.0423s] [ 78%] 2025-12-04T15:22:22.8489813Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flip_cuda_float32 PASSED [0.0187s] [ 78%] 2025-12-04T15:22:22.8489944Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_floor_cuda_float32 PASSED [0.0039s] [ 78%] 2025-12-04T15:22:22.8490066Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmod_cuda_float32 PASSED [0.0387s] [ 78%] 2025-12-04T15:22:22.8490219Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_frac_cuda_float32 PASSED [0.0037s] [ 78%] 2025-12-04T15:22:22.8490348Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gradient_cuda_float32 PASSED [0.6230s] [ 78%] 2025-12-04T15:22:22.8490485Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_grid_sampler_2d_cuda_float32 PASSED [2.3343s] [ 78%] 2025-12-04T15:22:22.8490610Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hsplit_cuda_float32 PASSED [0.0188s] [ 78%] 2025-12-04T15:22:22.8490734Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hstack_cuda_float32 PASSED [0.0089s] [ 78%] 2025-12-04T15:22:22.8490857Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hypot_cuda_float32 PASSED [0.0516s] [ 78%] 2025-12-04T15:22:22.8490987Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_put_cuda_float32 PASSED [0.0129s] [ 78%] 2025-12-04T15:22:22.8491108Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_inner_cuda_float32 PASSED [0.0137s] [ 78%] 2025-12-04T15:22:22.8491237Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_cuda_float32 PASSED [0.0519s] [ 78%] 2025-12-04T15:22:22.8491366Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eig_cuda_float32 PASSED [0.3876s] [ 78%] 2025-12-04T15:22:22.8491530Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvals_cuda_float32 PASSED [0.1329s] [ 78%] 2025-12-04T15:22:22.8491662Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_cuda_float32 PASSED [0.2860s] [ 78%] 2025-12-04T15:22:22.8491812Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_cuda_float32 PASSED [1.1185s] [ 78%] 2025-12-04T15:22:22.8491948Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_solve_cuda_float32 PASSED [5.0670s] [ 78%] 2025-12-04T15:22:22.8492105Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_matrix_norm_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 78%] 2025-12-04T15:22:22.8492236Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_cuda_float32 PASSED [0.2704s] [ 78%] 2025-12-04T15:22:22.8492356Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log2_cuda_float32 PASSED [0.0099s] [ 78%] 2025-12-04T15:22:22.8492489Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_cuda_float32 PASSED [0.0336s] [ 78%] 2025-12-04T15:22:22.8492634Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_with_dtype_cuda_float32 PASSED [0.0460s] [ 78%] 2025-12-04T15:22:22.8492764Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logsumexp_cuda_float32 PASSED [0.0800s] [ 78%] 2025-12-04T15:22:22.8492890Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_unpack_cuda_float32 PASSED [0.2347s] [ 78%] 2025-12-04T15:22:22.8493009Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mT_cuda_float32 PASSED [0.0101s] [ 78%] 2025-12-04T15:22:22.8493141Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumsum_cuda_float32 PASSED [0.1661s] [ 78%] 2025-12-04T15:22:22.8493299Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_pool2d_with_indices_backward_cuda_float32 PASSED [4.7631s] [ 79%] 2025-12-04T15:22:22.8493447Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0789s] [ 79%] 2025-12-04T15:22:22.8493573Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_movedim_cuda_float32 PASSED [0.0056s] [ 79%] 2025-12-04T15:22:22.8493695Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_msort_cuda_float32 PASSED [0.0081s] [ 79%] 2025-12-04T15:22:22.8493827Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mul_cuda_float32 PASSED [0.0343s] [ 79%] 2025-12-04T15:22:22.8493947Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mv_cuda_float32 PASSED [0.0150s] [ 79%] 2025-12-04T15:22:22.8494091Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0519s] [ 79%] 2025-12-04T15:22:22.8494220Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanmedian_cuda_float32 PASSED [0.0523s] [ 79%] 2025-12-04T15:22:22.8494381Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0245s] [ 79%] 2025-12-04T15:22:22.8494543Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0470s] [ 79%] 2025-12-04T15:22:22.8494703Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.0399s] [ 79%] 2025-12-04T15:22:22.8494844Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_celu_cuda_float32 PASSED [0.0141s] [ 79%] 2025-12-04T15:22:22.8495037Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv3d_cuda_float32 GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495113Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495183Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495270Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495346Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495414Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495490Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495558Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495624Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495692Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495758Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495825Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495891Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8495957Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496024Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496092Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496159Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496225Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496291Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496357Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496425Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496490Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496557Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496623Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496690Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496756Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496824Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496890Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8496957Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497024Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497090Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497155Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497231Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497297Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497364Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497430Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497497Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497562Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497631Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497698Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497764Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497831Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497899Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8497965Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498032Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498098Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498165Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498230Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498298Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498375Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8498451Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500229Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500325Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500395Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500461Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500531Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500598Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500665Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500731Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500798Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500865Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8500934Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501001Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501067Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501135Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501202Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501268Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501335Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501401Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501468Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501534Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501602Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501667Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501735Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501802Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501870Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8501937Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502005Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502086Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502155Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502221Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502288Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502354Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502422Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502490Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502558Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502625Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502693Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502760Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502826Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502895Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8502962Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503028Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503094Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503160Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503226Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503307Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503385Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503454Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503530Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503598Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503663Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503732Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503798Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503865Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503931Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8503998Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8504064Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8504133Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8504199Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8504401Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x780d6ae03200 size: 5888 2025-12-04T15:22:22.8504588Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x780d6ae03200 size: 5888 2025-12-04T15:22:22.8504783Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x780d6ae04a00 size: 5888 2025-12-04T15:22:22.8504963Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x780d6ae04a00 size: 5888 2025-12-04T15:22:22.8505166Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x780d6ae04c00 size: 6144 2025-12-04T15:22:22.8505238Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505306Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505376Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505443Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505523Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505590Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505657Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505723Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505790Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505857Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505925Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8505993Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506060Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506128Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506194Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506260Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506330Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506396Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506463Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506529Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506596Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506662Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8506877Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x780d6ae04c00 size: 6144 2025-12-04T15:22:22.8507071Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6656 2025-12-04T15:22:22.8507266Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6656 2025-12-04T15:22:22.8507462Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6656 2025-12-04T15:22:22.8507643Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6656 2025-12-04T15:22:22.8507847Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6912 2025-12-04T15:22:22.8507917Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8507986Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508054Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508121Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508188Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508258Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508324Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508390Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508456Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508523Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508589Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508657Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508724Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508791Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508858Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8508926Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509015Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509083Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509149Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509216Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509282Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509349Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509543Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x780d6ae05000 size: 6912 2025-12-04T15:22:22.8509611Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509680Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509747Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509814Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509881Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8509948Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510014Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510082Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510189Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510256Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510340Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510423Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510490Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510569Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510636Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510703Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510770Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510837Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510903Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8510970Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511036Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511102Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511169Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511237Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511303Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511371Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511436Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511509Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511576Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511642Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511709Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511776Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511844Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511911Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8511980Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512046Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512115Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512183Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512249Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512329Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512395Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512463Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512529Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512596Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512662Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512730Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512796Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512863Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512930Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8512997Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513063Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513132Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513198Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513265Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513331Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513397Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513474Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513552Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513619Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513700Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513766Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513833Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513900Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8513967Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514033Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514100Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514165Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514232Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514299Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8514358Z GridwiseOp: Problemsize dPASSED [0.6834s] [ 79%] 2025-12-04T15:22:22.8514511Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout2d_cuda_float32 PASSED [0.9365s] [ 79%] 2025-12-04T15:22:22.8514695Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0132s] [ 79%] 2025-12-04T15:22:22.8514854Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [7.1105s] [ 79%] 2025-12-04T15:22:22.8514994Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_glu_cuda_float32 PASSED [0.2111s] [ 79%] 2025-12-04T15:22:22.8515142Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardswish_cuda_float32 PASSED [0.0214s] [ 79%] 2025-12-04T15:22:22.8515287Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardtanh_cuda_float32 PASSED [0.0286s] [ 79%] 2025-12-04T15:22:22.8515452Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0386s] [ 79%] 2025-12-04T15:22:22.8515595Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_kl_div_cuda_float32 PASSED [0.1768s] [ 79%] 2025-12-04T15:22:22.8515753Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool1d_cuda_float32 PASSED [4.9084s] [ 79%] 2025-12-04T15:22:22.8515924Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.1084s] [ 79%] 2025-12-04T15:22:22.8516068Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_nll_loss_cuda_float32 PASSED [0.4692s] [ 79%] 2025-12-04T15:22:22.8516218Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_circular_cuda_float32 PASSED [0.1543s] [ 79%] 2025-12-04T15:22:22.8516373Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_cuda_float32 PASSED [0.0233s] [ 79%] 2025-12-04T15:22:22.8516513Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu6_cuda_float32 PASSED [0.0122s] [ 79%] 2025-12-04T15:22:22.8516654Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu_cuda_float32 PASSED [0.0147s] [ 79%] 2025-12-04T15:22:22.8516801Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rms_norm_cuda_float32 PASSED [0.0400s] [ 79%] 2025-12-04T15:22:22.8516945Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softplus_cuda_float32 PASSED [0.0144s] [ 79%] 2025-12-04T15:22:22.8517091Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_threshold_cuda_float32 PASSED [0.0129s] [ 79%] 2025-12-04T15:22:22.8517254Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.1989s] [ 79%] 2025-12-04T15:22:22.8517460Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.2008s] [ 80%] 2025-12-04T15:22:22.8517595Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ormqr_cuda_float32 PASSED [6.8012s] [ 80%] 2025-12-04T15:22:22.8517744Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0257s] [ 80%] 2025-12-04T15:22:22.8517890Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0250s] [ 80%] 2025-12-04T15:22:22.8518037Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0245s] [ 80%] 2025-12-04T15:22:22.8518159Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_prod_cuda_float32 PASSED [0.2628s] [ 80%] 2025-12-04T15:22:22.8518293Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_resolve_conj_cuda_float32 PASSED [0.0037s] [ 80%] 2025-12-04T15:22:22.8518433Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_roll_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 80%] 2025-12-04T15:22:22.8518557Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rot90_cuda_float32 PASSED [0.0589s] [ 80%] 2025-12-04T15:22:22.8518679Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_cuda_float32 PASSED [0.9367s] [ 80%] 2025-12-04T15:22:22.8518801Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsqrt_cuda_float32 PASSED [0.0144s] [ 80%] 2025-12-04T15:22:22.8518942Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amax_cuda_float32 PASSED [0.2218s] [ 80%] 2025-12-04T15:22:22.8519082Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amin_cuda_float32 PASSED [0.2191s] [ 80%] 2025-12-04T15:22:22.8519254Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_polygamma_special_polygamma_n_0_cuda_float32 PASSED [0.0251s] [ 80%] 2025-12-04T15:22:22.8519375Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sqrt_cuda_float32 PASSED [0.0050s] [ 80%] 2025-12-04T15:22:22.8519509Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_copy_cuda_float32 PASSED [0.0170s] [ 80%] 2025-12-04T15:22:22.8519642Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_cuda_float32 PASSED [0.0441s] [ 80%] 2025-12-04T15:22:22.8519790Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_svd_lowrank_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 80%] 2025-12-04T15:22:22.8519913Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_t_copy_cuda_float32 PASSED [0.0071s] [ 80%] 2025-12-04T15:22:22.8520036Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_cuda_float32 PASSED [0.0255s] [ 80%] 2025-12-04T15:22:22.8520186Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tile_cuda_float32 PASSED [0.0592s] [ 80%] 2025-12-04T15:22:22.8520315Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_sparse_cuda_float32 PASSED [0.9552s] [ 80%] 2025-12-04T15:22:22.8520436Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_topk_cuda_float32 PASSED [0.0366s] [ 80%] 2025-12-04T15:22:22.8520557Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32 PASSED [0.1487s] [ 80%] 2025-12-04T15:22:22.8520678Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tril_cuda_float32 PASSED [0.0203s] [ 80%] 2025-12-04T15:22:22.8520807Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_true_divide_cuda_float32 PASSED [0.0489s] [ 80%] 2025-12-04T15:22:22.8520928Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trunc_cuda_float32 PASSED [0.0040s] [ 80%] 2025-12-04T15:22:22.8521058Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_copy_cuda_float32 PASSED [0.0235s] [ 80%] 2025-12-04T15:22:22.8521212Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_cuda_float32 PASSED [0.0231s] [ 80%] 2025-12-04T15:22:22.8521340Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unflatten_cuda_float32 PASSED [0.0237s] [ 80%] 2025-12-04T15:22:22.8521491Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_chunk_cuda_float32 PASSED [0.0129s] [ 80%] 2025-12-04T15:22:22.8521625Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_split_cuda_float32 PASSED [0.0070s] [ 81%] 2025-12-04T15:22:22.8521754Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_cuda_float32 PASSED [0.0174s] [ 81%] 2025-12-04T15:22:22.8521882Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rpow___cuda_float32 PASSED [0.1007s] [ 81%] 2025-12-04T15:22:22.8522030Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_lengths_cuda_float32 PASSED [0.2258s] [ 81%] 2025-12-04T15:22:22.8522180Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_offsets_cuda_float32 PASSED [0.2216s] [ 81%] 2025-12-04T15:22:22.8522323Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_cuda_float32 PASSED [0.0526s] [ 81%] 2025-12-04T15:22:22.8522451Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acosh_cuda_float32 PASSED [0.0120s] [ 81%] 2025-12-04T15:22:22.8522577Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_cuda_float32 PASSED [0.0458s] [ 81%] 2025-12-04T15:22:22.8522710Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_alias_copy_cuda_float32 PASSED [0.0043s] [ 81%] 2025-12-04T15:22:22.8522834Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asin_cuda_float32 PASSED [0.0062s] [ 81%] 2025-12-04T15:22:22.8522960Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asinh_cuda_float32 PASSED [0.0057s] [ 81%] 2025-12-04T15:22:22.8523103Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_tensors_cuda_float32 PASSED [0.0092s] [ 81%] 2025-12-04T15:22:22.8523230Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cfloat_cuda_float32 PASSED [0.9511s] [ 81%] 2025-12-04T15:22:22.8523355Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chalf_cuda_float32 PASSED [0.0259s] [ 81%] 2025-12-04T15:22:22.8523509Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32 PASSED [0.1799s] [ 81%] 2025-12-04T15:22:22.8523646Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32 PASSED [0.9439s] [ 81%] 2025-12-04T15:22:22.8523778Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_corrcoef_cuda_float32 PASSED [0.1671s] [ 81%] 2025-12-04T15:22:22.8523904Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cos_cuda_float32 PASSED [0.0108s] [ 81%] 2025-12-04T15:22:22.8524028Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cov_cuda_float32 PASSED [1.5511s] [ 81%] 2025-12-04T15:22:22.8524154Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cross_cuda_float32 PASSED [0.0101s] [ 81%] 2025-12-04T15:22:22.8524282Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumprod_cuda_float32 PASSED [0.2325s] [ 81%] 2025-12-04T15:22:22.8524412Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagflat_cuda_float32 PASSED [0.0214s] [ 81%] 2025-12-04T15:22:22.8524549Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_copy_cuda_float32 PASSED [0.0288s] [ 81%] 2025-12-04T15:22:22.8524679Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_cuda_float32 PASSED [0.9616s] [ 81%] 2025-12-04T15:22:22.8524819Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32 PASSED [0.0476s] [ 81%] 2025-12-04T15:22:22.8524959Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_digamma_cuda_float32 PASSED [0.9532s] [ 81%] 2025-12-04T15:22:22.8525093Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dist_cuda_float32 PASSED [0.6146s] [ 81%] 2025-12-04T15:22:22.8525228Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dot_cuda_float32 PASSED [0.0052s] [ 81%] 2025-12-04T15:22:22.8525355Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_cuda_float32 PASSED [0.0194s] [ 81%] 2025-12-04T15:22:22.8525484Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft_cuda_float32 PASSED [0.0628s] [ 81%] 2025-12-04T15:22:22.8525620Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftshift_cuda_float32 PASSED [0.0127s] [ 81%] 2025-12-04T15:22:22.8525751Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft2_cuda_float32 PASSED [0.0626s] [ 81%] 2025-12-04T15:22:22.8525882Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft_cuda_float32 PASSED [0.0473s] [ 82%] 2025-12-04T15:22:22.8526011Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flipud_cuda_float32 PASSED [0.0055s] [ 82%] 2025-12-04T15:22:22.8526138Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_gather_cuda_float32 PASSED [0.0192s] [ 82%] 2025-12-04T15:22:22.8526264Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hsplit_cuda_float32 PASSED [0.0146s] [ 82%] 2025-12-04T15:22:22.8526407Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_amin_cuda_float32 PASSED [0.0715s] [ 82%] 2025-12-04T15:22:22.8526544Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_cuda_float32 PASSED [0.2879s] [ 82%] 2025-12-04T15:22:22.8526677Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_cuda_float32 PASSED [0.0530s] [ 82%] 2025-12-04T15:22:22.8526813Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_diagonal_cuda_float32 PASSED [0.0305s] [ 82%] 2025-12-04T15:22:22.8527055Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_householder_product_cuda_float32 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 82%] 2025-12-04T15:22:22.8527194Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_solve_cuda_float32 PASSED [4.4779s] [ 82%] 2025-12-04T15:22:22.8527339Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32 PASSED [0.2319s] [ 82%] 2025-12-04T15:22:22.8527477Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_ex_cuda_float32 PASSED [0.2220s] [ 82%] 2025-12-04T15:22:22.8527615Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorinv_cuda_float32 PASSED [0.0280s] [ 82%] 2025-12-04T15:22:22.8527750Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vander_cuda_float32 PASSED [0.2959s] [ 82%] 2025-12-04T15:22:22.8527879Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logdet_cuda_float32 PASSED [0.0816s] [ 82%] 2025-12-04T15:22:22.8528003Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mH_cuda_float32 PASSED [0.0104s] [ 82%] 2025-12-04T15:22:22.8528138Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_prod_cuda_float32 PASSED [1.3496s] [ 82%] 2025-12-04T15:22:22.8528338Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_binary_cuda_float32 PASSED [0.0748s] [ 82%] 2025-12-04T15:22:22.8528499Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_pool2d_with_indices_backward_cuda_float32 PASSED [4.7534s] [ 82%] 2025-12-04T15:22:22.8528645Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_no_dim_cuda_float32 PASSED [0.0159s] [ 82%] 2025-12-04T15:22:22.8528777Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nan_to_num_cuda_float32 PASSED [0.0140s] [ 82%] 2025-12-04T15:22:22.8528926Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmean_cuda_float32 PASSED [0.2421s] [ 82%] 2025-12-04T15:22:22.8529057Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_narrow_cuda_float32 PASSED [0.9752s] [ 82%] 2025-12-04T15:22:22.8529208Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_batch_norm_cuda_float32 PASSED [0.1139s] [ 82%] 2025-12-04T15:22:22.8529366Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0379s] [ 82%] 2025-12-04T15:22:22.8529516Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0475s] [ 82%] 2025-12-04T15:22:22.8529675Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.0540s] [ 82%] 2025-12-04T15:22:22.8529834Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cosine_similarity_cuda_float32 PASSED [0.2887s] [ 82%] 2025-12-04T15:22:22.8529985Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout3d_cuda_float32 PASSED [0.0411s] [ 82%] 2025-12-04T15:22:22.8530216Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.9476s] [ 82%] 2025-12-04T15:22:22.8530370Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_group_norm_cuda_float32 PASSED [0.2796s] [ 82%] 2025-12-04T15:22:22.8530521Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardshrink_cuda_float32 PASSED [0.0142s] [ 83%] 2025-12-04T15:22:22.8530670Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardswish_cuda_float32 PASSED [0.9538s] [ 83%] 2025-12-04T15:22:22.8530828Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_area_cuda_float32 PASSED [0.0619s] [ 83%] 2025-12-04T15:22:22.8530993Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bicubic_cuda_float32 PASSED [1.0969s] [ 83%] 2025-12-04T15:22:22.8531158Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.2777s] [ 83%] 2025-12-04T15:22:22.8531324Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_layer_norm_cuda_float32 PASSED [0.0476s] [ 83%] 2025-12-04T15:22:22.8531469Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mish_cuda_float32 PASSED [0.0153s] [ 83%] 2025-12-04T15:22:22.8531636Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0601s] [ 83%] 2025-12-04T15:22:22.8531809Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.1012s] [ 83%] 2025-12-04T15:22:22.8531974Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_negative_cuda_float32 PASSED [0.0126s] [ 83%] 2025-12-04T15:22:22.8532129Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0102s] [ 83%] 2025-12-04T15:22:22.8532304Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.6949s] [ 83%] 2025-12-04T15:22:22.8532454Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softplus_cuda_float32 PASSED [0.0121s] [ 83%] 2025-12-04T15:22:22.8532602Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softsign_cuda_float32 PASSED [0.0280s] [ 83%] 2025-12-04T15:22:22.8532728Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_cuda_float32 PASSED [0.2603s] [ 83%] 2025-12-04T15:22:22.8532885Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_fro_cuda_float32 PASSED [0.0190s] [ 83%] 2025-12-04T15:22:22.8533015Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_inf_cuda_float32 PASSED [0.0331s] [ 83%] 2025-12-04T15:22:22.8533154Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_cuda_float32 PASSED [0.9648s] [ 83%] 2025-12-04T15:22:22.8533279Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_put_cuda_float32 PASSED [0.0847s] [ 83%] 2025-12-04T15:22:22.8533408Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rad2deg_cuda_float32 PASSED [0.0043s] [ 83%] 2025-12-04T15:22:22.8533534Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_renorm_cuda_float32 PASSED [0.0468s] [ 83%] 2025-12-04T15:22:22.8533676Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_interleave_cuda_float32 PASSED [0.0290s] [ 83%] 2025-12-04T15:22:22.8533802Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsqrt_cuda_float32 PASSED [0.0118s] [ 83%] 2025-12-04T15:22:22.8533931Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_cuda_float32 PASSED [0.0244s] [ 83%] 2025-12-04T15:22:22.8534077Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32 PASSED [0.2225s] [ 83%] 2025-12-04T15:22:22.8534216Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_scatter_cuda_float32 PASSED [0.0167s] [ 83%] 2025-12-04T15:22:22.8534342Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32 PASSED [0.0040s] [ 83%] 2025-12-04T15:22:22.8534464Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sin_cuda_float32 PASSED [0.0045s] [ 83%] 2025-12-04T15:22:22.8534647Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_scatter_cuda_float32 PASSED [0.0271s] [ 83%] 2025-12-04T15:22:22.8534775Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_cuda_float32 PASSED [0.0247s] [ 83%] 2025-12-04T15:22:22.8534909Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1_cuda_float32 PASSED [0.0248s] [ 83%] 2025-12-04T15:22:22.8535048Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_log_ndtr_cuda_float32 PASSED [0.9670s] [ 83%] 2025-12-04T15:22:22.8535175Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32 PASSED [0.0095s] [ 84%] 2025-12-04T15:22:22.8535326Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_cuda_float32 PASSED [0.0129s] [ 84%] 2025-12-04T15:22:22.8535466Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_multiple_cuda_float32 PASSED [0.9524s] [ 84%] 2025-12-04T15:22:22.8535593Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stack_cuda_float32 PASSED [0.0288s] [ 84%] 2025-12-04T15:22:22.8535722Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_cuda_float32 PASSED [0.1344s] [ 84%] 2025-12-04T15:22:22.8535865Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_unbiased_cuda_float32 PASSED [0.0198s] [ 84%] 2025-12-04T15:22:22.8535991Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stft_cuda_float32 PASSED [0.1961s] [ 84%] 2025-12-04T15:22:22.8536124Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_to_size_cuda_float32 PASSED [0.0254s] [ 84%] 2025-12-04T15:22:22.8536250Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_svd_cuda_float32 PASSED [7.8279s] [ 84%] 2025-12-04T15:22:22.8536390Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_along_dim_cuda_float32 PASSED [0.0266s] [ 84%] 2025-12-04T15:22:22.8536514Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_topk_cuda_float32 PASSED [0.0333s] [ 84%] 2025-12-04T15:22:22.8536640Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trace_cuda_float32 PASSED [0.0055s] [ 84%] 2025-12-04T15:22:22.8536801Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_copy_cuda_float32 PASSED [0.0865s] [ 84%] 2025-12-04T15:22:22.8536939Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_copy_cuda_float32 PASSED [0.0170s] [ 84%] 2025-12-04T15:22:22.8537080Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vsplit_cuda_float32 PASSED [0.9701s] [ 84%] 2025-12-04T15:22:22.8537207Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vstack_cuda_float32 PASSED [0.0160s] [ 84%] 2025-12-04T15:22:22.8537308Z test_ops.py::TestFakeTensorCUDA::test_fake_cumprod_cuda_float32 PASSED [0.0114s] [ 84%] 2025-12-04T15:22:22.8537406Z test_ops.py::TestFakeTensorCUDA::test_fake_cumsum_cuda_float32 PASSED [0.9574s] [ 84%] 2025-12-04T15:22:22.8537506Z test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_cuda_float32 PASSED [0.0139s] [ 84%] 2025-12-04T15:22:22.8537602Z test_ops.py::TestFakeTensorCUDA::test_fake_dist_cuda_float32 PASSED [0.0607s] [ 84%] 2025-12-04T15:22:22.8537700Z test_ops.py::TestFakeTensorCUDA::test_fake_dstack_cuda_float32 PASSED [0.0058s] [ 84%] 2025-12-04T15:22:22.8537797Z test_ops.py::TestFakeTensorCUDA::test_fake_empty_cuda_float32 PASSED [0.0051s] [ 84%] 2025-12-04T15:22:22.8537893Z test_ops.py::TestFakeTensorCUDA::test_fake_equal_cuda_float32 PASSED [0.9508s] [ 84%] 2025-12-04T15:22:22.8537988Z test_ops.py::TestFakeTensorCUDA::test_fake_erfinv_cuda_float32 PASSED [0.0046s] [ 84%] 2025-12-04T15:22:22.8538083Z test_ops.py::TestFakeTensorCUDA::test_fake_exp_cuda_float32 PASSED [0.0049s] [ 84%] 2025-12-04T15:22:22.8538184Z test_ops.py::TestFakeTensorCUDA::test_fake_expand_as_cuda_float32 PASSED [0.0046s] [ 84%] 2025-12-04T15:22:22.8538281Z test_ops.py::TestFakeTensorCUDA::test_fake_expand_cuda_float32 PASSED [0.0082s] [ 84%] 2025-12-04T15:22:22.8538387Z test_ops.py::TestFakeTensorCUDA::test_fake_exponential_cuda_float32 PASSED [0.0061s] [ 84%] 2025-12-04T15:22:22.8538483Z test_ops.py::TestFakeTensorCUDA::test_fake_eye_cuda_float32 PASSED [0.0593s] [ 84%] 2025-12-04T15:22:22.8538577Z test_ops.py::TestFakeTensorCUDA::test_fake_frexp_cuda_float32 PASSED [0.0046s] [ 84%] 2025-12-04T15:22:22.8538674Z test_ops.py::TestFakeTensorCUDA::test_fake_gather_cuda_float32 PASSED [0.0084s] [ 84%] 2025-12-04T15:22:22.8538799Z test_ops.py::TestFakeTensorCUDA::test_fake_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 85%] 2025-12-04T15:22:22.8538902Z test_ops.py::TestFakeTensorCUDA::test_fake_i0_cuda_float32 PASSED [0.0035s] [ 85%] 2025-12-04T15:22:22.8539001Z test_ops.py::TestFakeTensorCUDA::test_fake_imag_cuda_complex64 PASSED [0.0051s] [ 85%] 2025-12-04T15:22:22.8539112Z test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_amax_cuda_float32 PASSED [0.0090s] [ 85%] 2025-12-04T15:22:22.8539213Z test_ops.py::TestFakeTensorCUDA::test_fake_isposinf_cuda_float32 PASSED [0.0029s] [ 85%] 2025-12-04T15:22:22.8539371Z test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8539512Z test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_binary_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8539622Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_cuda_float32 PASSED [0.0178s] [ 85%] 2025-12-04T15:22:22.8539727Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cond_cuda_float32 PASSED [0.0094s] [ 85%] 2025-12-04T15:22:22.8539831Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cross_cuda_float32 PASSED [0.0045s] [ 85%] 2025-12-04T15:22:22.8539968Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvals_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8540205Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_cuda_float32 PASSED [0.0064s] [ 85%] 2025-12-04T15:22:22.8540320Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_cuda_float32 PASSED [0.1545s] [ 85%] 2025-12-04T15:22:22.8540500Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8540695Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_singular_cuda_float32 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 85%] 2025-12-04T15:22:22.8540814Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svd_cuda_float32 PASSED [0.2342s] [ 85%] 2025-12-04T15:22:22.8540957Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorsolve_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8541053Z test_ops.py::TestFakeTensorCUDA::test_fake_log1p_cuda_float32 PASSED [0.0032s] [ 85%] 2025-12-04T15:22:22.8541147Z test_ops.py::TestFakeTensorCUDA::test_fake_log_cuda_float32 PASSED [0.0043s] [ 85%] 2025-12-04T15:22:22.8541243Z test_ops.py::TestFakeTensorCUDA::test_fake_logdet_cuda_float32 PASSED [0.0191s] [ 85%] 2025-12-04T15:22:22.8541335Z test_ops.py::TestFakeTensorCUDA::test_fake_lu_cuda_float32 PASSED [0.0415s] [ 85%] 2025-12-04T15:22:22.8541463Z test_ops.py::TestFakeTensorCUDA::test_fake_lu_solve_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 85%] 2025-12-04T15:22:22.8541555Z test_ops.py::TestFakeTensorCUDA::test_fake_mT_cuda_float32 PASSED [0.0054s] [ 85%] 2025-12-04T15:22:22.8541663Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmax_cuda_float32 PASSED [0.0917s] [ 85%] 2025-12-04T15:22:22.8541766Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_fill_cuda_float32 PASSED [0.0107s] [ 85%] 2025-12-04T15:22:22.8541879Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_logsumexp_cuda_float32 PASSED [0.1725s] [ 85%] 2025-12-04T15:22:22.8541985Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_scatter_cuda_float32 PASSED [0.0064s] [ 85%] 2025-12-04T15:22:22.8542092Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmax_cuda_float32 PASSED [0.0331s] [ 85%] 2025-12-04T15:22:22.8542192Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_std_cuda_float32 PASSED [0.2803s] [ 85%] 2025-12-04T15:22:22.8542289Z test_ops.py::TestFakeTensorCUDA::test_fake_matmul_cuda_float32 PASSED [0.0308s] [ 85%] 2025-12-04T15:22:22.8542419Z test_ops.py::TestFakeTensorCUDA::test_fake_max_pool2d_with_indices_backward_cuda_float32 PASSED [1.8440s] [ 85%] 2025-12-04T15:22:22.8542536Z test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_no_dim_cuda_float32 PASSED [0.0038s] [ 85%] 2025-12-04T15:22:22.8542634Z test_ops.py::TestFakeTensorCUDA::test_fake_median_cuda_float32 PASSED [0.0119s] [ 86%] 2025-12-04T15:22:22.8542768Z test_ops.py::TestFakeTensorCUDA::test_fake_min_reduction_with_dim_cuda_float32 PASSED [0.0052s] [ 86%] 2025-12-04T15:22:22.8542862Z test_ops.py::TestFakeTensorCUDA::test_fake_mm_cuda_float32 PASSED [0.0045s] [ 86%] 2025-12-04T15:22:22.8542958Z test_ops.py::TestFakeTensorCUDA::test_fake_msort_cuda_float32 PASSED [0.0038s] [ 86%] 2025-12-04T15:22:22.8543103Z test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_1_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 86%] 2025-12-04T15:22:22.8543246Z test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_5_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 86%] 2025-12-04T15:22:22.8543350Z test_ops.py::TestFakeTensorCUDA::test_fake_narrow_copy_cuda_float32 PASSED [0.9765s] [ 86%] 2025-12-04T15:22:22.8543460Z test_ops.py::TestFakeTensorCUDA::test_fake_native_layer_norm_cuda_float32 PASSED [0.0572s] [ 86%] 2025-12-04T15:22:22.8543554Z test_ops.py::TestFakeTensorCUDA::test_fake_ne_cuda_float32 PASSED [0.0100s] [ 86%] 2025-12-04T15:22:22.8543648Z test_ops.py::TestFakeTensorCUDA::test_fake_neg_cuda_float32 PASSED [0.0029s] [ 86%] 2025-12-04T15:22:22.8543780Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0090s] [ 86%] 2025-12-04T15:22:22.8543900Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_batch_norm_cuda_float32 PASSED [0.0289s] [ 86%] 2025-12-04T15:22:22.8544013Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_celu_cuda_float32 PASSED [0.0048s] [ 86%] 2025-12-04T15:22:22.8544161Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cosine_similarity_cuda_float32 PASSED [0.0356s] [ 86%] 2025-12-04T15:22:22.8544280Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout3d_cuda_float32 PASSED [0.0180s] [ 86%] 2025-12-04T15:22:22.8544409Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_cuda_float32 PASSED [0.0117s] [ 86%] 2025-12-04T15:22:22.8544536Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [0.4292s] [ 86%] 2025-12-04T15:22:22.8544647Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_glu_cuda_float32 PASSED [0.0383s] [ 86%] 2025-12-04T15:22:22.8544766Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_grid_sample_cuda_float32 PASSED [0.9758s] [ 86%] 2025-12-04T15:22:22.8544883Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardtanh_cuda_float32 PASSED [0.0083s] [ 86%] 2025-12-04T15:22:22.8545005Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_instance_norm_cuda_float32 PASSED [0.0524s] [ 86%] 2025-12-04T15:22:22.8545125Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_layer_norm_cuda_float32 PASSED [0.0150s] [ 86%] 2025-12-04T15:22:22.8545256Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_margin_ranking_loss_cuda_float32 PASSED [0.0539s] [ 86%] 2025-12-04T15:22:22.8545369Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mish_cuda_float32 PASSED [0.0051s] [ 86%] 2025-12-04T15:22:22.8545485Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mse_loss_cuda_float32 PASSED [0.0125s] [ 86%] 2025-12-04T15:22:22.8545628Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0268s] [ 86%] 2025-12-04T15:22:22.8545767Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_one_hot_cuda_int64 SKIPPED [0.0012s] (Skip failing test) [ 86%] 2025-12-04T15:22:22.8545889Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_constant_cuda_float32 PASSED [0.0359s] [ 86%] 2025-12-04T15:22:22.8546009Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_reflect_cuda_float32 PASSED [0.0103s] [ 86%] 2025-12-04T15:22:22.8546137Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0058s] [ 86%] 2025-12-04T15:22:22.8546251Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_relu_cuda_float32 PASSED [0.0055s] [ 86%] 2025-12-04T15:22:22.8546365Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_rrelu_cuda_float32 PASSED [0.0080s] [ 87%] 2025-12-04T15:22:22.8546519Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.1341s] [ 87%] 2025-12-04T15:22:22.8546645Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_complex_cuda_complex64 PASSED [0.0042s] [ 87%] 2025-12-04T15:22:22.8546765Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softshrink_cuda_float32 PASSED [0.0073s] [ 87%] 2025-12-04T15:22:22.8546891Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0103s] [ 87%] 2025-12-04T15:22:22.8546991Z test_ops.py::TestFakeTensorCUDA::test_fake_norm_cuda_float32 PASSED [0.0382s] [ 87%] 2025-12-04T15:22:22.8547118Z test_ops.py::TestFakeTensorCUDA::test_fake_normal_number_mean_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 87%] 2025-12-04T15:22:22.8547238Z test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0087s] [ 87%] 2025-12-04T15:22:22.8547356Z test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0087s] [ 87%] 2025-12-04T15:22:22.8547474Z test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0087s] [ 87%] 2025-12-04T15:22:22.8547569Z test_ops.py::TestFakeTensorCUDA::test_fake_prod_cuda_float32 PASSED [0.0283s] [ 87%] 2025-12-04T15:22:22.8547696Z test_ops.py::TestFakeTensorCUDA::test_fake_quantile_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 87%] 2025-12-04T15:22:22.8547795Z test_ops.py::TestFakeTensorCUDA::test_fake_randint_cuda_float32 PASSED [1.0004s] [ 87%] 2025-12-04T15:22:22.8547922Z test_ops.py::TestFakeTensorCUDA::test_fake_randint_like_cuda_float32 PASSED [0.0198s] [ 87%] 2025-12-04T15:22:22.8548025Z test_ops.py::TestFakeTensorCUDA::test_fake_randn_like_cuda_float32 PASSED [0.0114s] [ 87%] 2025-12-04T15:22:22.8548129Z test_ops.py::TestFakeTensorCUDA::test_fake_real_cuda_float32 PASSED [0.0035s] [ 87%] 2025-12-04T15:22:22.8548230Z test_ops.py::TestFakeTensorCUDA::test_fake_resize_as__cuda_float32 PASSED [0.0048s] [ 87%] 2025-12-04T15:22:22.8548335Z test_ops.py::TestFakeTensorCUDA::test_fake_resolve_neg_cuda_float32 PASSED [0.0030s] [ 87%] 2025-12-04T15:22:22.8548433Z test_ops.py::TestFakeTensorCUDA::test_fake_round_cuda_float32 PASSED [0.9600s] [ 87%] 2025-12-04T15:22:22.8548552Z test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_nuttall_cuda_float32 PASSED [0.0262s] [ 87%] 2025-12-04T15:22:22.8548652Z test_ops.py::TestFakeTensorCUDA::test_fake_signbit_cuda_float32 PASSED [0.0031s] [ 87%] 2025-12-04T15:22:22.8548746Z test_ops.py::TestFakeTensorCUDA::test_fake_sinc_cuda_float32 PASSED [0.0051s] [ 87%] 2025-12-04T15:22:22.8548843Z test_ops.py::TestFakeTensorCUDA::test_fake_slice_cuda_float32 PASSED [0.0051s] [ 87%] 2025-12-04T15:22:22.8548949Z test_ops.py::TestFakeTensorCUDA::test_fake_slice_scatter_cuda_float32 PASSED [0.0091s] [ 87%] 2025-12-04T15:22:22.8549091Z test_ops.py::TestFakeTensorCUDA::test_fake_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 87%] 2025-12-04T15:22:22.8549200Z test_ops.py::TestFakeTensorCUDA::test_fake_special_airy_ai_cuda_float32 PASSED [0.9802s] [ 87%] 2025-12-04T15:22:22.8549330Z test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0117s] [ 87%] 2025-12-04T15:22:22.8549457Z test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0117s] [ 87%] 2025-12-04T15:22:22.8549584Z test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_he_cuda_float32 PASSED [0.0095s] [ 87%] 2025-12-04T15:22:22.8549689Z test_ops.py::TestFakeTensorCUDA::test_fake_special_i0e_cuda_float32 PASSED [0.0036s] [ 87%] 2025-12-04T15:22:22.8549791Z test_ops.py::TestFakeTensorCUDA::test_fake_special_i1_cuda_float32 PASSED [0.0035s] [ 87%] 2025-12-04T15:22:22.8549915Z test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i0_cuda_float32 PASSED [0.0049s] [ 87%] 2025-12-04T15:22:22.8550066Z test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0107s] [ 88%] 2025-12-04T15:22:22.8550209Z test_ops.py::TestFakeTensorCUDA::test_fake_split_list_args_cuda_float32 PASSED [0.0045s] [ 88%] 2025-12-04T15:22:22.8550319Z test_ops.py::TestFakeTensorCUDA::test_fake_squeeze_multiple_cuda_float32 PASSED [0.0061s] [ 88%] 2025-12-04T15:22:22.8550417Z test_ops.py::TestFakeTensorCUDA::test_fake_stack_cuda_float32 PASSED [0.0098s] [ 88%] 2025-12-04T15:22:22.8550519Z test_ops.py::TestFakeTensorCUDA::test_fake_sum_to_size_cuda_float32 PASSED [0.0129s] [ 88%] 2025-12-04T15:22:22.8550652Z test_ops.py::TestFakeTensorCUDA::test_fake_tensor_split_cuda_float32 SKIPPED [0.0020s] (Skip failing test) [ 88%] 2025-12-04T15:22:22.8550777Z test_ops.py::TestFakeTensorCUDA::test_fake_to_sparse_cuda_float32 SKIPPED [0.0016s] (Skip failing test) [ 88%] 2025-12-04T15:22:22.8550933Z test_ops.py::TestFakeTensorCUDA::test_fake_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0011s] (Requires CUDA SM >= 8.9) [ 88%] 2025-12-04T15:22:22.8551029Z test_ops.py::TestFakeTensorCUDA::test_fake_trace_cuda_float32 PASSED [0.9837s] [ 88%] 2025-12-04T15:22:22.8551138Z test_ops.py::TestFakeTensorCUDA::test_fake_transpose_copy_cuda_float32 PASSED [0.0096s] [ 88%] 2025-12-04T15:22:22.8551239Z test_ops.py::TestFakeTensorCUDA::test_fake_trapezoid_cuda_float32 PASSED [0.9947s] [ 88%] 2025-12-04T15:22:22.8551334Z test_ops.py::TestFakeTensorCUDA::test_fake_tril_cuda_float32 PASSED [0.0145s] [ 88%] 2025-12-04T15:22:22.8551435Z test_ops.py::TestFakeTensorCUDA::test_fake_true_divide_cuda_float32 PASSED [0.0110s] [ 88%] 2025-12-04T15:22:22.8551585Z test_ops.py::TestFakeTensorCUDA::test_fake_unique_consecutive_cuda_float32 PASSED [0.1613s] [ 88%] 2025-12-04T15:22:22.8551688Z test_ops.py::TestFakeTensorCUDA::test_fake_unravel_index_cuda_int64 PASSED [0.0372s] [ 88%] 2025-12-04T15:22:22.8551806Z test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_split_cuda_float32 PASSED [0.0042s] [ 88%] 2025-12-04T15:22:22.8551913Z test_ops.py::TestFakeTensorCUDA::test_fake_unsqueeze_copy_cuda_float32 PASSED [0.0080s] [ 88%] 2025-12-04T15:22:22.8552011Z test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_cuda_float32 PASSED [0.0147s] [ 88%] 2025-12-04T15:22:22.8552120Z test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_unbiased_cuda_float32 PASSED [0.0039s] [ 88%] 2025-12-04T15:22:22.8552226Z test_ops.py::TestFakeTensorCUDA::test_fake_view_as_complex_cuda_float32 PASSED [0.0027s] [ 88%] 2025-12-04T15:22:22.8552333Z test_ops.py::TestFakeTensorCUDA::test_fake_view_as_real_cuda_complex64 PASSED [0.0034s] [ 88%] 2025-12-04T15:22:22.8552428Z test_ops.py::TestFakeTensorCUDA::test_fake_zero__cuda_float32 PASSED [0.0045s] [ 88%] 2025-12-04T15:22:22.8552545Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmatmul___cuda_float32 PASSED [0.0417s] [ 88%] 2025-12-04T15:22:22.8552655Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmod___cuda_float32 PASSED [0.0117s] [ 88%] 2025-12-04T15:22:22.8552765Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmul___cuda_float32 PASSED [0.0117s] [ 88%] 2025-12-04T15:22:22.8552896Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_lengths_cuda_float32 PASSED [0.0751s] [ 88%] 2025-12-04T15:22:22.8553047Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.0352s] [ 88%] 2025-12-04T15:22:22.8553154Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_cuda_float32 PASSED [0.0188s] [ 88%] 2025-12-04T15:22:22.8553262Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmv_cuda_float32 PASSED [0.0226s] [ 88%] 2025-12-04T15:22:22.8553373Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_allclose_cuda_float32 PASSED [0.0145s] [ 88%] 2025-12-04T15:22:22.8553512Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_aminmax_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 88%] 2025-12-04T15:22:22.8553622Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_arange_cuda_float32 PASSED [0.0176s] [ 88%] 2025-12-04T15:22:22.8553733Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argsort_cuda_float32 PASSED [0.0340s] [ 89%] 2025-12-04T15:22:22.8553863Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_cuda_float32 PASSED [0.0062s] [ 89%] 2025-12-04T15:22:22.8553986Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_scatter_cuda_float32 PASSED [0.0079s] [ 89%] 2025-12-04T15:22:22.8554094Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan2_cuda_float32 PASSED [0.0117s] [ 89%] 2025-12-04T15:22:22.8554200Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atanh_cuda_float32 PASSED [0.9752s] [ 89%] 2025-12-04T15:22:22.8554315Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_not_cuda_int64 PASSED [0.0063s] [ 89%] 2025-12-04T15:22:22.8554425Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_or_cuda_int64 PASSED [0.0116s] [ 89%] 2025-12-04T15:22:22.8554538Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_xor_cuda_int64 PASSED [0.0115s] [ 89%] 2025-12-04T15:22:22.8554654Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_to_cuda_float32 PASSED [0.0074s] [ 89%] 2025-12-04T15:22:22.8554767Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bucketize_cuda_float32 PASSED [0.0221s] [ 89%] 2025-12-04T15:22:22.8554878Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_cuda_float32 PASSED [0.0140s] [ 89%] 2025-12-04T15:22:22.8554982Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cos_cuda_float32 PASSED [0.0046s] [ 89%] 2025-12-04T15:22:22.8555098Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_count_nonzero_cuda_float32 PASSED [0.0194s] [ 89%] 2025-12-04T15:22:22.8555250Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumulative_trapezoid_cuda_float32 PASSED [0.0439s] [ 89%] 2025-12-04T15:22:22.8555357Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_cuda_float32 PASSED [0.0134s] [ 89%] 2025-12-04T15:22:22.8555494Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32 PASSED [0.0123s] [ 89%] 2025-12-04T15:22:22.8555609Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32 PASSED [0.0079s] [ 89%] 2025-12-04T15:22:22.8555718Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft2_cuda_float32 PASSED [0.0130s] [ 89%] 2025-12-04T15:22:22.8555828Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftn_cuda_float32 PASSED [0.0159s] [ 89%] 2025-12-04T15:22:22.8555939Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft2_cuda_float32 PASSED [0.0130s] [ 89%] 2025-12-04T15:22:22.8556054Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfft_cuda_float32 PASSED [0.0136s] [ 89%] 2025-12-04T15:22:22.8556162Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fill_cuda_float32 PASSED [0.0043s] [ 89%] 2025-12-04T15:22:22.8556271Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_cuda_float32 PASSED [0.9765s] [ 89%] 2025-12-04T15:22:22.8556378Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frexp_cuda_float32 PASSED [0.0070s] [ 89%] 2025-12-04T15:22:22.8556491Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_like_cuda_float32 PASSED [0.0087s] [ 89%] 2025-12-04T15:22:22.8556598Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geqrf_cuda_float32 PASSED [0.0515s] [ 89%] 2025-12-04T15:22:22.8556719Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_grid_sampler_2d_cuda_float32 PASSED [2.3797s] [ 89%] 2025-12-04T15:22:22.8556828Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_imag_cuda_complex64 PASSED [0.0061s] [ 89%] 2025-12-04T15:22:22.8556941Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_put_cuda_float32 PASSED [0.0063s] [ 89%] 2025-12-04T15:22:22.8557066Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_prod_cuda_float32 PASSED [0.0096s] [ 89%] 2025-12-04T15:22:22.8557177Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isclose_cuda_float32 PASSED [0.1186s] [ 89%] 2025-12-04T15:22:22.8557288Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isposinf_cuda_float32 PASSED [0.9688s] [ 90%] 2025-12-04T15:22:22.8557462Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0016s] (Skip failing test) [ 90%] 2025-12-04T15:22:22.8557613Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_unary_cuda_float32 SKIPPED [0.0013s] (Skip failing test) [ 90%] 2025-12-04T15:22:22.8557716Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64 PASSED [0.0183s] [ 90%] 2025-12-04T15:22:22.8557825Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ldexp_cuda_float32 PASSED [0.0167s] [ 90%] 2025-12-04T15:22:22.8557938Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_cuda_float32 PASSED [0.0131s] [ 90%] 2025-12-04T15:22:22.8558055Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_ex_cuda_float32 PASSED [0.0105s] [ 90%] 2025-12-04T15:22:22.8558171Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_cuda_float32 PASSED [0.9823s] [ 90%] 2025-12-04T15:22:22.8558283Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_cuda_float32 PASSED [0.0392s] [ 90%] 2025-12-04T15:22:22.8558405Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_cuda_float32 PASSED [0.0645s] [ 90%] 2025-12-04T15:22:22.8558525Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_solve_cuda_float32 PASSED [0.1082s] [ 90%] 2025-12-04T15:22:22.8558649Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_norm_cuda_float32 PASSED [0.1102s] [ 90%] 2025-12-04T15:22:22.8558794Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.1199s] [ 90%] 2025-12-04T15:22:22.8558927Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_cuda_float32 PASSED [0.0344s] [ 90%] 2025-12-04T15:22:22.8559047Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_ex_cuda_float32 PASSED [0.0354s] [ 90%] 2025-12-04T15:22:22.8559172Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32 PASSED [0.0301s] [ 90%] 2025-12-04T15:22:22.8559281Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log1p_cuda_float32 PASSED [0.9847s] [ 90%] 2025-12-04T15:22:22.8559391Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logdet_cuda_float32 PASSED [0.0209s] [ 90%] 2025-12-04T15:22:22.8559503Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_cuda_float32 PASSED [0.1476s] [ 90%] 2025-12-04T15:22:22.8559607Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lt_cuda_float32 PASSED [0.0125s] [ 90%] 2025-12-04T15:22:22.8559710Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_cuda_float32 PASSED [0.0450s] [ 90%] 2025-12-04T15:22:22.8559814Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mT_cuda_float32 PASSED [0.0058s] [ 90%] 2025-12-04T15:22:22.8559929Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_norm_cuda_float32 PASSED [0.4852s] [ 90%] 2025-12-04T15:22:22.8560043Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_var_cuda_float32 PASSED [0.0867s] [ 90%] 2025-12-04T15:22:22.8560202Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_no_dim_cuda_float32 PASSED [0.0036s] [ 90%] 2025-12-04T15:22:22.8560331Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32 PASSED [0.0053s] [ 90%] 2025-12-04T15:22:22.8560440Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mean_cuda_float32 PASSED [0.9951s] [ 90%] 2025-12-04T15:22:22.8560549Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_median_cuda_float32 PASSED [0.0139s] [ 90%] 2025-12-04T15:22:22.8560652Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mm_cuda_float32 PASSED [0.0048s] [ 90%] 2025-12-04T15:22:22.8560761Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_msort_cuda_float32 PASSED [0.0042s] [ 90%] 2025-12-04T15:22:22.8560917Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_5_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 90%] 2025-12-04T15:22:22.8561033Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmedian_cuda_float32 PASSED [0.0119s] [ 90%] 2025-12-04T15:22:22.8561191Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanquantile_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 91%] 2025-12-04T15:22:22.8561295Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ne_cuda_float32 PASSED [0.0118s] [ 91%] 2025-12-04T15:22:22.8561409Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32 PASSED [0.0114s] [ 91%] 2025-12-04T15:22:22.8561553Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [0.0221s] [ 91%] 2025-12-04T15:22:22.8561685Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0168s] [ 91%] 2025-12-04T15:22:22.8561827Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.0198s] [ 91%] 2025-12-04T15:22:22.8561952Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_elu_cuda_float32 PASSED [0.9824s] [ 91%] 2025-12-04T15:22:22.8562084Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_grid_sample_cuda_float32 PASSED [1.9133s] [ 91%] 2025-12-04T15:22:22.8562215Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardswish_cuda_float32 PASSED [0.0157s] [ 91%] 2025-12-04T15:22:22.8562347Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_huber_loss_cuda_float32 PASSED [0.0182s] [ 91%] 2025-12-04T15:22:22.8562507Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool1d_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 91%] 2025-12-04T15:22:22.8562647Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.1042s] [ 91%] 2025-12-04T15:22:22.8562809Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_cuda_float32 PASSED [1.6481s] [ 91%] 2025-12-04T15:22:22.8562962Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.0433s] [ 91%] 2025-12-04T15:22:22.8563095Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_constant_cuda_float32 PASSED [0.0373s] [ 91%] 2025-12-04T15:22:22.8563228Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_reflect_cuda_float32 PASSED [0.0108s] [ 91%] 2025-12-04T15:22:22.8563350Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu_cuda_float32 PASSED [0.9900s] [ 91%] 2025-12-04T15:22:22.8563477Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rms_norm_cuda_float32 PASSED [0.0254s] [ 91%] 2025-12-04T15:22:22.8563600Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32 PASSED [0.0046s] [ 91%] 2025-12-04T15:22:22.8563739Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0353s] [ 91%] 2025-12-04T15:22:22.8563882Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0157s] [ 91%] 2025-12-04T15:22:22.8564014Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softshrink_cuda_float32 PASSED [0.9943s] [ 91%] 2025-12-04T15:22:22.8564158Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0621s] [ 91%] 2025-12-04T15:22:22.8564268Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_cuda_float32 PASSED [0.0481s] [ 91%] 2025-12-04T15:22:22.8564379Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_fro_cuda_float32 PASSED [0.0056s] [ 91%] 2025-12-04T15:22:22.8564494Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pca_lowrank_cuda_float32 PASSED [0.0448s] [ 91%] 2025-12-04T15:22:22.8564606Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pinverse_cuda_float32 PASSED [0.0127s] [ 91%] 2025-12-04T15:22:22.8564738Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0097s] [ 91%] 2025-12-04T15:22:22.8564870Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_1_cuda_float32 PASSED [0.9998s] [ 91%] 2025-12-04T15:22:22.8564994Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_prod_cuda_float32 PASSED [0.0330s] [ 91%] 2025-12-04T15:22:22.8565107Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rad2deg_cuda_float32 PASSED [0.9937s] [ 91%] 2025-12-04T15:22:22.8565220Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32 PASSED [0.0151s] [ 92%] 2025-12-04T15:22:22.8565335Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reciprocal_cuda_float32 PASSED [0.0048s] [ 92%] 2025-12-04T15:22:22.8565447Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_as_cuda_float32 PASSED [0.0067s] [ 92%] 2025-12-04T15:22:22.8565562Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize_as__cuda_float32 PASSED [0.0052s] [ 92%] 2025-12-04T15:22:22.8565677Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scalar_tensor_cuda_float32 PASSED [0.0033s] [ 92%] 2025-12-04T15:22:22.8565788Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sigmoid_cuda_float32 PASSED [0.0043s] [ 92%] 2025-12-04T15:22:22.8565895Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinc_cuda_float32 PASSED [0.0060s] [ 92%] 2025-12-04T15:22:22.8566003Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinh_cuda_float32 PASSED [0.9842s] [ 92%] 2025-12-04T15:22:22.8566127Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_with_dtype_cuda_float32 PASSED [0.0148s] [ 92%] 2025-12-04T15:22:22.8566247Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_airy_ai_cuda_float32 PASSED [0.0049s] [ 92%] 2025-12-04T15:22:22.8566369Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j0_cuda_float32 PASSED [0.0047s] [ 92%] 2025-12-04T15:22:22.8566535Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0109s] [ 92%] 2025-12-04T15:22:22.8566678Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0108s] [ 92%] 2025-12-04T15:22:22.8566805Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_erfcx_cuda_float32 PASSED [0.9830s] [ 92%] 2025-12-04T15:22:22.8566913Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_cuda_float32 PASSED [0.0191s] [ 92%] 2025-12-04T15:22:22.8567019Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tile_cuda_float32 PASSED [0.0388s] [ 92%] 2025-12-04T15:22:22.8567170Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__flash_attention_forward_cuda_float16 PASSED [0.0205s] [ 92%] 2025-12-04T15:22:22.8567288Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_transpose_copy_cuda_float32 PASSED [0.0078s] [ 92%] 2025-12-04T15:22:22.8567397Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trunc_cuda_float32 PASSED [0.9798s] [ 92%] 2025-12-04T15:22:22.8567512Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_split_cuda_float32 PASSED [0.0069s] [ 92%] 2025-12-04T15:22:22.8567624Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_cuda_float32 PASSED [0.9972s] [ 92%] 2025-12-04T15:22:22.8567745Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_unbiased_cuda_float32 PASSED [0.0064s] [ 92%] 2025-12-04T15:22:22.8567861Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_unbiased_cuda_float32 PASSED [0.9922s] [ 92%] 2025-12-04T15:22:22.8567969Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vstack_cuda_float32 PASSED [0.0075s] [ 92%] 2025-12-04T15:22:22.8568091Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_bfloat16 PASSED [0.0130s] [ 92%] 2025-12-04T15:22:22.8568208Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float32 PASSED [0.0118s] [ 92%] 2025-12-04T15:22:22.8568329Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float64 PASSED [0.0114s] [ 92%] 2025-12-04T15:22:22.8568445Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int32 PASSED [0.0112s] [ 92%] 2025-12-04T15:22:22.8568561Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_uint8 PASSED [0.0076s] [ 92%] 2025-12-04T15:22:22.8568722Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex64 PASSED [0.0393s] [ 92%] 2025-12-04T15:22:22.8568860Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int16 PASSED [0.0388s] [ 92%] 2025-12-04T15:22:22.8568986Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex128 PASSED [0.0588s] [ 92%] 2025-12-04T15:22:22.8569129Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex64 PASSED [0.2520s] [ 93%] 2025-12-04T15:22:22.8569271Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float64 PASSED [0.2367s] [ 93%] 2025-12-04T15:22:22.8569408Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int8 PASSED [0.0974s] [ 93%] 2025-12-04T15:22:22.8569523Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int16 PASSED [0.9910s] [ 93%] 2025-12-04T15:22:22.8569639Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex64 PASSED [0.0033s] [ 93%] 2025-12-04T15:22:22.8569755Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float32 PASSED [0.9960s] [ 93%] 2025-12-04T15:22:22.8569866Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int32 PASSED [0.0034s] [ 93%] 2025-12-04T15:22:22.8569978Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_bfloat16 PASSED [0.0056s] [ 93%] 2025-12-04T15:22:22.8570167Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float64 PASSED [0.0054s] [ 93%] 2025-12-04T15:22:22.8570308Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bfloat16 PASSED [0.0027s] [ 93%] 2025-12-04T15:22:22.8570413Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bool PASSED [0.9993s] [ 93%] 2025-12-04T15:22:22.8570540Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex64 PASSED [0.0037s] [ 93%] 2025-12-04T15:22:22.8570649Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int64 PASSED [0.9960s] [ 93%] 2025-12-04T15:22:22.8570755Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_uint8 PASSED [0.0036s] [ 93%] 2025-12-04T15:22:22.8570865Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32 PASSED [0.0098s] [ 93%] 2025-12-04T15:22:22.8570973Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int8 PASSED [0.0096s] [ 93%] 2025-12-04T15:22:22.8571107Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float16 PASSED [0.0359s] [ 93%] 2025-12-04T15:22:22.8571220Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float64 PASSED [0.0505s] [ 93%] 2025-12-04T15:22:22.8571359Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex64 PASSED [0.2217s] [ 93%] 2025-12-04T15:22:22.8571463Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bool PASSED [0.9927s] [ 93%] 2025-12-04T15:22:22.8571571Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float32 PASSED [0.0031s] [ 93%] 2025-12-04T15:22:22.8571677Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float64 PASSED [0.9826s] [ 93%] 2025-12-04T15:22:22.8571782Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int16 PASSED [0.0032s] [ 93%] 2025-12-04T15:22:22.8571892Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex64 PASSED [0.9874s] [ 93%] 2025-12-04T15:22:22.8572000Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float16 PASSED [0.0032s] [ 93%] 2025-12-04T15:22:22.8572108Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float64 PASSED [0.9909s] [ 93%] 2025-12-04T15:22:22.8572219Z test_ops.py::TestTagsCUDA::test_tags___rxor___cuda_int64 SKIPPED [0.0015s] (Only runs on cpu) [ 93%] 2025-12-04T15:22:22.8572359Z test_ops.py::TestTagsCUDA::test_tags__refs__conversions_double_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T15:22:22.8572504Z test_ops.py::TestTagsCUDA::test_tags__refs__conversions_int_cuda_float32 SKIPPED [0.0012s] (Only runs on cpu) [ 93%] 2025-12-04T15:22:22.8572641Z test_ops.py::TestTagsCUDA::test_tags__refs__conversions_long_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T15:22:22.8572760Z test_ops.py::TestTagsCUDA::test_tags__refs_acosh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T15:22:22.8572879Z test_ops.py::TestTagsCUDA::test_tags__refs_addcdiv_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573000Z test_ops.py::TestTagsCUDA::test_tags__refs_alias_copy_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573115Z test_ops.py::TestTagsCUDA::test_tags__refs_amax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573227Z test_ops.py::TestTagsCUDA::test_tags__refs_amin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573343Z test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573455Z test_ops.py::TestTagsCUDA::test_tags__refs_asin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573572Z test_ops.py::TestTagsCUDA::test_tags__refs_atan2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573692Z test_ops.py::TestTagsCUDA::test_tags__refs_atleast_3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573813Z test_ops.py::TestTagsCUDA::test_tags__refs_block_diag_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8573946Z test_ops.py::TestTagsCUDA::test_tags__refs_cat_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574059Z test_ops.py::TestTagsCUDA::test_tags__refs_ceil_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574198Z test_ops.py::TestTagsCUDA::test_tags__refs_column_stack_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574313Z test_ops.py::TestTagsCUDA::test_tags__refs_cosh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574430Z test_ops.py::TestTagsCUDA::test_tags__refs_cumsum_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574548Z test_ops.py::TestTagsCUDA::test_tags__refs_deg2rad_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574666Z test_ops.py::TestTagsCUDA::test_tags__refs_digamma_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574786Z test_ops.py::TestTagsCUDA::test_tags__refs_empty_like_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8574896Z test_ops.py::TestTagsCUDA::test_tags__refs_eq_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575008Z test_ops.py::TestTagsCUDA::test_tags__refs_erf_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575124Z test_ops.py::TestTagsCUDA::test_tags__refs_erfinv_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575236Z test_ops.py::TestTagsCUDA::test_tags__refs_exp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575356Z test_ops.py::TestTagsCUDA::test_tags__refs_expand_as_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575472Z test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575589Z test_ops.py::TestTagsCUDA::test_tags__refs_expm1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575715Z test_ops.py::TestTagsCUDA::test_tags__refs_exponential_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575833Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8575953Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576093Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftshift_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576217Z test_ops.py::TestTagsCUDA::test_tags__refs_float_power_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576330Z test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576445Z test_ops.py::TestTagsCUDA::test_tags__refs_frexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576556Z test_ops.py::TestTagsCUDA::test_tags__refs_ge_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T15:22:22.8576666Z test_ops.py::TestTagsCUDA::test_tags__refs_gt_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8576784Z test_ops.py::TestTagsCUDA::test_tags__refs_hsplit_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8576904Z test_ops.py::TestTagsCUDA::test_tags__refs_index_add_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577019Z test_ops.py::TestTagsCUDA::test_tags__refs_isreal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577131Z test_ops.py::TestTagsCUDA::test_tags__refs_item_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577263Z test_ops.py::TestTagsCUDA::test_tags__refs_linalg_matrix_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577406Z test_ops.py::TestTagsCUDA::test_tags__refs_linspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577557Z test_ops.py::TestTagsCUDA::test_tags__refs_log_normal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577695Z test_ops.py::TestTagsCUDA::test_tags__refs_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577835Z test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8577958Z test_ops.py::TestTagsCUDA::test_tags__refs_logical_xor_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578099Z test_ops.py::TestTagsCUDA::test_tags__refs_logspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578212Z test_ops.py::TestTagsCUDA::test_tags__refs_mean_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578334Z test_ops.py::TestTagsCUDA::test_tags__refs_narrow_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578465Z test_ops.py::TestTagsCUDA::test_tags__refs_native_layer_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578617Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_channel_shuffle_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578760Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8578912Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579044Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mish_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579178Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mse_loss_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579329Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pairwise_distance_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579479Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579613Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu6_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579752Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_threshold_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8579879Z test_ops.py::TestTagsCUDA::test_tags__refs_normal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580004Z test_ops.py::TestTagsCUDA::test_tags__refs_permute_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580158Z test_ops.py::TestTagsCUDA::test_tags__refs_renorm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580273Z test_ops.py::TestTagsCUDA::test_tags__refs_repeat_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580389Z test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580506Z test_ops.py::TestTagsCUDA::test_tags__refs_sigmoid_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580639Z test_ops.py::TestTagsCUDA::test_tags__refs_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T15:22:22.8580772Z test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j1_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8580895Z test_ops.py::TestTagsCUDA::test_tags__refs_special_entr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581018Z test_ops.py::TestTagsCUDA::test_tags__refs_special_i1e_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581145Z test_ops.py::TestTagsCUDA::test_tags__refs_special_log_ndtr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581301Z test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581413Z test_ops.py::TestTagsCUDA::test_tags__refs_sub_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581549Z test_ops.py::TestTagsCUDA::test_tags__refs_triu_indices_cuda_int64 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581669Z test_ops.py::TestTagsCUDA::test_tags__refs_unflatten_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581780Z test_ops.py::TestTagsCUDA::test_tags__refs_var_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8581897Z test_ops.py::TestTagsCUDA::test_tags__refs_var_mean_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582009Z test_ops.py::TestTagsCUDA::test_tags__refs_vdot_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582124Z test_ops.py::TestTagsCUDA::test_tags__refs_where_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582274Z test_ops.py::TestTagsCUDA::test_tags__unsafe_masked_index_put_accumulate_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582408Z test_ops.py::TestTagsCUDA::test_tags__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582521Z test_ops.py::TestTagsCUDA::test_tags_argwhere_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582630Z test_ops.py::TestTagsCUDA::test_tags_atan2_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582737Z test_ops.py::TestTagsCUDA::test_tags_atan_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582852Z test_ops.py::TestTagsCUDA::test_tags_atleast_1d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8582965Z test_ops.py::TestTagsCUDA::test_tags_atleast_2d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583090Z test_ops.py::TestTagsCUDA::test_tags_bitwise_left_shift_cuda_int64 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583204Z test_ops.py::TestTagsCUDA::test_tags_bitwise_not_cuda_int64 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583325Z test_ops.py::TestTagsCUDA::test_tags_cartesian_prod_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583461Z test_ops.py::TestTagsCUDA::test_tags_cholesky_inverse_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583582Z test_ops.py::TestTagsCUDA::test_tags_count_nonzero_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583690Z test_ops.py::TestTagsCUDA::test_tags_cross_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583803Z test_ops.py::TestTagsCUDA::test_tags_diagflat_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8583927Z test_ops.py::TestTagsCUDA::test_tags_diagonal_scatter_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584053Z test_ops.py::TestTagsCUDA::test_tags_div_no_rounding_mode_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584180Z test_ops.py::TestTagsCUDA::test_tags_div_trunc_rounding_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584291Z test_ops.py::TestTagsCUDA::test_tags_dsplit_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584401Z test_ops.py::TestTagsCUDA::test_tags_dstack_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584509Z test_ops.py::TestTagsCUDA::test_tags_empty_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T15:22:22.8584617Z test_ops.py::TestTagsCUDA::test_tags_exp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8584732Z test_ops.py::TestTagsCUDA::test_tags_expand_as_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8584875Z test_ops.py::TestTagsCUDA::test_tags_expand_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8584984Z test_ops.py::TestTagsCUDA::test_tags_expm1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585110Z test_ops.py::TestTagsCUDA::test_tags_fft_irfft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585223Z test_ops.py::TestTagsCUDA::test_tags_flatten_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585335Z test_ops.py::TestTagsCUDA::test_tags_fliplr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585453Z test_ops.py::TestTagsCUDA::test_tags_floor_divide_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585562Z test_ops.py::TestTagsCUDA::test_tags_geqrf_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585676Z test_ops.py::TestTagsCUDA::test_tags_hash_tensor_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585786Z test_ops.py::TestTagsCUDA::test_tags_isnan_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8585898Z test_ops.py::TestTagsCUDA::test_tags_isneginf_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586024Z test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586143Z test_ops.py::TestTagsCUDA::test_tags_linalg_cross_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586265Z test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586389Z test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586513Z test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_power_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586636Z test_ops.py::TestTagsCUDA::test_tags_linalg_multi_dot_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586768Z test_ops.py::TestTagsCUDA::test_tags_linalg_solve_triangular_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8586894Z test_ops.py::TestTagsCUDA::test_tags_linalg_tensorinv_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587025Z test_ops.py::TestTagsCUDA::test_tags_log_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587156Z test_ops.py::TestTagsCUDA::test_tags_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587272Z test_ops.py::TestTagsCUDA::test_tags_logical_not_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587386Z test_ops.py::TestTagsCUDA::test_tags_logical_or_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587497Z test_ops.py::TestTagsCUDA::test_tags_lu_unpack_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587604Z test_ops.py::TestTagsCUDA::test_tags_mH_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587729Z test_ops.py::TestTagsCUDA::test_tags_masked_logsumexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587848Z test_ops.py::TestTagsCUDA::test_tags_masked_median_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8587969Z test_ops.py::TestTagsCUDA::test_tags_masked_scatter_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8588088Z test_ops.py::TestTagsCUDA::test_tags_masked_softmax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8588201Z test_ops.py::TestTagsCUDA::test_tags_masked_var_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 97%] 2025-12-04T15:22:22.8588309Z test_ops.py::TestTagsCUDA::test_tags_matmul_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8588447Z test_ops.py::TestTagsCUDA::test_tags_max_binary_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8588591Z test_ops.py::TestTagsCUDA::test_tags_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8588710Z test_ops.py::TestTagsCUDA::test_tags_mean_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8588821Z test_ops.py::TestTagsCUDA::test_tags_median_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8588930Z test_ops.py::TestTagsCUDA::test_tags_mode_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589043Z test_ops.py::TestTagsCUDA::test_tags_movedim_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589149Z test_ops.py::TestTagsCUDA::test_tags_mul_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589260Z test_ops.py::TestTagsCUDA::test_tags_nan_to_num_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589395Z test_ops.py::TestTagsCUDA::test_tags_native_dropout_backward_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589521Z test_ops.py::TestTagsCUDA::test_tags_new_empty_strided_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589668Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool1d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589814Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8589952Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590085Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590273Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_embedding_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590436Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_with_train_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590587Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590715Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8590864Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_layer_norm_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591000Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591143Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591270Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_prelu_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591395Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_relu_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591522Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_rrelu_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591665Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591798Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8591930Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_threshold_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8592040Z test_ops.py::TestTagsCUDA::test_tags_nonzero_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8592161Z test_ops.py::TestTagsCUDA::test_tags_normal_in_place_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8592313Z test_ops.py::TestTagsCUDA::test_tags_ones_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8592422Z test_ops.py::TestTagsCUDA::test_tags_ormqr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T15:22:22.8592567Z test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_4_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8592676Z test_ops.py::TestTagsCUDA::test_tags_prod_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8592783Z test_ops.py::TestTagsCUDA::test_tags_randn_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8592890Z test_ops.py::TestTagsCUDA::test_tags_real_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8592999Z test_ops.py::TestTagsCUDA::test_tags_repeat_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593108Z test_ops.py::TestTagsCUDA::test_tags_round_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593232Z test_ops.py::TestTagsCUDA::test_tags_round_decimals_0_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593341Z test_ops.py::TestTagsCUDA::test_tags_rsqrt_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593467Z test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_prod_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593573Z test_ops.py::TestTagsCUDA::test_tags_sgn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593711Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_exponential_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593851Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8593963Z test_ops.py::TestTagsCUDA::test_tags_signbit_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594071Z test_ops.py::TestTagsCUDA::test_tags_slice_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594196Z test_ops.py::TestTagsCUDA::test_tags_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594321Z test_ops.py::TestTagsCUDA::test_tags_special_bessel_j0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594452Z test_ops.py::TestTagsCUDA::test_tags_special_entr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594566Z test_ops.py::TestTagsCUDA::test_tags_special_i1_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594683Z test_ops.py::TestTagsCUDA::test_tags_special_ndtr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594839Z test_ops.py::TestTagsCUDA::test_tags_special_polygamma_special_polygamma_n_0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8594984Z test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595131Z test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k1_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595285Z test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595438Z test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595575Z test_ops.py::TestTagsCUDA::test_tags_special_spherical_bessel_j0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595699Z test_ops.py::TestTagsCUDA::test_tags_std_mean_unbiased_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595806Z test_ops.py::TestTagsCUDA::test_tags_stft_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8595933Z test_ops.py::TestTagsCUDA::test_tags_t_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8596055Z test_ops.py::TestTagsCUDA::test_tags_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8596171Z test_ops.py::TestTagsCUDA::test_tags_view_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8596279Z test_ops.py::TestTagsCUDA::test_tags_xlogy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T15:22:22.8596422Z test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_add_cuda_float32 PASSED [0.0018s] [100%] 2025-12-04T15:22:22.8596425Z 2025-12-04T15:22:22.8596594Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-c9beaf04e8863d0f.xml - 2025-12-04T15:22:22.8596684Z = 2766 passed, 357 skipped, 3537 deselected, 31 xfailed in 1126.65s (0:18:46) == 2025-12-04T15:22:22.8596734Z escriptor dimension check failure 2025-12-04T15:22:22.8596810Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8596881Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8596949Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597017Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597084Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597151Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597217Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597284Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597350Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597417Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597483Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597551Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597618Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597686Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597753Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597821Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597887Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8597972Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T15:22:22.8598158Z The following tests failed and then succeeded when run in a new process['test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8'] 2025-12-04T15:22:22.8598161Z 2025-12-04T15:22:22.8598276Z FINISHED PRINTING LOG FILE of test_ops 2/5 (test/test-reports/test_ops_2.5_5d2d9f84f109f206_.log) 2025-12-04T15:22:22.8598278Z 2025-12-04T15:22:22.8598365Z Finished test_ops 2/5 ... [2025-12-04 15:22:22.606774][2211205.067760023], took 49.31min 2025-12-04T15:22:22.8598597Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:22:22.8598686Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:22:22.8598779Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T15:22:22.8598830Z Uploading artifacts took 0.00 seconds 2025-12-04T15:22:22.8598944Z Running torch_np/numpy_tests/core/test_dtype 1/1 ... [2025-12-04 15:22:22.613398][2211205.074385912] 2025-12-04T15:22:22.8598993Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:22:22.8599313Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/core/test_dtype.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:22:22.613619] 2025-12-04T15:22:24.8815277Z 2025-12-04T15:22:24.8816898Z torch_np/numpy_tests/core/test_dtype 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.core.test_dtype_1.1_29851c89d609f0fe_.log 2025-12-04T15:22:24.8835039Z Running 102 items in this shard: test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_equivalent_dtype_hashing, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_invalid_types, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Bool, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Bytes0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Complex128, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Complex32, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Complex64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Datetime64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Float128, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Float16, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Float32, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Float64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Int16, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Int32, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Int64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Int8, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Object0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Str0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Timedelta64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_UInt16, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_UInt32, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_UInt64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_UInt8, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Uint32, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Uint64, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_numeric_style_types_are_invalid_dtype_Void0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_richcompare_invalid_dtype_comparison_operation0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_richcompare_invalid_dtype_comparison_operation1, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_richcompare_invalid_dtype_comparison_operation2, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_richcompare_invalid_dtype_comparison_operation3, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_richcompare_invalid_dtype_equality, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_run_t0, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_run_t1, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_run_t2, test/torch_np/numpy_tests/core/test_dtype.py::TestBuiltin::test_run_t3, test/torch_np/numpy_tests/core/test_dtype.py::TestDtypeAttributeDeletion::test_dtype_non_writable_attributes_deletion, test/torch_np/numpy_tests/core/test_dtype.py::TestDtypeAttributeDeletion::test_dtype_writable_attributes_deletion, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_builtin_t0, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_builtin_t1, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_builtin_t2, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_builtin_t3, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_builtin_t4, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_DType11, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_bool__10, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_complex128_4, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_complex64_3, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_float16_0, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_float32_1, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_float64_2, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_int16_7, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_int32_8, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_int64_9, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_int8_6, test/torch_np/numpy_tests/core/test_dtype.py::TestPickling::test_pickle_types_uint8_5, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_other_value_based_complex64_complex64_None, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_other_value_based_float16_complex64_None, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_other_value_based_float32_complex64_None, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_other_value_based_other_4294967295_expected1_expected_weak1, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_other_value_based_other_65535_expected0_expected_weak0, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other0_expected0, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other1_expected1, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other2_expected2, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other3_expected3, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other4_expected4, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other5_expected5, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_complex_scalar_value_based_other6_expected6, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes0_expected0, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes1_expected1, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes2_expected2, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes3_expected3, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes4_expected4, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes5_expected5, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes6_expected6, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes7_expected7, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes8_expected8, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_permutations_do_not_influence_result_dtypes9_expected9, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_python_integer_promotion_val_18446744073709551616, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_python_integer_promotion_val_2, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_python_integer_promotion_val_200, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_python_integer_promotion_val_4294967296, test/torch_np/numpy_tests/core/test_dtype.py::TestPromotion::test_python_integer_promotion_val_9223372036854775808, test/torch_np/numpy_tests/core/test_dtype.py::TestMisc::test_dtypes_are_true, test/torch_np/numpy_tests/core/test_dtype.py::TestMisc::test_keyword_argument, test/torch_np/numpy_tests/core/test_dtype.py::TestFromDTypeAttribute::test_recursion, test/torch_np/numpy_tests/core/test_dtype.py::TestFromDTypeAttribute::test_simple, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_?, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_B, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_D, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_F, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_b, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_d, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_e, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_f, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_h, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_i, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_dtype_subclass_code_l, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_subscript_scalar, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_subscript_tuple_arg_len_0, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_subscript_tuple_arg_len_1, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_subscript_tuple_arg_len_2, test/torch_np/numpy_tests/core/test_dtype.py::TestClassGetItem::test_subscript_tuple_arg_len_3 2025-12-04T15:22:24.8848601Z 2025-12-04T15:22:24.8848755Z Finished torch_np/numpy_tests/core/test_dtype 1/1 ... [2025-12-04 15:22:24.881361][2211207.342346704], took 0.04min 2025-12-04T15:22:24.8849170Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:22:24.8878051Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:22:24.8881173Z Running lazy/test_debug_util 1/1 ... [2025-12-04 15:22:24.887844][2211207.348833316] 2025-12-04T15:22:24.8881742Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:22:24.8882686Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'lazy/test_debug_util.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:22:24.888068] 2025-12-04T15:22:27.0562536Z 2025-12-04T15:22:27.0563512Z lazy/test_debug_util 1/1 was successful, full logs can be found in artifacts with path test/test-reports/lazy.test_debug_util_1.1_fe8c7ed33eba50cb_.log 2025-12-04T15:22:27.0564437Z Running 1 items in this shard: test/lazy/test_debug_util.py::DebugUtilTest::test_get_python_frames 2025-12-04T15:22:27.0564793Z 2025-12-04T15:22:27.0565046Z Finished lazy/test_debug_util 1/1 ... [2025-12-04 15:22:27.055961][2211209.516944703], took 0.04min 2025-12-04T15:22:27.0585638Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:22:27.0632376Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:22:27.0634103Z Running nn/test_load_state_dict 1/1 ... [2025-12-04 15:22:27.063276][2211209.524265152] 2025-12-04T15:22:27.0634553Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:22:27.0636267Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_load_state_dict.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:22:27.063488] 2025-12-04T15:22:30.4338670Z 2025-12-04T15:22:30.4339349Z nn/test_load_state_dict 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_load_state_dict_1.1_804b043022957592_.log 2025-12-04T15:22:30.4343695Z Running 29 items in this shard: test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_BC_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_BC_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_meta_swap_False_keep_vars_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_meta_swap_False_keep_vars_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_meta_swap_True_keep_vars_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_meta_swap_True_keep_vars_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_shape_stride_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_shape_stride_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_with_optimizer_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_assign_with_optimizer_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_child_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_child_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_custom_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_custom_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_invalid_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_invalid_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_ref_cycle_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_type_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_type_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_warn_assign_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_warn_assign_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_with_unexpected_key_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_load_state_dict_with_unexpected_key_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDict::test_scalar_param_1d_tensor_raises_swap_False, test/nn/test_load_state_dict.py::TestLoadStateDict::test_scalar_param_1d_tensor_raises_swap_True, test/nn/test_load_state_dict.py::TestLoadStateDictSwap::test_swap_subclass_swap_True_assign_False, test/nn/test_load_state_dict.py::TestLoadStateDictSwap::test_swap_subclass_swap_True_assign_True 2025-12-04T15:22:30.4347332Z 2025-12-04T15:22:30.4347448Z Finished nn/test_load_state_dict 1/1 ... [2025-12-04 15:22:30.433618][2211212.894602717], took 0.06min 2025-12-04T15:22:30.4354448Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:22:30.4401217Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:22:30.4402998Z Running test_shape_ops 1/1 ... [2025-12-04 15:22:30.440196][2211212.901185247] 2025-12-04T15:22:30.4403231Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:22:30.4405025Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_shape_ops.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:22:30.440411] 2025-12-04T15:24:41.2678008Z 2025-12-04T15:24:41.2678773Z test_shape_ops 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_shape_ops_1.1_cea01e4d71d36e7f_.log 2025-12-04T15:24:41.2688387Z Running 99 items in this shard: test/test_shape_ops.py::TestShapeOpsCUDA::test_clamp_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_clamp_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_clamp_propagates_nans_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_clamp_raises_arg_errors_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_complex_rot90_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_complex_rot90_cuda_complex64, test/test_shape_ops.py::TestShapeOpsCUDA::test_diag_cuda_bool, test/test_shape_ops.py::TestShapeOpsCUDA::test_diag_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_diagonal_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_diagonal_multidim_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_bfloat16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_bool, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_complex64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_float16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_int16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_int32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_int8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_cuda_uint8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_bfloat16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_bool, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_complex64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_float16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_int16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_int32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_int8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_errors_cuda_uint8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_large_tensor_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_bfloat16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_bool, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_complex64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_float16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_int16, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_int32, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_int8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_numpy_cuda_uint8, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_unsupported_dtype_cuda_quint2x4, test/test_shape_ops.py::TestShapeOpsCUDA::test_flip_unsupported_dtype_cuda_quint4x2, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_invalid_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_invalid_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_fliplr_invalid_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_invalid_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_invalid_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_flipud_invalid_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_invalid_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_invalid_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_movedim_invalid_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_astuple_out_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_bfloat16, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_bool, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_float16, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_int16, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_int32, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_int8, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_cuda_uint8, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_discontiguous_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_no_warning_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_nonzero_non_diff_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_rot90_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_sparse_dense_dim_cuda_complex128, test/test_shape_ops.py::TestShapeOpsCUDA::test_sparse_dense_dim_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_sparse_dense_dim_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_tolist_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_float16, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_float32, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_float64, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_int16, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_int32, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_int64, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_int8, test/test_shape_ops.py::TestShapeOpsCUDA::test_trace_cuda_uint8, test/test_shape_ops.py::TestShapeOpsCUDA::test_unbind_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_unfold_all_devices_and_dtypes_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_unfold_backward_errors_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_unfold_errors_cuda, test/test_shape_ops.py::TestShapeOpsCUDA::test_unfold_scalars_cuda 2025-12-04T15:24:41.2697330Z 2025-12-04T15:24:41.2697435Z Finished test_shape_ops 1/1 ... [2025-12-04 15:24:41.267658][2211343.728643443], took 2.18min 2025-12-04T15:24:41.2697818Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:24:41.2747899Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:24:41.2749007Z Running functorch/test_ops 1/4 ... [2025-12-04 15:24:41.274791][2211343.735779585] 2025-12-04T15:24:41.2749254Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:24:41.2750774Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_ops.py', '--shard-id=1', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:24:41.274980] 2025-12-04T15:39:41.5602465Z 2025-12-04T15:39:41.5603406Z functorch/test_ops 1/4 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_ops_1.4_de82ddc936cc71fe_.log 2025-12-04T15:39:41.5938050Z Running 2549 items in this shard: test/functorch/test_ops.py::TestOperatorsCUDA::test_data_write_errors_under_transform_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_cross_entropy_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__unsafe_masked_index_put_accumulate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ceil_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_empty_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_kron_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_det_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_maximum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv_transpose2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_dropout3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_hardtanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_tan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_trunc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_var_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp___rsub___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_double_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_erf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_max_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_minimum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_dropout2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nonzero_static_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_take_along_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_bool_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_empty_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_signal_windows_blackman_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_shifted_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_bool_raises_topk_cuda_bool, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_argmax_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_ceil_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_gt_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_maximum_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_sort_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_sort_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_contiguous_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_expand_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_flatten_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_flatten_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_dsplit_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_split_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_unbind_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_mH_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_mT_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_movedim_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_permute_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_positive_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_resolve_neg_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_special_grad_op_jvp_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_special_grad_op_vjp_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_squeeze_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_squeeze_multiple_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_unfold_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_unsqueeze_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ceil_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diff_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_fractional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_polar_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_entr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_take_along_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_vecdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logical_and_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_max_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_fractional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_mish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_scaled_dot_product_attention_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_quantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_modified_bessel_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_alias_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_native_dropout_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_binary_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_scaled_dot_product_attention_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scalar_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_take_along_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_tan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_trunc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unique_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmapvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyMulAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpySortAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___radd___cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rmatmul___cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rsub___cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__chunk_cat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_acos_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_alias_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_all_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_allclose_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_any_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_argwhere_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_as_strided_partial_views_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atan_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bernoulli_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bucketize_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_byte_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cdist_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cfloat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_column_stack_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_combinations_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_corrcoef_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cov_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cummin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cumsum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diagonal_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diff_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dist_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_like_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_erfinv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expm1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fftshift_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_hfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_rfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fill_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_flip_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fliplr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_flipud_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_frexp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ge_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_hash_tensor_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_hypot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_igamma_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_fill_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_amax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_amin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_int_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isfinite_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_4inputs_with_extra_args_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_kron_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cholesky_ex_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cross_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eigh_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eigvals_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_inv_ex_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_factor_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lstsq_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_factor_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_factor_ex_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_norm_subgradients_at_zero_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_pinv_singular_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_qr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_svdvals_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_vector_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linspace_tensor_overload_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log1p_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log_normal_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logaddexp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logsumexp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_long_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_lu_unpack_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_amin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_argmin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_log_softmax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_prod_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_scatter_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_softmax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_softmin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_max_reduction_no_dim_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_min_reduction_with_dim_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_5_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_narrow_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_neg_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_new_empty_strided_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_new_ones_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_max_pool3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_binary_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_binary_cross_entropy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_celu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_channel_shuffle_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_no_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_groups_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv_transpose3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_embedding_functorch_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_gaussian_nll_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hardswish_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_huber_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_instance_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_area_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_bilinear_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_trilinear_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_linear_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_local_response_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_margin_ranking_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_pool1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_mish_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_constant_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_reflect_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_poisson_nll_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu6_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_rrelu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_softshrink_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_tanhshrink_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_upsample_bilinear_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nonzero_static_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_normal_number_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_3_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_4_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_qr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randint_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randint_like_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_renorm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_repeat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_resize__cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_resolve_conj_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_round_decimals_3_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_round_decimals_neg_3_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_searchsorted_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_select_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sigmoid_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_bartlett_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_blackman_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_exponential_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signbit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sort_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_t_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_u_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_w_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_entr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_hermite_polynomial_h_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_hermite_polynomial_he_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i1e_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_log_ndtr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_i0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_split_with_sizes_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_unbiased_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sub_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sum_to_size_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_svd_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_svd_lowrank_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_t_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_take_along_dim_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_take_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_to_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_true_divide_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unfold_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsafe_chunk_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsqueeze_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_erf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_kron_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_eigh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_pinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_vecdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_logical_and_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_native_dropout_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_polygamma_polygamma_n_2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_to_sparse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_det_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_polar_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_quantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_resize_as__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scalar_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sparse_sampled_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_shifted_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_trunc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__unsafe_masked_index_put_accumulate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ceil_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_pinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logical_and_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mT_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_hardshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_blackman_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule__unsafe_masked_index_put_accumulate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_double_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_vecdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_hardshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nonzero_static_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_modified_bessel_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_index_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_det_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_max_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_dropout3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_scaled_dot_product_attention_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nonzero_static_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_resize_as__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_blackman_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signbit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_det_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_pinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_max_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_hardshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_scaled_dot_product_attention_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_normal_number_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sparse_sampled_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_SelectAutogradFunction_cuda_float32 2025-12-04T15:39:41.6250375Z 2025-12-04T15:39:41.6250498Z Finished functorch/test_ops 1/4 ... [2025-12-04 15:39:41.562235][2212244.02322027], took 15.00min 2025-12-04T15:39:41.6250881Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:39:41.6251235Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:39:41.6251433Z Running test_nn 2/2 ... [2025-12-04 15:39:41.568783][2212244.029772521] 2025-12-04T15:39:41.6251591Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T15:39:41.6251954Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_nn.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:39:41.568997] 2025-12-04T15:52:53.6381511Z 2025-12-04T15:52:53.6382229Z test_nn 2/2 was successful, full logs can be found in artifacts with path test/test-reports/test_nn_2.2_31b848336d30a6d2_.log 2025-12-04T15:52:53.6593215Z Running 1240 items in this shard: test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda_fp32, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_reduce, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_CTCLoss_critical_target_len, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cpu, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cuda, test/test_nn.py::TestNN::test_CTCLoss_long_targets, test/test_nn.py::TestNN::test_CTCLoss_typechecks, test/test_nn.py::TestNN::test_CTCLoss_zero_infinity, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_groups, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad1size1, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad2, test/test_nn.py::TestNN::test_Conv1d_pad2size1, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_same, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_valid, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_zero_batch, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv2d_pad_valid, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_padding, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_strided, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zero_batch, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_dilated, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_dilated_strided, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_groups, test/test_nn.py::TestNN::test_Conv3d_groups_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_groups_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_no_bias, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose3d, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_CrossMapLRN2d_cuda, test/test_nn.py::TestNN::test_ELU_no_batch_dim, test/test_nn.py::TestNN::test_Embedding, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous, test/test_nn.py::TestNN::test_EmbeddingBag_max, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx_cuda, test/test_nn.py::TestNN::test_Embedding_cuda, test/test_nn.py::TestNN::test_Embedding_sparse_cuda, test/test_nn.py::TestNN::test_Flatten, test/test_nn.py::TestNN::test_Flatten_no_batch_dim, test/test_nn.py::TestNN::test_Flatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Fold, test/test_nn.py::TestNN::test_Fold_int_input, test/test_nn.py::TestNN::test_Fold_int_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input_cuda, test/test_nn.py::TestNN::test_GELU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_HuberLoss_delta, test/test_nn.py::TestNN::test_HuberLoss_delta_cuda, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex, test/test_nn.py::TestNN::test_L1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_LSTM_cell_forward_hidden_size, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval_cuda, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim, test/test_nn.py::TestNN::test_Linear, test/test_nn.py::TestNN::test_Linear_cuda_fp32, test/test_nn.py::TestNN::test_Linear_no_batch_dim, test/test_nn.py::TestNN::test_Linear_no_bias, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MaxUnpool1d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim, test/test_nn.py::TestNN::test_Mish_no_batch_dim, test/test_nn.py::TestNN::test_ModuleDict, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_nn.py::TestNN::test_PReLU_backward_requires_grad_false, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args, test/test_nn.py::TestNN::test_ParameterDict_replication, test/test_nn.py::TestNN::test_ParameterList_replication, test/test_nn.py::TestNN::test_PixelShuffle, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_RNN_cell, test/test_nn.py::TestNN::test_RNN_change_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_no_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_with_dropout, test/test_nn.py::TestNN::test_RNN_cudnn_weight_norm, test/test_nn.py::TestNN::test_RNN_dropout, test/test_nn.py::TestNN::test_RNN_input_size_zero, test/test_nn.py::TestNN::test_RNN_nonlinearity, test/test_nn.py::TestNN::test_RNN_nonlinearity_passed_as_arg, test/test_nn.py::TestNN::test_RReLU, test/test_nn.py::TestNN::test_RReLU_with_up_down, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar_cuda, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReLU_no_batch_dim, test/test_nn.py::TestNN::test_ReplicationPad3d, test/test_nn.py::TestNN::test_ReplicationPad3d_complex, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim, test/test_nn.py::TestNN::test_SELU_no_batch_dim, test/test_nn.py::TestNN::test_Sequential_add, test/test_nn.py::TestNN::test_Sequential_append, test/test_nn.py::TestNN::test_Sequential_delitem, test/test_nn.py::TestNN::test_Sequential_imul, test/test_nn.py::TestNN::test_Sequential_insert, test/test_nn.py::TestNN::test_Sequential_insert_fail_case, test/test_nn.py::TestNN::test_Sequential_mul, test/test_nn.py::TestNN::test_Sequential_pop, test/test_nn.py::TestNN::test_Sequential_setitem, test/test_nn.py::TestNN::test_Sequential_setitem_named, test/test_nn.py::TestNN::test_SiLU_no_batch_dim, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_Softplus_no_batch_dim, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softsign_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda_tf32, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda_tf32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda_tf32, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Unfold_int_input, test/test_nn.py::TestNN::test_Unfold_int_input_cuda, test/test_nn.py::TestNN::test_adaptive_log_softmax, test/test_nn.py::TestNN::test_add_module, test/test_nn.py::TestNN::test_affine_grid, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_3, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_2, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_inference_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NCHW_vs_native_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_NCHW_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_2D_train_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NCHW_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_NCHW_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_cpu_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_inference_NHWC_vs_native_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NCHW_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_NCHW_mixed_float16, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_cpu_float32, test/test_nn.py::TestNN::test_batchnorm_3D_train_NHWC_vs_cpu_mixed_bfloat16, test/test_nn.py::TestNN::test_batchnorm_half_overflow, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_SyncBatchNorm, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_mean_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_or_running_mean_have_forward_grad, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_weight_is_not_same_size_as_input, test/test_nn.py::TestNN::test_bce_loss_always_nonnegative, test/test_nn.py::TestNN::test_bce_loss_broadcasts_weights, test/test_nn.py::TestNN::test_bce_loss_input_range, test/test_nn.py::TestNN::test_bce_loss_size_mismatch, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_pos_weights, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bce_with_logits_raises_if_target_and_input_are_different_size, test/test_nn.py::TestNN::test_bce_with_logits_stability, test/test_nn.py::TestNN::test_bilinear_broadcasting, test/test_nn.py::TestNN::test_bilinear_no_bias, test/test_nn.py::TestNN::test_bilinear_non_contiguous, test/test_nn.py::TestNN::test_bilinear_value_error, test/test_nn.py::TestNN::test_broadcast_not_requiring_grad, test/test_nn.py::TestNN::test_buffer_not_persistent, test/test_nn.py::TestNN::test_buffer_not_persistent_assign, test/test_nn.py::TestNN::test_buffer_not_persistent_del, test/test_nn.py::TestNN::test_buffer_not_persistent_overwrite, test/test_nn.py::TestNN::test_call_supports_python_dict_output, test/test_nn.py::TestNN::test_channel_shuffle_input_checks, test/test_nn.py::TestNN::test_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_children, test/test_nn.py::TestNN::test_container_copy, test/test_nn.py::TestNN::test_convert_sync_batchnorm, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_diff_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_nonexpandable_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_invalid_shape, test/test_nn.py::TestNN::test_cosine_similarity, test/test_nn.py::TestNN::test_cross_entropy_loss, test/test_nn.py::TestNN::test_cross_entropy_loss_zero_div, test/test_nn.py::TestNN::test_cudnn_forward_exception, test/test_nn.py::TestNN::test_cudnn_weight_format, test/test_nn.py::TestNN::test_dir, test/test_nn.py::TestNN::test_dir_digit, test/test_nn.py::TestNN::test_elu_inplace_gradgrad, test/test_nn.py::TestNN::test_elu_inplace_on_view, test/test_nn.py::TestNN::test_error_RNN_seq_len_zero, test/test_nn.py::TestNN::test_extra_state_missing_get_extra_state, test/test_nn.py::TestNN::test_extra_state_non_dict, test/test_nn.py::TestNN::test_fold_invalid_arg, test/test_nn.py::TestNN::test_get_buffer_from_submodules, test/test_nn.py::TestNN::test_getattr_with_property, test/test_nn.py::TestNN::test_grid_sample_nearest_neighbor_rounding_mode_consistency, test/test_nn.py::TestNN::test_interpolate, test/test_nn.py::TestNN::test_interpolate_bicubic_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_buffer_overflow, test/test_nn.py::TestNN::test_interpolate_illegal_memory_access, test/test_nn.py::TestNN::test_interpolate_linear_1d, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d, test/test_nn.py::TestNN::test_interpolate_nearest_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d, test/test_nn.py::TestNN::test_kl_div_log_softmax_target, test/test_nn.py::TestNN::test_kl_div_with_diff_type, test/test_nn.py::TestNN::test_kl_div_with_diff_type_log_target, test/test_nn.py::TestNN::test_layer_norm_backwards_eps, test/test_nn.py::TestNN::test_layer_norm_eps, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_broadcasting, test/test_nn.py::TestNN::test_linear_raise_on_scalar_input, test/test_nn.py::TestNN::test_log_softmax_dim0, test/test_nn.py::TestNN::test_log_softmax_dim0_cuda, test/test_nn.py::TestNN::test_log_softmax_dim3, test/test_nn.py::TestNN::test_log_softmax_dim3_cuda, test/test_nn.py::TestNN::test_log_softmax_lastdim, test/test_nn.py::TestNN::test_log_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_log_softmax_scalar_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial, test/test_nn.py::TestNN::test_log_softmax_spatial_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial_special, test/test_nn.py::TestNN::test_margin_ranking_loss_margin_no_reduce, test/test_nn.py::TestNN::test_max_pool1d_invalid_output_size, test/test_nn.py::TestNN::test_module_apply_inplace_op, test/test_nn.py::TestNN::test_module_to_argparse, test/test_nn.py::TestNN::test_mse_loss_size_warning, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_nn.py::TestNN::test_named_children, test/test_nn.py::TestNN::test_native_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_nested_tensor_from_mask_error, test/test_nn.py::TestNN::test_no_grad, test/test_nn.py::TestNN::test_non_leaf_parameters, test/test_nn.py::TestNN::test_normalize, test/test_nn.py::TestNN::test_pad_scalar_error, test/test_nn.py::TestNN::test_pairwise_distance, test/test_nn.py::TestNN::test_parameter_assignment, test/test_nn.py::TestNN::test_parameterlistdict_pickle, test/test_nn.py::TestNN::test_parameters_and_named_parameters, test/test_nn.py::TestNN::test_parameters_to_vector, test/test_nn.py::TestNN::test_parse_to, test/test_nn.py::TestNN::test_partial_flat_weights, test/test_nn.py::TestNN::test_pdist, test/test_nn.py::TestNN::test_pdist_cpu_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_cuda_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_empty_row, test/test_nn.py::TestNN::test_pdist_large, test/test_nn.py::TestNN::test_pdist_zeros, test/test_nn.py::TestNN::test_pointwise_loss_target_grad_none_reduction, test/test_nn.py::TestNN::test_projections_lstm_check_device, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_not_tensor, test/test_nn.py::TestNN::test_register_parameter_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_repr, test/test_nn.py::TestNN::test_requires_grad_, test/test_nn.py::TestNN::test_rnn_args_check, test/test_nn.py::TestNN::test_share_memory, test/test_nn.py::TestNN::test_smoothl1loss_negative_beta_not_supported, test/test_nn.py::TestNN::test_softmax_functional_dim3_cuda, test/test_nn.py::TestNN::test_softmax_functional_scalar, test/test_nn.py::TestNN::test_softmax_functional_scalar_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial_cuda, test/test_nn.py::TestNN::test_softmax_spatial_dtype, test/test_nn.py::TestNN::test_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_accuracy_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_backward_elemt, test/test_nn.py::TestNN::test_threshold_bfloat16_half, test/test_nn.py::TestNN::test_threshold_int, test/test_nn.py::TestNN::test_train_errors_for_invalid_mode, test/test_nn.py::TestNN::test_transformer_layer_args_check, test/test_nn.py::TestNN::test_transformerdecoder, test/test_nn.py::TestNN::test_transformerdecoderlayer, test/test_nn.py::TestNN::test_triplet_margin_loss, test/test_nn.py::TestNN::test_triplet_margin_loss_swap_no_reduce, test/test_nn.py::TestNN::test_unflatten_invalid_arg, test/test_nn.py::TestNN::test_upsamplingLinear1d, test/test_nn.py::TestNN::test_upsamplingTrilinear3d_spatial_invariance, test/test_nn.py::TestNN::test_vector_to_parameters, test/test_nn.py::TestNN::test_weight_norm, test/test_nn.py::TestNN::test_weight_norm_pickle, test/test_nn.py::TestNN::test_zero_grad, test/test_nn.py::TestConstantPadNd::test_preserves_memory_format, test/test_nn.py::TestAddRelu::test_add_relu_broadcasting, test/test_nn.py::TestFusionUtils::test_fuse_conv_bn_requires_grad, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_empty_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GRU_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_memory_format_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm2d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm3d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_race_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_warnings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_deterministic_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad1d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Transformer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Unfold_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_adaptiveavg_pool1d_shmem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate45_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate90_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_3d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_channel_shuffle_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_error_if_nonfinite_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_weight_ignore_indices_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_with_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_index_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_one_hot_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cpu_length_cuda_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cuda_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cudnn_rnn_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_device_mask_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_glu_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_bfloat16_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardsigmoid_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_corner_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_invalid_reduction_strings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_large_max_pool_contig_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_large_reflect_pad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_linear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_logsigmoid_out_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_lstmcell_backward_only_one_output_grad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_TxT_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_devices_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_mish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_mse_loss_error_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_mean_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_none_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_target_dim_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_mismatched_batch_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_total_weight_is_zero_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_overwrite_module_params_on_conversion_cpu_device_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_prelu_backward_32bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_replicatepad_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_epsilon_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_numeric_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rmsnorm_numeric_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_silu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smoothl1loss_backward_zero_beta_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_without_fully_vectorized_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_low_threshold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_default_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_fail_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_rescale_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float64 2025-12-04T15:52:53.6796831Z 2025-12-04T15:52:53.6796931Z Finished test_nn 2/2 ... [2025-12-04 15:52:53.638963][2213036.0999504], took 13.20min 2025-12-04T15:52:53.6797308Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-24caf5a151463e15.xml 2025-12-04T15:52:53.6797671Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T15:52:53.6797895Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T15:52:53.6798079Z Uploading artifacts took 0.00 seconds 2025-12-04T15:52:55.7918838Z Running test batch 'tests to run' cost 23565.34 seconds 2025-12-04T15:52:55.7929930Z Emitting td_test_failure_stats_v2 2025-12-04T15:52:55.7934074Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764863575_4ca06c50d12911f096500ef8a4e6025b 2025-12-04T15:52:57.8135372Z /var/lib/jenkins/pytorch/tools/stats/upload_metrics.py:156: UserWarning: Error uploading metric td_test_failure_stats_v2 to DynamoDB: Unable to locate credentials 2025-12-04T15:52:57.8148796Z warn(f"Error uploading metric {metric_name} to DynamoDB: {e}") 2025-12-04T15:52:57.8149141Z Emitting td_test_failure_stats_v2 2025-12-04T15:52:57.8149576Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764863577_4dd4cdd2d12911f096500ef8a4e6025b 2025-12-04T15:52:57.8172289Z dynamo/test_misc 1/1 failed! 2025-12-04T15:52:57.8172458Z inductor/test_fp8 1/1 failed! 2025-12-04T15:52:58.6749144Z 2025-12-04T15:52:58.6749581Z real 392m51.416s 2025-12-04T15:52:58.6749902Z user 2399m23.236s 2025-12-04T15:52:58.6750286Z sys 273m57.833s 2025-12-04T15:52:58.6750481Z + sccache_epilogue 2025-12-04T15:52:58.6750752Z + echo '::group::Sccache Compilation Log' 2025-12-04T15:52:58.6751375Z ##[group]Sccache Compilation Log 2025-12-04T15:52:58.6751669Z + echo '=================== sccache compilation log ===================' 2025-12-04T15:52:58.6751989Z =================== sccache compilation log =================== 2025-12-04T15:52:58.6752442Z + python /var/lib/jenkins/pytorch/.ci/pytorch/print_sccache_log.py /var/lib/jenkins/sccache_error.log 2025-12-04T15:52:58.6828697Z + echo '=========== If your build fails, please take a look at the log above for possible reasons ===========' 2025-12-04T15:52:58.6829127Z =========== If your build fails, please take a look at the log above for possible reasons =========== 2025-12-04T15:52:58.6831127Z + sccache --show-stats 2025-12-04T15:52:58.6853793Z Compile requests 7785 2025-12-04T15:52:58.6854012Z Compile requests executed 399 2025-12-04T15:52:58.6854246Z Cache hits 24 2025-12-04T15:52:58.6854441Z Cache hits (C/C++) 24 2025-12-04T15:52:58.6854633Z Cache misses 375 2025-12-04T15:52:58.6854823Z Cache misses (C/C++) 368 2025-12-04T15:52:58.6855013Z Cache misses (HIP) 7 2025-12-04T15:52:58.6855210Z Cache hits rate 6.02 % 2025-12-04T15:52:58.6855412Z Cache hits rate (C/C++) 6.12 % 2025-12-04T15:52:58.6856132Z Cache hits rate (HIP) 0.00 % 2025-12-04T15:52:58.6856338Z Cache timeouts 0 2025-12-04T15:52:58.6856534Z Cache read errors 0 2025-12-04T15:52:58.6856729Z Forced recaches 0 2025-12-04T15:52:58.6856923Z Cache write errors 0 2025-12-04T15:52:58.6857117Z Cache errors 0 2025-12-04T15:52:58.6857309Z Compilations 375 2025-12-04T15:52:58.6857502Z Compilation failures 0 2025-12-04T15:52:58.6857703Z Non-cacheable compilations 0 2025-12-04T15:52:58.6857909Z Non-cacheable calls 206 2025-12-04T15:52:58.6858109Z Non-compilation calls 7180 2025-12-04T15:52:58.6870644Z Unsupported compiler calls 0 2025-12-04T15:52:58.6870989Z Average cache write 0.000 s 2025-12-04T15:52:58.6871221Z Average compiler 2.443 s 2025-12-04T15:52:58.6871463Z Average cache read hit 0.000 s 2025-12-04T15:52:58.6871730Z Failed distributed compilations 0 2025-12-04T15:52:58.6871884Z 2025-12-04T15:52:58.6871964Z Non-cacheable reasons: 2025-12-04T15:52:58.6872173Z unknown source language 167 2025-12-04T15:52:58.6872384Z -E 39 2025-12-04T15:52:58.6872524Z 2025-12-04T15:52:58.6872668Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T15:52:58.6872968Z Use direct/preprocessor mode? yes 2025-12-04T15:52:58.6873458Z Version (client) 0.10.0 2025-12-04T15:52:58.6873678Z Cache size 38 MiB 2025-12-04T15:52:58.6873960Z Max cache size 10 GiB 2025-12-04T15:52:58.6874174Z + sccache --stop-server 2025-12-04T15:52:58.6874428Z Stopping sccache server... 2025-12-04T15:52:58.6877386Z Compile requests 7785 2025-12-04T15:52:58.6877571Z Compile requests executed 399 2025-12-04T15:52:58.6877728Z Cache hits 24 2025-12-04T15:52:58.6877898Z Cache hits (C/C++) 24 2025-12-04T15:52:58.6878054Z Cache misses 375 2025-12-04T15:52:58.6878209Z Cache misses (C/C++) 368 2025-12-04T15:52:58.6878364Z Cache misses (HIP) 7 2025-12-04T15:52:58.6878532Z Cache hits rate 6.02 % 2025-12-04T15:52:58.6878700Z Cache hits rate (C/C++) 6.12 % 2025-12-04T15:52:58.6878863Z Cache hits rate (HIP) 0.00 % 2025-12-04T15:52:58.6879025Z Cache timeouts 0 2025-12-04T15:52:58.6879183Z Cache read errors 0 2025-12-04T15:52:58.6879349Z Forced recaches 0 2025-12-04T15:52:58.6879506Z Cache write errors 0 2025-12-04T15:52:58.6879670Z Cache errors 0 2025-12-04T15:52:58.6879824Z Compilations 375 2025-12-04T15:52:58.6879978Z Compilation failures 0 2025-12-04T15:52:58.6880176Z Non-cacheable compilations 0 2025-12-04T15:52:58.6880340Z Non-cacheable calls 206 2025-12-04T15:52:58.6880501Z Non-compilation calls 7180 2025-12-04T15:52:58.6880671Z Unsupported compiler calls 0 2025-12-04T15:52:58.6880847Z Average cache write 0.000 s 2025-12-04T15:52:58.6881012Z Average compiler 2.443 s 2025-12-04T15:52:58.6881179Z Average cache read hit 0.000 s 2025-12-04T15:52:58.6881361Z Failed distributed compilations 0 2025-12-04T15:52:58.6881478Z 2025-12-04T15:52:58.6881535Z Non-cacheable reasons: 2025-12-04T15:52:58.6881687Z unknown source language 167 2025-12-04T15:52:58.6881848Z -E 39 2025-12-04T15:52:58.6881955Z 2025-12-04T15:52:58.6882069Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T15:52:58.6882286Z Use direct/preprocessor mode? yes 2025-12-04T15:52:58.6882457Z Version (client) 0.10.0 2025-12-04T15:52:58.6882698Z Cache size 38 MiB 2025-12-04T15:52:58.6882869Z Max cache size 10 GiB 2025-12-04T15:52:58.6883063Z + echo ::endgroup:: 2025-12-04T15:52:58.6883436Z ##[endgroup] 2025-12-04T15:52:58.6932682Z ##[error]Process completed with exit code 1. 2025-12-04T15:52:58.6961979Z ##[group]Run # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T15:52:58.6962305Z # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T15:52:58.6962700Z docker exec -t "f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2025-12-04T15:52:58.6967074Z shell: /usr/bin/bash -e {0} 2025-12-04T15:52:58.6967188Z env: 2025-12-04T15:52:58.6967283Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:52:58.6967422Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:52:58.6967601Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:52:58.6967771Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:52:58.6968164Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:52:58.6968532Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:52:58.6968648Z AWS_REGION: us-east-1 2025-12-04T15:52:58.6968904Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:52:58.6969057Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:52:58.6971183Z AWS_SESSION_TOKEN: *** 2025-12-04T15:52:58.6971357Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:52:58.6971618Z ##[endgroup] 2025-12-04T15:52:58.7671359Z ##[group]Run docker exec -t "f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e" sh -c "sudo chown -R 1001:1001 test" 2025-12-04T15:52:58.7671759Z docker exec -t "f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e" sh -c "sudo chown -R 1001:1001 test" 2025-12-04T15:52:58.7675821Z shell: /usr/bin/bash -e {0} 2025-12-04T15:52:58.7675934Z env: 2025-12-04T15:52:58.7676026Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:52:58.7676163Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:52:58.7676340Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:52:58.7676507Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:52:58.7676895Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:52:58.7677280Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:52:58.7677395Z AWS_REGION: us-east-1 2025-12-04T15:52:58.7677551Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:52:58.7677699Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:52:58.7679767Z AWS_SESSION_TOKEN: *** 2025-12-04T15:52:58.7679935Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:52:58.7680171Z ##[endgroup] 2025-12-04T15:52:58.8470998Z ##[group]Run cat test/**/*_toprint.log || true 2025-12-04T15:52:58.8471218Z cat test/**/*_toprint.log || true 2025-12-04T15:52:58.8475070Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T15:52:58.8475290Z env: 2025-12-04T15:52:58.8475445Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:52:58.8475667Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:52:58.8475875Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:52:58.8476069Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:52:58.8476513Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:52:58.8476944Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:52:58.8477078Z AWS_REGION: us-east-1 2025-12-04T15:52:58.8477239Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:52:58.8477430Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:52:58.8479819Z AWS_SESSION_TOKEN: *** 2025-12-04T15:52:58.8480021Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:52:58.8480284Z ##[endgroup] 2025-12-04T15:52:58.8526275Z cat: 'test/**/*_toprint.log': No such file or directory 2025-12-04T15:52:58.8593132Z Prepare all required actions 2025-12-04T15:52:58.8593514Z Getting action download info 2025-12-04T15:52:59.2061679Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-12-04T15:53:00.0182649Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T15:53:00.9234258Z ##[group]Run ./.github/actions/upload-test-artifacts 2025-12-04T15:53:00.9234423Z with: 2025-12-04T15:53:00.9234522Z use-gha: true 2025-12-04T15:53:00.9234677Z file-suffix: test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140 2025-12-04T15:53:00.9234851Z s3-bucket: gha-artifacts 2025-12-04T15:53:00.9234962Z env: 2025-12-04T15:53:00.9235055Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:00.9235192Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:00.9235371Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:00.9235651Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:00.9236041Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:00.9236501Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:00.9236617Z AWS_REGION: us-east-1 2025-12-04T15:53:00.9236777Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:00.9236927Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:00.9238986Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:00.9239157Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:00.9239340Z ##[endgroup] 2025-12-04T15:53:00.9270196Z ##[group]Run actions/upload-artifact@v4 2025-12-04T15:53:00.9270335Z with: 2025-12-04T15:53:00.9270515Z name: test-jsons-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip 2025-12-04T15:53:00.9270723Z retention-days: 14 2025-12-04T15:53:00.9270834Z if-no-files-found: warn 2025-12-04T15:53:00.9270940Z path: test/**/*.json 2025-12-04T15:53:00.9271042Z compression-level: 6 2025-12-04T15:53:00.9271141Z overwrite: false 2025-12-04T15:53:00.9271246Z include-hidden-files: false 2025-12-04T15:53:00.9271353Z env: 2025-12-04T15:53:00.9271443Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:00.9271576Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:00.9271752Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:00.9271918Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:00.9272299Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:00.9272669Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:00.9272783Z AWS_REGION: us-east-1 2025-12-04T15:53:00.9272932Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:00.9273085Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:00.9275138Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:00.9275306Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:00.9275487Z ##[endgroup] 2025-12-04T15:53:01.2941079Z With the provided path, there will be 6 files uploaded 2025-12-04T15:53:01.2944408Z Artifact name is valid! 2025-12-04T15:53:01.2945254Z Root directory input is valid! 2025-12-04T15:53:01.5157220Z Beginning upload of artifact content to blob storage 2025-12-04T15:53:01.8833452Z Uploaded bytes 46621 2025-12-04T15:53:01.9513157Z Finished uploading artifact content to blob storage! 2025-12-04T15:53:01.9514349Z SHA256 digest of uploaded artifact zip is 7d15d86c4abf819ca2a397e229ec804b98f313b05bb9b3e81c3321ad3895d6b0 2025-12-04T15:53:01.9514997Z Finalizing artifact upload 2025-12-04T15:53:02.1053099Z Artifact test-jsons-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip.zip successfully finalized. Artifact ID 4766314764 2025-12-04T15:53:02.1054201Z Artifact test-jsons-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip has been successfully uploaded! Final size is 46621 bytes. Artifact ID is 4766314764 2025-12-04T15:53:02.1058586Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4766314764 2025-12-04T15:53:02.1177449Z ##[group]Run actions/upload-artifact@v4 2025-12-04T15:53:02.1177614Z with: 2025-12-04T15:53:02.1177815Z name: test-reports-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip 2025-12-04T15:53:02.1178035Z retention-days: 14 2025-12-04T15:53:02.1178152Z if-no-files-found: ignore 2025-12-04T15:53:02.1178279Z path: test/**/*.xml test/**/*.csv 2025-12-04T15:53:02.1178408Z compression-level: 6 2025-12-04T15:53:02.1178519Z overwrite: false 2025-12-04T15:53:02.1178630Z include-hidden-files: false 2025-12-04T15:53:02.1178745Z env: 2025-12-04T15:53:02.1178842Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:02.1179083Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:02.1179266Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:02.1179441Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:02.1179906Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:02.1180564Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:02.1180684Z AWS_REGION: us-east-1 2025-12-04T15:53:02.1180867Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:02.1181026Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:02.1183164Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:02.1183344Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:02.1183536Z ##[endgroup] 2025-12-04T15:53:02.4943493Z With the provided path, there will be 310 files uploaded 2025-12-04T15:53:02.4945282Z Artifact name is valid! 2025-12-04T15:53:02.4945926Z Root directory input is valid! 2025-12-04T15:53:02.7175206Z Beginning upload of artifact content to blob storage 2025-12-04T15:53:03.7410549Z Uploaded bytes 1915361 2025-12-04T15:53:03.8083269Z Finished uploading artifact content to blob storage! 2025-12-04T15:53:03.8084712Z SHA256 digest of uploaded artifact zip is acd9b47dccd7676094fffd45d9094d143c293a21c0e98d8e6a7ffcfdd4a1e519 2025-12-04T15:53:03.8085445Z Finalizing artifact upload 2025-12-04T15:53:03.9731434Z Artifact test-reports-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip.zip successfully finalized. Artifact ID 4766315184 2025-12-04T15:53:03.9732862Z Artifact test-reports-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip has been successfully uploaded! Final size is 1915361 bytes. Artifact ID is 4766315184 2025-12-04T15:53:03.9736543Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4766315184 2025-12-04T15:53:03.9867353Z ##[group]Run actions/upload-artifact@v4 2025-12-04T15:53:03.9867574Z with: 2025-12-04T15:53:03.9867862Z name: logs-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip 2025-12-04T15:53:03.9868173Z retention-days: 14 2025-12-04T15:53:03.9868334Z if-no-files-found: ignore 2025-12-04T15:53:03.9868521Z path: usage_log.txt test/**/*.log 2025-12-04T15:53:03.9868711Z compression-level: 6 2025-12-04T15:53:03.9868871Z overwrite: false 2025-12-04T15:53:03.9869029Z include-hidden-files: false 2025-12-04T15:53:03.9869200Z env: 2025-12-04T15:53:03.9869339Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:03.9869551Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:03.9869816Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:03.9870069Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:03.9870869Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:03.9871438Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:03.9871615Z AWS_REGION: us-east-1 2025-12-04T15:53:03.9871867Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:03.9872103Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:03.9875206Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:03.9875476Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:03.9875762Z ##[endgroup] 2025-12-04T15:53:04.3940565Z Multiple search paths detected. Calculating the least common ancestor of all paths 2025-12-04T15:53:04.3941624Z The least common ancestor is /home/runner/_work/pytorch/pytorch. This will be the root directory of the artifact 2025-12-04T15:53:04.3942039Z With the provided path, there will be 94 files uploaded 2025-12-04T15:53:04.3944566Z Artifact name is valid! 2025-12-04T15:53:04.3945194Z Root directory input is valid! 2025-12-04T15:53:04.6191500Z Beginning upload of artifact content to blob storage 2025-12-04T15:53:05.5166547Z Uploaded bytes 1690230 2025-12-04T15:53:05.5863975Z Finished uploading artifact content to blob storage! 2025-12-04T15:53:05.5865105Z SHA256 digest of uploaded artifact zip is 7e97238c97205dc76a260538d79e767c9b2b0cf7408c90cf2a2ee28ea1b7a12c 2025-12-04T15:53:05.5866312Z Finalizing artifact upload 2025-12-04T15:53:05.8025538Z Artifact logs-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip.zip successfully finalized. Artifact ID 4766315596 2025-12-04T15:53:05.8026477Z Artifact logs-runattempt1-test-default-2-6-linux.rocm.gpu.gfx942.1.b_57116213140.zip has been successfully uploaded! Final size is 1690230 bytes. Artifact ID is 4766315596 2025-12-04T15:53:05.8031788Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4766315596 2025-12-04T15:53:05.8167318Z ##[group]Run # shellcheck disable=SC2156 2025-12-04T15:53:05.8167534Z # shellcheck disable=SC2156 2025-12-04T15:53:05.8167772Z find . -iname "core.[1-9]*" -exec docker exec "${CONTAINER_NAME}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2025-12-04T15:53:05.8172383Z shell: /usr/bin/bash -e {0} 2025-12-04T15:53:05.8172522Z env: 2025-12-04T15:53:05.8172628Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:05.8172785Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:05.8172985Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:05.8173165Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:05.8173563Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:05.8173957Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:05.8174108Z AWS_REGION: us-east-1 2025-12-04T15:53:05.8174304Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:05.8174475Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:05.8176599Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:05.8176790Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:05.8176985Z ##[endgroup] 2025-12-04T15:53:05.9512414Z ##[group]Run actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 2025-12-04T15:53:05.9512642Z with: 2025-12-04T15:53:05.9512797Z name: coredumps-default-2-6-linux.rocm.gpu.gfx942.1.b 2025-12-04T15:53:05.9512975Z retention-days: 14 2025-12-04T15:53:05.9513106Z if-no-files-found: ignore 2025-12-04T15:53:05.9513238Z path: ./**/core.[1-9]* 2025-12-04T15:53:05.9513373Z compression-level: 6 2025-12-04T15:53:05.9513493Z overwrite: false 2025-12-04T15:53:05.9513618Z include-hidden-files: false 2025-12-04T15:53:05.9513750Z env: 2025-12-04T15:53:05.9513858Z GIT_DEFAULT_BRANCH: main 2025-12-04T15:53:05.9514024Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T15:53:05.9514238Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T15:53:05.9514433Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T15:53:05.9514894Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD136 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T15:53:05.9515324Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T15:53:05.9515466Z AWS_REGION: us-east-1 2025-12-04T15:53:05.9515665Z AWS_ACCESS_KEY_ID: *** 2025-12-04T15:53:05.9515834Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T15:53:05.9517901Z AWS_SESSION_TOKEN: *** 2025-12-04T15:53:05.9518083Z CONTAINER_NAME: f00254eecce07fc9cb80a4e4078c448fa3665224b6ea05a49c63ff315941870e 2025-12-04T15:53:05.9518272Z ##[endgroup] 2025-12-04T15:53:09.4139601Z No files were found with the provided path: ./**/core.[1-9]*. No artifacts will be uploaded. 2025-12-04T15:53:09.4323758Z Post job cleanup. 2025-12-04T15:53:09.4336790Z Post job cleanup. 2025-12-04T15:53:09.4536430Z Logging out of registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T15:53:09.4709656Z Post job cleanup. 2025-12-04T15:53:09.5318699Z Post job cleanup. 2025-12-04T15:53:09.5338544Z Post job cleanup. 2025-12-04T15:53:09.5806034Z [command]/usr/bin/git version 2025-12-04T15:53:09.5830919Z git version 2.52.0 2025-12-04T15:53:09.5851476Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/85a35ba5-c1ed-42fb-af33-83ad46f2888e/.gitconfig' 2025-12-04T15:53:09.5857779Z Temporarily overriding HOME='/home/runner/_work/_temp/85a35ba5-c1ed-42fb-af33-83ad46f2888e' before making global git config changes 2025-12-04T15:53:09.5858348Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T15:53:09.5860552Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T15:53:09.5891447Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T15:53:09.5918703Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T15:53:09.6151755Z Entering 'android/libs/fbjni' 2025-12-04T15:53:09.6189530Z Entering 'third_party/FP16' 2025-12-04T15:53:09.6224242Z Entering 'third_party/FXdiv' 2025-12-04T15:53:09.6249876Z Entering 'third_party/NNPACK' 2025-12-04T15:53:09.6275056Z Entering 'third_party/NVTX' 2025-12-04T15:53:09.6306119Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:09.6329886Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:09.6357582Z Entering 'third_party/aiter' 2025-12-04T15:53:09.6391499Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:09.6431056Z Entering 'third_party/benchmark' 2025-12-04T15:53:09.6459469Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:09.6485332Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:09.6521621Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:09.6552945Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:09.6579641Z Entering 'third_party/cutlass' 2025-12-04T15:53:09.6607134Z Entering 'third_party/fbgemm' 2025-12-04T15:53:09.6633863Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:09.6661583Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:09.6686425Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:09.6719370Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:09.6751264Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:09.6781630Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:09.6803756Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:09.6827246Z Entering 'third_party/flash-attention' 2025-12-04T15:53:09.6853530Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:09.6880482Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:09.6912986Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:09.6942583Z Entering 'third_party/fmt' 2025-12-04T15:53:09.6968788Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:09.6993611Z Entering 'third_party/gloo' 2025-12-04T15:53:09.7017502Z Entering 'third_party/googletest' 2025-12-04T15:53:09.7041135Z Entering 'third_party/ideep' 2025-12-04T15:53:09.7064436Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:09.7107731Z Entering 'third_party/ittapi' 2025-12-04T15:53:09.7134323Z Entering 'third_party/kineto' 2025-12-04T15:53:09.7157135Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:09.7183094Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:09.7205902Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:09.7228969Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:09.7249149Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:09.7270150Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:09.7294484Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:09.7315051Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:09.7337413Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:09.7362573Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:09.7385658Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:09.7408342Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:09.7431351Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:09.7456776Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:09.7487345Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:09.7512891Z Entering 'third_party/kleidiai' 2025-12-04T15:53:09.7543098Z Entering 'third_party/mimalloc' 2025-12-04T15:53:09.7570578Z Entering 'third_party/nlohmann' 2025-12-04T15:53:09.7595850Z Entering 'third_party/onnx' 2025-12-04T15:53:09.7625342Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:09.7659951Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:09.7688213Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:09.7708775Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:09.7730458Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:09.7752080Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:09.7774665Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:09.7795364Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:09.7816938Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:09.7839244Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:09.7861380Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:09.7889414Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:09.7918121Z Entering 'third_party/pocketfft' 2025-12-04T15:53:09.7941723Z Entering 'third_party/protobuf' 2025-12-04T15:53:09.7973239Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:09.8010996Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:09.8045826Z Entering 'third_party/psimd' 2025-12-04T15:53:09.8079726Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:09.8107743Z Entering 'third_party/pybind11' 2025-12-04T15:53:09.8140334Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:09.8169967Z Entering 'third_party/sleef' 2025-12-04T15:53:09.8202695Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:09.8227925Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:09.8260292Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:09.8283757Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:09.8305731Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:09.8340770Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:09.8383699Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T15:53:09.8399023Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8409266Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T15:53:09.8430411Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T15:53:09.8589082Z Entering 'android/libs/fbjni' 2025-12-04T15:53:09.8603394Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8622107Z Entering 'third_party/FP16' 2025-12-04T15:53:09.8636315Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8652047Z Entering 'third_party/FXdiv' 2025-12-04T15:53:09.8664938Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8684415Z Entering 'third_party/NNPACK' 2025-12-04T15:53:09.8697259Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8719527Z Entering 'third_party/NVTX' 2025-12-04T15:53:09.8733046Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8752553Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:09.8766472Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8782208Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:09.8795570Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8818688Z Entering 'third_party/aiter' 2025-12-04T15:53:09.8831619Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8848744Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:09.8862090Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8884340Z Entering 'third_party/benchmark' 2025-12-04T15:53:09.8897545Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8920844Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:09.8935450Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8954794Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:09.8971904Z http.https://github.com/.extraheader 2025-12-04T15:53:09.8991746Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:09.9006012Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9022309Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:09.9035084Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9051696Z Entering 'third_party/cutlass' 2025-12-04T15:53:09.9064512Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9092278Z Entering 'third_party/fbgemm' 2025-12-04T15:53:09.9105569Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9133278Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:09.9146928Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9164070Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:09.9179923Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9202731Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:09.9217742Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9232964Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:09.9245268Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9266731Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:09.9282136Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9310034Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:09.9312245Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9347661Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:09.9362658Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9392833Z Entering 'third_party/flash-attention' 2025-12-04T15:53:09.9414014Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9433238Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:09.9451230Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9476659Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:09.9492366Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9517849Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:09.9531837Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9549415Z Entering 'third_party/fmt' 2025-12-04T15:53:09.9566490Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9594774Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:09.9610285Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9627749Z Entering 'third_party/gloo' 2025-12-04T15:53:09.9641811Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9657833Z Entering 'third_party/googletest' 2025-12-04T15:53:09.9669851Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9692191Z Entering 'third_party/ideep' 2025-12-04T15:53:09.9704158Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9721863Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:09.9741046Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9775446Z Entering 'third_party/ittapi' 2025-12-04T15:53:09.9788453Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9806622Z Entering 'third_party/kineto' 2025-12-04T15:53:09.9820534Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9839663Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:09.9853338Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9881539Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:09.9895083Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9916626Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:09.9943888Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9962935Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:09.9976024Z http.https://github.com/.extraheader 2025-12-04T15:53:09.9993195Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:10.0014584Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0036553Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:10.0049567Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0078090Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:10.0099391Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0121460Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:10.0143171Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0164960Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:10.0179260Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0198486Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:10.0222879Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0240321Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:10.0260448Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0280215Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.0293328Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0312795Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.0327622Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0349961Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:10.0362136Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0379757Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:10.0396016Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0414573Z Entering 'third_party/kleidiai' 2025-12-04T15:53:10.0427472Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0452148Z Entering 'third_party/mimalloc' 2025-12-04T15:53:10.0465942Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0488996Z Entering 'third_party/nlohmann' 2025-12-04T15:53:10.0506550Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0530455Z Entering 'third_party/onnx' 2025-12-04T15:53:10.0546739Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0576511Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:10.0590042Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0611465Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:10.0624521Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0642555Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:10.0667921Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0685612Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:10.0701870Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0724639Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:10.0744696Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0767745Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:10.0781062Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0802505Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:10.0814892Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0833977Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:10.0855645Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0878070Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:10.0892774Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0910480Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.0928976Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0951274Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.0965733Z http.https://github.com/.extraheader 2025-12-04T15:53:10.0989817Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:10.1002417Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1028909Z Entering 'third_party/pocketfft' 2025-12-04T15:53:10.1042177Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1059630Z Entering 'third_party/protobuf' 2025-12-04T15:53:10.1073262Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1090752Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:10.1103878Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1126109Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:10.1139979Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1169461Z Entering 'third_party/psimd' 2025-12-04T15:53:10.1182473Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1199372Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:10.1213182Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1229375Z Entering 'third_party/pybind11' 2025-12-04T15:53:10.1242196Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1262036Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:10.1275755Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1300257Z Entering 'third_party/sleef' 2025-12-04T15:53:10.1313912Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1332475Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:10.1345407Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1361745Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:10.1381174Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1399198Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:10.1416675Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1435351Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:10.1455336Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1479846Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:10.1498490Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1515012Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:10.1535923Z http.https://github.com/.extraheader 2025-12-04T15:53:10.1575997Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.1599620Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T15:53:10.1797797Z Entering 'android/libs/fbjni' 2025-12-04T15:53:10.1812334Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T15:53:10.1823205Z Entering 'third_party/FP16' 2025-12-04T15:53:10.1835402Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T15:53:10.1847300Z Entering 'third_party/FXdiv' 2025-12-04T15:53:10.1863655Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T15:53:10.1873717Z Entering 'third_party/NNPACK' 2025-12-04T15:53:10.1889789Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T15:53:10.1899712Z Entering 'third_party/NVTX' 2025-12-04T15:53:10.1910881Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T15:53:10.1922019Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:10.1933778Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T15:53:10.1941423Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:10.1951525Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T15:53:10.1968929Z Entering 'third_party/aiter' 2025-12-04T15:53:10.1980009Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T15:53:10.1990070Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:10.2001448Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T15:53:10.2023333Z Entering 'third_party/benchmark' 2025-12-04T15:53:10.2037624Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:10.2049018Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:10.2060286Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T15:53:10.2082684Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:10.2097684Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T15:53:10.2107580Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:10.2118887Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T15:53:10.2129802Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:10.2140249Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T15:53:10.2153764Z Entering 'third_party/cutlass' 2025-12-04T15:53:10.2164658Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T15:53:10.2177774Z Entering 'third_party/fbgemm' 2025-12-04T15:53:10.2196386Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T15:53:10.2207680Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:10.2228597Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T15:53:10.2238713Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:10.2252153Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T15:53:10.2273868Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:10.2285087Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T15:53:10.2294457Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:10.2306492Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T15:53:10.2321308Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:10.2336092Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T15:53:10.2345608Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:10.2357258Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T15:53:10.2367889Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:10.2385985Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T15:53:10.2398529Z Entering 'third_party/flash-attention' 2025-12-04T15:53:10.2411207Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T15:53:10.2420825Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:10.2432348Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T15:53:10.2448185Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:10.2459262Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T15:53:10.2480386Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:10.2497329Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T15:53:10.2509014Z Entering 'third_party/fmt' 2025-12-04T15:53:10.2521546Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T15:53:10.2537372Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:10.2554195Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T15:53:10.2564771Z Entering 'third_party/gloo' 2025-12-04T15:53:10.2579929Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T15:53:10.2589799Z Entering 'third_party/googletest' 2025-12-04T15:53:10.2601244Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.2617152Z Entering 'third_party/ideep' 2025-12-04T15:53:10.2629274Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T15:53:10.2643394Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:10.2660353Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T15:53:10.2676423Z Entering 'third_party/ittapi' 2025-12-04T15:53:10.2689256Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T15:53:10.2698607Z Entering 'third_party/kineto' 2025-12-04T15:53:10.2709193Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T15:53:10.2717658Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:10.2732221Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T15:53:10.2744673Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:10.2757267Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T15:53:10.2770195Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:10.2780964Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T15:53:10.2793693Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:10.2811426Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T15:53:10.2821596Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:10.2834273Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T15:53:10.2845564Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:10.2856129Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T15:53:10.2871536Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:10.2886597Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T15:53:10.2897596Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:10.2910157Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.2924678Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:10.2936017Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T15:53:10.2945779Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:10.2961050Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T15:53:10.2970311Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:10.2983337Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T15:53:10.2997131Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.3012217Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T15:53:10.3023008Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.3037724Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T15:53:10.3050802Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:10.3063225Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T15:53:10.3077554Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:10.3090379Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.3104576Z Entering 'third_party/kleidiai' 2025-12-04T15:53:10.3119621Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T15:53:10.3130839Z Entering 'third_party/mimalloc' 2025-12-04T15:53:10.3146730Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T15:53:10.3157544Z Entering 'third_party/nlohmann' 2025-12-04T15:53:10.3168141Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T15:53:10.3182600Z Entering 'third_party/onnx' 2025-12-04T15:53:10.3194628Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T15:53:10.3213169Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:10.3225393Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:10.3238010Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:10.3248575Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T15:53:10.3261452Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:10.3278145Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:10.3289340Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:10.3305777Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.3315199Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:10.3331690Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T15:53:10.3340862Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:10.3352138Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T15:53:10.3363949Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:10.3378245Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T15:53:10.3386589Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:10.3401128Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T15:53:10.3410079Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:10.3426940Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T15:53:10.3437847Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.3451199Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T15:53:10.3462333Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.3474181Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T15:53:10.3489493Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:10.3502871Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T15:53:10.3522729Z Entering 'third_party/pocketfft' 2025-12-04T15:53:10.3535025Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T15:53:10.3544682Z Entering 'third_party/protobuf' 2025-12-04T15:53:10.3558626Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T15:53:10.3571255Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:10.3582740Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:10.3592008Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:10.3605161Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.3618001Z Entering 'third_party/psimd' 2025-12-04T15:53:10.3631354Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T15:53:10.3647256Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:10.3660418Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T15:53:10.3672011Z Entering 'third_party/pybind11' 2025-12-04T15:53:10.3683296Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:10.3693891Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:10.3703451Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T15:53:10.3712100Z Entering 'third_party/sleef' 2025-12-04T15:53:10.3736082Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T15:53:10.3748530Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:10.3763985Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T15:53:10.3781518Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:10.3794438Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:10.3807444Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:10.3823101Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T15:53:10.3833424Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:10.3846794Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T15:53:10.3857354Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:10.3869103Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:10.3879229Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:10.3889413Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T15:53:10.3931887Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.3955479Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.3976122Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.3993738Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4013128Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4031920Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4051735Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4069613Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4089452Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4109764Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4128682Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4145829Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4162141Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4179336Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4197310Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4214141Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4234601Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4251596Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4268388Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4290769Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4312832Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4335449Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4352428Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4374699Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4392060Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4409069Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4425395Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4446534Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4464817Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4485715Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4503597Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4521974Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4542097Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4558436Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4575368Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4592099Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4609500Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4628718Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4645398Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4669489Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4688313Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4708943Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4724531Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4742061Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4758915Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4779948Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4800328Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4817942Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4835820Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4852821Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4870573Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4887393Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4904177Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4925146Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4943104Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4960810Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4979122Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.4996964Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5015116Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5036651Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5054075Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5076829Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5098838Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5116744Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5134242Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5154970Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5173927Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5196864Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5216382Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5233794Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5251511Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5276485Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5296746Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5314593Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5333385Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5351641Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5373359Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5391180Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5408789Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5430438Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5448010Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:10.5561124Z Post job cleanup. 2025-12-04T15:53:10.6013481Z [command]/usr/bin/git version 2025-12-04T15:53:10.6034626Z git version 2.52.0 2025-12-04T15:53:10.6056281Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/c60032c7-77c5-4ce1-a8a7-474008c04fe1/.gitconfig' 2025-12-04T15:53:10.6056659Z Temporarily overriding HOME='/home/runner/_work/_temp/c60032c7-77c5-4ce1-a8a7-474008c04fe1' before making global git config changes 2025-12-04T15:53:10.6056988Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T15:53:10.6057780Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T15:53:10.6085965Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T15:53:10.6106674Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T15:53:10.6329139Z Entering 'android/libs/fbjni' 2025-12-04T15:53:10.6359074Z Entering 'third_party/FP16' 2025-12-04T15:53:10.6388677Z Entering 'third_party/FXdiv' 2025-12-04T15:53:10.6413697Z Entering 'third_party/NNPACK' 2025-12-04T15:53:10.6444256Z Entering 'third_party/NVTX' 2025-12-04T15:53:10.6473076Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:10.6499997Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:10.6538829Z Entering 'third_party/aiter' 2025-12-04T15:53:10.6566805Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:10.6602343Z Entering 'third_party/benchmark' 2025-12-04T15:53:10.6627300Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:10.6658692Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:10.6684864Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:10.6709183Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:10.6730257Z Entering 'third_party/cutlass' 2025-12-04T15:53:10.6754953Z Entering 'third_party/fbgemm' 2025-12-04T15:53:10.6779109Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:10.6801571Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:10.6832028Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:10.6859367Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:10.6898971Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:10.6924736Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:10.6949510Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:10.6975559Z Entering 'third_party/flash-attention' 2025-12-04T15:53:10.6999469Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:10.7024356Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:10.7052098Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:10.7076019Z Entering 'third_party/fmt' 2025-12-04T15:53:10.7098539Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:10.7121197Z Entering 'third_party/gloo' 2025-12-04T15:53:10.7145641Z Entering 'third_party/googletest' 2025-12-04T15:53:10.7167909Z Entering 'third_party/ideep' 2025-12-04T15:53:10.7189917Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:10.7217740Z Entering 'third_party/ittapi' 2025-12-04T15:53:10.7239155Z Entering 'third_party/kineto' 2025-12-04T15:53:10.7261643Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:10.7288395Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:10.7314123Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:10.7339526Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:10.7362468Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:10.7384185Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:10.7420019Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:10.7452524Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:10.7481306Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:10.7515936Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:10.7546631Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:10.7572010Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.7608633Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.7645303Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:10.7673623Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:10.7699783Z Entering 'third_party/kleidiai' 2025-12-04T15:53:10.7726189Z Entering 'third_party/mimalloc' 2025-12-04T15:53:10.7753653Z Entering 'third_party/nlohmann' 2025-12-04T15:53:10.7785316Z Entering 'third_party/onnx' 2025-12-04T15:53:10.7818349Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:10.7849705Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:10.7873667Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:10.7896554Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:10.7927326Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:10.7952369Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:10.7976480Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:10.8004451Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:10.8030792Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:10.8052826Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:10.8080822Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:10.8106897Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:10.8143119Z Entering 'third_party/pocketfft' 2025-12-04T15:53:10.8166866Z Entering 'third_party/protobuf' 2025-12-04T15:53:10.8191275Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:10.8224397Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:10.8250477Z Entering 'third_party/psimd' 2025-12-04T15:53:10.8273127Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:10.8295062Z Entering 'third_party/pybind11' 2025-12-04T15:53:10.8315635Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:10.8338417Z Entering 'third_party/sleef' 2025-12-04T15:53:10.8364026Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:10.8386401Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:10.8414445Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:10.8444345Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:10.8469851Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:10.8492813Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:10.8552031Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T15:53:10.8570496Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T15:53:10.8736177Z Entering 'android/libs/fbjni' 2025-12-04T15:53:10.8765723Z Entering 'third_party/FP16' 2025-12-04T15:53:10.8798785Z Entering 'third_party/FXdiv' 2025-12-04T15:53:10.8828182Z Entering 'third_party/NNPACK' 2025-12-04T15:53:10.8848381Z Entering 'third_party/NVTX' 2025-12-04T15:53:10.8871153Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:10.8893231Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:10.8925885Z Entering 'third_party/aiter' 2025-12-04T15:53:10.8953056Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:10.9004858Z Entering 'third_party/benchmark' 2025-12-04T15:53:10.9032092Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:10.9068813Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:10.9101785Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:10.9126260Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:10.9146747Z Entering 'third_party/cutlass' 2025-12-04T15:53:10.9183010Z Entering 'third_party/fbgemm' 2025-12-04T15:53:10.9215948Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:10.9243353Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:10.9278595Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:10.9304738Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:10.9328514Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:10.9355209Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:10.9377414Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:10.9407459Z Entering 'third_party/flash-attention' 2025-12-04T15:53:10.9433395Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:10.9468804Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:10.9503568Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:10.9530852Z Entering 'third_party/fmt' 2025-12-04T15:53:10.9560364Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:10.9588892Z Entering 'third_party/gloo' 2025-12-04T15:53:10.9611694Z Entering 'third_party/googletest' 2025-12-04T15:53:10.9635565Z Entering 'third_party/ideep' 2025-12-04T15:53:10.9657499Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:10.9690476Z Entering 'third_party/ittapi' 2025-12-04T15:53:10.9716552Z Entering 'third_party/kineto' 2025-12-04T15:53:10.9748103Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:10.9778113Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:10.9802427Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:10.9824365Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:10.9847918Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:10.9872957Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:10.9906549Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:10.9933950Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:10.9955031Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:10.9978076Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:11.0004074Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:11.0031923Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:11.0055818Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:11.0081514Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:11.0103675Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:11.0129992Z Entering 'third_party/kleidiai' 2025-12-04T15:53:11.0152842Z Entering 'third_party/mimalloc' 2025-12-04T15:53:11.0177685Z Entering 'third_party/nlohmann' 2025-12-04T15:53:11.0199735Z Entering 'third_party/onnx' 2025-12-04T15:53:11.0229503Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:11.0267322Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:11.0295161Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:11.0329218Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:11.0353476Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:11.0377803Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:11.0400148Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:11.0428017Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:11.0450615Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:11.0475418Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:11.0497598Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:11.0523499Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:11.0558807Z Entering 'third_party/pocketfft' 2025-12-04T15:53:11.0583807Z Entering 'third_party/protobuf' 2025-12-04T15:53:11.0607717Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:11.0640590Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:11.0667098Z Entering 'third_party/psimd' 2025-12-04T15:53:11.0690293Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:11.0712960Z Entering 'third_party/pybind11' 2025-12-04T15:53:11.0735950Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:11.0757479Z Entering 'third_party/sleef' 2025-12-04T15:53:11.0781146Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:11.0806312Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:11.0834095Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:11.0857517Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:11.0878431Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:11.0898715Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:11.0947655Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.0966959Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T15:53:11.1129884Z Entering 'android/libs/fbjni' 2025-12-04T15:53:11.1144604Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T15:53:11.1157666Z Entering 'third_party/FP16' 2025-12-04T15:53:11.1170444Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T15:53:11.1189677Z Entering 'third_party/FXdiv' 2025-12-04T15:53:11.1206482Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T15:53:11.1215157Z Entering 'third_party/NNPACK' 2025-12-04T15:53:11.1228665Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T15:53:11.1241606Z Entering 'third_party/NVTX' 2025-12-04T15:53:11.1255196Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T15:53:11.1271204Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T15:53:11.1284805Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T15:53:11.1298824Z Entering 'third_party/XNNPACK' 2025-12-04T15:53:11.1316353Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T15:53:11.1332194Z Entering 'third_party/aiter' 2025-12-04T15:53:11.1348142Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T15:53:11.1363543Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T15:53:11.1373061Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T15:53:11.1387190Z Entering 'third_party/benchmark' 2025-12-04T15:53:11.1399593Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:11.1413922Z Entering 'third_party/composable_kernel' 2025-12-04T15:53:11.1425200Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T15:53:11.1438132Z Entering 'third_party/cpp-httplib' 2025-12-04T15:53:11.1450304Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T15:53:11.1459671Z Entering 'third_party/cpuinfo' 2025-12-04T15:53:11.1476352Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T15:53:11.1489664Z Entering 'third_party/cudnn_frontend' 2025-12-04T15:53:11.1505618Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T15:53:11.1520683Z Entering 'third_party/cutlass' 2025-12-04T15:53:11.1538060Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T15:53:11.1551819Z Entering 'third_party/fbgemm' 2025-12-04T15:53:11.1564302Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T15:53:11.1578856Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T15:53:11.1595132Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T15:53:11.1606812Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T15:53:11.1618565Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T15:53:11.1630667Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T15:53:11.1643811Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T15:53:11.1658287Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T15:53:11.1670525Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T15:53:11.1687366Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T15:53:11.1700609Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T15:53:11.1712201Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T15:53:11.1725803Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T15:53:11.1736640Z Entering 'third_party/fbgemm/external/json' 2025-12-04T15:53:11.1748075Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T15:53:11.1760178Z Entering 'third_party/flash-attention' 2025-12-04T15:53:11.1771996Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T15:53:11.1782296Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T15:53:11.1798879Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T15:53:11.1811799Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T15:53:11.1822863Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T15:53:11.1844242Z Entering 'third_party/flatbuffers' 2025-12-04T15:53:11.1856918Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T15:53:11.1867187Z Entering 'third_party/fmt' 2025-12-04T15:53:11.1877565Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T15:53:11.1886741Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T15:53:11.1900409Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T15:53:11.1910614Z Entering 'third_party/gloo' 2025-12-04T15:53:11.1922781Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T15:53:11.1933707Z Entering 'third_party/googletest' 2025-12-04T15:53:11.1945911Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.1957215Z Entering 'third_party/ideep' 2025-12-04T15:53:11.1975282Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T15:53:11.1987228Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T15:53:11.1998636Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T15:53:11.2011914Z Entering 'third_party/ittapi' 2025-12-04T15:53:11.2023973Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T15:53:11.2034326Z Entering 'third_party/kineto' 2025-12-04T15:53:11.2049460Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T15:53:11.2059167Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T15:53:11.2074498Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T15:53:11.2087793Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T15:53:11.2103941Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T15:53:11.2114685Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T15:53:11.2125038Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T15:53:11.2133793Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T15:53:11.2147675Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T15:53:11.2156367Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T15:53:11.2167042Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T15:53:11.2174466Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T15:53:11.2183778Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T15:53:11.2195481Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T15:53:11.2204854Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T15:53:11.2214183Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T15:53:11.2228349Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.2237431Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T15:53:11.2248850Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T15:53:11.2260710Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T15:53:11.2271962Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T15:53:11.2280938Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T15:53:11.2292191Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T15:53:11.2300599Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:11.2309564Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T15:53:11.2323816Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:11.2335916Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T15:53:11.2349254Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T15:53:11.2360323Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T15:53:11.2370959Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T15:53:11.2384616Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.2398264Z Entering 'third_party/kleidiai' 2025-12-04T15:53:11.2411706Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T15:53:11.2424293Z Entering 'third_party/mimalloc' 2025-12-04T15:53:11.2436799Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T15:53:11.2447176Z Entering 'third_party/nlohmann' 2025-12-04T15:53:11.2462696Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T15:53:11.2479508Z Entering 'third_party/onnx' 2025-12-04T15:53:11.2491130Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T15:53:11.2507252Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T15:53:11.2521230Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:11.2534974Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T15:53:11.2550507Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T15:53:11.2560935Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T15:53:11.2578075Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:11.2589239Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T15:53:11.2604957Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.2614799Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T15:53:11.2627199Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T15:53:11.2639980Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T15:53:11.2650605Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T15:53:11.2661082Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T15:53:11.2673181Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T15:53:11.2684057Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T15:53:11.2696666Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T15:53:11.2706137Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T15:53:11.2722755Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T15:53:11.2734012Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T15:53:11.2746701Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T15:53:11.2757538Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T15:53:11.2770436Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T15:53:11.2787397Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T15:53:11.2800175Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T15:53:11.2819271Z Entering 'third_party/pocketfft' 2025-12-04T15:53:11.2829638Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T15:53:11.2839044Z Entering 'third_party/protobuf' 2025-12-04T15:53:11.2854020Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T15:53:11.2865687Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T15:53:11.2876171Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T15:53:11.2885441Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T15:53:11.2899056Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.2913773Z Entering 'third_party/psimd' 2025-12-04T15:53:11.2925072Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T15:53:11.2935907Z Entering 'third_party/pthreadpool' 2025-12-04T15:53:11.2947681Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T15:53:11.2957958Z Entering 'third_party/pybind11' 2025-12-04T15:53:11.2972889Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:11.2983696Z Entering 'third_party/python-peachpy' 2025-12-04T15:53:11.2995892Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T15:53:11.3003514Z Entering 'third_party/sleef' 2025-12-04T15:53:11.3022649Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T15:53:11.3037976Z Entering 'third_party/tensorpipe' 2025-12-04T15:53:11.3052675Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T15:53:11.3063701Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T15:53:11.3074158Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T15:53:11.3083206Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T15:53:11.3096404Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T15:53:11.3110845Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T15:53:11.3124254Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T15:53:11.3136612Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T15:53:11.3148212Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T15:53:11.3156624Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T15:53:11.3169907Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T15:53:11.3202674Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3222303Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3239739Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3256440Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3280795Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3304083Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3319907Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3338486Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3353041Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3367333Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3379963Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3397535Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3412194Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3428858Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3443304Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3455827Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3469034Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3481803Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3496713Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3517566Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3532669Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3547513Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3564713Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3579107Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3597992Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3612928Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3628755Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3643692Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3658541Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3673039Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3687234Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3701542Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3715587Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3738036Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3753604Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3767739Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3782914Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3804573Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3819892Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3837827Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3855082Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3871829Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3894470Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3909266Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3929973Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3944767Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3958486Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3977569Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.3993832Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4009807Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4024226Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4044608Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4059164Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4073083Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4087983Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4104037Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4118096Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4133664Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4148153Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4163240Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4177526Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4196700Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4212228Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4227478Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4244117Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4258008Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4276747Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4290721Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4304672Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4320437Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4339314Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4357835Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4372013Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4391829Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4405777Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4419874Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4434071Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4447701Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4462137Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4476463Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4493178Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T15:53:11.4591491Z Cleaning up orphan processes